CN116987681A - 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof - Google Patents
10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof Download PDFInfo
- Publication number
- CN116987681A CN116987681A CN202210444105.0A CN202210444105A CN116987681A CN 116987681 A CN116987681 A CN 116987681A CN 202210444105 A CN202210444105 A CN 202210444105A CN 116987681 A CN116987681 A CN 116987681A
- Authority
- CN
- China
- Prior art keywords
- leu
- val
- ser
- gly
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- YWLXLRUDGLRYDR-ZHPRIASZSA-N 5beta,20-epoxy-1,7beta,10beta,13alpha-tetrahydroxy-9-oxotax-11-ene-2alpha,4alpha-diyl 4-acetate 2-benzoate Chemical compound O([C@H]1[C@H]2[C@@](C([C@H](O)C3=C(C)[C@@H](O)C[C@]1(O)C3(C)C)=O)(C)[C@@H](O)C[C@H]1OC[C@]12OC(=O)C)C(=O)C1=CC=CC=C1 YWLXLRUDGLRYDR-ZHPRIASZSA-N 0.000 title claims abstract description 24
- TYLVGQKNNUHXIP-MHHARFCSSA-N 10-deacetyltaxol Chemical compound O([C@H]1[C@H]2[C@@](C([C@H](O)C3=C(C)[C@@H](OC(=O)[C@H](O)[C@@H](NC(=O)C=4C=CC=CC=4)C=4C=CC=CC=4)C[C@]1(O)C3(C)C)=O)(C)[C@@H](O)C[C@H]1OC[C@]12OC(=O)C)C(=O)C1=CC=CC=C1 TYLVGQKNNUHXIP-MHHARFCSSA-N 0.000 claims abstract description 71
- 102000008300 Mutant Proteins Human genes 0.000 claims abstract description 38
- 108010021466 Mutant Proteins Proteins 0.000 claims abstract description 38
- 229930182986 10-Deacetyltaxol Natural products 0.000 claims abstract description 35
- 125000002252 acyl group Chemical group 0.000 claims abstract description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 6
- 230000000694 effects Effects 0.000 claims description 16
- 230000035772 mutation Effects 0.000 claims description 16
- 239000013612 plasmid Substances 0.000 claims description 12
- 230000004048 modification Effects 0.000 claims description 9
- 238000012986 modification Methods 0.000 claims description 9
- 150000001413 amino acids Chemical class 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 8
- 125000003729 nucleotide group Chemical group 0.000 claims description 8
- 101000885693 Taxus cuspidata 10-deacetylbaccatin III 10-O-acetyltransferase Proteins 0.000 claims description 7
- 238000006640 acetylation reaction Methods 0.000 claims description 6
- 102200130585 rs778210210 Human genes 0.000 claims description 6
- 230000021736 acetylation Effects 0.000 claims description 5
- 229920001223 polyethylene glycol Polymers 0.000 claims description 4
- 102000057234 Acyl transferases Human genes 0.000 claims description 2
- 108700016155 Acyl transferases Proteins 0.000 claims description 2
- 241000589158 Agrobacterium Species 0.000 claims description 2
- 241000351920 Aspergillus nidulans Species 0.000 claims description 2
- 244000063299 Bacillus subtilis Species 0.000 claims description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 claims description 2
- 101710135898 Myc proto-oncogene protein Proteins 0.000 claims description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 claims description 2
- 241000235648 Pichia Species 0.000 claims description 2
- 239000002202 Polyethylene glycol Substances 0.000 claims description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 claims description 2
- 101710150448 Transcriptional regulator Myc Proteins 0.000 claims description 2
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 claims description 2
- 230000029936 alkylation Effects 0.000 claims description 2
- 238000005804 alkylation reaction Methods 0.000 claims description 2
- 230000009435 amidation Effects 0.000 claims description 2
- 238000007112 amidation reaction Methods 0.000 claims description 2
- 230000006287 biotinylation Effects 0.000 claims description 2
- 238000007413 biotinylation Methods 0.000 claims description 2
- CRFNGMNYKDXRTN-CITAKDKDSA-N butyryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 CRFNGMNYKDXRTN-CITAKDKDSA-N 0.000 claims description 2
- 108010048367 enhanced green fluorescent protein Proteins 0.000 claims description 2
- 230000013595 glycosylation Effects 0.000 claims description 2
- 238000006206 glycosylation reaction Methods 0.000 claims description 2
- 230000026731 phosphorylation Effects 0.000 claims description 2
- 238000006366 phosphorylation reaction Methods 0.000 claims description 2
- 229920001184 polypeptide Polymers 0.000 claims description 2
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 2
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 2
- QAQREVBBADEHPA-IEXPHMLFSA-N propionyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 QAQREVBBADEHPA-IEXPHMLFSA-N 0.000 claims description 2
- 238000007363 ring formation reaction Methods 0.000 claims description 2
- 102200091761 rs33926449 Human genes 0.000 claims description 2
- 102000040430 polynucleotide Human genes 0.000 claims 5
- 108091033319 polynucleotide Proteins 0.000 claims 5
- 239000002157 polynucleotide Substances 0.000 claims 5
- GKNWJYPYFJMILD-UHFFFAOYSA-N 1-[4-(2h-triazolo[4,5-c]pyridin-4-ylperoxy)-2h-triazolo[4,5-c]pyridin-6-yl]decan-1-one Chemical compound N=1C(C(=O)CCCCCCCCC)=CC=2NN=NC=2C=1OOC1=NC=CC2=C1N=NN2 GKNWJYPYFJMILD-UHFFFAOYSA-N 0.000 claims 1
- 229940123237 Taxane Drugs 0.000 claims 1
- 230000010933 acylation Effects 0.000 claims 1
- 238000005917 acylation reaction Methods 0.000 claims 1
- 229930012538 Paclitaxel Natural products 0.000 abstract description 18
- 229960001592 paclitaxel Drugs 0.000 abstract description 18
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 abstract description 18
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical group O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 abstract description 16
- 239000000758 substrate Substances 0.000 abstract description 14
- 229940100228 acetyl coenzyme a Drugs 0.000 abstract description 3
- 125000002887 hydroxy group Chemical group [H]O* 0.000 abstract description 3
- 230000002194 synthesizing effect Effects 0.000 abstract description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 56
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 56
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 56
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 56
- 108010047857 aspartylglycine Proteins 0.000 description 56
- 108010079547 glutamylmethionine Proteins 0.000 description 56
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 54
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 37
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 37
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 37
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 35
- 108020004414 DNA Proteins 0.000 description 34
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 30
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 29
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 28
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 28
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 28
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 28
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 28
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 28
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 28
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 28
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 28
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 28
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 28
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 28
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 28
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 28
- ZMWDUIIACVLIHK-GHCJXIJMSA-N Asn-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N ZMWDUIIACVLIHK-GHCJXIJMSA-N 0.000 description 28
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 28
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 28
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 28
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 28
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 28
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 28
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 28
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 28
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 28
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 28
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 28
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 28
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 28
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 28
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 28
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 28
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 28
- 108010090461 DFG peptide Proteins 0.000 description 28
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 28
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 28
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 28
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 28
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 28
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 28
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 28
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 28
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 28
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 28
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 28
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 28
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 28
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 28
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 28
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 28
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 28
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 28
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 28
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 28
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 28
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 28
- 241000880493 Leptailurus serval Species 0.000 description 28
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 28
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 28
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 28
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 28
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 28
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 28
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 28
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 28
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 28
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 28
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 28
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 28
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 28
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 28
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 28
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 28
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 28
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 28
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 28
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 28
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 28
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 28
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 28
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 28
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 28
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 28
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 28
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 28
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 28
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 28
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 28
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 28
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 28
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 28
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 28
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 28
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 28
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 28
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 28
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 28
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 28
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 28
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 28
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 28
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 28
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 28
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 28
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 28
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 28
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 28
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 28
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 28
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 28
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 28
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 28
- 108010013835 arginine glutamate Proteins 0.000 description 28
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 28
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 28
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 28
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 28
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 28
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 28
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 28
- 108010089804 glycyl-threonine Proteins 0.000 description 28
- 108010081551 glycylphenylalanine Proteins 0.000 description 28
- 108010040030 histidinoalanine Proteins 0.000 description 28
- 108010092114 histidylphenylalanine Proteins 0.000 description 28
- 108010085325 histidylproline Proteins 0.000 description 28
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 28
- 108010054155 lysyllysine Proteins 0.000 description 28
- 108010017391 lysylvaline Proteins 0.000 description 28
- 108010031719 prolyl-serine Proteins 0.000 description 28
- 108010061238 threonyl-glycine Proteins 0.000 description 28
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 28
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 28
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 27
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 27
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 27
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 27
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 27
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 27
- 108010003137 tyrosyltyrosine Proteins 0.000 description 27
- 108010073969 valyllysine Proteins 0.000 description 27
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 26
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 25
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 25
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 25
- 230000003197 catalytic effect Effects 0.000 description 25
- 108010037850 glycylvaline Proteins 0.000 description 24
- 108090000623 proteins and genes Proteins 0.000 description 22
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 21
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 21
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 19
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 19
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 16
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 15
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 14
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 14
- 102000004169 proteins and genes Human genes 0.000 description 14
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 12
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 12
- WEWCEPOYKANMGZ-MMWGEVLESA-N Ile-Cys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WEWCEPOYKANMGZ-MMWGEVLESA-N 0.000 description 11
- 239000000047 product Substances 0.000 description 11
- 108010077112 prolyl-proline Proteins 0.000 description 11
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 9
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 7
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 4
- 241001116500 Taxus Species 0.000 description 4
- 238000000246 agarose gel electrophoresis Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000003139 buffering effect Effects 0.000 description 4
- BHZOKUMUHVTPBX-UHFFFAOYSA-M sodium acetic acid acetate Chemical compound [Na+].CC(O)=O.CC([O-])=O BHZOKUMUHVTPBX-UHFFFAOYSA-M 0.000 description 4
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 3
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 3
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 3
- 239000000370 acceptor Substances 0.000 description 3
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- OVMSOCFBDVBLFW-VHLOTGQHSA-N 5beta,20-epoxy-1,7beta,13alpha-trihydroxy-9-oxotax-11-ene-2alpha,4alpha,10beta-triyl 4,10-diacetate 2-benzoate Chemical compound O([C@@H]1[C@@]2(C[C@H](O)C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)O)C(=O)C1=CC=CC=C1 OVMSOCFBDVBLFW-VHLOTGQHSA-N 0.000 description 2
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241001674343 Taxus x media Species 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000012084 conversion product Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 238000009630 liquid culture Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 238000000034 method Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- YWLXLRUDGLRYDR-SKXCCXORSA-N 10-dab iii Chemical compound O([C@H]1C2[C@@](C([C@H](O)C3=C(C)[C@@H](O)C[C@]1(O)C3(C)C)=O)(C)[C@@H](O)C[C@H]1OC[C@]12OC(=O)C)C(=O)C1=CC=CC=C1 YWLXLRUDGLRYDR-SKXCCXORSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241001116498 Taxus baccata Species 0.000 description 1
- 241000202349 Taxus brevifolia Species 0.000 description 1
- 241000015728 Taxus canadensis Species 0.000 description 1
- 235000009065 Taxus cuspidata Nutrition 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- 229930014667 baccatin III Natural products 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 238000007036 catalytic synthesis reaction Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000012468 concentrated sample Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 229930004069 diterpene Natural products 0.000 description 1
- 150000004141 diterpene derivatives Chemical class 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000006167 equilibration buffer Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000006225 natural substrate Substances 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1025—Acyltransferases (2.3)
- C12N9/1029—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y203/00—Acyltransferases (2.3)
- C12Y203/01—Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
- C12Y203/01167—10-Deacetylbaccatin III 10-O-acetyltransferase (2.3.1.167)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention belongs to the field of bioengineering, relates to a 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof, and provides a series of 10-deacetylbaccatin III 10 beta-O-acetyltransferase (DBAT) mutant proteins which can specifically transfer acyl groups such as acetyl coenzyme A and the like to a C10 hydroxyl group of 10-deacetyltaxane to generate taxol or analogues thereof. The invention relates to amino acid sequences of DBAT mutants, nucleotide sequences encoding the amino acid sequences, and application of DBAT mutant proteins in synthesizing taxol or analogues thereof by utilizing 10-deacetyltaxane substrates such as 10-deacetyltaxol and the like; in particular, 10-deacetyl taxol is used as a substrate, and acetyl coenzyme A is used as an acyl donor to directly synthesize taxol.
Description
Technical Field
The invention belongs to the field of bioengineering, and in particular relates to a series of mutant proteins of 10-deacetylbaccatin III 10 beta-O-acetyltransferase (DBAT), which transfer acyl groups from but not limited to acetyl coenzyme A to 10-deacetyltaxane to generate taxol or analogues thereof; acyl acceptors for mutant proteins include, but are not limited to, 10-desacetyltaxol and 10-desacetylbaccatin III. The invention relates to amino acid sequences of these mutant proteins, nucleotide sequences encoding these amino acid sequences and uses thereof.
Background
10-De-acetyltransferase (DBAT) is an essential acylase in the taxol biosynthetic pathway and is responsible for catalyzing the acetylation of the hydroxyl group at the C-10 position on 10-De-acetylbaccatin III (10-DAB), the product baccatin III being the last diterpene intermediate in the taxol biosynthetic pathway [ Croteau R, et al, taxol biosynthesis and molecular genetics, phytochem Rev.,2006,5 (1): 75-97].
DBAT also has acetylation of the unnatural substrate 10-Deacetyltaxol (DT) [ Li BJ, et al, improving 10-deacetylbaccatin III-10-beta-O-acetyltransferase catalytic fitness for Taxol production Nat Commun.,2017,8:15544-15556.]Paclitaxel can be produced by only one step of acetylation reaction on the hydroxyl group at the C-10 position,). However, the acetylation efficiency of DT by wild-type DBAT (exemplified by DBAT derived from Taxus northeast) is extremely low (0.26U/mg), about 0.13% of that of its catalytic natural substrate 10-DAB (206.6U/mg).
The applicant of the present invention has performed protein engineering on DBATs of 6 different yew sources, and known that the DBATs of the 6 different sources have higher sequence consistency (about 98%), and only 21 different sites from each other (fig. 1), but the 6 DBATs have larger difference on the catalytic efficiency of DT, wherein the activity of DBATs to acetylate DT is the highest [ Li BJ, et al, improving 10-deacetylbaccatin iii-10- β -O-acetyltransferase catalytic fitness for Taxol production. The DBATcus three-dimensional structure obtained by homologous modeling is analyzed, and the difference sites are far away from the catalytic center, and the rule of affecting the catalytic efficiency is not known, so that the applicant of the invention performs DNA buffering on the 6 DBATs, and the random combination of different difference sites can be realized by breaking and rearranging template sequences of different sources. Through a large number of screening, a series of mutants containing R345C mutation are obtained, the activity of the acetylated DT is obviously improved, and the activity can reach more than 11 times of DBAT (DBATcus) from Taxus northeast.
Disclosure of Invention
The technical problems to be solved are as follows: the invention provides a DBAT series mutant protein containing R345C mutation, a nucleotide sequence for encoding the mutant protein, a recombinant plasmid containing the nucleotide sequence, a recombinant cell containing the nucleotide sequence or the recombinant plasmid, and application of the mutant protein, the nucleotide sequence, the recombinant plasmid or the recombinant cell in the aspect of catalyzing and synthesizing taxol or analogues thereof.
In order to solve the technical problems of the invention, the following technical scheme is provided:
the first aspect of the technical scheme of the invention is as follows: there is provided a mutant protein of 10-deacetylbaccatin III 10 beta-O-acetyltransferase DBAT, characterized in that the mutant protein contains a mutation of R345C on the basis of the sequence shown in SEQ ID NO. 1.
In some preferred embodiments, the mutant protein further has one or more amino acid mutations selected from the group consisting of: G38R, V39A, V A, L123H, V158M, G163S, Q213R, C42216R, P217K, S219A, L E, S L, F301V, E350M, S351D, L383I, S396G, S37415L, M417L, S420L, I421V, V M.
In some preferred embodiments, the amino acid sequence of the mutant protein is a derivative polypeptide selected from the group consisting of the amino acid sequences shown in SEQ ID NO. 2-SEQ ID NO. 28, and having an homology of 85% (preferably 95%) to any one of the amino acid sequences shown in SEQ ID NO. 2-SEQ ID NO. 28, and having an acyltransferase activity.
In some preferred embodiments, conventional modifications may be made to the mutant proteins described above; the conventional modification comprises acetylation, amidation, cyclization, glycosylation, phosphorylation, alkylation, biotinylation, fluorescent group modification, polyethylene glycol PEG modification and immobilization modification;
alternatively, in some preferred embodiments, tags for detection or purification are attached to these mutant proteins; the label comprises 6X His, GST, EGFP, MBP, nus, HA, igG, FLAG, c-Myc and proficiency eXact.
The second aspect of the technical scheme of the invention is that: nucleotide sequences encoding the mutant proteins of the first aspect are provided, wherein the nucleotide sequences are SEQ ID NO 30-SEQ ID NO 56, and nucleotide sequences having a homology of 95% (preferably 98%) with any one of the sequences shown in SEQ ID NO 30-SEQ ID NO 56.
The third aspect of the technical scheme of the invention is that: providing a recombinant plasmid comprising the nucleotide sequence of the second aspect.
The fourth aspect of the technical scheme of the invention is that: providing a recombinant cell comprising the nucleotide sequence of the second aspect or the recombinant plasmid of the third aspect, said recombinant cell being constructed using a host cell comprising: coli, pichia, saccharomyces cerevisiae, aspergillus nidulans, bacillus subtilis or agrobacterium.
The fifth aspect of the technical scheme of the invention is that: providing the mutant protein of the first aspect, the nucleotide sequence of the second aspect, the recombinant plasmid of the third aspect and the recombinant cell of the fourth aspect for the application in the catalytic synthesis of taxol or analogues thereof; the acyl acceptors include, but are not limited to, 10-deacetyltaxol, 10-deacetylbaccatin III; the acyl donors include, but are not limited to, acetyl-CoA, propionyl-CoA and butyryl-CoA.
A preferred acyl donor substrate is acetyl coa and a preferred acyl acceptor substrate is 10-Deacetyltaxol (DT);
beneficial technical effects
The invention utilizes protein engineering to modify 10-deacetylbaccatin III 10 beta-O-acetyltransferase (DBAT) to obtain 27 mutant proteins with remarkably improved catalytic activity to unnatural acyl receptor substrate DT compared with DBATcus. The mutant proteins can catalyze DT to directly generate taxol with high efficiency. The invention can simplify the synthesis steps of the taxol or the analogues thereof, solves the problems of less taxol or analogues thereof resources and difficult synthesis, and has the advantage of environmental friendliness compared with a chemical semi-synthesis method.
Drawings
FIG. 1 is a differential site analysis of a DNA buffering template.
FIG. 2 is a schematic diagram of the basic principle of construction of DBAT mutants using DNA buffering technology.
FIG. 3 is an HPLC analysis of DBAT mutant cells for conversion of 10-DAB.
FIG. 4 is a schematic diagram of SDS-PAGE bands of the target protein.
FIG. 5 is an HPLC analysis of DBAT family mutant protein catalytic unnatural substrate DT.
FIG. 6 is an optimum temperature curve and an optimum pH curve.
FIG. 7 is a plot of product concentration versus time for 2-283-F301V/E350M catalytic DT.
Detailed Description
The invention is further illustrated by the following examples, which are intended to be illustrative only and are not intended to limit the scope of the claims in any way.
Example 1: construction of DBAT DNA shuffling mutant
By artificially synthesizing the dbat gene sequence (dbat) of taxus northeast cus GenBank Accession: Q9M6E2.1), taxus brevifolia origin dbat (dbat) bre GenBank Accession, EU 107143.1) G38R/F301V mutant Gene sequence (dbat) bre G38R/F301V ) The dbat gene sequence (dbat) of Taxus media med GenBank Accession AY 452666.1), and the dbat gene sequence (dbat) of Taxus canadensis can GenBank Accession EU 107134.1), the dbat gene sequence of Taxus baccata (dbat) bac GenBank Accession AF 456342.1), and dbat gene sequence (dbat) of Taxus media wal GenBank Accession:EU 107140.1) as template.
The differential site analysis of the 6 DBAT proteins is shown in figure 1. The upstream and downstream primers used are shown in Table 1.
TABLE 1.6 upstream and downstream primers for amplification of dbat Gene sequences
The PCR amplification system of the 6 dbat gene fragments is as follows:
the PCR amplification conditions were as follows:
the PCR product was detected by agarose gel electrophoresis at 1.0%, and the PCR product was purified and recovered.
The following procedure was followed as described for the BTN131178DNA buffering kit: the PCR product was fragmented and all fragments ranging from 25 to 150bp were recovered by 3.0% agarose gel electrophoresis. 500ng of the recovered fragments were subjected to primer-free PCR amplification. Purifying and recovering 1 kb-1.5 kb fragment by 1.0% agarose gel electrophoresis, and using the fragment as template and dbat cus F/R is the primer for final PCR amplification. The basic principle diagram of constructing DBAT mutant by using DNA shuffle technology is shown in figure 2.
After the final amplified fragment was recovered, the resultant was digested with Nde I and Sal I, ligated with the pCWori vector digested with the same double enzymes, E.coli Trans109 competent was obtained, and positive transformants were selected by colony PCR.
Example 2: prokaryotic expression of DBAT mutant protein and bacterial activity screening
Single colonies of each positive transformant obtained in example 1 were picked up in 96-well deep well plates, each well containing 500. Mu.L of TBA (final Amp concentration of 100. Mu.g/mL) liquid medium, and after shaking culture at 37℃and 220rpm for about 12 hours, the culture of each single colony was inoculated with 100. Mu.L of seed liquid: 900. Mu.L of fresh TBA medium was inoculated in parallel into 3 wells of a 96-well plate, and after shaking culture at 37℃and 220rpm for 1 hour, 1. Mu.L of 1M IPTG was added to each well, followed by shaking culture at 18℃and 220rpm for 20 hours, and the 96-well plate was centrifuged to collect the cells after induction culture.
Primary screening mutants with significantly improved catalytic activity on DT were screened in positive variant strains relative to wild type DBATcus using 10-DAB as substrate. To each well, 100. Mu.L of 50mM sodium acetate-acetic acid buffer (pH 5.5) was added as a control to resuspend the cells, then 2.5. Mu.L of 10mM 10-DAB (DMSO was dissolved) was added, and after shaking reaction at 40℃and 250rpm for 2 hours, 400. Mu.L of methanol was added to terminate the reaction, and 50. Mu.L of the detection transformation product was applied by HPLC (see FIG. 3).
Example 3: specific activity determination of DBAT mutant protein catalytic substrate DT
Selecting a mutant flat plate colony (positive variant strain) with the bacterial level catalytic activity higher than DBATcus obtained in the example 2, inoculating the colony into 10mL TBA (Amp final concentration is 100 mug/mL) liquid culture medium, and placing the culture medium at 37 ℃ and shaking at 220rpm for overnight; transferring the recombinant bacteria cultured overnight into 200mL of fresh TBA liquid culture medium according to the proportion of 1%, and culturing for 2-3 h at 37 ℃ and 220 rpm; waiting bacteria liquid OD 600 At approximately 0.8, IPTG was added to a final concentration of 1mM, inducing culture conditions: 18 ℃, 220rpm, 20h. And centrifuging and collecting the induced thalli.
The bacterial cells after induction of expression were resuspended in equilibration buffer (20 mM imidazole, 100mM NaCl, 20mM Tris-HCl, pH 7.5) at a ratio of 50mL buffer to 1L bacterial cell pellet collected, broken up at a high pressure of 800bar for 5 times, centrifuged at 12000rpm at 4℃for 1h, and the supernatant was filtered through a 0.45 μm filter, purified by HisTrap HP (5 mL) preloaded with nickel column, and the fractions eluted with 200mM imidazole buffer were collected (the band of the protein of interest is shown in FIG. 4). The obtained target protein eluent was concentrated by a ultrafiltration tube having a molecular weight cut-off (Molecular Weight Cutoff, MWCO) of 30kDa under centrifugation conditions of 4000g for 30min, and the protein concentration of the concentrated sample was measured.
The specific activity detection system of DBATcus and mutant catalytic DT thereof is as follows:
100. Mu.L of the reaction system contained 0.25mg/mL of protein, 500. Mu.M (404.5. Mu.g/mL) of acetyl CoA, 500. Mu.M (405.94. Mu.g/mL) of DT, 20mM of imidazole, 5% of DMSO (V/V), and 50mM of sodium acetate-acetic acid buffer solution at pH5.5 were used to make up to 100. Mu.L, after shaking reaction at 37.5℃for 3 hours, 400. Mu.L of methanol was added to terminate the reaction, and 50. Mu.L of the conversion product was detected by HPLC (see FIG. 5).
The specific activity of DBATcus and its mutant catalyzing DT is shown in Table 2, and the mutation site compared to DBATcus is shown in Table 3. As can be seen from the data in the table, the specific activity of the mutant catalytic DT is obviously improved compared with DBATcus, and the specific activity of the mutant catalytic DT can be up to 7.9 times.
TABLE 2 specific Activity and relative enzymatic Activity of DBATcus and its mutants catalyzing DT
TABLE 3 mutation sites of DNA stuffer mutants (compared to DBATcus)
Example 4: acquisition of DBAT combination mutant and determination of specific Activity of catalytic DT
The method of whole plasmid PCR amplification was used, using the respective mutant primers shown in Table 4:
taking pCWori-2-60 as a template, introducing F301V mutation, and constructing a 2-60-F301V combined mutant;
taking pCWori-2-181 as a template, respectively introducing G38R, P217K, F301V, E350M, S351D, S G mutation, constructing 2-181-G38R, 2-181-P217K, 2-181-F301V, 2-181-E350M, 2-181-S351D and 2-181-S396G combined mutant, and continuously introducing S396G mutation on the basis of pCWori-2-80-R345C to construct 2-80-R345C/S396G combined mutant;
the pCWori-2-283 is used as a template, and the G38R, P217K, F301V, E350M, S351D, S396G mutation is respectively introduced to construct 2-283-G38R, 2-283-P217K, 2-283-F301V, 2-283-E350M, 2-283-S351D and 2-283-S396G combined mutants;
P217K, E and 350M, S D mutation are respectively introduced on the basis of pCWori-2-283-F301V to construct 2-283-F301V/P217K, 2-283-F301V/E350M and 2-283-F301V/S351D combined mutants;
in addition, P217K, S D mutation is continuously introduced by taking pCWori-2-283-F301V/E350M as a template, and 2-283-F301V/E350M/P217K, 2-283-F301V/E350M/S351D mutants and 2-283-F301V/E350M/P217K/S351D mutants are constructed.
TABLE 4 upstream and downstream primers for introducing the mutant into the respective mutation sites
The PCR system for whole plasmid amplification was as follows:
the PCR amplification conditions were as follows:
the PCR product was detected by agarose gel electrophoresis at 1.0%, and the PCR product was purified and recovered.
The PCR product was digested with Dpn I, and the digestion system was as follows:
enzyme cutting conditions: and enzyme cutting at 37 ℃ for 5 hours.
All the cleavage products were transformed into competent cells of E.coli Trans 109. Colonies were picked for plasmid extraction and sequencing validation.
The mutants were subjected to protein-induced expression, purification and specific activity determination of catalytic substrate DT as described in example 3, and the results are shown in Table 5, in which the activity of the combined mutants was significantly improved, up to 11.23 times, compared to DBATcus.
TABLE 5 specific Activity and relative enzymatic Activity of the combined mutant protein catalytic DT
TABLE 6 mutation sites of the combination mutants (compared to DBATcus)
Example 5: determination of optimal reaction conditions for DBAT mutant protein catalytic substrate DT
The enzyme catalysis system is the same as in example 3.
The determination of the optimum reaction pH was carried out at 37.5℃and the buffer composition and pH were set to 50mM sodium acetate-acetic acid buffer (pH 4.0, 4.5, 5.0, 5.5, 6.0, respectively) and 50mM sodium phosphate buffer (pH 6.5, 7.0, 7.5, 8.0, respectively).
The optimum reaction temperature was measured at pH5.5, and the temperatures were set to 25, 27.5, 30, 32.5, 35, 37.5, 40, 42.5, 45, 47.5, and 50 (. Degree.C.).
The results are shown in FIG. 6 and Table 7.
TABLE 7 optimum reaction conditions for DBATcus and several highly active mutant protein catalytic substrates DT
Example 6: dynamic parameter determination of DBATcus and mutant protein catalytic substrate DT
The catalytic system comprises: the 100. Mu.L reaction system comprises 0.5mg/mL of protein, 1mM of acetyl CoA, different concentrations of DT, 20mM of imidazole and 5% of DMSO (V/V), the different proteins are respectively supplemented to 100. Mu.L by using buffers with optimal pH, and the reaction is carried out by shaking for 1h at the respective optimal temperatures, and the conversion products are detected by HPLC.
The final DT concentrations were 2000, 1000, 500, 250, 125, 62.5, 31.25, 15.625. Mu.M, respectively.
The results are shown in Table 8, where the conversion efficiency of 2-60 is 7.2 times that of DBATcus, and the conversion efficiency of 2-283-P217K/F301V/E350M/S351D is 13.99 times that of DBATcus.
TABLE 8 kinetic parameters of DBATcus and mutant protein catalytic substrate DT
Example 7: product time-concentration profile for 2-283-F301V/E350M catalytic DT
The catalytic system comprises: 1mL of the reaction system contains 1mg/mL of protein, 10mM of acetyl-CoA, 10mM of DT, 20mM of imidazole, 5% of DMSO (V/V), and 50mM of sodium acetate-acetic acid buffer pH5.5, which is supplemented to 1mL. The production of Taxol was measured at 3h, 6h, 9h, 12h, 24h, 27h, 30h, 33h and 36h, respectively, and when the reaction was carried out for 12h, 1mg/mL final protein and 10mM final acetyl CoA were added to the system.
Reaction conditions: 32.5℃pH5.5, 600rpm.
The results are shown in fig. 7, which shows:
under the catalysis of 2-283-F301V/E350M, the yield of taxol increases at a relatively high speed in the first 6h of reaction, the yield increasing speed is gradually reduced from 6h to 12h, the reaction is continued for 24h after the protein is added in the 12h, and the highest yield of taxol is 2.27+/-0.04 mg/mL.
Sequence listing
<110> institute of medicine at the national academy of medical science
<120> 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof
<160> 56
<170> SIPOSequenceListing 1.0
<210> 1
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 1
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Arg Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 2
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 2
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser His Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Met Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile His Pro Pro Thr Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Leu Ile Met Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 3
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 3
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Arg Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Ser Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile His Pro Pro Thr Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Leu Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 4
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 4
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 5
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 5
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Ile Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Val Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 6
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 6
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Ala Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Ile Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 7
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 7
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 8
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 8
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Ala Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile His Pro Pro Thr Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Leu Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 9
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 9
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 10
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 10
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 11
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 11
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Arg Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 12
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 12
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Lys Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 13
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 13
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 14
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 14
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Met Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 15
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 15
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Asp Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 16
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 16
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Gln Leu Ile Cys Pro Pro Ser Thr Phe Gly Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Gly Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Leu Phe
405 410 415
Leu Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 17
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 17
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Arg Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 18
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 18
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Lys Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 19
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 19
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 20
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 20
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Met Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 21
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 21
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Asp Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 22
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 22
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Phe Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Gly Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 23
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 23
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Lys Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 24
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 24
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Met Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 25
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 25
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Glu Asp Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 26
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 26
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Lys Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Met Ser Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 27
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 27
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Pro Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Met Asp Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 28
<211> 440
<212> PRT
<213> Artificial Sequence
<400> 28
Met Ala Gly Ser Thr Glu Phe Val Val Arg Ser Leu Glu Arg Val Met
1 5 10 15
Val Ala Pro Ser Gln Pro Ser Pro Lys Ala Phe Leu Gln Leu Ser Thr
20 25 30
Leu Asp Asn Leu Pro Gly Val Arg Glu Asn Ile Phe Asn Thr Leu Leu
35 40 45
Val Tyr Asn Ala Ser Asp Arg Val Ser Val Asp Pro Ala Lys Val Ile
50 55 60
Arg Gln Ala Leu Ser Lys Val Leu Val Tyr Tyr Ser Pro Phe Ala Gly
65 70 75 80
Arg Leu Arg Lys Lys Glu Asn Gly Asp Leu Glu Val Glu Cys Thr Gly
85 90 95
Glu Gly Ala Leu Phe Val Glu Ala Met Ala Asp Thr Asp Leu Ser Val
100 105 110
Leu Gly Asp Leu Asp Asp Tyr Ser Pro Ser Leu Glu Gln Leu Leu Phe
115 120 125
Cys Leu Pro Pro Asp Thr Asp Ile Glu Asp Ile His Pro Leu Val Val
130 135 140
Gln Val Thr Arg Phe Thr Cys Gly Gly Phe Val Val Gly Val Ser Phe
145 150 155 160
Cys His Gly Ile Cys Asp Gly Leu Gly Ala Gly Gln Phe Leu Ile Ala
165 170 175
Met Gly Glu Met Ala Arg Gly Glu Ile Lys Pro Ser Ser Glu Pro Ile
180 185 190
Trp Lys Arg Glu Leu Leu Lys Pro Glu Asp Pro Leu Tyr Arg Phe Gln
195 200 205
Tyr Tyr His Phe Arg Leu Ile Arg Lys Pro Ser Thr Phe Glu Lys Ile
210 215 220
Val Gln Gly Ser Leu Val Ile Thr Ser Glu Thr Ile Asn Cys Ile Lys
225 230 235 240
Gln Cys Leu Arg Glu Glu Ser Lys Glu Phe Cys Ser Ala Phe Glu Val
245 250 255
Val Ser Ala Leu Ala Trp Ile Ala Arg Thr Arg Ala Leu Gln Ile Pro
260 265 270
His Ser Glu Asn Val Lys Leu Ile Phe Ala Met Asp Met Arg Lys Leu
275 280 285
Phe Asn Pro Pro Leu Ser Lys Gly Tyr Tyr Gly Asn Val Val Gly Thr
290 295 300
Val Cys Ala Met Asp Asn Val Lys Asp Leu Leu Ser Gly Ser Leu Leu
305 310 315 320
Arg Val Val Arg Ile Ile Lys Lys Ala Lys Val Ser Leu Asn Glu His
325 330 335
Phe Thr Ser Thr Ile Val Thr Pro Cys Ser Gly Ser Asp Met Asp Ile
340 345 350
Asn Tyr Glu Asn Ile Val Gly Phe Gly Asp Arg Arg Arg Leu Gly Phe
355 360 365
Asp Glu Val Asp Phe Gly Trp Gly His Ala Asp Asn Val Ser Leu Val
370 375 380
Gln His Gly Leu Lys Asp Val Ser Val Val Gln Ser Tyr Phe Leu Phe
385 390 395 400
Ile Arg Pro Pro Lys Asn Asn Pro Asp Gly Ile Lys Ile Leu Ser Phe
405 410 415
Met Pro Pro Ser Ile Val Lys Ser Phe Lys Phe Glu Met Glu Thr Met
420 425 430
Thr Asn Lys Tyr Val Thr Lys Pro
435 440
<210> 29
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 29
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
atcgtgacac cccgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 30
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 30
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacatg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg gatgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagcca 600
gaagaccctt tatatcggtt ccagtattat cactttcgat tgattcaccc gcctacgaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagattttg ggtggggaca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtta 1260
ataatgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 31
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 31
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc acgggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc 480
tgccatagta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcaccc gcctacgaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtttctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagattttg ggtggggaca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtta 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 32
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 32
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacatctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtggggaca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 33
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 33
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc tgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtagg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtattaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatc 840
tttgcaatgg acatgaggaa attatttaat ccaccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtggggaca tgcagataat 1140
gtaagtatcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
gtagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 34
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 34
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc tgcagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacatctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatc 840
tttgcaatgg acatgaggaa attatttaat ccaccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
atcgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtggggaca tgcagataat 1140
gtaagtatcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 35
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 35
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 36
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 36
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggcgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcaccc gcctacgaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt tgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtggggaca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 37
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 37
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 38
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 38
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacatctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
gtcgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtggggaca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 39
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 39
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aagagtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 40
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 40
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgcaa gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 41
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 41
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
gtggttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 42
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 42
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatatg agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 43
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 43
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag gacatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 44
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 44
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggagatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgactacagt 360
ccttcacttg agcaactact tttttgtctt ccacctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaaggggag agattaagcc ctcctcggag ccaatatgga agagagaatt gctgaagccg 600
gaagaccctt tataccggtt ccagtattat cactttcaat tgatttgccc gccttcaaca 660
ttcgggaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacaag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgaggaa attatttaat ccgccacttt cgaagggata ctacggtaat 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaggata ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tattgttcct gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 45
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 45
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aagagtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 46
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 46
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgcaa accttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 47
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 47
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 48
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 48
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatatg agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 49
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 49
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag gatatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 50
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 50
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
tttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaggata ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 51
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 51
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgcaa gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 52
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 52
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatatg agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 53
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 53
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatgag gatatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 54
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 54
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgcaa gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatatg agtatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 55
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 55
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgccc gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatatg gatatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
<210> 56
<211> 1323
<212> DNA
<213> Artificial Sequence
<400> 56
atggcaggct caacagaatt tgtggtaaga agcttagaga gagtgatggt ggctccaagc 60
cagccatcgc ccaaagcttt cctgcagctc tccacccttg acaatctacc aggggtgaga 120
gaaaacattt ttaacacctt gttagtctac aatgcctcag acagagtttc cgtagatcct 180
gcaaaagtaa ttcggcaggc tctctccaag gtgttggtgt actattcccc ttttgcaggg 240
cgtctcagga aaaaagaaaa tggggatctt gaagtggagt gcacagggga gggtgctctg 300
tttgtggaag ccatggctga cactgacctc tcagtcttag gagatttgga tgattacagt 360
ccttcacttg agcaactact tttttgtctt ccgcctgata cagatattga ggacatccat 420
cctctggtgg ttcaggtaac tcgttttaca tgtggaggtt ttgttgtggg ggtgagtttc 480
tgccatggta tatgtgatgg actaggagca ggccagtttc ttatagccat gggagagatg 540
gcaagggggg agattaagcc ctcctcggag ccgatatgga agagagaatt gctgaagcca 600
gaagaccctt tataccggtt ccagtattat cactttcgat tgattcgcaa gccttcgaca 660
ttcgagaaaa tagttcaagg atctcttgtt ataacctctg agacaataaa ttgtatcaaa 720
caatgcctta gggaagaaag taaagaattt tgctctgcgt tcgaagttgt atctgcattg 780
gcttggatag caaggacgag ggctcttcaa attccacata gtgagaatgt gaagcttatt 840
tttgcaatgg acatgagaaa attatttaat ccaccacttt cgaagggata ctacggtaac 900
gttgttggta ccgtatgtgc aatggataat gtcaaggacc tattaagtgg atctcttttg 960
cgtgttgtaa ggattataaa gaaagcaaag gtctctttaa atgagcattt cacgtcaaca 1020
attgtgacac cctgttctgg atcagatatg gatatcaatt atgaaaacat agttggattt 1080
ggtgatcgaa ggcgattggg atttgatgaa gtagactttg ggtgggggca tgcagataat 1140
gtaagtctcg tgcaacatgg attgaaggat gtttcagtcg tgcaaagtta ttttcttttc 1200
atacgacctc ccaagaataa ccccgatgga atcaagatcc tatcgttcat gcccccgtca 1260
atagtgaaat ccttcaaatt tgaaatggaa accatgacaa acaaatatgt aactaagcct 1320
tga 1323
Claims (13)
1. A mutant protein of 10-deacetylbaccatin iii 10 beta-O-acetyltransferase DBAT, wherein the amino acid sequence of said mutant protein has at least an R345C amino acid mutation compared to the wild type protein DBAT shown in SEQ ID NO 1.
2. The mutant protein of claim 1, wherein the mutant protein further has one or more amino acid mutations selected from the group consisting of: G38R, V39A, V A, L123H, V158M, G163S, Q213R, C42216R, P217K, S219A, L E, S L, F301V, E350M, S351D, L383I, S396G, S37415L, M417L, S420L, I421V, V M.
3. Mutant protein according to claim 1 or 2, wherein the amino acid sequence of the mutant protein is selected from the group consisting of the amino acid sequences shown in SEQ ID NO 2 to SEQ ID NO 28, and derived polypeptides having homology of more than or equal to 85%, preferably more than or equal to 95%, with any one of the amino acid sequences shown in SEQ ID NO 2 to SEQ ID NO 28 and having an acyltransferase activity.
4. A mutant protein according to any one of claims 1 to 3, wherein the mutant protein further has one or more modifications selected from the group consisting of: acetylation, amidation, cyclization, glycosylation, phosphorylation, alkylation, biotinylation, fluorophore modification, polyethylene glycol PEG modification, immobilization modification.
5. The mutant protein according to any one of claims 1 to 4, wherein the mutant protein is further linked to one or more tags selected from the group consisting of: 6X His, GST, EGFP, MBP, nus, HA, igG, FLAG, c-Myc, proficiency eXact.
6. A polynucleotide encoding the mutant protein of any one of claims 1-5.
7. The polynucleotide according to claim 6, wherein the polynucleotide is selected from the group consisting of the sequences shown in SEQ ID NO30 to SEQ ID NO 56, and nucleotide sequences having a homology of 95% or more, preferably a homology of 98% or more with the sequence shown in any one of SEQ ID NO30 to SEQ ID NO 56.
8. A recombinant plasmid comprising the polynucleotide of claim 6 or 7.
9. A recombinant cell incorporating the polynucleotide of any one of claims 6 or 7 or the recombinant plasmid of claim 8.
10. The recombinant cell of claim 9, wherein the recombinant cell is constructed using a host cell that: coli, pichia, saccharomyces cerevisiae, aspergillus nidulans, bacillus subtilis or agrobacterium.
11. Use of a mutant protein according to any one of claims 1 to 5 or a nucleotide sequence according to claim 6 or 7 or a recombinant plasmid according to claim 8 or a recombinant cell according to claim 9 for catalyzing the acylation of 10-deacetyltaxane to 10-acyl taxane.
12. The use according to claim 11, wherein said 10-deacetyltaxane comprises 10-deacetyl taxol, 10-deacetyl baccatin iii.
13. The use according to claim 11 or 12, wherein the acyl donor comprises acetyl coa, propionyl coa and butyryl coa.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210444105.0A CN116987681A (en) | 2022-04-26 | 2022-04-26 | 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210444105.0A CN116987681A (en) | 2022-04-26 | 2022-04-26 | 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116987681A true CN116987681A (en) | 2023-11-03 |
Family
ID=88523730
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210444105.0A Pending CN116987681A (en) | 2022-04-26 | 2022-04-26 | 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116987681A (en) |
-
2022
- 2022-04-26 CN CN202210444105.0A patent/CN116987681A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8669088B2 (en) | Enzyme associated with equol synthesis | |
CN109266630B (en) | Lipase and application thereof in preparation of brivaracetam intermediate | |
CN108103039B (en) | Fucosyltransferase mutants and screening method and application thereof | |
US20220348974A1 (en) | Biotin synthases for efficient production of biotin | |
CN106754818B (en) | Heat-resistant esterase mutant and preparation method and application thereof | |
CN113980925B (en) | Catalytic synthesis of paclitaxel and its derivatives using 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant | |
CN116987681A (en) | 10-deacetylbaccatin III 10 beta-O-acetyltransferase mutant and application thereof | |
CN109355271A (en) | A kind of epoxide hydrolase and its application in ocean rhodotorula source | |
CN115125222A (en) | Synthesis of taxol and its analogs by using 10-deacetylbaccatin III10 beta-O-acetyltransferase mutant as catalyst | |
CN112410353B (en) | fkbS gene, genetic engineering bacterium containing fkbS gene, and preparation method and application of fkbS gene | |
CN109402085B (en) | Recombinant strains, methods for synthesizing simvastatin, and related enzymes | |
CN112708571A (en) | Recombinant yeast for producing chondroitin sulfate with controllable molecular weight by fermentation and application thereof | |
US7273755B2 (en) | Compositions and methods for altering biosynthesis of taxanes and taxane-related compounds | |
CN111363709A (en) | Genetic engineering bacterium for improving isoprene yield and construction method and application thereof | |
CN114369582B (en) | Brucella bifidus source ester synthetase JG536_25355, coding gene and application | |
Su et al. | Cloning of the N-acylamino acid racemase gene from Amycolatopsis azurea and biochemical characterization of the gene product | |
CN111094558A (en) | Thioesterase variants with improved activity for production of medium-chain fatty acid derivatives | |
CN112342203B (en) | Application of ribosome sigma factor, mutant thereof and protein obtained by encoding to increase yield of lipstatin | |
CN102517265A (en) | Esterase, and preparation method and application thereof | |
CN114292825A (en) | Synthetic method of tropinone | |
EP1675946A1 (en) | Process for producing penicillin | |
CN117586892A (en) | Genetically engineered bacterium for producing neotame B0 and preparation method and application thereof | |
CN117210429A (en) | Histidine trimethylase EgtD mutant and application thereof | |
WO2017132924A1 (en) | Enzyme for biosynthesis of isoprene and isopentenyl, and mutant thereof | |
CN116103360A (en) | Method for preparing selenoamino acid by using enzyme method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication |