CN1950507A - 重组水痘-带状疱疹病毒 - Google Patents
重组水痘-带状疱疹病毒 Download PDFInfo
- Publication number
- CN1950507A CN1950507A CNA2005800145230A CN200580014523A CN1950507A CN 1950507 A CN1950507 A CN 1950507A CN A2005800145230 A CNA2005800145230 A CN A2005800145230A CN 200580014523 A CN200580014523 A CN 200580014523A CN 1950507 A CN1950507 A CN 1950507A
- Authority
- CN
- China
- Prior art keywords
- gene
- orf
- zone
- flank region
- zoster virus
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 241000701085 Human alphaherpesvirus 3 Species 0.000 title claims abstract description 278
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 1482
- 238000000034 method Methods 0.000 claims abstract description 145
- 239000012634 fragment Substances 0.000 claims abstract description 121
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 92
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 92
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 92
- 238000002744 homologous recombination Methods 0.000 claims abstract description 35
- 230000006801 homologous recombination Effects 0.000 claims abstract description 35
- 210000004027 cell Anatomy 0.000 claims description 158
- 241000700605 Viruses Species 0.000 claims description 86
- 229960005486 vaccine Drugs 0.000 claims description 53
- 241000894006 Bacteria Species 0.000 claims description 47
- 230000001580 bacterial effect Effects 0.000 claims description 42
- 230000000968 intestinal effect Effects 0.000 claims description 39
- 238000002360 preparation method Methods 0.000 claims description 39
- 239000003814 drug Substances 0.000 claims description 37
- 230000008676 import Effects 0.000 claims description 37
- 239000003550 marker Substances 0.000 claims description 35
- 230000008859 change Effects 0.000 claims description 32
- 230000006798 recombination Effects 0.000 claims description 32
- 238000005215 recombination Methods 0.000 claims description 32
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 31
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 31
- 239000002773 nucleotide Substances 0.000 claims description 24
- 125000003729 nucleotide group Chemical group 0.000 claims description 24
- 239000000203 mixture Substances 0.000 claims description 23
- 239000008194 pharmaceutical composition Substances 0.000 claims description 18
- 108010043121 Green Fluorescent Proteins Proteins 0.000 claims description 16
- 102000004144 Green Fluorescent Proteins Human genes 0.000 claims description 15
- 239000005090 green fluorescent protein Substances 0.000 claims description 15
- 210000004962 mammalian cell Anatomy 0.000 claims description 14
- 239000013600 plasmid vector Substances 0.000 claims description 9
- 239000013598 vector Substances 0.000 abstract description 10
- 230000008569 process Effects 0.000 abstract description 8
- 239000008196 pharmacological composition Substances 0.000 abstract 1
- 108700026244 Open Reading Frames Proteins 0.000 description 675
- 150000001413 amino acids Chemical class 0.000 description 181
- 241000282326 Felis catus Species 0.000 description 122
- 229940024606 amino acid Drugs 0.000 description 104
- 235000001014 amino acid Nutrition 0.000 description 104
- 239000013612 plasmid Substances 0.000 description 43
- 108020004414 DNA Proteins 0.000 description 36
- 239000003795 chemical substances by application Substances 0.000 description 35
- 108090000765 processed proteins & peptides Proteins 0.000 description 30
- 230000003449 preventive effect Effects 0.000 description 29
- 230000001225 therapeutic effect Effects 0.000 description 26
- 239000000463 material Substances 0.000 description 25
- 229920001184 polypeptide Polymers 0.000 description 24
- 102000004196 processed proteins & peptides Human genes 0.000 description 24
- 230000001717 pathogenic effect Effects 0.000 description 19
- 239000000969 carrier Substances 0.000 description 18
- 241000196324 Embryophyta Species 0.000 description 16
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 15
- 230000006870 function Effects 0.000 description 15
- 238000003752 polymerase chain reaction Methods 0.000 description 15
- 102000004169 proteins and genes Human genes 0.000 description 15
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 14
- 230000000694 effects Effects 0.000 description 14
- 108091033319 polynucleotide Proteins 0.000 description 14
- 102000040430 polynucleotide Human genes 0.000 description 14
- 239000002157 polynucleotide Substances 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 13
- 108010005233 alanylglutamic acid Proteins 0.000 description 13
- 108010050848 glycylleucine Proteins 0.000 description 13
- 230000036961 partial effect Effects 0.000 description 13
- 238000012360 testing method Methods 0.000 description 13
- 230000002238 attenuated effect Effects 0.000 description 12
- 201000010099 disease Diseases 0.000 description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 12
- 230000008521 reorganization Effects 0.000 description 12
- 239000000126 substance Substances 0.000 description 12
- 239000000758 substrate Substances 0.000 description 12
- -1 wherein Substances 0.000 description 12
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 11
- 241000880493 Leptailurus serval Species 0.000 description 11
- 241001465754 Metazoa Species 0.000 description 11
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 10
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- 108010008355 arginyl-glutamine Proteins 0.000 description 10
- 230000008034 disappearance Effects 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 108010031719 prolyl-serine Proteins 0.000 description 10
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- 108010062796 arginyllysine Proteins 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 8
- 208000007514 Herpes zoster Diseases 0.000 description 8
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 8
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 230000001419 dependent effect Effects 0.000 description 8
- MURGITYSBWUQTI-UHFFFAOYSA-N fluorescin Chemical compound OC(=O)C1=CC=CC=C1C1C2=CC=C(O)C=C2OC2=CC(O)=CC=C21 MURGITYSBWUQTI-UHFFFAOYSA-N 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 239000007924 injection Substances 0.000 description 8
- 238000002347 injection Methods 0.000 description 8
- 108010000761 leucylarginine Proteins 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 7
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 7
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 7
- 108010093581 aspartyl-proline Proteins 0.000 description 7
- 239000003937 drug carrier Substances 0.000 description 7
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 7
- 108010077515 glycylproline Proteins 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- 239000013605 shuttle vector Substances 0.000 description 7
- 229940021648 varicella vaccine Drugs 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 6
- 102000055025 Adenosine deaminases Human genes 0.000 description 6
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 6
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 6
- 229940124863 Varicella-zoster virus vaccine Drugs 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 229940079593 drug Drugs 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 208000015181 infectious disease Diseases 0.000 description 6
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 230000002062 proliferating effect Effects 0.000 description 6
- 108010004914 prolylarginine Proteins 0.000 description 6
- 108010053725 prolylvaline Proteins 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 241000701161 unidentified adenovirus Species 0.000 description 6
- 241000186226 Corynebacterium glutamicum Species 0.000 description 5
- 241000698776 Duma Species 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- 108010065920 Insulin Lispro Proteins 0.000 description 5
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 5
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 5
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 5
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 5
- 206010035226 Plasma cell myeloma Diseases 0.000 description 5
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 5
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 5
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 229930182817 methionine Natural products 0.000 description 5
- 201000000050 myeloid neoplasm Diseases 0.000 description 5
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 4
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 4
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 4
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 4
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 4
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 4
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 4
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 4
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 4
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 4
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 4
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 4
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 4
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 4
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 4
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 4
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 4
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 4
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 4
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 4
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 4
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- 241000607720 Serratia Species 0.000 description 4
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 4
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 4
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 4
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 210000004507 artificial chromosome Anatomy 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 238000011081 inoculation Methods 0.000 description 4
- 201000001441 melanoma Diseases 0.000 description 4
- 238000002493 microarray Methods 0.000 description 4
- 238000012856 packing Methods 0.000 description 4
- 239000004033 plastic Substances 0.000 description 4
- 238000012797 qualification Methods 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 239000003981 vehicle Substances 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- 102100033350 ATP-dependent translocase ABCB1 Human genes 0.000 description 3
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 3
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 3
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 3
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 3
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 3
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 3
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 3
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 3
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 3
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 3
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 3
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 3
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 3
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 3
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 3
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 3
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 3
- 108010070255 Aspartate-ammonia ligase Proteins 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 241000186146 Brevibacterium Species 0.000 description 3
- 241000186216 Corynebacterium Species 0.000 description 3
- 108010051219 Cre recombinase Proteins 0.000 description 3
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 3
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 3
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 3
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 3
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 3
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 3
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 3
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 3
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 3
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 3
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 3
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 3
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 3
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 3
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 3
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 3
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 3
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 3
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 3
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 3
- 241001467578 Microbacterium Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- VCUFZILGIRCDQQ-KRWDZBQOSA-N N-[[(5S)-2-oxo-3-(2-oxo-3H-1,3-benzoxazol-6-yl)-1,3-oxazolidin-5-yl]methyl]-2-[[3-(trifluoromethoxy)phenyl]methylamino]pyrimidine-5-carboxamide Chemical compound O=C1O[C@H](CN1C1=CC2=C(NC(O2)=O)C=C1)CNC(=O)C=1C=NC(=NC=1)NCC1=CC(=CC=C1)OC(F)(F)F VCUFZILGIRCDQQ-KRWDZBQOSA-N 0.000 description 3
- 102000052812 Ornithine decarboxylases Human genes 0.000 description 3
- 108700005126 Ornithine decarboxylases Proteins 0.000 description 3
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 3
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 3
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 3
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 3
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 3
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 3
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 3
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 3
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 3
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 3
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 3
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 3
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 3
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 3
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 3
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 3
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 3
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 3
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 3
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 3
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 3
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 3
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 3
- 108010081404 acein-2 Proteins 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 229960003121 arginine Drugs 0.000 description 3
- 108010036533 arginylvaline Proteins 0.000 description 3
- 229960005261 aspartic acid Drugs 0.000 description 3
- 235000003704 aspartic acid Nutrition 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 229960002989 glutamic acid Drugs 0.000 description 3
- 150000004676 glycans Chemical class 0.000 description 3
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 230000005847 immunogenicity Effects 0.000 description 3
- 238000007912 intraperitoneal administration Methods 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- HPNSFSBZBAHARI-UHFFFAOYSA-N micophenolic acid Natural products OC1=C(CC=C(C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-UHFFFAOYSA-N 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 229960000951 mycophenolic acid Drugs 0.000 description 3
- HPNSFSBZBAHARI-RUDMXATFSA-N mycophenolic acid Chemical compound OC1=C(C\C=C(/C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-RUDMXATFSA-N 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 229920001282 polysaccharide Polymers 0.000 description 3
- 239000005017 polysaccharide Substances 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 3
- 230000001954 sterilising effect Effects 0.000 description 3
- 238000007920 subcutaneous administration Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 231100000419 toxicity Toxicity 0.000 description 3
- 230000001988 toxicity Effects 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 108010012050 valyl-aspartyl-prolyl-proline Proteins 0.000 description 3
- 235000013311 vegetables Nutrition 0.000 description 3
- 210000001835 viscera Anatomy 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- 101000818123 Acholeplasma phage L2 Uncharacterized 17.2 kDa protein Proteins 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 2
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 2
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 2
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 2
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 2
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 2
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- 108010003415 Aspartate Aminotransferases Proteins 0.000 description 2
- 102000004625 Aspartate Aminotransferases Human genes 0.000 description 2
- 241000020089 Atacta Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000193744 Bacillus amyloliquefaciens Species 0.000 description 2
- 201000006082 Chickenpox Diseases 0.000 description 2
- 241000282552 Chlorocebus aethiops Species 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 2
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 2
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241001411320 Eriogonum inflatum Species 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 2
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 2
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 2
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- FWKQNCXZGNBPFD-UHFFFAOYSA-N Guaiazulene Chemical compound CC(C)C1=CC=C(C)C2=CC=C(C)C2=C1 FWKQNCXZGNBPFD-UHFFFAOYSA-N 0.000 description 2
- 101000818121 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 18.2 kDa protein in rep-hol intergenic region Proteins 0.000 description 2
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 2
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 2
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 2
- DMAPKBANYNZHNR-ULQDDVLXSA-N His-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DMAPKBANYNZHNR-ULQDDVLXSA-N 0.000 description 2
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 2
- 101000790842 Klebsiella pneumoniae Uncharacterized 65.4 kDa protein in cps region Proteins 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- FZMNAYBEFGZEIF-AVGNSLFASA-N Leu-Met-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N FZMNAYBEFGZEIF-AVGNSLFASA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 2
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 2
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- DAOSYIZXRCOKII-SRVKXCTJSA-N Lys-His-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O DAOSYIZXRCOKII-SRVKXCTJSA-N 0.000 description 2
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 2
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- 108010047230 Member 1 Subfamily B ATP Binding Cassette Transporter Proteins 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 2
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 101000781204 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 36.6 kDa protein Proteins 0.000 description 2
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 2
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 2
- 101710118890 Photosystem II reaction center protein Ycf12 Proteins 0.000 description 2
- 239000004793 Polystyrene Substances 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 2
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 2
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 2
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 2
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 241000881765 Serratia ficaria Species 0.000 description 2
- 241000607715 Serratia marcescens Species 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 206010042566 Superinfection Diseases 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 108010022394 Threonine synthase Proteins 0.000 description 2
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 2
- DVIIYMVCSUQOJG-QEJZJMRPSA-N Trp-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DVIIYMVCSUQOJG-QEJZJMRPSA-N 0.000 description 2
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 2
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 2
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 2
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 2
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- 101710110895 Uncharacterized 7.3 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- 101710134973 Uncharacterized 9.7 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 206010046980 Varicella Diseases 0.000 description 2
- 108010027570 Xanthine phosphoribosyltransferase Proteins 0.000 description 2
- LPQOADBMXVRBNX-UHFFFAOYSA-N ac1ldcw0 Chemical compound Cl.C1CN(C)CCN1C1=C(F)C=C2C(=O)C(C(O)=O)=CN3CCSC1=C32 LPQOADBMXVRBNX-UHFFFAOYSA-N 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- WQZGKKKJIJFFOK-PQMKYFCFSA-N alpha-D-mannose Chemical compound OC[C@H]1O[C@H](O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-PQMKYFCFSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940121363 anti-inflammatory agent Drugs 0.000 description 2
- 239000002260 anti-inflammatory agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 239000003963 antioxidant agent Substances 0.000 description 2
- 230000003078 antioxidant effect Effects 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- CNBGNNVCVSKAQZ-UHFFFAOYSA-N benzydamine Chemical compound C12=CC=CC=C2C(OCCCN(C)C)=NN1CC1=CC=CC=C1 CNBGNNVCVSKAQZ-UHFFFAOYSA-N 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- 239000007933 dermal patch Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 238000007877 drug screening Methods 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 210000004700 fetal blood Anatomy 0.000 description 2
- 230000001605 fetal effect Effects 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 101150106093 gpt gene Proteins 0.000 description 2
- FUZZWVXGSFPDMH-UHFFFAOYSA-N hexanoic acid Chemical compound CCCCCC(O)=O FUZZWVXGSFPDMH-UHFFFAOYSA-N 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 102000057593 human F8 Human genes 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 230000001524 infective effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010043612 kentsin Proteins 0.000 description 2
- 210000003292 kidney cell Anatomy 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010072591 lysyl-leucyl-alanyl-arginine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 229960000485 methotrexate Drugs 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 238000001393 microlithography Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 210000001616 monocyte Anatomy 0.000 description 2
- 238000004264 monolayer culture Methods 0.000 description 2
- 210000003928 nasal cavity Anatomy 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 239000002674 ointment Substances 0.000 description 2
- 238000002515 oligonucleotide synthesis Methods 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 229940049547 paraxin Drugs 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 229920002223 polystyrene Polymers 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 229940047431 recombinate Drugs 0.000 description 2
- 210000000664 rectum Anatomy 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 239000003419 rna directed dna polymerase inhibitor Substances 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000000377 silicon dioxide Substances 0.000 description 2
- 235000012239 silicon dioxide Nutrition 0.000 description 2
- 229960001866 silicon dioxide Drugs 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 210000001562 sternum Anatomy 0.000 description 2
- 238000010254 subcutaneous injection Methods 0.000 description 2
- 239000007929 subcutaneous injection Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 210000003462 vein Anatomy 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 1
- DWNBOPVKNPVNQG-LURJTMIESA-N (2s)-4-hydroxy-2-(propylamino)butanoic acid Chemical compound CCCN[C@H](C(O)=O)CCO DWNBOPVKNPVNQG-LURJTMIESA-N 0.000 description 1
- BAPRUDZDYCKSOQ-RITPCOANSA-N (2s,4r)-1-acetyl-4-hydroxypyrrolidine-2-carboxylic acid Chemical compound CC(=O)N1C[C@H](O)C[C@H]1C(O)=O BAPRUDZDYCKSOQ-RITPCOANSA-N 0.000 description 1
- OJHZNMVJJKMFGX-RNWHKREASA-N (4r,4ar,7ar,12bs)-9-methoxy-3-methyl-1,2,4,4a,5,6,7a,13-octahydro-4,12-methanobenzofuro[3,2-e]isoquinoline-7-one;2,3-dihydroxybutanedioic acid Chemical compound OC(=O)C(O)C(O)C(O)=O.O=C([C@@H]1O2)CC[C@H]3[C@]4([H])N(C)CC[C@]13C1=C2C(OC)=CC=C1C4 OJHZNMVJJKMFGX-RNWHKREASA-N 0.000 description 1
- FPVKHBSQESCIEP-UHFFFAOYSA-N (8S)-3-(2-deoxy-beta-D-erythro-pentofuranosyl)-3,6,7,8-tetrahydroimidazo[4,5-d][1,3]diazepin-8-ol Natural products C1C(O)C(CO)OC1N1C(NC=NCC2O)=C2N=C1 FPVKHBSQESCIEP-UHFFFAOYSA-N 0.000 description 1
- ZGNLFUXWZJGETL-YUSKDDKASA-N (Z)-[(2S)-2-amino-2-carboxyethyl]-hydroxyimino-oxidoazanium Chemical compound N[C@@H](C\[N+]([O-])=N\O)C(O)=O ZGNLFUXWZJGETL-YUSKDDKASA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- XUDSQIDNHJMBBW-FOWTUZBSSA-N 2-[4-[(e)-n-hydroxy-c-methylcarbonimidoyl]phenoxy]-1-piperidin-1-ylethanone Chemical compound C1=CC(C(=N/O)/C)=CC=C1OCC(=O)N1CCCCC1 XUDSQIDNHJMBBW-FOWTUZBSSA-N 0.000 description 1
- ZBYVTTSIVDYQSO-UHFFFAOYSA-N 2-azaniumyl-4-(hydroxyamino)-4-oxobutanoate Chemical compound OC(=O)C(N)CC(=O)NO ZBYVTTSIVDYQSO-UHFFFAOYSA-N 0.000 description 1
- WTLKTXIHIHFSGU-UHFFFAOYSA-N 2-nitrosoguanidine Chemical compound NC(N)=NN=O WTLKTXIHIHFSGU-UHFFFAOYSA-N 0.000 description 1
- WOVTUUKKGNHVFZ-UHFFFAOYSA-N 4-(fluoren-9-ylidenemethyl)benzenecarboximidamide Chemical compound C1=CC(C(=N)N)=CC=C1C=C1C2=CC=CC=C2C2=CC=CC=C21 WOVTUUKKGNHVFZ-UHFFFAOYSA-N 0.000 description 1
- DVEQCIBLXRSYPH-UHFFFAOYSA-N 5-butyl-1-cyclohexylbarbituric acid Chemical compound O=C1C(CCCC)C(=O)NC(=O)N1C1CCCCC1 DVEQCIBLXRSYPH-UHFFFAOYSA-N 0.000 description 1
- FRXSZNDVFUDTIR-UHFFFAOYSA-N 6-methoxy-1,2,3,4-tetrahydroquinoline Chemical compound N1CCCC2=CC(OC)=CC=C21 FRXSZNDVFUDTIR-UHFFFAOYSA-N 0.000 description 1
- 101710115267 ATP synthase protein MI25 Proteins 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 101100230376 Acetivibrio thermocellus (strain ATCC 27405 / DSM 1237 / JCM 9322 / NBRC 103400 / NCIMB 10682 / NRRL B-4536 / VPI 7372) celI gene Proteins 0.000 description 1
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 description 1
- 101000748061 Acholeplasma phage L2 Uncharacterized 16.1 kDa protein Proteins 0.000 description 1
- 101000818089 Acholeplasma phage L2 Uncharacterized 25.6 kDa protein Proteins 0.000 description 1
- 101000827329 Acholeplasma phage L2 Uncharacterized 26.1 kDa protein Proteins 0.000 description 1
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 description 1
- 101000818108 Acholeplasma phage L2 Uncharacterized 81.3 kDa protein Proteins 0.000 description 1
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 description 1
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 description 1
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 description 1
- 229920000178 Acrylic resin Polymers 0.000 description 1
- 239000004925 Acrylic resin Substances 0.000 description 1
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 description 1
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- CWRBRVZBMVJENN-UVBJJODRSA-N Ala-Trp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CWRBRVZBMVJENN-UVBJJODRSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 description 1
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 description 1
- 241000269328 Amphibia Species 0.000 description 1
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 description 1
- 101000748781 Anthoceros angustus Uncharacterized 3.0 kDa protein in psbT-psbN intergenic region Proteins 0.000 description 1
- 101000812031 Anthoceros angustus Uncharacterized 5.9 kDa protein in rps16-psbA intergenic region Proteins 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- WOPFJPHVBWKZJH-SRVKXCTJSA-N Arg-Arg-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O WOPFJPHVBWKZJH-SRVKXCTJSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- YUGFLWBWAJFGKY-BQBZGAKWSA-N Arg-Cys-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O YUGFLWBWAJFGKY-BQBZGAKWSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- PYDIIVKGTBRIEL-SZMVWBNQSA-N Arg-Trp-Pro Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(O)=O PYDIIVKGTBRIEL-SZMVWBNQSA-N 0.000 description 1
- AZHXYLJRGVMQKW-UMPQAUOISA-N Arg-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N)O AZHXYLJRGVMQKW-UMPQAUOISA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 1
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- ODNWIBOCFGMRTP-SRVKXCTJSA-N Asp-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CN=CN1 ODNWIBOCFGMRTP-SRVKXCTJSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- XFQOQUWGVCVYON-DCAQKATOSA-N Asp-Met-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XFQOQUWGVCVYON-DCAQKATOSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- NONWUQAWAANERO-BZSNNMDCSA-N Asp-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 NONWUQAWAANERO-BZSNNMDCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- RMFITHMDQGFSDC-UBHSHLNASA-N Asp-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RMFITHMDQGFSDC-UBHSHLNASA-N 0.000 description 1
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 101100214868 Autographa californica nuclear polyhedrosis virus AC54 gene Proteins 0.000 description 1
- 101100074342 Autographa californica nuclear polyhedrosis virus LEF-11 gene Proteins 0.000 description 1
- 101100351191 Autographa californica nuclear polyhedrosis virus PCNA gene Proteins 0.000 description 1
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 description 1
- 101000770875 Autographa californica nuclear polyhedrosis virus Uncharacterized 14.2 kDa protein in PK1-LEF1 intergenic region Proteins 0.000 description 1
- 101000781183 Autographa californica nuclear polyhedrosis virus Uncharacterized 20.4 kDa protein in IAP1-SOD intergenic region Proteins 0.000 description 1
- 101000847476 Autographa californica nuclear polyhedrosis virus Uncharacterized 54.7 kDa protein in IAP1-SOD intergenic region Proteins 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 description 1
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 description 1
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 description 1
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 description 1
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 description 1
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 description 1
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 description 1
- 101000736075 Bacillus subtilis (strain 168) Uncharacterized protein YcbP Proteins 0.000 description 1
- 101000736076 Bacillus subtilis (strain 168) Uncharacterized protein YcbR Proteins 0.000 description 1
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 description 1
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 description 1
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 description 1
- 101000786247 Bacillus subtilis (strain 168) Uncharacterized protein YqaT Proteins 0.000 description 1
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 description 1
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 description 1
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 description 1
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 description 1
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 description 1
- 101000774107 Borrelia burgdorferi (strain ATCC 35210 / B31 / CIP 102532 / DSM 4680) Uncharacterized protein BB_0266 Proteins 0.000 description 1
- 101000631235 Borrelia burgdorferi (strain ATCC 35210 / B31 / CIP 102532 / DSM 4680) Uncharacterized protein BB_0268 Proteins 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 description 1
- 101000736909 Campylobacter jejuni Probable nucleotidyltransferase Proteins 0.000 description 1
- 229920001661 Chitosan Polymers 0.000 description 1
- 101000748745 Chlamydomonas reinhardtii Uncharacterized 6.2 kDa protein in psaC-petL intergenic region Proteins 0.000 description 1
- 101000626907 Chlamydomonas reinhardtii Uncharacterized 7.3 kDa protein in petA 5'region Proteins 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 description 1
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 description 1
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 description 1
- 101000947615 Clostridium perfringens Uncharacterized 38.4 kDa protein Proteins 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- YOOVTUPUBVHMPG-UHFFFAOYSA-N Coformycin Natural products OC1C(O)C(CO)OC1N1C(NC=NCC2O)=C2N=C1 YOOVTUPUBVHMPG-UHFFFAOYSA-N 0.000 description 1
- 241000148131 Colibacter Species 0.000 description 1
- 206010010099 Combined immunodeficiency Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 102100039200 Constitutive coactivator of PPAR-gamma-like protein 2 Human genes 0.000 description 1
- 102100031725 Cortactin-binding protein 2 Human genes 0.000 description 1
- 241001517047 Corynebacterium acetoacidophilum Species 0.000 description 1
- 241000807905 Corynebacterium glutamicum ATCC 14067 Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 241000938605 Crocodylia Species 0.000 description 1
- 101000651958 Crotalus durissus terrificus Snaclec crotocetin-1 Proteins 0.000 description 1
- 101000792449 Cyanophora paradoxa Uncharacterized 3.4 kDa protein in atpE-petA intergenic region Proteins 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 1
- JTNKVWLMDHIUOG-IHRRRGAJSA-N Cys-Arg-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JTNKVWLMDHIUOG-IHRRRGAJSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 1
- DYBIDOHFRRUMLW-CIUDSAMLSA-N Cys-Leu-Cys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O DYBIDOHFRRUMLW-CIUDSAMLSA-N 0.000 description 1
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- PEZINYWZBQNTIX-NAKRPEOUSA-N Cys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N PEZINYWZBQNTIX-NAKRPEOUSA-N 0.000 description 1
- HJGUQJJJXQGXGJ-FXQIFTODSA-N Cys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HJGUQJJJXQGXGJ-FXQIFTODSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 1
- ZLFRUAFDAIFNHN-LKXGYXEUSA-N Cys-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)O ZLFRUAFDAIFNHN-LKXGYXEUSA-N 0.000 description 1
- MSWBLPLBSLQVME-XIRDDKMYSA-N Cys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 MSWBLPLBSLQVME-XIRDDKMYSA-N 0.000 description 1
- LLUXQOVDMQZMPJ-KKUMJFAQSA-N Cys-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CC1=CC=C(O)C=C1 LLUXQOVDMQZMPJ-KKUMJFAQSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 1
- 101150026402 DBP gene Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 1
- 102100024101 DnaJ homolog subfamily C member 28 Human genes 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- QZKRHPLGUJDVAR-UHFFFAOYSA-K EDTA trisodium salt Chemical compound [Na+].[Na+].[Na+].OC(=O)CN(CC([O-])=O)CCN(CC([O-])=O)CC([O-])=O QZKRHPLGUJDVAR-UHFFFAOYSA-K 0.000 description 1
- 239000004278 EU approved seasoning Substances 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 description 1
- 101000964391 Enterococcus faecalis UPF0145 protein Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 239000004593 Epoxy Substances 0.000 description 1
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 description 1
- 101100173936 Escherichia coli (strain K12) flmA gene Proteins 0.000 description 1
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 description 1
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 description 1
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 description 1
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 description 1
- 101100226347 Escherichia phage lambda exo gene Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 101000748786 Euglena longa Uncharacterized 7.2 kDa protein in rps2-rps9 intergenic region Proteins 0.000 description 1
- 101000792446 Euglena longa Uncharacterized 8.7 kDa protein in rpl22-rpl23 intergenic region Proteins 0.000 description 1
- YCKRFDGAMUMZLT-UHFFFAOYSA-N Fluorine atom Chemical compound [F] YCKRFDGAMUMZLT-UHFFFAOYSA-N 0.000 description 1
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 description 1
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 1
- 101150066002 GFP gene Proteins 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 description 1
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 description 1
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 description 1
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- YLABFXCRQQMMHS-AVGNSLFASA-N Gln-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YLABFXCRQQMMHS-AVGNSLFASA-N 0.000 description 1
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- UQULNJAARAXSPO-ZCWPNWOLSA-N Glu-Thr-Thr-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UQULNJAARAXSPO-ZCWPNWOLSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- UVTSZKIATYSKIR-RYUDHWBXSA-N Gly-Tyr-Glu Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O UVTSZKIATYSKIR-RYUDHWBXSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 229930186217 Glycolipid Natural products 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 229920002683 Glycosaminoglycan Polymers 0.000 description 1
- 101000626903 Guillardia theta Uncharacterized 6.1 kDa protein Proteins 0.000 description 1
- 101000792437 Guillardia theta Uncharacterized 7.8 kDa protein Proteins 0.000 description 1
- 101000626971 Guillardia theta Uncharacterized 8.1 kDa protein Proteins 0.000 description 1
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 description 1
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 description 1
- 101000912350 Haemophilus phage HP1 (strain HP1c1) DNA N-6-adenine-methyltransferase Proteins 0.000 description 1
- 101000933958 Haemophilus phage HP1 (strain HP1c1) Major capsid protein Proteins 0.000 description 1
- 101000743338 Haemophilus phage HP1 (strain HP1c1) Probable head completion/stabilization protein Proteins 0.000 description 1
- 101001066788 Haemophilus phage HP1 (strain HP1c1) Probable portal protein Proteins 0.000 description 1
- 101001052021 Haemophilus phage HP1 (strain HP1c1) Probable tail fiber protein Proteins 0.000 description 1
- 101000854890 Haemophilus phage HP1 (strain HP1c1) Probable terminase, ATPase subunit Proteins 0.000 description 1
- 101000743335 Haemophilus phage HP1 (strain HP1c1) Probable terminase, endonuclease subunit Proteins 0.000 description 1
- 101000748063 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 11.1 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 101000758973 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 11.3 kDa protein in lys 3'region Proteins 0.000 description 1
- 101000758963 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 12.7 kDa protein in lys 3'region Proteins 0.000 description 1
- 101000976893 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 14.1 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101000818057 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 14.9 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 101000786896 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 101000708358 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 23.3 kDa protein in lys 3'region Proteins 0.000 description 1
- 101000786921 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 26.0 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 101000948764 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 58.7 kDa protein in lys 3'region Proteins 0.000 description 1
- 101000768945 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 7.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 1
- 101000748060 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.3 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 101000977016 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 1
- 108010054147 Hemoglobins Proteins 0.000 description 1
- 241000700586 Herpesviridae Species 0.000 description 1
- 101000626607 Herpetosiphon aurantiacus Putative type II restriction enzyme HgiDII Proteins 0.000 description 1
- 101000623276 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiBIM 5'region Proteins 0.000 description 1
- 101000623175 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiCIIM 5'region Proteins 0.000 description 1
- 101000626850 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiEIM 5'region Proteins 0.000 description 1
- 101000748192 Herpetosiphon aurantiacus Uncharacterized 15.4 kDa protein in HgiDIIM 5'region Proteins 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- IPIVXQQRZXEUGW-UWJYBYFXSA-N His-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IPIVXQQRZXEUGW-UWJYBYFXSA-N 0.000 description 1
- GMIWMPUGTFQFHK-KCTSRDHCSA-N His-Ala-Trp Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O GMIWMPUGTFQFHK-KCTSRDHCSA-N 0.000 description 1
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- SYIPVNMWBZXKMU-HJPIBITLSA-N His-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N SYIPVNMWBZXKMU-HJPIBITLSA-N 0.000 description 1
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- SLFSYFJKSIVSON-SRVKXCTJSA-N His-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SLFSYFJKSIVSON-SRVKXCTJSA-N 0.000 description 1
- FBCURAVMSXNOLP-JYJNAYRXSA-N His-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBCURAVMSXNOLP-JYJNAYRXSA-N 0.000 description 1
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 description 1
- 101001053999 Homo sapiens DnaJ homolog subfamily C member 28 Proteins 0.000 description 1
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 1
- 101001028836 Homo sapiens M-phase-specific PLK1-interacting protein Proteins 0.000 description 1
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 1
- 101000652805 Homo sapiens Protein shisa-8 Proteins 0.000 description 1
- 101000820589 Homo sapiens Succinate-hydroxymethylglutarate CoA-transferase Proteins 0.000 description 1
- 101000667300 Homo sapiens WD repeat-containing protein 19 Proteins 0.000 description 1
- 101100064352 Human herpesvirus 8 type P (isolate GK18) DUT gene Proteins 0.000 description 1
- 101100100297 Human herpesvirus 8 type P (isolate GK18) TRM3 gene Proteins 0.000 description 1
- 101100283436 Human herpesvirus 8 type P (isolate GK18) gM gene Proteins 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 1
- 102000018251 Hypoxanthine Phosphoribosyltransferase Human genes 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- SJIGTGZVQGLMGG-NAKRPEOUSA-N Ile-Cys-Arg Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O SJIGTGZVQGLMGG-NAKRPEOUSA-N 0.000 description 1
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- JNDYZNJRRNFYIR-VGDYDELISA-N Ile-His-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N JNDYZNJRRNFYIR-VGDYDELISA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- HZVRQFKRALAMQS-SLBDDTMCSA-N Ile-Trp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZVRQFKRALAMQS-SLBDDTMCSA-N 0.000 description 1
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 241000500891 Insecta Species 0.000 description 1
- 102100024407 Jouberin Human genes 0.000 description 1
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 description 1
- 101000578717 Klebsiella pneumoniae Mannose-1-phosphate guanylyltransferase Proteins 0.000 description 1
- 101000957786 Klebsiella pneumoniae Phosphomannomutase Proteins 0.000 description 1
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 description 1
- 101001015100 Klebsiella pneumoniae UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase Proteins 0.000 description 1
- 101000790838 Klebsiella pneumoniae UPF0053 protein in cps region Proteins 0.000 description 1
- 101000790837 Klebsiella pneumoniae Uncharacterized 18.9 kDa protein in cps region Proteins 0.000 description 1
- 101000790844 Klebsiella pneumoniae Uncharacterized 24.8 kDa protein in cps region Proteins 0.000 description 1
- 101000790840 Klebsiella pneumoniae Uncharacterized 49.5 kDa protein in cps region Proteins 0.000 description 1
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 description 1
- 101000768313 Klebsiella pneumoniae Uncharacterized membrane protein in cps region Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- MLFKVJCWGUZWNV-UHFFFAOYSA-N L-alanosine Natural products OC(=O)C(N)CN(O)N=O MLFKVJCWGUZWNV-UHFFFAOYSA-N 0.000 description 1
- GZYFIMLSHBLMKF-UHFFFAOYSA-N L-albizziine Natural products OC(=O)C(N)CNC(N)=O GZYFIMLSHBLMKF-UHFFFAOYSA-N 0.000 description 1
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 description 1
- 101000904276 Lactococcus phage P008 Gene product 38 Proteins 0.000 description 1
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 101000756286 Lymantria dispar multicapsid nuclear polyhedrosis virus Uncharacterized 10.9 kDa protein in LEF8-FP intergenic region Proteins 0.000 description 1
- 101000759330 Lymantria dispar multicapsid nuclear polyhedrosis virus Uncharacterized protein in LEF8-FP intergenic region Proteins 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- ZTPWXNOOKAXPPE-DCAQKATOSA-N Lys-Arg-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N ZTPWXNOOKAXPPE-DCAQKATOSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- MTBBHUKKPWKXBT-ULQDDVLXSA-N Lys-Met-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MTBBHUKKPWKXBT-ULQDDVLXSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- 102100037185 M-phase-specific PLK1-interacting protein Human genes 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 108010038016 Mannose-1-phosphate guanylyltransferase Proteins 0.000 description 1
- 101000626970 Marchantia polymorpha Uncharacterized 3.3 kDa protein in psbT-psbN intergenic region Proteins 0.000 description 1
- 101000626905 Marchantia polymorpha Uncharacterized 3.8 kDa protein in ycf12-psaM intergenic region Proteins 0.000 description 1
- 101000748779 Marchantia polymorpha Uncharacterized 6.4 kDa protein in atpA-psbA intergenic region Proteins 0.000 description 1
- 101000788487 Marchantia polymorpha Uncharacterized mitochondrial protein ymf25 Proteins 0.000 description 1
- 101000788489 Marchantia polymorpha Uncharacterized mitochondrial protein ymf26 Proteins 0.000 description 1
- 101000788491 Marchantia polymorpha Uncharacterized mitochondrial protein ymf27 Proteins 0.000 description 1
- 101000747938 Marchantia polymorpha Uncharacterized mitochondrial protein ymf31 Proteins 0.000 description 1
- 101000747949 Marchantia polymorpha Uncharacterized mitochondrial protein ymf32 Proteins 0.000 description 1
- 229920000877 Melamine resin Polymers 0.000 description 1
- 239000004640 Melamine resin Substances 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- RPEPZINUYHUBKG-FXQIFTODSA-N Met-Cys-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O RPEPZINUYHUBKG-FXQIFTODSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- UKUMISIRZAVYOG-CIUDSAMLSA-N Met-Glu-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O UKUMISIRZAVYOG-CIUDSAMLSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- SCKPOOMCTFEVTN-QTKMDUPCSA-N Met-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCSC)N)O SCKPOOMCTFEVTN-QTKMDUPCSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- HMEVNCOJHJTLNB-BVSLBCMMSA-N Met-Trp-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N HMEVNCOJHJTLNB-BVSLBCMMSA-N 0.000 description 1
- QZUCCDSNETVAIS-RYQLBKOJSA-N Met-Trp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N QZUCCDSNETVAIS-RYQLBKOJSA-N 0.000 description 1
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 1
- FZDOBWIKRQORAC-ULQDDVLXSA-N Met-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N FZDOBWIKRQORAC-ULQDDVLXSA-N 0.000 description 1
- ATBJCCFCJXCNGZ-UFYCRDLUSA-N Met-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 ATBJCCFCJXCNGZ-UFYCRDLUSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 description 1
- 101000804418 Methanothermobacter thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM 10044 / NBRC 100330 / Delta H) Uncharacterized protein MTH_1463 Proteins 0.000 description 1
- 241000144155 Microbacterium ammoniaphilum Species 0.000 description 1
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 description 1
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 description 1
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 description 1
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 1
- OTCCIMWXFLJLIA-BYPYZUCNSA-N N-acetyl-L-aspartic acid Chemical compound CC(=O)N[C@H](C(O)=O)CC(O)=O OTCCIMWXFLJLIA-BYPYZUCNSA-N 0.000 description 1
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 1
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108091036732 NRON Proteins 0.000 description 1
- BLXXJMDCKKHMKV-UHFFFAOYSA-N Nabumetone Chemical compound C1=C(CCC(C)=O)C=CC2=CC(OC)=CC=C21 BLXXJMDCKKHMKV-UHFFFAOYSA-N 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 101100521838 Nicotiana tabacum pbf1 gene Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 101150101223 ORF29 gene Proteins 0.000 description 1
- 101150020791 ORF37 gene Proteins 0.000 description 1
- 101150016564 ORF39 gene Proteins 0.000 description 1
- 101150075249 ORF40 gene Proteins 0.000 description 1
- 101150006817 ORF43 gene Proteins 0.000 description 1
- 101150050790 ORF49 gene Proteins 0.000 description 1
- 101150104094 ORF52 gene Proteins 0.000 description 1
- 101150098690 ORF54 gene Proteins 0.000 description 1
- 101710087110 ORF6 protein Proteins 0.000 description 1
- 101150092861 ORF71 gene Proteins 0.000 description 1
- 241000702259 Orbivirus Species 0.000 description 1
- 101100389785 Orgyia pseudotsugata multicapsid polyhedrosis virus ETM gene Proteins 0.000 description 1
- 101100069690 Orgyia pseudotsugata multicapsid polyhedrosis virus GTA gene Proteins 0.000 description 1
- 101100306237 Orgyia pseudotsugata multicapsid polyhedrosis virus LEF-8 gene Proteins 0.000 description 1
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 description 1
- 101100096140 Orgyia pseudotsugata multicapsid polyhedrosis virus SOD gene Proteins 0.000 description 1
- 101000666843 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 24.0 kDa protein Proteins 0.000 description 1
- 101000770899 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 24.3 kDa protein Proteins 0.000 description 1
- 101000770870 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 37.2 kDa protein Proteins 0.000 description 1
- 101000805098 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 73.1 kDa protein Proteins 0.000 description 1
- 102100037214 Orotidine 5'-phosphate decarboxylase Human genes 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 239000002033 PVDF binder Substances 0.000 description 1
- 101100378791 Paenarthrobacter nicotinovorans aldh gene Proteins 0.000 description 1
- 101100156835 Paenarthrobacter nicotinovorans xdh gene Proteins 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- VHDNDCPMHQMXIR-IHRRRGAJSA-N Phe-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHDNDCPMHQMXIR-IHRRRGAJSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- ABEFOXGAIIJDCL-SFJXLCSZSA-N Phe-Thr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ABEFOXGAIIJDCL-SFJXLCSZSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- GNZCMRRSXOBHLC-JYJNAYRXSA-N Phe-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N GNZCMRRSXOBHLC-JYJNAYRXSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 description 1
- 101100481711 Pneumococcus phage Dp-1 TMP gene Proteins 0.000 description 1
- RVGRUAULSDPKGF-UHFFFAOYSA-N Poloxamer Chemical compound C1CO1.CC1CO1 RVGRUAULSDPKGF-UHFFFAOYSA-N 0.000 description 1
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 229920002367 Polyisobutene Polymers 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 229920001328 Polyvinylidene chloride Polymers 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- OGRYXQOUFHAMPI-DCAQKATOSA-N Pro-Cys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O OGRYXQOUFHAMPI-DCAQKATOSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- TYMBHHITTMGGPI-NAKRPEOUSA-N Pro-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 TYMBHHITTMGGPI-NAKRPEOUSA-N 0.000 description 1
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- BLJMJZOMZRCESA-GUBZILKMSA-N Pro-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BLJMJZOMZRCESA-GUBZILKMSA-N 0.000 description 1
- JFBJPBZSTMXGKL-JYJNAYRXSA-N Pro-Met-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JFBJPBZSTMXGKL-JYJNAYRXSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 1
- 101710197985 Probable protein Rev Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 102100040307 Protein FAM3B Human genes 0.000 description 1
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 description 1
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 description 1
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- XESARGFCSKSFID-UHFFFAOYSA-N Pyrazofurin Natural products OC1=C(C(=O)N)NN=C1C1C(O)C(O)C(CO)O1 XESARGFCSKSFID-UHFFFAOYSA-N 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 description 1
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 description 1
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 description 1
- 101000748499 Rhodobacter capsulatus Uncharacterized 104.1 kDa protein in hypE 3'region Proteins 0.000 description 1
- 101000748505 Rhodobacter capsulatus Uncharacterized 16.1 kDa protein in hypE 3'region Proteins 0.000 description 1
- 101000827754 Rhodobacter capsulatus Uncharacterized 5.8 kDa protein in puhA 5'region Proteins 0.000 description 1
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 description 1
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 description 1
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 description 1
- 241000190932 Rhodopseudomonas Species 0.000 description 1
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 description 1
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 description 1
- 101000814063 Salmonella phage P22 Uncharacterized 6.6 kDa protein in eae-abc2 intergenic region Proteins 0.000 description 1
- 101000953980 Salmonella phage P22 Uncharacterized 7.7 kDa protein in gp5-gp4 intergenic region Proteins 0.000 description 1
- 101000953981 Salmonella phage P22 Uncharacterized 7.8 kDa protein in ral-gp17 intergenic region Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- HZNFKPJCGZXKIC-DCAQKATOSA-N Ser-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N HZNFKPJCGZXKIC-DCAQKATOSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241000218654 Serratia fonticola Species 0.000 description 1
- 241000607717 Serratia liquefaciens Species 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- 101000992423 Severe acute respiratory syndrome coronavirus 2 Putative ORF9c protein Proteins 0.000 description 1
- 229910052581 Si3N4 Inorganic materials 0.000 description 1
- 108010052160 Site-specific recombinase Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 description 1
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 description 1
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 description 1
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 description 1
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 description 1
- 101000708364 Streptomyces griseus Uncharacterized 31.2 kDa protein in rplA-rplJ intergenic region Proteins 0.000 description 1
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 description 1
- 101000953979 Streptomyces lividans Uncharacterized 6.6 kDa protein Proteins 0.000 description 1
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 description 1
- 102100021652 Succinate-hydroxymethylglutarate CoA-transferase Human genes 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 description 1
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- UMFLBPIPAJMNIM-LYARXQMPSA-N Thr-Trp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N)O UMFLBPIPAJMNIM-LYARXQMPSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- SJPDTIQHLBQPFO-VLCNGCBASA-N Thr-Tyr-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SJPDTIQHLBQPFO-VLCNGCBASA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- 101000764204 Trieres chinensis Uncharacterized 3.3 kDa protein in rpl11-trnW intergenic region Proteins 0.000 description 1
- 101000792450 Trieres chinensis Uncharacterized 4.7 kDa protein in ycf33-trnY intergenic region Proteins 0.000 description 1
- 101000748762 Trieres chinensis Uncharacterized 5.4 kDa protein in trnK-psbC intergenic region Proteins 0.000 description 1
- 101000626900 Trieres chinensis Uncharacterized 5.5 kDa protein in ccsA-rps6 intergenic region Proteins 0.000 description 1
- 101000768114 Triticum aestivum Uncharacterized protein ycf70 Proteins 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- KZIQDVNORJKTMO-WDSOQIARSA-N Trp-Arg-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N KZIQDVNORJKTMO-WDSOQIARSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- VEYXZZGMIBKXCN-UBHSHLNASA-N Trp-Asp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VEYXZZGMIBKXCN-UBHSHLNASA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- WQYPAGQDXAJNED-AAEUAGOBSA-N Trp-Cys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N WQYPAGQDXAJNED-AAEUAGOBSA-N 0.000 description 1
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 1
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 1
- BEWOXKJJMBKRQL-AAEUAGOBSA-N Trp-Gly-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N BEWOXKJJMBKRQL-AAEUAGOBSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- ORQGVWIUHICVKE-KCTSRDHCSA-N Trp-His-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O ORQGVWIUHICVKE-KCTSRDHCSA-N 0.000 description 1
- FHVCMIMUGUFIOJ-IHPCNDPISA-N Trp-His-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O)N FHVCMIMUGUFIOJ-IHPCNDPISA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 1
- RERRMBXDSFMBQE-ZFWWWQNUSA-N Trp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERRMBXDSFMBQE-ZFWWWQNUSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 1
- RPTAWXPQXXCUGL-OYDLWJJNSA-N Trp-Trp-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O RPTAWXPQXXCUGL-OYDLWJJNSA-N 0.000 description 1
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- ULHJJQYGMWONTD-HKUYNNGSSA-N Tyr-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ULHJJQYGMWONTD-HKUYNNGSSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- RMRFSFXLFWWAJZ-HJOGWXRNSA-N Tyr-Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 RMRFSFXLFWWAJZ-HJOGWXRNSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- 101710173665 Uncharacterized 5.8 kDa protein Proteins 0.000 description 1
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 1
- 101710172361 Uncharacterized protein ycf17 Proteins 0.000 description 1
- 229920001807 Urea-formaldehyde Polymers 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- JFAWZADYPRMRCO-UBHSHLNASA-N Val-Ala-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JFAWZADYPRMRCO-UBHSHLNASA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- PWCJARIQERIIGF-BZSNNMDCSA-N Val-Met-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWCJARIQERIIGF-BZSNNMDCSA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 description 1
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 description 1
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 description 1
- 102100039744 WD repeat-containing protein 19 Human genes 0.000 description 1
- 101000736254 Zea mays Uncharacterized protein ycf70 Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- DHKHKXVYLBGOIT-UHFFFAOYSA-N acetaldehyde Diethyl Acetal Natural products CCOC(C)OCC DHKHKXVYLBGOIT-UHFFFAOYSA-N 0.000 description 1
- 150000001241 acetals Chemical class 0.000 description 1
- ODFJOVXVLFUVNQ-UHFFFAOYSA-N acetarsol Chemical compound CC(=O)NC1=CC([As](O)(O)=O)=CC=C1O ODFJOVXVLFUVNQ-UHFFFAOYSA-N 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- XECAHXYUAAWDEL-UHFFFAOYSA-N acrylonitrile butadiene styrene Chemical compound C=CC=C.C=CC#N.C=CC1=CC=CC=C1 XECAHXYUAAWDEL-UHFFFAOYSA-N 0.000 description 1
- 229920000122 acrylonitrile butadiene styrene Polymers 0.000 description 1
- 239000004676 acrylonitrile butadiene styrene Substances 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 229960001570 ademetionine Drugs 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 239000003905 agrochemical Substances 0.000 description 1
- 229950005033 alanosine Drugs 0.000 description 1
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 239000000956 alloy Substances 0.000 description 1
- 229910045601 alloy Inorganic materials 0.000 description 1
- SRBFZHDQGSBBOR-LECHCGJUSA-N alpha-D-xylose Chemical compound O[C@@H]1CO[C@H](O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-LECHCGJUSA-N 0.000 description 1
- PNEYBMLMFCGWSK-UHFFFAOYSA-N aluminium oxide Inorganic materials [O-2].[O-2].[O-2].[Al+3].[Al+3] PNEYBMLMFCGWSK-UHFFFAOYSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 210000002821 alveolar epithelial cell Anatomy 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 239000002647 aminoglycoside antibiotic agent Substances 0.000 description 1
- ISRODTBNJUAWEJ-UHFFFAOYSA-N amixetrine Chemical compound C=1C=CC=CC=1C(OCCC(C)C)CN1CCCC1 ISRODTBNJUAWEJ-UHFFFAOYSA-N 0.000 description 1
- 229950001993 amixetrine Drugs 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 229940111131 antiinflammatory and antirheumatic product propionic acid derivative Drugs 0.000 description 1
- 229940124522 antiretrovirals Drugs 0.000 description 1
- 239000003903 antiretrovirus agent Substances 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 210000001367 artery Anatomy 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- BYFMCKSPFYVMOU-UHFFFAOYSA-N bendazac Chemical compound C12=CC=CC=C2C(OCC(=O)O)=NN1CC1=CC=CC=C1 BYFMCKSPFYVMOU-UHFFFAOYSA-N 0.000 description 1
- 229960005149 bendazac Drugs 0.000 description 1
- 229960000333 benzydamine Drugs 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000029918 bioluminescence Effects 0.000 description 1
- 238000005415 bioluminescence Methods 0.000 description 1
- 229950003872 bucolome Drugs 0.000 description 1
- 229910052793 cadmium Inorganic materials 0.000 description 1
- BDOSMKKIYDKNTQ-UHFFFAOYSA-N cadmium atom Chemical compound [Cd] BDOSMKKIYDKNTQ-UHFFFAOYSA-N 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 150000001782 cephems Chemical class 0.000 description 1
- 230000000973 chemotherapeutic effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- YOOVTUPUBVHMPG-LODYRLCVSA-O coformycin(1+) Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C([NH+]=CNC[C@H]2O)=C2N=C1 YOOVTUPUBVHMPG-LODYRLCVSA-O 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 239000012297 crystallization seed Substances 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- PWHROYKAGRUWDQ-UHFFFAOYSA-N difenpiramide Chemical compound C=1C=CC=NC=1NC(=O)CC(C=C1)=CC=C1C1=CC=CC=C1 PWHROYKAGRUWDQ-UHFFFAOYSA-N 0.000 description 1
- 210000001840 diploid cell Anatomy 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 230000000857 drug effect Effects 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- VLCYCQAOQCDTCN-UHFFFAOYSA-N eflornithine Chemical compound NCCCC(N)(C(F)F)C(O)=O VLCYCQAOQCDTCN-UHFFFAOYSA-N 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000001508 eye Anatomy 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000011737 fluorine Substances 0.000 description 1
- 229910052731 fluorine Inorganic materials 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000011194 food seasoning agent Nutrition 0.000 description 1
- 229910052839 forsterite Inorganic materials 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 101150055782 gH gene Proteins 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 235000003869 genetically modified organism Nutrition 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 229960002449 glycine Drugs 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 229960002350 guaiazulen Drugs 0.000 description 1
- 230000026030 halogenation Effects 0.000 description 1
- 238000005658 halogenation reaction Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- SPSXSWRZQFPVTJ-ZQQKUFEYSA-N hepatitis b vaccine Chemical compound C([C@H](NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC1N=CN=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)OC(=O)CNC(=O)CNC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@@H](N)CCCNC(N)=N)C1=CC=CC=C1 SPSXSWRZQFPVTJ-ZQQKUFEYSA-N 0.000 description 1
- 229940124736 hepatitis-B vaccine Drugs 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 229920001477 hydrophilic polymer Polymers 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 239000002955 immunomodulating agent Substances 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- XUWPJKDMEZSVTP-LTYMHZPRSA-N kalafungina Chemical compound O=C1C2=C(O)C=CC=C2C(=O)C2=C1[C@@H](C)O[C@H]1[C@@H]2OC(=O)C1 XUWPJKDMEZSVTP-LTYMHZPRSA-N 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000003120 macrolide antibiotic agent Substances 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- HCWCAKKEBCNQJP-UHFFFAOYSA-N magnesium orthosilicate Chemical compound [Mg+2].[Mg+2].[O-][Si]([O-])([O-])[O-] HCWCAKKEBCNQJP-UHFFFAOYSA-N 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 230000002906 microbiologic effect Effects 0.000 description 1
- 238000004377 microelectronic Methods 0.000 description 1
- 238000005459 micromachining Methods 0.000 description 1
- 238000005497 microtitration Methods 0.000 description 1
- 229940029985 mineral supplement Drugs 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 101150026150 mt gene Proteins 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 229950006780 n-acetylglucosamine Drugs 0.000 description 1
- 229960004270 nabumetone Drugs 0.000 description 1
- 210000003360 nephrocyte Anatomy 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 230000007830 nerve conduction Effects 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 229960000965 nimesulide Drugs 0.000 description 1
- HYWYRSMBCFDLJT-UHFFFAOYSA-N nimesulide Chemical compound CS(=O)(=O)NC1=CC=C([N+]([O-])=O)C=C1OC1=CC=CC=C1 HYWYRSMBCFDLJT-UHFFFAOYSA-N 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000000041 non-steroidal anti-inflammatory agent Substances 0.000 description 1
- 231100000956 nontoxicity Toxicity 0.000 description 1
- 239000000346 nonvolatile oil Substances 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 210000001331 nose Anatomy 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 101150055367 orf47 gene Proteins 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- 229960004534 orgotein Drugs 0.000 description 1
- 108010070915 orgotein Proteins 0.000 description 1
- 229940037201 oris Drugs 0.000 description 1
- 229960005113 oxaceprol Drugs 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- FPVKHBSQESCIEP-JQCXWYLXSA-N pentostatin Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC[C@H]2O)=C2N=C1 FPVKHBSQESCIEP-JQCXWYLXSA-N 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- XKFIQZCHJUUSBA-UHFFFAOYSA-N perisoxal Chemical compound C1=C(C=2C=CC=CC=2)ON=C1C(O)CN1CCCCC1 XKFIQZCHJUUSBA-UHFFFAOYSA-N 0.000 description 1
- 229950005491 perisoxal Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-N phosphoric acid Substances OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 1
- 150000003016 phosphoric acids Chemical class 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 229950006452 pifoxime Drugs 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- XESARGFCSKSFID-FLLFQEBCSA-N pirazofurin Chemical compound OC1=C(C(=O)N)NN=C1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 XESARGFCSKSFID-FLLFQEBCSA-N 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229960000502 poloxamer Drugs 0.000 description 1
- 229920000724 poly(L-arginine) polymer Polymers 0.000 description 1
- 229920002285 poly(styrene-co-acrylonitrile) Polymers 0.000 description 1
- 229920002492 poly(sulfone) Polymers 0.000 description 1
- 229920002239 polyacrylonitrile Polymers 0.000 description 1
- 108010011110 polyarginine Proteins 0.000 description 1
- 239000004417 polycarbonate Substances 0.000 description 1
- 229920000515 polycarbonate Polymers 0.000 description 1
- 150000004291 polyenes Chemical class 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000139 polyethylene terephthalate Polymers 0.000 description 1
- 239000005020 polyethylene terephthalate Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000003910 polypeptide antibiotic agent Substances 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 229950008882 polysorbate Drugs 0.000 description 1
- 239000011118 polyvinyl acetate Substances 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 239000004800 polyvinyl chloride Substances 0.000 description 1
- 229920000915 polyvinyl chloride Polymers 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 229960001801 proxazole Drugs 0.000 description 1
- OLTAWOVKGWWERU-UHFFFAOYSA-N proxazole Chemical compound C=1C=CC=CC=1C(CC)C1=NOC(CCN(CC)CC)=N1 OLTAWOVKGWWERU-UHFFFAOYSA-N 0.000 description 1
- JEXVQSWXXUJEMA-UHFFFAOYSA-N pyrazol-3-one Chemical compound O=C1C=CN=N1 JEXVQSWXXUJEMA-UHFFFAOYSA-N 0.000 description 1
- 150000003217 pyrazoles Chemical class 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 229950008942 renytoline Drugs 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 229920003987 resole Polymers 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical class OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 229910052594 sapphire Inorganic materials 0.000 description 1
- 239000010980 sapphire Substances 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 239000003352 sequestering agent Substances 0.000 description 1
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- HBMJWWWQQXIZIP-UHFFFAOYSA-N silicon carbide Chemical compound [Si+]#[C-] HBMJWWWQQXIZIP-UHFFFAOYSA-N 0.000 description 1
- 229910010271 silicon carbide Inorganic materials 0.000 description 1
- HQVNEWCFYHHQES-UHFFFAOYSA-N silicon nitride Chemical compound N12[Si]34N5[Si]62N3[Si]51N64 HQVNEWCFYHHQES-UHFFFAOYSA-N 0.000 description 1
- 229910052814 silicon oxide Inorganic materials 0.000 description 1
- 229920002050 silicone resin Polymers 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011343 solid material Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000002910 structure generation Methods 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-L succinate(2-) Chemical compound [O-]C(=O)CCC([O-])=O KDYFGRWQOYBRFD-UHFFFAOYSA-L 0.000 description 1
- 150000005846 sugar alcohols Chemical class 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 229960003676 tenidap Drugs 0.000 description 1
- LXIKEPCNDFVJKC-QXMHVHEDSA-N tenidap Chemical compound C12=CC(Cl)=CC=C2N(C(=O)N)C(=O)\C1=C(/O)C1=CC=CS1 LXIKEPCNDFVJKC-QXMHVHEDSA-N 0.000 description 1
- 101150065190 term gene Proteins 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- TUGDLVFMIQZYPA-UHFFFAOYSA-N tetracopper;tetrazinc Chemical compound [Cu+2].[Cu+2].[Cu+2].[Cu+2].[Zn+2].[Zn+2].[Zn+2].[Zn+2] TUGDLVFMIQZYPA-UHFFFAOYSA-N 0.000 description 1
- 229940072172 tetracycline antibiotic Drugs 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- FYZXEMANQYHCFX-UHFFFAOYSA-K tripotassium;2-[2-[bis(carboxylatomethyl)amino]ethyl-(carboxymethyl)amino]acetate Chemical compound [K+].[K+].[K+].OC(=O)CN(CC([O-])=O)CCN(CC([O-])=O)CC([O-])=O FYZXEMANQYHCFX-UHFFFAOYSA-K 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 229920006305 unsaturated polyester Polymers 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 229940124742 varicella zoster vaccine Drugs 0.000 description 1
- 210000003501 vero cell Anatomy 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- 229960003487 xylose Drugs 0.000 description 1
- VLCYCQAOQCDTCN-ZCFIWIBFSA-N α-difluoromethylornithine Chemical compound NCCC[C@@](N)(C(F)F)C(O)=O VLCYCQAOQCDTCN-ZCFIWIBFSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/245—Herpetoviridae, e.g. herpes simplex virus
- A61K39/25—Varicella-zoster virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/20—Antivirals for DNA viruses
- A61P31/22—Antivirals for DNA viruses for herpes viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5254—Virus avirulent or attenuated
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/16011—Herpesviridae
- C12N2710/16711—Varicellovirus, e.g. human herpesvirus 3, Varicella Zoster, pseudorabies
- C12N2710/16722—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/16011—Herpesviridae
- C12N2710/16711—Varicellovirus, e.g. human herpesvirus 3, Varicella Zoster, pseudorabies
- C12N2710/16734—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/16011—Herpesviridae
- C12N2710/16711—Varicellovirus, e.g. human herpesvirus 3, Varicella Zoster, pseudorabies
- C12N2710/16761—Methods of inactivation or attenuation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/50—Vectors for producing vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2820/00—Vectors comprising a special origin of replication system
- C12N2820/55—Vectors comprising a special origin of replication system from bacteria
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Microbiology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Epidemiology (AREA)
- Mycology (AREA)
- Biophysics (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Oncology (AREA)
- Communicable Diseases (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供重组水痘-带状疱疹病毒及其制备方法,和包含重组水痘-带状疱疹病毒的药物组合物;还提供包含所述水痘-带状疱疹病毒基因组基因和BAC载体序列的载体、包含这种载体的细胞以及能与水痘-带状疱疹病毒进行同源重组的片段、包含所述的BAC载体序列的核酸盒。为此,本发明通过开发利用BAC载体序列制备重组水痘-带状疱疹病毒的方法,解决了上述课题。
Description
技术领域
本发明涉及重组水痘-带状疱疹病毒,特别是利用BAC(大肠杆菌人工染色体)制备的重组水痘-带状疱疹病毒,以及包含此种病毒的药物组合物。此外,本发明还涉及包含水痘-带状疱疹病毒基因组基因和BAC载体序列的载体,以及包含此种载体的细胞。本发明还涉及制备重组水痘-带状疱疹病毒的方法。再有,本发明涉及包含能与水痘-带状疱疹病毒基因组进行同源重组的片段以及BAC载体序列的核酸盒。
背景技术
水痘-带状疱疹病毒(varicella-zoster virus;VZV)是属于人疱疹病毒科(Herpesviridae)的病毒,是表现出两种不同临床表象疾病(水痘和带状疱疹)的病因。这种病毒的初期感染引起水痘(水疱疮)。其后,此种病毒潜伏感染神经节,经年累月后由于某种诱因重新活化引发带状疱疹(形成病毒颗粒,通过神经传导,达到表皮细胞,在神经分布的区域形成水痘症状)。
VZV基因组是双链DNA,包含约125000个碱基。全部的碱基序列已由Davison等人确定,已知该基因组中至少存在着72个基因。
VZV疫苗的研发困难重重。VZV疫苗Oka株是由高桥等人开发的(特公昭53-41202号),是世界上唯一针对水痘-带状疱疹病毒的疫苗。现有的减毒活水痘疫苗是利用以来源于减毒水痘病毒Oka株的病毒作为种子制备的,目前在世界各国广泛应用(Requirements forVaricella(Live)Adopted 1984;Revised 1993:WHO Technical ReportSeries,No.848,pp.22-38,1994)。这一Oka株是由如下方式获得的,即将从表现典型的水痘症状的患儿分离的病毒(Oka原株),利用人胎儿肺细胞在34℃下传代12代,再利用豚鼠胚胎细胞传代11代,然后利用人二倍体细胞传代数代来获得。所述的Oka原株虽然具有强病源性,但Oka疫苗株(Oka株)即使对正常的儿童接种也几乎没有发现副作用。因此,Oka株作为几乎没有病源性的疫苗株是有用的。
病毒疫苗在进行传代培养时有可能改变其基因型。另外,由在Oka株自身的制备过程中经过多次传代培养,Oka株自身也可能具备遗传多样性。实践中,为了确保疫苗的安全性和有效性,考虑到在制备疫苗过程中,由于传代培养所导致的疫苗遗传改变,制定了种子批次系统作为对认可进行生产的水痘种子病毒的传代数的限制,即以确认种子时的传代数为0代,并把从0代开始总传代数在10代以内的病毒用作疫苗。
另一方面,通过水痘疫苗的效果追踪,以及售后调查(PMS:post-marketing Surveillance),从流行病学的观点,有必要对从自然感染的水痘患者分离的水痘病毒的新鲜野生株与来自上述Oka株的疫苗株之间的病毒学差异进行分析,故已进行了应用免疫学和遗传工程学进行分析的多种尝试。例如,已有基于下述文献进行的试验报道,即水痘病毒株间基因结构、DNA碱基序列之间的差异(Journal of GeneralVirology,59,660-668,1986;同前67,1759-1816,1986),限制酶Pst I酶切位点的有无(Japanese Journal of Experimental Medicine,59,233-237,1989),根据应用PCR(聚合酶链式反应)进行的RFLP(限制性片段长度多态性)判定(Journal of Virology,66,1016-1020,1992),以及将上述Pst I酶切位点的有无与RFLP分析相结合(Journalof Clinical Microbiology,33,658-660,1995)。尽管通过这些试验提出了为鉴别新鲜的野生型菌株与来自Oka株的疫苗株的条件,但考虑到Oka株自身的遗传多样性的问题,仍缺乏可信性,且不具备确定性,故而仍然存在疫苗的品质管理问题。再有,已知有利用水痘病毒的基因14区域鉴定水痘病毒Oka株的方法(US Patent No.6,093,535),利用基因62区域鉴定减毒活水痘疫苗病毒株的方法(WO 00/50603),这些方法中的任意一种技术均可以用于鉴定水痘病毒Oka株(强毒亲株),由亲株衍生的疫苗株(减毒的Oka株),以及Oka株之外的水痘病毒株之间的差异。但是以用于减毒活水痘疫苗的品质管理和品质保证的制剂标准尚不完备。
目前,用于评定和确认疫苗品质的方法,还没有进行例如,通过对种子病毒和疫苗病毒的基因组DNA以直接的或定量的遗传分析进行品质管理,因此用作活疫苗的减毒株的品质管理和品质保证的准确度无法计算,故而含混不清。因此,提高品质管理和品质保证的准确度对于确保减毒活水痘疫苗的有效性。安全性·均质性至关重要。然而,如上所述,由于尚未确立其方法,使其成为亟待解决的课题。
此外,为了研制出一种优于Oka株的、经改变的水痘-带状疱疹病毒疫苗,也需要开发经诱变的重组水痘-带状疱疹病毒及其制备方法。
专利文献1:特公昭53-41202号
专利文献2:米国特许第6,093,535号
专利文献3:国际公开番号WO 00/50603
非专利文献1:Requirements for Varicella Vaccine(Live)Adopted 1984;Revised 1993:WHO Technical Report Series,No.848,pp.22-38,1994
非专利文献2:Journal of General Virology,59,660-668,1986
非专利文献3:Journal of General Virology,67,1759-1816,1986
非专利文献4:Japanese Journal of Experimental Medicine,59,233-237,1989
非专利文献5:Journal of Virology,66,1016-1020,1992
非专利文献6:Journal of Clinical Microbiology,33,658-660,1995
发明内容
本发明的一个目的在于提高水痘带状疱疹疫苗的品质控制和品质保证的精度,以确保、保证减毒活水痘疫苗的有效性,安全性和均质性。此外,本发明的课题还在于为了研制一种优于Oka株的,经改变的水痘-带状疱疹病毒疫苗,建立通过诱变生产重组水痘-带状疱疹病毒的方法,并提供此种病毒。
为实现上述目的,本发明提供了重组水痘-带状疱疹病毒,及其生产方法,例如利用BAC(大肠杆菌人工染色体)由单一病毒株生产重组水痘-带状疱疹病毒的方法。
发明概述
本发明人通过开发利用BAC载体序列制造重组水痘-带状疱疹病毒的方法,完成了本发明。本发明提供了下述内容。
1.重组水痘-带状疱疹病毒。
2.1项中所述的重组水痘-带状疱疹病毒,其中,包含BAC载体序列。
3.如2项中所述的重组水痘-带状疱疹病毒,其中,至少部分BAC载体序列插入到水痘-带状疱疹病毒基因组的非必需区域之中。
4.如3项中所述的重组水痘-带状疱疹病毒,其中,所述的非必需区域选自下述区域:基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
5.如4项中所述的重组水痘-带状疱疹病毒,其中,所述的非必需区域是基因11的ORF的侧翼区域或基因12的ORF的侧翼区域。
6.如2项所述的重组水痘-带状疱疹病毒,其中,至少部分BAC载体序列插入到水痘-带状疱疹病毒基因组的基因62的ORF区域。
7.如2项中的重组水痘-带状疱疹病毒,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
8.如2项中所述的重组水痘-带状疱疹病毒,其中,所述的BAC载体序列包含选择标记。
9.如8项中所述的重组水痘-带状疱疹病毒,其中,所述的选择标记是药物选择标记。
10.如2项中的重组水痘-带状疱疹病毒,其中,所述的选择标记是编码绿荧光蛋白的基因。
11.如2项中的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
12.如2项中的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
13.如2项中的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
14.如2项中的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组在基因62和基因6中带有突变。
15.如14项中的重组水痘-带状疱疹病毒,其中,所述的基因62在SEQ ID NO.5中至少包含下述(a)-(d)中的碱基取代:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代,
以及前述基因6在SEQ ID NO.8的碱基序列中至少具有第5745位碱基是G的碱基取代。
16.如2项中的重组水痘-带状疱疹病毒,其中,所述的BAC载体序列是具有SEQ ID NO.:7中所示序列的载体。
17.包含1项中所述病毒的药物组合物。
18.如17项所述的药物组合物,其中,组合物是疫苗的形式。
19.载体,其包含除基因62之外的水痘-带状疱疹病毒基因组必需基因和BAC载体序列。
20.如19项所述的载体,其进一步包含基因62。
21.如19项所述的载体,其中,当所述载体插入哺乳动物细胞时,哺乳动物细胞产生水痘-带状疱疹病毒。
22.如19项所述的载体,其中,来自前述水痘-带状疱疹病毒基因组的序列与BAC载体序列相连的部分位于该水痘-带状疱疹病毒基因组非必需区域内。
23.如22项所述的载体,其中,所述的非必需区域选自下述区域:基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
24.如23项所述的载体,其中,所述的连接部分是基因11的ORF侧翼区域或基因12的ORF侧翼区域。
25.如19项所述的载体,其中,所述的来自水痘-带状疱疹病毒基因组的序列与前述BAC载体序列相连的部分位于水痘-带状疱疹病毒基因组中基因62的ORF中的区域内。
26.如19项所述的载体,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
27.如19项所述的载体,其中,所述的BAC载体序列包含选择标记。
28.如27项所述的载体,其中,所述的选择标记是药物选择标记。
29.如27项所述的载体,其中,所述的选择标记是编码绿荧光蛋白的基因。
30.如19项所述的载体,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
31.如19项所述的载体,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
32.如19项所述的载体,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
33.如19项所述的载体,其中,所述的水痘-带状疱疹病毒基因组带有在基因62和基因6中的突变。
34.如33项中的载体,其中,所述的基因62在SEQ ID NO.5中至少包含下述(a)-(d)中的碱基取代:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代,
此外,基因6在SEQ ID NO.8的碱基序列中至少包含第5745位碱基是G的碱基取代。
35.如19项所述的载体,其中,是所述的BAC载体序列包含SEQID NO.:7所示的序列的载体。
36.包含19项所述载体的细胞。
37.如36项所述的细胞,其中,所述的细胞是细菌细胞。
38.如37项所述的细胞,其中,所述的细菌细胞是大肠杆菌细胞。
39.如36项所述的细胞,其中所述的细胞是哺乳动物细胞。
40.如39项所述的哺乳动物细胞,其中,所述的哺乳动物细胞是来自人的细胞。
41.由如39项所述的哺乳动物细胞产生的病毒。
42.包含如41项所述病毒的药物组合物。
43.如42项所述的药物组合物,其中,所述的组合物是疫苗形式。
44.生产重组水痘-带状疱疹病毒的方法,它是重组水痘-带状疱疹病毒的生产方法,该方法包含以下步骤:
将含有除基因62之外的水痘-带状疱疹病毒基因组必需基因与BAC载体序列的载体导入哺乳动物宿主细胞的步骤;和
培养该哺乳动物宿主细胞,产生重组水痘-带状疱疹病毒的步骤。
45.如44项所述的方法,其中,所述的载体还包含基因62。
46.如44项所述的方法,其中,所述的哺乳动物宿主细胞来源于人。
47.如44项所述的方法,其中,所述的BAC载体序列至少包含两种重组蛋白依赖的重组序列。
48.如47项所述的方法,其中,进一步包含引起上述两个重组蛋白依赖的重组序列之间重组的步骤。
49.如44项所述的方法,其中,来自前述水痘-带状疱疹病毒基因组的序列与BAC载体序列相连的部位位于该水痘-带状疱疹病毒基因组的非必需区域内。
50.如49项所述的方法,其中,所述的非必需区域选自下述区域:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
51.如50项所述的载体,其中,所述的非必需区域是基因11的ORF侧翼区域或基因12的ORF侧翼区域。
52.如44项所述的方法,其中,所述的来自水痘-带状疱疹病毒基因组的序列与BAC载体序列相连接的部分位于水痘-带状疱疹病毒基因组的基因62的ORF区域内。
53.如44项所述的方法,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
54.如44项所述的方法,其中,所述的BAC载体序列包含选择标记。
55.如第54项所述的方法,其中,所述的选择标记是药物选择标记。
56.如第54项所述的方法,其中,所述的选择标记是编码绿荧光蛋白的基因。
57.如44项所述的方法,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
58.如44项所述的方法,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
59.如44项所述的方法,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
60.如44项所述的方法,其中,前述水痘-带状疱疹病毒基因组具有基因62和基因6中的突变。
61.如第60项所述的方法,其中,所述的基因62在SEQ ID NO.5中至少包含下述(a)-(d)中的碱基取代:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代,
此外,基因6包含在SEQ ID NO.8的碱基序列中至少第5745位碱基是G的碱基取代。
62.如44项所述的方法,其中,是所述的BAC载体序列包含SEQID NO.:7所示的序列的载体。
63.如第44项所述方法所生产的病毒。
64.药物组合物,其包含如第63项所述的病毒。
65.如第64项所述的药物组合物,其中,所述的组合物是疫苗的形式。
66.向如第19项所述的载体中导入突变的方法,包括以下步骤:
将所述载体导入细菌宿主细胞的步骤;
将包含由水痘-带状疱疹病毒基因组的一部分组成的片段的质粒载体导入该细菌宿主细胞的步骤,其中,所述片段至少具有一个突变;
培养所述的细菌宿主细胞的步骤;
由经培养的细菌宿主细胞分离具有BAC序列的载体的步骤。
67.向如第19项所述的载体中导入突变的方法,包括以下步骤:
将所述载体导入细菌宿主细胞的步骤;
将包含由水痘-带状疱疹病毒基因组的一部分组成的第一片段的第一质粒载体导入所述细菌宿主细胞的步骤,其中,所述的第一片段至少具有一个变异;
将包含由水痘-带状疱疹病毒基因组的一部分组成的第二片段的第二质粒载体导入所述细菌宿主细胞中的步骤,其中,所述的第二片段至少具有一个变异,所述的第二片段与第一片段不同;
培养所述细菌宿主细胞的步骤;
从所培养的细菌宿主细胞分离具有BAC载体序列的载体的步骤。
68.核酸盒,其包含在细菌细胞内可以与水痘-带状疱疹病毒基因组同源重组的第一片段,BAC载体序列,和在细菌细胞内可以与水痘-带状疱疹病毒基因组同源重组的第二片段的核酸,其中,所述的BAC序列两端分别与第一片段和第二片段相连。
69.如第68项所述的核酸盒,其中,所述的第一片段和第二片段至少为1kb。
70.如第68项所述的核酸盒,其中,所述的第一片段和第二片段至少为1.5kb。
71.如第68项所述的核酸盒,其中,所述的第一片段和第二片段至少为2kb。
72.如第68项所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少80%同一。
73.如第68项所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少85%同一。
74.如第68项所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少90%同一。
75.如第68项所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少95%同一。
76.如第68项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地来自选自下述的水痘-带状疱疹病毒基因组的区域:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域,基因70的ORF的侧翼区域。
77.如第68项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少80%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
78.如第68项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少85%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
79.如第68项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少90%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
80.如第68项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少95%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
81.如第68项所述的核酸盒,其中,所述的第一片段和第二片段来自不同的区域。
82.如第72项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地来自于基因11的ORF的侧翼序列和基因12的ORF的侧翼序列。
83.如第68项所述的核酸盒,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
84.如第68项所述的核酸盒,其中,所述的BAC载体序列包含选择标记。
85.如第84项所述的核酸盒,其中,所述的选择标记是药物选择标记。
86.如第68项所述的核酸盒,其中,所述的选择标记是编码绿荧光蛋白的基因。
87.如第68项所述的核酸盒,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
88.如第68项所述的核酸盒,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
89.如第68项所述的核酸盒,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
90.如第68项所述的核酸盒,其中,所述的BAC载体序列包含SEQ ID NO.:7所示的序列。
91.如第68项所述的核酸盒,其具有SEQ ID NO.:2所示的核酸序列。
本发明提供重组水痘-带状疱疹病毒及其生产方法。例如,本发明提供利用BAC(大肠杆菌人工染色体)由单个的病毒株生产重组水痘-带状疱疹病毒的方法,以及由该方法生产的重组水痘-带状疱疹病毒。此外,本发明还提供包含所述重组水痘-带状疱疹病毒的药物组合物。
本发明还提供包含水痘-带状疱疹病毒基因组基因和BAC载体序列的载体,包含所述载体的细胞,以及包含能与水痘-带状疱疹病毒基因组同源重组的片段以及BAC载体序列的核酸盒。
附图说明
图1是显示水痘-带状疱疹病毒Oka株基因组和重组水痘-带状疱疹病毒的结构的模式图。
图2是水痘-带状疱疹病毒Oka株(亲株)基因组与重组水痘-带状疱疹病毒(rV02)的体外增殖比较图。
优选实施方案的描述
以下,对本发明进行说明。在整个说明书中,只要没有特别说明,单数表达形式应当理解为也包含其复数形式的概念范畴。只要没有特别说明,本说明书中所使用的术语应理解为本领域通常所指的含义。因而在没有其它定义的情况下,本说明书中所使用的全部专业术语和科技用语,应理解为本发明所属技术领域的普通技术人员通常所理解的含义。当发生矛盾时,首先依照本说明书(包括定义)。
以下对本发明进行说明。在本说明书中,只要没有特别说明,单数表达形式应当理解为包含其复数形式的概念范畴。因此,单数形式的冠词(例如英语中的“a”,“an”,“the”)在没有特别说明的情况下也包含其复数形式的概念范畴。在没有特别说明的情况下,本说明书中所使用的术语应理解为本领域通常所指的含义。因而在没有其它定义的情况下,本说明书中所使用的全部专业术语和科技用语,应理解为与本发明所属技术领域的普通技术人员通常所理解的相同含义。发生矛盾时,优先依照本说明书中(包括定义)。
术语的定义
以下列举的是本说明书中特别使用的术语的定义。
在本说明书中使用时,水痘-带状疱疹病毒的“必需基因”是指水痘-带状疱疹病毒增殖所必需的基因。水痘-带状疱疹病毒的“非必需基因”指对于非水痘-带状疱疹病毒增殖所必需的基因,即使缺失了这些“非必需基因”,水痘-带状疱疹病毒仍然可以增殖的基因。作为水痘-带状疱疹病毒的非必需基因,例如包括但不限于:基因7,基因8,基因9,基因10,基因11,基因12,基因13,基因14,基因15,基因17,基因18,基因19,基因38,基因39,基因46,基因47,基因48,基因49,基因50,基因56,基因57,基因58,基因59,基因61,基因63,基因64,基因65,基因66,基因67,基因68,基因69和基因70。
当病毒基因组中的基因为必需基因时,由于该基因被破坏,病毒将不能增殖。因此,当病毒基因组中的任意基因被破坏后,通过检测病毒的增殖情况即可判断被破坏的基因是必需基因还是非必需基因。
本说明书中所述的水痘-带状疱疹病毒的“野生株”指未经人工修饰,从自然界中分离的水痘-带状疱疹病毒株。作为野生株例如,包括但不限于Davison,A.J.and Scott,J.E.(J.Gen.Virol.67(Pt 9),1759-1816(1986)鉴定的Dumas株。Dumas株的核酸序列如SEQ IDNO.:5所示。该Dumas株的ORF的编号及其位置如下:
ORF编号 阅读框的方 基因组中的位置 氨基酸残基数
向
ORF1 3’→5’方向 589到915 氨基酸1-108
ORF2 5’→3’方向 1134到1850 氨基酸1-238
ORF3 3’→5’方向 1908到2447 氨基酸1-179
ORF4 3’→5’方向 2783到4141 氨基酸1-452
ORF5 3’→5’方向 4252到5274 氨基酸1-340
ORF6 3’→5’方向 5326到8577 氨基酸1-1083
ORF7 5’→3’方向 8607到9386 氨基酸1-259
ORF8 3’→5’方向 9477到10667 氨基酸1-396
ORF9 5’→3’方向 11009到11917 氨基酸1-302
ORF9A 5’→3’方向 10642到10902 氨基酸1-87
ORF10 5’→3’方向 12160到13392 氨基酸1-410
ORF11 5’→3’方向 13590到16049 氨基酸1-819
ORF12 5’→3’方向 16214到18199 氨基酸1-661
ORF13 5’→3’方向 18441到19346 氨基酸1-301
ORF14 3’→5’方向 19431到21113 氨基酸1-560
ORF15 3’→5’方向 21258到22478 氨基酸1-406
ORF16 3’→5’方向 22568到23794 氨基酸1-408
ORF17 5’→3’方向 24149到25516 氨基酸1-455
ORF18 3’→5’方向 25573到26493 氨基酸1-306
ORF19 3’→5’方向 26518到28845 氨基酸1-775
ORF20 3’→5’方向 29024到30475 氨基酸1-483
ORF21 5’→3’方向 30759到33875 氨基酸1-1038
ORF22 5’→3’方向 34083到42374 氨基酸1-2763
ORF23 3’→5’方向 42431到43138 氨基酸1-235
ORF24 3’→5’方向 43212到44021 氨基酸1-269
ORF25 3’→5’方向 44148到44618 氨基酸1-156
ORF26 5’→3’方向 44506到46263 氨基酸1-585
ORF27 5’→3’方向 46127到47128 氨基酸1-333
ORF28 3’→5’方向 47052到50636 氨基酸1-1194
ORF29 5’→3’方向 50857到54471 氨基酸1-1204
ORF30 5’→3’方向 54651到56963 氨基酸1-770
ORF31 5’→3’方向 57008到59614 氨基酸1-868
ORF32 5’→3’方向 59766到60197 氨基酸1-143
ORF33 3’→5’方向 60321到62138 氨基酸1-605
ORF33.5 3’→5’方向 60321到61229 氨基酸1-301
ORF34 3’→5’方向 62171到63910 氨基酸1-579
ORF35 3’→5’方向 63977到64753 氨基酸1-258
ORF36 5’→3’方向 64807到65832 氨基酸1-341
ORF37 5’→3’方向 66074到68599 氨基酸1-841
ORF38 3’→5’方向 68668到70293 氨基酸1-541
ORF39 5’→3’方向 70633到71355 氨基酸1-240
ORF40 5’→3’方向 71540到75730 氨基酸1-1396
ORF41 5’→3’方向 75847到76797 氨基酸1-316
ORF42+45 3’→5’方向 76851到78038以及氨基酸1-747
81538到82593
ORF43 5’→3’方向 78170到80200 氨基酸1-676
ORF44 5’→3’方向 80360到81451 氨基酸1-363
ORF46 5’→3’方向 82719到83318 氨基酸1-199
ORF47 5’→3’方向 83168到84700 氨基酸1-510
ORF48 5’→3’方向 84667到86322 氨基酸1-551
ORF49 5’→3’方向 86226到86471 氨基酸1-81
ORF50 3’→5’方向 86575到87882 氨基酸1-435
ORF51 5’→3’方向 87881到90388 氨基酸1-835
ORF52 5’→3’方向 90493到92808 氨基酸1-771
ORF53 3’→5’方向 92855到93850 氨基酸1-331
ORF54 3’→5’方向 93675到95984 氨基酸1-769
ORF55 5’→3’方向 95996到98641 氨基酸1-881
ORF56 5’→3’方向 98568到99302 氨基酸1-244
ORF57 3’→5’方向 99411到99626 氨基酸1-71
ORF58 3’→5’方向 99607到100272 氨基酸1-221
ORF59 3’→5’方向 100302到101219 氨基酸1-305
ORF60 3’→5’方向 101170到101649 氨基酸1-159
ORF61 3’→5’方向 103082到104485 氨基酸1-467
ORF62 3’→5’方向 105201到109133 氨基酸1-1310
ORF63 5’→3’方向 110581到111417 氨基酸1-278
ORF64 5’→3’方向 111565到112107 氨基酸1-180
ORF65 3’→5’方向 112332到112640 氨基酸1-102
ORF66 5’→3’方向 113037到114218 氨基酸1-393
ORF67 5’→3’方向 114496到115560 氨基酸1-354
ORF68 5’→3’方向 115808到117679 氨基酸1-623
ORF69 3’→5’方向 117790到118332 氨基酸1-180
ORF70 3’→5’方向 118480到119316 氨基酸1-278
ORF71 5→3’方向 120764到124696 氨基酸1-1310
上表中“5’→3’方向”指ORF具有与SEQ ID NO.:5所示核酸序列相同的方向。“3’→5’方向”指ORF具有与SEQ ID NO.:5所示核酸序列相反的方向。通过鉴定与上述ORF的核酸序列和/或氨基酸序列同源的序列,本领域的技术人员可以容易地鉴定出来自Dumas株之外的基因组中的ORF。
本说明书中的“突变株”是指对野生株的病毒株经过诱发突变,多次传代培养等诱发突变的水痘-带状疱疹病毒株。在对水痘-带状疱疹病毒株诱发突变时,所述的诱发突变既可以是随机导入突变,也可以是导入位点特异性突变。
在本说明书中使用时,“减毒病毒”是病毒突变株的一种,其毒性比野生株有所减弱。关于确定病毒突变株的毒性是否比野生株减弱的方法,即试验水痘-带状疱疹病毒病源性的方法,已确立了两种方法。
作为应用动物模型的方法,已知有制备移植了人皮肤的重度联合免疫缺损(SCID)小鼠,使其感染水痘-带状疱疹病毒来对病源性进行评价的方法(J.Viro 1.1998 Feb;72(2):965-74)。
与此相反,作为在试管中对病源性进行评价的方法,已知有分别在经孔径3μm的trans-well分隔的双层孔的下层接入单层培养的人黑素瘤细胞,上层接入经水痘-带状疱疹病毒感染的脐带血单核细胞(CBMC),培养7-8天后,观察黑素瘤细胞的CPE(细胞变性效果)程度的方法(J.Virol.2000 Feb;74(4):1864-70)。
虽然不是直接确定病源性的方法,从本发明者等人迄今为止的研究结果(J Virol.2002 Nov;76(22):11447-59)可以理解的是病毒的病源性与增殖性密切相关,所以通过感染中心分析(infectious centerassay)研究细胞-细胞(cell-to-cell)的增殖性也可以间接地评价病源性。
通过人工的方式对病毒进行减毒的方法是公知的。例如可以把具有SEQ ID NO.5所示的基因62中至少以下(a)~(d)的碱基取代以及在SEQ ID NO.8中所示的基因6中,至少5745位碱基为G的碱基取代的水痘-带状疱疹病毒作为减毒的病毒使用:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代。
作为应用上述水痘-带状疱疹病毒的替代,除了(a)~(d)的碱基取代之外,还可以使用具有以下(e)~(g)中至少一个或一个以上碱基取代的减毒水痘病毒株:
(e)第1251位的G取代;
(f)第2226位的G取代;和
(g)第3657位的G取代。
本发明中,作为应用上述的水痘-带状疱疹病毒的替代,除了(a)~(g)的至少一个或一个以上之外,还可以使用具有下述(h)~(o)的至少一个或一个以上的减毒水痘病毒株:
(h)第162位的C取代;
(i)第225位的C取代;
(j)第523位的C取代;
(k)第1565位的C取代;
(l)第1763位的C取代;
(m)第2652位的C取代;
(n)第4052位的C取代;和
(o)第4193位的C取代。
或作为“减毒病毒”,例如可以使用在基因62中具有选自下述的至少一种碱基取代的病毒:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;
(d)第4006位的G取代,
(e)第1251位的G取代;
(f)第2226位的G取代;
(g)第3657位的G取代;
(h)第162位的C取代;
(i)第225位的C取代;
(j)第523位的C取代;
(k)第1565位的C取代;
(l)第1763位的C取代;
(m)第2652位的C取代;
(n)第4052位的C取代;
(o)第4193位的C取代。
本说明书中使用的术语“蛋白质”“多肽”“寡肽”和“肽”具有相同的含义,均指具有任意长度的氨基酸聚合物。
本发明书中所用的术语“多核苷酸”、“寡核苷酸”和“核酸”在本说明书中,表示相同含义,指任意长度的核苷酸。除非特别说明,特定的核酸序列与所示的序列一样包含其经保守修饰的突变体(例如简并密码子取代)及其互补序列。具体而言,简并密码子取代可以通过制备一个或多个(或全部)所选择密码子的第三位由混合碱基和/或脱氧次黄嘌呤核苷残基取代的序列而完成(Batzer等人,Nucleic AcidRes.19:5081(1991);Ohtsuka等人,J.Biol.Chem.260:2605-2608(1985);Rossolini等人,Mol.Cell.Probes 8:91-98(1994))。
本说明书中所述的术语“基因”是指确定遗传性状的因子。通常在染色体上以一定的顺序排列。把确定蛋白质一级结构的基因称为结构基因。调节结构基因表达的基因称为调节基因。本说明书中所用的“基因”指“多核苷酸”、“寡核苷酸”、“核酸”、和/或“蛋白质”“多肽”“寡肽”和“肽”。本说明书中基因的“开放阅读框”或“ORF”指将基因的碱基序列以每3个碱基为一组进行分割时所得的三种读框中的一个,其具有起始密码子,内部没有终止密码子且具有一定的长度,实际上有可能编码蛋白质的读框。水痘-带状疱疹病毒基因组,其全序列已经得到鉴定,其中至少鉴定了71个基因,已知各基因分别具有开放阅读框(ORF)。
本说明书中,水痘-带状疱疹病毒基因组中基因的“ORF中的区域”是指位于水痘-带状疱疹病毒基因组内的基因中形成ORF的碱基所在的区域。
本说明书中,水痘-带状疱疹病毒基因组中基因的“ORF侧翼区域”是指位于水痘-带状疱疹病毒基因组内基因中,位于ORF附近的碱基所在的区域,其不代表该基因或其它基因的ORF中的区域。
本说明书中所用的术语基因的“同源性”指两个或两个以上的基因序列之间的同一性程度。某因此,两序列的同源性越高,则其序列的同一性或相似性越高。两种基因是否具有同源性可以通过将两个序列进行直接比较来判断,当所述序列是核酸时也可以通过在严格条件下的杂交方法进行判定。将两个基因直接进行比较时,所述基因的DNA序列,典型地具有至少50%的同一性,优选具有至少70%的同一性,更优选具有至少80%,90%,95%,96%,97%,98%,或99%同一性时,认为它们的基因具有同源性。
本说明书中所述的碱基序列同一性的比较和同源性的计算可以利用作为序列分析用工具的BLAST以其缺省参数进行计算。
本说明书中基因、多核苷酸,多肽等的“表达”指其基因等在体内受到某种作用而形成另外的形态。优选指基因,多核苷酸等经转录和翻译成为多肽,也可以是将基因转录为mRNA并表达。更优选地,所述的多肽可以经翻译后加工。
本说明书中氨基酸可以通过一般已知的三字符形式表示,也可以IUPAC-IUB生物化学命名委员会所推荐的单字符形式表示。类似地,对于核苷酸以其为公众所接受的单字符编码形式表示。
本说明书中所述的“片段”是相对于全长的多核苷酸或多肽(其长度为n),具有1到n-1序列长度的多肽或多核苷酸。片段的长度,根据目的不同可以适当地变化,例如作为片段长度的下限,当用于多肽时,可以列举3,4,5,6,7,8,9,10,15,20,25,30,40,50或其以上个氨基酸。用这里没有列举的整数表示的长度(例如,11等)作为下限,也是适宜的。当指多核苷酸时,可以列举5,6,7,8,9,10,15,20,25,30,40,50,75,100,200,300,400,500,600,600,700,800,900,1000或其以上个核苷酸,用这里没有具体列举的整数表示的长度(例如,11等)作为下限,也是适宜的。
BAC载体内的基因所编码的多肽只要其具有与天然的多肽基本上同一的作用即可。既可以是在其氨基酸序列中包含1个或以上(例如1个或几个)氨基酸的取代,添加和/或缺失,也可以是糖链的取代,添加和/或缺失。
本说明书中所用的“糖链”指连接1个或1个以上单元糖(单糖和/或其衍生物)的化合物。当连接两个或以上的单元糖时,各单元糖与单元糖之间,可以通过糖苷键脱水缩合连接。作为这种糖链,例如其包括但不限于,生物体中所含的多糖(葡萄糖、半乳糖、甘露糖、岩藻糖、木糖、N-乙酰葡糖胺、乙酰半乳糖胺、唾液酸及其复合物或衍生物),以及降解的多糖,糖蛋白,蛋白聚糖,粘多糖,糖脂等由复合生物分子分解或诱导产生的糖链等广义的糖。因此,本说明书中“糖链”可以与“多糖”,“含糖物质”和“碳水化合物”互换使用。除非特别说明,本说明书中所述的“糖链”包含“糖链”和“含有糖链的物质”两方面的含义。
用具有同样疏水性指数的其它氨基酸取代某氨基酸可以生成仍然具有相同生物学功能的蛋白质(例如具有等价的酶活性的蛋白质),在本领域是众所周知的。在这种氨基酸取代中优选疏水性指数在±2以内,进一步优选在±1以内,更优选在±0.5以内。基于疏水性的这种氨基酸取代是有效的,这在本领域中,是可以理解的。在制备变异体时还要考虑亲水性指标。如US4,554,101所示,对氨基酸残基分配了如下的亲水性指数:精氨酸(+3.0);赖氨酸(+3.0);天冬氨酸(+3.0±1);谷氨酸(+3.0±1);丝氨酸(+0.3);天冬酰胺(+0.2);谷氨酰胺(+0.2);甘氨酸(0);苏氨酸(-0.4);脯氨酸(-0.5±1);丙氨酸(-0.5);组氨酸(-0.5);半胱氨酸(-1.0);甲硫氨酸(-1.3);缬氨酸(-1.5);亮氨酸(-1.8);异亮氨酸(-1.8);酪氨酸(-2.3);苯丙氨酸(-2.5);和色氨酸(-3.4)。可以理解的是某种氨基酸可以被具有相同亲水性指数的其它氨基酸取代而提供生物学等价体。在这种氨基酸取代中,优选亲水性指数在±2以内,更优选在±1以内,进一步优选在±0.5以内。
本发明中的“保守取代”是指在氨基酸取代中,原来的氨基酸和被取代氨基酸的亲水性指数和/或疏水性指数类似的取代。例如,保守取代的例子是本领域已知的,如的下列各组内的取代:精氨酸和赖氨酸;谷氨酸和天冬氨酸;丝氨酸和苏氨酸;谷氨酰胺和天冬酰胺;缬氨酸,亮氨酸和异亮氨酸。
本说明书中所述的“变体”是指针对原来的多肽或多核苷酸等物质,使其一部分发生改变所得的物质。作为这样的变体可以列举:取代变体,添加变体,缺失变体,截短变体,等位基因变体等。等位基因(allele)是指属于同一基因座但相互之间存在差异的基因变体。因此,所谓的“等位基因变异体”指与某基因具有等位基因关系的变体。所述的“种同源物”或“同系物”指某种中具有在某基因和氨基酸水平或核苷酸水平上同源性(优选至少60%或以上同源,更优选至少80%或以上,85%或以上,90%或以上,95%或以上的同源性)的物质。本说明书中记载了这种种同源物的制备方法。所谓“直向同源物”也称作直向同源基因是指经过种分化,由共同的祖先分化形成的基因。例如,以具有多种基因结构的血红蛋白基因家族为例,人和小鼠的α-血红蛋白基因是直向同源物。人的α-血红蛋白基因和β-血红蛋白基因是种内同源物(由于基因发生重复所产生的基因)。直向同源物对于分子系统树的推定是有用的。本发明的直向同源物在本发明中也是有用的。
“保守性(保守性改变)变体”在氨基酸序列和核酸序列两方面均适用。对于特定的核酸序列,所谓经保守性改变所得的变体是可以编码同一或基本上同一的氨基酸序列的核酸。当核酸不编码氨基酸序列时,称为实质上同一的序列。由于遗传密码子的简并性,许多功能同一的核酸编码任意给定的蛋白质。例如密码子GCA,GCC,GCG和GCU都编码丙氨酸。因而在所有通过所述密码子表示丙氨酸的位置,在所编码的多肽不发生改变的情况下,该密码子可被上述对应的任意密码子所替换。这种核酸变化是保守性变异的一种,即“沉默改变(变异)”。本说明书中记录了编码多肽的所有核酸序列,以及该核酸的所有沉默变异。本领域中,核酸中的各密码子(通常唯一编码甲硫氨酸的AUG,通常为编码色氨酸的TGG除外)均可能为产生同样功能的分子而被改变。因此,所述的各序列也暗含了编码多肽的核酸的各沉默变异。优选地这样的改变要避免对多肽高级结构产生较大影响的氨基酸即半胱氨酸的替代。
在本说明书中为了制备包含编码功能等价多肽基因的BAC载体,除氨基酸的取代之外,也可以进行氨基酸的添加,缺失或修饰。所谓氨基酸替代,是指对原始的肽进行一个以上,例如1-10个,优选1-5个,更优选1-3个氨基酸的替代。所谓氨基酸的添加是指对原始的肽进行一个以上,例如1-10个,优选1-5个,更优选1-3个氨基酸的添加。所谓氨基酸的缺失,是指使原始肽进行一个或一个以上,例如1-10个,优选1-5个,更优选1-3个氨基酸的缺失。氨基酸的修饰包括但不限于酰胺化、羧基化、硫酸化、卤化、烷基化、糖基化作用、磷酸化作用、羟基化作用、酰化作用(例如、乙酰化)等等。替代或添加的氨基酸可以是天然氨基酸,也可以是非天然的氨基酸或氨基酸的类似物。优选天然氨基酸。
本说明书中使用的多肽的核酸的形式指能表达其多肽的蛋白质形式的核酸分子。对于所述的核酸分子,只要其表达的多肽具有与天然的多肽基本上同一的活性,则如上所述核酸分子中的一部分序列可以缺失,也可以被其它碱基所替代,也可以部分插入其它的核酸序列。也可以在5’端和/或3’端添加其它的核酸。还可以是对编码多肽的基因在严格条件下杂交、并且编码与这种多肽具有基本上相同功能多肽的核酸分子。这些基因是本领域所公知的,可以应用于本发明。
这样的核酸可以利用已知的PCR方法获得,也可以通过化学合成来获得。这些方法还可以与定点诱变,杂交等结合使用。
本说明书中所谓的多肽或多核苷酸的“替代,添加或缺失”是指相对于原始的多肽或多核苷酸各氨基酸或其替代物,或者各核苷酸或其替代物分别被替代、添加或缺失。这样的替代、添加或缺失的技术,在本领域是已知的,作为这种技术的例子,可以列举定点诱变技术等。替代,添加或缺失只要是一个或一个以上则可以是任意数量,所述的数量可以是任意多个,只要经替代,添加或缺失后所得的变体能够保持目的功能即可。例如,所述的数量可以是一个或几个,优选是全长的20%以内、10%以内,或100各以下,50个以下,25个以下等。
高分子的结构(例如多肽的结构)可以在各种水平结构上进行描述。对于该结构的一般性论述可以参照例如Alberts等人,MolecularBiology of the Cell(3rd Ed.,1994),和Cantor and Schimmel,BiophysicalChemistry Part I:The Conformation of Biological Macromolecules(1980)。本说明书中所应用的一般分子生物学技术,可以参照AusubelF.A.等人编著的(1988),Current Protocols in Molecular Biology,Wiley,New York,NY;Sambrook J.等人,(1987)Molecular Cloning:A Laboratory Manual,2nd Ed.,Cold Spring Harbor LaboratoryPress,Cold Spring Harbor,NY,等等只要是本行业的从业者,就很容易实施。
在本说明书中,当涉及基因时,所谓“载体”指可以将目的核苷酸序列递送到目的细胞中的物质。作为这样的载体可以利用能够在原核生物细胞、酵母、动物细胞、植物细胞、昆虫细胞、动物个体和植物个体等的宿主细胞中自我复制的,或者可以整合到染色体中,并且在适合本发明的多核苷酸转录的位置包含启动子。
“BAC载体”是以大肠杆菌的F质粒为基础制备的质粒,其是可以在大肠杆菌等细菌中稳定保持并增殖300kb以上的巨大DNA片段的载体。BAC载体至少包含BAC载体复制所必需的区域。作为所述的复制所必需的区域,例如可以列举F质粒复制起点的oriS,或其变体。
本说明书中所谓“BAC载体序列”是指包含作为BAC载体功能所必需序列的序列。需要时,BAC载体序列还可以进一步包含“重组蛋白依赖的重组序列”和/或“选择标记”。
本说明书中,所谓核酸的“重组”可以与术语“同源重组”互换使用。由两个不同的核酸分子的结合开始,发生交换,产生核酸的新组合。在本说明书中使用时,同源重组包含“重组蛋白-依赖的重组”与“重组蛋白-非依赖性重组”两方面的含义。所谓“重组蛋白-依赖性重组”是指在重组蛋白存在下发生,在重组蛋白不存在下不发生的同源重组。而所谓“重组蛋白非依赖性重组”是指与重组蛋白存在与否无关的同源重组。本说明书中的所谓“重组蛋白依赖的重组序列”是指产生重组蛋白-依赖性重组的序列。而所谓“重组蛋白-非依赖性重组序列”是产生重组蛋白-非依赖性重组的序列。重组蛋白依赖的重组序列在重组蛋白存在下产生重组,在重组蛋白不存在时不产生重组。优选重组蛋白与重组蛋白依赖的重组序列特异性作用,而对重组蛋白依赖的重组序列之外的序列不发生作用。
作为典型的重组蛋白依赖重组序列和重组蛋白包括但不限于:来自P1噬菌体的loxP(P1交换位点)序列和Cre(环化重组)蛋白的组合,Flp蛋白和FRT位点的组合,_C31和attB或attP的组合(Thorpe,HelenaM.;Wilson,Stuart E.;Smith,Margaret C.M.,Control of directionalityin the site-specific recombination system of the Streptomyces phage_C31.,Molecular Microbiology(2000),38(2),232-241.)解离酶和res位点的组合(Sadowski P.,Site-specific recombinases:changingpartners and doing the twist,J.Bacteriol.,February 1986;165(2)341-7)(一般参照Sauer B.,Site-specific recombination:developmentsand applications.,Curr.Opin.Biotechnol.,1994 Oct;5(5):521-7)。
本说明书中使用时,所谓“选择标记”是指作为筛选含有BAC载体的宿主细胞指标的功能性基因。作为选择标记,例如但不限于荧光标记,发光标记和药物选择标记。作为“荧光标记”例如编码绿荧光蛋白(GFP)样的荧光蛋白的基因。作为“发光基因”例如,但不限于,编码虫荧光蛋白样荧光蛋白的基因。作为“药物选择标记”包括但不限于编码选自下述蛋白质的基因:二氢叶酸还原酶基因,谷氨酰胺合酶基因、天冬氨酸转氨酶、金属硫蛋白(MT)、腺苷脱氨酶(ADA)、腺苷脱氨酶(AMPD1、2)、黄嘌呤-鸟嘌呤-磷酸核糖基转移酶、UMP合酶、P-糖蛋白、天冬酰胺合酶,以及鸟氨酸脱羧酶。药物筛选标记与所用药物的组合包括例如下述的:二氢叶酸还原酶基因(DHFR)和氨甲蝶呤(MTX)的组合,谷氨酰胺合酶(GS)基因与甲硫氨酸磺基肟(Msx)的组合,天冬氨酸转氨酶(CAD)基因与N-磷酸乙酰-L-天冬氨酸(PALA)的组合,MT基因和镉的组合,腺苷脱氨酶(ADA)基因和腺苷、亚硝基羟基丙氨酸、或2′脱氧柯福霉素的组合,腺苷脱氨酶(AMPD1、2)基因和腺嘌呤、重氮乙酰丝氨酸、或助间型霉素的组合,黄嘌呤-鸟嘌呤-磷酸核糖转移酶基因和霉酚酸的组合,UMP合酶基因和6-azaulysine或吡唑呋喃菌素的组合,P-糖蛋白(P-gp,MDR)基因和多种药物的组合,天冬酰胺合酶(AS)基因和β-天冬氨酰异羟肟酸或合欢氨酸的组合,以及鸟氨酸脱羧酶(ODC)基因和-α-二氟甲基-鸟氨酸(DFMO)的组合。
本说明书中使用时,“表达载体”指调节结构基因和其表达的启动子,加上各种调节元件以在宿主细胞中可以操纵的状态而连接的核酸序列。调节元件优选包含终止子、耐药性基因(例如卡那霉素抗性基因、潮霉素抗性基因)之类的选择标记,以及增强子等。生物(例如,植物)的表达载体的类型和所使用的调节元件的种类根据宿主细胞的不同而不同,这是本领域技术人员已知的。对于植物,本发明所使用的植物表达载体进一步含有T-DNA区域。在利用土壤杆菌转化植物时,T-DNA可以提高基因的导入效率。
本说明书中使用时,所谓“重组载体”指可以将目的多核苷酸序列导入目的细胞的载体。作为这种载体,例如能够在原核生物细胞、酵母、动物细胞、植物细胞、昆虫细胞、动物个体和植物个体等的宿主细胞中自我复制的,或者可以整合到染色体中的载体,并且在适合本发明的多核酸转录的位置包含启动子的载体。
“终止子”是位于基因中编码蛋白质的区域的下游,DNA转录为mRNA时的转录终点,与附加poly A序列相关的序列。已知终止子与mRNA的稳定性相关,并影响基因的表达量。作为终止子,例如但不限于CaMV35S终止子、胭脂氨酸合成酶基因的终止子(Tnos)、烟草PR1a基因的终止子等。
所谓本说明书中所用的“启动子”指确定基因转录的起始部位、并在DNA的ORF中直接调节转录频率的区域、与RNA聚合酶结合起始转录的碱基序列。启动子区域通常大多推定为编码蛋白质的区域的第一个外显子上游2kbp以内的区域,因此假如用DNA分析软件预测基因组碱基序列中编码蛋白质的区域,则可由此推定启动子区域。推定的启动子区域随每个结构基因变化,通常位于结构基因的上游,但并不局限于此,启动子有时候也位于结构基因的下游。优选推定启动子区域位于第一外显子翻译起始点上游约2kbp以内。
本说明书中,所谓本发明的启动子表达是“组成型”的,是指在生物的所有组织中,在生物发育的任意阶段启动子均以大致一定量表达的性质。具体而言,在与本说明书实施例同样的条件下,利用Northern印迹分析时,例如在任意时间点(例如两个或两个以上的时间点(例如第5天和第15天))在同一或对应部位均可以观察到几乎相同的表达量时则在本发明意义上,表达是组成型表达。一般认为组成型表达的启动子对通常位于发育环境下维持生物的恒定性起作用。这些特性可以通过下述方式确定,由生物的任意部位提取RNA,利用Northern印迹分析表达量,或者利用Western印迹分析定量所表达的蛋白质。
“增强子”可以用于提高目的基因的表达效率。在动物细胞中使用时,作为增强子优选包含SV40启动子内上游侧序列的增强子区域。增强子,可以使用多个,也可以使用1个,还可以不使用。
在本说明书中使用时,“有效连接”指将目的序列的表达(起作用)设置于某些转录翻译调节序列(例如启动子,终止子等)或翻译调节序列的控制之下。为了使启动子作用于基因的有效连接通常是在该基因的上游设置启动子,但启动子无需紧邻该基因。
在本说明书中使用时,所述的“转化”,“转导”和“转染”,在没有特别说明的情况下可以互换使用,指向宿主细胞中导入核酸。作为转化方法,只要是可以向宿主细胞中导入DNA的方法均可以使用,例如可以例举电穿孔法,粒子枪(基因枪)法,磷酸钙法等各种已知的方法。
所谓“转化体”指通过转化所制备的细胞等生物体的全部或一部分。作为转化体,例如原核细胞、酵母、动物细胞、植物细胞、昆虫细胞等。转化体根据对象的不同,可以指转化细胞、转化组织、转化宿主等。本说明书中包含上述所有的转化形态,但在具体的上下文中可能指某种特定形态。
作为原核生物细胞的例子,可以列举属于大肠杆菌属、沙雷氏菌属(Serratia)、芽胞杆菌属(Bacillus)、短杆菌属(Brevibacterium)、棒状杆菌属(Corynebacterium)、微杆菌属(Microbacterium)、假单胞菌属(Pseudomonas)等的原核细胞,例如大肠杆菌XL1-Blue,大肠杆菌XL2-Blue,大肠杆菌DH1,大肠杆菌MC1000,大肠杆菌KY3276,大肠杆菌W1485,大肠杆菌JM109,大肠杆菌HB101,大肠杆菌No.49,大肠杆菌W3110,大肠杆菌NY49,大肠杆菌BL21(DE3),大肠杆菌BL21(DE3)pLysS,大肠杆菌HMS174(DE3),大肠杆菌HMS174(DE3)pLysS,无花果沙雷氏菌(Serratia ficaria),居泉沙雷氏菌(Serratia fonticola),液化沙雷氏菌(Serratia liquefaciens),粘质沙雷氏菌(Serratia marcescens),枯草芽孢杆菌(Bacillussubtilis),解淀粉芽孢杆菌(Bacillus amyloliquefaciens),Brevibacteriumammmoniagenes,Brevibacterium immariophilum ATCC14068,解糖短杆菌(Brevibacterium saccharolyticum)ATCC14066,谷氨酸棒杆菌(Corynebacterium glutamicum)ATCC13032,谷氨酸棒杆菌ATCC14067,谷氨酸棒杆菌ATCC13869,嗜乙酰乙酸棒杆菌(Corynebacterium acetoacidophilum)ATCC13870,嗜氨微杆菌(Microbacterium ammoniaphilum)ATCC15354,假单胞菌某种(Pseudomonas sp.)D-0110,等等。
作为动物细胞,例如人MRC-5细胞,人HEL细胞,人WI-38细胞,小鼠骨髓瘤细胞,大鼠骨髓瘤细胞,人骨髓瘤细胞,小鼠杂交瘤细胞,中国仓鼠CHO细胞,BHK细胞,非洲绿猴肾细胞,人白血病细胞,HBT5637(特公昭63-299),人结肠癌细胞株等。作为小鼠骨髓瘤细胞包括ps20,NSO等。作为大鼠骨髓瘤细胞包括YB2/0等。作为人胎儿肾细胞例如HEK293(ATCC:CRL-1573)等。作为人白血病细胞包括BALL-1等。作为非洲绿猴肾细胞包括COS-1,COS-7,Vero细胞等。人结肠癌细胞株包括HCT-15等。
本说明书中的“动物”指本领域中最广义的意义,包含脊椎动物和无脊椎动物。作为动物例如但不限于哺乳动物纲、鸟纲、爬行纲、两栖纲、鱼纲、昆虫纲、蠕虫纲等。
本说明书中,所谓生物“组织”是细胞的集团,该集团中指具有一定的相同作用的细胞。组织可以是脏器(器官)的一部分。脏器(器官)中大多数细胞具有相同的功能,但其中也可以混有功能上具有微妙差异的细胞,本说明书中的组织,只要共同具有一定特性,也可以混有各种不同的细胞。
本说明书中的“器官(脏器)”指具有一种独立的形态,包含一种或一种以上的组织配合形成的行使特定功能的结构体。对于植物,例如但不限于愈伤组织、根、茎、植干、叶、花、种子、胚芽、胚、果实等。对于动物例如但不局限于胃、肝脏、肠、胰腺、肺、气管、鼻、心脏、动脉、静脉、淋巴结(淋巴系统)、胸腺、卵巢、眼、耳、舌、皮肤等等。
本说明书中,所谓“转基因”指将特定基因整合到某生物中或整合了这类基因的生物(例如,包括植物或动物(小鼠等))。
本发明的生物,指动物时,转基因生物可以利用微注射法(微量注射法),病毒载体法,干细胞(ES)法(胚胎干细胞法),精子载体法,染色体片段导入法(transsomic法),附加体法等转基因动物的制备技术来制备。这些转基因动物的制备技术在本领域中是已知的。
本说明书中使用时,所谓“筛选”指通过特定的操作和/或评价方法从众多的候选物中选择具有某种所需特定性质的物质或宿主细胞或病毒等。可以理解的是,本发明还包含通过筛选所获得的具有所需活性的病毒。
本说明书中的“芯片”或“微芯片”可以互换使用,指具有多种功能的、构成系统的一部分的微型集成电路。作为芯片例如但不限于DNA芯片,蛋白质芯片等。
本说明书中所谓的“阵列”是指含一个或一个以上(例如1000个或以上)目的物的组合物(例如DNA,蛋白质或细胞)的组合物阵列设置的模式或具有模式的基板(如芯片)。阵列中,小基板(例如10×10mm等)上模式化的阵列称为微阵列,在本说明书中微阵列和阵列可以互换使用。因此,即使在比上述基板大的物质上模式化的阵列也可称为微阵列。例如,阵列可由固定在其自身的固相表面或固定在膜表面的目的细胞组构成。阵列优选包括含有相同或不同病毒的至少102个,更优选至少103,更优选至少104,更优选至少105个细胞。这些细胞优选设置在125×80mm,更优选10×10mm的表面上。阵列的形式,从96-孔微滴定板,384-孔微滴定板等微型滴定板的尺寸考虑,最好是载玻片大小的板。含所固定的目的物质的组合物,既可以是一种,而可以是数种。这样的种类数量可以是达到一个斑点的任意数量。例如,可以固定为含约10种,约100种,约500种,和约1,000种目的物质的组合物。
基板的固相表面或膜上可以设置上述任意数量的目的物(如细胞的生物分子),一般每个基板上可以设置不多于108个生物分子,在另外的实施例中不多于107个生物分子,不多于106个生物分子,不多于105个生物分子,不多于104个生物分子,不多于103个生物分子,或不多于102个生物分子。也可以设置含有超过108个生物分子目的物质的组合物。这种情况下,优选基板的尺寸更小。特别地,含目的物质的组合物(如细胞等)的斑点尺寸可以与单一生物分子的尺寸一样小(例如1-2nm的数量级)。某些情况下,基板面积的下限在某些情况下取决于基板上生物分子的数量。
阵列上可以设置生物分子“斑点”。本说明书中的所谓“斑点”指含目的物质组合物的一定集合。本说明书中的“点斑点”指在某基板或板上制备含某种目的物质组合物的斑点。点斑点可以利用任意的方法进行,例如利用移液管等制备或利用自动装置制备。这些方法是本领域所公知的。
作为本说明书中所用的术语“地址(address)”指基板上独特的位置,其可与其它独特的位置相区别。地址适合于与带有该地址的斑点建立联系,地址可以采用任意的形状以使得在各地址中所存在的物质与其它地址中所存在的物质相区别识别(例如,光学的方式)。可将地址定义为例如但不限于,圆形,椭圆形,正方形,长方形或不规则的形状。因此,“地址”表示的是一种抽象的概念,而“斑点”是表示一种具体化的概念。当不必要区分二者时本说明书中“斑点”和“地址”可以互换使用。
预定各地址的尺寸依赖于基板的尺寸,特定基板上的地址数,含目的物质的组合物的数量和/或可利用的试剂、微粒的大小,以及为该阵列的任意方法所必需的分辨率程度。其大小例如可以在众1-2nm到数厘米的范围,可以是与应用该阵列一致的任意大小。
设定地址的空间配置和形状可设计成使该微阵列适用于特定的应用。地址可以设置得密集也可以设置得分散,或者可以为适合于特定型式分析物所需要的模式或亚组。
本说明书使用时,“支持物”指可以承载细胞,细菌,病毒,多核苷酸或多肽的物质。作为支持物的材料可以是具有与本发明中所使用的细胞等小共价键或非共价键地结合的特性的,或能衍生具有上述特性的任何固体材料。
作支持物所用的材料可以使用能够形成固体表面的任意物质,例如但不限于,玻璃、氧化硅、硅、陶瓷、二氧化硅、塑料、金属(包括合金),天然存在的以及合成的聚合体(例如聚苯乙烯、纤维素、脱乙酰壳多糖、右旋糖酐和尼龙)等等。优选支持物包含形成疏水键的部分。支持物可以由多层不同材料层构成。例如,可以使用如玻璃、石英玻璃、矾土、蓝宝石、镁橄榄石、金刚砂、二氧化硅、氮化硅等等无机材料。还可以使用聚乙烯、乙烯、聚丙烯、聚异丁烯、聚对苯二甲酸乙二醇酯、不饱和聚酯、含氟树脂、聚氯乙烯、聚偏二氯乙烯、聚醋酸乙烯酯、聚乙烯醇、聚乙烯醇缩醛、丙烯酸树脂、聚丙烯腈、聚苯乙烯、缩醛树脂、聚碳酸酯、聚酰胺、酚醛树脂、尿素树脂、环氧树脂、三聚氰胺树脂、苯乙烯-丙烯腈共聚物、丙烯腈-丁二烯-苯乙烯共聚物、硅酮树脂、聚苯撑氧化物、聚砜等等有机材料。另外,还可以使用用于印迹的硝化纤维素膜、聚偏二氟乙烯膜等等。
本发明的水痘-带状疱疹病毒可以作为用于处置,预防和/或治疗感染症的药物组合物成分使用。
本说明书中所谓药物的“有效量”指所述药剂能够发挥目的药效的量。本说明书中,对于所述的有效量中,将最小浓度称为最小有效量。这种最小有效量是本领域所公知的。通常药剂的最小有效量由本领域的技术人员确定,或可由本领域的技术人员适宜地确定。要确定有效量除了实际施用以外,还可以使用动物模型来确定。在本发明中在确定此种有效量时是有用的。
本说明书中“药学上可接受的载体”是指制造医药或如动物药的农药时所使用的物质,并对有效成分不产生不良影响的物质。作为这种药学上可接受的载体例如但不限于下述的:抗氧化剂、防腐剂、着色剂、调味剂、稀释剂、乳化剂、悬浮剂、溶剂、填料、填充剂、缓冲液、运载工具、赋形剂、和/或农业的或药学的佐剂。
本发明的处置方法中所使用的药剂的种类和用量,本领域的技术人员可以根据通过本发明方法所获得的信息(例如与疾病相关的信息)为依据,考虑使用目的,对象疾病(种类,严重程度等),患者的年龄,体重,性别,既往病史,施用被检部位的形态或种类等容易地确定。本行业从业人员考虑对被检体(或患者)施用的频率以及使用目的,对象疾病(种类,严重程度等),患者的年龄,体重,性别,既往病史和治疗过程等,可以很容易决定本发明的监测方法。作为监测疾病状态的频率例如每日到数月一次(例如一周一次到1个月一次)的频率。优选一边观察施用过程一边实施每周一次到每月一次的监测。
本说明书中所用的“指导说明书”针对实施本发明方法的医生,患者等而记述的材料。该指导说明书记载了在放射性治疗之前或之后(例如24小时之内等)施用本发明的药物的指示性文字说明。所述的说明书按照实施本发明的国家监督管理部门(例如,日本的厚生劳动省,美国的食品药品局(FDA)等)所规定的样式编写,并标明已得到所述监督管理部门的认可。本说明书即所谓的包装说明书,通常以纸为介质提供,但也并不局限于此,也可以通过例如电子媒体(例如通过互联网提供的主页或电子邮件)的形式提供。
必要时,本发明的治疗中,可以使用两种或两种以上的药剂。在使用两种以上的药剂时,可以使用由类似性质或来源的物质,也可以使用不同性质或来源的药剂。可以通过本发明的方法获得有关使用两种以上药剂的方法中疾病水平的相关信息。
当本发明中对于类似种类(例如相对人而言,小鼠)的生物、培养细胞、组织等,一旦确定了某种特定的糖链结构的分析结果与疾病水平的关系,就能够确定对应的糖链结构的分析结果与疾病水平的关系,本待业从业人员很容易理解这一点。这种情况可以得到例如下述文献的支持“Doubutsu Baiyosaibo Manual(Animal Culture CellManual),Seno等人编,Kyoritsu shuppan,1993,所述文献全文引入本说明书作为参考。
(本说明书中所用的一般技术)
本说明书中所使用的技术在没有特别说明的情况下属于该领域的技术范围内的糖链科学,微射流学、显微加工、有机化学、生物化学、遗传工程、分子生物学、微生物学、遗传学及其相关领域中的已知常用技术,这些技术例如在下述文献及本说明书中其它部分的引用文献中也进行了详细说明。
有关显微加工在例如Campbell,S.A.(1996),The Science andEngineering of Microelectronic Fabrication,Oxford University Press;Zaut,P.V.(1996),Micromicroarray Fabrication:a Practical Guideto Semiconductor Processing,Semiconductor Services;Madou,M.J.(1997),Fundamentals of Microfabrication,CRC1 5 Press;Rai-Choudhury,P.(1997),Handbook of Microlithography,Micromachining&Microfabrication:Microlithography;等文献中均有所记载,所述文献相关部分引入本说明书作为参考。
本说明书中所用的分子生物学方法,生物化学方法,微生物学方法,糖链科学方法是该领域所公知的惯用方法,可参见,例如Molecularbiology techniques,biochemistry techniques,and microbiologytechniques used herein are well known and commonly used in the art,and are described in,for example,Maniatis,T.等人(1989),MolecularCloning:A Laboratory Manual,Cold Spring Harbor and及其3rd Ed.(2001);Ausubel,F.M.等人eds,Current Protocols in MolecularBiology,John Wiley & Sons Inc.,NY,10158(2000);Innis,M.A.(1990),PCR Protocols:A Guide to Methods and Applications,Academic Press;Innis,M.A.等人(1995),PCR Strategies,AcademicPress;Sninsky,J.J.等人(1999),PCR Applications:Protocols forFunctional Genomics,Academic Press;Gait,M.J.(1985),Oligonucleotide Synthesis:A Practical Approach,IRL Press;Gait,M.J.(1990),Oligonucleotide Synthesis:A Practical Approach,IRLPress;Eckstein,F.(1991),Oligonucleotides and Analogues:A PracticalApproach,IRL Press;Adams,R.L.等人(1992),The Biochemistryof the Nucleic Acids,Chapman & Hall;Shabarova,Z.等人(1994),Advanced Organic Chemistry of Nucleic Acids,Weinheim;Blackburn,G.M.等人(1996),Nucleic Acids in Chemistry and Biology,OxfordUniversity Press;Hermanson,G.T.(1996),Bioconjugate Techniques,Academic Press;Method in Enzymology 230,242,247,AcademicPress,1994;Special issue,Jikken Igaku(Experimental Medicine)“Idenshi Donyu & Hatsugenkaiseki Jikkenho(Experimental Methodfor Gene introduction & Expression Analysis)”,Yodo-sha,1997;等文献中均有所记载,本说明书中作为参考引用其相关部分(也可以是全部)。
优选实施方案的描述
下面对本发明的优选实施方案进行描述。这些实施方案仅作为对本发明示例,而不应理解为对本发明范围的限定。本领域的技术人员应该理解参照下述优选的实施例,在本发明的范围内可以很容易地对本发明进行适当更改和变动。
本发明的一个方面在于提供重组水痘-带状疱疹病毒。优选地,所述水痘-带状疱疹病毒在其基因组序列中包含BAC载体序列。通过构建含有BAC载体序列的水痘-带状疱疹病毒基因组,可以在细菌内将水痘-带状疱疹病毒基因组作为BAC分子来操作。使用的BAC载体序列优选包含来自于F质粒的复制起点,也可以是来自F质粒复制起点之外的序列,只要具备300kb或以上的序列并能在细菌细胞中作为细菌人工染色保持并复制即可。本发明的BAC载体可以在细菌宿主细胞,优选大肠杆菌中保持和/或扩增。优选地,所述BAC载体的一部分插入到水痘-带状疱疹病毒基因组的非必需区域中,可以作为包含水痘-带状疱疹病毒基因组的BAC进行操作。当将含有水痘-带状疱疹病毒基因组的BAC引入到哺乳动物细胞中时,所述的重组水痘-带状疱疹病毒即可产生并增殖。作为重组水痘-带状疱疹病毒的宿主细胞,可以使用野生型水痘-带状疱疹病毒株能够增殖的任意哺乳动物细胞。优选该宿主细胞来源于人,但不限定于此,例如:人MRC-5细胞,人HEL细胞,和人WI-38细胞。
制备包含水痘-带状疱疹病毒基因组的BAC载体的方法
要利用水痘-带状疱疹病毒和BAC载体制备包含水痘-带状疱疹病毒基因组的BAC载体,可以使用同源重组方法等各种已知的方法。
作为使用同源重组的方法,例如应用下述核酸分子的方法,所述核酸分子具有与水痘-带状疱疹病毒基因组的同源碱基序列相连的环状BAC载体序列。
利用含有与水痘-带状疱疹病毒基因组的同源序列相连的环状BAC载体序列的核酸,制备包含水痘-带状疱疹病毒基因组的BAC载体的制备方法,该方法包括代表性步骤:(1)将所述核酸与水痘-带状疱疹病毒基因组一起导入宿主(例如,人株化细胞)内;(2)培养所述的宿主细胞,使与环状BAC载体序列连接的同源序列与水痘-带状疱疹病毒基因组序列之间进行同源重组;(3)筛选通过同源重组产生的,含有整合了BAC载体序列的水痘-带状疱疹病毒基因组序列的宿主细胞;(4)培养所述的宿主细胞,抽提环状病毒DNA的步骤。
另外,为了利用水痘-带状疱疹病毒基因组和BAC序列制备含水痘-带状疱疹病毒基因组的BAC,除利用同源重组之外,还可以利用核酸的限制酶片段等其它公知的方法。
水痘-带状疱疹病毒基因组中用于导入BAC载体序列的非必需区域选自以下区域:基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
优选地,所述的非必需区域是基因11的ORF侧翼区域或基因12的ORF侧翼区域。这是由于基因11和基因12是水痘-带状疱疹病毒基因组中连续的非必需基因,因此进行同源重组的核酸。或者也可以将BAC载体序列的一部分插入到水痘-带状疱疹病毒基因组中基因62的ORF内的区域中。
本发明使用的BAC载体序列优选包括重组蛋白依赖的重组序列和/或选择标记。优选地,所述选择标记序列是药物选择标记和/或编码绿荧光蛋白的基因。这是因为可以简便地确认所需基因的存在。
作为本发明的起始物质使用的水痘-带状疱疹病毒既可以来源于野生株也可以来源于突变株。优选作为本发明的起始物质的水痘-带状疱疹病毒是经减毒的病毒,例如Oka疫苗株或在基因62中带有变异的水痘-带状疱疹病毒。作为“经减毒的水痘-带状疱疹病毒”,可以列举具有一个,或2个或以上选自下述基因62的变异的组合的病毒:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;
(d)第4006位的G取代,
(e)第1251位的G取代;
(f)第2226位的G取代;
(g)第3657位的G取代;
(h)第162位的C取代;
(i)第225位的C取代;
(j)第523位的C取代;
(k)第1565位的C取代;
(l)第1763位的C取代;
(m)第2652位的C取代;
(n)第4052位的C取代;和
(o)第4193位的C取代。
本发明的又一方面提供用于制备上述病毒的载体和用于制备上述病毒的方法。本发明的再一方面提供含有上述病毒的药物组合物以及疫苗形式的药物组合物。
本发明的重组水痘-带状疱疹病毒可用作疫苗。这是由于其包含大量的具有与野生型病毒相同结构的蛋白。
本发明的另一方面在于提供向为产生本发明疫苗的载体中引入突变的方法。该方法包括以下步骤:将该载体导入细菌宿主细胞的步骤;将包含由一部分水痘-带状疱疹病毒基因组组成的片段质粒载体导入到该细菌宿主细胞中的步骤,其中该片段至少包含一个变异的步骤;培养该细菌宿主细胞的步骤;从所培养的细菌宿主细胞中分离具有BAC载体序列的载体的步骤。在上述方法中,细菌宿主细胞内,为产生本发明疫苗的载体与包含由水痘-带状疱疹病毒基因组的一部分所组成的片段载体间发生同源重组,结果使为产生本发明疫苗的载体在含有一部分水痘-带状疱疹病毒基因组的片段上具有变异。
在上述的方法中,作为将载体导入细菌宿主细胞中的步骤,可以使用电穿孔等公知的方法。同样可以将包含由水痘-带状疱疹病毒基因组的一部分所组成的片段载体导入到细菌宿主细胞中。作为向该片段中导入变异的方法,使用PCR的变异导入方法是众所周知的,例如在四种核苷酸中有一种数量较少的条件下,使用不具备校正功能的耐热性聚合酶,就可以随机地导入变异。此外,使用带有变异碱基序列的引物进行PCR也可以在所需要的位置引入所需要的突变。通过培养该细菌细胞在为产生本发明疫苗的载体与包含由水痘-带状疱疹病毒基因组的一部分所组成的片段的载体之间发生同源重组,结果使用于生产本发明的疫苗的载体在由水痘-带状疱疹病毒基因组的一部分组成的片段上的具有变异。为了用细菌宿主细胞制备BAC载体序列可以使用多种已知的方法,例如碱法等,也可以使用市售的试剂盒。
本发明的另一方面在于提供为产生本发明疫苗的载体中引入突变的另一种方法。该方法包括以下步骤:将该载体导入到宿主细胞的步骤;将包含由水痘-带状疱疹病毒基因组的一部分组成的第一片段的第一质粒载体导入到该宿主细胞中并且其中该第一片段至少包含一个变异的步骤;将包含由水痘-带状疱疹病毒基因组的一部分组成第二片段的第二质粒载体导入该细菌宿主细胞的步骤,其中该第二片段至少包含一个变异,且第二片段和第一片段不同;培养该细菌宿主细胞的步骤;从所培养的细菌宿主细胞中分离具有BAC载体序列的载体的步骤。
本发明的另一方面在于提供用于生产本发明的疫苗的核酸盒。所述的核酸盒优选是包含在细菌细胞内可与水痘-带状疱疹病毒基因组发生同源重组的第一片段、BAC载体序列、在细菌细胞内可与水痘-带状疱疹病毒基因组发生同源重组的第二片段的核酸盒,其中,该BAC序列的两端分别与第一片段和第二片段相连。其中,第一片段和第二片段优选至少为1kb,至少1.5kb,或至少2kb的长度。该第一片段和第二片段与水痘-带状疱疹病毒基因组序列,优选具有至少80%同一,至少85%同一,至少90%同一,或至少95%同一。
优选地,所述第一片段和所述第二片段是各自独立的,来源于选自下述水痘-带状疱疹病毒基因组中的区域,或与选自下述区域的区域至少80%,85%,90%,或95%同一:基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因69的ORF的侧翼区域。
优选地,所述的第一片段和第二片段来自水痘-带状疱疹病毒基因组的不同区域。所述的第一片段和第二片段是各自独立的,既可以来源于基因11的ORF的侧翼区域或基因12的ORF侧翼区域。优选地,所述的BAC载体序列包含重组蛋白依赖的重组序列和/或选择标记以控制同源重组并易于检测目的基因。所述的选择标记既可以是药物选择标记,也可以是编码绿荧光蛋白样的荧光蛋白的基因。代表性地,所述的BAC载体序列具有如SEQ ID NO.:2中所示的核酸序列,所述的核酸盒具有SEQ ID No.:2中所述的核酸序列。
(变异型重组水痘-带状疱疹病毒的制备)
利用本发明的方法,可以简便地制备具有导入了变异的水痘-带状疱疹病毒基因组的变异性水痘-带状疱疹病毒。
所述变异的导入可以利用下述公知的方法进行。
向大肠杆菌中导入(a)VZV-BAC-DNA质粒和(b)作为变异核酸的、具有包含任意变异的水痘-带状疱疹病毒基因组部分序列的穿梭载体或PCR产物。通过VZV-BAC-DNA质粒与所述变异核酸之间发生同源重组,可以向VZV-BAC-DNA质粒中导入变异。另外也可以利用转座子随机导入突变。导入了变异的VZV-BAC-DNA质粒可以容易地在大肠杆菌中选择并扩增。通过由具有变异的VZV-BAC-DNA产生病毒,可以获得重组水痘-带状疱疹病毒(MarkusWagner,TRENDS in Microbilogy,Vol.10,No.7,July 2002)。以下列举具体例。
(1)利用包含变异的水痘-带状疱疹病毒基因的温度敏感型穿梭载体作为变异核酸的情况:
首先,对穿梭载体和VZV-BAC-DNA质粒通过第一同源区域重组,生成穿梭载体和VZV-BAC-DNA质粒相连的共整合体。接下来,由于穿梭载体的复制起点是温度敏感型的,所以去除了穿梭质粒。在第二个重组事件中去除了共整合的部分。当第二个重组事件通过第一同源区域介导发生时,生成用于重组的、具有与VZV-BAC-DNA相同序列的质粒。相反,当第二重组现象通过不同于第一同源区域的第二同源区域介导发生时,得到具有穿梭载体上变异的变异型VZV-BAC-DNA质粒。当第一同源区域和第二同源区域长度大致相同时,第二重组事件由第二同源区域介导发生的几率与第二重组事件在第一同源区域介导发生的几率几乎相同。由此,所得的约二分之一的VZV-BAC-DNA质粒是具有与用于重组的序列相同序列的质粒,另外约二分之一是具有导入到穿梭载体中的具有变异的质粒。
(2)利用线性DNA片段的情况:
该方法中,例如,利用来自原噬菌体recET的重组功能,或利用来自细菌噬菌体λ的redαβ的重组功能,由线性DNA片段向环状VZV-BAC-DNA分子导入变异。具体说来,将与靶序列相连的选择标记和含有同源序列的线性DNA片段,连同VZV-BAC-DNA一起导入能够进行同源重组的大肠杆菌中。为了避免线性DNA在大肠杆菌中的分解,要使用缺失了外切核酸酶的大肠杆菌,或使来自细菌噬菌体的外切核酸酶抑制剂redγ(gam)表达。线性DNA在其两端具有与VZV-BAC-DNA质粒同源的区域。通过该同源区域介导同源重组的发生,由此可以将线性DNA中的目的序列导入VZV-BAC-DNA中。当利用recET或red αβ的重组功能时,这些重组功能可以通过约25到50个核苷酸长度的同源序列发生同源重组。比recA介导的同源重组使用起来更简便。
(3)使用转座子的情况:
利用转座子元件可随机插入大肠杆菌内核酸的功能。例如,将转座子元件和VZV-BAC-DNA导入大肠杆菌,通过向VZV-BAC-DNA内随机插入转座子元件产生插入变异。
另外,例如利用诱变剂(例如亚硝基胍)对具有VZV-BAC-DNA样的重组水痘-带状疱疹病毒的宿主细胞本身进行处理,可以在重组水痘-带状疱疹病毒基因组内随机地导入变异。
(配方)
本发明提供利用将有效量的治疗剂、预防剂施用、接种于受试者,对疾病或障碍(例如感染性疾病)进行处置的方法。治疗剂、预防剂意味着与药学上可接受的载体形式(例如灭菌载体)组合形成的本发明组合物。
对于预防剂和治疗剂,应考虑到不同患者的临床状态(特别是预防剂和治疗剂单独使用时的副作用)、送达部位、施用方法、施用计划以及本领域技术人员所知的其它因素,根据符合医疗实施基准(GMP)的方式开处方以及用药。作为本说明书的目的的“有效量”是在进行过上述考虑后进行确定。
作为一般的方案,非经口施用的治疗剂/预防剂的药学总有效量,单位用量为患者体重的约1μg/kg/day到10mg/kg/day,但这也要根据上述的治疗判断进行。关于本发明的细胞生理活性物质,更优选其用量至少为0.01mg/kg/day,更优选对于人为约0.01-1mg/kg/day之间。连续施用时,典型的是以约1μg/kg/hour-约50μg/kg/hour的给药速度一天1-4次注射或连续皮下注入(例如应用迷你泵)的任意一种方式施用治疗剂/预防剂。也可以利用静脉内袋溶液(bag solution)。观察变化所必需的处置时间和产生反应处置后的间隔,根据所需效果的不同而变化。
所述的治疗剂/预防剂可以作为经口、直肠内、非经口的、intracistemal、阴道内、腹腔内、局部的(作为通过粉剂、软膏、凝胶剂、滴剂或经皮贴剂等)口内,或经口或鼻腔喷入法来施用。所谓“药学上可接受的载体”指非毒性的固体,半固体或液体的填充剂、稀释剂、包被材料或任意形式的制剂辅剂。本说明书中所述的“非经口的”包含静脉内、肌内、腹腔内、胸骨内、皮下和关节内注射和注入的施用方式。
本发明的治疗剂/预防剂可通过缓释性装置适当施用。缓释性治疗剂/预防剂可适宜地通过经口、直肠内、非经口的、intracistemal、阴道内的、腹腔内的、局部的(粉剂、软膏、凝胶剂、滴或经皮贴剂)的口内或经口或鼻腔喷入来施用。所谓“药学上可接受的载体”指非毒性的固体,半固体或液体的填充剂、稀释剂、包被材料或任意形式的制剂辅剂。本说明书中所述的“非经口的”包含静脉内、肌内、腹腔内、胸骨内、皮下和关节内注射和注入的施用方式。
对于肠胃外施用,在一个实施方案中治疗剂/预防剂以所需的纯度与药学上可接受的载体以单位给药量的可能注射形态(溶液、悬浊液或乳浊液)混合配置,其中,所述药学上可接受的载体在所使用的给药量和浓度条件下对受体没有毒性且与配方中的其它成分相匹配。例如,所述的配置物优选不含氧化剂和已知对治疗剂/预防剂有害的其它化合物。
通常,所述的配置物通过使治疗与剂/预防剂与液体载体或精细分割固体载体或这两者均一且紧密接触进行配置。然后,需要时,将所得产物制成所需配置物。优选载体是非经口的载体,更优选是与受体血液等渗的溶液。这种载体工具例如,水、生理盐水、Ringer’s溶液和蔗糖溶液。不挥发性油和油酸乙酯等非水性运载工具,或脂质体两样也可应用于本说明书中。
载体可适当含有微量添加剂,如等渗性和化学稳定性高的物质。这些物质在所用的给药量和浓度下对受体无毒性,这种物质如:磷酸盐、柠檬酸盐、琥珀酸盐、乙酸,及其他有机酸或其盐的缓冲剂;抗氧化剂比如抗坏血酸,低分子量(少于10个残基)的多肽、例如、聚精氨酸或三肽;蛋白、如血清白蛋白、明胶,或免疫球蛋白;亲水聚合物如聚乙烯吡咯烷酮;氨基酸、如甘氨酸、谷氨酸、天冬氨酸,或精氨酸;单糖、二糖,及其他碳水化合物包括纤维素或其衍生物、葡萄糖、甘露糖,或糊精;螯合剂如乙二胺四乙酸;糖醇如甘露醇或山梨糖醇;平衡离子如钠;和/或非离子型表面活性剂如聚山梨酸酯、泊洛沙姆、或PEG。
治疗上可施用的任何药剂可以是不含作为有效成分病毒之外的生物或病毒的状态,即无菌状态。通过灭菌过滤膜(例如0.2micron的膜)过滤可以容易地实现无菌状态。一般而言,治疗剂/预防剂均置于具有无菌入口的容器中,例如,可以用皮下注射针刺穿的带有瓶塞的静脉内用溶液袋,或带有瓶塞的小瓶中。
治疗剂/预防剂通常储存于单位用量或多用量容器,例如密封的安瓿瓶或小瓶中,作为水溶液或需要再构建的冷冻干燥制剂储存。作为冷冻干燥配置物的例子,如10-ml的小瓶中填充5ml经灭菌过滤的1%(w/v)治疗剂/预防剂水溶液,将所得的混合物冷冻干燥。冷冻干燥的治疗剂/预防剂用注射用无菌水再构建可以制备注入的溶液。
本发明提供具有充满本发明治疗剂/预防剂的一种或以上成分的一个或多个容器制药包装或试剂盒。上述容器还附有规范药物或生物制品的制备、使用或销售的政府机关所规定形式的通知,所述通知标明有关政府机关对于施用于人的制造、使用或销售予以认可。另外,所述的治疗剂/预防剂可于其它治疗用化合物组合使用。
本发明的治疗剂/预防剂可以单独施用,也可以与其它治疗剂/预防剂组合施用。作为可与本发明的治疗剂/预防剂组合施用的治疗剂/预防剂例如但不限于:化学治疗剂,抗生素,固醇或非固醇类抗炎药,现有的免疫治疗剂/预防剂,其它细胞因子和/或增殖因子等。所谓组合,例如作为组合物同时施用,分别地但同时或并行施用,或经时顺序施用。所谓的组合的药剂作为治疗用混合物共同施用包含所述组合药剂分别但同时施用的次序,例如对于同一个体通过不同位置的静脉内给药。所述的“组合”施用进一步包含分别施用第一个、接下来施用第二个所述化合物或药剂。
在特定的实施方案中,本发明的治疗剂/预防剂可以与抗反转录病毒剂,核苷酸反转录酶抑制剂,非核苷酸反转录酶抑制剂和/或蛋白酶抑制剂组合施用。
在另一种实施方案中本发明的治疗剂/预防剂与抗生素组合施用。作为可以使用的抗生素包括但不限于氨基糖苷类抗生素、多烯类抗菌素、青霉素类抗生素、头孢烯类抗生素、肽类抗生素、大环内酯类抗生素,和四环素类抗生素。
在另外的实施方案中,本发明的治疗剂/预防剂可单独地、或与抗炎症剂一起施用。可与本发明的治疗剂/预防剂一起施用的抗炎症剂例如但不限于:糖皮质激素和非固醇类抗炎症剂、氨基芳基羧酸衍生物、芳基乙酸酸衍生物、芳基丁酸衍生物、芳基羧酸衍生物、芳基丙酸衍生物、吡唑、吡唑啉酮、水杨酸衍生物、噻嗪酰胺、乙酰氨基己酸、S-腺苷甲硫氨酸、3-氨基-4-羟丁酸、阿米曲林、苄达酸、消炎灵、丁基环已基巴比妥、联苯吡胺、双苯唑醇、依莫法宗、愈创蓝油烃、萘丁美酮、尼美舒利、肝蛋白、奥沙西罗、瑞尼托林、哌立索唑、哌福肟、普罗喹宗、普罗沙唑和替尼达普。
在进一步的实施方案中本发明的治疗剂/预防剂与其它的治疗剂/预防剂(例如放射性治疗)结合施用。
以下通过实施例等对本发明进行详细说明,但下述实施例并不作为对本发明的限制。
实施例1
(重组水痘-带状疱疹病毒的制备)
(1:BAC质粒的制备)
质粒PHA-2使用由Markus Wagner和Ulrich H.Koszinowski(Adler等人,(2000),J.Virol.74:6964-74)获得。为了制备重组病毒,选择水痘-带状疱疹病毒基因11的ORF和基因12的ORF之间的区域作为BAC载体的插入位点。这是基于将外源基因插入所述非必需区域不会对水痘-带状疱疹病毒的增殖产生不良影响的设想。
以水痘-带状疱疹病毒Oka亲株的基因组DNA为模板,分别以引物VZ11F(SEQ ID NO.:1)、VZ11R(SEQ ID NO.:2)、引物-VZ12F(SEQ ID NO.:3)和VZ12R(SEQ ID NO.:4)对水痘-带状疱疹病毒Oka株基因11的ORF和基因12的ORF的片段进行扩增。
(2:用于制备重组质粒的引物的制备)
[表1]
用于制备重组质粒的引物
引物 | 序列 | 产物(碱基对)和质粒 |
VZ11FVZ11RVZ12FVZ12R | S-TATA ACTAGT GCGGCCGC TTACGAAAACGTGCATG-3′SpeI NotIS-CGCG ACCTGGT TTTATTTTACAAACTCCTTTGTGG-3′SexAIS-GCGC ACCAGGT CTCTGTTTAGACCTTAAAATTTG-3′SexAIS-TATA GCGGCCGC TTTTAATCTGGTTGTGGAAATG-3′NotI | VZ CRF11(2652) SK/VZ11-12VZ ORF12(2164) SK/VZ11-12 |
表中,寡核苷酸序列中的限制酶位点用下划线标出,以斜体标示的序列是在VZV序列中不存在的另外的碱基。
分别用SpeI/SexAI和NotI/SexAI消化PCR产物的基因11ORF和基因12ORF的片段。将两个PCR片段克隆到经SpeI和NotI消化的pBluescript SK-(Stratagene)中。把得到的质粒作为SK/VZ11-12。
用Pacl消化质粒pHA-2,然后利用T4 DNA聚合酶将该位点处理成平末端。将所得质粒克隆到SK/VZ11-12的平末端化的SexAI位点。把所得的质粒作为pHA-2/VZ11-12(图1C)。
如图1所示,VZV基因组(A)长约125kbp,包含末端重复(TR)DNA结构域,特有的长(UR)DNA结构域,中间重复(IR)DNA结构域,和特有的短(US)DNA结构域。为了构建重组质粒PHA-2/VZV11-12(C),如上所述,利用适宜的引物,通过PCR扩增VZV基因组中的ORF 11片段和ORF 12片段。所得的重组质粒pHA-2/VZ11-12包含比邻loxP位点(L)的约2.0kbp的侧翼同源序列,以及BAC载体。
(3:通过同源重组制备重组病毒)
制备的质粒pHA-2/VZ11-12包含作为选择标记的contains a鸟嘌呤磷酸核糖转移酶(gpt)基因。还含有夹在两个loxP序列之间的BAC载体序列。因此,通过使Cre重组酶的作用,夹在loxP序列之间的BAC载体序列可被高效地去除。另外通过绿荧光蛋白(GFP)的荧光可以很容易地确认出导入含有BAC载体序列质粒的细胞。
该质粒通过NotI消化线性化。利用Nucleofection unit(Amaxa)通过点穿孔将0.2μg线性化的pHA-2/VZ11-12转染在75-cm2塑料瓶中生长至铺满的HEL细胞。转染后一天,再用水痘-带状疱疹病毒Oka株感染经转染的细胞。
利用50μM的霉酚酸和200μM的黄嘌呤进行利用gpt基因的重组病毒筛选。在HEL细胞中可观察到由水痘-带状疱疹病毒引起的典型的细胞变性效果(CPE)。其中一些细胞可以在荧光显微镜下确认GFP表达。这一结果表明BAC载体已插入到水痘-带状疱疹病毒基因组中,并且使BAC载体中所含的GFP基因得以表达。
(4:重组病毒的富集并向大肠杆菌中导入)
利用gpt基因通过霉酚酸和黄嘌呤的药物筛选以及96孔板有限稀释法富集重组病毒。通过Hirt’s法(Hirt,(1967),J.M.Biol,26:365-9)从感染细胞中提取环状DNA。使用脉冲发生器(Bio-Rad),通过电穿孔的方法(0.2-cm杯,2.5kV)把提取的DNA导入到大肠杆菌中进行转化。利用含17μg/ml氯霉素的琼脂平板对此进行筛选得到含VZV-BAC-DNA的大肠杆菌。
( 5:大肠杆菌中VZV-BAC-DNA质粒的稳定性)
将包含BAC载体(VZV-BAC-DNA)的大肠杆菌在LB培养基上培养22-24小时,所述的BAC载体含有水痘-带状疱疹病毒基因组,用该方法传代3次,最后利用含氯霉素的琼脂平板进行选择。从传代大肠杆菌中获得5个克隆,将其分别在LB培养基上以相同的方法大量培养,提取DNA。根据试剂盒所附的说明书,使用Nucleobond PC100试剂盒(Macherey-Nagel)从菌体中提取VZV-BAC-DNA。将所得的5个克隆和原来的VZV-BAC-DNA分别用限制酶BamHI消化。通过琼脂糖凝胶电泳确认限制酶谱(结果未示出)。将所有5个克隆与原本的VZV BAC质粒相比较。结果琼脂糖凝胶上给出相同的限制酶谱。由此说明大肠杆菌中的VZV质粒具有高度的稳定性。
这些图中,原本的VZV-BAC-DNA和经传代3次的VZV-BAC-DNA给出相同的电泳图谱。这标明大肠杆菌中VZV-BAC-DNA质粒是稳定的。
(6:由VZV-BAC-DNA产生病毒)
通过重组质粒PHA-2/VZV11-12和VZV病毒的同源重组在HEL细胞中制备BAC克隆化的VZV rV01(图1D)。具体说来,用1μg VZV-BAC-DNA通过Nucleofector unit(Amaxa)转染在75-cm2的塑料烧瓶中培养至铺满的HEL细胞。2天后传代培养在75-cm2的塑料瓶中对培养至铺满的HEL细胞。2~3天后观察到水痘-带状疱疹病毒产生的典型CPE。在荧光显微镜下确认观察到CPE的细胞中GFP基因的表达。由此确认了可以利用VZV-BAC-DNA生产重组水痘-的带状疱疹病毒。把所产生的重组水痘-带状疱疹病毒命名为rV01(图1D)。通过将环状的BAC克隆化的基因组导入大肠杆菌中制备了VZVBAC质粒。
(7:切出BAC载体序列)
能表达Cre重组酶的重组腺病毒(AxCANCre)是由Nagoya大学的Yasushi Kawaguchi先生惠赠(Kanegae等人,(1995)Nucleic AcidsRes 23:3816-21)。利用该重组腺病毒通过BAC克隆化的VZV rV01和重组腺病毒(AxCANCre)的重复感染,制备VZV rV02(图1E;L表示loxP位点)。具体说来,以MOI(感染复数)100对HEL细胞感染重组腺病毒。吸附病毒2小时后,用PBS(-)清洗细胞,然后用含5%FCS的DMEM培养基培养。在重组腺病毒感染24小时后,用重组水痘-带状疱疹病毒rV01重复感染HEL细胞。通过使用对照的实验确认了通过重组腺病毒表达Cre重组酶,可以有效地从rV01基因组中切出BAC载体序列。把所得的水痘病毒称为rV02(图1,E)。DNA测序结果确认rV02是从rV01切除了BAC载体序列后的产物。
用限制酶BamHI消化从水痘-带状疱疹病毒Oka株的感染细胞中提取的DNA和来自大肠杆菌的VZV-BAC-DNA。由于残留片段一侧的loxP序列的缘故,来自重组水痘-带状疱疹病毒rV02 DNA的片段比水痘-带状疱疹病毒Oka株的DNA大。
从VZV-BAC-DNA电泳图谱看,由于插入了BAC序列,与亲株相比约8.1kbp的BamHI片段消失了,同时增加了约7.8kbp和约9.2kbp的BamHI片段。从重组水痘病毒rV02感染细胞提取的DNA的约8.2kbp的BamHI片段,因BAC载体序列切出时残留片段一侧的loxP序列的缘故,与水痘病毒亲株的约8.1kbp的BamHI片段相比,大小稍稍增大。
(实施例2)
(具有重组水痘-带状疱疹病毒的特征)
(1:重组病毒增殖性的比较)
利用感染中心分析方法比较水痘-带状疱疹病毒Oka株和所获得的重组水痘-带状疱疹病毒rV02在HEL细胞中的增殖性(Gomi等人,(2002)J.Virol 76:11447-59)。以0.01PFU/细胞的MOI感染35mm平皿中的HEL细胞,接着洗涤所感染的细胞。培养感染HEL细胞的水痘-带状疱疹病毒Oka株和rV02株0到5天,用胰蛋白酶收集后,用以感染新的HEL细胞,从而比较其增殖性。将感染细胞的数量规范为初期病毒滴度/皿。倍数增加,表示第0天从1个感染细胞开始传播的感染细胞数。结果如图2所示。由图2可知所获得的重组水痘病毒rV02在体外与水痘病毒Oka株(亲株)表现同等的增殖能力。
(实施例3)
(弱致病性的变异型重组水痘-带状疱疹病毒的制备)
根据本发明,利用以下的方法可以制备变异型重组水痘-带状疱疹病毒,并且从变异病毒中获得致病性减弱的变异水痘-带状疱疹病毒株。
(1:变异型重组水痘-带状疱疹病毒的制备)
作为变异型重组水痘-带状疱疹病毒的制备方法,例如可以列举通过在包含变异基因的核酸与VZV-BAC-DNA质粒之间引起同源重组,制备变异型重组水痘-带状疱疹病毒的方法。为了与VZV-BAC-DNA质粒进行同源重组的变异基因既可以具有随机变异,也可以具有位点特异性变异。分别使用这些基因可以获得具有随机变异的变异型重组水痘-带状疱疹病毒集团和具有位点特异性变异的变异型重组水痘-带状疱疹病毒集团。下面分别进行详细描述。
(1.1:具有随机变异的重组水痘-带状疱疹病毒的制备)
已知有几种在水痘-带状疱疹病毒基因组的基因62中包含变异的病毒是减毒病毒。由此,在本实施例中通过PCR向基因62中导入随机变异。利用PCR导入变异的方法是已知的,例如,在四种核苷酸中的一种数量较少的条件下利用没有校正功能的耐热性聚合酶可以导入随机变异。必要时,基因62中可以与耐药基因之类的标记基因相连。
按照实施例1(4:重组病毒的富集并将其导入大肠杆菌)将这样制备的变异型基因62与VZV-BAC-DNA质粒一起导入到大肠杆菌中。并且在变异型基因62与VZV-BAC-DNA之间发生同源重组。然后根据实施例1中所述的方法分离产生同源重组的水痘-带状疱疹病毒DNA,将其导入至大肠杆菌中获得引起同源重组的VZV-BAC-DNA。
所获得的多个大肠杆菌包含VZV-BAC-DNA,所述的VZV-BAC-DNA包含携带不同变异的基因62。通过下述的(2:检测水痘-带状疱疹病毒致病性的方法)对各大肠杆菌中所含的变异型VZV-BAC-DNA所产生的水痘-带状疱疹病毒的致病性程度进行筛选。
(1.2:具有位点特异性变异的变异型重组水痘-带状疱疹病毒的制备)
导入所需的位点特异性变异的方法也是本领域已知的。例如,用含有所需变异的引物通过PCR制备包含所需变异的基因片段,然后用该变异的基因片段,通过进一步的PCR,或通过限制酶等的酶处理,制备具有所需变异的全长基因。
有关这样制备的变异基因,按照上述(1.1.)中所述的顺序,制备具有位点特异性变异的变异型重组水痘-带状疱疹病毒。
(2:水痘-带状疱疹病毒致病性的试验方法)
对于试验水痘-带状疱疹病毒致病性的方法,已确立了两种方法。
作为使用动物模型的方法,制备移植了人皮肤的重症复合免疫缺陷(SCID)小鼠,使其感染水痘-带状疱疹病毒,来评价其致病性的方法是已知的(J.Viro1.1998 Feb;72(2):965-74)。
与此相反,对于在试管内进行致病性评价的方法,在由孔径3μm的通孔(trans-well)分隔开的双层孔的下层接入单层培养的人黑素瘤细胞,在上层接入经水痘-带状疱疹病毒感染的脐带血单核细胞(CBMC)培养7-8后观察黑素瘤细胞的CPE程度(细胞变性效果)的方法(J.Virol.2000 Feb;74(4):1864-70)也是公知的。
尽管不是直接确定致病性的方法,根据本发明人等目前所得到的结果(J.Virol.2002 Nov;76(22):11447-59),可以理解病毒的致病性与增殖性密切相关,通过感染中心试验研究细胞-对细胞的增殖性可以间接地评价致病性。
(实施例4)
(疫苗的制备)
将由实施例1获得的重组水痘-带状疱疹病毒接种到20个培养面积210cm2的Roux瓶中后进行培养。培养完毕后,弃去培养液,各Roux瓶内的感染细胞用200ml的PBS(-)洗2次。接下来,将20ml0.03%(w/v)EDTA-3Na叠覆于各Roux瓶的感染细胞,以使细胞从Roux瓶内壁剥离并悬浮。汇集各瓶中的感染细胞悬液,在4℃以2,000rpm离心10min收集感染细胞颗粒。再用100ml的PBS(-)重悬细胞后,冻结融解一次。然后在冰水浴中超声波处理(20KHz,150mA,0.3sec/ml)后,再在4℃下以3,000rpm离心20min。采集含有由细胞释放病毒的悬液作为活疫苗原液。从该原液中取30ml作为鉴定用样品,向剩余的70ml原液中添加混合作为疫苗稳定剂的、溶解于PBS(-)蔗糖和明胶水解物并使其最终浓度分别达到50%(w/v)和2.5%(w/v),制备140ml最终散装的活疫苗。从所述最终散装中取30ml作为鉴定用样品,将所余的以每瓶0.5ml的量分注到3ml容量的小瓶中,0.5ml每瓶,冷冻干燥后填充氮气,用胶塞将小瓶气密闭。将所得的活疫苗分装品在4℃下保存,使用前添加0.5ml注射用蒸馏水使干燥的内容物完全溶解使用。另一方面,作为样品的上述疫苗原液和最终散装品,以及20个小分装品用于鉴定试验。通过所述鉴定试验确认安全性、有效性和均质性。作为活疫苗的合格性根据Guidelines for BiologicalFormulations defined under Notice No.195 of Ministry of Health andWelfare(1989),以及“重组沉淀乙型肝炎疫苗(来自酵母)”实施。根据试验结果,上述小分装品品病毒含量为2×104PFU(噬菌斑形成单位)/0.5ml,并且当上述标准的各种试验合格时,提供具备合格性的活疫苗以备今后使用。
实施例5
(重组水痘-带状疱疹病毒疫苗免疫原性的确定)
利用豚鼠测定实施例4中制备的重组水痘-带状疱疹病毒疫苗的免疫原性。用Oka株活疫苗作为对照。将这些疫苗分别皮下接种到3只3周龄、平均体重250克的豚鼠上。疫苗接种,用PBS(-)稀释调整各疫苗并使得重组株和Oka活疫苗的接种量达到3,000PFU/豚鼠或2,000PFU/豚鼠。接种后第4、6、8周,从各接种豚鼠的大腿部静脉采血,测定其血液中抗体的效价。对于抗体效价的测定采用中和法(Journal of General Virology,61,255-269,1982)。确认重组水痘-带状疱疹病毒疫苗与Oka株同等程度地诱导抗-VZV抗体。根据这些结果选择免疫原性良好的重组水痘-带状疱疹病毒疫苗。
以上通过本发明的优选实施方案对本发明进行了示例性的说明,但应当理解只有权利要求范围可以用来解释本发明的范围。应该理解,本说明书中所引用的专利、专利申请和文献,其内容本身如本说明书说明的一样,其内容是作为对本说明书的参考而引用的。
产业上的实用性
本发明提供利用BAC(大肠杆菌人工染色体)由单一的病毒株生产重组水痘-带状疱疹病毒的方法,以及由该方法生产的重组水痘-带状疱疹病毒。此外,本发明还提供包含重组水痘-带状疱疹病毒的药物组合物。
此外,本发明还提供包含水痘-带状疱疹病毒基因组基因和BAC载体序列的载体,包含所述载体的细胞,以及包含能与水痘-带状疱疹病毒基因组进行同源重组的片段以及BAC载体序列的核酸盒。
(序列表文本)
SEQ ID NO.:1,VZ11F 引物
SEQ ID NO.:2,VZ11R 引物
SEQ ID NO.:3,VZ12F 引物
SEQ ID NO.:4,VZ12R 引物
SEQ ID NO.:5,基因62的序列
SEQ ID NO.:6,基因62的序列
SEQ ID NO.:7,质粒PHA-2的序列
SEQ ID NO.:8,水痘-带状疱疹病毒Dumas株
SEQ ID NO.:9,SEQ ID NO.:8的1134到1850位中以5’→3’方向编码的氨基酸序列(基因2)
SEQ ID NO.:10,SEQ ID NO.:8的8607到9386位以5’→3’方向编码的氨基酸序列(基因7)
SEQ ID NO.:11,SEQ ID NO.:8的10642到10902位以5’→3’方向编码的氨基酸序列(基因9A)
SEQ ID NO.:12,SEQ ID NO.:8的11009到11917位以5’→3’方向编码的氨基酸序列(基因9)
SEQ ID NO.:13,SEQ ID NO.:8的12160到13392位以5’→3’方向编码的氨基酸序列(基因10)
SEQ ID NO.:14,SEQ ID NO.:8的13590到16049位以5’→3’方向编码的氨基酸序列(基因11)
SEQ ID NO.:15,SEQ ID NO.:8的16214到18199位以5’→3’方向编码的氨基酸序列(基因12)
SEQ ID NO.:16,SEQ ID NO.:8的18441到19346位以5’→3’方向编码的氨基酸序列(基因13)
SEQ ID NO.:17,SEQ ID NO.:8的24149到25516位以5’→3’方向编码的氨基酸序列(基因17)
SEQ ID NO.:18,SEQ ID NO.:8的30759到33875位以5’→3’方向编码的氨基酸序列(基因21)
SEQ ID NO.:19,SEQ ID NO.:8的34083到42374位以5’→3’方向编码的氨基酸序列(基因22)
SEQ ID NO.:20,SEQ ID NO.:8的44506到46263位以5’→3’方向编码的氨基酸序列(基因26)
SEQ ID NO.:21,SEQ ID NO.:8的50857到54471位以5’→3’方向编码的氨基酸序列(基因29)
SEQ ID NO.:22,SEQ ID NO.:8的54651到56963位以5’→3’方向编码的氨基酸序列(基因30)
SEQ ID NO.:23,SEQ ID NO.:8的57008到59614位以5’→3’方向编码的氨基酸序列(基因31)
SEQ ID NO.:24,SEQ ID NO.:8的59766到60197位以5’→3’方向编码的氨基酸序列(基因32)
SEQ ID NO.:25,SEQ ID NO.:8的64807到65832位以5’→3’方向编码的氨基酸序列(基因36)
SEQ ID NO.:26,SEQ ID NO.:8的66074到68599位以5’→3’方向编码的氨基酸序列(基因37)
SEQ ID NO.:27,SEQ ID NO.:8的70633到71355位以5’→3’方向编码的氨基酸序列(基因39)
SEQ ID NO.:28,SEQ ID NO.:8的71540到75730位以5’→3’方向编码的氨基酸序列(基因40)
SEQ ID NO.:29,SEQ ID NO.:8的75847到76797位以5’→3’方向编码的氨基酸序列(基因41)
SEQ ID NO.:30,SEQ ID NO.:8的78170到80200位以5’→3’方向编码的氨基酸序列(基因43)
SEQ ID NO.:31,SEQ ID NO.:8的80360到81451位以5’→3’方向编码的氨基酸序列(基因44)
SEQ ID NO.:32,SEQ ID NO.:8的82719到83318位以5’→3’方向编码的氨基酸序列(基因46)
SEQ ID NO.:33,SEQ ID NO.:8的84667到86322位以5’→3’方向编码的氨基酸序列(基因48)
SEQ ID NO.:34,SEQ ID NO.:8的87881到90388位以5’→3’方向编码的氨基酸序列(基因51)
SEQ ID NO.:35,SEQ ID NO.:8的90493到92808位以5’→3’方向编码的氨基酸序列(基因52)
SEQ ID NO.:36,SEQ ID NO.:8的95996到98641位以5’→3’方向编码的氨基酸序列(基因55)
SEQ ID NO.:37,SEQ ID NO.:8的110581到111417位以5’→3’方向编码的氨基酸序列(基因63)
SEQ ID NO.:38,SEQ ID NO.:8的111565到112107位以5’→3’方向编码的氨基酸序列(基因64)
SEQ ID NO.:39,SEQ ID NO.:8的113037到114218位以5’→3’方向编码的氨基酸序列(基因66)
SEQ ID NO.:40,SEQ ID NO.:8的114496到115560位以5’→3’方向编码的氨基酸序列(基因67)
SEQ ID NO.:41,SEQ ID NO.:8的115808到117679位以5’→3’方向编码的氨基酸序列(基因68)
SEQ ID NO.:42,SEQ ID NO.:8的120764到124696位以5’→3’方向编码的氨基酸序列(基因71)
SEQ ID NO.:43SEQ ID No.:8的部分序列(基因27)
SEQ ID NO.:44,SEQ ID NO.:43的1到999位以5’→3’方向编码的氨基酸序列(基因27)
SEQ ID NO.:45,SEQ ID No.:8的部分序列(基因47)
SEQ ID NO.:46,SEQ ID NO.:45的1到1530位以5’→3’方向编码的氨基酸序列(基因47)
SEQ ID NO.:47,SEQ ID No.:8的部分序列
SEQ ID NO.:48,SEQ ID NO.:47的1到243以5’→3’方向编码的氨基酸序列(基因49)
SEQ ID NO.:49,SEQ ID No.:8的部分序列
SEQ ID NO.:50,SEQ ID NO.:49的1到732位以5’→3’方向编码的氨基酸序列(基因56)
SEQ ID NO.:51,SEQ ID No.:8的互补序列
SEQ ID NO.:52,SEQ ID NO.:8的118480到119316位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的5569到6405位)(基因70)
SEQ ID NO.:53,SEQ ID NO.:8的117790到118332位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的6553到7095位)
(基因69)
SEQ ID NO.:54,SEQ ID NO.:8的112332到112640位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的12245到12553位)
(基因65)
SEQ ID NO.:55,SEQ ID NO.:8的105201到109133位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的15752到19684位)
(基因62)
SEQ ID NO.:56,SEQ ID NO.:8的103082到104485位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的20400到21803位)
(基因61)
SEQ ID NO.:57,SEQ ID NO.:8的100302到101219位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的23666到24583位)
(基因59)
SEQ ID NO.:58,SEQ ID NO.:8的99411到99626位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的25259到25474位)
(基因57)
SEQ ID NO.:59,SEQ ID NO.:8的92855到93850位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的31035到32030位)
(基因53)
SEQ ID NO.:60,SEQ ID NO.:8的68668到70293位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的54592到56217位)
(基因38)
SEQ ID NO.:61,SEQ ID NO.:8的63977到64753位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的60132到60908位)
(基因35)
SEQ ID NO.:62,SEQ ID NO.:8的62171到63910位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的60975到62714位)
(基因34)
SEQ ID NO.:63,SEQ ID NO.:8的60321到62138位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的62747到64564位)
(基因33)
SEQ ID NO.:64,SEQ ID NO.:8的47052到50636位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的74249到77833位)
(基因28)
SEQ ID NO.:65,SEQ ID NO.:8的44148到44618位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的80267到80737位)
(基因25)
SEQ ID NO.:66,SEQ ID NO.:8的43212到44021位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的80864到81673位)
(基因24)
SEQ ID NO.:67,SEQ ID NO.:8的42431到43138位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的81747到82454位)
(基因23)
SEQ ID NO.:68,SEQ ID NO.:8的29024到30475位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的94410到95861位)
(基因20)
SEQ ID NO.:69,SEQ ID NO.:8的26518到28845位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的96040到98367位)
(基因19)
SEQ ID NO.:70,SEQ ID NO.:8的25573到26493位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的98392到99312位)
(基因18)
SEQ ID NO.:71,SEQ ID NO.:8的22568到23794位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的101091到102317位)(基因16)
SEQ ID NO.:72,SEQ ID NO.:8的21258到22478位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的102407到103627位)(基因15)
SEQ ID NO.:73,SEQ ID NO.:8的19431到21113位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的103772到105454位)(基因14)
SEQ ID NO.:74,SEQ ID NO.:8的9477到10667位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的114218到115408位)
(基因8)
SEQ ID NO.:75,SEQ ID NO.:8的5326到8577位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的116308到119559位)
(基因6)
SEQ ID NO.:76,SEQ ID NO.:8的4252到5274位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的119611到120633位)
(基因5)
SEQ ID NO.:77,SEQ ID NO.:8的2783到4141位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的120744到122102位)
(基因4)
SEQ ID NO.:78,SEQ ID NO.:8的1908到2447位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的122438到122977位)
(基因3)
SEQ ID NO.:79,SEQ ID NO.:8的589到915位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的123970到124296位)
(基因1)
SEQ ID NO.:80,SEQ ID No.:51的部分序列
SEQ ID NO.:81,SEQ ID NO.:80的1到1056和4556到5740位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的46847到48034和42292到43347位)(基因42和基因45)
SEQ ID NO.:82,SEQ ID No.:51的部分序列
SEQ ID NO.:83,SEQ ID NO.:82的1到1305位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的123580到124884位)
(基因50)
SEQ ID NO..84,SEQ ID No.:51的部分序列
SEQ ID NO.:85,SEQ ID NO.:84的1到2307位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的122578到124884位)
(基因54)
SEQ ID NO.:86,SEQ ID No.:51的部分序列
SEQ ID NO.:87,SEQ ID NO.:86的1到663位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的124222到124884位)(基因58)
SEQ ID NO.:88,SEQ ID No.:51的部分序列
SEQ ID NO.:89,SEQ ID NO.:88的1到427位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的124458到124884位)(基因60)
SEQ ID NO.:90,SEQ ID No.:51的部分序列
SEQ ID NO.:91,SEQ ID NO.:90的1到903位以3’→5’方向编码的氨基酸序列(相应于SEQ ID No.:51的60321到61229位)(基因33.5)
序列表
<110>The Research Foundation for Microbial Diseases of OsakaUniversity
<120>重组水痘-带状疱疹病毒
<130>F5-05PCT032 OBK004PCT
<140>PCT/JP2005/003652
<141>2005-03-05
<150>JP 2004-063277
<151>2004-03-05
<160>91
<170>PatentIn version 3.2
<210>1
<211>35
<212>DNA
<213>人工的
<220>
<223>引物
<400>1
tataactagt gcggccgctt acgaaaacgt gcatg 35
<210>2
<211>34
<212>DNA
<213>人工的
<220>
<223>引物
<400>2
cgcgacctgg tttattttac aaactccttt gtgg 34
<210>3
<211>34
<212>DNA
<213>人工的
<220>
<223>引物
<400>3
gcgcaccagg tctctgttta gaccttaaaa tttg 34
<210>4
<211>34
<212>DNA
<213>人工的
<220>
<223>引物
<400>4
tatagcggcc gcttttaatc tggttgtgga aatg 34
<210>5
<211>4226
<212>DNA
<213>水痘带状疱疹(Varicella zoster)
<220>
<221>CDS
<222>(229)..(4158)
<400>5
atactatggt ccatgaactt cccgcctcga gtctcgtcca atcactacat cgtcttatca 60
ttaagaatat ttacacggtg acgacacggg gaggaaatat gcggtcgagg ggggggcaca 120
acacgtttta agtactgttg gaactccctc accaaccgca aycgcaatcc tttgaaggct 180
gcgagagcgt ttggaaaact cgggtacgtc taaattcacc ccagygcg atg gat acg 237
Met Asp Thr
1
ccg ccg atg cag cgc tct aca ccc caa cgc gcg ggg tcg cct gat act 285
Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser Pro Asp Thr
5 10 15
ttg gag tta atg gac ctg ttg gac gcg gcc gcc gcg gcc gcc gaa cac 333
Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala Ala Glu His
20 25 30 35
agg gcc cgg gtg gtc acc tcg agt cag cct gac gat cta cta ttt gga 381
Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu Leu Phe Gly
40 45 50
gag aac ggg gtc atg gtg gga cgg gaa cat gag atc gtt tca att ccc 429
Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile Val Ser Ile Pro
55 60 65
tcc gta tcg gga ctt caa cca gaa ccc aga acg gaa gat gtt ggc gaa 477
Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr Glu Asp Val Gly Glu
70 75 80
gag cta aca caa gac gac tac gta tgc gag gac ggt cag gat cta ayg 525
Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu Asp Gly Gln Asp Leu Xaa
85 90 95
ggc tcg cct gta atc ccg ctg gcc gag gtc ttc cac acc cga ttc tcg 573
Gly Ser Pro Val Ile Pro Leu Ala Glu Val Phe His Thr Arg Phe Ser
100 105 110 115
gag gcc ggc gcg cga gaa cca aca gga gcc gat cgc tcc ctc gag aca 621
Glu Ala Gly Ala Arg Glu Pro Thr Gly Ala Asp Arg Ser Leu Glu Thr
120 125 130
gtc tct ctc gga acg aag ctt gct agg tct cca aaa cca ccg atg aac 669
Val Ser Leu Gly Thr Lys Leu Ala Arg Ser Pro Lys Pro Pro Met Asn
135 140 145
gat ggg gaa acg ggc aga ggt acg acc cct ccg ttc ccg cag gcc ttc 717
Asp Gly Glu Thr Gly Arg Gly Thr Thr Pro Pro Phe Pro Gln Ala Phe
150 155 160
tcc cct gta tcc ccc gcg tct cct gtt gga gac gcc gcc ggg aac gat 765
Ser Pro Val Ser Pro Ala Ser Pro Val Gly Asp Ala Ala Gly Asn Asp
165 170 175
caa cgg gaa gac cag cgg tct ata ccc cga caa acg acg aga gga aat 813
Gln Arg Glu Asp Gln Arg Ser Ile Pro Arg Gln Thr Thr Arg Gly Asn
180 185 190 195
tca cca ggt ttg ccg tcg gtg gtc cat cga gac aga caa act cag tcc 861
Ser Pro Gly Leu Pro Ser Val Val His Arg Asp Arg Gln Thr Gln Ser
200 205 210
atc tcg ggt aaa aag ccg ggc gat gag caa gcg ggt cat gcg cat gca 909
Ile Ser Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His Ala His Ala
215 220 225
tcg ggg gac gga gta gtt ctc cag aaa act caa cgg ccc gct cag gga 957
Ser Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro Ala Gln Gly
230 235 240
aag agc ccg aag aaa aag act ttg aag gtt aag gtc cca ctc ccg gcg 1005
Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro Leu Pro Ala
245 250 255
cgg aaa ccc ggt gga cct gta ccc ggc ccg gtt gag caa ttg tac cac 1053
Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln Leu Tyr His
260 265 270 275
gtc ctt tcg gac agc gtt ccc gct aag ggg gca aag gcg gac ctg ccg 1101
Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala Asp Leu Pro
280 285 290
ttt gag acc gat gat acc cgc cca agg aaa cat gat gcc cgg ggt ata 1149
Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp Ala Arg Gly Ile
295 300 305
aca cct cgc gtc cct gga cgt tcg tcg ggg ggc aaa cct aga gcg ttt 1197
Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly Lys Pro Arg Ala Phe
310 315 320
ttg gcc ctg ccg gga aga tcc cac gca cca gac ccg att gag gat gac 1245
Leu Ala Leu Pro Gly Arg Ser His Ala Pro Asp Pro Ile Glu Asp Asp
325 330 335
agc ccg gtg gag aaa aag cca aag agt cgt gag ttt gtt tcg tct tca 1293
Ser Pro Val Glu Lys Lys Pro Lys Ser Arg Glu Phe Val Ser Ser Ser
340 345 350 355
tcc tct tcc tcg tcg tgg gga tcg tca tcg gag gat gaa gac gat gaa 1341
Ser Ser Ser Ser Ser Trp Gly Ser Ser Ser Glu Asp Glu Asp Asp Glu
360 365 370
ccc cgg cgc gtt tcg gtg gga agt gaa act aca ggc agc agg tcc gga 1389
Pro Arg Arg Val Ser Val Gly Ser Glu Thr Thr Gly Ser Arg Ser Gly
375 380 385
cgc gaa cac gcc cct tcc ccg tca aat tcg gat gat tcg gac tca aat 1437
Arg Glu His Ala Pro Ser Pro Ser Asn Ser Asp Asp Ser Asp Ser Asn
390 395 400
gat ggt ggg tcg acg aaa caa aat atc caa ccg gga tat cga tcc atc 1485
Asp Gly Gly Ser Thr Lys Gln Asn Ile Gln Pro Gly Tyr Arg Ser Ile
405 410 415
agc ggt ccc gat ccg agg att cgt aag acc aaa cgt ctt gcg ggg gaa 1533
Ser Gly Pro Asp Pro Arg Ile Arg Lys Thr Lys Arg Leu Ala Gly Glu
420 425 430 435
ccg ggg cgc cag aga cag aaa tca ttt tcc ctg ccg cga tcc aga acc 1581
Pro Gly Arg Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg Ser Arg Thr
440 445 450
ccg ata att ccc ccg gtg tcg ggg ccg ctc atg atg ccc gac gga agc 1629
Pro Ile Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro Asp Gly Ser
455 460 465
cct tgg ccc gga tcg gcg ccc ctc cca tcc aac agg gtg cgg ttt gga 1677
Pro Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val Arg Phe Gly
470 475 480
ccg tcc ggg gag acc aga gag ggt cac tgg gag gat gag gct gtg aga 1725
Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp Glu Ala Val Arg
485 490 495
gcg gcg cgg gct cgt tac gag gcc tca act gaa ccc gyg ccg ctt tac 1773
Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu Pro Xaa Pro Leu Tyr
500 505 510 515
gtg ccg gag ttg gga gat ccg gct aga cag tac cgc gcg ctg att aac 1821
Val Pro Glu Leu Gly Asp Pro Ala Arg Gln Tyr Arg Ala Leu Ile Asn
520 525 530
ctg atc tac tgt cca gac aga gac cct ata gca tgg ctc cag aac ccc 1869
Leu Ile Tyr Cys Pro Asp Arg Asp Pro Ile Ala Trp Leu Gln Asn Pro
535 540 545
aag ctg acc ggt gtc aac tcg gcc ctg aac cag ttc tac caa aag ctg 1917
Lys Leu Thr Gly Val Asn Ser Ala Leu Asn Gln Phe Tyr Gln Lys Leu
550 555 560
ttg cca ccg gga cgg gcg ggt acc gcc gtt acg ggg agc gta gcg tct 1965
Leu Pro Pro Gly Arg Ala Gly Thr Ala Val Thr Gly Ser Val Ala Ser
565 570 575
ccc gtt ccg cat gta ggc gaa gcc atg gcc acg ggg gag gcc ctc tgg 2013
Pro Val Pro His Val Gly Glu Ala Met Ala Thr Gly Glu Ala Leu Trp
580 585 590 595
gct ctc ccc cac gcg gcc gcg gcc gtg gct atg agc cgt cgg tac gac 2061
Ala Leu Pro His Ala Ala Ala Ala Val Ala Met Ser Arg Arg Tyr Asp
600 605 610
cgg gcc caa aaa cac ttt atc cta cag agt ctc cgc aga gcc ttt gcc 2109
Arg Ala Gln Lys His Phe Ile Leu Gln Ser Leu Arg Arg Ala Phe Ala
615 620 625
ggc atg gca tac ccc gag gca acg ggc tcc agt ccg gcg gcg cgg atc 2157
Gly Met Ala Tyr Pro Glu Ala Thr Gly Ser Ser Pro Ala Ala Arg Ile
630 635 640
tcc cgc ggt cac cct tct cca aca acc ccg gcc aca cag act ccc gac 2205
Ser Arg Gly His Pro Ser Pro Thr Thr Pro Ala Thr Gln Thr Pro Asp
645 650 655
cct cag ccg tcg gcc gcc gcg cgc tct ctt tct gtg tgt cca ccg gat 2253
Pro Gln Pro Ser Ala Ala Ala Arg Ser Leu Ser Val Cys Pro Pro Asp
660 665 670 675
gat cgt tta cga act ccg cgc aag cgc aag tcc cag cca gtc gag agc 2301
Asp Arg Leu Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro Val Glu Ser
680 685 690
aga agc ctc ctc gac aag att agg gag aca ccc gtc gcg gac gcc cgg 2349
Arg Ser Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala Asp Ala Arg
695 700 705
gtt gca gac gat cat gtg gtt tcc aag gcc aag agg cgg gta tcc gag 2397
Val Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg Val Ser Glu
710 715 720
ccc gtg acc atc acc tcg ggc cct gtg gtg gat ccc ccc gcc gta ata 2445
Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro Pro Ala Val Ile
725 730 735
acg atg cca ctt gac gga ccg gcc cca aac ggg gga ttt cgg cgt att 2493
Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly Gly Phe Arg Arg Ile
740 745 750 755
ccc cgg ggg gcc ctg cat acc ccg gtc ccg tcg gac cag gct cgc aag 2541
Pro Arg Gly Ala Leu His Thr Pro Val Pro Ser Asp Gln Ala Arg Lys
760 765 770
gcg tac tgt acc ccc gaa acc atc gcc cgt ctg gtc gac gac cca ttg 2589
Ala Tyr Cys Thr Pro Glu Thr Ile Ala Arg Leu Val Asp Asp Pro Leu
775 780 785
ttt ccc acg gcc tgg cgc cct gcg cta agc ttt gat ccc ggc gcc ttg 2637
Phe Pro Thr Ala Trp Arg Pro Ala Leu Ser Phe Asp Pro Gly Ala Leu
790 795 800
gcg gaa atc gcc gct cgg cgt ccg ggc gga gga gac cga cgg ttt ggt 2685
Ala Glu Ile Ala Ala Arg Arg Pro Gly Gly Gly Asp Arg Arg Phe Gly
805 810 815
cca ccc agc gga gtg gag gcg ctg cga cgg agg tgc gcc tgg atg cgg 2733
Pro Pro Ser Gly Val Glu Ala Leu Arg Arg Arg Cys Ala Trp Met Arg
820 825 830 835
cag atc cca gac ccg gag gat gtg agg ctt ctg atc atc tac gat ccg 2781
Gln Ile Pro Asp Pro Glu Asp Val Arg Leu Leu Ile Ile Tyr Asp Pro
840 845 850
ttg ccc gga gag gac atc aac ggc ccc ctc gag agc acc ctc gcg aca 2829
Leu Pro Gly Glu Asp Ile Asn Gly Pro Leu Glu Ser Thr Leu Ala Thr
855 860 865
gat ccg gga ccg tca tgg agt cca tcc cga ggg gga ctg tct gtg gtc 2877
Asp Pro Gly Pro Ser Trp Ser Pro Ser Arg Gly Gly Leu Ser Val Val
870 875 880
ctg gca gcc ctg agt aac cgg ttg tgc ctg ccg agc act cat gcc tgg 2925
Leu Ala Ala Leu Ser Asn Arg Leu Cys Leu Pro Ser Thr His Ala Trp
885 890 895
gcc ggg aac tgg acc ggc ccg ccg gac gtg tcc gct ttg aac gcc cgg 2973
Ala Gly Asn Trp Thr Gly Pro Pro Asp Val Ser Ala Leu Asn Ala Arg
900 905 910 915
ggc gtt tta tta ctg tcg acc cga gac ctg gcc ttt gcc ggg gcc gtc 3021
Gly Val Leu Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala Gly Ala Val
920 925 930
gag tat cta ggc tcg cgg ttg gcc tct gcc cgg cgc cgg ttg ctg gtg 3069
Glu Tyr Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg Leu Leu Val
935 940 945
ttg gac gcg gtg gcc ctc gag agg tgg ccc ggg gat gga ccc gct ttg 3117
Leu Asp Ala Val Ala Leu Glu Arg Trp Pro Gly Asp Gly Pro Ala Leu
950 955 960
tct cag tat cac gtg tac gtc cgg gcc ccg gcg cga ccg gac gcc cag 3165
Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg Pro Asp Ala Gln
965 970 975
gcc gtc gtc cga tgg cca gac tcg gcg gtc aca gaa gga ctc gcc cgg 3213
Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr Glu Gly Leu Ala Arg
980 985 990 995
gcc gtg ttt gca tcg tcg cgc acc ttt ggg cca gcg agt ttt gct 3258
Ala Val Phe Ala Ser Ser Arg Thr Phe Gly Pro Ala Ser Phe Ala
1000 1005 1010
cgt atc gag act gcg ttt gcc aac ctg tac ccg ggc gaa caa ccc 3303
Arg Ile Glu Thr Ala Phe Ala Asn Leu Tyr Pro Gly Glu Gln Pro
1015 1020 1025
ctg tgt ttg tgc cgc ggt ggg aac gtc gca tac acc gtg tgt acc 3348
Leu Cys Leu Cys Arg Gly Gly Asn Val Ala Tyr Thr Val Cys Thr
1030 1035 1040
cgc gcg ggc ccc aag acc cgc gtc ccc ctg tcg ccc cgt gaa tac 3393
Arg Ala Gly Pro Lys Thr Arg Val Pro Leu Ser Pro Arg Glu Tyr
1045 1050 1055
cgg cag tac gtg ctg ccg ggt ttt gac ggt tgc aag gac ctc gcg 3438
Arg Gln Tyr Val Leu Pro Gly Phe Asp Gly Cys Lys Asp Leu Ala
1060 1065 1070
cga cag tct cgg ggt ctg ggg ctc ggg gca gcc gac ttt gtg gac 3483
Arg Gln Ser Arg Gly Leu Gly Leu Gly Ala Ala Asp Phe Val Asp
1075 1080 1085
gag gcg gca cat agc cac cgc gca gca aac cga tgg ggc ctg ggt 3528
Glu Ala Ala His Ser His Arg Ala Ala Asn Arg Trp Gly Leu Gly
1090 1095 1100
gcc gcg ctt cga ccc gtc ttc ctt ccc gag gga cgg aga ccg ggg 3573
Ala Ala Leu Arg Pro Val Phe Leu Pro Glu Gly Arg Arg Pro Gly
1105 1110 1115
gcc gcc ggg ccg gag gcc ggc gac gta ccc acc tgg gcg agg gtg 3618
Ala Ala Gly Pro Glu Ala Gly Asp Val Pro Thr Trp Ala Arg Val
1120 1125 1130
ttt tgc cgc cac gcc ctg ctg gaa ccc gac cct gcc gcg gaa cca 3663
Phe Cys Arg His Ala Leu Leu Glu Pro Asp Pro Ala Ala Glu Pro
1135 1140 1145
ctc gtg ctt cca ccc gtg gcc ggt cgg tcg gtg gcg ctg tat gcg 3708
Leu Val Leu Pro Pro Val Ala Gly Arg Ser Val Ala Leu Tyr Ala
1150 1155 1160
tcg gcg gac gag gct cgg aat gcc ctc ccc ccg att ccc aga gta 3753
Ser Ala Asp Glu Ala Arg Asn Ala Leu Pro Pro Ile Pro Arg Val
1165 1170 1175
atg tgg ccg ccc ggt ttt ggg gcc gcg gag acg gtg ttg gag ggg 3798
Met Trp Pro Pro Gly Phe Gly Ala Ala Glu Thr Val Leu Glu Gly
1180 1185 1190
agc gac gga aca cgg ttc gcg ttc gga cac cac ggg ggc tcg gaa 3843
Ser Asp Gly Thr Arg Phe Ala Phe Gly His His Gly Gly Ser Glu
1195 1200 1205
cgg ccg gca gaa acc cag gcg ggg cga cag cgg cgc acc gca gac 3888
Arg Pro Ala Glu Thr Gln Ala Gly Arg Gln Arg Arg Thr Ala Asp
1210 1215 1220
gac aga gaa cac gct ttg gag ccg gac gat tgg gag gtg ggg tgt 3933
Asp Arg Glu His Ala Leu Glu Pro Asp Asp Trp Glu Val Gly Cys
1225 1230 1235
gaa gac gcg tgg gac agc gag gag ggg ggc ggg gac gac ggg gac 3978
Glu Asp Ala Trp Asp Ser Glu Glu Gly Gly Gly Asp Asp Gly Asp
1240 1245 1250
gca ccg ggg tca tcc ttt ggg gtg agc gtc gtg tcg gtg gcc ccg 4023
Ala Pro Gly Ser Ser Phe Gly Val Ser Val Val Ser Val Ala Pro
1255 1260 1265
ggt gtg ctg cga gac cgc cgg gtg ggc tyg cgc ccg gcg gtc aag 4068
Gly Val Leu Arg Asp Arg Arg Val Gly Xaa Arg Pro Ala Val Lys
1270 1275 1280
gtg gag ctg ttg tcc tcg tcc tcg tcc tcc gag gac gag gac gat 4113
Val Glu Leu Leu Ser Ser Ser Ser Ser Ser Glu Asp Glu Asp Asp
1285 1290 1295
gtg tgg gga ggg cgc ggg ggg agg agc ccc ccg cag agt cgg ggg 4158
Val Trp Gly Gly Arg Gly Gly Arg Ser Pro Pro Gln Ser Arg Gly
1300 1305 1310
tgacggagtc ccctcctttt ctcgtgagcg ccacyggcgc gcggactgtt tgttgtttgt 4218
taataaaa 4226
<210>6
<211>1310
<212>PRT
<213>水痘带状疱疹
<220>
<221>misc_feature
<222>(99)..(99)
<223>第99位的’Xaa’代表Thr,或Met
<220>
<221>misc_feature
<222>(512)..(512)
<223>第512位的’Xaa’代表Ala,或Val
<220>
<221>misc_feature
<222>(1275)..(1275)
<223>第1275位的’Xaa’代表Ser,Leu,或Xaa
<400>6
Met Asp Thr Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser
1 5 10 15
Pro Asp Thr Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala
20 25 30
Ala Glu His Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu
35 40 45
Leu Phe Gly Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile Val
50 55 60
Ser Ile Pro Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr Glu Asp
65 70 75 80
Val Gly Glu Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu Asp Gly Gln
85 90 95
Asp Leu Xaa Gly Ser Pro Val Ile Pro Leu Ala Glu Val Phe His Thr
100 105 110
Arg Phe Ser Glu Ala Gly Ala Arg Glu Pro Thr Gly Ala Asp Arg Ser
115 120 125
Leu Glu Thr Val Ser Leu Gly Thr Lys Leu Ala Arg Ser Pro Lys Pro
130 135 140
Pro Met Asn Asp Gly Glu Thr Gly Arg Gly Thr Thr Pro Pro Phe Pro
145 150 155 160
Gln Ala Phe Ser Pro Val Ser Pro Ala Ser Pro Val Gly Asp Ala Ala
165 170 175
Gly Asn Asp Gln Arg Glu Asp Gln Arg Ser Ile Pro Arg Gln Thr Thr
180 185 190
Arg Gly Asn Ser Pro Gly Leu Pro Ser Val Val His Arg Asp Arg Gln
195 200 205
Thr Gln Ser Ile Ser Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His
210 215 220
Ala His Ala Ser Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro
225 230 235 240
Ala Gln Gly Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro
245 250 255
Leu Pro Ala Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln
260 265 270
Leu Tyr His Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala
275 280 285
Asp Leu Pro Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp Ala
290 295 300
Arg Gly Ile Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly Lys Pro
305 310 315 320
Arg Ala Phe Leu Ala Leu Pro Gly Arg Ser His Ala Pro Asp Pro Ile
325 330 335
Glu Asp Asp Ser Pro Val Glu Lys Lys Pro Lys Ser Arg Glu Phe Val
340 345 350
Ser Ser Ser Ser Ser Ser Ser Ser Trp Gly Ser Ser Ser Glu Asp Glu
355 360 365
Asp Asp Glu Pro Arg Arg Val Ser Val Gly Ser Glu Thr Thr Gly Ser
370 375 380
Arg Ser Gly Arg Glu His Ala Pro Ser Pro Ser Asn Ser Asp Asp Ser
385 390 395 400
Asp Ser Asn Asp Gly Gly Ser Thr Lys Gln Asn Ile Gln Pro Gly Tyr
405 410 415
Arg Ser Ile Ser Gly Pro Asp Pro Arg Ile Arg Lys Thr Lys Arg Leu
420 425 430
Ala Gly Glu Pro Gly Arg Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg
435 440 445
Ser Arg Thr Pro Ile Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro
450 455 460
Asp Gly Ser Pro Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val
465 470 475 480
Arg Phe Gly Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp Glu
485 490 495
Ala Val Arg Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu Pro Xaa
500 505 510
Pro Leu Tyr Val Pro Glu Leu Gly Asp Pro Ala Arg Gln Tyr Arg Ala
515 520 525
Leu Ile Asn Leu Ile Tyr Cys Pro Asp Arg Asp Pro Ile Ala Trp Leu
530 535 540
Gln Asn Pro Lys Leu Thr Gly Val Asn Ser Ala Leu Asn Gln Phe Tyr
545 550 555 560
Gln Lys Leu Leu Pro Pro Gly Arg Ala Gly Thr Ala Val Thr Gly Ser
565 570 575
Val Ala Ser Pro Val Pro His Val Gly Glu Ala Met Ala Thr Gly Glu
580 585 590
Ala Leu Trp Ala Leu Pro His Ala Ala Ala Ala Val Ala Met Ser Arg
595 600 605
Arg Tyr Asp Arg Ala Gln Lys His Phe Ile Leu Gln Ser Leu Arg Arg
610 615 620
Ala Phe Ala Gly Met Ala Tyr Pro Glu Ala Thr Gly Ser Ser Pro Ala
625 630 635 640
Ala Arg Ile Ser Arg Gly His Pro Ser Pro Thr Thr Pro Ala Thr Gln
645 650 655
Thr Pro Asp Pro Gln Pro Ser Ala Ala Ala Arg Ser Leu Ser Val Cys
660 665 670
Pro Pro Asp Asp Arg Leu Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro
675 680 685
Val Glu Ser Arg Ser Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala
690 695 700
Asp Ala Arg Val Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg
705 710 715 720
Val Ser Glu Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro Pro
725 730 735
Ala Val Ile Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly Gly Phe
740 745 750
Arg Arg Ile Pro Arg Gly Ala Leu His Thr Pro Val Pro Ser Asp Gln
755 760 765
Ala Arg Lys Ala Tyr Cys Thr Pro Glu Thr Ile Ala Arg Leu Val Asp
770 775 780
Asp Pro Leu Phe Pro Thr Ala Trp Arg Pro Ala Leu Ser Phe Asp Pro
785 790 795 800
Gly Ala Leu Ala Glu Ile Ala Ala Arg Arg Pro Gly Gly Gly Asp Arg
805 810 815
Arg Phe Gly Pro Pro Ser Gly Val Glu Ala Leu Arg Arg Arg Cys Ala
820 825 830
Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val Arg Leu Leu Ile Ile
835 840 845
Tyr Asp Pro Leu Pro Gly Glu Asp Ile Asn Gly Pro Leu Glu Ser Thr
850 855 860
Leu Ala Thr Asp Pro Gly Pro Ser Trp Ser Pro Ser Arg Gly Gly Leu
865 870 875 880
Ser Val Val Leu Ala Ala Leu Ser Asn Arg Leu Cys Leu Pro Ser Thr
885 890 895
His Ala Trp Ala Gly Asn Trp Thr Gly Pro Pro Asp Val Ser Ala Leu
900 905 910
Asn Ala Arg Gly Val Leu Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala
915 920 925
Gly Ala Val Glu Tyr Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg
930 935 940
Leu Leu Val Leu Asp Ala Val Ala Leu Glu Arg Trp Pro Gly Asp Gly
945 950 955 960
Pro Ala Leu Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg Pro
965 970 975
Asp Ala Gln Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr Glu Gly
980 985 990
Leu Ala Arg Ala Val Phe Ala Ser Ser Arg Thr Phe Gly Pro Ala Ser
995 1000 1005
Phe Ala Arg Ile Glu Thr Ala Phe Ala Asn Leu Tyr Pro Gly Glu
1010 1015 1020
Gln Pro Leu Cys Leu Cys Arg Gly Gly Asn Val Ala Tyr Thr Val
1025 1030 1035
Cys Thr Arg Ala Gly Pro Lys Thr Arg Val Pro Leu Ser Pro Arg
1040 1045 1050
Glu Tyr Arg Gln Tyr Val Leu Pro Gly Phe Asp Gly Cys Lys Asp
1055 1060 1065
Leu Ala Arg Gln Ser Arg Gly Leu Gly Leu Gly Ala Ala Asp Phe
1070 1075 1080
Val Asp Glu Ala Ala His Ser His Arg Ala Ala Asn Arg Trp Gly
1085 1090 1095
Leu Gly Ala Ala Leu Arg Pro Val Phe Leu Pro Glu Gly Arg Arg
1100 1105 1110
Pro Gly Ala Ala Gly Pro Glu Ala Gly Asp Val Pro Thr Trp Ala
1115 1120 1125
Arg Val Phe Cys Arg His Ala Leu Leu Glu Pro Asp Pro Ala Ala
1130 1135 1140
Glu Pro Leu Val Leu Pro Pro Val Ala Gly Arg Ser Val Ala Leu
1145 1150 1155
Tyr Ala Ser Ala Asp Glu Ala Arg Asn Ala Leu Pro Pro Ile Pro
1160 1165 1170
Arg Val Met Trp Pro Pro Gly Phe Gly Ala Ala Glu Thr Val Leu
1175 1180 1185
Glu Gly Ser Asp Gly Thr Arg Phe Ala Phe Gly His His Gly Gly
1190 1195 1200
Ser Glu Arg Pro Ala Glu Thr Gln Ala Gly Arg Gln Arg Arg Thr
1205 1210 1215
Ala Asp Asp Arg Glu His Ala Leu Glu Pro Asp Asp Trp Glu Val
1220 1225 1230
Gly Cys Glu Asp Ala Trp Asp Ser Glu Glu Gly Gly Gly Asp Asp
1235 1240 1245
Gly Asp Ala Pro Gly Ser Ser Phe Gly Val Ser Val Val Ser Val
1250 1255 1260
Ala Pro Gly Val Leu Arg Asp Arg Arg Val Gly Xaa Arg Pro Ala
1265 1270 1275
Val Lys Val Glu Leu Leu Ser Ser Ser Ser Ser Ser Glu Asp Glu
1280 1285 1290
Asp Asp Val Trp Gly Gly Arg Gly Gly Arg Ser Pro Pro Gln Ser
1295 1300 1305
Arg Gly
1310
<210>7
<211>8878
<212>DNA
<213>人工的
<220>
<223>plasmid
<400>7
ttaattaagg ccgcagcttc ctagataact tcgtatagca tacattatac gaagttatgg 60
atctcccgcc cagcgtcttg tcattggcga actcgaacac gcagatgcag tcggggcggc 120
gcggtcccag gtccacttcg catattaagg tgacacgcgc ggcctcgaac acagctgcag 180
gccatgagcg aaaaatacat cgtcacctgg gacatgttgc agatccatgc acgtaaactc 240
gcaagccgac tgatgccttc tgaacaatgg aaaggcatta ttgccgtaag ccgtggcggt 300
ctggtaccgg gtgcgttact ggcgcgtgaa ctgggtattc gtcatgtcga taccgtttgt 360
atttccagct acgatcacga caaccagcgc gagcttaaag tgctgaaacg cgcagaaggc 420
gatggcgaag gcttcatcgt tattgatgac ctggtggata ccggtggtac tgcggttgcg 480
attcgtgaaa tgtatccaaa agcgcacttt gtcaccatct tcgcaaaacc ggctggtcgt 540
ccgctggttg atgactatgt tgttgatatc ccgcaagata cctggattga acagccgtgg 600
gatatgggcg tcgtattcgt cccgccaatc tccggtcgct aaccggtagc ggatcatcta 660
gacccgggta ccgttaactt gtttattgca gcttataatg gttacaaata aagcaatagc 720
atcacaaatt tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa 780
ctcatcaatg tatcttatca tgtctggatc cccattctca tgtttgacag cttatcatcg 840
aatttctgcc attcatccgc ttattatcac ttattcaggc gtagcaacca ggcgtttaag 900
ggcaccaata actgccttaa aaaaattacg ccccgccctg ccactcatcg cagtactgtt 960
gtaattcatt aagcattctg ccgacatgga agccatcaca gacggcatga tgaacctgaa 1020
tcgccagcgg catcagcacc ttgtcgcctt gcgtataata tttgcccatg gtgaaaacgg 1080
gggcgaagaa gttgtccata ttggccacgt ttaaatcaaa actggtgaaa ctcacccagg 1140
gattggctga gacgaaaaac atattctcaa taaacccttt agggaaatag gccaggtttt 1200
caccgtaaca cgccacatct tgcgaatata tgtgtagaaa ctgccggaaa tcgtcgtggt 1260
attcactcca gagcgatgaa aacgtttcag tttgctcatg gaaaacggtg taacaagggt 1320
gaacactatc ccatatcacc agctcaccgt ctttcattgc catacggaat tccggatgag 1380
cattcatcag gcgggcaaga atgtgaataa aggccggata aaacttgtgc ttatttttct 1440
ttacggtctt taaaaaggcc gtaatatcca gctgaacggt ctggttatag gtacattgag 1500
caactgactg aaatgcctca aaatgttctt tacgatgcca ttgggatata tcaacggtgg 1560
tatatccagt gatttttttc tccattttag cttccttagc tcctgaaaat ctcgataact 1620
caaaaaatac gcccggtagt gatcttattt cattatggtg aaagttggaa cctcttacgt 1680
gccgatcaac gtctcatttt cgccaaaagt tggcccaggg cttcccggta tcaacaggga 1740
caccaggatt tatttattct gcgaagtgat cttccgtcac aggtatttat tcgcgataag 1800
ctcatggagc ggcgtaaccg tcgcacagga aggacagaga aagcgcggat ctgggaagtg 1860
acggacagaa cggtcaggac ctggattggg gaggcggttg ccgccgctgc tgctgacggt 1920
gtgacgttct ctgttccggt cacaccacat acgttccgcc attcctatgc gatgcacatg 1980
ctgtatgccg gtataccgct gaaagttctg caaagcctga tgggacataa gtccatcagt 2040
tcaacggaag tctacacgaa ggtttttgcg ctggatgtgg ctgcccggca ccgggtgcag 2100
tttgcgatgc cggagtctga tgcggttgcg atgctgaaac aattatcctg agaataaatg 2160
ccttggcctt tatatggaaa tgtggaactg agtggatatg ctgtttttgt ctgttaaaca 2220
gagaagctgg ctgttatcca ctgagaagcg aacgaaacag tcgggaaaat ctcccattat 2280
cgtagagatc cgcattatta atctcaggag cctgtgtagc gtttatagga agtagtgttc 2340
tgtcatgatg cctgcaagcg gtaacgaaaa cgatttgaat atgccttcag gaacaataga 2400
aatcttcgtg cggtgttacg ttgaagtgga gcggattatg tcagcaatgg acagaacaac 2460
ctaatgaaca cagaaccatg atgtggtctg tccttttaca gccagtaggc tcgccgcagt 2520
cgagcgacgg cgaagccctc gagtgagcga ggaagcacca gggaacagca cttatatatt 2580
ctgcttacac acgatgcctg aaaaaacttc ccttggggtt atccacttat ccacggggat 2640
atttttataa ttattttttt tatagttttt agatcttctt ttttagagcg ccttgtaggc 2700
ctttatccat gctggttcta gagaaggtgt tgtgacaaat tgccctttca gtgtgacaaa 2760
tcaccctcaa atgacagtcc tgtctgtgac aaattgccct taaccctgtg acaaattgcc 2820
ctcagaagaa gctgtttttt cacaaagtta tccctgctta ttgactcttt tttatttagt 2880
gtgacaatct aaaaacttgt cacacttcac atggatctgt catggcggaa acagcggtta 2940
tcaatcacaa gaaacgtaaa aatagcccgc gaatcgtcca gtcaaacgac ctcactgagg 3000
cggcatatag tctctcccgg gatcaaaaac gtatgctgta tctgttcgtt gaccagatca 3060
gaaaatctga tggcacccta caggaacatg acggtatctg cgagatccat gttgctaaat 3120
atgctgaaat attcggattg acctctgcgg aagccagtaa ggatatacgg caggcattga 3180
agagtttcgc ggggaaggaa gtggtttttt atcgccctga agaggatgcc ggcgatgaaa 3240
aaggctatga atcttttcct tggtttatca aacgtgcgca cagtccatcc agagggcttt 3300
acagtgtaca tatcaaccca tatctcattc ccttctttat cgggttacag aaccggttta 3360
cgcagtttcg gcttagtgaa acaaaagaaa tcaccaatcc gtatgccatg cgtttatacg 3420
aatccctgtg tcagtatcgt aagccggatg gctcaggcat cgtctctctg aaaatcgact 3480
ggatcataga gcgttaccag ctgcctcaaa gttaccagcg tatgcctgac ttccgccgcc 3540
gcttcctgca ggtctgtgtt aatgagatca acagcagaac tccaatgcgc ctctcataca 3600
ttgagaaaaa gaaaggccgc cagacgactc atatcgtatt ttccttccgc gatatcactt 3660
ccatgacgac aggatagtct gagggttatc tgtcacagat ttgagggtgg ttcgtcacat 3720
ttgttctgac ctactgaggg taatttgtca cagttttgct gtttccttca gcctgcatgg 3780
attttctcat actttttgaa ctgtaatttt taaggaagcc aaatttgagg gcagtttgtc 3840
acagttgatt tccttctctt tcccttcgtc atgtgacctg atatcggggg ttagttcgtc 3900
atcattgatg agggttgatt atcacagttt attactctga attggctatc cgcgtgtgta 3960
cctctacctg gagtttttcc cacggtggat atttcttctt gcgctgagcg taagagctat 4020
ctgacagaac agttcttctt tgcttcctcg ccagttcgct cgctatgctc ggttacacgg 4080
ctgcggcgag cgctagtgat aataagtgac tgaggtatgt gctcttctta tctccttttg 4140
tagtgttgct cttattttaa acaactttgc ggttttttga tgactttgcg attttgttgt 4200
tgctttgcag taaattgcaa gatttaataa aaaaacgcaa agcaatgatt aaaggatgtt 4260
cagaatgaaa ctcatggaaa cacttaacca gtgcataaac gctggtcatg aaatgacgaa 4320
ggctatcgcc attgcacagt ttaatgatga cagcccggaa gcgaggaaaa taacccggcg 4380
ctggagaata ggtgaagcag cggatttagt tggggtttct tctcaggcta tcagagatgc 4440
cgagaaagca gggcgactac cgcacccgga tatggaaatt cgaggacggg ttgagcaacg 4500
tgttggttat acaattgaac aaattaatca tatgcgtgat gtgtttggta cgcgattgcg 4560
acgtgctgaa gacgtatttc caccggtgat cggggttgct gcccataaag gtggcgttta 4620
caaaacctca gtttctgttc atcttgctca ggatctggct ctgaaggggc tacgtgtttt 4680
gctcgtggaa ggtaacgacc cccagggaac agcctcaatg tatcacggat gggtaccaga 4740
tcttcatatt catgcagaag acactctcct gcctttctat cttggggaaa aggacgatgt 4800
cacttatgca ataaagccca cttgctggcc ggggcttgac attattcctt cctgtctggc 4860
tctgcaccgt attgaaactg agttaatggg caaatttgat gaaggtaaac tgcccaccga 4920
tccacacctg atgctccgac tggccattga aactgttgct catgactatg atgtcatagt 4980
tattgacagc gcgcctaacc tgggtatcgg cacgattaat gtcgtatgtg ctgctgatgt 5040
gctgattgtt cccacgcctg ctgagttgtt tgactacacc tccgcactgc agtttttcga 5100
tatgcttcgt gatctgctca agaacgttga tcttaaaggg ttcgagcctg atgtacgtat 5160
tttgcttacc aaatacagca atagtaatgg ctctcagtcc ccgtggatgg aggagcaaat 5220
tcgggatgcc tggggaagca tggttctaaa aaatgttgta cgtgaaacgg atgaagttgg 5280
taaaggtcag atccggatga gaactgtttt tgaacaggcc attgatcaac gctcttcaac 5340
tggtgcctgg agaaatgctc tttctatttg ggaacctgtc tgcaatgaaa ttttcgatcg 5400
tctgattaaa ccacgctggg agattagata atgaagcgtg cgcctgttat tccaaaacat 5460
acgctcaata ctcaaccggt tgaagatact tcgttatcga caccagctgc cccgatggtg 5520
gattcgttaa ttgcgcgcgt aggagtaatg gctcgcggta atgccattac tttgcctgta 5580
tgtggtcggg atgtgaagtt tactcttgaa gtgctccggg gtgatagtgt tgagaagacc 5640
tctcgggtat ggtcaggtaa tgaacgtgac caggagctgc ttactgagga cgcactggat 5700
gatctcatcc cttcttttct actgactggt caacagacac cggcgttcgg tcgaagagta 5760
tctggtgtca tagaaattgc cgatgggagt cgccgtcgta aagctgctgc acttaccgaa 5820
agtgattatc gtgttctggt tggcgagctg gatgatgagc agatggctgc attatccaga 5880
ttgggtaacg attatcgccc aacaagtgct tatgaacgtg gtcagcgtta tgcaagccga 5940
ttgcagaatg aatttgctgg aaatatttct gcgctggctg atgcggaaaa tatttcacgt 6000
aagattatta cccgctgtat caacaccgcc aaattgccta aatcagttgt tgctcttttt 6060
tctcaccccg gtgaactatc tgcccggtca ggtgatgcac ttcaaaaagc ctttacagat 6120
aaagaggaat tacttaagca gcaggcatct aaccttcatg agcagaaaaa agctggggtg 6180
atatttgaag ctgaagaagt tatcactctt ttaacttctg tgcttaaaac gtcatctgca 6240
tcaagaacta gtttaagctc acgacatcag tttgctcctg gagcgacagt attgtataag 6300
ggcgataaaa tggtgcttaa cctggacagg tctcgtgttc caactgagtg tatagagaaa 6360
attgaggcca ttcttaagga acttgaaaag ccagcaccct gatgcgacca cgttttagtc 6420
tacgtttatc tgtctttact taatgtcctt tgttacaggc cagaaagcat aactggcctg 6480
aatattctct ctgggcccac tgttccactt gtatcgtcgg tctgataatc agactgggac 6540
cacggtccca ctcgtatcgt cggtctgatt attagtctgg gaccacggtc ccactcgtat 6600
cgtcggtctg attattagtc tgggaccacg gtcccactcg tatcgtcggt ctgataatca 6660
gactgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctggg accatggtcc 6720
cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt atcgtcggtc 6780
tgattattag tctggaacca cggtcccact cgtatcgtcg gtctgattat tagtctggga 6840
ccacggtccc actcgtatcg tcggtctgat tattagtctg ggaccacgat cccactcgtg 6900
ttgtcggtct gattatcggt ctgggaccac ggtcccactt gtattgtcga tcagactatc 6960
agcgtgagac tacgattcca tcaatgcctg tcaagggcaa gtattgacat gtcgtcgtaa 7020
cctgtagaac ggagtaacct cggtgtgcgg ttgtatgcct gctgtggatt gctgctgtgt 7080
cctgcttatc cacaacattt tgcgcacggt tatgtggaca aaatacctgg ttacccaggc 7140
cgtgccggca cgttaaccgg gctgcatccg atgcaagtgt gtcgctgtcg agtttaaaca 7200
tgcatagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca tatatggagt 7260
tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc 7320
cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac 7380
gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata 7440
tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc 7500
agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta 7560
ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg tttgactcac 7620
ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg caccaaaatc 7680
aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg ggcggtaggc 7740
gtgtacggtg ggaggtctat ataagcagag ctggtttagt gaaccgtcag atccgctagc 7800
gctaccggtc gccaccatgg tgagcaaggg cgaggagctg ttcaccgggg tggtgcccat 7860
cctggtcgag ctggacggcg acgtaaacgg ccacaagttc agcgtgtccg gcgagggcga 7920
gggcgatgcc acctacggca agctgaccct gaagttcatc tgcaccaccg gcaagctgcc 7980
cgtgccctgg cccaccctcg tgaccaccct gacctacggc gtgcagtgct tcagccgcta 8040
ccccgaccac atgaagcagc acgacttctt caagtccgcc atgcccgaag gctacgtcca 8100
ggagcgcacc atcttcttca aggacgacgg caactacaag acccgcgccg aggtgaagtt 8160
cgagggcgac accctggtga accgcatcga gctgaagggc atcgacttca aggaggacgg 8220
caacatcctg gggcacaagc tggagtacaa ctacaacagc cacaacgtct atatcatggc 8280
cgacaagcag aagaacggca tcaaggtgaa cttcaagatc cgccacaaca tcgaggacgg 8340
cagcgtgcag ctcgccgacc actaccagca gaacaccccc atcggcgacg gccccgtgct 8400
gctgcccgac aaccactacc tgagcaccca gtccgccctg agcaaagacc ccaacgagaa 8460
gcgcgatcac atggtcctgc tggagttcgt gaccgccgcc gggatcactc tcggcatgga 8520
cgagctgtac aagtccggac tcagatccac cggatctaga taactgatca taatcagcca 8580
taccacattt gtagaggttt tacttgcttt aaaaaacctc ccacacctcc ccctgaacct 8640
gaaacataaa atgaatgcaa ttgttgttgt taacttgttt attgcagctt ataatggtta 8700
caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag 8760
ttgtggtttg tccaaactca tcaatgtatc ttaaatggcc gcataacttc gtatagcata 8820
cattatacga agttatctag cagatctgaa ttcgatatca agctgcggcc ttaattaa 8878
<210>8
<211>124884
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1134)..(1850)
<220>
<221>CDS
<222>(8607)..(9386)
<220>
<221>CDS
<222>(10642)..(10902)
<220>
<221>CDS
<222>(11009)..(11917)
<220>
<221>CDS
<222>(12160)..(13392)
<220>
<221>CDS
<222>(13590)..(16049)
<220>
<221>CDS
<222>(16214)..(18199)
<220>
<221>CDS
<222>(18441)..(19346)
<220>
<221>CDS
<222>(24149)..(25516)
<220>
<221>CDS
<222>(30759)..(33875)
<220>
<221>CDS
<222>(34083)..(42374)
<220>
<221>CDS
<222>(44506)..(46263)
<220>
<221>CDS
<222>(50857)..(54471)
<220>
<221>CDS
<222>(54651)..(56963)
<220>
<221>CDS
<222>(57008)..(59614)
<220>
<221>CDS
<222>(59766)..(60197)
<220>
<221>CDS
<222>(64807)..(65832)
<220>
<221>CDS
<222>(66074)..(68599)
<220>
<221>CDS
<222>(70633)..(71355)
<220>
<221>CDS
<222>(71540)..(75730)
<220>
<221>CDS
<222>(75847)..(76797)
<220>
<221>CDS
<222>(78170)..(80200)
<220>
<221>CDS
<222>(80360)..(81451)
<220>
<221>CDS
<222>(82719)..(83318)
<220>
<221>CDS
<222>(84667)..(86322)
<220>
<221>CDS
<222>(87881)..(90388)
<220>
<221>CDS
<222>(90493)..(92808)
<220>
<221>CDS
<222>(95996)..(98641)
<220>
<221>CDS
<222>(110581)..(111417)
<220>
<221>CDS
<222>(111565)..(112107)
<220>
<221>CDS
<222>(113037)..(114218)
<220>
<221>CDS
<222>(114496)..(115560)
<220>
<221>CDS
<222>(115808)..(117679)
<220>
<221>CDS
<222>(120764)..(124696)
<400>8
aggccagccc tctcgcggcc ccctcgagag agaaaaaaaa aagcgacccc acctccccgc 60
gcgtttgcgg ggcgaccatc gggggggatg ggattttttg ccgggaaacc cccccccgcc 120
agcctttaac aaaacccgcg ccttttgcgt ccacccctcg tttactgctc ggatggcgac 180
cgtgcactac tcccgccgac ctgggacccc gccggtcacc ctcacgtcgt cccccagcat 240
ggatgacgtt gcgaccccca tcccctacct acccacatac gccgaggccg tggcagacgc 300
gcccccccct tacagaagcc gcgagagtct ggtgttctcc ccgcctcttt ttcctcacgt 360
ggagaatggc accacccaac agtcttacga ttgcctagac tgcgcttatg atggaatcca 420
cagacttcag ctggcttttc taagaattcg caaatgctgt gtaccggctt ttttaattct 480
ttttggtatt ctcaccctta ctgctgtcgt ggtcgccatt gttgccgttt ttcccgagga 540
acctcccaac tcaactacat gaaactactg tccggaaggg gaaggtattt attctcgctt 600
gcagcttgtc gcgcgtgtat gcacaacaaa agctatatat gtcaccaaag ccaacgtcgc 660
catctggagt actacaccca gtacgttgca taacctgtcc atttgcattt tcagttgcgc 720
ggacgccttt ctccgggatc gtggccttgg gacatcaacc agtggaataa gaaccgccgg 780
tggtcttgtt tgaacgacga gtggcgacgc gttgttctgc ataagctctg tatgctgata 840
cataaacaca gagtctgtat cgctatcaga ttcccgaaca ccttccggta ccccatactc 900
cgataccctg gacattgcgg atcccaaaaa tataatatta acaggatttg cttatacttt 960
gctacagctt atataaattt atgtgcgata catcttaagt gcatccgtac gttatttata 1020
cattgcctgt cacgtgaaaa gactgtgtta cccaataaag gttctacaaa aaatgcttta 1080
ttgggtgttt gtttaatagc tattatcgta acccaccccc gtaaaatcat aaa atg 1136
Met
1
cat gta att tct gag aca ctt gca tat ggg cat gtt ccc gca ttt att 1184
His Val Ile Ser Glu Thr Leu Ala Tyr Gly His Val Pro Ala Phe Ile
5 10 15
atg ggc tcc act ctg gtg cgt ccc agt tta aac gcc acc gcc gag gaa 1232
Met Gly Ser Thr Leu Val Arg Pro Ser Leu Asn Ala Thr Ala Glu Glu
20 25 30
aat ccc gcg tca gaa acg cga tgt tta tta cga gtg ctt gcg ggg aga 1280
Asn Pro Ala Ser Glu Thr Arg Cys Leu Leu Arg Val Leu Ala Gly Arg
35 40 45
act gta gac ctg cca ggc gga gga acg tta cac att acc tgt acc aaa 1328
Thr Val Asp Leu Pro Gly Gly Gly Thr Leu His Ile Thr Cys Thr Lys
50 55 60 65
acc tat gta att att ggc aaa tat agc aaa ccc ggc gaa cgt ctt agc 1376
Thr Tyr Val Ile Ile Gly Lys Tyr Ser Lys Pro Gly Glu Arg Leu Ser
70 75 80
ctt gcc cgt cta ata ggg cgt gca atg acg cct gga ggt gca agg aca 1424
Leu Ala Arg Leu Ile Gly Arg Ala Met Thr Pro Gly Gly Ala Arg Thr
85 90 95
ttt att att ttg gcg atg aag gaa aag cga tcc aca acg ctt ggg tat 1472
Phe Ile Ile Leu Ala Met Lys Glu Lys Arg Ser Thr Thr Leu Gly Tyr
100 105 110
gaa tgt ggt acg ggc ttg cat tta ctg gct cca tct atg ggt aca ttt 1520
Glu Cys Gly Thr Gly Leu His Leu Leu Ala Pro Ser Met Gly Thr Phe
115 120 125
ctc cgc aca cac ggt tta agt aac aga gat ctc tgt tta tgg cgg ggt 1568
Leu Arg Thr His Gly Leu Ser Asn Arg Asp Leu Cys Leu Trp Arg Gly
130 135 140 145
aat att tat gat atg cat atg caa cgt ctt atg ttt tgg gag aat atc 1616
Asn Ile Tyr Asp Met His Met Gln Arg Leu Met Phe Trp Glu Asn Ile
150 155 160
gcg caa aat acc act gaa aca cct tgt ata acg tcg acg tta aca tgc 1664
Ala Gln Asn Thr Thr Glu Thr Pro Cys Ile Thr Ser Thr Leu Thr Cys
165 170 175
aac ttg aca gaa gac tct ggt gaa gcc gca ctt acc acg tca gac cga 1712
Asn Leu Thr Glu Asp Ser Gly Glu Ala Ala Leu Thr Thr Ser Asp Arg
180 185 190
ccc act ctc cca acc cta aca gcc caa gga aga cca aca gtt tcc aac 1760
Pro Thr Leu Pro Thr Leu Thr Ala Gln Gly Arg Pro Thr Val Ser Asn
195 200 205
att cgt gga ata ttg aaa gga tcc ccc cgt caa cag ccg gtc tgt cac 1808
Ile Arg Gly Ile Leu Lys Gly Ser Pro Arg Gln Gln Pro Val Cys His
210 215 220 225
cgg gtt aga ttt gcc gaa cct acg gag ggc gta ttg atg taa 1850
Arg Val Arg Phe Ala Glu Pro Thr Glu Gly Val Leu Met
230 235
tcactaaata aaatacacct tttttcgatt gtacgtattt ttatttaaat gtgtagttca 1910
tagtccgccg acagccgctc gggcttttcc cccacataca acatgatcgt atgcctcgga 1970
tgcaccggtc caacactccg ccgagaaggg ggatttacaa tgacagtgat acccaatagc 2030
cgccagatgt acacccagct gtccggactc cagcatcatc tgctgagttg cggcgctgaa 2090
gggtgcatcg catagggtgt tataattagc catttccggt aacagtcgtt gggaatttag 2150
gaggctgcaa aacggctgta ggtcaacata cattggggat tcagatggtt tatctcgacg 2210
tccaagtcca atcaaaaaag cgtgtaaatc atcagcccgg ccgcatgttg ctcgaagagc 2270
acataacctc ttaacaccgt acagagggga tggcgtcggt gcatgtgagt tggcagggca 2330
tgtccacgtt gtttccaacg ccagtggcgg tataacttgt gtaaacgacg ccaacgggtc 2390
aggtttaaga ttcactcgga tgggttgact gctttcggaa gctcccgttg tatccattaa 2450
ttaaacgttc ggtacacgtc tggtgtgtgt tttacccgaa tcagagacgg aattgcaaag 2510
atattggttt gaaagcaatg taatcccgcc catatatccc caacgtcgcc ttaaaaactc 2570
ccacaatatt acatttttat tagtctttta ttaatataga atcacataaa caattgataa 2630
aatcaagggg tggtgtataa tgattaaaaa tataaattga tatgttttac aagcatgaaa 2690
taggtattta ctattctaac aggtaaatat gcttaatgat taaaaataca aattagtatg 2750
ttttgacaag catgaaaaag gtatttttta ttttagcagt taaaggtact acacttaaaa 2810
tatttaccgt atggacgggc gtcagaaaga tgcccggccc aagttgagag ggtacattca 2870
acacgaccac actcgcgttg gtgggtgatt agggcctcta aaacaccggc cagacatgac 2930
ccgggtgtat attcttgtaa cacttgaacg ttacaactga tatcatcata ttccacaaat 2990
ttagagccac ggacaactat attagcaatg cgggcaatca taacaaacat ataagtagta 3050
atacacgtga tatcactaaa acgttgctgg cgcaacagtt cggggagagt acgagacccc 3110
aaatcgttgt ccctgtttag aagaagacat cttacaaaag gccccagctt taactttaaa 3170
ttctccaaaa gtgacttcga ggttgcaaca atgggattat ttgtgtagat gggcaagttt 3230
tttgccgcta acattttaat ccacgttaac agttcatccg cagactccaa cgcttcaatc 3290
aaagattctc cacgtatgac tctctcacgc aacgcgcggg caatacgtga gtccatttta 3350
tatgactcaa aggtacgata aagttcatgt ccgtacaaca tcaactccgg ccaagatgtg 3410
ttttgtttta tccccggaaa acatccaccg gaagcccatg aatcaccctc ttgtattgtg 3470
gcatatcgga ctaccagttt ttcaattgtt tcatctaaat ggcgtaccga gtcaatggtc 3530
acgctggctc ccgcggtgga gacgacttca atagcacggc ccgtaattcg atcgaccggg 3590
atatcatact cttttcgaat acgctctcgg cgggcgtctc tcttggaaaa tcgcaacctg 3650
tacgattcgt catgtgtctg atcatttctt tctcccgtgg tcattgcagg aggcgttgta 3710
ggacgccgtc ttcgatttga cagggatcga tcacggtgtt ttcttgaact ttgagtgtta 3770
taagatctgg atgatcgtcg atgtccccgt tcgatgcgtg catatccagt ctccacgtct 3830
cttcctccat gatggtttga atcgggtaat acaacaacca aagttttcgg gcgattgtgg 3890
tggtagcttt cacgccttcc gtgccttcgt ttggaatacc gtggattata tgctgtatct 3950
gcagtacgct ccacatacac agttctagac gttgtggagt cctcgcctgg agtggagcca 4010
atagcttcat catttgccca atcggtgact tccaatgcaa agtcatccga aggttcgtct 4070
ggtagcaaat tcataaagtc ttcacaaata gtagacacgt ctgggtcggt tggaattgaa 4130
gcagaggcca tggctgcaaa atatctgaca attgcgtgtt tgcagttgcc tgtatcttcc 4190
gccaatgttg tagaatttat aggctcaccc aaccccgcaa tgggcgtgtt tagtcacatg 4250
attaatgctt ctgggagttt tcactttccc caaacaagct tacctgcacc ctttgttcgt 4310
aatgcataaa aataaccact gctatagcaa atatgacgat ataaaaacat tttatagcaa 4370
ggccggacat tactgtagcg caacatgttg tgcatatacc acgtattccc cccgtattga 4430
tatgatttaa atgattatcc ttggttggtt ttggtctaac ataagatata agctctacta 4490
tagcgagcgt gcatacaaca acccaggcca gaatccgaat gtatgtgggg tataataacg 4550
cgcatggtgt atatgcaacg ccaagcgtta aaagcacaat acatccagat gatatatgag 4610
cgataacctc caaaagcatc aataacgtaa cacctttatg catatataaa aaacttatag 4670
ggtcagcatt aaatacttta ctcataccat cccgtcgcat ggaaacatca cataacaacc 4730
ttgccaactt tgtatatggg taaccaagaa gaatgttcga aataacccgt gttacgtaat 4790
tcagtgaata tgatgtgggg gatattaact cacaggatga tcggaatggc ccaaacatac 4850
gacgtattcg tcgaaattgt aaatacatac catatacaaa ccatgcaaaa aaaatcattt 4910
ttagctgcac gcaccaaaaa taagcgtgac aattacgtgt tcccagaaca attcgaattt 4970
tgtcatgcaa aggtgtagaa atagcggttt ttaccatagt atctcctgat aatagatttt 5030
cccggcagct gtaatcgtat ccagataggc catccaaaaa cgttgagtgg tttacaaacg 5090
ttacatatat aagagagttg ttataagacc cccatacaac cggtccacca ttaatcaccg 5150
tggttgcata cacacactca tgttcaaact ttacacgagc ggtataccat agggtaaaaa 5210
cagcatgtcc gctaagtaga cacataatta taaaatgttc tgtcttgatt cctaaagcct 5270
gcatgacccg tggaagatgg caattcaagc acgatgtagt atcacacggt tggtgttaac 5330
tcgaagttaa atttggataa ttaggtactt ctagagtaaa gattgtatgc atgcgattgc 5390
tatcgcactt tgtagcaaaa cattgttgtg caagcgaaat acacaaacgg ttgtgatgat 5450
ccactcgcag agacacaaat gtccggggag ccgttcttcc tccgcgatgg ggatatcgaa 5510
gacaagtgaa cccttttgtt ccgcatatga gctgaaataa cacccagtcc cttttgatgg 5570
cgatacactt tgatgatgtt aaggtatatt cgcgatcacg cccggggaaa tgaacagcaa 5630
tatgctccac aatagattct aatattgtgc tgtcgacaaa ggcctccagt gtaaatgcgt 5690
ccagacaagt taccccgcgc tcttttagag cctttgttaa agatatttgc ggggagctaa 5750
atatttgttt attacgcgca accttacgtt caaaaaactc tgcgtattcc cccccaaggt 5810
tatgtaaaat aaattgcact ggaacattcg actgcggtct tgaatgaaaa tgaaagtttg 5870
ccgggtttct atgtgatgtc acaaacgcta atatatcaat acactgctca ggtacaacat 5930
aaaatgggag tagttgtcca accgccgtcc ctgtggttgt tactttggag aaaaaaggca 5990
gtcttaaact atgtccgtgg ctataaacac cagtatctat aaacgaaaag tcccgtaaat 6050
acggaccaat atattcaaca aattcccgtt ccagcaacac cgcttgctgt aatatttgtg 6110
caaacccctt taaagtggaa gaccccacta acgcataggg atttgggatt ggtacgcata 6170
ccctgaaacc tattttctct ttacagttac agggtagagt ttcatgcaag ttttcattgt 6230
ttgatacatc ggcgtgtgta tggacttcag acgttgtctg tgtatcaaaa aaccatacat 6290
cctctgtata attctcttct acacacgtgt ataattcgcc attttctatg taaaaatcga 6350
tgtcagaatg gctggttata tccaataaat tatcatcatc caacacctca acggtaggtt 6410
caggacatgc agttttataa aaataacatg ggtctttgtt agggtttacc acggcctttg 6470
gaaaaagtaa ttgcatggcc gttaaaatac catgacgaaa tgctcgcatg ccggcatgta 6530
aaatacccaa tgggatgggt tttcttatat gaaagtctac atcaagtatg aggtttgtga 6590
ttataagatt tgtattaaat agctcattcc tgtttatata aagctgatct ttgggtatgt 6650
ttgatgaaat tttagaaacg tttttaacag acgtagataa tagtaaagtc aactgcatat 6710
ctcgtagtga agcggcaaca aaattacatg gattaatttg tttaaggtcc tccgcaatta 6770
atcgagcctc gtgcggtaaa gtgtaacggt ttgttattga tgaccacgta tcattagcaa 6830
taacagcaaa tgcttgggcg ccgtgaggca aggctacccg atatacaggc attggtccag 6890
ttacctcaga atggccgatg agggcttcta atggagtttt ataactcagg atggatacat 6950
catgtgtggc tatcccagtg gcagcagaga aaaacagtaa tagttttgta atccccgggc 7010
tcgtatcaaa accagtacga ccactttggt taggtgtatc gtttgcaaag ttggctgctc 7070
gtaacgcctc cgcggaaaca cccgaatcct caaaattaga caattcgtca aaaccgggtg 7130
gatttgaggg aatagtggag gaccatccat atggactaaa ttgtttttca atgttttcca 7190
cacgacgagt tagcgttgta gctaggtcac atacgcctat aaacttgcta ggttttgcgg 7250
catacgtaag acttaaagta tatgttttag taattgtata tttatgtcca atctcaggtc 7310
caagttcagt gacatcacaa attacgttct tttttatata gtcacgcatg ttgagacgag 7370
aacgtacatg attaaaaaaa ttagcagtag ctctttttcc caggttggat gattttaaga 7430
ggaccggttt attcacaaaa tctgagtatg taaccgcttg taggtggtct gcgatctgtt 7490
tccgattgaa acattcaaaa tgtgccagat aaatataatc aacaaattca cggtctggaa 7550
ctttaaggcc ttttctatcg ttggtaatat actccgatac tgcgtgtatt tccgttgtgt 7610
ctgtatgtat tcgctgtaaa atgtacgata gagcattttt ggctgtcaaa cctcgtgtat 7670
atgttgagga acaacaaaac atggaaagtt tatcaaaaga caacaagtcc gaaatattgt 7730
acccactaca attaggtaat gccgggactt ggtaagttaa aaacaaatct ttaattgcct 7790
gtaagtcata taagggggtt tccaacgtat tgtaacttgt gtccgtttgt aacaagtaat 7850
agcgtgtagc caacactagc gttttttcag agggtccaaa tcgaacaata taccaaaacg 7910
gcgagcatcc atacccccag tagagtcgtc gatatgcagc caatacttga cgttcgtaat 7970
gggcatataa tgatgttagc tcctgacgac caacggattt tttaactaac ttgcagagtg 8030
ttgcctctgt gatgcatagg ccgttgtccg ataatccctt tcggtttaaa tggtgtgttg 8090
ttaccatcag agtttgtata acttccgagt gaatgtcaaa cgtctccgat atacataggg 8150
tatcagatat tatatgcgga tttaggggtg ctccatacca taacgcctta tataaagctt 8210
taaaatcagt ttgggtttta aaacaacaaa aaaatatagg ccagacccgg gatcgtacat 8270
ctccagttga aaatccacca attaaataaa aaataacgtt gacgtcccta ctacaaaata 8330
aatgcattat ttggttttct tcatcgtttt cagttacttc acgtgggcgt ttagttggga 8390
ttacttgcgt gatctcttcc ctcccatttt tgacaaagac gtcatctaag tcgggagtcc 8450
aagtataact caccacatac agaggttctg tgcttatctg cccggtaagc aacaacagcg 8510
agtgggagat tgcacatccc tttgtggcaa ataataaccg aatcgtcggt ttggaggatt 8570
tatccatagt tcaatacgtt ggaaagccag tcaatc atg cag acg gtg tgt gcc 8624
Met Gln Thr Val Cys Ala
240
agc tta tgt gga tat gct cga ata cca act gaa gag cca tct tat gaa 8672
Ser Leu Cys Gly Tyr Ala Arg Ile Pro Thr Glu Glu Pro Ser Tyr Glu
245 250 255 260
gag gtg cgt gta aac acg cac ccc caa gga gcc gcc ctg ctc cgc ctc 8720
Glu Val Arg Val Asn Thr His Pro Gln Gly Ala Ala Leu Leu Arg Leu
265 270 275
caa gag gct tta acc gct gtg aat gga tta ttg cct gca cct cta acg 8768
Gln Glu Ala Leu Thr Ala Val Asn Gly Leu Leu Pro Ala Pro Leu Thr
280 285 290
tta gaa gac gta gtc gct tct gca gat aat acc cgt cgt ttg gtc cgc 8816
Leu Glu Asp Val Val Ala Ser Ala Asp Asn Thr Arg Arg Leu Val Arg
295 300 305
gcc cag gct ttg gcg cga act tac gct gca tgt tct cgt aac att gaa 8864
Ala Gln Ala Leu Ala Arg Thr Tyr Ala Ala Cys Ser Arg Asn Ile Glu
310 315 320
tgt tta aaa cag cac cat ttt act gaa gat aac ccc ggt ctt aac gcc 8912
Cys Leu Lys Gln His His Phe Thr Glu Asp Asn Pro Gly Leu Asn Ala
325 330 335 340
gtg gtc cgt tca cac atg gaa aac tca aaa cgg ctt gct gat atg tgt 8960
Val Val Arg Ser His Met Glu Asn Ser Lys Arg Leu Ala Asp Met Cys
345 350 355
tta gct gca att acc cat ttg tat tta tcg gtt ggc gcg gtg gat gtt 9008
Leu Ala Ala Ile Thr His Leu Tyr Leu Ser Val Gly Ala Val Asp Val
360 365 370
act acg gat gat att gtc gat caa acc ctg aga atg acc gct gaa agt 9056
Thr Thr Asp Asp Ile Val Asp Gln Thr Leu Arg Met Thr Ala Glu Ser
375 380 385
gaa gtg gtc atg tct gat gtt gtt ctt ttg gag aaa act ctt ggg gtc 9104
Glu Val Val Met Ser Asp Val Val Leu Leu Glu Lys Thr Leu Gly Val
390 395 400
gtt gct aaa cct cag gca tcg ttt gat gtt tcc cac aac cat gaa tta 9152
Val Ala Lys Pro Gln Ala Ser Phe Asp Val Ser His Asn His Glu Leu
405 410 415 420
tct ata gct aaa ggg gaa aat gtg ggt tta aaa aca tca cct att aaa 9200
Ser Ile Ala Lys Gly Glu Asn Val Gly Leu Lys Thr Ser Pro Ile Lys
425 430 435
tcg gag gcg aca caa tta tct gaa att aaa ccc cca ctt ata gaa gta 9248
Ser Glu Ala Thr Gln Leu Ser Glu Ile Lys Pro Pro Leu Ile Glu Val
440 445 450
tcg gat aat aac aca tct aac cta aca aaa aaa acg tat ccg aca gaa 9296
Ser Asp Asn Asn Thr Ser Asn Leu Thr Lys Lys Thr Tyr Pro Thr Glu
455 460 465
act ctt cag ccc gtg ttg acc cca aaa cag acg caa gat gta caa cgc 9344
Thr Leu Gln Pro Val Leu Thr Pro Lys Gln Thr Gln Asp Val Gln Arg
470 475 480
aca acc ccc gcg atc aag aaa tcc cat gtt atg ctt gta taa 9386
Thr Thr Pro Ala Ile Lys Lys Ser His Val Met Leu Val
485 490 495
atattgaaat aaaaactaaa aacgtttctg gtgtatgttt ttattttgta tataaaatta 9446
aaacattgct ggctggcgtg gttattacat ttaatgtttt agtagaaaat cgacatcgtt 9506
tgtttcttta tcagttgaac caaatccacg cgttccccgt tcgctgggtg tggctattag 9566
atctaacgtt ttagtaaaat accattgtac acccggtatg ccacatttac cgcggatagc 9626
ataaggaaat gcaatattac ttaaaacgtt gtgttttaag tgtatttggg tgttgtgatc 9686
tattaacagg acctgtgcaa gacgatctcc cgtttttata cgtatgtcat cacccgtgag 9746
attatatacg tagaatttac agtgttctcc tgcaggccat gccgttggac acacgataat 9806
gcctgatcgg cttttcgatg atcttccaaa aatataagcg tttatactcg gatgttgtaa 9866
gtcccagtct cttataatcg gtaagacaat ttttataaat tcattccttt ttaaatatag 9926
gttatatggt acacaaatat catatcccgc gtcttcttgg cgttttggat tgatgatatg 9986
tttgtaggtt aagggaacat cgatatggta ttctgcagaa tccctatgta aaggttgccc 10046
ctgctgtacc gtggaaatat cagcaaattc aggtataacg ggtttttcat aatttgacgg 10106
cgagtttgat aagggttgaa cttgtatcga tttaaaaatt ggatccagat gtttaagaac 10166
gttttttggg agaaggcgac tttgtcttaa ttttaccggg aacaagtaga ttgttaaatg 10226
tccgggtaaa ataacggtta ctcctggccg gtaatacaaa agggctgaaa ttactcctct 10286
gtaacccgca tcaataactc cgttggcgac aaaaaaattg tcttcatcag caagggcagt 10346
atctttgcat tgaattaaca acagtgcgta ttcattggga ggcgccgact taaccaacag 10406
ctccaactgc tgcatataaa aaccgccccg tgttacagat ttttcagatg gcagttcgag 10466
tttcttgtgg ttccggagta acaacggttg atgtcgactt actttatcgt ctaacacgca 10526
ttgcagcgta tctgcacatt caggttgaac ttctattaaa attgtatctt ttaaacaccg 10586
attcggaata gtttggctac aaaacatatc acctgtattt actgccgttt ccaag atg 10644
Met
gga tca att acc gct tcg ttc ata tta ata acg atg caa att tta ttt 10692
Gly Ser Ile Thr Ala Ser Phe Ile Leu Ile Thr Met Gln Ile Leu Phe
500 505 510
ttt tgt gaa gac agc agt ggg gag cca aac ttt gca gaa cgg aat ttt 10740
Phe Cys Glu Asp Ser Ser Gly Glu Pro Asn Phe Ala Glu Arg Asn Phe
515 520 525 530
tgg cat gcc agc tgt tcg gct cgt gga gtt tat atc gac gga tca atg 10788
Trp His Ala Ser Cys Ser Ala Arg Gly Val Tyr Ile Asp Gly Ser Met
535 540 545
atc acc acc ctt ttc ttc tac gca tcc ctt ttg ggg gtg tgt gta gcc 10836
Ile Thr Thr Leu Phe Phe Tyr Ala Ser Leu Leu Gly Val Cys Val Ala
550 555 560
ctt att tcg tta gct tat cat gcg tgt ttc cgg tta ttt act cgt tct 10884
Leu Ile Ser Leu Ala Tyr His Ala Cys Phe Arg Leu Phe Thr Arg Ser
565 570 575
gta tta cgc agc acg tgg taaacccgtt tgcctataaa aggggcaggc 10932
Val Leu Arg Ser Thr Trp
580
gtgtataaga gggcccctgt ttaatacgcg gtctgccgtg tttggatatt tcacgaccct 10992
atcgtttatt tacgta atg gca tct tcc gac ggt gac aga ctt tgt cgc tct 11044
Met Ala Ser Ser Asp Gly Asp Arg Leu Cys Arg Ser
585 590 595
aat gca gtg cgt cgt aaa aca acg cct agt tat tcc gga caa tat cga 11092
Asn Ala Val Arg Arg Lys Thr Thr Pro Ser Tyr Ser Gly Gln Tyr Arg
600 605 610
acc gcg cgg cga agt gtg gtc gta gga ccc ccc gat gat tca gac gac 11140
Thr Ala Arg Arg Ser Val Val Val Gly Pro Pro Asp Asp Ser Asp Asp
615 620 625
tcg ttg ggt tac att acc aca gtt ggg gcc gat tct cct tct cca gtg 11188
Ser Leu Gly Tyr Ile Thr Thr Val Gly Ala Asp Ser Pro Ser Pro Val
630 635 640
tac gcg gat ctt tat ttt gaa cat aaa aat acg acc cct cgc gta cat 11236
Tyr Ala Asp Leu Tyr Phe Glu His Lys Asn Thr Thr Pro Arg Val His
645 650 655 660
caa cca aac gac tcc agc gga tcg gaa gat gac ttt gaa gac atc gat 11284
Gln Pro Asn Asp Ser Ser Gly Ser Glu Asp Asp Phe Glu Asp Ile Asp
665 670 675
gaa gta gtg gcc gcc ttt cgg gag gcc cgt ttg aga cat gaa ctg gtt 11332
Glu Val Val Ala Ala Phe Arg Glu Ala Arg Leu Arg His Glu Leu Val
680 685 690
gaa gat gct gta tat gaa aac ccg cta agt gta gaa aaa cca tct aga 11380
Glu Asp Ala Val Tyr Glu Asn Pro Leu Ser Val Glu Lys Pro Ser Arg
695 700 705
tct ttt act aaa aat gcg gcg gtt aaa cct aaa tta gag gat tca ccg 11428
Ser Phe Thr Lys Asn Ala Ala Val Lys Pro Lys Leu Glu Asp Ser Pro
710 715 720
aag cga gct ccc ccg gga gca ggc gca att gcc agc ggg aga cca att 11476
Lys Arg Ala Pro Pro Gly Ala Gly Ala Ile Ala Ser Gly Arg Pro Ile
725 730 735 740
tcc ttc agc act gca cca aaa acc gca aca agc tcg tgg tgc ggt cct 11524
Ser Phe Ser Thr Ala Pro Lys Thr Ala Thr Ser Ser Trp Cys Gly Pro
745 750 755
acg cca tca tat aac aaa cgc gtc ttt tgt gaa gcg gtc cgg cgc gta 11572
Thr Pro Ser Tyr Asn Lys Arg Val Phe Cys Glu Ala Val Arg Arg Val
760 765 770
gcc gcc atg cag gca caa aag gct gcc gaa gcg gct tgg aat agt aat 11620
Ala Ala Met Gln Ala Gln Lys Ala Ala Glu Ala Ala Trp Asn Ser Asn
775 780 785
ccc cca agg aat aac gcc gaa tta gac cgt ttg tta acc gga gcc gtt 11668
Pro Pro Arg Asn Asn Ala Glu Leu Asp Arg Leu Leu Thr Gly Ala Val
790 795 800
att cgt att acg gtg cat gag ggt tta aat tta ata caa gcc gct aat 11716
Ile Arg Ile Thr Val His Glu Gly Leu Asn Leu Ile Gln Ala Ala Asn
805 810 815 820
gaa gca gac cta ggt gaa gga gca tcg gta tcc aaa cgt gga cat aat 11764
Glu Ala Asp Leu Gly Glu Gly Ala Ser Val Ser Lys Arg Gly His Asn
825 830 835
cga aaa act gga gat tta cag ggg ggc atg ggt aat gaa cct atg tac 11812
Arg Lys Thr Gly Asp Leu Gln Gly Gly Met Gly Asn Glu Pro Met Tyr
840 845 850
gca caa gtt cgt aag cca aaa agt cga acg gat aca caa acg act ggg 11860
Ala Gln Val Arg Lys Pro Lys Ser Arg Thr Asp Thr Gln Thr Thr Gly
855 860 865
cgt ata act aat cga agt agg gcc cgt tct gca tca aga act gat acg 11908
Arg Ile Thr Asn Arg Ser Arg Ala Arg Ser Ala Ser Arg Thr Asp Thr
870 875 880
cga aaa tag ggatataatt acgcagtaac ggtttacccg gtattatgta 11957
Arg Lys
885
taataaataa acgtataaaa gacagtcgtg gtttgtgttt attataaatg tgtattatat 12017
gtcacatatt ataaactgtt taaatagtac cacgtggtat tatgaacagt ttataatcag 12077
ttgctaccaa acaaacccca ttagacggcg ggttttgata aagggaatcg cttatttaaa 12137
ctaaagattt tactctataa gt atg gag tgt aat tta gga acc gaa cat cct 12189
Met Glu Cys Asn Leu Gly Thr Glu His Pro
890 895
agt aca gat acg tgg aat cgt agt aaa acg gaa caa gcg gtt gtg gac 12237
Ser Thr Asp Thr Trp Asn Arg Ser Lys Thr Glu Gln Ala Val Val Asp
900 905 910
gca ttt gat gaa tcg ttg ttt ggt gat gta gca tcg gat att gga ttt 12285
Ala Phe Asp Glu Ser Leu Phe Gly Asp Val Ala Ser Asp Ile Gly Phe
915 920 925
gaa acg tcg tta tat tca cat gca gtt aaa act gct ccg tct ccg cct 12333
Glu Thr Ser Leu Tyr Ser His Ala Val Lys Thr Ala Pro Ser Pro Pro
930 935 940
tgg gta gct agc cct aaa att tta tat caa cag tta ata cgg gat ctt 12381
Trp Val Ala Ser Pro Lys Ile Leu Tyr Gln Gln Leu Ile Arg Asp Leu
945 950 955 960
gat ttt tca gaa ggg ccg cgt tta cta tca tgt ctt gaa acc tgg aac 12429
Asp Phe Ser Glu Gly Pro Arg Leu Leu Ser Cys Leu Glu Thr Trp Asn
965 970 975
gag gat tta ttc tca tgt ttt cct att aat gag gac cta tat tcc gat 12477
Glu Asp Leu Phe Ser Cys Phe Pro Ile Asn Glu Asp Leu Tyr Ser Asp
980 985 990
atg atg gtt tta tcc ccg gat cca gat gac gtt atc tca acc gtt tca 12525
Met Met Val Leu Ser Pro Asp Pro Asp Asp Val Ile Ser Thr Val Ser
995 1000 1005
acc aaa gac cat gtt gaa atg ttt aat tta aca acc cgg ggt tcc 12570
Thr Lys Asp His Val Glu Met Phe Asn Leu Thr Thr Arg Gly Ser
1010 1015 1020
gtt cga ttg cct agt cca cca aag caa ccg acg ggg ctt cca gct 12615
Val Arg Leu Pro Ser Pro Pro Lys Gln Pro Thr Gly Leu Pro Ala
1025 1030 1035
tac gtt cag gag gtc cag gat tcg ttt acc gta gaa cta cgc gcc 12660
Tyr Val Gln Glu Val Gln Asp Ser Phe Thr Val Glu Leu Arg Ala
1040 1045 1050
cgg gaa gaa gca tac aca aaa cta cta gtt act tat tgt aaa tcg 12705
Arg Glu Glu Ala Tyr Thr Lys Leu Leu Val Thr Tyr Cys Lys Ser
1055 1060 1065
att ata cgt tat ctc caa gga acg gcg aaa agg acg aca ata ggt 12750
Ile Ile Arg Tyr Leu Gln Gly Thr Ala Lys Arg Thr Thr Ile Gly
1070 1075 1080
ctt aat ata caa aac cct gac cag aaa gct tac acg caa ctc agg 12795
Leu Asn Ile Gln Asn Pro Asp Gln Lys Ala Tyr Thr Gln Leu Arg
1085 1090 1095
caa agt att cta ctt aga tat tat cgt gag gtg gca agt ttg gcg 12840
Gln Ser Ile Leu Leu Arg Tyr Tyr Arg Glu Val Ala Ser Leu Ala
1100 1105 1110
cgt ctt ctg tac cta cat tta tat tta acc gta acg cgt gaa ttt 12885
Arg Leu Leu Tyr Leu His Leu Tyr Leu Thr Val Thr Arg Glu Phe
1115 1120 1125
tcc tgg cgt ttg tac gcc agt caa tct gca cac ccg gac gtg ttt 12930
Ser Trp Arg Leu Tyr Ala Ser Gln Ser Ala His Pro Asp Val Phe
1130 1135 1140
gcg gct tta aaa ttc acc tgg acc gaa cgt cga cag ttc acg tgt 12975
Ala Ala Leu Lys Phe Thr Trp Thr Glu Arg Arg Gln Phe Thr Cys
1145 1150 1155
gcg ttt cat cct gta tta tgc aac cac ggc att gtg tta tta gaa 13020
Ala Phe His Pro Val Leu Cys Asn His Gly Ile Val Leu Leu Glu
1160 1165 1170
ggg aaa cca cta aca gcg tct gcc ttg agg gaa ata aat tac cgc 13065
Gly Lys Pro Leu Thr Ala Ser Ala Leu Arg Glu Ile Asn Tyr Arg
1175 1180 1185
cgc cga gaa ctg gga ctg cct cta gtt aga tgt ggt ctt gtt gaa 13110
Arg Arg Glu Leu Gly Leu Pro Leu Val Arg Cys Gly Leu Val Glu
1190 1195 1200
gaa aac aaa tct ccg ttg gtt caa caa ccc tca ttt tcg gtt cat 13155
Glu Asn Lys Ser Pro Leu Val Gln Gln Pro Ser Phe Ser Val His
1205 1210 1215
tta cca cgg tcg gtg ggt ttt ctt acc cac cac att aag cgt aag 13200
Leu Pro Arg Ser Val Gly Phe Leu Thr His His Ile Lys Arg Lys
1220 1225 1230
tta gac gca tat gcg gtc aaa cat cct caa gaa ccg aga cat gta 13245
Leu Asp Ala Tyr Ala Val Lys His Pro Gln Glu Pro Arg His Val
1235 1240 1245
cga gcg gat cat cct tac gca aaa gtt gtt gaa aat aga aac tac 13290
Arg Ala Asp His Pro Tyr Ala Lys Val Val Glu Asn Arg Asn Tyr
1250 1255 1260
ggt agt agc atc gaa gct atg att tta gca cct ccg tcc cca tcc 13335
Gly Ser Ser Ile Glu Ala Met Ile Leu Ala Pro Pro Ser Pro Ser
1265 1270 1275
gag atc ctg ccg ggg gac cca cca cgc cca ccc acg tgt ggg ttt 13380
Glu Ile Leu Pro Gly Asp Pro Pro Arg Pro Pro Thr Cys Gly Phe
1280 1285 1290
tta acg cgt taa acgtcattgg ggtagagggt gtaaataaat tacgaaaacg 13432
Leu Thr Arg
1295
tgcatgcgtt ttttattttt acaatgcgcc gtatatggta tgtctgtcat gtgctctaaa 13492
gtcccatata taaaagaagc cccaacgagt gtatgcgtat tgcgtaccgc gaccctggga 13552
tgttttacag gcgcgtttgt ttgtctcggt tataagt atg cag tcg ggt cat tat 13607
Met Gln Ser Gly His Tyr
1300
aac cgg agg caa tcc cgc cga cag cgg ata tcg tct aat acc aca 13652
Asn Arg Arg Gln Ser Arg Arg Gln Arg Ile Ser Ser Asn Thr Thr
1305 1310 1315
gac tcc ccc cgt cac aca cac gga aca cgt tat cgg tca acc aat 13697
Asp Ser Pro Arg His Thr His Gly Thr Arg Tyr Arg Ser Thr Asn
1320 1325 1330
tgg tat aca cac cca ccc cag ata ttg tcc aat tca gaa aca tta 13742
Trp Tyr Thr His Pro Pro Gln Ile Leu Ser Asn Ser Glu Thr Leu
1335 1340 1345
gtt gcg gtt caa gaa cta ctg aac tcc gag atg gat cag gac agc 13787
Val Ala Val Gln Glu Leu Leu Asn Ser Glu Met Asp Gln Asp Ser
1350 1355 1360
agt tct gac gca tcg gat gat ttt ccg gga tac gcc tta cat cat 13832
Ser Ser Asp Ala Ser Asp Asp Phe Pro Gly Tyr Ala Leu His His
1365 1370 1375
tct aca tat aat gga tcc gaa caa aat aca tca act tcc aga cat 13877
Ser Thr Tyr Asn Gly Ser Glu Gln Asn Thr Ser Thr Ser Arg His
1380 1385 1390
gaa aat cgc ata ttt aaa tta acg gag agg gaa gct aat gag gaa 13922
Glu Asn Arg Ile Phe Lys Leu Thr Glu Arg Glu Ala Asn Glu Glu
1395 1400 1405
atc aac atc aat acg gac gcg atc gac gac gag gga gag gcg gag 13967
Ile Asn Ile Asn Thr Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu
1410 1415 1420
gag gga gag gcg gag gag gac gcg atc gac gac gag gga gag gcg 14012
Glu Gly Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu Ala
1425 1430 1435
gag gag gga gag gcg gag gag gac gcg att gac gac gag gga gag 14057
Glu Glu Gly Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu
1440 1445 1450
gcg gag gag gga gag gcg gag gag gac gcg att gac gac gag gga 14102
Ala Glu Glu Gly Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly
1455 1460 1465
gag gcg gag gag gga gag gcg gag gag gga gag gcg gag gag gga 14147
Glu Ala Glu Glu Gly Glu Ala Glu Glu Gly Glu Ala Glu Glu Gly
1470 1475 1480
gag gcg gag gag gac gcg atc gac gac gag gga gag gcg gag gag 14192
Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu Glu
1485 1490 1495
gac gcg gcg gag gag gac gcg atc gac gac gag gga gag gcg gag 14237
Asp Ala Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu
1500 1505 1510
gag gat tat ttt tct gta agt caa gtt tgc agt cga gac gcg gat 14282
Glu Asp Tyr Phe Ser Val Ser Gln Val Cys Ser Arg Asp Ala Asp
1515 1520 1525
gag gtt tat ttt acg tta gac ccg gaa ata agt tac agt acc gat 14327
Glu Val Tyr Phe Thr Leu Asp Pro Glu Ile Ser Tyr Ser Thr Asp
1530 1535 1540
ctt cgc att gca aag gtt atg gag cct gcg gta tca aag gaa ctt 14372
Leu Arg Ile Ala Lys Val Met Glu Pro Ala Val Ser Lys Glu Leu
1545 1550 1555
aat gta tca aaa cgt tgt gtt gaa cct gtt acc cta aca ggc tct 14417
Asn Val Ser Lys Arg Cys Val Glu Pro Val Thr Leu Thr Gly Ser
1560 1565 1570
atg tta gcg cat aat ggg ttt gat gag tcc tgg ttt gct atg cgc 14462
Met Leu Ala His Asn Gly Phe Asp Glu Ser Trp Phe Ala Met Arg
1575 1580 1585
gaa tgt acc cgt cgc gaa tat att acg gtc caa gga tta tac gac 14507
Glu Cys Thr Arg Arg Glu Tyr Ile Thr Val Gln Gly Leu Tyr Asp
1590 1595 1600
cca att cat tta cgg tat cag ttt gat act tcc cgg atg aca ccc 14552
Pro Ile His Leu Arg Tyr Gln Phe Asp Thr Ser Arg Met Thr Pro
1605 1610 1615
cca cag att ttg aga act ata cca gcc ctt cct aac atg aca ctt 14597
Pro Gln Ile Leu Arg Thr Ile Pro Ala Leu Pro Asn Met Thr Leu
1620 1625 1630
ggt gaa ctt tta ttg att ttt cct att gaa ttt atg gcc cag cca 14642
Gly Glu Leu Leu Leu Ile Phe Pro Ile Glu Phe Met Ala Gln Pro
1635 1640 1645
att tct ata gaa cgt att tta gtt gaa gat gta ttt tta gat agg 14687
Ile Ser Ile Glu Arg Ile Leu Val Glu Asp Val Phe Leu Asp Arg
1650 1655 1660
cgg gct tcc agt aaa aca cat aaa tac ggc ccg cgt tgg aat tcc 14732
Arg Ala Ser Ser Lys Thr His Lys Tyr Gly Pro Arg Trp Asn Ser
1665 1670 1675
gtc tac gca ctt cca tat aat gcg ggt aaa atg tat gta caa cac 14777
Val Tyr Ala Leu Pro Tyr Asn Ala Gly Lys Met Tyr Val Gln His
1680 1685 1690
att cct ggg ttt tat gac gtg tcc tta cgt gct gtg ggc caa gga 14822
Ile Pro Gly Phe Tyr Asp Val Ser Leu Arg Ala Val Gly Gln Gly
1695 1700 1705
acg gcc att tgg cat cac atg ata tta tcc aca gca gca tgc gct 14867
Thr Ala Ile Trp His His Met Ile Leu Ser Thr Ala Ala Cys Ala
1710 1715 1720
gac gcg gca att cat att agc gca aac tgt att ttt ttg gga cgt
Asp Ala Ala Ile Arg Ile Ser Ala Asn Cys Ile Phe Leu Gly Arg
1740 1745 1750
aac gat aat ttt ggc gtg ggg gat cca tgt tgg tta gaa gac cat 15002
Asn Asp Asn Phe Gly Val Gly Asp Pro Cys Trp Leu Glu Asp His
1755 1760 1765
ctt gcc gga tta cca cga gaa gcc gta ccc gac gta ctc caa gtg 15047
Leu Ala Gly Leu Pro Arg Glu Ala Val Pro Asp Val Leu Gln Val
1770 1775 1780
aca cag ttg gtt ttg cca aat cgg ggt cca acg gtt gcc att atg 15092
Thr Gln Leu Val Leu Pro Asn Arg Gly Pro Thr Val Ala Ile Met
1785 1790 1795
cgt ggt ttt ttt ggg gcg ttg gca tat tgg ccc gaa cta aga att 15137
Arg Gly Phe Phe Gly Ala Leu Ala Tyr Trp Pro Glu Leu Arg Ile
1800 1805 1810
gct ata agt gaa cca tct aca tct ttg gtg cga tat gct acc ggt 15182
Ala Ile Ser Glu Pro Ser Thr Ser Leu Val Arg Tyr Ala Thr Gly
1815 1820 1825
cac atg gaa ctt gcc gaa tgg ttt tta ttt tca cgt aca cat agt 15227
His Met Glu Leu Ala Glu Trp Phe Leu Phe Ser Arg Thr His Ser
1830 1835 1840
tta aag cca caa ttt acc cca acg gaa cgg gaa atg tta gcg tca 15272
Leu Lys Pro Gln Phe Thr Pro Thr Glu Arg Glu Met Leu Ala Ser
1845 1850 1855
ttt ttt acg ttg tat gtt act ctt ggt gga gga atg ttg aac tgg 15317
Phe Phe Thr Leu Tyr Val Thr Leu Gly Gly Gly Met Leu Asn Trp
1860 1865 1870
atc tgt aga gca act gca atg tat tta gct gct cct tac cat tcc 15362
Ile Cys Arg Ala Thr Ala Met Tyr Leu Ala Ala Pro Tyr His Ser
1875 1880 1885
cgt tcg gct tac atc gcg gtc tgt gaa tct ctg ccc tat tac tat 15407
Arg Ser Ala Tyr Ile Ala Val Cys Glu Ser Leu Pro Tyr Tyr Tyr
1890 1895 1900
atc ccg gtt aat agt gac ctg tta tgt gat tta gag gta tta ctg 15452
Ile Pro Val Asn Ser Asp Leu Leu Cys Asp Leu Glu Val Leu Leu
1905 1910 1915
tta ggc gag gtc gac ctc cca act gtt tgt gaa tcc tac gca act 15497
Leu Gly Glu Val Asp Leu Pro Thr Val Cys Glu Ser Tyr Ala Thr
1920 1925 1930
att gca cac gaa tta acc gga tat gag gct gtt cgc aca gca gcc 15542
Ile Ala His Glu Leu Thr Gly Tyr Glu Ala Val Arg Thr Ala Ala
1935 1940 1945
aca aat ttt atg ata gag ttt gcc gat tgt tat aag gaa agt gag 15587
Thr Asn Phe Met Ile Glu Phe Ala Asp Cys Tyr Lys Glu Ser Glu
1950 1955 1960
acc gat tta atg gta agc gcg tac ctg ggg gcc gtt tta ttg tta 15632
Thr Asp Leu Met Val Ser Ala Tyr Leu Gly Ala Val Leu Leu Leu
1965 1970 1975
caa cgg gtg ttg ggt cat gca aat ctt ctt ttg ttg ctt ctc tcc 15677
Gln Arg Val Leu Gly His Ala Asn Leu Leu Leu Leu Leu Leu Ser
1980 1985 1990
ggt gct gcg ttg tac gga gga tgt tca att tac atc ccc cga ggt 15722
Gly Ala Ala Leu Tyr Gly Gly Cys Ser Ile Tyr Ile Pro Arg Gly
1995 2000 2005
att tta gat gca tat aat act tta atg ttg gca gca agt cct ctt 15767
Ile Leu Asp Ala Tyr Asn Thr Leu Met Leu Ala Ala Ser Pro Leu
2010 2015 2020
tac gct cac caa act tta aca tcc ttt tgg aaa gac cgc gat gat 15812
Tyr Ala His Gln Thr Leu Thr Ser Phe Trp Lys Asp Arg Asp Asp
2025 2030 2035
gca atg caa act ttg ggg att cga ccg aca acg gac gtt tta ccc 15857
Ala Met Gln Thr Leu Gly Ile Arg Pro Thr Thr Asp Val Leu Pro
2040 2045 2050
aaa gag caa gac agg ata gtt cag gca tca cct ata gag atg aac 15902
Lys Glu Gln Asp Arg Ile Val Gln Ala Ser Pro Ile Glu Met Asn
2055 2060 2065
ttc cgt ttt gtg gga ttg gag acc atc tat ccc cga gaa cag ccc 15947
Phe Arg Phe Val Gly Leu Glu Thr Ile Tyr Pro Arg Glu Gln Pro
2070 2075 2080
att ccc tcc gtg gac cta gcc gaa aat ctt atg caa tac agg aat 15992
Ile Pro Ser Val Asp Leu Ala Glu Asn Leu Met Gln Tyr Arg Asn
2085 20902095
gaa att ctg ggt ttg gat tgg aaa agc gta gcc atg cat tta cta 16037
Glu Ile Leu Gly Leu Asp Trp Lys Ser Val Ala Met His Leu Leu
2100 2105 2110
cga aaa tat taa gggttgtgat ttttttcatt aggatgaaaa gaacgtttcc 16089
Arg Lys Tyr
2115
tagccacacc cacaaaggag tttgtaaaat aaaatctctg tttagacctt aaaatttgtt 16149
gtgtgtgttg tgtggggggt ccgtgaggat cgacctttac aagatataat ttgtccatat 16209
cgca atg ttt tct cgg ttt gcg cgt tcc ttt tcc agc gat gat aga
Met Phe Ser Arg Phe Ala Arg Ser Phe Ser Ser Asp Asp Arg
2120 2125
acg cgt aaa tct tat gat ggt agt tac caa agt ttt aat gcc ggc 16300
Thr Arg Lys Ser Tyr Asp Gly Ser Tyr Gln Ser Phe Asn Ala Gly
gaa cgt gat ttg ccc aca cct acc cgg gac tgg tgt tct att tcc 16345
Glu Arg Asp Leu Pro Thr Pro Thr Arg Asp Trp Cys Ser Ile Ser
2145 2150 2155
caa cgc ata acc agc gag cgc gtg agg gat gga tgt ctt att cca 16390
Gln Arg Ile Thr Ser Glu Arg Val Arg Asp Gly Cys Leu Ile Pro
2160 2165 2170
acg ccc ggc gag gct ttg gag acg gcg gta aag gct tta tct gaa 16435
Thr Pro Gly Glu Ala Leu Glu Thr Ala Val Lys Ala Leu Ser Glu
2175 2180 2185
Lys Thr Asp Ser Leu Thr Ser Pro Val Leu Gln Ser Thr Glu Arg
2190 2195 2200
cac agt gtt ctg ctt gga tta cac cat aat aat gtt cct gaa tcg 16525
His Ser Val Leu Leu Gly Leu His His Asn Asn Val Pro Glu Ser
2205 2210 2215
ttg gtg gtc tcg tgt atg tct aac gat gtt cat gac ggg ttt atg 16570
Leu Val Val Ser Cys Met Ser Asn Asp Val His Asp Gly Phe Met
2220 2225 2230
cag cgt tat atg gaa aca att caa aga tgt ttg gat gac ctg aaa 16615
Gln Arg Tyr Met Glu Thr Ile Gln Arg Cys Leu Asp Asp Leu Lys
2235 2240 2245
ctt tct ggg gat gga ctt tgg tgg gtt tat gaa aat aca tat tgg 16660
Leu Ser Gly Asp Gly Leu Trp Trp Val Tyr Glu Asn Thr Tyr Trp
2250 2255 2260
cag tat ctc aaa tac acc aca gga gcc gag gta ccg gtg act tca 16705
Gln Tyr Leu Lys Tyr Thr Thr Gly Ala Glu Val Pro Val Thr Ser
2265 2270 2275
gag aag gta aat aaa aag tct aaa tcc acg gtt ttg ttg ttt tca 16750
Glu Lys Val Asn Lys Lys Ser Lys Ser Thr Val Leu Leu Phe Ser
2280 2285 2290
tcc gta gtt gcc aat aaa cca ata tcc aga cat cct ttt aaa tct 16795
Ser Val Val Ala Asn Lys Pro Ile Ser Arg His Pro Phe Lys Ser
2295 2300 2305
aaa gtt ata aat tcg gat tac cgg gga ata tgt cag gag cta cgt 16840
Lys Val Ile Asn Ser Asp Tyr Arg Gly Ile Cys Gln Glu Leu Arg
2310 2315 2320
gag gcg tta gga gct gtg caa aag tat atg tat ttt atg cgt cca 16885
Glu Ala Leu Gly Ala Val Gln Lys Tyr Met Tyr Phe Met Arg Pro
2325 2330 2335
gat gat cct aca aac ccc agc ccg gat aca aga ata cgt gta caa 16930
Asp Asp Pro Thr Asn Pro Ser Pro Asp Thr Arg Ile Arg Val Gln
2340 2345 2350
gaa att gcg gct tac acg gct act ggc tac ggg tgg atg tta tgg 16975
Glu Ile Ala Ala Tyr Thr Ala Thr Gly Tyr Gly Trp Met Leu Trp
2355 2360 2365
ttc ttg gac gtt gtg gac gcc agg gta tgt cgc cat ctc aaa ctt 17020
Phe Leu Asp Val Val Asp Ala Arg Val Cys Arg His Leu Lys Leu
2370 2375 2380
caa ttt cga cgg att cga ggg ccg cgc gcg tct gtt att cca gat 17065
Gln Phe Arg Arg Ile Arg Gly Pro Arg Ala Ser Val Ile Pro Asp
2385 2390 2395
gat ttg ctt aga cga cat tta aaa acg ggt cct gcg gtc tca gcg 17110
Asp Leu Leu Arg Arg His Leu Lys Thr Gly Pro Ala Val Ser Ala
2400 2405 2410
ggc aca gga gtt gcg ttt att tta gca gca aca act gcc agc gct 17155
Gly Thr Gly Val Ala Phe Ile Leu Ala Ala Thr Thr Ala Ser Ala
2415 2420 2425
ctt act gcg ctt ttg cgt att agt gta tta tgg cga aag gaa gag 17200
Leu Thr Ala Leu Leu Arg Ile Ser Val Leu Trp Arg Lys Glu Glu
2430 2435 2440
tgg cgg gat ggt tta aat gga acc gca gct gca att gtt gcg gcg 17245
Trp Arg Asp Gly Leu Asn Gly Thr Ala Ala Ala Ile Val Ala Ala
2445 2450 2455
gtt gaa ctt att acg ctt ttg cac cac cat ttt caa tac tta att 17290
Val Glu Leu Ile Thr Leu Leu His His His Phe Gln Tyr Leu Ile
2460 2465 2470
aat atg atg ctt att gga tat gca tgt tgg ggg gat ggg gga tta 17335
Asn Met Met Leu Ile Gly Tyr Ala Cys Trp Gly Asp Gly Gly Leu
2475 2480 2485
aac gat cct tat ata tta aag gcg cta cgt gcc cag gga cgg ttt 17380
Asn Asp Pro Tyr Ile Leu Lys Ala Leu Arg Ala Gln Gly Arg Phe
2490 2495 2500
tta tat ttt gcg ggt cag ttg gtc aga aca atg tca aca cac agt 17425
Leu Tyr Phe Ala Gly Gln Leu Val Arg Thr Met Ser Thr His Ser
2505 2510 2515
tgg gtt gtg tta gag acc agc acc cat atg tgg ttt tcc cgg gcc 17470
Trp Val Val Leu Glu Thr Ser Thr His Met Trp Phe Ser Arg Ala
2520 2525 2530
gtg gcg cag agt att tta gca cat ggg ggt aaa ccc aca aag tat 17515
Val Ala Gln Ser Ile Leu Ala His Gly Gly Lys Pro Thr Lys Tyr
2535 2540 2545
tat gct cag gtt ctt gcc gcc agt aaa cgg tat act ccg tta cat 17560
Tyr Ala Gln Val Leu Ala Ala Ser Lys Arg Tyr Thr Pro Leu His
2550 2555 2560
tta aga cgt ata tcc gaa cca tcg agt gtg tct gat cag ccg tat 17605
Leu Arg Arg Ile Ser Glu Pro Ser Ser Val Ser Asp Gln Pro Tyr
2565 2570 2575
att cgt ttt aat cga ctg gga tct cca ata ggg aca ggt ata ggg 17650
Ile Arg Phe Asn Arg Leu Gly Ser Pro Ile Gly Thr Gly Ile Gly
2580 2585 2590
aat ttg gaa tgt gtc tgt tta acg gga aat tat tta tct gac gac 17695
Asn Leu Glu Cys Val Cys Leu Thr Gly Asn Tyr Leu Ser Asp Asp
2595 2600 2605
gta aat gca agt tcg cat gta att aat aca gaa gca ccg tta aac 17740
Val Asn Ala Ser Ser His Val Ile Asn Thr Glu Ala Pro Leu Asn
2610 2615 2620
agt ata gca ccc gat aca aat aga cag cgg act tct cgc gtt tta 17785
Ser Ile Ala Pro Asp Thr Asn Arg Gln Arg Thr Ser Arg Val Leu
2625 2630 2635
gtt cgt cca gac acg ggt ttg gat gta act gtc cga aaa aac cac 17830
Val Arg Pro Asp Thr Gly Leu Asp Val Thr Val Arg Lys Asn His
2640 2645 2650
tgt ctg gac ata ggc cat acg gac ggt agt cca gtt gac cca acg 17875
Cys Leu Asp Ile Gly His Thr Asp Gly Ser Pro Val Asp Pro Thr
2655 2660 2665
tat cct gat cat tac acc cgg ata aag gcg gaa tat gaa ggt ccg 17920
Tyr Pro Asp His Tyr Thr Arg Ile Lys Ala Glu Tyr Glu Gly Pro
2670 2675 2680
gtt cgg gat gaa tca aac aca atg ttt gac caa aga tcg gat tta 17965
Val Arg Asp Glu Ser Asn Thr Met Phe Asp Gln Arg Ser Asp Leu
2685 2690 2695
cgt cac ata gaa acc caa gca tct tta aat gat cac gta tat gaa 18010
Arg His Ile Glu Thr Gln Ala Ser Leu Asn Asp His Val Tyr Glu
2700 2705 2710
aat ata cca ccc aag gaa gtg ggt ttt aac tca tct tca gac ctg 18055
Asn Ile Pro Pro Lys Glu Val Gly Phe Asn Ser Ser Ser Asp Leu
2715 2720 2725
gat gtg gat agc ctt aac ggg tac acc tcc gga gac atg cat aca 18100
Asp Val Asp Ser Leu Asn Gly Tyr Thr Ser Gly Asp Met His Thr
2730 2735 2740
gac gat gac tta tca cca gat ttt ata ccc aac gac gtt ccc gtt 18145
Asp Asp Asp Leu Ser Pro Asp Phe Ile Pro Asn Asp Val Pro Val
2745 2750 2755
aga tgt aaa acc acg gtt acg ttt agg aaa aat acg cct aag agt 18190
Arg Cys Lys Thr Thr Val Thr Phe Arg Lys Asn Thr Pro Lys Ser
2760 2765 2770
cat cat taa gtacagcggt taatagatag ttatggacta ggcactttgg 18239
His His
2775
cggtcatttc cacaaccagg ttaaaattgg gggatttggg agaaaatagt ctattgcgta 18299
ttttctgttc aataattgga ctgcgttatt taaaggtctg attggttgat tgggttataa 18359
aaggaattac tcctttaaat tttacttaat gtacccacaa tatcaagtgg tcgtttgtat 18419
ttaacgatta ttaccggtac c atg gga gac ttg tca tgt tgg aca aag gtg 18470
Met Gly Asp Leu Ser Cys Trp Thr Lys Val
2780 2785
ccg ggt ttt acg tta acc ggc gaa ctt cag tac tta aaa caa gtg 18515
Pro Gly Phe Thr Leu Thr Gly Glu Leu Gln Tyr Leu Lys Gln Val
2790 2795 2800
gat gat att tta agg tat gga gtt cgg aaa cgc gat cga aca gga 18560
Asp Asp Ile Leu Arg Tyr Gly Val Arg Lys Arg Asp Arg Thr Gly
2805 2810 2815
atc gga acg tta tct tta ttt gga atg caa gct cga tac aat ttg 18605
Ile Gly Thr Leu Ser Leu Phe Gly Met Gln Ala Arg Tyr Asn Leu
2820 2825 2830
cga aat gaa ttt cct ctt tta act aca aag cgt gtt ttt tgg agg 18650
Arg Asn Glu Phe Pro Leu Leu Thr Thr Lys Arg Val Phe Trp Arg
2835 2840 2845
gcc gtc gtg gaa gag ttg tta tgg ttt atc cgc ggg tca acc gat 18695
Ala Val Val Glu Glu Leu Leu Trp Phe Ile Arg Gly Ser Thr Asp
2850 2855 2860
tcc aaa gaa ctc gcc gct aaa gat ata cac ata tgg gat ata tac 18740
Ser Lys Glu Leu Ala Ala Lys Asp Ile His Ile Trp Asp Ile Tyr
2865 2870 2875
gga tcg agc aaa ttt cta aat agg aat ggc ttc cat aaa aga cac 18785
Gly Ser Ser Lys Phe Leu Asn Arg Asn Gly Phe His Lys Arg His
2880 2885 2890
acg ggg gac ctt ggc ccc att tac ggc ttc cag tgg aga cat ttt 18830
Thr Gly Asp Leu Gly Pro Ile Tyr Gly Phe Gln Trp Arg His Phe
2895 2900 2905
gga gcg gaa tat aaa gac tgt caa tca aac tat tta cag caa gga 18875
Gly Ala Glu Tyr Lys Asp Cys Gln Ser Asn Tyr Leu Gln Gln Gly
2910 2915 2920
atc gat cag ctg caa act gtt ata gat aca att aaa aca aac cca 18920
Ile Asp Gln Leu Gln Thr Val Ile Asp Thr Ile Lys Thr Asn Pro
2925 2930 2935
gaa agc cga cga atg att ata tcg tct tgg aat cca aag gat atc 18965
Glu Ser Arg Arg Met Ile Ile Ser Ser Trp Asn Pro Lys Asp Ile
2940 2945 2950
ccc tta atg gta cta cct cca tgt cac acg tta tgt cag ttt tac 19010
Pro Leu Met Val Leu Pro Pro Cys His Thr Leu Cys Gln Phe Tyr
2955 2960 2965
gtt gca aac ggt gaa tta tcc tgc caa gta tac cag aga tcg ggg 19055
Val Ala Asn Gly Glu Leu Ser Cys Gln Val Tyr Gln Arg Ser Gly
2970 2975 2980
gat atg ggc ctt ggg gta ccg ttc aac att gct gga tat gca ctt 19100
Asp Met Gly Leu Gly Val Pro Phe Asn Ile Ala Gly Tyr Ala Leu
2985 2990 2995
ctt acc tac ata gta gcg cat gtt aca gga ctt aaa acc gga gat 19145
Leu Thr Tyr Ile Val Ala His Val Thr Gly Leu Lys Thr Gly Asp
3000 3005 3010
tta att cat aca atg ggg gat gca cat att tac ttg aat cat ata 19190
Leu Ile His Thr Met Gly Asp Ala His Ile Tyr Leu Asn His Ile
3015 3020 3025
gat gct tta aaa gtg cag cta gct cga tcc cca aaa cct ttt cct 19235
Asp Ala Leu Lys Val Gln Leu Ala Arg Ser Pro Lys Pro Phe Pro
3030 3035 3040
tgc ctt aaa att att cga aat gta aca gat ata aac gac ttt aaa 19280
Cys Leu Lys Ile Ile Arg Asn Val Thr Asp Ile Asn Asp Phe Lys
3045 3050 3055
tgg gac gat ttt cag ctt gat gga tat aat cca cac ccc ccc cta 19325
Trp Asp Asp Phe Gln Leu Asp Gly Tyr Asn Pro His Pro Pro Leu
3060 3065 3070
aaa atg gaa atg gct ctt taa tggattttta aatgttgtca agacagtaga 19376
Lys Met Glu Met Ala Leu
3075
tgtgttgcga atgtaataaa atgatataca cagacgcgtt tggttggttt ctgtttatga 19436
acagcaacgg atgcataggg ttgcgataac tgcgataaga cccaatgtcc caaggataga 19496
tatcacacca attataactg ctacaacgga aaatgtagtg gcgtaggtag atgcatcgta 19556
ggtataaacg gccgaaaacg gagggaattt tttagggtaa ccatctagat gacacgaata 19616
ggtgataggt ccgtcgagtt ccgatgttgg acaagaactt tgcatgttta caaaccgttt 19676
gttttgatca cacaccccag taatctcact gttttcgtgg ttaatgggag aatcgttaac 19736
ccaccatacg aaatgtacaa cgccacgtgg cacacatttt gccgtacata ctatgtgtcc 19796
atcaataata cctatagaca cgttgggaaa tggatagacg tcaggggtaa cgacagcaga 19856
atatttcata ttagagacgc catcccgaat ccataaaaca ttacattgga tggctggggg 19916
tgggtaatcc atttgttttt gctgtggaat tcgtaccgcc gaaacataac taaataatcc 19976
attggcatat tcttgtattg catcggttat aaaatttttt ccgatgttac caaaccttga 20036
agtccaccga acacgtaccg agtgcggtgg ataatacttt gatacgttac agtaggctgc 20096
gtatgtctgt ccggttaaga ctggatcgcc gacaacggta atatttggac gataatacgt 20156
tgtaactgta atactgtgtt ccgatatgac gttcttagtt tttgtattaa cgactcgcca 20216
aatatacgtt ccctccgtgg tagcatccat agataaaatt gttacagaaa aatcagacgt 20276
tgttttaaca tctggtatta cataattttc cttagcgtgt gtaaatatct cagggttgtt 20336
tattaagttt aaatcggcac tgttgctata taacataacc ggtaaatctg gcatgcgtat 20396
taacgcattg cccagttgac ggtgcggatc tataaggtga cgcgtaaacc aaacttcaat 20456
atgaagatcg gggcgtataa gcgacttcca ccttgttata tttgaacctt ccggatctaa 20516
agaatattgt tcatatgttt tttgttgctg cttaaaggcc gcctgttgtc cggtcgttag 20576
acgcatgtaa caaggcatga taaatgtgtg aaaatagggt atggattgta ttccgccgtg 20636
aacgcattgt atattttcat atagaaaagg tggttgtgaa tgttgggtgt tggctgcggg 20696
atcgggcttt cgggaagcgg ccgaggtggg cgcgacggcg ggatcgggct ttcgggtagc 20756
ggccgaggtg ggcgcgacgg cgggatcggg ctttcgggaa gcggccgagg tgggcgcgac 20816
ggcgggatcg ggctttcggg tagcggccga ggtgggcgcg acggcgggat cgggctttcg 20876
ggaagcggcc gaggtgggcg cgacggcggg atcgggcttt cgggaagcgg ccgaggtggg 20936
cgcgacggcg ggatcgggct ttcgggaagc ggccgaggtg ggcgcgacgg cgggatcggg 20996
ctttcgggta gcggccgagg tatataattc agttatactt acgggtgtgg gttgagattc 21056
agtcgataat tgtatacacg cgatcgttaa aattaaattt atttgtatcc gcttcatcct 21116
ggtttttatt gacacatcca cgctcccctt aaataaaaga ttaaaacacc caccgcggaa 21176
tttaaatgat ggaaacgttt ttttcgacat tgggaataat aaaaacggct tttgcaactt 21236
taaaaacttt atttatctcg attacgatac atatgtacca catagatagc atagatttat 21296
tataatataa acacacacgt gatatacttt agtgatatga gatgccataa aacagtcaat 21356
aggtttaacg cttagtctca tcatctgaat acacgtcaaa cccgccgcaa ctgttgatgt 21416
tagaattata atagctcccc atgaaatgcc ggcaaatgtt acagctatac ccgtcaccga 21476
ggtcgttgta tataatacaa ttacccatag gttttttttt tcttgatata aaacggcaaa 21536
accctgtaac ccaaatgcta taatatgacc tcctattgaa actgctaacg ttacttgtgt 21596
aagtttgata aaatgattta atttaattat atgtgagatt gcccacatta atggggtaac 21656
tatatataac accgggggta taacagacat tatacgaatt cctttaaaca cgcgtttaag 21716
ggtccgggaa ctttctcgat ggtcacatac tctcccgcgg tcattttgtg tatatacaac 21776
ggcaaaacct aaatctgtat aagtgtttaa ttgcttatgg cgatttttac gatatataca 21836
cgtatcttgc aaatcggtgg cggcatcgac aattgaaact agtgtgacaa tagatataca 21896
caatccaata agaacctcat atttactgac atacatatat aaaataacgg ttagtaaacc 21956
tcccaaccca gttcccaaca tcataacata aaaataaata tgcggtccat tgaatgtcgt 22016
aacaaagttg tagtaatgga tatgcacagc agccactgtt ccggtaatcg cggatatgga 22076
aattcccagt aattctacaa atggaagatc ccgggatatt gggcaaccaa ccgcccataa 22136
cacagcaaaa cccaacacga ccaccgtctg caaacatcgt cccaattttg ctaatgtgcg 22196
tagaaatttc acggatgttg gccataaccc cgaaacgacg atcaacccca taatagttgc 22256
attgacggca gcttcgcaga cgtgatattg taaaattaac ccggacgtga taacgcttgc 22316
ttgtagtccc acgagaaaca accgcgatgc tgaggttatt gcacacgaat tacattcttg 22376
agggtttccg acacatcctt ggattgattg agcgcggatt aattctctgt ctaacacacc 22436
caggttttca tcatggacag ctctttcacc attcacggcc atgtcttaag tttaataatt 22496
caaaacaaat aaaaatgtgt tcatctatgg tacacacaag tttgtatgta aaatataagc 22556
aaaagttgca cttatttaac tgtacatatt acgtcagatt cacgtgataa ttcagaataa 22616
tccagggttc ctgcagggtc cactggagga gccacacaat attcgcgaat tccgattccc 22676
tcctgccatg tggtttcggg gagtttcccc cccattttat ttccggtatt tttttcgttt 22736
ctttttgtta ataaattgcg tctttttttt aatggtggtt catccttcac agattccatg 22796
ttcgcaaata attgcatcga ggttaatttt tctttaaggt ctttgggact taagaacgtt 22856
gcataaaaaa aagaatgcac gggtgcggaa cgttggatat acaatccaac catgggggag 22916
ttagttaagg cgagataaaa attaatataa cacgtctcat cccgtgttaa cttaagattt 22976
tgtacggcag aacggaatcc actgtgtgtt tccaataata ctccaaattc acgcatactc 23036
ccgctgccat aaacaacatt attaaggatc ctttttgaat ttgtgattga gcgtattaaa 23096
ttatatggtg taggcttgct tccgtttata tccaaggaaa cattaaatga gataaaacca 23156
cccccggcgg tctggatgta catatccgtg gctgttagaa tgaagcatgt tgtaaaccca 23216
aaagttttaa gtagtcgctg taaacgggtg aattgatcgc gttttaagca aatgcttata 23276
tctggagtta gatttggaaa catcattgta taacaagcga gttcacgttt tacaacttgt 23336
ttgtaacatt gtacttgatc atctggacca caatcacccg ggcgttgcca taccatcgtt 23396
tggataatac tccgctcggg gggttgtccg gtaaatttaa aatataaccg tgttggggtc 23456
gacggatctt ttgtatggcg aaacgcgtca ataagcgagg accgtccctc cgttgccgcg 23516
agtacaacca ttctcggccc agtccaatta tactggtcaa acatatttgc cggtatagga 23576
atatacagtt gttctgtttc caaactacag tgaataatta atccttcgtc gctgaatatt 23636
aaaatagaat cccttagtct attaaccaga ggtgatatag acgaaattaa accagtaagc 23696
gttttttccg ttaaaacagc tctggcgatt tctggggcgt caaaacccgc atgcaattcc 23756
atgtccaaag catcgtctgt acgcgacctc aaatccataa tttactactt aaaatgttta 23816
ctatagaaaa agtaatcata tgtaaacaca cgagtttcgt taatatgttt gtttaacccg 23876
atccggtgac ttaagtacat aaacaggcat gatatttgaa tagtacggcc catgggaggg 23936
aacatttcca cgtgttccaa tacagggggt gttccttaat agggactgtg caataaaata 23996
cgtaagaagt taccagattt gatgtaatgt ttgtcataaa aaatatgtac atcattatat 24056
acgtctgtaa ttaacacaag atcacatcga agaattactg aagccgctgt gaaacctttc 24116
acaagacgat ataaacttgg ttaagtgtat tg atg ggg ctc ttt gga ctg aca 24169
Met Gly Leu Phe Gly Leu Thr
3080
cgc ttt atc cat gaa cat aaa ctg gtt aaa ccc agc atc att tca 24214
Arg Phe Ile His Glu His Lys Leu Val Lys Pro Ser Ile Ile Ser
3085 3090 3095
acg cca ccc gga gtt tta acc ccc gtg gcg gta gac gta tgg aac 24259
Thr Pro Pro Gly Val Leu Thr Pro Val Ala Val Asp Val Trp Asn
3100 3105 3110
gtc atg tac aca ttg ttg gaa cgt tta tac cct gtg ggt aaa cgc 24304
Val Met Tyr Thr Leu Leu Glu Arg Leu Tyr Pro Val Gly Lys Arg
3115 3120 3125
gag aat tta cac gga cca tct gta acg ata cat tgt ctt gga gtc 24349
Glu Asn Leu His Gly Pro Ser Val Thr Ile His Cys Leu Gly Val
3130 3135 3140
tta ttg cgg cta tta aca caa cgg tca tac tat ccg ata ttt gta 24394
Leu Leu Arg Leu Leu Thr Gln Arg Ser Tyr Tyr Pro Ile Phe Val
3145 3150 3155
ttg gaa cgt tgt aca gac ggc cca tta tca cgt gga gcc aag gca 24439
Leu Glu Arg Cys Thr Asp Gly Pro Leu Ser Arg Gly Ala Lys Ala
3160 3165 3170
att atg tca cgg gcc atg aac cac gat gaa agg gga acc tcg gac 24484
Ile Met Ser Arg Ala Met Asn His Asp Glu Arg Gly Thr Ser Asp
3175 3180 3185
tta acc cgt gtt cta cta tca tcc aac aca tca tgt tct atc aag 24529
Leu Thr Arg Val Leu Leu Ser Ser Asn Thr Ser Cys Ser Ile Lys
3190 3195 3200
tat aac aaa aca tcg gaa aca tat gac agt gtg ttt cga aac tct 24574
Tyr Asn Lys Thr Ser Glu Thr Tyr Asp Ser Val Phe Arg Asn Ser
3205 3210 3215
tcc acg agt tgt att cct agc gaa gaa aac aaa tcc cag gat atg 24619
Ser Thr Ser Cys Ile Pro Ser Glu Glu Asn Lys Ser Gln Asp Met
3220 3225 3230
ttt ttg gac ggt tgt cca cga caa act gac aag acg atc tgc ctg 24664
Phe Leu Asp Gly Cys Pro Arg Gln Thr Asp Lys Thr Ile Cys Leu
3235 3240 3245
cgc gac caa aac gta tgc agt ctt acc tct aca atg cca tcc cga 24709
Arg Asp Gln Asn Val Cys Ser Leu Thr Ser Thr Met Pro Ser Arg
3250 3255 3260
gga cat cct aac cat cga tta tat cac aaa ttg tgt gca agt ctt 24754
Gly His Pro Asn His Arg Leu Tyr His Lys Leu Cys Ala Ser Leu
3265 3270 3275
att aga tgg atg ggg tat gca tac gtc gag gcg gtt gac att gag 24799
Ile Arg Trp Met Gly Tyr Ala Tyr Val Glu Ala Val Asp Ile Glu
3280 3285 3290
gcg gac gag gca tgt gca aac tta ttt cat acg cgt aca gtg gct 24844
Ala Asp Glu Ala Cys Ala Asn Leu Phe His Thr Arg Thr Val Ala
3295 3300 3305
ttg gtt tat acg aca gat act gat tta ctc ttc atg ggc tgt gat 24889
Leu Val Tyr Thr Thr Asp Thr Asp Leu Leu Phe Met Gly Cys Asp
3310 3315 3320
att ttg tta gat gca att cct atg ttt gct cca gta gta cga tgt 24934
Ile Leu Leu Asp Ala Ile Pro Met Phe Ala Pro Val Val Arg Cys
3325 3330 3335
cgc gat ttg ctt caa tat tta gga att aca tac cct gaa ttt ttg 24979
Arg Asp Leu Leu Gln Tyr Leu Gly Ile Thr Tyr Pro Glu Phe Leu
3340 3345 3350
gtt gcc ttt gtt cgc tgt cag acc gat ttg cat aca agt gac aac 25024
Val Ala Phe Val Arg Cys Gln Thr Asp Leu His Thr Ser Asp Asn
3355 3360 3365
cta aaa tct gtt cag caa gtt att cag gat acc ggc ctg aaa gtt 25069
Leu Lys Ser Val Gln Gln Val Ile Gln Asp Thr Gly Leu Lys Val
3370 3375 3380
cca cat caa atg gac act tca acg cgc tcc ccc act tac gac tcg 25114
Pro His Gln Met Asp Thr Ser Thr Arg Ser Pro Thr Tyr Asp Ser
3385 3390 3395
tgg aga cat ggc gag gtt ttc aaa agt ctt acc gta gcc acg tcg 25159
Trp Arg His Gly Glu Val Phe Lys Ser Leu Thr Val Ala Thr Ser
3400 3405 3410
ggt aaa aca gaa aac gga gtg tcc gtt tcc aaa tat gca tct aac 25204
Gly Lys Thr Glu Asn Gly Val Ser Val Ser Lys Tyr Ala Ser Asn
3415 3420 3425
cga tcg gag gtg aca gta gac gcc agt tgg gct tta aac ctt ctg 25249
Arg Ser Glu Val Thr Val Asp Ala Ser Trp Ala Leu Asn Leu Leu
3430 3435 3440
cca ccc tca tcc tcc cca ttg gat aat ttg gaa cgc gca ttt gtt 25294
Pro Pro Ser Ser Ser Pro Leu Asp Asn Leu Glu Arg Ala Phe Val
3445 3450 3455
gaa cat ata atc gcc gtg gta act cca ttg acc cgc ggt cgc cta 25339
Glu His Ile Ile Ala Val Val Thr Pro Leu Thr Arg Gly Arg Leu
3460 3465 3470
aag tta atg aaa cgt gta aat att atg caa aat acg gca gac cca 25384
Lys Leu Met Lys Arg Val Asn Ile Met Gln Asn Thr Ala Asp Pro
3475 3480 3485
tat atg gtt att aac acc tta tat cat aac tta aag ggg gaa aaa 25429
Tyr Met Val Ile Asn Thr Leu Tyr His Asn Leu Lys Gly Glu Lys
3490 3495 3500
atg gct cgc caa tac gca cgt att ttt aaa cag ttt att cct act 25474
Met Ala Arg Gln Tyr Ala Arg Ile Phe Lys Gln Phe Ile Pro Thr
3505 3510 3515
cca ctc cca cta aac act gta tta aca aaa tat tgg aat taa 25516
Pro Leu Pro Leu Asn Thr Val Leu Thr Lys Tyr Trp Asn
3520 3525 3530
aacacacata agagcgactt aatggttcat tgttttattt tgctcgtata tacatgttat 25576
aaatcgttta tcactgtgcc cgcataagat gtactgtgtc tctcaaaaaa atttgtgttt 25636
ttatctgcaa tcataaatgc aagtggaaag tccgaatcgg gaggtggggt gttaaatagt 25696
tttggtacat taatcgctga taaaagcctg tccgcgctga atttcacgta ttgtgtaatt 25756
gcatcgacgt tcaccaaacg ggttttgggt gcatgggatt ttaaaaacgc acactcgatt 25816
tcaacggctt ccgaaaacag ttgatgtatt ctggtgatag cgggtttttc gggtacatag 25876
ttattgtata tacaacacga tgcgctggta tgtatggctt catctcggct tataaggtcg 25936
ttaaattgac aagttacaac aaatagtccg ttattgcgta aatatgcaat agccgcgaac 25996
gatgatacaa aaaaaatgcc ctctataaga atcattagta tatatttttc tgcaacggat 26056
gggttgtccc gtaccttttc ttccaaccat tgtacttttt gttggatcga cggattatta 26116
atagtgacat ttacgtattg tacccgcaac gattcatccc ctctgaacaa cattagttga 26176
atttgactat agacacgcgc gtggacaacc tcgatgcact cttgttcaat gtagtaatgg 26236
tgaatatcct tttgggaaaa gagttgggtt agagagccca aattaacatt taccagatca 26296
tctgccgccg ataaaaatgt aaaaataaat ctgtagaata ttagttcatc ttccgttaaa 26356
cagtccaagt attgataatc atcttcaatg ataaaatcgc tttctaacca acgattcgaa 26416
atgctcaggg cacgtaaatt gtttatatct ggacactccg gcctgtaaaa aaaatgactg 26476
caatctttct gatccatttt ggaatagttt cccgtgtaaa tttataaagc acaactggta 26536
caggttaatt cgcctcccgc aaacagtccg ctgttcgtag ctttacgaat tttacagtag 26596
tacatacccg ttttaaggcc ggctttatag gcacgtataa gcaaattcat tattttggag 26656
gcgggaattg tcccgtctgg gcgttcctca ataaataaag tcattgattg actttggtca 26716
ataaatggcg ccctttctgc acacatatca acgagatcct cttgctcata ttcaaacgct 26776
gttttatatt ttaagagtgg gtgactatta gataaacagc caaacgaacg tattactgac 26836
cattggtttt tctcaagtat gtttataact tccagtcgtt tttcttcaca tgaatacata 26896
tctcttagtt cgtccataag gtctaagttg ggtctaagta actcacccga ggtggtgacc 26956
ttactaaaca tattattata aattggagag aaaccctcac tgcactccgt tacctgtgca 27016
gatgaaactg tgggcattaa cgctaagaac tgcgagttgt ataacccata agcgcaaata 27076
tcatctcgca gggtacacca tggtaaatct aaataactta tcgtagaaaa cccatcttgg 27136
tgtaaccatc ccttagcata tttactttcg gtaaaaccct taaacggggc taagccgcca 27196
atcttacaca tttccatgct tgttttcatt gtctcataca acattaactc cgctatttgt 27256
acatttaacc gtctagctgg ttgggaagtt aaatcaaatc ctaagcggag acaagttgta 27316
tgtaaccctt gtatgccaat gccaagtgat cggttgtttt ttacaccttt acatgatttt 27376
ttacatggaa agttcccagc cgccaggacc ccgtttaaaa aaataacagt cgttcttgct 27436
gtcaattgaa ggtcgtttaa attaaatgac actgggcctt tggataagca cgttgtaaga 27496
tttatgctgg caagattaca tacgccatgt tgatgagcgt ctgccttttg aacaatttcc 27556
gtacacaaat ttgaccccgt gatagcattt ccttgggtat tcatatgata attacgatta 27616
caggcatctt tgaacattaa aaaggggctt cctgttacag cagcactgcg tatgattgtg 27676
aatgcgatat cttgaatggg aacagaagaa acgcctaatc cttctctctc taaacgtaaa 27736
taggttgaag tgaatgcctc cccgtgtaat gttcgaagga tatcggctct gttatcaaaa 27796
agagtccact gaacattact agcccctttt agatagctta ggtatctttc aaaaaataaa 27856
tctggggtcc ataaacaaca aaatatgtta tcacatcgaa atatttcatc acgaaccaac 27916
attccacgtg tggccaaaac agtttgtaga tcgacgtgcc atggttctat gtaaacacaa 27976
actccagttg gtcgttcaca atcactgtta attgccataa ccatgcaatc taaaagtttt 28036
aaaactgcaa gaagaccttt cgtttgattt tccgtaggta ttaaattcag actctgtaga 28096
gaaattccca ctccacctcg actttgtaat accgttccca catcgcctgt gatagctcga 28156
acagctctcc caacagtgat ggattccggg tccattaaat aacaactggc cgttgccccg 28216
gtctctcgac ctaaaaacat cataaccggt gtagccggga caattttctg acatgccaac 28276
gctgtgaaaa atacccgaca gacatcagtc catgtataac catcatttat tccgggaata 28336
agagttgcga ttttaggcag gtttacgatt tctgttgtca cggtggccgc cagtcttaaa 28396
aagaattggc aaagcgactc taatttacct tcctctaact tagttaaata aaagtcttcg 28456
tactttaaag cagactgtag tccaagggta gctaaagcgg ggtattgatc tttcaaaaac 28516
ggttctaata tagcccgacg aatttcgtcc ctccgccctt caattgcttg gcggactcgg 28576
ggagttaaac agagaattgg ggaagtcaac cacgtttcca tggaaacgga tcgtaggtta 28636
atacggcaat ggataagttc tccacaacat cggtacactc gctcatcttg tcgcgtcacc 28696
gccttaagtt ttgagacgat agtgctaata tactccatta attccaccgg tgtggttgat 28756
tcgggcggaa tgatgtattc cttgtagcca tgttgacata atcggtttat aatgtcatga 28816
accgtattaa aaattctttt gaactccata acggataacg tatttaggct ccggaataaa 28876
cctttaaacc ctaaactcac agctgagtta gttctacaat attgtagact cccttatata 28936
tggttacgta cagcctgccc ctccccagta tataatatca cgcaaaaccc acgctatgtt 28996
aaattcagtt tattttacat acatgcttta ataataacat tcgttccatg tatttgtacc 29056
cccccacaca accccctcta accaaatagt tggcacgtta taacctccga accgttccat 29116
gcgtcttgta taacgcacag actctgatgg aattgttcca attaacgtat atgccgcata 29176
catgcaggat aattgtgtgg gaagtccccg aaaatcgccg gtccattgat acaatcgctg 29236
tctagccaag ttccaattta ctcctgtaat ttcgccaata ctacatcgag ggcttgtcgg 29296
gtcattggat aactgcacaa gcggcaacgc ccttgtgtta tatggctggt gggtatttgc 29356
aaccccttca gtcccccagg cggcattttc agctcgtatg cgtcctaaca ggaagccaat 29416
accacgacca aaacattgtt cgtttagttg gcttaatgca agatgcagtc ttacaccttc 29476
tcgttggcgt cgctgtgtat atacaaaaac caagaacaca tgcttcagtc cgtccgcgga 29536
aagatgtaaa tctttgtcaa cgtcccaaaa tacgcaggcc gggatgttgg ctgtgaccct 29596
gcgagttgaa gttttgtctg tacgtgcagc ttcttgggga cctttggcca cggcggttat 29656
attgcataaa ttatcctgaa tggtatattc cagcagggac ccaaaaaaac ttataaatcg 29716
atgtggaaat acatgacatt gtaccatcgc acgtaaacac tccgaaaacc ttatgagccg 29776
cgtttccata cgactgcatc cataggcaga aacaattgct gttctgttgg catccgctgc 29836
ctgtttatcc gtatattctt ctgcccggca tgcggcgatg aaacttaatg acgttacata 29896
tgctctaagc cccccacctt ctccaacggt ccaaggagcc gtgcaggcat tgaataggtt 29956
tcgtaaaccc tctagtagta catcggggtc acgtccagcc tgtgtaagtg tattagcttc 30016
tccaatcatg tcagatggat gacgaaggat taagacgatt gacccagcat gctcaatgtc 30076
cggacgaaaa aaatcggtta atgacacttg ttggattagc tgtgtcgttg atttaaaatt 30136
atttaacggg agtctaatgg taacttgcgg gttaccaatt gaagttggat ttatttgaat 30196
gttgttcata cgattaataa caattgaacg gggggttact tgaatagacg cggtttctgt 30256
acgttttggt ggtacatgta tcggttgttt gttcagacct ccaaagcgag ggccaattgt 30316
taaatcgcga ctccaatttc cgaagaagcc cggagcataa gtcatatgaa gcccgttccc 30376
tatttgaata aaacggttat ttcctaaaag actgatatta gttccacata gcgtttgttc 30436
gtttaaagta aaatgcgagt tggttggttg actccccata gctgaggggt taaattcaca 30496
caatgcaatc gtgacgtggt actatctgaa atgttgcctg gggtatgtgt acacattata 30556
cagtcgtagt accgtttata taatgttagg taggaggagc ctataaaaat attttgattg 30616
gcgttaaaag gttcttcaac ttaccgtgac gtccttttta ttaacatgcg tttttattga 30676
tgttacattt atgtcttttc attccggacg gatgtagctt tttcatatca cgttataaag 30736
ttaagtcagc gtagaatata cc atg gaa gaa cca att tgt tat gat aca 30785
Met Glu Glu Pro Ile Cys Tyr Asp Thr
3535 3540
caa aaa ctt ttg gat gat tta agt aac ttg aaa gta caa gaa gcg 30830
Gln Lys Leu Leu Asp Asp Leu Ser Asn Leu Lys Val Gln Glu Ala
3545 3550 3555
gac aac gaa aga cca tgg tca cca gag aaa aca gaa atc gcc aga 30875
Asp Asn Glu Arg Pro Trp Ser Pro Glu Lys Thr Glu Ile Ala Arg
3560 3565 3570
gtt aag gta gtt aag ttt tta cga tct acc cag aaa att cca gct 30920
Val Lys Val Val Lys Phe Leu Arg Ser Thr Gln Lys Ile Pro Ala
3575 3580 3585
aaa cat ttt att cag ata tgg gaa ccc ctg cat tct aat atc tgt 30965
Lys His Phe Ile Gln Ile Trp Glu Pro Leu His Ser Asn Ile Cys
3590 3595 3600
ttt gta tat tcc aat aca ttt ttg gcg gag gct gct ttc acg gcc 31010
Phe Val Tyr Ser Asn Thr Phe Leu Ala Glu Ala Ala Phe Thr Ala
3605 3610 3615
gaa aat tta ccc gga ctg ttg ttt tgg aga cta gat cta gac tgg 31055
Glu Asn Leu Pro Gly Leu Leu Phe Trp Arg Leu Asp Leu Asp Trp
3620 3625 3630
acg ata gag gag cca ggt aat agc tta aaa att tta acc cag cta 31100
Thr Ile Glu Glu Pro Gly Asn Ser Leu Lys Ile Leu Thr Gln Leu
3635 3640 3645
tca agt gta gta caa gat tcc gag acg tta cat cgt tta tcg gcc 31145
Ser Ser Val Val Gln Asp Ser Glu Thr Leu His Arg Leu Ser Ala
3650 3655 3660
aat aaa tta cga acc tcg tct aaa ttt gga ccc gtt tcg ata cac 31190
Asn Lys Leu Arg Thr Ser Ser Lys Phe Gly Pro Val Ser Ile His
3665 3670 3675
ttc att ata acg gac tgg ata aat atg tac gag gtc gcc tta aag 31235
Phe Ile Ile Thr Asp Trp Ile Asn Met Tyr Glu Val Ala Leu Lys
3680 3685 3690
gat gca aca aca gcc att gaa tca cca ttc act cac gct cgt att 31280
Asp Ala Thr Thr Ala Ile Glu Ser Pro Phe Thr His Ala Arg Ile
3695 3700 3705
gga atg ttg gaa agc gcc att gca gct tta aca caa cat aaa ttt 31325
Gly Met Leu Glu Ser Ala Ile Ala Ala Leu Thr Gln His Lys Phe
3710 3715 3720
gcg atc att tac gat atg cca ttt gtt caa gag ggg att cgt gtt 31370
Ala Ile Ile Tyr Asp Met Pro Phe Val Gln Glu Gly Ile Arg Val
3725 3730 3735
tta aca caa tat gca gga tgg ctt ctt ccg ttt aat gtt atg tgg 31415
Leu Thr Gln Tyr Ala Gly Trp Leu Leu Pro Phe Asn Val Met Trp
3740 3745 3750
aat cag att caa aat agc tca ctc act cct cta aca cga gcc ctt 31460
Asn Gln Ile Gln Asn Ser Ser Leu Thr Pro Leu Thr Arg Ala Leu
3755 3760 3765
ttt ata atc tgt atg att gat gaa tat ctc acg gaa acg cca gta 31505
Phe Ile Ile Cys Met Ile Asp Glu Tyr Leu Thr Glu Thr Pro Val
3770 3775 3780
cat agc ata tca gaa tta ttt gca gat act gta aat tta att aaa 31550
His Ser Ile Ser Glu Leu Phe Ala Asp Thr Val Asn Leu Ile Lys
3785 3790 3795
gat gag gcg ttc gta tcc atc gaa gaa gcg gta acg aat cca cga 31595
Asp Glu Ala Phe Val Ser Ile Glu Glu Ala Val Thr Asn Pro Arg
3800 3805 3810
acg gtg cac gag tca cga att tcc tca gct ctg gct tat cga gac 31640
Thr Val His Glu Ser Arg Ile Ser Ser Ala Leu Ala Tyr Arg Asp
3815 3820 3825
cct tat gtt ttt gag aca tcc ccg gga atg ctt gct agg aga ctt 31685
Pro Tyr Val Phe Glu Thr Ser Pro Gly Met Leu Ala Arg Arg Leu
3830 3835 3840
aga tta gac aat ggt ata tgg gaa agc aac ctc tta tcg ttg tcc 31730
Arg Leu Asp Asn Gly Ile Trp Glu Ser Asn Leu Leu Ser Leu Ser
3845 3850 3855
acc ccc gga att cat att gag gcg ctg tta cat tta cta aac tcc 31775
Thr Pro Gly Ile His Ile Glu Ala Leu Leu His Leu Leu Asn Ser
3860 3865 3870
gac ccg gaa gcg gaa acc aca tct gga agt aat gta gca gaa cac 31820
Asp Pro Glu Ala Glu Thr Thr Ser Gly Ser Asn Val Ala Glu His
3875 3880 3885
acc cgt ggc att tgg gaa aag gtt cag gct agt aca tcg cct agt 31865
Thr Arg Gly Ile Trp Glu Lys Val Gln Ala Ser Thr Ser Pro Ser
3890 3895 3900
atg tta ata agc acc ctt gcc gaa tcc ggg ttt aca aga ttt tca 31910
Met Leu Ile Ser Thr Leu Ala Glu Ser Gly Phe Thr Arg Phe Ser
3905 3910 3915
tgc aaa ttg cta cgt cgg ttt att gct cac cac aca ctc gcc ggt 31955
Cys Lys Leu Leu Arg Arg Phe Ile Ala His His Thr Leu Ala Gly
3920 3925 3930
ttt att cac gga agc gtt gta gca gac gag cat att aca gat ttc 32000
Phe Ile His Gly Ser Val Val Ala Asp Glu His Ile Thr Asp Phe
3935 3940 3945
caa caa aca cta gga tgt ctc gct tta gtg ggt gga ctg gca tac 32045
Gln Gln Thr Leu Gly Cys Leu Ala Leu Val Gly Gly Leu Ala Tyr
3950 3955 3960
caa tta gtg gaa acg tac gct cct act acc gag tat gtg tta aca 32090
Gln Leu Val Glu Thr Tyr Ala Pro Thr Thr Glu Tyr Val Leu Thr
3965 3970 3975
tat aca cgg aca gta aac gag acc gaa aaa cgg tat gaa acg cta 32135
Tyr Thr Arg Thr Val Asn Glu Thr Glu Lys Arg Tyr Glu Thr Leu
3980 3985 3990
tta ccc gcc tta gga tta cca ccg gga ggc ctg gga caa att atg 32180
Leu Pro Ala Leu Gly Leu Pro Pro Gly Gly Leu Gly Gln Ile Met
3995 4000 4005
cgg cgc tgt ttt gct cca cga ccc ctt att gaa agt ata caa gcg 32225
Arg Arg Cys Phe Ala Pro Arg Pro Leu Ile Glu Ser Ile Gln Ala
4010 4015 4020
aca cgc gta ata cta ctt aat gaa att tca cat gca gaa gct aga 32270
Thr Arg Val Ile Leu Leu Asn Glu Ile Ser His Ala Glu Ala Arg
4025 4030 4035
gag aca aca tat ttt aag caa aca cat aat caa tcc tca ggt gcg 32315
Glu Thr Thr Tyr Phe Lys Gln Thr His Asn Gln Ser Ser Gly Ala
4040 4045 4050
tta tta cca caa gca gga caa agt gcc gta cgc gaa gcc gta cta 32360
Leu Leu Pro Gln Ala Gly Gln Ser Ala Val Arg Glu Ala Val Leu
4055 4060 4065
acc tgg ttt gac cta cgt atg gat tca aga tgg ggt att act ccc 32405
Thr Trp Phe Asp Leu Arg Met Asp Ser Arg Trp Gly Ile Thr Pro
4070 4075 4080
ccg gtg gat gtg ggt atg aca cct cct att tgt gtt gat cca ccg 32450
Pro Val Asp Val Gly Met Thr Pro Pro Ile Cys Val Asp Pro Pro
4085 4090 4095
gct aca ggg ttg gaa gct gtc atg ata aca gaa gca cta aag att 32495
Ala Thr Gly Leu Glu Ala Val Met Ile Thr Glu Ala Leu Lys Ile
4100 4105 4110
gca tat cct acc gaa tat aat cgc tct agc gtg ttt gtg gaa ccg 32540
Ala Tyr Pro Thr Glu Tyr Asn Arg Ser Ser Val Phe Val Glu Pro
4115 4120 4125
tcg ttt gtg cct tat att att gca aca agc acg ctt gat gcc ctt 32585
Ser Phe Val Pro Tyr Ile Ile Ala Thr Ser Thr Leu Asp Ala Leu
4130 4135 4140
tcg gca aca ata gct ttg tct ttt gat aca cgg gga ata cag caa 32630
Ser Ala Thr Ile Ala Leu Ser Phe Asp Thr Arg Gly Ile Gln Gln
4145 4150 4155
gcc ttg tct att ctt cag tgg gct cgc gat tat gga tcc gga acc 32675
Ala Leu Ser Ile Leu Gln Trp Ala Arg Asp Tyr Gly Ser Gly Thr
4160 4165 4170
gtg ccc aat gca gat gga tat cgc aca aaa cta tct gct ctt ata 32720
Val Pro Asn Ala Asp Gly Tyr Arg Thr Lys Leu Ser Ala Leu Ile
4175 4180 4185
aca ata tta gaa cct ttt acc cgt aca cac ccc cca gta ctt tta 32765
Thr Ile Leu Glu Pro Phe Thr Arg Thr His Pro Pro Val Leu Leu
4190 4195 4200
cca tct cac gtt tct act ata gat tcc ctt ata tgc gaa ctt cat 32810
Pro Ser His Val Ser Thr Ile Asp Ser Leu Ile Cys Glu Leu His
4205 4210 4215
cgg act gtt ggc att gcc gtt gac ctg ctt ccc cag cac gtc cgt 32855
Arg Thr Val Gly Ile Ala Val Asp Leu Leu Pro Gln His Val Arg
4220 4225 4230
cct ttg gtt cct gac cgt cct tct att aca aat agc gtt ttt tta 32900
Pro Leu Val Pro Asp Arg Pro Ser Ile Thr Asn Ser Val Phe Leu
4235 4240 4245
gca act ctc tat tat gat gaa ctt tac ggt cgt tgg acc cga ctg 32945
Ala Thr Leu Tyr Tyr Asp Glu Leu Tyr Gly Arg Trp Thr Arg Leu
4250 4255 4260
gat aaa aca tcg cag gcg ttg gtt gaa aat ttt aca tcc aac gcg 32990
Asp Lys Thr Ser Gln Ala Leu Val Glu Asn Phe Thr Ser Asn Ala
4265 4270 4275
tta gtg gtt tct cgg tac atg tta atg tta caa aaa ttt ttt gcg 33035
Leu Val Val Ser Arg Tyr Met Leu Met Leu Gln Lys Phe Phe Ala
4280 4285 4290
tgt cgt ttt tat cca acg cca gat ctt cag gct gtt ggt atc tgt 33080
Cys Arg Phe Tyr Pro Thr Pro Asp Leu Gln Ala Val Gly Ile Cys
4295 4300 4305
aac cca aag gtt gaa cgc gat gaa caa ttt ggg gta tgg cgt tta 33125
Asn Pro Lys Val Glu Arg Asp Glu Gln Phe Gly Val Trp Arg Leu
4310 4315 4320
aac gat ctt gct gat gcg gtt ggt cat att gtt ggg aca ata caa 33170
Asn Asp Leu Ala Asp Ala Val Gly His Ile Val Gly Thr Ile Gln
4325 4330 4335
gga atc cga acg caa atg aga gtg gga ata tcc agc ctg cgc aca 33215
Gly Ile Arg Thr Gln Met Arg Val Gly Ile Ser Ser Leu Arg Thr
4340 4345 4350
att atg gcc gat gct tcc tca gcc ctt agg gaa tgt gaa aat tta 33260
Ile Met Ala Asp Ala Ser Ser Ala Leu Arg Glu Cys Glu Asn Leu
4355 4360 4365
atg act aaa acc tcc act tct gct att ggg cct ctt ttt tca acg 33305
Met Thr Lys Thr Ser Thr Ser Ala Ile Gly Pro Leu Phe Ser Thr
4370 4375 4380
atg gct tcc cgg tat gca cgg ttt aca cag gat caa atg gac att 33350
Met Ala Ser Arg Tyr Ala Arg Phe Thr Gln Asp Gln Met Asp Ile
4385 4390 4395
tta atg cgt gtt gac aaa cta aca aca gga gaa aat ata ccc ggt 33395
Leu Met Arg Val Asp Lys Leu Thr Thr Gly Glu Asn Ile Pro Gly
4400 4405 4410
ctt gca aat gta gag att ttt tta aat agg tgg gaa cga ata gca 33440
Leu Ala Asn Val Glu Ile Phe Leu Asn Arg Trp Glu Arg Ile Ala
4415 4420 4425
aca gct tgt agg cat gcc acg gca gtc ccg tcg gcc gaa tct att 33485
Thr Ala Cys Arg His Ala Thr Ala Val Pro Ser Ala Glu Ser Ile
4430 4435 4440
gca acc gtg tgt aat gaa ttg agg cgc ggt tta aaa aat ata caa 33530
Ala Thr Val Cys Asn Glu Leu Arg Arg Gly Leu Lys Asn Ile Gln
4445 4450 4455
gag gat cgt gta aat gcc cca acc tca tat atg agt cac gcc cga 33575
Glu Asp Arg Val Asn Ala Pro Thr Ser Tyr Met Ser His Ala Arg
4460 4465 4470
aat ctg gaa gat cac aag gca gca gtt tca ttc gtt atg gac tcc 33620
Asn Leu Glu Asp His Lys Ala Ala Val Ser Phe Val Met Asp Ser
4475 4480 4485
agg caa cag ttt att gtg gat tct gga cct cag atg ggc gcg gtt 33665
Arg Gln Gln Phe Ile Val Asp Ser Gly Pro Gln Met Gly Ala Val
4490 4495 4500
tta act tca caa tgt aat ata gga aca tgg gag aat gta aat gca 33710
Leu Thr Ser Gln Cys Asn Ile Gly Thr Trp Glu Asn Val Asn Ala
4505 4510 4515
acg ttt tta cat gat aat gtt aaa ata acg aca acg gtc aga gac 33755
Thr Phe Leu His Asp Asn Val Lys Ile Thr Thr Thr Val Arg Asp
4520 4525 4530
gta att tca gag gct ccg acg ctg ata ata gga caa aga tgg ctt 33800
Val Ile Ser Glu Ala Pro Thr Leu Ile Ile Gly Gln Arg Trp Leu
4535 4540 4545
cgt cca gat gag att tta tct aat gta gat ttg cgt ctt ggc gta 33845
Arg Pro Asp Glu Ile Leu Ser Asn Val Asp Leu Arg Leu Gly Val
4550 4555 4560
ccc ggg aat aca agt ggg agt gac cct taa tataaaacag gcgtgtttat 33895
Pro Gly Asn Thr Ser Gly Ser Asp Pro
4565 4570
gtacattaaa gtatttgtgg tttttattga ctgggcgttt cgtttgtata acgctgttgt 33955
tgctagtatt ttcataacct cctaggtttt tggagctaca cgtgcttatt caacgctctt 34015
tgggatttga atcatcgtaa acgtagcgtc cctaccagtt gagcgcgtaa ttttcgtaag 34075
caataaa atg gat ata att ccg cct ata gct gtc act gtt gcg gga gtg 34124
Met Asp Ile Ile Pro Pro Ile Ala Val Thr Val Ala Gly Val
4575 4580
gga agc cgt aat caa ttt gac ggt gcc ctg gga ccg gcg tca ggt 34169
Gly Ser Arg Asn Gln Phe Asp Gly Ala Leu Gly Pro Ala Ser Gly
4585 4590 4595
ctg tca tgt tta aga aca tct tta tcg ttt ttg cat atg aca tat 34214
Leu Ser Cys Leu Arg Thr Ser Leu Ser Phe Leu His Met Thr Tyr
4600 4605 4610
gcg cat gga att aat gca acc ctg tca tca gac atg att gat gga 34259
Ala His Gly Ile Asn Ala Thr Leu Ser Ser Asp Met Ile Asp Gly
4615 4620 4625
tgt tta caa gag ggt gca gca tgg act acg gat ctg tct aat atg 34304
Cys Leu Gln Glu Gly Ala Ala Trp Thr Thr Asp Leu Ser Asn Met
4630 4635 4640
ggg agg ggt gtc cca gat atg tgt gct ctt gtt gat ctc ccc aat 34349
Gly Arg Gly Val Pro Asp Met Cys Ala Leu Val Asp Leu Pro Asn
4645 4650 4655
cga att tca tat att aaa ctg ggg gac act acc agt acg tgc tgc 34394
Arg Ile Ser Tyr Ile Lys Leu Gly Asp Thr Thr Ser Thr Cys Cys
4660 4665 4670
gtt ttg tct aga ata tac ggc gat agc cat ttt ttt acc gtt cca 34439
Val Leu Ser Arg Ile Tyr Gly Asp Ser His Phe Phe Thr Val Pro
4675 4680 4685
gac gag ggt ttt atg tgc aca caa att ccc gct aga gcg ttt ttc 34484
Asp Glu Gly Phe Met Cys Thr Gln Ile Pro Ala Arg Ala Phe Phe
4690 4695 4700
gat gat gtg tgg atg gga cgt gaa gag tcg tat aca att ata act 34529
Asp Asp Val Trp Met Gly Arg Glu Glu Ser Tyr Thr IleIle Thr
4705 4710 4715
gta gac tca acg gga atg gcc atc tat cgt cag gga aac ata tct 34574
Val Asp Ser Thr Gly Met Ala Ile Tyr Arg Gln Gly Asn Ile Ser
4720 4725 4730
ttt att ttt gat cca cat ggc cat ggg act ata gga cag gct gta 34619
Phe Ile Phe Asp Pro His Gly His Gly Thr Ile Gly Gln Ala Val
4735 4740 4745
gtt gtt cgg gtg aat acc acg gat gtg tac tct tat atc gca tcg 34664
Val Val Arg Val Asn Thr Thr Asp Val Tyr Ser Tyr Ile Ala Ser
4750 4755 4760
gag tat acc cac cgc ccc gat aac gta gaa tcc caa tgg gcc gct 34709
Glu Tyr Thr His Arg Pro Asp Asn Val Glu Ser Gln Trp Ala Ala
4765 4770 4775
gca tta gtt ttt ttt gtc acc gca aac gac ggt ccc gta agc gaa 34754
Ala Leu Val Phe Phe Val Thr Ala Asn Asp Gly Pro Val Ser Glu
4780 4785 4790
gaa gcg cta tct tcg gca gta acg ctt ata tac gga agc tgt gat 34799
Glu Ala Leu Ser Ser Ala Val Thr Leu Ile Tyr Gly Ser Cys Asp
4795 4800 4805
aca tat ttt aca gat gaa caa tat tgc gaa aaa ctg gtt aca gct 34844
Thr Tyr Phe Thr Asp Glu Gln Tyr Cys Glu Lys Leu Val Thr Ala
4810 4815 4820
caa cat ccg ttg ctt ctt tca cct cct aat tcc acg aca att gtg 34889
Gln His Pro Leu Leu Leu Ser Pro Pro Asn Ser Thr Thr Ile Val
4825 4830 4835
ctt aat aaa tcg tct ata gta cct ctt cac caa aac gtt ggt gaa 34934
Leu Asn Lys Ser Ser Ile Val Pro Leu His Gln Asn Val Gly Glu
4840 4845 4850
agt gta tcc ttg gaa gca acc cta cat tca acg tta acc aac acg 34979
Ser Val Ser Leu Glu Ala Thr Leu His Ser Thr Leu Thr Asn Thr
4855 4860 4865
gtt gca ctg gac cct aga tgt agt tac agc gag gtt gat cct tgg 35024
Val Ala Leu Asp Pro Arg Cys Ser Tyr Ser Glu Val Asp Pro Trp
4870 4875 4880
cat gcg gtt cta gaa aca acc tcg act ggg tct ggc gtt ttg gat 35069
His Ala Val Leu Glu Thr Thr Ser Thr Gly Ser Gly Val Leu Asp
4885 4890 4895
tgt cgt cgt aga cgc cgt cct tca tgg act cct cct tca agc gag 35114
Cys Arg Arg Arg Arg Arg Pro Ser Trp Thr Pro Pro Ser Ser Glu
4900 4905 4910
gaa aat tta gct tgt atc gac gat ggc ttg gta aat aat aca cat 35159
Glu Asn Leu Ala Cys Ile Asp Asp Gly Leu Val Asn Asn Thr His
4915 4920 4925
tcc acg gat aat tta cat aaa ccc gct aaa aag gtt ctc aaa ttt 35204
Ser Thr Asp Asn Leu His Lys Pro Ala Lys Lys Val Leu Lys Phe
4930 4935 4940
aaa cca act gta gac gtg ccg gat aaa aca caa gtg gca cat gta 35249
Lys Pro Thr Val Asp Val Pro Asp Lys Thr Gln Val Ala His Val
4945 4950 4955
tta ccc cgc cta cga gaa gtt gct aac acc cca gac gtt gtg tta 35294
Leu Pro Arg Leu Arg Glu Val Ala Asn Thr Pro Asp Val Val Leu
4960 4965 4970
aat gta tcc aat gta gat acg cct gaa tcc agt ccc act ttt tca 35339
Asn Val Ser Asn Val Asp Thr Pro Glu Ser Ser Pro Thr Phe Ser
4975 4980 4985
cgg aac atg aat gta gga agc agt ttg aaa gat cgg aag cca ttt 35384
Arg Asn Met Asn Val Gly Ser Ser Leu Lys Asp Arg Lys Pro Phe
4990 4995 5000
cta ttt gaa cag agt ggt gat gtc aac atg gtt gtc gaa aaa cta 35429
Leu Phe Glu Gln Ser Gly Asp Val Asn Met Val Val Glu Lys Leu
5005 5010 5015
cta caa cat ggg cat gaa att agc aat gga tac gta caa aat gcg 35474
Leu Gln His Gly His Glu Ile Ser Asn Gly Tyr Val Gln Asn Ala
5020 5025 5030
gtg ggt acg ttg gat act gtt att acc ggt cat aca aat gtt ccc 35519
Val Gly Thr Leu Asp Thr Val Ile Thr Gly His Thr Asn Val Pro
5035 5040 5045
att tgg gta aca agg ccc ttg gtt atg cca gac gaa aag gat cca 35564
Ile Trp Val Thr Arg Pro Leu Val Met Pro Asp Glu Lys Asp Pro
5050 5055 5060
ttg gag ctt ttt att aac ctc acc att ttg cgt tta acg gga ttt 35609
Leu Glu Leu Phe Ile Asn Leu Thr Ile Leu Arg Leu Thr Gly Phe
5065 5070 5075
gtg gtg gaa aat gga aca cgt aca cat cat ggt gct aca agc gtt 35654
Val Val Glu Asn Gly Thr Arg Thr His His Gly Ala Thr Ser Val
5080 5085 5090
gta tca gac ttt ata ggt ccc ctt ggg gaa att tta aca gga ttt 35699
Val Ser Asp Phe Ile Gly Pro Leu Gly Glu Ile Leu Thr Gly Phe
5095 5100 5105
ccc tcc gcc gcg gaa ctt ata cgc gtt aca agt ttg ata tta aca 35744
Pro Ser Ala Ala Glu Leu Ile Arg Val Thr Ser Leu Ile Leu Thr
5110 5115 5120
aac atg ccg ggg gcg gaa tat gct att aaa act gtt ctc cgg aaa 35789
Asn Met Pro Gly Ala Glu Tyr Ala Ile Lys Thr Val Leu Arg Lys
5125 5130 5135
aaa tgt aca att ggc atg ctc att atc gct aag ttt ggt cta gtt 35834
Lys Cys Thr Ile Gly Met Leu Ile Ile Ala Lys Phe Gly Leu Val
5140 5145 5150
gcc atg cgg gtt cag gat aca acc ggc gct tta cat gcc gaa cta 35879
Ala Met Arg Val Gln Asp Thr Thr Gly Ala Leu His Ala Glu Leu
5155 5160 5165
gat gtg tta gaa gcg gat cta gga ggt tcg tcg ccc ata gac ctc 35924
Asp Val Leu Glu Ala Asp Leu Gly Gly Ser Ser Pro Ile Asp Leu
5170 5175 5180
tat tct aga ctg tcg aca ggt ctt ata agt ata cta aat tcg cct 35969
Tyr Ser Arg Leu Ser Thr Gly Leu Ile Ser Ile Leu Asn Ser Pro
5185 5190 5195
att att tct cat ccc gga ctt ttt gcc gag ctt att cca acc cgt 36014
Ile Ile Ser His Pro Gly Leu Phe Ala Glu Leu Ile Pro Thr Arg
5200 5205 5210
aca ggg tcc ctg tct gaa cga ata cgt ctt ctt tgt gaa tta gtc 36059
Thr Gly Ser Leu Ser Glu Arg Ile Arg Leu Leu Cys Glu Leu Val
5215 5220 5225
tcg gcc cgg gag aca cgc tat atg cgt gaa cac acc gcg ctt gtt 36104
Ser Ala Arg Glu Thr Arg Tyr Met Arg Glu His Thr Ala Leu Val
5230 5235 5240
tct agt gta aag gct tta gag aat gca tta cgg tct acc cgc aat 36149
Ser Ser Val Lys Ala Leu Glu Asn Ala Leu Arg Ser Thr Arg Asn
5245 5250 5255
aaa att gat gcc att caa ata cca gaa gtt ccc cag gaa ccc ccg 36194
Lys Ile Asp Ala Ile Gln Ile Pro Glu Val Pro Gln Glu Pro Pro
5260 5265 5270
gaa gaa acc gac att cca ccc gaa gag tta att cgg cgt gta tat 36239
Glu Glu Thr Asp Ile Pro Pro Glu Glu Leu Ile Arg Arg Val Tyr
5275 5280 5285
gag ata cga tcc gaa gtt aca atg cta ttg acc tcg gct gtt aca 36284
Glu Ile Arg Ser Glu Val Thr Met Leu Leu Thr Ser Ala Val Thr
5290 5295 5300
gaa tac ttc acc cgc gga gtg tta tat agc aca cgg gcc ttg atc 36329
Glu Tyr Phe Thr Arg Gly Val Leu Tyr Ser Thr Arg Ala Leu Ile
5305 5310 5315
gct gaa caa tcc cct agg cgt ttt cgg gtc gcg acc gca agt acg 36374
Ala Glu Gln Ser Pro Arg Arg Phe Arg Val Ala Thr Ala Ser Thr
5320 5325 5330
gca ccc att caa cgg ctt tta gat tct ctt ccg gaa ttc gac gct 36419
Ala Pro Ile Gln Arg Leu Leu Asp Ser Leu Pro Glu Phe Asp Ala
5335 5340 5345
aaa tta acg gca atc ata tcg tcc ctg tct ata cac cct cct cct 36464
Lys Leu Thr Ala Ile Ile Ser Ser Leu Ser Ile His Pro Pro Pro
5350 5355 5360
gag act ata caa aat ctc ccc gtc gta tct ctg tta aaa gag ctt 36509
Glu Thr Ile Gln Asn Leu Pro Val Val Ser Leu Leu Lys Glu Leu
5365 5370 5375
att aaa gaa ggg gaa gat tta aac aca gac acg gct ctc gta tcg 36554
Ile Lys Glu Gly Glu Asp Leu Asn Thr Asp Thr Ala Leu Val Ser
5380 5385 5390
tgg tta tct gta gtc ggg gaa gct caa acc gca ggt tac tta tcc 36599
Trp Leu Ser Val Val Gly Glu Ala Gln Thr Ala Gly Tyr Leu Ser
5395 5400 5405
aga cga gag ttc gat gaa tta tca cgt aca att aaa acc att aat 36644
Arg Arg Glu Phe Asp Glu Leu Ser Arg Thr Ile Lys Thr Ile Asn
5410 5415 5420
aca cgc gca acg caa cgg gct tcc gcg gaa gca gag ttg tct tgc 36689
Thr Arg Ala Thr Gln Arg Ala Ser Ala Glu Ala Glu Leu Ser Cys
5425 5430 5435
ttt aat acg cta agc gcg gcc gta gac caa gcc gta aag gac tat 36734
Phe Asn Thr Leu Ser Ala Ala Val Asp Gln Ala Val Lys Asp Tyr
5440 5445 5450
gaa aca tat aac aat ggt gag gtc aag tat cct gaa ata aca cgg 36779
Glu Thr Tyr Asn Asn Gly Glu Val Lys Tyr Pro Glu Ile Thr Arg
5455 5460 5465
gat gat tta tta gca aca att gta cgt gct aca gac gat ttg gtg 36824
Asp Asp Leu Leu Ala Thr Ile Val Arg Ala Thr Asp Asp Leu Val
5470 5475 5480
cga cag ata aaa att tta agt gat cca atg atc caa tcc ggt tta 36869
Arg Gln Ile Lys Ile Leu Ser Asp Pro Met Ile Gln Ser Gly Leu
5485 5490 5495
caa cct tcg att aaa aga cga ttg gaa aca agg ctt aaa gag gtt 36914
Gln Pro Ser Ile Lys Arg Arg Leu Glu Thr Arg Leu Lys Glu Val
5500 5505 5510
cag acg tat gca aac gag gcc cga acc aca cag gac aca ata aag 36959
Gln Thr Tyr Ala Asn Glu Ala Arg Thr Thr Gln Asp Thr Ile Lys
5515 5520 5525
agt cga aaa cag gcg gca tat aat aaa ctc ggg ggg tta ctt cgc 37004
Ser Arg Lys Gln Ala Ala Tyr Asn Lys Leu Gly Gly Leu Leu Arg
5530 5535 5540
ccg gta acc ggt ttt gtg gga ctt agg gct gca gta gat tta tta 37049
Pro Val Thr Gly Phe Val Gly Leu Arg Ala Ala Val Asp Leu Leu
5545 5550 5555
ccg gaa ctt gct tct gag tta gat gtc caa gga gcc ctg gta aat 37094
Pro Glu Leu Ala Ser Glu Leu Asp Val Gln Gly Ala Leu Val Asn
5560 5565 5570
ctc agg acc aaa gtc tta gag gcg ccg gta gag atc cgt tct caa 37139
Leu Arg Thr Lys Val Leu Glu Ala Pro Val Glu Ile Arg Ser Gln
5575 5580 5585
ctt acg ggt gat ttc tgg gcg tta ttt aac caa tat cga gac att 37184
Leu Thr Gly Asp Phe Trp Ala Leu Phe Asn Gln Tyr Arg Asp Ile
5590 5595 5600
tta gaa cat ccc gga aac gca cgc aca tct gtc tta gga gga ctg 37229
Leu Glu His Pro Gly Asn Ala Arg Thr Ser Val Leu Gly Gly Leu
5605 5610 5615
gga gct tgt ttt aca gct att atc gaa att gtg ccg ata cct acg 37274
Gly Ala Cys Phe Thr Ala Ile Ile Glu Ile Val Pro Ile Pro Thr
5620 5625 5630
gag tat aga cca tca ttg ctt gcg ttt ttt ggt gac gtg gca gat 37319
Glu Tyr Arg Pro Ser Leu Leu Ala Phe Phe Gly Asp Val Ala Asp
5635 5640 5645
gtg ctt gca tcc gac atc gcg acc gta tct act aac ccg gaa agt 37364
Val Leu Ala Ser Asp Ile Ala Thr Val Ser Thr Asn Pro Glu Ser
5650 5655 5660
gag tcc gcc ata aac gct gtt gtt gca act ctt agt aaa gcg acg 37409
Glu Ser Ala Ile Asn Ala Val Val Ala Thr Leu Ser Lys Ala Thr
5665 5670 5675
tta gtt tca tct aca gtg cca gcc tta tcc ttt gtg ttg tcg tta 37454
Leu Val Ser Ser Thr Val Pro Ala Leu Ser Phe Val Leu Ser Leu
5680 5685 5690
tat aaa aaa tat cag gct tta caa caa gaa att acg aat acc cat 37499
Tyr Lys Lys Tyr Gln Ala Leu Gln Gln Glu Ile Thr Asn Thr His
5695 5700 5705
aag ttg act gaa tta caa aaa caa ctt gga gat gac ttc tcc acc 37544
Lys Leu Thr Glu Leu Gln Lys Gln Leu Gly Asp Asp Phe Ser Thr
5710 5715 5720
cta gct gtc tca tct gga cac ttg aag ttt ata tca tct tca aat 37589
Leu Ala Val Ser Ser Gly His Leu Lys Phe Ile Ser Ser Ser Asn
5725 5730 5735
gta gat gat tat gaa ata aac gat gcg ata tta tca ata caa aca 37634
Val Asp Asp Tyr Glu Ile Asn Asp Ala Ile Leu Ser Ile Gln Thr
5740 5745 5750
aat gtg cac gcc cta atg gat acg gtt aaa ctt gtt gaa gtt gaa 37679
Asn Val His Ala Leu Met Asp Thr Val Lys Leu Val Glu Val Glu
5755 5760 5765
ctg caa aag cta ccc ccc cat tgt att gct ggg aca tct acc tta 37724
Leu Gln Lys Leu Pro Pro His Cys Ile Ala Gly Thr Ser Thr Leu
5770 5775 5780
tct cga gta gta aag gat ctt cat aaa ctc gtc aca atg gca cat 37769
Ser Arg Val Val Lys Asp Leu His Lys Leu Val Thr Met Ala His
5785 5790 5795
gag aag aag gaa cag gca aaa gtg tta att acc gat tgt gaa cgt 37814
Glu Lys Lys Glu Gln Ala Lys Val Leu Ile Thr Asp Cys Glu Arg
5800 5805 5810
gca cat aaa caa caa acg act cgg gtt ttg tat gag cgt tgg aca 37859
Ala His Lys Gln Gln Thr Thr Arg Val Leu Tyr Glu Arg Trp Thr
5815 5820 5825
cgt gat att ata gca tgt ctg gag gca atg gaa acg cgc cat ata 37904
Arg Asp Ile Ile Ala Cys Leu Glu Ala Met Glu Thr Arg His Ile
5830 5835 5840
ttt aac ggg aca gaa ctg gca cgg ttg cga gat atg gcc gct gcg 37949
Phe Asn Gly Thr Glu Leu Ala Arg Leu Arg Asp Met Ala Ala Ala
5845 5850 5855
gga ggg ttt gat ata cac gca gtt tac cca caa gca cgt cag gtt 37994
Gly Gly Phe Asp Ile His Ala Val Tyr Pro Gln Ala Arg Gln Val
5860 5865 5870
gta gcg gca tgt gaa act aca gcc gtt acg gca tta gat act gtg 38039
Val Ala Ala Cys Glu Thr Thr Ala Val Thr Ala Leu Asp Thr Val
5875 5880 5885
ttt cgc cac aat cca tat acc ccc gaa aat aca aat att cca cca 38084
Phe Arg His Asn Pro Tyr Thr Pro Glu Asn Thr Asn Ile Pro Pro
5890 5895 5900
cct ttg gct ttg tta aga ggg tta aca tgg ttt gat gat ttt tcg 38129
Pro Leu Ala Leu Leu Arg Gly Leu Thr Trp Phe Asp Asp Phe Ser
5905 5910 5915
att acg gct ccc gta ttc acc gtt atg ttt cca ggt gtt agt att 38174
Ile Thr Ala Pro Val Phe Thr Val Met Phe Pro Gly Val Ser Ile
5920 5925 5930
gag gga ctc ctt ctg ctt atg cgt att cgc gcg gtt gtg tta tta 38219
Glu Gly Leu Leu Leu Leu Met Arg Ile Arg Ala Val Val Leu Leu
5935 5940 5945
tcc gcc gat acg tct att aat gga ata cct aac tac cga gat atg 38264
Ser Ala Asp Thr Ser Ile Asn Gly Ile Pro Asn Tyr Arg Asp Met
5950 5955 5960
ata tta cga acc tcg ggg gat cta tta caa ata ccc gca ttg gct 38309
Ile Leu Arg Thr Ser Gly Asp Leu Leu Gln Ile Pro Ala Leu Ala
5965 5970 5975
ggg tat gtt gat ttt tac aca cgg tct tat gat cag ttt ata acc 38354
Gly Tyr Val Asp Phe Tyr Thr Arg Ser Tyr Asp Gln Phe Ile Thr
5980 5985 5990
gaa agt gta acg tta agt gaa ctt aga gca gac atc aga cag gct 38399
Glu Ser Val Thr Leu Ser Glu Leu Arg Ala Asp Ile Arg Gln Ala
5995 6000 6005
gcc ggg gct aaa ctt aca gaa gca aat aag gct ttg gag gaa gta 38444
Ala Gly Ala Lys Leu Thr Glu Ala Asn Lys Ala Leu Glu Glu Val
6010 6015 6020
act cat gtt cgg gca cac gaa acg gct aaa ctt gca ctt aaa gaa 38489
Thr His Val Arg Ala His Glu Thr Ala Lys Leu Ala Leu Lys Glu
6025 6030 6035
ggt gtc ttc att aca tta cca agc gaa ggt tta ttg att cgg gct 38534
Gly Val Phe Ile Thr Leu Pro Ser Glu Gly Leu Leu Ile Arg Ala
6040 6045 6050
ata gag tat ttt aca act ttc gat cat aaa cga ttt ata gga acg 38579
Ile Glu Tyr Phe Thr Thr Phe Asp His Lys Arg Phe Ile Gly Thr
6055 6060 6065
gca tat gaa aga gtt tta caa aca atg gta gac cgc gat cta aag 38624
Ala Tyr Glu Arg Val Leu Gln Thr Met Val Asp Arg Asp Leu Lys
6070 6075 6080
gag gcc aac gca gag ctt gca cag ttt cgt atg gtg tgt cag gca 38669
Glu Ala Asn Ala Glu Leu Ala Gln Phe Arg Met Val Cys Gln Ala
6085 6090 6095
aca aag aac cgt gca ata caa att tta caa aac att gtt gat acg 38714
Thr Lys Asn Arg Ala Ile Gln Ile Leu Gln Asn Ile Val Asp Thr
6100 6105 6110
gcc aat gcc act gag caa caa gaa gac gtg gat ttc act aac ctg 38759
Ala Asn Ala Thr Glu Gln Gln Glu Asp Val Asp Phe Thr Asn Leu
6115 6120 6125
aag acg tta tta aaa cta acc ccc cct ccc aaa aca att gca ttg 38804
Lys Thr Leu Leu Lys Leu Thr Pro Pro Pro Lys Thr Ile Ala Leu
6130 6135 6140
gcc att gat aga tct act tcc gtt cag gac att gtc acg cag ttt 38849
Ala Ile Asp Arg Ser Thr Ser Val Gln Asp Ile Val Thr Gln Phe
6145 6150 6155
gca ttg ctg tta ggg cgt ctg gaa gaa gaa act ggt acg ttg gac 38894
Ala Leu Leu Leu Gly Arg Leu Glu Glu Glu Thr Gly Thr Leu Asp
6160 6165 6170
att cag gcg gtt gac tgg atg tac caa gct cgc aat att att gac 38939
Ile Gln Ala Val Asp Trp Met Tyr Gln Ala Arg Asn Ile Ile Asp
6175 6180 6185
tcc cat cca cta agt gtg cgt ata gac ggt acc ggc ccc ctg cat 38984
Ser His Pro Leu Ser Val Arg Ile Asp Gly Thr Gly Pro Leu His
6190 6195 6200
act tat aaa gat agg gtg gat aaa ctt tat gcg tta cga act aaa 39029
Thr Tyr Lys Asp Arg Val Asp Lys Leu Tyr Ala Leu Arg Thr Lys
6205 6210 6215
tta gat ctc cta cga cga cga ata gaa acc ggt gag gtt acg tgg 39074
Leu Asp Leu Leu Arg Arg Arg Ile Glu Thr Gly Glu Val Thr Trp
6220 6225 6230
gac gat gca tgg aca aca ttt aaa aga gaa acg ggg gat atg ttg 39119
Asp Asp Ala Trp Thr Thr Phe Lys Arg Glu Thr Gly Asp Met Leu
6235 6240 6245
gca tcg ggg gac acg tac gct act tcc gta gat agt ata aag gca 39164
Ala Ser Gly Asp Thr Tyr Ala Thr Ser Val Asp Ser Ile Lys Ala
6250 6255 6260
ctc cag gca tcg gcg tct gtg gtt gac atg ctt tgt tcc gaa ccc 39209
Leu Gln Ala Ser Ala Ser Val Val Asp Met Leu Cys Ser Glu Pro
6265 6270 6275
gaa ttt ttt tta ttg cct gtg gaa acg aaa aac cgt ctc caa aaa 39254
Glu Phe Phe Leu Leu Pro Val Glu Thr Lys Asn Arg Leu Gln Lys
6280 6285 6290
aag caa cag gaa cgt aaa acg gcg ttg gat gtt gtg ttg caa aaa 39299
Lys Gln Gln Glu Arg Lys Thr Ala Leu Asp Val Val Leu Gln Lys
6295 6300 6305
caa aga cag ttt gaa gag acc gcg tct cgc tta cga gct tta att 39344
Gln Arg Gln Phe Glu Glu Thr Ala Ser Arg Leu Arg Ala Leu Ile
6310 6315 6320
gaa cgt att cca acg gag agt gac cat gac gtt ctt cgt atg tta 39389
Glu Arg Ile Pro Thr Glu Ser Asp His Asp Val Leu Arg Met Leu
6325 6330 6335
tta cgt gat ttc gat caa ttt aca cat ttg cct ata tgg ata aaa 39434
Leu Arg Asp Phe Asp Gln Phe Thr His Leu Pro Ile Trp Ile Lys
6340 6345 6350
aca cag tat atg aca ttt cga aat tta ctc atg gta cgg tta ggc 39479
Thr Gln Tyr Met Thr Phe Arg Asn Leu Leu Met Val Arg Leu Gly
6355 6360 6365
ttg tat gca agt tat gct gag att ttt cca ccc gcg tct cca aac 39524
Leu Tyr Ala Ser Tyr Ala Glu Ile Phe Pro Pro Ala Ser Pro Asn
6370 6375 6380
gga gta ttt gct cct att ccc gcc atg tcg ggt gta tgt cta gaa 39569
Gly Val Phe Ala Pro Ile Pro Ala Met Ser Gly Val Cys Leu Glu
6385 6390 6395
gac caa tcc cga tgc att cgc gcg cgg gtg gcc gcg ttt atg ggg 39614
Asp Gln Ser Arg Cys Ile Arg Ala Arg Val Ala Ala Phe Met Gly
6400 6405 6410
gag gcg tct gtg gtg caa acg ttt agg gaa gcc aga tct tct ata 39659
Glu Ala Ser Val Val Gln Thr Phe Arg Glu Ala Arg Ser Ser Ile
6415 6420 6425
gac gct ttg ttt gga aaa aat tta acc ttt tac ttg gat act gat 39704
Asp Ala Leu Phe Gly Lys Asn Leu Thr Phe Tyr Leu Asp Thr Asp
6430 6435 6440
ggg gtt cca ctt cga tat aga gtg tgt tat aaa tca gtt ggg gtt 39749
Gly Val Pro Leu Arg Tyr Arg Val Cys Tyr Lys Ser Val Gly Val
6445 6450 6455
aaa ctt gga acc atg cta tgc agt cag ggt gga tta tct tta cga 39794
Lys Leu Gly Thr Met Leu Cys Ser Gln Gly Gly Leu Ser Leu Arg
6460 6465 6470
ccg gca ctt ccc gat gaa ggt att gtg gaa gaa act aca cta tcg 39839
Pro Ala Leu Pro Asp Glu Gly Ile Val Glu Glu Thr Thr Leu Ser
6475 6480 6485
gca tta cgc gtg gcc aat gag gtc aat gag cta cgc att gaa tac 39884
Ala Leu Arg Val Ala Asn Glu Val Asn Glu Leu Arg Ile Glu Tyr
6490 6495 6500
gaa tcc gct ata aaa tcc ggg ttt tct gcc ttt tcc acc ttt gtt 39929
Glu Ser Ala Ile Lys Ser Gly Phe Ser Ala Phe Ser Thr Phe Val
6505 6510 6515
agg cat cgc cac gcc gaa tgg ggt aaa acc aac gca cgc aga gcc 39974
Arg His Arg His Ala Glu Trp Gly Lys Thr Asn Ala Arg Arg Ala
6520 6525 6530
att gca gag ata tac gcc ggc ctt ata aca aca aca ttg aca cga 40019
Ile Ala Glu Ile Tyr Ala Gly Leu Ile Thr Thr Thr Leu Thr Arg
6535 6540 6545
caa tac ggg gtt cat tgg gac aag ctt att tat tct ttt gaa aaa 40064
Gln Tyr Gly Val His Trp Asp Lys Leu Ile Tyr Ser Phe Glu Lys
6550 6555 6560
cac cac cta act tct gta atg ggc aat gga cta act aaa cca atc 40109
His His Leu Thr Ser Val Met Gly Asn Gly Leu Thr Lys Pro Ile
6565 6570 6575
cag aga agg ggt gat gta cgc gta tta gag tta acc cta tct gat 40154
Gln Arg Arg Gly Asp Val Arg Val Leu Glu Leu Thr Leu Ser Asp
6580 6585 6590
att gta act att ttg gtt gcc aca acc ccg gta cat ctt ctc aat 40199
Ile Val Thr Ile Leu Val Ala Thr Thr Pro Val His Leu Leu Asn
6595 6600 6605
ttt gct aga ttg gat tta att aaa cag cat gag tat atg gcc cgt 40244
Phe Ala Arg Leu Asp Leu Ile Lys Gln His Glu Tyr Met Ala Arg
6610 6615 6620
acc ctc aga ccc gta atc gag gcc gca ttt aga ggt cgt tta ctc 40289
Thr Leu Arg Pro Val Ile Glu Ala Ala Phe Arg Gly Arg Leu Leu
6625 6630 6635
gtt cgc tca ttg gat gga gac ccg aaa ggc aat gcc cgg gcc ttt 40334
Val Arg Ser Leu Asp Gly Asp Pro Lys Gly Asn Ala Arg Ala Phe
6640 6645 6650
ttt aat gcc gcc cca tcc aaa cat aaa ctc ccg tta gct ctt gga 40379
Phe Asn Ala Ala Pro Ser Lys His Lys Leu Pro Leu Ala Leu Gly
6655 6660 6665
tca aac caa gat cct acc ggc ggg aga ata ttt gca ttt cgg atg 40424
Ser Asn Gln Asp Pro Thr Gly Gly Arg Ile Phe Ala Phe Arg Met
6670 6675 6680
gca gat tgg aaa ctt gtt aaa atg cca cag aaa ata acg gat cct 40469
Ala Asp Trp Lys Leu Val Lys Met Pro Gln Lys Ile Thr Asp Pro
6685 6690 6695
ttt gcg cca tgg caa ctt tcc ccc ccc ccc ggg gta aag gcc aat 40514
Phe Ala Pro Trp Gln Leu Ser Pro Pro Pro Gly Val Lys Ala Asn
6700 6705 6710
gtc gat gca gtt acc cgt ata atg gca aca gat cgt ctt gcg acc 40559
Val Asp Ala Val Thr Arg Ile Met Ala Thr Asp Arg Leu Ala Thr
6715 6720 6725
att act gta ctt ggg cgc atg tgt ctc ccg cca att tcc tta gtg 40604
Ile Thr Val Leu Gly Arg Met Cys Leu Pro Pro Ile Ser Leu Val
6730 6735 6740
tca atg tgg aat acg ctg caa ccg gag gaa ttc gca tac aga aca 40649
Ser Met Trp Asn Thr Leu Gln Pro Glu Glu Phe Ala Tyr Arg Thr
6745 6750 6755
caa gat gat gtg gac att ata gtt gat gcg aga ctg gat ttg tca 40694
Gln Asp Asp Val Asp Ile Ile Val Asp Ala Arg Leu Asp Leu Ser
6760 6765 6770
tcc acg ctt aat gca aga ttt gat acc gct ccc agc aat acc acg 40739
Ser Thr Leu Asn Ala Arg Phe Asp Thr Ala Pro Ser Asn Thr Thr
6775 6780 6785
tta gag tgg aat aca gac cgt aaa gta att aca gat gct tat att 40784
Leu Glu Trp Asn Thr Asp Arg Lys Val Ile Thr Asp Ala Tyr Ile
6790 6795 6800
caa acc ggg gca acg aca gtt ttt aca gta acg ggg gcg gca cca 40829
Gln Thr Gly Ala Thr Thr Val Phe Thr Val Thr Gly Ala Ala Pro
6805 6810 6815
act cac gtt tct aat gta aca gcg ttt gac ata gca act acg gct 40874
Thr His Val Ser Asn Val Thr Ala Phe Asp Ile Ala Thr Thr Ala
6820 6825 6830
att tta ttt ggg gct cct ttg gtt att gcc atg gaa ctt aca tcc 40919
Ile Leu Phe Gly Ala Pro Leu Val Ile Ala Met Glu Leu Thr Ser
6835 6840 6845
gtt ttt tca caa aat tcc gga ctt act ttg ggg tta aaa tta ttc 40964
Val Phe Ser Gln Asn Ser Gly Leu Thr Leu Gly Leu Lys Leu Phe
6850 6855 6860
gat tcc cgg cat atg gct aca gat tcg ggt ata tcc tca gcc gta 41009
Asp Ser Arg His Met Ala Thr Asp Ser Gly Ile Ser Ser Ala Val
6865 6870 6875
tct ccc gat att gtt tct tgg ggg tta cgt tta ctg cat atg gat 41054
Ser Pro Asp Ile Val Ser Trp Gly Leu Arg Leu Leu His Met Asp
6880 6885 6890
cct cac cca att gaa aat gca tgt tta att gtc caa cta gaa aaa 41099
Pro His Pro Ile Glu Asn Ala Cys Leu Ile Val Gln Leu Glu Lys
6895 6900 6905
ctg tcc gcg ctc att gca aac aaa cct ctt aca aac aat ccc ccg 41144
Leu Ser Ala Leu Ile Ala Asn Lys Pro Leu Thr Asn Asn Pro Pro
6910 6915 6920
tgt tta ctg cta ttg gac gaa cat atg aat ccc tct tat gtt tta 41189
Cys Leu Leu Leu Leu Asp Glu His Met Asn Pro Ser Tyr Val Leu
6925 6930 6935
tgg gaa cga aaa gac tcg att cca gct ccg gat tat gtg gtc ttt 41234
Trp Glu Arg Lys Asp Ser Ile Pro Ala Pro Asp Tyr Val Val Phe
6940 6945 6950
tgg ggg cca gaa tct ctt att gat ttg ccg tac atc gac tcc gat 41279
Trp Gly Pro Glu Ser Leu Ile Asp Leu Pro Tyr Ile Asp Ser Asp
6955 6960 6965
gag gac tct ttc ccc tcg tgt ccc gat gat cca ttt tac tcg caa 41324
Glu Asp Ser Phe Pro Ser Cys Pro Asp Asp Pro Phe Tyr Ser Gln
6970 6975 6980
att att gcc ggt tat gcg ccc caa ggc ccc cca aac ctc gac aca 41369
Ile Ile Ala Gly Tyr Ala Pro Gln Gly Pro Pro Asn Leu Asp Thr
6985 6990 6995
act gat ttt tac cca acg gag cca cta ttt aag tct ccc gtt caa 41414
Thr Asp Phe Tyr Pro Thr Glu Pro Leu Phe Lys Ser Pro Val Gln
7000 7005 7010
gtt gtt aga agt tcc aaa tgt aaa aaa atg ccc gtc cgg ccc gcg 41459
Val Val Arg Ser Ser Lys Cys Lys Lys Met Pro Val Arg Pro Ala
7015 7020 7025
cag ccc gcg cag ccc gcg cag ccc gcg cag ccc gcg cag acc gtc 41504
Gln Pro Ala Gln Pro Ala Gln Pro Ala Gln Pro Ala Gln Thr Val
7030 7035 7040
cag ccc gcg cag ccc ata gaa ccg ggc aca caa ata gtg gta caa 41549
Gln Pro Ala Gln Pro Ile Glu Pro Gly Thr Gln Ile Val Val Gln
7045 7050 7055
aat ttt aag aaa ccc caa agc gta aaa aca acc ctt agc caa aaa 41594
Asn Phe Lys Lys Pro Gln Ser Val Lys Thr Thr Leu Ser Gln Lys
7060 7065 7070
gat att ccc ttg tat gtg gaa acc gaa tca gaa acg gct gtg ctt 41639
Asp Ile Pro Leu Tyr Val Glu Thr Glu Ser Glu Thr Ala Val Leu
7075 7080 7085
ata cct aag caa tta acc acc tcc att aaa aca acc gtt tgt aaa 41684
Ile Pro Lys Gln Leu Thr Thr Ser Ile Lys Thr Thr Val Cys Lys
7090 7095 7100
agt att acc cca cca aat aac caa ttg tcg gat tgg aaa aat aat 41729
Ser Ile Thr Pro Pro Asn Asn Gln Leu Ser Asp Trp Lys Asn Asn
7105 7110 7115
cca cag caa aac caa acg tta aac caa gcg ttc agt aaa cca ata 41774
Pro Gln Gln Asn Gln Thr Leu Asn Gln Ala Phe Ser Lys Pro Ile
7120 7125 7130
ctt gag att acc tcc att ccg aca gat gac tcg ata tct tac cgg 41819
Leu Glu Ile Thr Ser Ile Pro Thr Asp Asp Ser Ile Ser Tyr Arg
7135 7140 7145
act tgg att gaa aaa tca aat caa aca caa aaa cgg cat caa aat 41864
Thr Trp Ile Glu Lys Ser Asn Gln Thr Gln Lys Arg His Gln Asn
7150 7155 7160
gac cct cga atg tat aac tcc aaa aca gta ttc cac cct gta aat 41909
Asp Pro Arg Met Tyr Asn Ser Lys Thr Val Phe His Pro Val Asn
7165 7170 7175
aac caa tta cct tct tgg gtt gac acg gca gcc gat gcc ccc caa 41954
Asn Gln Leu Pro Ser Trp Val Asp Thr Ala Ala Asp Ala Pro Gln
7180 7185 7190
acg gac cta ttg aca aac tat aaa aca aga cag ccg tcg cca aac 41999
Thr Asp Leu Leu Thr Asn Tyr Lys Thr Arg Gln Pro Ser Pro Asn
7195 7200 7205
ttt ccg cgg gac gta cac aca tgg ggc gta tct tct aac ccg ttt 42044
Phe Pro Arg Asp Val His Thr Trp Gly Val Ser Ser Asn Pro Phe
7210 7215 7220
aac tca ccg aac aga gac cta tat caa agt gat ttt agt gaa cct 42089
Asn Ser Pro Asn Arg Asp Leu Tyr Gln Ser Asp Phe Ser Glu Pro
7225 7230 7235
tct gac ggc tat agc agt gag agt gaa aat tct atc gta cta agt 42134
Ser Asp Gly Tyr Ser Ser Glu Ser Glu Asn Ser Ile Val Leu Ser
7240 7245 7250
ctc gac gaa cat cgg tca tgt cgc gtt cct agg cac gta cgc gtt 42179
Leu Asp Glu His Arg Ser Cys Arg Val Pro Arg His Val Arg Val
7255 7260 7265
gtt aat gcc gat gta gtc acc ggt cga cgt tat gtc cga ggg acc 42224
Val Asn Ala Asp Val Val Thr Gly Arg Arg Tyr Val Arg Gly Thr
7270 7275 7280
gcc ttg gga gca ctg gca ctg tta agc cag gca tgt cgg cgt atg 42269
Ala Leu Gly Ala Leu Ala Leu Leu Ser Gln Ala Cys Arg Arg Met
7285 7290 7295
atc gac aac gtt aga tat aca cgt aaa ctt tta atg gac cac acg 42314
Ile Asp Asn Val Arg Tyr Thr Arg Lys Leu Leu Met Asp His Thr
7300 7305 7310
gaa gat ata ttt caa ggc ctg ggg tat gtt aaa ttg tta tta gat 42359
Glu Asp Ile Phe Gln Gly Leu Gly Tyr Val Lys Leu Leu Leu Asp
7315 7320 7325
gga aca tat ata taa agtagcgcct attaaagaaa aaaaaaaaac aacgattatt 42414
Gly Thr Tyr Ile
7330
ttctgtgtat ttttatttac accctacgac ttcttgaagc gtttccagat tgtcccgtgt 42474
gtgacaaggt ctgtccctta cccccctggg gggtattttg ggttgggggc ggggtagact 42534
gtggcacgcc ttgggccgcg ggcggtgatc cggttgttgg ctggacagtg cttgactgtg 42594
ctccctgttg cggttgttgt ccagaagacc ccgacaccac gtgttgctgt tgtccaacgg 42654
atgccgacgt cgtttgaggt ggggggtgtt gcggggatga tcccgaaaac gccaacgcgg 42714
cgggctgttg taaagcagac tgatcggcgc tctgtgtttt ttgcggcaat atagtaggcc 42774
ccgagattcc caaactcatg gatggatttg ggggttgtgg tcgtataata cgcgggttaa 42834
acgtacgttt taagccaacc gttggtctta accatgtcat agggtcagtc tcggcaaaca 42894
tggccgttcg gcgtatcgta tttgcattat ggttagcgcg tgcacgcgcg gcactggccg 42954
cggctcccac ggtgtaaatg cttctggcat cagcgatgtc cacacggtga ccaggttgca 43014
aaggtccact ggcgtttaaa agtcgtatta aagcaacggg ggtgtaagcc gcaattgctt 43074
ccaccgaaaa tgtggtgggg ttgctgggat caaagactac acgagacgat gcgggttgtg 43134
tcatcgttta ttagtttacg ggacaatcga taacagcata cacgtacatc tgcgcaggat 43194
atgtacggaa aggcaattta tttccagaaa agcaccgccc ctaatacaac taccagtaca 43254
attacaatga acagggcata tgtcacgtta gctacgggta gagcaagttt ccagacacgc 43314
gtagtttggg tatcgggtaa cgcaggttta atgtcacttt gcatttgaac agacgtgttt 43374
ggacttccgt tctcgggtgg ggatctgaat gaaggccgcc agcgtatata ttcatccaaa 43434
ttattgccag tttccttata catgtatgca tccgtggcgc gggccataag tttaatggtg 43494
cgagatggat cttccggtcc cataaaacga aaggataact gaacatatgg cattcgcaca 43554
aagcagttca cccacattaa agcctggaga ggtcggcggt caataccccc acctcgttta 43614
attgattcca aagcagatag gttgataccg gtacttaacg ttgaactaag aatcacgtta 43674
ttactgtcaa tggacacttc agccactggt gcgttagtcg gacgaaaaaa aaaaccttga 43734
aatagcacag acacccccgt attttgaatt tttatgtaag ggtcacaatc tacttgcgcc 43794
caattcgcca ttaaacgcat aatatactct accggaaagg cttcggatac gttgtcttcg 43854
ccgttaaact gaaaaacaca acgggcgggg gggcgttgtg gatcaaatat tggaagatcc 43914
ccatcgcaac attgaagagc gcttggtacc accaaccgaa tacgttgtaa aagattatct 43974
ccgcaacccc tcctgcgttc actccgtaca tacgttctcc gtgacatatt gatctaaggt 44034
tgcaaaccaa ggcacacgcg tgaagtattt agaccattta tcgtgggata taggaggagt 44094
ttggagtgat ccaccccctg acgacttatt aatgcgttta ttttccccat gtattaagca 44154
tccttcaata tttcatgcaa atctagaaat ttggccatga ctcccgcaaa gcgttcacgg 44214
cgacgggtca cgctggcact atgttcacat ggaacaacat aagcagattt ttctgaatcg 44274
ttactttctt tatgttttaa aacggacgcc aggcgactgg taaatgatat ataatttaat 44334
tgagcgtcag ttgtaggtag aattgcttct atttccgggg gaattaaatt ttcaaaccaa 44394
acggaaagag taaaggtgct atcagcagga aaatactttg actccagtgc atcgatattt 44454
aatagattaa catcggtgtc tgtaattaaa tcgcgggccc tcatcccaga g atg gat 44511
Met Asp
7335
cgg gta gaa tca gaa gaa ccc atg gat gga ttc gaa tcg ccc gta 44556
Arg Val Glu Ser Glu Glu Pro Met Asp Gly Phe Glu Ser Pro Val
7340 7345 7350
ttc tcc gaa aat aca tct tct aat tcc gga tgg tgt tcc gac gca 44601
Phe Ser Glu Asn Thr Ser Ser Asn Ser Gly Trp Cys Ser Asp Ala
7355 7360 7365
ttt tcc gat tcg tac atc gct tat aat cca gcc ctt ctg cta aaa 44646
Phe Ser Asp Ser Tyr Ile Ala Tyr Asn Pro Ala Leu Leu Leu Lys
7370 7375 7380
aac gat ttg tta ttt tca gaa ttg tta ttt gcc tcc cac tta ata 44691
Asn Asp Leu Leu Phe Ser Glu Leu Leu Phe Ala Ser His Leu Ile
7385 7390 7395
aat gtt ccc cgt gca ata gaa aac aac gtc act tat gag gcc tct 44736
Asn Val Pro Arg Ala Ile Glu Asn Asn Val Thr Tyr Glu Ala Ser
7400 7405 7410
tcg gcg gta ggt gtg gat aat gaa atg acc tca agt acc act gaa 44781
Ser Ala Val Gly Val Asp Asn Glu Met Thr Ser Ser Thr Thr Glu
7415 7420 7425
ttt ata gaa gaa att gga gac gtt ttg gcg tta gac aga gcc tgt 44826
Phe Ile Glu Glu Ile Gly Asp Val Leu Ala Leu Asp Arg Ala Cys
7430 7435 7440
ttg gtc tgc aga acg ctt gat ttg tat aaa cgt aaa ttt gga ctg 44871
Leu Val Cys Arg Thr Leu Asp Leu Tyr Lys Arg Lys Phe Gly Leu
7445 7450 7455
aca ccg gaa tgg gtt gcg gac tac gcc atg tta tgt atg aaa agt 44916
Thr Pro Glu Trp Val Ala Asp Tyr Ala Met Leu Cys Met Lys Ser
7460 7465 7470
ctg gca tcc ccg ccc tgt gca gtt gtc act ttt agc gct gcc ttt 44961
Leu Ala Ser Pro Pro Cys Ala Val Val Thr Phe Ser Ala Ala Phe
7475 7480 7485
gaa ttt gtg tat ctt atg gat cgt tac tac ctg tgc cgt tat aac 45006
Glu Phe Val Tyr Leu Met Asp Arg Tyr Tyr Leu Cys Arg Tyr Asn
7490 7495 7500
gtt act ttg gtt ggg tcc ttt gcc agg cgc acg ctt tcc ctg tta 45051
Val Thr Leu Val Gly Ser Phe Ala Arg Arg Thr Leu Ser Leu Leu
7505 7510 7515
gat ata caa aga cat ttt ttt ttg cat gta tgt ttt cgt acc gat 45096
Asp Ile Gln Arg His Phe Phe Leu His Val Cys Phe Arg Thr Asp
7520 7525 7530
gga ggg tta cca ggt ata cga ccg ccc ccc ggt aag gaa atg gcc 45141
Gly Gly Leu Pro Gly Ile Arg Pro Pro Pro Gly Lys Glu Met Ala
7535 7540 7545
aac aaa gta aga tat tcc aat tac tcc ttt ttt gta cag gcg gta 45186
Asn Lys Val Arg Tyr Ser Asn Tyr Ser Phe Phe Val Gln Ala Val
7550 7555 7560
gtt agg gct gca tta cta tcg atc agc acg tct cgt tta gac gaa 45231
Val Arg Ala Ala Leu Leu Ser Ile Ser Thr Ser Arg Leu Asp Glu
7565 7570 7575
acc gaa acg cgt aag tca ttt tac ttt aat cag gac gga ctg act 45276
Thr Glu Thr Arg Lys Ser Phe Tyr Phe Asn Gln Asp Gly Leu Thr
7580 7585 7590
gga ggc cct caa cct tta gcg gcc gcc ttg gct aat tgg aaa gat 45321
Gly Gly Pro Gln Pro Leu Ala Ala Ala Leu Ala Asn Trp Lys Asp
7595 7600 7605
tgc gcg cgg atg gtt gac tgt tca tca tcg gaa cat cgc aca agt 45366
Cys Ala Arg Met Val Asp Cys Ser Ser Ser Glu His Arg Thr Ser
7610 7615 7620
ggg atg att acc tgc gcg gaa cgt gca tta aaa gag gat ata gag 45411
Gly Met Ile Thr Cys Ala Glu Arg Ala Leu Lys Glu Asp Ile Glu
7625 7630 7635
ttt gaa gat ata tta ata gac aaa ctt aaa aaa tcg tct tac gta 45456
Phe Glu Asp Ile Leu Ile Asp Lys Leu Lys Lys Ser Ser Tyr Val
7640 7645 7650
gaa gca gct tgg ggt tac gca gac ttg gct tta tta tta ctg agt 45501
Glu Ala Ala Trp Gly Tyr Ala Asp Leu Ala Leu Leu Leu Leu Ser
7655 7660 7665
ggg gtt gct act tgg aat gta gac gag cgt aca aat tgt gct ata 45546
Gly Val Ala Thr Trp Asn Val Asp Glu Arg Thr Asn Cys AlaIle
7670 7675 7680
gaa act cgc gtt gga tgt gtt aaa tca tac tgg cag gcg aac cgg 45591
Glu Thr Arg Val Gly Cys Val Lys Ser Tyr Trp Gln Ala Asn Arg
7685 7690 7695
att gaa aac tcc agg gac gtt cca aaa caa ttt tcc aaa ttt acg 45636
Ile Glu Asn Ser Arg Asp Val Pro Lys Gln Phe Ser Lys Phe Thr
7700 7705 7710
agc gag gat gcc tgt ccc gaa gta gca ttt ggg cct att ttg tta 45681
Ser Glu Asp Ala Cys Pro Glu Val Ala Phe Gly Pro Ile Leu Leu
7715 7720 7725
act acc tta aaa aac gca aag tgc cgt ggt cgc acg aat acc gaa 45726
Thr Thr Leu Lys Asn Ala Lys Cys Arg Gly Arg Thr Asn Thr Glu
7730 7735 7740
tgc atg tta tgt tgt tta tta acc ata ggg cac tat tgg atc gct 45771
Cys Met Leu Cys Cys Leu Leu Thr Ile Gly His Tyr Trp Ile Ala
7745 7750 7755
ttg cgg cag ttt aaa agg gat ata tta gca tac tca gca aat aac 45816
Leu Arg Gln Phe Lys Arg Asp Ile Leu Ala Tyr Ser Ala Asn Asn
7760 7765 7770
aca agt tta ttt gac tgt atc gaa cct gta atc aat gca tgg agc 45861
Thr Ser Leu Phe Asp Cys Ile Glu Pro Val Ile Asn Ala Trp Ser
7775 7780 7785
cta gat aac ccc att aaa ctt aaa ttt cca ttt aat gat gag ggt 45906
Leu Asp Asn Pro Ile Lys Leu Lys Phe Pro Phe Asn Asp Glu Gly
7790 7795 7800
cga ttc ata acc att gta aaa gca gca ggt tcc gag gcc gta tat 45951
Arg Phe Ile Thr Ile Val Lys Ala Ala Gly Ser Glu Ala Val Tyr
7805 7810 7815
aaa cat tta ttt tgc gat ctc cta tgc gct ctc tcg gaa tta cag 45996
Lys His Leu Phe Cys Asp Leu Leu Cys Ala Leu Ser Glu Leu Gln
7820 7825 7830
aca aac cct aaa att tta ttt gcc cat cct aca acc gcg gat aag 46041
Thr Asn Pro Lys Ile Leu Phe Ala His Pro Thr Thr Ala Asp Lys
7835 7840 7845
gaa gtg ttg gag tta tat aaa gcc caa ctg gct gca caa aac aga 46086
Glu Val Leu Glu Leu Tyr Lys Ala Gln Leu Ala Ala Gln Asn Arg
7850 7855 7860
ttt gaa ggt cgt gta tgt gct ggc ctg tgg aca ttg gcg tat gca 46131
Phe Glu Gly Arg Val Cys Ala Gly Leu Trp Thr Leu Ala Tyr Ala
7865 7870 7875
ttt aaa gcc tac cag att ttt cca cgc aaa cca acc gcc aat gcc 46176
Phe Lys Ala Tyr Gln Ile Phe Pro Arg Lys Pro Thr Ala Asn Ala
7880 7885 7890
gca ttc ata cga gat gga gga ctt atg ctt cga cga cat gca ata 46221
Ala Phe Ile Arg Asp Gly Gly Leu Met Leu Arg Arg His Ala Ile
7895 7900 7905
tcg ctg gtc tcc ctc gaa cac acc cta tcg aag tat gtc tag 46263
Ser Leu Val Ser Leu Glu His Thr Leu Ser Lys Tyr Val
7910 7915
gcgatataaa tccgtatctc ggagcgggcc ttcgatgcgt gtacgctcca gaacgccatg 46323
ccgccgtcaa accattcgag gaaaacttat gtcaaaggag cggtctgtgt accgccatta 46383
ttttaattac atcgcaaggt cccccccaga agaactagct accgttagag gcttaatcgt 46443
gccaattatt aagacgaccc ctgtcaccct tccgtttaac ttgggtcaga cagtggcgga 46503
taactgcctg tcgttatccg gaatgggtta tcatttaggt ctcggaggtt attgtccgac 46563
atgcactgca tctggagaac cgcgtctatg tcgaaccgat cgggcggctc tgatactagc 46623
atatgttcag cagcttaaca acatatacga atatcgtgtg tttcttgcat ccattttggc 46683
gctatcagac cgagccaaca tgcaagcagc gtccgctgaa cccctattgt cgagcgtatt 46743
ggcacaaccg gaattatttt ttatgtatca tattatgagg gaggggggca tgcgagatat 46803
acgcgtactt ttttatcgtg atggagatgc cggagggttt atgatgtatg ttatatttcc 46863
ggggaaatct gttcacctcc attacagact aatcgatcat atacaggccg cgtgtcgggg 46923
gtataaaata gtcgcacacg tttggcagac aacattttta ctgtcggtat gtcgcaaccc 46983
agaacaacaa acagagactg tggtgccatc cattggaaca tcggacgttt actgtaaaat 47043
gtgtgacctt aactttgatg gagaattgct tttggaatac aaaagactct acgcattatt 47103
tgatgacttt gttcctcctc ggtgatttca gcttcagtgt tcattttatt atcccagcac 47163
ggggcgtgta tacaaacaaa gcctgccgcc tgcaagcggt ttagcatttt aacgttaaca 47223
actcgtgtct ctggaataaa acgttttaaa agccgttctg tgagtttagt gtcgtttcca 47283
aataacgcct taaaagttac actcgccgtc ccaatgagat gagaaaaata atagtcaatg 47343
tttaaagaca gcccgtgtga tgttacgtga atgggatctt ccgctaagtc agatattatt 47403
aacttacgct ttgcttcccc acaccgttta cctgcggtat tctgtaaagg atctccacgt 47463
agcaaagcta cactttttgc atcagcctcc acttcgtctg tgggggccac aataacataa 47523
gggatgcgtt ctcgaacgtt tgggatttga ccctgtctca ttactaattt ataatatact 47583
gttaagtgag ccaagcgacg gtttatgtag gcggatggtg gacgactaag ctcggccgtc 47643
ataacaaact tattaatatc caatttgggt gatgtaatct ggcgatgtgc atctgcaatt 47703
atgcgtccaa acccggccat cccagacggc atggcccgtc tattccattc agcaatggaa 47763
acacacgacg cctccgccgc agcacgcgag acggtgtcgt catataacaa cagttctaca 47823
agtttgcggg cataatcgtt aataaattga cagttgtttt ttctaaccaa gtcgactccc 47883
ttcattaaaa cctttccgcc gtaaattacc ccaatgtact ttttctttgt tataagcaaa 47943
agttttataa aagttttttc acactccaac tttataggag gacaaaacag agccgttgaa 48003
attatatgtg ccattttctc gccgatttta gctatcccct caacactaac acccttgaat 48063
cggataaaca cagaatccgt atctccatat ataaccttta cctcgtacgc tttttgggag 48123
agaacgctac tttcaatgtc tggaaacgct gtaataaaac gttcaaatgc ggcccagtta 48183
ttatgaatat aatctctggt acttaataac atttgacggc caattgtagt gacagtggcc 48243
gctacgtata aacatggcag aaatccctgc gcaactccag taaaaccgta cacggaatta 48303
caaactactt ttatcgcggc ttgttgtttg tctaataaca ctgcttcatc tgaagaactt 48363
ccgggtatgc gcgctctaat agccttgcgc atagccaacc agtcttttaa aagaacaccc 48423
agcagacttt ctcgaacgtt agagcgcaca aaaaaaagac gttttcctcc aactgtaaag 48483
gtggcataat cggatggatt caaacgttta accgtctcaa aatttaacgt tagcgtggta 48543
aaacataagt tatgggcctg aattatactt ggatataaac ttgcaaaatc caatacgacc 48603
accggatcga tataaaatcc cgtatcaggg tcaaaaaccc tggctccttt atatcctaca 48663
tttcgcccac ttgacgtacc agtgggagaa acgctctcgt cttcatccat ctcttcctca 48723
acatccccga catcgggaat aacatcctta tattcaaaag tagctgggta tcccccatcg 48783
ggtaaaataa atcctcgaga cgaagccagt cctaataaac aggtgtaaat cctaacctgc 48843
tgtccgtcgt aaatagcctt ggttaaagta attctagcta gccttgcaac cgcggataac 48903
tcaaggtgtg gtaaatattt aaaaaacagt ttccccacaa gagccgagtc ttgtatacaa 48963
tattcaccaa taattcctcg tgtattcggt ccactagcgt aatatcccgg aatgtctttg 49023
tagggcaaat ctctcttgga ctcatttaga gcttcacgtg caaccgaatc taatttataa 49083
ctcgagagtt ttaatttttc agttgcaatt gcatacatat ccagagatat gagaccgttg 49143
atctttacct tgcttcgtcg ctgaaatccg gatttgccaa catcccatat cttaaacaga 49203
cccccacggt ttatactgcc ataaccatca agcttgagac tgtatataga attaagtttc 49263
tccataataa acgcccaatc aaaattaaca atgttataac ctgtggcaaa ctcgggagcg 49323
tactgtttta cgagggtcat aaatgcaatt aatagctcga attcactatc aaactccagc 49383
acagtcggct ccggtaaccc cgcgtccttc atttcttgta catacctttg tggtaagtca 49443
caagagccaa gggaaaacag taaaatgtgt tctaaagact gtcgagggat tgaatataat 49503
agacaagaaa tttggattac aagatcctcc agatgtgttg catcgggaaa cgccagctca 49563
ttagatcctc ctgatttaca ttcaatatcg aaacataaca acttgtagtc aggccatgag 49623
tcatcgtttg gtatagcctg cagattatcc gacatgcagt caatttcaac gtcgcttaac 49683
gttaattggc gacttgccgg tcgaactcga acacgttccc catcaactcc aggttttagt 49743
tgataccaac caaaactaac aaagccggga ttatccatta gaaaacgagt ggtagcgtct 49803
acccgacctt catacttttt caactccggg tgaaagttat cacaaagata atttgtaaat 49863
ttagatgagg gagaatacac cctgtaaaac gcacatggct gtgtatcgta gtaataaaca 49923
tctgtgcgct caataacctc aacgcgaaag ctttctggag atgcgctttt aaacgaggta 49983
ccatgaaaag cgttcttgtc tccatttaac gttgcatcat tttgtgttat catagaactg 50043
cgtaaacact cggcaagtaa tacagataac tcgctaccgg aacgtatgcc acaagcggta 50103
tccacctcgg ctttgtttat ataaaaatat tgacagatgc cgtatacatg aactgccacc 50163
ctttttccac atcgggacat gccaagtaaa gtaataacgg taccaagcgg tcgtgttgca 50223
gttgcaaacc gggatacatc tccattagac gcggcttctg ttgtttcgac aatatcatat 50283
acatggaatg tgttaaagcg ggggtcaaac ttatccccac gaaagtcgat ttccccccaa 50343
atattcacgc gtctaggcca ggggctggaa caacgaaaat ccagaatcgg aacttctttt 50403
ccattacagt aaactttagg cggtcgacta agtgtaccga cgtgaacccc ctttcgttct 50463
tccatgggca catcttcatc taaacattta ggggccaaaa attgaaacga tgacatggta 50523
gttttgtaac tatgaagaaa ttctctgtta ctaccgcgcc cggttcttgg gttatattta 50583
atccctgatg cttgggttaa aaagggatta caaaaccccg ttctgatcgc cattttatgt 50643
taacgattga taatcttgta aaaagccagt gttactgagt aacacaaccc cacgcccttc 50703
taatacataa agtgtaatca cgtgatttgt tgtggtttcc gcatatgtaa tacccgttta 50763
aaagcctctc ttcttaatgt atcgacagac tgggttttgg gtggtcattt gaccctgcca 50823
acaacccccc attattacga gtacttcacc aaa atg gaa aat act cag aag 50874
Met Glu Asn Thr Gln Lys
7920
act gtg aca gtg ccc acg ggg ccc ctg ggt tac gtt tat gcg tgc 50919
Thr Val Thr Val Pro Thr Gly Pro Leu Gly Tyr Val Tyr Ala Cys
7925 7930 7935
cgg gtt gaa gat ttg gat ctg gag gaa att tca ttt ttg gcc gct 50964
Arg Val Glu Asp Leu Asp Leu Glu Glu Ile Ser Phe Leu Ala Ala
7940 7945 7950
cgt agc acg gac tct gat ttg gct tta tta cct ttg atg cgt aat 51009
Arg Ser Thr Asp Ser Asp Leu Ala Leu Leu Pro Leu Met Arg Asn
7955 7960 7965
ttg acc gtg gaa aaa act ttt aca tcc agc ctg gcg gtg gtt tct 51054
Leu Thr Val Glu Lys Thr Phe Thr Ser Ser Leu Ala Val Val Ser
7970 7975 7980
gga gca cgc act acg ggt ctt gcc gga gct ggt att acc tta aaa 51099
Gly Ala Arg Thr Thr Gly Leu Ala Gly Ala Gly Ile Thr Leu Lys
7985 7990 7995
ctc act acc agt cat ttc tat cca tct gtc ttt gtc ttt cac gga 51144
Leu Thr Thr Ser His Phe Tyr Pro Ser Val Phe Val Phe His Gly
8000 8005 8010
ggc aaa cac gtt tta ccc agc tcc gcg gcc cca aat ctc aca cgc 51189
Gly Lys His Val Leu Pro Ser Ser Ala Ala Pro Asn Leu Thr Arg
8015 8020 8025
gcg tgt aac gcg gct cga gaa cgg ttt ggg ttt tca cgc tgc caa 51234
Ala Cys Asn Ala Ala Arg Glu Arg Phe Gly Phe Ser Arg Cys Gln
8030 8035 8040
ggg cct cct gtt gac ggt gct gtt gag acg acc ggc gct gag ata 51279
Gly Pro Pro Val Asp Gly Ala Val Glu Thr Thr Gly Ala Glu Ile
8045 8050 8055
tgc acc cgc ctt gga tta gag cca gaa aat aca ata tta tac ttg 51324
Cys Thr Arg Leu Gly Leu Glu Pro Glu Asn Thr Ile Leu Tyr Leu
8060 8065 8070
gtg gtc acg gca ttg ttt aag gaa gcc gta ttt atg tgc aac gtg 51369
Val Val Thr Ala Leu Phe Lys Glu Ala Val Phe Met Cys Asn Val
8075 8080 8085
ttt ctg cat tat gga gga ctc gat att gtt cat att aac cat ggg 51414
Phe Leu His Tyr Gly Gly Leu Asp Ile Val His Ile Asn His Gly
8090 8095 8100
gat gtt ata cgt ata ccg tta ttt ccg gta caa ctt ttc atg ccc 51459
Asp Val Ile Arg Ile Pro Leu Phe Pro Val Gln Leu Phe Met Pro
8105 8110 8115
gat gtt aac cgt ctg gta ccc gac cca ttc aac act cat cac agg 51504
Asp Val Asn Arg Leu Val Pro Asp Pro Phe Asn Thr His His Arg
8120 8125 8130
tct atc gga gag ggt ttt gta tac cca aca ccc ttt tat aac acc 51549
Ser Ile Gly Glu Gly Phe Val Tyr Pro Thr Pro Phe Tyr Asn Thr
8135 8140 8145
ggg ttg tgc cat tta ata cat gac tgt gtt att gct ccc atg gcc 51594
Gly Leu Cys His Leu Ile His Asp Cys Val Ile Ala Pro Met Ala
8150 8155 8160
gtt gcc ttg cgc gtc aga aat gta act gcc gtc gcc cga gga gcg 51639
Val Ala Leu Arg Val Arg Asn Val Thr Ala Val Ala Arg Gly Ala
8165 8170 8175
gcc cac ctt gct ttt gat gaa aat cac gag ggg gca gta ctc ccc 51684
Ala His Leu Ala Phe Asp Glu Asn His Glu Gly Ala Val Leu Pro
8180 8185 8190
cct gac att acg tac acg tat ttt cag tcc tct tca agt gga acc 51729
Pro Asp Ile Thr Tyr Thr Tyr Phe Gln Ser Ser Ser Ser Gly Thr
8195 8200 8205
act acc gcc cgt gga gcg cgt cga aac gat gtc aac tcc acg tct 51774
Thr Thr Ala Arg Gly Ala Arg Arg Asn Asp Val Asn Ser Thr Ser
8210 8215 8220
aag cct agc cca tcg ggg ggg ttt gaa aga cgg ttg gcg tct att 51819
Lys Pro Ser Pro Ser Gly Gly Phe Glu Arg Arg Leu Ala Ser Ile
8225 8230 8235
atg gcc gct gac aca gcc ttg cac gca gaa gtt ata ttc aac act 51864
Met Ala Ala Asp Thr Ala Leu His Ala Glu Val Ile Phe Asn Thr
8240 8245 8250
gga att tac gaa gaa act cca aca gat atc aaa gaa tgg cca atg 51909
Gly Ile Tyr Glu Glu Thr Pro Thr Asp Ile Lys Glu Trp Pro Met
8255 8260 8265
ttt ata ggc atg gag ggc act ttg cca agg cta aac gct ctg ggg 51954
Phe Ile Gly Met Glu Gly Thr Leu Pro Arg Leu Asn Ala Leu Gly
8270 8275 8280
tca tat acc gct cgt gtg gcc ggg gtc att ggt gcg atg gtt ttc 51999
Ser Tyr Thr Ala Arg Val Ala Gly Val Ile Gly Ala Met Val Phe
8285 8290 8295
agc cca aat tct gcg ttg tat cta act gag gtg gag gat agc ggg 52044
Ser Pro Asn Ser Ala Leu Tyr Leu Thr Glu Val Glu Asp Ser Gly
8300 8305 8310
atg acc gaa gcc aag gat ggg gga ccg ggt cca tca ttt aat cga 52089
Met Thr Glu Ala Lys Asp Gly Gly Pro Gly Pro Ser Phe Asn Arg
8315 8320 8325
ttt tac cag ttt gcc gga cct cat tta gct gcg aat ccc caa aca 52134
Phe Tyr Gln Phe Ala Gly Pro His Leu Ala Ala Asn Pro Gln Thr
8330 8335 8340
gat cga gat ggc cac gtt cta tcc agt cag tct acg ggt tca tca 52179
Asp Arg Asp Gly His Val Leu Ser Ser Gln Ser Thr Gly Ser Ser
8345 8350 8355
aac aca gag ttt agc gtg gat tat ttg gca ctc att tgt gga ttt 52224
Asn Thr Glu Phe Ser Val Asp Tyr Leu Ala Leu Ile Cys Gly Phe
8360 8365 8370
gga gca ccc ctg ttg gcg cga ctg ctt ttt tat cta gaa cgc tgt 52269
Gly Ala Pro Leu Leu Ala Arg Leu Leu Phe Tyr Leu Glu Arg Cys
8375 8380 8385
gac gct ggt gcg ttt aca ggg ggt cac ggg gat gcg tta aaa tat 52314
Asp Ala Gly Ala Phe Thr Gly Gly His Gly Asp Ala Leu Lys Tyr
8390 8395 8400
gtt acg ggg acc ttt gac tct gaa att cca tgt agt tta tgt gaa 52359
Val Thr Gly Thr Phe Asp Ser Glu Ile Pro Cys Ser Leu Cys Glu
8405 8410 8415
aaa cac acg cgg ccg gta tgc gct cac aca aca gta cac cga ctt 52404
Lys His Thr Arg Pro Val Cys Ala His Thr Thr Val His Arg Leu
8420 8425 8430
aga caa cgc atg ccg cga ttt gga caa gcc acc cgt caa cct att 52449
Arg Gln Arg Met Pro Arg Phe Gly Gln Ala Thr Arg Gln Pro Ile
8435 8440 8445
ggg gtg ttt gga aca atg aac agc caa tat agc gac tgc gat cct 52494
Gly Val Phe Gly Thr Met Asn Ser Gln Tyr Ser Asp Cys Asp Pro
8450 8455 8460
cta gga aac tat gct cca tat tta atc ctt cga aaa ccc ggg gat 52539
Leu Gly Asn Tyr Ala Pro Tyr Leu Ile Leu Arg Lys Pro Gly Asp
8465 8470 8475
caa acg gaa gca gca aag gca acc atg cag gac act tat agg gct 52584
Gln Thr Glu Ala Ala Lys Ala Thr Met Gln Asp Thr Tyr Arg Ala
8480 8485 8490
aca cta gaa cgc ttg ttt atc gat cta gaa caa gag cga cta ctg 52629
Thr Leu Glu Arg Leu Phe Ile Asp Leu Glu Gln Glu Arg Leu Leu
8495 8500 8505
gat cgc ggt gcc cca tgt tct tcc gag gga cta tcg tct gtc att 52674
Asp Arg Gly Ala Pro Cys Ser Ser Glu Gly Leu Ser Ser Val Ile
8510 8515 8520
gtg gat cat cca acg ttt cgt cgc ata tta gac aca ctg cgt gcg 52719
Val Asp His Pro Thr Phe Arg Arg Ile Leu Asp Thr Leu Arg Ala
8525 8530 8535
cgt ata gaa cag aca aca aca caa ttt atg aaa gtg ttg gtt gag 52764
Arg Ile Glu Gln Thr Thr Thr Gln Phe Met Lys Val Leu Val Glu
8540 8545 8550
acc cgc gat tat aag atc cgt gaa gga tta tcc gaa gcc acc cat 52809
Thr Arg Asp Tyr Lys Ile Arg Glu Gly Leu Ser Glu Ala Thr His
8555 8560 8565
tca atg gcg tta acg ttt gat cca tac tca gga gca ttt tgt ccc 52854
Ser Met Ala Leu Thr Phe Asp Pro Tyr Ser Gly Ala Phe Cys Pro
8570 8575 8580
att acc aat ttt tta gtt aaa cga aca cac cta gcc gtg gta caa 52899
Ile Thr Asn Phe Leu Val Lys Arg Thr His Leu Ala Val Val Gln
8585 8590 8595
gac tta gca tta agc caa tgt cat tgt gta ttt tac gga cag caa 52944
Asp Leu Ala Leu Ser Gln Cys His Cys Val Phe Tyr Gly Gln Gln
8600 8605 8610
gtt gag ggg cgg aac ttt cgt aac caa ttc caa cct gtt ttg cgg 52989
Val Glu Gly Arg Asn Phe Arg Asn Gln Phe Gln Pro Val Leu Arg
8615 8620 8625
cgg cgt ttt gtt gac ctg ttt aat ggg ggg ttt ata tca aca cgc 53034
Arg Arg Phe Val Asp Leu Phe Asn Gly Gly Phe Ile Ser Thr Arg
8630 8635 8640
tct ata acc gta aca tta tct gaa ggt cct gta tcc gcc cca aat 53079
Ser Ile Thr Val Thr Leu Ser Glu Gly Pro Val Ser Ala Pro Asn
8645 8650 8655
ccg aca ttg gga caa gac gcg ccc gcg ggg cgt acc ttt gat ggg 53124
Pro Thr Leu Gly Gln Asp Ala Pro Ala Gly Arg Thr Phe Asp Gly
8660 8665 8670
gat tta gcg cgc gta agc gtg gaa gtt att cgg gat ata cga gtt 53169
Asp Leu Ala Arg Val Ser Val Glu Val Ile Arg Asp Ile Arg Val
8675 8680 8685
aaa aat agg gtc gtt ttt tca ggt aac tgt aca aat ctc tct gag 53214
Lys Asn Arg Val Val Phe Ser Gly Asn Cys Thr Asn Leu Ser Glu
8690 8695 8700
gca gcc cgg gca agg ctt gta ggc ctt gca agt gcg tac caa cgc 53259
Ala Ala Arg Ala Arg Leu Val Gly Leu Ala Ser Ala Tyr Gln Arg
8705 8710 8715
caa gaa aaa aga gtg gat atg tta cac ggg gcc cta ggg ttt ttg 53304
Gln Glu Lys Arg Val Asp Met Leu His Gly Ala Leu Gly Phe Leu
8720 8725 8730
ctt aaa cag ttt cac ggc ctg tta ttt cct cgg ggt atg cca cca 53349
Leu Lys Gln Phe His Gly Leu Leu Phe Pro Arg Gly Met Pro Pro
8735 8740 8745
aac agt aaa tcc ccc aac ccg cag tgg ttt tgg acc ctg tta caa 53394
Asn Ser Lys Ser Pro Asn Pro Gln Trp Phe Trp Thr Leu Leu Gln
8750 8755 8760
cgc aac cag atg ccg gca gat aaa ctt aca cac gaa gag att acc 53439
Arg Asn Gln Met Pro Ala Asp Lys Leu Thr His Glu GluIle Thr
8765 8770 8775
act att gca gct gtt aaa cgg ttt acc gag gaa tat gca gca ata 53484
Thr Ile Ala Ala Val Lys Arg Phe Thr Glu Glu Tyr Ala Ala Ile
8780 8785 8790
aac ttt att aat cta ccc cca acc tgc ata gga gaa tta gcc cag 53529
Asn Phe Ile Asn Leu Pro Pro Thr Cys Ile Gly Glu Leu Ala Gln
8795 8800 8805
ttt tat atg gca aat ctt att ctt aaa tac tgc gat cat tca cag 53574
Phe Tyr Met Ala Asn Leu Ile Leu Lys Tyr Cys Asp His Ser Gln
8810 8815 8820
tac ctt ata aat acc tta act tct ata att acg ggt gcc agg cgc 53619
Tyr Leu Ile Asn Thr Leu Thr Ser Ile Ile Thr Gly Ala Arg Arg
8825 8830 8835
ccg cgt gac cca tca tcc gtt ttg cat tgg att cgt aaa gat gtc 53664
Pro Arg Asp Pro Ser Ser Val Leu His Trp Ile Arg Lys Asp Val
8840 8845 8850
acg tcc gcc gcg gac ata gaa acc caa gca aag gcg ctt ctt gaa 53709
Thr Ser Ala Ala Asp Ile Glu Thr Gln Ala Lys Ala Leu Leu Glu
8855 8860 8865
aaa acg gaa aac tta ccg gaa tta tgg act acg gct ttt act tca 53754
Lys Thr Glu Asn Leu Pro Glu Leu Trp Thr Thr Ala Phe Thr Ser
8870 8875 8880
act cat tta gtc cgc gcg gcc atg aat caa cgt ccc atg gtc gtt 53799
Thr His Leu Val Arg Ala Ala Met Asn Gln Arg Pro Met Val Val
8885 8890 8895
tta gga ata agc att agt aaa tat cac gga gcg gca gga aac aac 53844
Leu Gly Ile Ser Ile Ser Lys Tyr His Gly Ala Ala Gly Asn Asn
8900 8905 8910
cgc gtc ttt cag gca ggg aat tgg agc ggt tta aac ggg ggt aaa 53889
Arg Val Phe Gln Ala Gly Asn Trp Ser Gly Leu Asn Gly Gly Lys
8915 8920 8925
aat gta tgc ccg cta ttt aca ttt gat cgc act cgc cgt ttt ata 53934
Asn Val Cys Pro Leu Phe Thr Phe Asp Arg Thr Arg Arg Phe Ile
8930 8935 8940
ata gca tgt cct aga gga ggt ttt atc tgc ccc gta aca ggt ccc 53979
Ile Ala Cys Pro Arg Gly Gly Phe Ile Cys Pro Val Thr Gly Pro
8945 8950 8955
tcg tcg gga aat cga gaa acc acc cta tcc gac caa gtt cgc ggt 54024
Ser Ser Gly Asn Arg Glu Thr Thr Leu Ser Asp Gln Val Arg Gly
8960 8965 8970
ata att gtc agt ggc ggg gcc atg gtt caa tta gcc ata tac gcc 54069
Ile Ile Val Ser Gly Gly Ala Met Val Gln Leu Ala Ile Tyr Ala
8975 8980 8985
acg gtt gtg cgt gca gtg ggc gct cga gca caa cat atg gca ttt 54114
Thr Val Val Arg Ala Val Gly Ala Arg Ala Gln His Met Ala Phe
8990 8995 9000
gac gac tgg tta agt ctt aca gac gat gag ttt tta gcc aga gac 54159
Asp Asp Trp Leu Ser Leu Thr Asp Asp Glu Phe Leu Ala Arg Asp
9005 9010 9015
ttg gag gag tta cac gac cag att atc caa acc ctg gaa acg ccc 54204
Leu Glu Glu Leu His Asp Gln Ile Ile Gln Thr Leu Glu Thr Pro
9020 9025 9030
tgg acc gta gaa ggc gct cta gaa gca gta aag att cta gat gaa 54249
Trp Thr Val Glu Gly Ala Leu Glu Ala Val Lys Ile Leu Asp Glu
9035 9040 9045
aaa acg aca gcg gga gat ggg gaa acc ccc aca aac cta gca ttt 54294
Lys Thr Thr Ala Gly Asp Gly Glu Thr Pro Thr Asn Leu Ala Phe
9050 9055 9060
aat ttt gat tct tgt gaa cca agc cat gac acc aca tct aac gta 54339
Asn Phe Asp Ser Cys Glu Pro Ser His Asp Thr Thr Ser Asn Val
9065 9070 9075
tta aac att tca ggg tca aac att tca ggg tca act gtc cct ggt 54384
Leu Asn Ile Ser Gly Ser Asn Ile Ser Gly Ser Thr Val Pro Gly
9080 9085 9090
ctt aaa cga ccc ccc gaa gat gac gaa ctc ttt gat ctt agt ggt 54429
Leu Lys Arg Pro Pro Glu Asp Asp Glu Leu Phe Asp Leu Ser Gly
9095 9100 9105
att ccc ata aaa cat ggg aac att aca atg gaa atg att taa 54471
Ile Pro Ile Lys His Gly Asn Ile Thr Met Glu Met Ile
9110 9115 9120
cctccctctt tatccaatta aagcccacac gcgggtgagt gtacgtaata aacaagtcaa 54531
tattacatat tctgttgtgt tttctttttt tgtgtgtagt ccttacccat atgacctgta 54591
atatagtgtg tctccaacca ttcagcttac agtccagtgg acagtaacag cccgataac 54650
atg gaa ttg gat att aat cga aca ttg ttg gtt cta ctg ggt caa 54695
Met Glu Leu Asp Ile Asn Arg Thr Leu Leu Val Leu Leu Gly Gln
9125 9130 9135
gtt tat acg tac atc ttt cag gtt gaa ctg cta cgt cga tgt gat 54740
Val Tyr Thr Tyr Ile Phe Gln Val Glu Leu Leu Arg Arg Cys Asp
9140 9145 9150
cca agg gtg gcg tgt cgc ttt tta tat cgg tta gcg gct aac tgt 54785
Pro Arg Val Ala Cys Arg Phe Leu Tyr Arg Leu Ala Ala Asn Cys
9155 9160 9165
ttg aca gtt cgt tat tta tta aag ctg ttt ctc cgg gga ttt aat 54830
Leu Thr Val Arg Tyr Leu Leu Lys Leu Phe Leu Arg Gly Phe Asn
9170 9175 9180
acc cag cta aaa ttt gga aac act ccc acg gtt tgt gca ctg cat 54875
Thr Gln Leu Lys Phe Gly Asn Thr Pro Thr Val Cys Ala Leu His
9185 9190 9195
tgg gca tta tgt tat gta aag gga gaa ggt gag cgt ttg ttt gag 54920
Trp Ala Leu Cys Tyr Val Lys Gly Glu Gly Glu Arg Leu Phe Glu
9200 9205 9210
ttg cta caa cat ttt aaa acg cgt ttt gtt tat ggt gag act aaa 54965
Leu Leu Gln His Phe Lys Thr Arg Phe Val Tyr Gly Glu Thr Lys
9215 9220 9225
gac tca aac tgt atc aaa gat tac ttt gtc tca gcg ttt aac tta 55010
Asp Ser Asn Cys Ile Lys Asp Tyr Phe Val Ser Ala Phe Asn Leu
9230 9235 9240
aaa acc tgc caa tat cac cat gag ctg tcg tta aca aca tac gga 55055
Lys Thr Cys Gln Tyr His His Glu Leu Ser Leu Thr Thr Tyr Gly
9245 9250 9255
ggt tac gta tcg agt gaa att cag ttt tta cac gac att gag aat 55100
Gly Tyr Val Ser Ser Glu Ile Gln Phe Leu His Asp Ile Glu Asn
9260 9265 9270
ttt tta aaa cag ctt aat tac tgc tat att atc acg tct tct cgt 55145
Phe Leu Lys Gln Leu Asn Tyr Cys Tyr Ile Ile Thr Ser Ser Arg
9275 9280 9285
gag gcg cta aac aca ttg gaa acc gtg acg cgg ttt atg aca gat 55190
Glu Ala Leu Asn Thr Leu Glu Thr Val Thr Arg Phe Met Thr Asp
9290 9295 9300
act ata gga agc ggt cta ata cca ccc gtg gag ttg ttt gat ccg 55235
Thr Ile Gly Ser Gly Leu Ile Pro Pro Val Glu Leu Phe Asp Pro
9305 9310 9315
gcg cat cca tgt gct ata tgt ttt gaa gaa tta tgt ata aca gct 55280
Ala His Pro Cys Ala Ile Cys Phe Glu Glu Leu Cys Ile Thr Ala
9320 9325 9330
aac caa ggt gag acc tta cat cgt aga tta tta gga tgt atc tgc 55325
Asn Gln Gly Glu Thr Leu His Arg Arg Leu Leu Gly Cys Ile Cys
9335 9340 9345
gat cac gtt act aag caa gtt cgg gtt aac gtg gat gtt gac gat 55370
Asp His Val Thr Lys Gln Val Arg Val Asn Val Asp Val Asp Asp
9350 9355 9360
att att cgg tgt tta cca tat atc cct gat gta ccg gat atc aaa 55415
Ile Ile Arg Cys Leu Pro Tyr Ile Pro Asp Val Pro Asp Ile Lys
9365 9370 9375
cgt caa tcc gcc gtt gaa gcg tta cga aca ctt caa acc aag acg 55460
Arg Gln Ser Ala Val Glu Ala Leu Arg Thr Leu Gln Thr Lys Thr
9380 9385 9390
gta gtc aat ccc atg gga gca aag aac gat acg ttt gac caa aca 55505
Val Val Asn Pro Met Gly Ala Lys Asn Asp Thr Phe Asp Gln Thr
9395 9400 9405
tac gaa att gcg agc acc atg ctt gat tct tat aat gtt ttt aaa 55550
Tyr Glu Ile Ala Ser Thr Met Leu Asp Ser Tyr Asn Val Phe Lys
9410 9415 9420
cct gcc cct cgg tgt atg tac gcc atc agc gag ctt aaa ttc tgg 55595
Pro Ala Pro Arg Cys Met Tyr Ala Ile Ser Glu Leu Lys Phe Trp
9425 9430 9435
tta acg tct aat tcc act gaa gga ccc caa cgt act tta gac gtg 55640
Leu Thr Ser Asn Ser Thr Glu Gly Pro Gln Arg Thr Leu Asp Val
9440 9445 9450
ttt gtt gat aat ttg gat gta tta aac gaa cat gaa aaa cac gca 55685
Phe Val Asp Asn Leu Asp Val Leu Asn Glu His Glu Lys His Ala
9455 9460 9465
gaa ctt aca gcc gta acg gtt gag ttg gcg tta ttt gga aaa act 55730
Glu Leu Thr Ala Val Thr Val Glu Leu Ala Leu Phe Gly Lys Thr
9470 9475 9480
ccc ata cac ttt gat agg gcg ttt tct gaa gaa ctc gga tct ctg 55775
Pro Ile His Phe Asp Arg Ala Phe Ser Glu Glu Leu Gly Ser Leu
9485 9490 9495
gat gca att gat agt att ttg gtt ggc aat cgc tca tcc tca cca 55820
Asp Ala Ile Asp Ser Ile Leu Val Gly Asn Arg Ser Ser Ser Pro
9500 9505 9510
gac agt cag ata gaa gca tta att aaa gcc tgt tat gcc cat cat 55865
Asp Ser Gln Ile Glu Ala Leu Ile Lys Ala Cys Tyr Ala His His
9515 9520 9525
cta tcg tcg cct ctc atg cgt cac att tct aac ccg agt cat gat 55910
Leu Ser Ser Pro Leu Met Arg His Ile Ser Asn Pro Ser His Asp
9530 9535 9540
aac gaa gcc gcc tta cgc caa ctt tta gaa aga gtt ggg tgt gag 55955
Asn Glu Ala Ala Leu Arg Gln Leu Leu Glu Arg Val Gly Cys Glu
9545 9550 9555
gat gat tta acc aaa gag gcg agt gac agc gct aca gca tcc gaa 56000
Asp Asp Leu Thr Lys Glu Ala Ser Asp Ser Ala Thr Ala Ser Glu
9560 9565 9570
tgt gat ctg aac gat gat agt agc ata act ttt gct gtt cat gga 56045
Cys Asp Leu Asn Asp Asp Ser Ser Ile Thr Phe Ala Val His Gly
9575 9580 9585
tgg gaa aac ctg tta tcc aaa gca aaa att gac gct gcg gaa aga 56090
Trp Glu Asn Leu Leu Ser Lys Ala Lys Ile Asp Ala Ala Glu Arg
9590 9595 9600
aaa cga gta tat ctt gaa cat ctg tct aag cgc tct cta acc agc 56135
Lys Arg Val Tyr Leu Glu His Leu Ser Lys Arg Ser Leu Thr Ser
9605 9610 9615
ctc ggt aga tgt atc cgc gaa cag cgc caa gag cta gaa aaa aca 56180
Leu Gly Arg Cys Ile Arg Glu Gln Arg Gln Glu Leu Glu Lys Thr
9620 9625 9630
ctc agg gta aac gtt tat gga gag gcc tta ttg cag aca ttt gtt 56225
Leu Arg Val Asn Val Tyr Gly Glu Ala Leu Leu Gln Thr Phe Val
9635 9640 9645
tcg atg caa aat ggg ttt ggg gca cga aac gtg ttt tta gct aag 56270
Ser Met Gln Asn Gly Phe Gly Ala Arg Asn Val Phe Leu Ala Lys
9650 9655 9660
gtt tcc cag gca ggg tgt att atc gac aat cgc att cag gaa gcg 56315
Val Ser Gln Ala Gly Cys Ile Ile Asp Asn Arg Ile Gln Glu Ala
9665 9670 9675
gcc ttt gat gca cat aga ttt ata agg aat acc tta gtt cga cat 56360
Ala Phe Asp Ala His Arg Phe Ile Arg Asn Thr Leu Val Arg His
9680 9685 9690
aca gta gat gcg gct atg tta cct gca ctt aca cat aaa ttt ttt 56405
Thr Val Asp Ala Ala Met Leu Pro Ala Leu Thr His Lys Phe Phe
9695 9700 9705
gag ttg gtc aac ggc cca ttg ttt aat cac gat gaa cac cgt ttt 56450
Glu Leu Val Asn Gly Pro Leu Phe Asn His Asp Glu His Arg Phe
9710 9715 9720
gca caa ccc cct aac acc gcc tta ttt ttt acc gtg gaa aac gtt 56495
Ala Gln Pro Pro Asn Thr Ala Leu Phe Phe Thr Val Glu Asn Val
9725 9730 9735
ggc cta ttt ccg cac tta aaa gag gaa ttg gca aag ttt atg ggc 56540
Gly Leu Phe Pro His Leu Lys Glu Glu Leu Ala Lys Phe Met Gly
9740 9745 9750
ggt gtc gtt ggt tcc aac tgg ctt ctc agt cca ttt agg ggc ttt 56585
Gly Val Val Gly Ser Asn Trp Leu Leu Ser Pro Phe Arg Gly Phe
9755 9760 9765
tat tgc ttt tct ggg gta gaa ggc gtt act ttt gca cag aga ctt 56630
Tyr Cys Phe Ser Gly Val Glu Gly Val Thr Phe Ala Gln Arg Leu
9770 9775 9780
gcc tgg aaa tat att agg gag ctt gtg ttt gca acc aca cta ttc 56675
Ala Trp Lys Tyr Ile Arg Glu Leu Val Phe Ala Thr Thr Leu Phe
9785 9790 9795
acc tct gtt ttc cat tgt ggg gag gtg cgg tta tgt cgc gtt gac 56720
Thr Ser Val Phe His Cys Gly Glu Val Arg Leu Cys Arg Val Asp
9800 9805 9810
cgt cta ggt aag gat cca cgc ggg tgc acg tct caa cct aaa ggt 56765
Arg Leu Gly Lys Asp Pro Arg Gly Cys Thr Ser Gln Pro Lys Gly
9815 9820 9825
ata ggc agt tcc cac gga ccc tta gac ggc att tat tta acg tac 56810
Ile Gly Ser Ser His Gly Pro Leu Asp Gly Ile Tyr Leu Thr Tyr
9830 9835 9840
gaa gaa aca tgt ccc ctt gtg gct att att caa agt gga gaa aca 56855
Glu Glu Thr Cys Pro Leu Val Ala Ile Ile Gln Ser Gly Glu Thr
9845 9850 9855
ggg atc gac cag aat acc gtc gta atc tac gat tca gac gtt ttt 56900
Gly Ile Asp Gln Asn Thr Val Val Ile Tyr Asp Ser Asp Val Phe
9860 9865 9870
tct ctt cta tac acc cta atg cag cgg ctg gct ccg gat tca acg 56945
Ser Leu Leu Tyr Thr Leu Met Gln Arg Leu Ala Pro Asp Ser Thr
9875 9880 9885
gac ccg gcg ttt tca taa cctccgttac gggggtgtgg ttatgctttt 56993
Asp Pro Ala Phe Ser
9890
tatgcatatt ttct atg ttt gtt acg gcg gtt gtg tcg gtc tct cca agc 57043
Met Phe Val Thr Ala Val Val Ser Val Ser Pro Ser
9895 9900
tcg ttt tat gag agt tta caa gta gag ccc aca caa tca gaa gat 57088
Ser Phe Tyr Glu Ser Leu Gln Val Glu Pro Thr Gln Ser Glu Asp
9905 9910 9915
ata acc cgg tct gct cat ctg ggc gat ggt gat gaa atc aga gaa 57133
Ile Thr Arg Ser Ala His Leu Gly Asp Gly Asp Glu Ile Arg Glu
9920 9925 9930
gct ata cac aag tcc cag gac gcc gaa aca aaa ccc acg ttt tac 57178
Ala Ile His Lys Ser Gln Asp Ala Glu Thr Lys Pro Thr Phe Tyr
9935 9940 9945
gtc tgc cca ccg cca aca ggc tcc aca atc gta cga tta gaa cca 57223
Val Cys Pro Pro Pro Thr Gly Ser Thr Ile Val Arg Leu Glu Pro
9950 9955 9960
act cgg aca tgt ccg gat tat cac ctt ggt aaa aac ttt aca gag 57268
Thr Arg Thr Cys Pro Asp Tyr His Leu Gly Lys Asn Phe Thr Glu
9965 9970 9975
ggt att gct gtt gtt tat aaa gaa aac att gca gcg tac aag ttt 57313
Gly Ile Ala Val Val Tyr Lys Glu AsnIle Ala Ala Tyr Lys Phe
9980 9985 9990
aag gcg acg gta tat tac aaa gat gtt atc gtt agc acg gcg tgg 57358
Lys Ala Thr Val Tyr Tyr Lys Asp Val Ile Val Ser Thr Ala Trp
9995 10000 10005
gcc gga agt tct tat acg caa att act aat aga tat gcg gat agg 57403
Ala Gly Ser Ser Tyr Thr Gln Ile Thr Asn Arg Tyr Ala Asp Arg
10010 10015 10020
gta cca att ccc gtt tca gag atc acg gac acc att gat aag ttt 57448
Val Pro Ile Pro Val Ser Glu Ile Thr Asp Thr Ile Asp Lys Phe
10025 10030 10035
ggc aag tgt tct tct aaa gca acg tac gta cga aat aac cac aaa 57493
Gly Lys Cys Ser Ser Lys Ala Thr Tyr Val Arg Asn Asn His Lys
10040 10045 10050
gtt gaa gcc ttt aat gag gat aaa aat cca cag gat atg cct cta 57538
Val Glu Ala Phe Asn Glu Asp Lys Asn Pro Gln Asp Met Pro Leu
10055 10060 10065
atc gca tca aaa tat aat tct gtg gga tcc aaa gca tgg cat act 57583
Ile Ala Ser Lys Tyr Asn Ser Val Gly Ser Lys Ala Trp His Thr
10070 10075 10080
acc aat gac acg tac atg gtt gcc gga acc ccc gga aca tat agg 57628
Thr Asn Asp Thr Tyr Met Val Ala Gly Thr Pro Gly Thr Tyr Arg
10085 10090 10095
acg ggc acg tcg gtg aat tgc atc att gag gaa gtt gaa gcc aga 57673
Thr Gly Thr Ser Val Asn Cys IleIle Glu Glu Val Glu Ala Arg
10100 10105 10110
tca ata ttc cct tat gat agt ttt gga ctt tcc acg gga gat ata 57718
Ser Ile Phe Pro Tyr Asp Ser Phe Gly Leu Ser Thr Gly Asp Ile
10115 10120 10125
ata tac atg tcc ccg ttt ttt ggc cta cgg gat ggt gca tac aga 57763
Ile Tyr Met Ser Pro Phe Phe Gly Leu Arg Asp Gly Ala Tyr Arg
10130 10135 10140
gaa cat tcc aat tat gca atg gat cgt ttt cac cag ttt gag ggt 57808
Glu His Ser Asn Tyr Ala Met Asp Arg Phe His Gln Phe Glu Gly
10145 10150 10155
tat aga caa agg gat ctt gac act aga gca tta ctg gaa cct gca 57853
Tyr Arg Gln Arg Asp Leu Asp Thr Arg Ala Leu Leu Glu Pro Ala
10160 10165 10170
gcg cgg aac ttt tta gtc acg cct cat tta acg gtt ggt tgg aac 57898
Ala Arg Asn Phe Leu Val Thr Pro His Leu Thr Val Gly Trp Asn
10175 10180 10185
tgg aag cca aaa cga acg gaa gtt tgt tcg ctt gtc aag tgg cgt 57943
Trp Lys Pro Lys Arg Thr Glu Val Cys Ser Leu Val Lys Trp Arg
10190 10195 10200
gag gtt gaa gac gta gtt cgc gat gag tat gca cac aat ttt cgc 57988
Glu Val Glu Asp Val Val Arg Asp Glu Tyr Ala His Asn Phe Arg
10205 10210 10215
ttt aca atg aaa aca ctt tct acc acg ttt ata agt gaa aca aac 58033
Phe Thr Met Lys Thr Leu Ser Thr Thr Phe Ile Ser Glu Thr Asn
10220 10225 10230
gag ttt aat ctt aac caa atc cat ctc agt caa tgt gta aag gag 58078
Glu Phe Asn Leu Asn Gln Ile His Leu Ser Gln Cys Val Lys Glu
10235 10240 10245
gaa gcc cgg gct att att aac cgg atc tat aca acc aga tac aac 58123
Glu Ala Arg Ala Ile Ile Asn Arg Ile Tyr Thr Thr Arg Tyr Asn
10250 10255 10260
tca tct cat gtt aga acc ggg gat atc cag acc tac ctt gcc aga 58168
Ser Ser His Val Arg Thr Gly Asp Ile Gln Thr Tyr Leu Ala Arg
10265 10270 10275
ggg ggg ttt gtt gtg gtg ttt caa ccc ctg ctg agc aat tcc ctc 58213
Gly Gly Phe Val Val Val Phe Gln Pro Leu Leu Ser Asn Ser Leu
10280 10285 10290
gcc cgt ctc tat ctc caa gaa ttg gtc cgt gaa aac act aat cat 58258
Ala Arg Leu Tyr Leu Gln Glu Leu Val Arg Glu Asn Thr Asn His
10295 10300 10305
tca cca caa aaa cac ccg act cga aat acc aga tcc cga cga agc 58303
Ser Pro Gln Lys His Pro Thr Arg Asn Thr Arg Ser Arg Arg Ser
10310 10315 10320
gtg cca gtt gag ttg cgt gcc aat aga aca ata aca acc acc tca 58348
Val Pro Val Glu Leu Arg Ala Asn Arg Thr Ile Thr Thr Thr Ser
10325 10330 10335
tcg gtg gaa ttt gct atg ctc cag ttt aca tat gac cac att caa 58393
Ser Val Glu Phe Ala Met Leu Gln Phe Thr Tyr Asp HisIle Gln
10340 10345 10350
gag cat gtt aat gaa atg ttg gca cgt atc tcc tcg tcg tgg tgc 58438
Glu His Val Asn Glu Met Leu Ala Arg Ile Ser Ser Ser Trp Cys
10355 10360 10365
cag cta caa aat cgc gaa cgc gcc ctt tgg agc gga cta ttt cca 58483
Gln Leu Gln Asn Arg Glu Arg Ala Leu Trp Ser Gly Leu Phe Pro
10370 10375 10380
att aac cca agt gct tta gcg agc acc att ttg gat caa cgt gtt 58528
Ile Asn Pro Ser Ala Leu Ala Ser Thr Ile Leu Asp Gln Arg Val
10385 10390 10395
aaa gct cgt att ctc ggc gac gtt atc tcc gtt tct aat tgt cca 58573
Lys Ala Arg Ile Leu Gly Asp Val Ile Ser Val Ser Asn Cys Pro
10400 10405 10410
gaa ctg gga tca gat aca cgc att ata ctt caa aac tct atg agg 58618
Glu Leu Gly Ser Asp Thr Arg Ile Ile Leu Gln Asn Ser Met Arg
10415 10420 10425
gta tct ggt agt act acg cgt tgt tat agc cgt cct tta att tca 58663
Val Ser Gly Ser Thr Thr Arg Cys Tyr Ser Arg Pro Leu Ile Ser
10430 10435 10440
ata gtt agt tta aat ggg tcc ggg acg gtg gag ggc cag ctt gga 58708
Ile Val Ser Leu Asn Gly Ser Gly Thr Val Glu Gly Gln Leu Gly
10445 10450 10455
aca gat aac gag tta att atg tcc aga gat ctg tta gaa cca tgc 58753
Thr Asp Asn Glu Leu Ile Met Ser Arg Asp Leu Leu Glu Pro Cys
10460 10465 10470
gtg gct aat cac aag cga tat ttt cta ttt ggg cat cac tac gta 58798
Val Ala Asn His Lys Arg Tyr Phe Leu Phe Gly His His Tyr Val
10475 10480 10485
tat tat gag gat tat cgt tac gtc cgt gaa atc gca gtc cat gat 58843
Tyr Tyr Glu Asp Tyr Arg Tyr Val Arg Glu Ile Ala Val His Asp
10490 10495 10500
gtg gga atg att agc act tac gta gat tta aac tta aca ctt ctt 58888
Val Gly Met Ile Ser Thr Tyr Val Asp Leu Asn Leu Thr Leu Leu
10505 10510 10515
aaa gat aga gag ttt atg ccg ctg caa gta tat aca aga gac gag 58933
Lys Asp Arg Glu Phe Met Pro Leu Gln Val Tyr Thr Arg Asp Glu
10520 10525 10530
ctg cgg gat aca gga tta cta gac tac agt gaa att caa cgc cga 58978
Leu Arg Asp Thr Gly Leu Leu Asp Tyr Ser Glu Ile Gln Arg Arg
10535 10540 10545
aat caa atg cat tcg ctg cgt ttt tat gac ata gac aag gtt gtg 59023
Asn Gln Met His Ser Leu Arg Phe Tyr Asp Ile Asp Lys Val Val
10550 10555 10560
caa tat gat agc gga acg gcc att atg cag ggc atg gct cag ttt 59068
Gln Tyr Asp Ser Gly Thr Ala Ile Met Gln Gly Met Ala Gln Phe
10565 10570 10575
ttc cag gga ctt ggg acc gcg ggc cag gcc gtt gga cat gtg gtt 59113
Phe Gln Gly Leu Gly Thr Ala Gly Gln Ala Val Gly His Val Val
10580 10585 10590
ctt ggg gcc acg gga gcg ctg ctt tcc acc gta cac gga ttt acc 59158
Leu Gly Ala Thr Gly Ala Leu Leu Ser Thr Val His Gly Phe Thr
10595 10600 10605
acg ttt tta tct aac cca ttt ggg gca ttg gcc gtg gga tta ttg 59203
Thr Phe Leu Ser Asn Pro Phe Gly Ala Leu Ala Val Gly Leu Leu
10610 10615 10620
gtt ttg gcg gga ctg gta gcg gcc ttt ttt gcg tac cgg tac gtg 59248
Val Leu Ala Gly Leu Val Ala Ala Phe Phe Ala Tyr Arg Tyr Val
10625 10630 10635
ctt aaa ctt aaa aca agc ccg atg aag gca tta tat cca ctc aca 59293
Leu Lys Leu Lys Thr Ser Pro Met Lys Ala Leu Tyr Pro Leu Thr
10640 10645 10650
acc aag ggg tta aaa cag tta ccg gaa gga atg gat ccc ttt gcc 59338
Thr Lys Gly Leu Lys Gln Leu Pro Glu Gly Met Asp Pro Phe Ala
10655 10660 10665
gag aaa ccc aac gct act gat acc cca ata gaa gaa att ggc gac 59383
Glu Lys Pro Asn Ala Thr Asp Thr Pro Ile Glu Glu Ile Gly Asp
10670 10675 10680
tca caa aac act gaa ccg tcg gta aat agc ggg ttt gat ccc gat 59428
Ser Gln Asn Thr Glu Pro Ser Val Asn Ser Gly Phe Asp Pro Asp
10685 10690 10695
aaa ttt cga gaa gcc cag gaa atg att aaa tat atg acg tta gta 59473
Lys Phe Arg Glu Ala Gln Glu Met Ile Lys Tyr Met Thr Leu Val
10700 10705 10710
tct gcg gct gag cgc caa gaa tct aaa gcc cgc aaa aaa aat aag 59518
Ser Ala Ala Glu Arg Gln Glu Ser Lys Ala Arg Lys Lys Asn Lys
10715 10720 10725
act agc gcc ctt tta act tca cgt ctt acc ggc ctt gct tta cga 59563
Thr Ser Ala Leu Leu Thr Ser Arg Leu Thr Gly Leu Ala Leu Arg
10730 10735 10740
aat cgc cga gga tac tcc cgt gtt cgc acc gag aat gta acg ggg 59608
Asn Arg Arg Gly Tyr Ser Arg Val Arg Thr Glu Asn Val Thr Gly
10745 10750 10755
gtg taa atagccaggg ggtttgtttt aatttattaa taaaaatgtg tattacgtta 59664
Val
10760
ctcatgtgtc tccattacgc atcacagggg gtatttatac ccgataatat acaaaacgcg 59724
ttttgtacct ctaccgcacc cgatatctta acggggttat t atg gaa tcg tct 59777
Met Glu Ser Ser
aac att aac gcg cta caa caa ccg tcg tct atc gca cat cat ccg 59822
Asn Ile Asn Ala Leu Gln Gln Pro Ser Ser Ile Ala His His Pro
10765 10770 10775
tcc aaa cag tgc gct tca agt ctc aat gaa aca gta aaa gat tct 59867
Ser Lys Gln Cys Ala Ser Ser Leu Asn Glu Thr Val Lys Asp Ser
10780 10785 10790
ccc ccc gcg att tat gaa gat agg tta gaa cac acg ccg gta caa 59912
Pro Pro Ala Ile Tyr Glu Asp Arg Leu Glu His Thr Pro Val Gln
10795 10800 10805
tta ccc cgc gac ggt aca ccc cga gac gta tgt tct gtg gga cag 59957
Leu Pro Arg Asp Gly Thr Pro Arg Asp Val Cys Ser Val Gly Gln
10810 10815 10820
cta acc tgt cga gca tgt gca acg aaa cct ttt cgc ctt aac cgc 60002
Leu Thr Cys Arg Ala Cys Ala Thr Lys Pro Phe Arg Leu Asn Arg
10825 10830 10835
gac agc caa tac gac tac tta aac aca tgt cca ggg ggc cgt cat 60047
Asp Ser Gln Tyr Asp Tyr Leu Asn Thr Cys Pro Gly Gly Arg His
10840 10845 10850
att tca ctg gca ctg gag att ata acg ggt cga tgg gtt tgc atc 60092
Ile Ser Leu Ala Leu Glu Ile Ile Thr Gly Arg Trp Val Cys Ile
10855 10860 10865
ccg cgt gtg ttt ccg gat acc cca gag gaa aaa tgg atg gcg cca 60137
Pro Arg Val Phe Pro Asp Thr Pro Glu Glu Lys Trp Met Ala Pro
10870 10875 10880
tat att att cca gac cga gaa caa cca tca tca ggg gat gaa gat 60182
Tyr Ile Ile Pro Asp Arg Glu Gln Pro Ser Ser Gly Asp Glu Asp
10885 10890 10895
tct gac acc gat taa atttaactta aataaaacct taccacccat aaaaacgcct 60237
Ser Asp Thr Asp
10900
tctgtttgtt taacacgaca ccgcttaaca aaaaaaaaaa aaccaaacac gccttttatg 60297
aatgtaatac ttttatttgt tggttaacac cgccccacca tcatctgatt tgcaaacata 60357
tcggcgtcgt ctgccgtgga cccctgtatt aaaggggcct tggaactcgc ctccactgca 60417
tttacatctt gtccaactgt atctgtatgt ggggtgcttg ttgtattttg ggatgagcat 60477
agacccgaaa cgctttgaag ctgttttaat aaaatcgata ttcgaggatc ccgtgtcccc 60537
tctggtatat ttgtatggtg cgacaaaggc atttgtgtcc cattttgtga ttttagctct 60597
gtaacctcct gttgcagttt tgccacaacc ccagcaagct cttcgtgctg accattagaa 60657
actctgtgtc tcctctgcca atatgatgga gaaactcgac gtctccgatg cgttatatac 60717
gttggttcac cgggaaaata tatatttgag ggaaactctc cgtccatttg agactcccca 60777
ctataaaaag aatccaattc cctttgatcc atgctcttga aatcccgttt tcctggacga 60837
cggacatcgg ttttgtctgg aaaatttaca cacggggtct gcaagtcaat accccgttcg 60897
gcggccaatg cgttcataaa tgcggacatt tgcatttcca aacgattggg tggtggatat 60957
cccggaaacc cgtacggtcc cccgaagtgt cccggagggc aaccataacc ccctgtatta 61017
ggtgggaagg caggcgggtg tggagatcca tatggcccga cgatatactg tccgttattt 61077
ggagctccaa ttgatacctg cggattttta gtctgcccgg ttaacagctg tgaataatac 61137
gcggtaggta tcagtacaaa ttcccctccg gttggaacgc ccgacggggg ctgtggtgag 61197
atattactag cgttacctgc tacagaagcc atatcgctgt cgttcctaca caactgcgta 61257
acctttaaat gcggaacagt cttttcacaa tcttcatttg attccccaac acccaacgcg 61317
agatcgtata tgggcccgcc ggggtggaat gtggcgttta taacacccgc gttgggtaat 61377
ttagactcca ccccattaac gttggttatc cgagcaagtc catatccggt gctagcctga 61437
agataaacgt gacccataat tccggcttcg cgtctacgtt ttgcaaccac gtcccatcta 61497
tctcttaaaa gcatattgtt cacggctgtg gataataaca ccttggcgag tttatcttcg 61557
ctaaccttcc atactttatt taaacccgcg tagtctttaa ccagcgacaa taaccgcgct 61617
ttactttcca tcgataaaac ccggaatggt tcaattgaag attccggggt acagtcataa 61677
ttgaccactg ttccaacgcg tcttccaaca acacataacg caacatgggt aaaaaaatta 61737
ccgtctggta tctcattcgg ggacaatcgt tttgaagaca gggatacgga gggtaagtaa 61797
tttgtgacca agtataacgc acgttctagc ggagataata cagaatctct atttccaaaa 61857
aaattcgaat gggccgcttc aaacagcacc gcatgtagtt gagggcatct aacgataccc 61917
aaaaaaaaag gtccgcgtat gtcctcaatg attgcgatta cttcacccac gacacagtct 61977
tttcgatgat cgatgtttat tggtatttta ctagtaggcg gcaaagcgga ccgcacaatc 62037
tctggggtaa tatttaattc cccttcgtcc tttgaatata aggctaaata cccagccacg 62097
tataacgctt cacagttctc ttcgtcagct tcagcagcca ttataaacac cccacggacc 62157
ggatagtgaa tactcacggt gtggaggcaa actgaggaat gacacccaaa cagacaaaat 62217
atagaagatc atagtcactg ttaacgttga actgcgcaag gcggcgactt tcttccaatg 62277
ccgcccttac acgcggttgg tgcattaaca ttccaagtcc ccgttcatat tgcaacataa 62337
cactgtcatg tattgatacc acggcggcta tgggtaggga tgtaacattt tgtcggcggt 62397
gttctaattc caatgcaatt aagcttatga gccgatcttg gtactgtcca gaagaaatat 62457
ctattacggt tcttcctaaa cttccacgac taagctgggt atgcgcgtct aaacaaagag 62517
caactaatcc aggaaacatt tcagtcagct ctgtggtccg atttaacgta tacagtggtg 62577
ctatatatcg ttcacataaa aattgaaagt tattattacc gcttttaaac ttcccatcaa 62637
accccgtcgc tccgcgcaag attacattgt tggtaggggt tcctgttgct tctgacacaa 62697
tcaaacccag ttgaaaatta ttttttagtt tatctccgta tacgttcccg ttccataata 62757
agcgccttaa taataataac gccgtaatcg tgtcaattgt taaccttaat agagtttggt 62817
cttccataag aaacacgttt tgggcccgtt ctaaatacgc cgcggccgcc tgttgaatct 62877
tgtccacata tgcggtatga ttgcgatcaa taatgtcatt aaccccagga ttaaactgtc 62937
caggtgcagg cggtaggacc tgcaaccgta taagcgcatc cataacagaa tgtgacgtta 62997
aggcgccttg atcataccgc cccccacgag catgaaactg gtcgcgtggt agacgatcat 63057
agcaaaattg ataactgttt ttattttcgt gtgttgtcat ataattcaca aatgtctcag 63117
tatattccgg taggtgctct ataaggttcc cgaaggacga aacttgaggt tcgtggacac 63177
tattagatgt cctatacatt aaatataaac ataataccgc acactcgaac gcggagtacg 63237
ctctatctcc aacatacatt ctcccggcgg actgtagaca tgttaccgtt gtgttcataa 63297
acgtacggga aatgcgcccg tctttacaat caactccgcg tgcagctacg ggcctatcta 63357
acacaagccg ttcctgcaga gtacgatacc atggcccgaa aacaatccct ggagagttat 63417
tgccccttgc ccttcccaag tacaccaggg tgataaaatc cacttgaaag tttgtatcgt 63477
actgcaacgg tgcatcattt ttggcaatct gtacctcggg gtgtatagac tcattgcgta 63537
ttatttctgt acgtgtacat tcctcagatt gtgcatctgc ttcttccgcc tcggcagcag 63597
ccgtctccag ggaatccaaa accttggcca tgcgcgttag ttgttcttcg aggggcttta 63657
aacgacgatc tatttccgtt ggtaacgtaa tcgtttcccc gcgaaggttg tctaatgcgg 63717
caacggccgc cgcatttttt aacgttaacg tatttttttc caaatcggga ttcatacgcc 63777
ctcttaactc aaacgcggga gccgtccagt agtgtatggg gaagttgggg gctataaagt 63837
tcttagtggt agacaaaaat atcccacatt tattcggaaa cgagatagat ccgaacccat 63897
atctcgccgt catggtgtct gcagcaaaca aagtcaactg gcgtgaatat aaaccggtac 63957
tgctttaaaa gctgttttct tacccatggg aaaacatccc ggttatactt tgtaaaattc 64017
caccacaagc acctaaagaa ggccttctaa ggggtaaatc caccccacaa gctgcatttt 64077
cttcaaactt tgttaaagcg gaacgatggc atgatttcgc acgctttttc gcaagagaac 64137
atacgtgaat tttctttttg catagacgtc ttcgctctct aacggacctt atcggggggg 64197
tatattccgc tacattctcc aaatgcgacg ctagcataac aaggtttcca tgaatcacct 64257
ttgggggtaa ccgagttacc tgtaacaggt tcagaccccg ttgagataca aacacaagga 64317
ggggggtcac cattatttca tcagatcccg tgggtgtggt ttcctttatt aaagccatgg 64377
tatccctcag ctggcgcata ccctcgcaaa actggtgata cttagtaggg gtatgtatat 64437
tagcgctaaa acggcaagat tttaattcca ctataaaaca aacggtcttt ccggcaccac 64497
tggattccgt ttgtataata caaacacaat cggggcgtcg gcgtcccaaa tttacttcaa 64557
acgacattga tatgcgtaca gccctttgaa catccacgtg ggataacggc gacaggagtt 64617
ttgccagcct cgggttgaac gcgtccgcga aacctcgacg tacgttatca atatcctttt 64677
tgagtacatc gtaaaaacga gtgtggcaac gttgtcccaa acgaaaacac ttggcccgaa 64737
ttcgactagc ggacatattt gaagttccgt cccagaagat aacctaagac gcgtttgtct 64797
acaataaac atg tca acg gat aaa acc gat gta aaa atg ggc gtt 64842
Met Ser Thr Asp Lys Thr Asp Val Lys Met Gly Val
10905 10910 10915
ttg cgt att tat ttg gac ggg gcg tat gga att gga aaa aca acc 64887
Leu Arg Ile Tyr Leu Asp Gly Ala Tyr Gly Ile Gly Lys Thr Thr
10920 10925 10930
gcc gcc gaa gaa ttt tta cac cac ttt gca ata aca cca aac cgg 64932
Ala Ala Glu Glu Phe Leu His His Phe Ala Ile Thr Pro Asn Arg
10935 10940 10945
atc tta ctc att ggg gag ccc ctg tcg tat tgg cgt aac ctt gca 64977
Ile Leu Leu Ile Gly Glu Pro Leu Ser Tyr Trp Arg Asn Leu Ala
10950 10955 10960
ggg gag gac gcc att tgc gga att tac gga aca caa act cgc cgt 65022
Gly Glu Asp Ala Ile Cys Gly Ile Tyr Gly Thr Gln Thr Arg Arg
10965 10970 10975
ctt aat gga gac gtt tcg cct gaa gac gca caa cgc ctc acg gct 65067
Leu Asn Gly Asp Val Ser Pro Glu Asp Ala Gln Arg Leu Thr Ala
10980 10985 10990
cat ttt cag agc ctg ttc tgt tct ccg cat gca att atg cat gcg 65112
His Phe Gln Ser Leu Phe Cys Ser Pro His Ala Ile Met His Ala
10995 11000 11005
aaa atc tcg gca ttg atg gac aca agt aca tcg gat ctc gta caa 65157
Lys Ile Ser Ala Leu Met Asp Thr Ser Thr Ser Asp Leu Val Gln
11010 11015 11020
gta aat aag gag ccg tat aaa att atg tta tcc gac cga cac cca 65202
Val Asn Lys Glu Pro Tyr Lys Ile Met Leu Ser Asp Arg His Pro
11025 11030 11035
atc gcc tca act ata tgt ttt ccc ttg tcc aga tac tta gtg gga 65247
Ile Ala Ser Thr Ile Cys Phe Pro Leu Ser Arg Tyr Leu Val Gly
11040 11045 11050
gat atg tcc cca gcg gcg ctt cct ggg tta ttg ttt acg ctt ccc 65292
Asp Met Ser Pro Ala Ala Leu Pro Gly Leu Leu Phe Thr Leu Pro
11055 11060 11065
gct gaa ccc ccc ggg acc aac ttg gta gtt tgt acc gtt tca ctc 65337
Ala Glu Pro Pro Gly Thr Asn Leu Val Val Cys Thr Val Ser Leu
11070 11075 11080
ccc agt cat tta tcc aga gta agc aaa cgg gcc aga ccg gga gaa 65382
Pro Ser His Leu Ser Arg Val Ser Lys Arg Ala Arg Pro Gly Glu
11085 11090 11095
acg gtt aat ctg ccg ttt gtt atg gtt ctg aga aat gta tat ata 65427
Thr Val Asn Leu Pro Phe Val Met Val Leu Arg Asn Val Tyr Ile
11100 11105 11110
atg ctt att aat aca att ata ttt ctt aaa act aac aac tgg cac 65472
Met Leu Ile Asn Thr Ile Ile Phe Leu Lys Thr Asn Asn Trp His
11115 11120 11125
gcg ggc tgg aac aca ctg tca ttt tgt aat gat gta ttt aaa cag 65517
Ala Gly Trp Asn Thr Leu Ser Phe Cys Asn Asp Val Phe Lys Gln
11130 11135 11140
aaa tta caa aaa tcc gag tgt ata aaa cta cgc gaa gta cct ggg 65562
Lys Leu Gln Lys Ser Glu Cys Ile Lys Leu Arg Glu Val Pro Gly
11145 11150 11155
att gaa gac acg tta ttc gcc gtg ctt aaa ctt ccg gag ctt tgc 65607
Ile Glu Asp Thr Leu Phe Ala Val Leu Lys Leu Pro Glu Leu Cys
11160 11165 11170
gga gag ttt gga aat att ctg ccg tta tgg gca tgg gga atg gag 65652
Gly Glu Phe Gly Asn Ile Leu Pro Leu Trp Ala Trp Gly Met Glu
11175 11180 11185
acc ctt tca aac tgc tca cga agc atg tct ccg ttc gta tta tcg 65697
Thr Leu Ser Asn Cys Ser Arg Ser Met Ser Pro Phe Val Leu Ser
11190 11195 11200
tta gaa cag aca ccc cag cat gcg gca caa gaa cta aaa act ctg 65742
Leu Glu Gln Thr Pro Gln His Ala Ala Gln Glu Leu Lys Thr Leu
11205 11210 11215
cta ccc cag atg acc ccg gca aac atg tcc tcc ggt gca tgg aat 65787
Leu Pro Gln Met Thr Pro Ala Asn Met Ser Ser Gly Ala Trp Asn
11220 11225 11230
ata ttg aaa gag ctt gtt aat gcc gtt cag gac aac act tcc taa 65832
Ile Leu Lys Glu Leu Val Asn Ala Val Gln Asp Asn Thr Ser
11235 11240
atatacctag tatttacgta tgtaccagta aaaagatgat acacattgtc atactcgcgt 65892
gtacgtgttt ttctttttta tatatgcgtc atttattacc acatccttta atcccgcctt 65952
tatctcccta aaacggagtg gtaatattaa aagccgccaa gcctgttggt gggtgaggag 66012
gggtaaaggc acgctgtgtg cataacgttg cggtgatatt gtagcgcaag taacagcgac 66072
t atg ttt gcg cta gtt tta gcg gtg gta att ctt cct ctt tgg 66115
Met Phe Ala Leu Val Leu Ala Val Val Ile Leu Pro Leu Trp
11245 11250 11255
acc acg gct aat aaa tct tac gta aca cca acc cct gcg act cgc 66160
Thr Thr Ala Asn Lys Ser Tyr Val Thr Pro Thr Pro Ala Thr Arg
11260 11265 11270
tct atc gga cat atg tct gct ctt cta cga gaa tat tcc gac cgt 66205
Ser Ile Gly His Met Ser Ala Leu Leu Arg Glu Tyr Ser Asp Arg
11275 11280 11285
aat atg tct ctg aaa tta gaa gcc ttt tat cct act ggt ttc gat 66250
Asn Met Ser Leu Lys Leu Glu Ala Phe Tyr Pro Thr Gly Phe Asp
11290 11295 11300
gaa gaa ctc att aaa tca ctt cac tgg gga aat gat aga aaa cac 66295
Glu Glu Leu Ile Lys Ser Leu His Trp Gly Asn Asp Arg Lys His
11305 11310 11315
gtt ttc ttg gtt att gtt aag gtt aac cct aca aca cac gaa gga 66340
Val Phe Leu Val Ile Val Lys Val Asn Pro Thr Thr His Glu Gly
11320 11325 11330
gac gtc ggg ctg gtt ata ttt cca aaa tac ttg tta tcg cca tac 66385
Asp Val Gly Leu Val Ile Phe Pro Lys Tyr Leu Leu Ser Pro Tyr
11335 11340 11345
cat ttc aaa gca gaa cat cga gca ccg ttt cct gct gga cgt ttt 66430
His Phe Lys Ala Glu His Arg Ala Pro Phe Pro Ala Gly Arg Phe
11350 11355 11360
gga ttt ctt agt cac cct gtg aca ccc gac gtg agc ttc ttt gac 66475
Gly Phe Leu Ser His Pro Val Thr Pro Asp Val Ser Phe Phe Asp
11365 11370 11375
agt tcg ttt gcg ccg tat tta act acg caa cat ctt gtt gcg ttt 66520
Ser Ser Phe Ala Pro Tyr Leu Thr Thr Gln His Leu Val Ala Phe
11380 11385 11390
act acg ttc cca cca aac ccc ctt gta tgg cat ttg gaa aga gct 66565
Thr Thr Phe Pro Pro Asn Pro Leu Val Trp His Leu Glu Arg Ala
11395 11400 11405
gag acc gca gca act gca gaa agg ccg ttt ggg gta agt ctt tta 66610
Glu Thr Ala Ala Thr Ala Glu Arg Pro Phe Gly Val Ser Leu Leu
11410 11415 11420
ccc gct cgc cca aca gtc ccc aag aat act att ctg gaa cat aaa 66655
Pro Ala Arg Pro Thr Val Pro Lys Asn Thr Ile Leu Glu His Lys
11425 11430 11435
gcg cat ttt gct aca tgg gat gcc ctt gcc cga cat act ttt ttt 66700
Ala His Phe Ala Thr Trp Asp Ala Leu Ala Arg His Thr Phe Phe
11440 11445 11450
tct gcc gaa gca att atc acc aac tca acg ttg aga ata cac gtt 66745
Ser Ala Glu Ala Ile Ile Thr Asn Ser Thr Leu Arg Ile His Val
11455 11460 11465
ccc ctt ttt ggg tcg gta tgg cca att cga tac tgg gcc acc ggt 66790
Pro Leu Phe Gly Ser Val Trp Pro Ile Arg Tyr Trp Ala Thr Gly
11470 11475 11480
tcg gtg ctt ctc aca agc gac tcg ggt cgt gtg gaa gta aat att 66835
Ser Val Leu Leu Thr Ser Asp Ser Gly Arg Val Glu Val Asn Ile
11485 11490 11495
ggt gta gga ttt atg agc tcg ctc att tct tta tcc tct gga cca 66880
Gly Val Gly Phe Met Ser Ser Leu Ile Ser Leu Ser Ser Gly Pro
11500 11505 11510
ccg ata gaa tta att gtt gta cca cat aca gta aaa ctg aac gcg 66925
Pro Ile Glu Leu Ile Val Val Pro His Thr Val Lys Leu Asn Ala
11515 11520 11525
gtt aca agc gac acc aca tgg ttc cag cta aat cca ccg ggt ccg 66970
Val Thr Ser Asp Thr Thr Trp Phe Gln Leu Asn Pro Pro Gly Pro
11530 11535 11540
gat ccg ggg cca tct tat cga gtt tat tta ctt gga cgt ggg ttg 67015
Asp Pro Gly Pro Ser Tyr Arg Val Tyr Leu Leu Gly Arg Gly Leu
11545 11550 11555
gat atg aat ttt tca aag cat gct acg gtc gat ata tgc gca tat 67060
Asp Met Asn Phe Ser Lys His Ala Thr Val Asp Ile Cys Ala Tyr
11560 11565 11570
ccc gaa gag agt ttg gat tac cgc tat cat tta tcc atg gcc cac 67105
Pro Glu Glu Ser Leu Asp Tyr Arg Tyr His Leu Ser Met Ala His
11575 11580 11585
acg gag gct ctg cgg atg aca acg aag gcg gat caa cat gac ata 67150
Thr Glu Ala Leu Arg Met Thr Thr Lys Ala Asp Gln His Asp Ile
11590 11595 11600
aac gag gaa agc tat tac cat atc gcc gca aga ata gcc aca tca 67195
Asn Glu Glu Ser Tyr Tyr His Ile Ala Ala Arg Ile Ala Thr Ser
11605 11610 11615
att ttt gcg ttg tcg gaa atg ggc cgt acc aca gaa tat ttt ctg 67240
Ile Phe Ala Leu Ser Glu Met Gly Arg Thr Thr Glu Tyr Phe Leu
11620 11625 11630
tta gat gag atc gta gat gtt cag tat caa tta aaa ttc ctt aat 67285
Leu Asp Glu Ile Val Asp Val Gln Tyr Gln Leu Lys Phe Leu Asn
11635 11640 11645
tac att tta atg cgg ata gga gca gga gct cat ccc aac act ata 67330
Tyr Ile Leu Met Arg Ile Gly Ala Gly Ala His Pro Asn Thr Ile
11650 11655 11660
tcc gga acc tcg gat ctg atc ttt gcc gat cca tcg cag ctt cat 67375
Ser Gly Thr Ser Asp Leu Ile Phe Ala Asp Pro Ser Gln Leu His
11665 11670 11675
gac gaa ctt tca ctt ctt ttt ggt cag gta aaa ccc gca aat gtc 67420
Asp Glu Leu Ser Leu Leu Phe Gly Gln Val Lys Pro Ala Asn Val
11680 11685 11690
gat tat ttt att tca tat gat gaa gcc cgt gat caa cta aag acc 67465
Asp Tyr Phe Ile Ser Tyr Asp Glu Ala Arg Asp Gln Leu Lys Thr
11695 11700 11705
gca tac gcg ctt tcc cgt ggt caa gac cat gtg aat gca ctt tct 67510
Ala Tyr Ala Leu Ser Arg Gly Gln Asp His Val Asn Ala Leu Ser
11710 11715 11720
ctc gcc agg cgt gtt ata atg agc ata tac aag ggg ctg ctt gtg 67555
Leu Ala Arg Arg Val Ile Met Ser Ile Tyr Lys Gly Leu Leu Val
11725 11730 11735
aag caa aat tta aat gct aca gag agg cag gct tta ttt ttt gcc 67600
Lys Gln Asn Leu Asn Ala Thr Glu Arg Gln Ala Leu Phe Phe Ala
11740 11745 11750
tca atg att tta tta aat ttc cgc gaa gga cta gaa aat tca tct 67645
Ser Met Ile Leu Leu Asn Phe Arg Glu Gly Leu Glu Asn Ser Ser
11755 11760 11765
cgg gta tta gac ggt cgc aca act ttg ctt tta atg aca tcc atg 67690
Arg Val Leu Asp Gly Arg Thr Thr Leu Leu Leu Met Thr Ser Met
11770 11775 11780
tgt acg gca gct cac gcc acg caa gca gca ctt aac ata caa gaa 67735
Cys Thr Ala Ala His Ala Thr Gln Ala Ala Leu Asn Ile Gln Glu
11785 11790 11795
ggc ctg gca tac tta aat cct tca aaa cac atg ttt aca ata cca 67780
Gly Leu Ala Tyr Leu Asn Pro Ser Lys His Met Phe ThrIle Pro
11800 11805 11810
aac gta tac agt cct tgt atg ggt tcc ctt cgt aca gac ctc acg 67825
Asn Val Tyr Ser Pro Cys Met Gly Ser Leu Arg Thr Asp Leu Thr
11815 11820 11825
gaa gag att cat gtt atg aat ctc ctg tcg gca ata cca aca cgc 67870
Glu Glu Ile His Val Met Asn Leu Leu Ser Ala Ile Pro Thr Arg
11830 11835 11840
cca gga ctt aac gag gta ttg cat acc caa cta gac gaa tct gaa 67915
Pro Gly Leu Asn Glu Val Leu His Thr Gln Leu Asp Glu Ser Glu
11845 11850 11855
ata ttc gac gcg gca ttt aaa acc atg atg att ttt acc aca tgg 67960
Ile Phe Asp Ala Ala Phe Lys Thr Met Met Ile Phe Thr Thr Trp
11860 11865 11870
act gcc aaa gat ttg cat ata ctc cac acc cat gta cca gaa gta 68005
Thr Ala Lys Asp Leu His Ile Leu His Thr His Val Pro Glu Val
11875 11880 11885
ttt acg tgt caa gat gca gcc gcg cgt aac gga gaa tat gtg ctc 68050
Phe Thr Cys Gln Asp Ala Ala Ala Arg Asn Gly Glu Tyr Val Leu
11890 11895 11900
att ctt cca gct gtc cag gga cac agt tat gtg att aca cga aac 68095
Ile Leu Pro Ala Val Gln Gly His Ser Tyr Val Ile Thr Arg Asn
11905 11910 11915
aaa cct caa agg ggt ttg gta tat tcc ctg gca gat gtg gat gta 68140
Lys Pro Gln Arg Gly Leu Val Tyr Ser Leu Ala Asp Val Asp Val
11920 11925 11930
tat aac ccc ata tcc gtt gtt tat tta agc agg gat act tgc gtg 68185
Tyr Asn Pro Ile Ser Val Val Tyr Leu Ser Arg Asp Thr Cys Val
11935 11940 11945
tct gaa cat ggt gtc ata gag acg gtc gca ctg ccc cat ccg gac 68230
Ser Glu His Gly Val Ile Glu Thr Val Ala Leu Pro His Pro Asp
11950 11955 11960
aat tta aaa gaa tgt ttg tat tgc gga agt gtt ttt ctt agg tat 68275
Asn Leu Lys Glu Cys Leu Tyr Cys Gly Ser Val Phe Leu Arg Tyr
11965 11970 11975
cta acc acg ggg gcg att atg gat ata att att att gac agc aaa 68320
Leu Thr Thr Gly Ala Ile Met Asp Ile Ile Ile Ile Asp Ser Lys
11980 11985 11990
gat aca gaa cga caa cta gcc gct atg gga aac tcc aca att cca 68365
Asp Thr Glu Arg Gln Leu Ala Ala Met Gly Asn Ser Thr Ile Pro
11995 12000 12005
ccc ttc aat cca gac atg cac ggg gat gac tct aag gct gtg ttg 68410
Pro Phe Asn Pro Asp Met His Gly Asp Asp Ser Lys Ala Val Leu
12010 12015 12020
ttg ttt cca aac gga act gtg gta acg ctt cta gga ttc gaa cga 68455
Leu Phe Pro Asn Gly Thr Val Val Thr Leu Leu Gly Phe Glu Arg
12025 12030 12035
cga caa gcc ata cga atg tcg gga caa tac ctt ggg gcc tct tta 68500
Arg Gln Ala Ile Arg Met Ser Gly Gln Tyr Leu Gly Ala Ser Leu
12040 12045 12050
gga ggg gcg ttt ctg gcg gta gtg ggg ttt ggt att atc gga tgg 68545
Gly Gly Ala Phe Leu Ala Val Val Gly Phe Gly Ile Ile Gly Trp
12055 12060 12065
atg tta tgt gga aat tcc cgc ctt cga gaa tat aat aaa ata cct 68590
Met Leu Cys Gly Asn Ser Arg Leu Arg Glu Tyr Asn LysIle Pro
12070 12075 12080
ctg aca taa aaaacatgta taataaaaag tcactataaa cgtattctct 68639
Leu Thr
12085
acaatacttt attcgcgaat aatacacact acctttgggt ttttttcccg tccccaaatg 68699
gtgtttggtg cactctacca aaaaatagag cgcctaaata tgctatataa cgcctcccag 68759
caaaatacgg ttcaaaggca ttacccgata ttgtattgta gtacagggca atgggaattg 68819
atgatcccaa taaacggcat agacgcacag cgccgttata gcaggggtct ccagagtaca 68879
gggtatctaa gtaccgggat atctcatact catgcctttc cgtgacagaa acatcaaccg 68939
gaacagtatc cgataaacca actcctgttt ttgcaaggcg taaaattcgc acaccttcct 68999
tttttgcaag atgtgacgtt tccttgtaac agggaagctg ggggagtggt aagaacaaca 69059
aagtttcagc caacgtgcca ataaagccca cttccctcaa gaggctgttt gctgtatcca 69119
caatggtccg tattaaatct tgagcaactt gatccgtgtc atcatcactg ggtaacgcgt 69179
taacataact acgcgttaaa tcttcaataa cggcataaca attaaacgct tcccaccgag 69239
acagtatata ttgaacaatc acgaaccgtt gacaggacgt cagatcacgt ccgtaagcat 69299
gcccgaaaaa tggaagttcc ccccgttcgc catataccgc aacaactgca gtatatatcg 69359
tctcacgggc ttcattaagt tcatcttcaa gtccaggcca ttttctggct ttaaatataa 69419
cctcgtccgc aaaaaaaacc gcacatgata acgcgcggat acaatgagta gtggctttat 69479
ggcgaggatc ccaaatgtcc attacccggg ggatggtcct aatctgtaca aagttactta 69539
gtgtaatatg atcggacttc ttacgccgtc taggctgttt ctcagaatac ggttcacccg 69599
aaatcggcac atcatctgct tttacgtctt ccgtaaccac atcagcagcg cgccgactaa 69659
caattatact tgttttttca tcgtcgttac ttccgttaag cgcgtctcgt atctcgggcg 69719
tcccgtcgaa taatccactc actagctcct gcaaactttc tggtaactcc aacatacgca 69779
tatacaccaa tgaaaaactg gcttcgtttg gtacgtacat aaagccattt gtggtattaa 69839
tggcggtggg tgttggaaac aattttagct tattctcgcg cgtaacatct acccccgcca 69899
ccaatgttaa atgcgtcacg gggagggaca cgagataatc tgcgagcgta gggtcctcca 69959
cttcaacatc aaatgttccg caaaggtcgc gatccaccgc ccccgatccc gctgcaagta 70019
aggccactcg atccaaaaac acgcagttat tattggatga taccgcccat gtcttcccgg 70079
tgcgattgag ctcacttcga acgtaactgg caacagatct gtcaccgggt ccgaccccgc 70139
gaacaacatg tccaaatttt gcgatctcgc ctccatgttt gcggggtatg gaaattaagc 70199
atcccccgca tataaaatac gccctggtag cacgctcgtt aaaataaaac gttacgccgt 70259
tataagatac ggttgaatga tatggaaatt ccatattaaa gcgtttatcg gaacattaac 70319
ctcgaacttg ccgtcccgtg atcgtgtgat cgccaacctt aggtccacac cgaatatgag 70379
aaatatataa ctacacgcaa acattcaaaa caccgtggta tcattaacgt catatgaaaa 70439
gatccaatca atccaatcaa ccacacctcc taccgtttag cacgtcagct atgtgacatg 70499
ctccaaacat acgtaaacat ttagagaggg tgttataaca gtctgtcagg cggggtatat 70559
tctacataat acaaggatcg gctttaactt tgtcaacatt tttactttgg actataaact 70619
gcgactgaac gtt atg aac cca ccc caa gcc cgc gtc tcg gaa cag 70665
Met Asn Pro Pro Gln Ala Arg Val Ser Glu Gln
12090 12095
aca aag gac ttg ctt agc gtt atg gtt aac cag cac ccc gaa gag 70710
Thr Lys Asp Leu Leu Ser Val Met Val Asn Gln His Pro Glu Glu
12100 12105 12110
gac gca aaa gtg tgt aaa tcc agt gat aat tca ccg ctt tat aac 70755
Asp Ala Lys Val Cys Lys Ser Ser Asp Asn Ser Pro Leu Tyr Asn
12115 12120 12125
acc atg gtt atg tta tcg tat ggg ggt gat acg gac tta cta tta 70800
Thr Met Val Met Leu Ser Tyr Gly Gly Asp Thr Asp Leu Leu Leu
12130 12135 12140
agc tct gca tgt acc cgc aca tct acc gta aac agg tcg gcg ttt 70845
Ser Ser Ala Cys Thr Arg Thr Ser Thr Val Asn Arg Ser Ala Phe
12145 12150 12155
acg caa cac tcc gtg ttt tat att ata tcc acg gtg ttg att caa 70890
Thr Gln His Ser Val Phe Tyr Ile Ile Ser Thr Val Leu Ile Gln
12160 12165 12170
cca ata tgt tgt atc ttc ttt ttt ttt tac tat aaa gcg aca cgc 70935
Pro Ile Cys Cys Ile Phe Phe Phe Phe Tyr Tyr Lys Ala Thr Arg
12175 12180 12185
tgt atg ctc tta ttc aca gcc ggg tta ctt ctg acg att cta cat 70980
Cys Met Leu Leu Phe Thr Ala Gly Leu Leu Leu Thr Ile Leu His
12190 12195 12200
cac ttt cga ctt att att atg tta ttg tgt gtc tac aga aat ata 71025
His Phe Arg Leu Ile Ile Met Leu Leu Cys Val Tyr Arg Asn Ile
12205 12210 12215
cga tca gac ctg cta ccc tta tct aca tcc cag caa ctg ctg ctt 71070
Arg Ser Asp Leu Leu Pro Leu Ser Thr Ser Gln Gln Leu Leu Leu
12220 12225 12230
gga att att gtt gtg act cga aca atg cta ttt tgt att acg gcg 71115
Gly Ile Ile Val Val Thr Arg Thr Met Leu Phe Cys Ile Thr Ala
12235 12240 12245
tat tat act ctt ttt ata gac acc cgg gtg ttc ttt ttg att acc 71160
Tyr Tyr Thr Leu Phe Ile Asp Thr Arg Val Phe Phe Leu Ile Thr
12250 12255 12260
gga cac ttg caa agt gag gtt att ttt cca gat agc gtt tca aaa 71205
Gly His Leu Gln Ser Glu Val Ile Phe Pro Asp Ser Val Ser Lys
12265 12270 12275
ata ctt cct gtg tcg tgg ggt cca agt cca gcc gtg tta ctg gta 71250
Ile Leu Pro Val Ser Trp Gly Pro Ser Pro Ala Val Leu Leu Val
12280 12285 12290
atg gcg gca gtt att tac gct atg gac tgt ttg gtg gac acg gta 71295
Met Ala Ala Val Ile Tyr Ala Met Asp Cys Leu Val Asp Thr Val
12295 12300 12305
tcc ttt att ggg cca agg gtg tgg gtc cgt gtt atg tta aaa aca 71340
Ser Phe Ile Gly Pro Arg Val Trp Val Arg Val Met Leu Lys Thr
12310 12315 12320
tct att tcg ttt tag tccatttcaa taaatgtact ataattgttc agtctaaaaa 71395
Ser Ile Ser Phe
12325
taatgttggg tatttataat taccgccccc gtgttacttg gaaacaccca tacatatgtt 71455
ccactctaca tcaaacttct cgcagttttc ttgttcccgc acacgtttac acgtccggat 71515
tcaagtcgca acgctgctga caaa atg aca acg gtt tca tgt ccc gct aac 71566
Met Thr Thr Val Ser Cys Pro Ala Asn
12330
gtg att act aca acg gaa tct gat cgt att gct ggg tta ttt aac 71611
Val Ile Thr Thr Thr Glu Ser Asp Arg Ile Ala Gly Leu Phe Asn
12335 12340 12345
atc cca gcg ggg atc att cca act gga aat gtg ctg tca acc ata 71656
Ile Pro Ala Gly Ile Ile Pro Thr Gly Asn Val Leu Ser Thr Ile
12350 12355 12360
gag gtg tgt gca cac cgt tgc att ttt gat ttt ttt aaa caa ata 71701
Glu Val Cys Ala His Arg Cys Ile Phe Asp Phe Phe Lys Gln Ile
12365 12370 12375
cga tca gat gat aac agc ctt tac tcg gct caa ttc gat att ctt 71746
Arg Ser Asp Asp Asn Ser Leu Tyr Ser Ala Gln Phe Asp Ile Leu
12380 12385 12390
ttg ggg aca tac tgc aat aca tta aac ttt gtg cgt ttt cta gaa 71791
Leu Gly Thr Tyr Cys Asn Thr Leu Asn Phe Val Arg Phe Leu Glu
12395 12400 12405
ctt gga ctg tct gtc gct tgc atc tgt act aaa ttt ccg gag ctg 71836
Leu Gly Leu Ser Val Ala Cys Ile Cys Thr Lys Phe Pro Glu Leu
12410 12415 12420
gct tac gtg cga gat ggc gtt att caa ttt gag gta caa caa ccc 71881
Ala Tyr Val Arg Asp Gly Val Ile Gln Phe Glu Val Gln Gln Pro
12425 12430 12435
atg ata gca cgt gat ggc cca cat ccc gtc gat cag cct gtt cat 71926
Met Ile Ala Arg Asp Gly Pro His Pro Val Asp Gln Pro Val His
12440 12445 12450
aat tat atg gtt aag cgg ata cac aag cgt tcg tta agc gct gcg 71971
Asn Tyr Met Val Lys Arg Ile His Lys Arg Ser Leu Ser Ala Ala
12455 12460 12465
ttt gca att gca tcg gaa gcg ttg agt ttg tta agt aac aca tat 72016
Phe Ala Ile Ala Ser Glu Ala Leu Ser Leu Leu Ser Asn Thr Tyr
12470 12475 12480
gtc gat ggg aca gag att gac tca tcg tta cgt ata aga gct atc 72061
Val Asp Gly Thr Glu Ile Asp Ser Ser Leu Arg Ile Arg Ala Ile
12485 12490 12495
caa cag atg gct cgt aat tta cgc acc gtt ttg gac tca ttt gaa 72106
Gln Gln Met Ala Arg Asn Leu Arg Thr Val Leu Asp Ser Phe Glu
12500 12505 12510
cga ggc act gcc gat caa ctt ctt ggt gtt cta ttg gag aaa gcc 72151
Arg Gly Thr Ala Asp Gln Leu Leu Gly Val Leu Leu Glu Lys Ala
12515 12520 12525
cca ccg cta tcg ctg ctt tca cca att aat aaa ttc caa ccc gag 72196
Pro Pro Leu Ser Leu Leu Ser Pro Ile Asn Lys Phe Gln Pro Glu
12530 12535 12540
gga cat cta aat cgt gtt gca cgc gcg gcc cta ctt tcg gac ctc 72241
Gly His Leu Asn Arg Val Ala Arg Ala Ala Leu Leu Ser Asp Leu
12545 12550 12555
aaa cgt aga gtc tgt gcg gat atg ttt ttt atg acc cga cac gcc 72286
Lys Arg Arg Val Cys Ala Asp Met Phe Phe Met Thr Arg His Ala
12560 12565 12570
agg gaa cct agg ctg atc tct gcg tat ctg tcg gat atg gtt tcg 72331
Arg Glu Pro Arg Leu Ile Ser Ala Tyr Leu Ser Asp Met Val Ser
12575 12580 12585
tgc acc caa cca tcg gtg atg gta tca cga ata act cat aca aac 72376
Cys Thr Gln Pro Ser Val Met Val Ser Arg Ile Thr His Thr Asn
12590 12595 12600
act cgc gga cgg cag gtt gac ggt gtg ttg gta aca aca gca acc 72421
Thr Arg Gly Arg Gln Val Asp Gly Val Leu Val Thr Thr Ala Thr
12605 12610 12615
tta aaa cgg caa cta tta cag gga att tta caa att gac gac acc 72466
Leu Lys Arg Gln Leu Leu Gln Gly Ile Leu Gln Ile Asp Asp Thr
12620 12625 12630
gcc gct gac gta cca gta aca tat ggc gaa atg gtt cta cag ggg 72511
Ala Ala Asp Val Pro Val Thr Tyr Gly Glu Met Val Leu Gln Gly
12635 12640 12645
aca aac ttg gta acc gcc ctt gtg atg gga aag gcc gtc cgc gga 72556
Thr Asn Leu Val Thr Ala Leu Val Met Gly Lys Ala Val Arg Gly
12650 12655 12660
atg gat gat gta gcc cgc cat ctc ctt gat ata acc gac cct aac 72601
Met Asp Asp Val Ala Arg His Leu Leu AspIle Thr Asp Pro Asn
12665 12670 12675
acg tta aac ata ccg tct ata ccc cca caa tcc aac tcc gat tca 72646
Thr Leu Asn Ile Pro Ser Ile Pro Pro Gln Ser Asn Ser Asp Ser
12680 12685 12690
acg aca gct ggg ctt ccg gtt aac gcc cgt gtt cct gcg gat tta 72691
Thr Thr Ala Gly Leu Pro Val Asn Ala Arg Val Pro Ala Asp Leu
12695 12700 12705
gtg att gtt ggg gat aaa ctt gta ttc tta gaa gca tta gaa cgg 72736
Val Ile Val Gly Asp Lys Leu Val Phe Leu Glu Ala Leu Glu Arg
12710 12715 12720
cgg gtc tac caa gct acg cgc gtt gcc tac cct ctt att gga aat 72781
Arg Val Tyr Gln Ala Thr Arg Val Ala Tyr Pro Leu Ile Gly Asn
12725 12730 12735
ata gat att acg ttt atc atg cca atg gga gtg ttt cag gca aac 72826
Ile Asp Ile Thr Phe Ile Met Pro Met Gly Val Phe Gln Ala Asn
12740 12745 12750
tcc atg gac aga tat aca cga cac gcc ggc gat ttt tca act gta 72871
Ser Met Asp Arg Tyr Thr Arg His Ala Gly Asp Phe Ser Thr Val
12755 12760 12765
tcc gaa cag gat cca cgt caa ttt cca ccc caa ggg att ttt ttt 72916
Ser Glu Gln Asp Pro Arg Gln Phe Pro Pro Gln Gly Ile Phe Phe
12770 12775 12780
tat aat aaa gat ggg ata tta aca cag ttg act ctt cgt gat gca 72961
Tyr Asn Lys Asp Gly Ile Leu Thr Gln Leu Thr Leu Arg Asp Ala
12785 12790 12795
atg ggt acc atc tgc cac agt tca ttg ctt gat gtc gag gcc aca 73006
Met Gly Thr Ile Cys His Ser Ser Leu Leu Asp Val Glu Ala Thr
12800 12805 12810
ctt gtt gcc ctc cgc caa caa cat tta gat cgt cag tgt tat ttt 73051
Leu Val Ala Leu Arg Gln Gln His Leu Asp Arg Gln Cys Tyr Phe
12815 12820 12825
ggt gta tac gtg gcc gag ggt aca gag gac aca ttg gat gtt caa 73096
Gly Val Tyr Val Ala Glu Gly Thr Glu Asp Thr Leu Asp Val Gln
12830 12835 12840
atg ggg agg ttt atg gaa acg tgg gca gat atg atg cct cat cac 73141
Met Gly Arg Phe Met Glu Thr Trp Ala Asp Met Met Pro His His
12845 12850 12855
cct cat tgg gta aac gaa cat tta aca att cta cag ttt ata gct 73186
Pro His Trp Val Asn Glu His Leu Thr Ile Leu Gln PheIle Ala
12860 12865 12870
ccg agc aac ccg cgt cta agg ttt gaa tta aac ccc gcc ttt gat 73231
Pro Ser Asn Pro Arg Leu Arg Phe Glu Leu Asn Pro Ala Phe Asp
12875 12880 12885
ttt ttt gtt gca ccg ggg gac gta gac ctt ccc gga ccg cag cgt 73276
Phe Phe Val Ala Pro Gly Asp Val Asp Leu Pro Gly Pro Gln Arg
12890 12895 12900
ccc ccg gaa gcc atg cca acc gtt aac gca aca tta cgg att atc 73321
Pro Pro Glu Ala Met Pro Thr Val Asn Ala Thr Leu Arg Ile Ile
12905 12910 12915
aac gga aac att ccc gtg cct cta tgt ccc att tca ttt cga gac 73366
Asn Gly Asn Ile Pro Val Pro Leu Cys Pro Ile Ser Phe Arg Asp
12920 12925 12930
tgt cgc gga acc caa ctc ggt ttg gga aga cat aca atg acc ccg 73411
Cys Arg Gly Thr Gln Leu Gly Leu Gly Arg His Thr Met Thr Pro
12935 12940 12945
gca acc att aaa gcc gta aag gat aca ttt gaa gac cgc gca tac 73456
Ala Thr Ile Lys Ala Val Lys Asp Thr Phe Glu Asp Arg Ala Tyr
12950 12955 12960
cca act att ttc tac atg cta gag gct gtt att cat gga aac gaa 73501
Pro Thr Ile Phe Tyr Met Leu Glu Ala Val Ile His Gly Asn Glu
12965 12970 12975
aga aac ttc tgt gcg tta ctg cga ctg tta aca cag tgt att cgc 73546
Arg Asn Phe Cys Ala Leu Leu Arg Leu Leu Thr Gln Cys Ile Arg
12980 12985 12990
ggg tat tgg gag caa tcc cac agg gtg gca ttt gta aat aac ttt 73591
Gly Tyr Trp Glu Gln Ser His Arg Val Ala Phe Val Asn Asn Phe
12995 13000 13005
cac atg tta atg tac ata act aca tat ctc gga aac ggt gag ctt 73636
His Met Leu Met Tyr Ile Thr Thr Tyr Leu Gly Asn Gly Glu Leu
13010 13015 13020
ccc gaa gtc tgt att aat ata tat cgg gat tta ctg cag cat gta 73681
Pro Glu Val Cys Ile Asn Ile Tyr Arg Asp Leu Leu Gln His Val
13025 13030 13035
aga gca tta cgc caa act ata acc gat ttt aca ata caa gga gag 73726
Arg Ala Leu Arg Gln Thr Ile Thr Asp Phe Thr Ile Gln Gly Glu
13040 13045 13050
ggc cat aac ggc gag acc tcg gaa gcg cta aat aac atc ctt acg 73771
Gly His Asn Gly Glu Thr Ser Glu Ala Leu Asn Asn Ile Leu Thr
13055 13060 13065
gat gac acg ttt att gca cct att cta tgg gat tgt gat gcg tta 73816
Asp Asp Thr Phe Ile Ala Pro Ile Leu Trp Asp Cys Asp Ala Leu
13070 13075 13080
ata tac cgt gat gaa gcc gcc cga gac cga ctc ccc gca att cgt 73861
Ile Tyr Arg Asp Glu Ala Ala Arg Asp Arg Leu Pro Ala Ile Arg
13085 13090 13095
gta agc ggg cga aac gga tac caa gcc ctt cac ttt gtg gat atg 73906
Val Ser Gly Arg Asn Gly Tyr Gln Ala Leu His Phe Val Asp Met
13100 13105 13110
gcc ggg cat aac ttc caa cga cgc gat aat gtg tta atc cac ggg 73951
Ala Gly His Asn Phe Gln Arg Arg Asp Asn Val Leu Ile His Gly
13115 13120 13125
aga ccc gtt cgg gga gac acg ggt cag ggt att ccc att act cca 73996
Arg Pro Val Arg Gly Asp Thr Gly Gln Gly Ile Pro Ile Thr Pro
13130 13135 13140
cac cat gac cgt gaa tgg ggt att ctc tcc aag att tac tac tat 74041
His His Asp Arg Glu Trp Gly Ile Leu Ser Lys Ile Tyr Tyr Tyr
13145 13150 13155
att gtc att cct gca ttt tcc cgc ggt tcc tgt tgt aca atg ggc 74086
Ile Val Ile Pro Ala Phe Ser Arg Gly Ser Cys Cys Thr Met Gly
13160 13165 13170
gtg cgt tat gat cgc cta tac cct gcg tta cag gca gtt atc gtt 74131
Val Arg Tyr Asp Arg Leu Tyr Pro Ala Leu Gln Ala ValIle Val
13175 13180 13185
ccg gaa att ccc gct gat gaa gaa gcc cca act acc cca gaa gat 74176
Pro Glu Ile Pro Ala Asp Glu Glu Ala Pro Thr Thr Pro Glu Asp
13190 13195 13200
cca aga cac cct ctt cac gca cac caa ctc gtt ccg aac tct ctt 74221
Pro Arg His Pro Leu His Ala His Gln Leu Val Pro Asn Ser Leu
13205 13210 13215
aac gtt tac ttc cat aat gca cac cta acc gtt gat ggt gat gca 74266
Asn Val Tyr Phe His Asn Ala His Leu Thr Val Asp Gly Asp Ala
13220 13225 13230
ttg ctc aca cta caa gag tta atg gga gat atg gct gaa cga acg 74311
Leu Leu Thr Leu Gln Glu Leu Met Gly Asp Met Ala Glu Arg Thr
13235 13240 13245
acg gcc att tta gta tca agc gcc ccc gat gcg gga gcc gcc acg 74356
Thr Ala Ile Leu Val Ser Ser Ala Pro Asp Ala Gly Ala Ala Thr
13250 13255 13260
gca aca acc aga aat atg aga ata tat gac gga gcg ctt tac cat 74401
Ala Thr Thr Arg Asn Met Arg Ile Tyr Asp Gly Ala Leu Tyr His
13265 13270 13275
ggc ctt att atg atg gca tat cag gcg tac gat gaa acc att gca 74446
Gly Leu Ile Met Met Ala Tyr Gln Ala Tyr Asp Glu Thr Ile Ala
13280 13285 13290
acg ggt act ttt ttt tat ccc gtt ccg gtc aac cct ctg ttt gca 74491
Thr Gly Thr Phe Phe Tyr Pro Val Pro Val Asn Pro Leu Phe Ala
13295 13300 13305
tgt ccg gaa cat ttg gca tca ttg cgt gga atg aca aat gct agg 74536
Cys Pro Glu His Leu Ala Ser Leu Arg Gly Met Thr Asn Ala Arg
13310 13315 13320
cgg gtt ttg gca aaa atg gta cca cca atc cct cct ttt ctg gga 74581
Arg Val Leu Ala Lys Met Val Pro Pro Ile Pro Pro Phe Leu Gly
13325 13330 13335
gcc aac cac cac gca act ata cgc caa ccc gtt gcc tac cat gta 74626
Ala Asn His His Ala Thr Ile Arg Gln Pro Val Ala Tyr His Val
13340 13345 13350
acg cat agt aag tcg gat ttt aat act ctt aca tat tct ctt ctt 74671
Thr His Ser Lys Ser Asp Phe Asn Thr Leu Thr Tyr Ser Leu Leu
13355 13360 13365
gga ggg tat ttt aag ttt aca cca ata tct ctt aca cat caa cta 74716
Gly Gly Tyr Phe Lys Phe Thr Pro Ile Ser Leu Thr His Gln Leu
13370 13375 13380
cga acg gga ttt cac ccc ggg att gcc ttt acc gta gtg cgc cag 74761
Arg Thr Gly Phe His Pro Gly Ile Ala Phe Thr Val Val Arg Gln
13385 13390 13395
gat cgc ttt gcc aca gag caa ctt tta tat gcc gag cgt gct tct 74806
Asp Arg Phe Ala Thr Glu Gln Leu Leu Tyr Ala Glu Arg Ala Ser
13400 13405 13410
gaa tcg tac ttt gtc gga caa atc caa gta cac cat cat gat gct 74851
Glu Ser Tyr Phe Val Gly Gln Ile Gln Val His His His Asp Ala
13415 13420 13425
att ggg ggg gta aac ttt acc cta acc caa ccc aga gct cac gtg 74896
Ile Gly Gly Val Asn Phe Thr Leu Thr Gln Pro Arg Ala His Val
13430 13435 13440
gac ctg gga gtc ggg tat aca gct gta tgt gcc aca gca gcc ctg 74941
Asp Leu Gly Val Gly Tyr Thr Ala Val Cys Ala Thr Ala Ala Leu
13445 13450 13455
cga tgc cct ctc acg gat atg ggc aat act gcc caa aat ctt ttt 74986
Arg Cys Pro Leu Thr Asp Met Gly Asn Thr Ala Gln Asn Leu Phe
13460 13465 13470
ttt tca cga gga gga gtg cca atg tta cat gat aac gtt acc gaa 75031
Phe Ser Arg Gly Gly Val Pro Met Leu His Asp Asn Val Thr Glu
13475 13480 13485
tcg ttg cgt cgt ata aca gca tcg ggg ggt cgc tta aat ccc acc 75076
Ser Leu Arg Arg Ile Thr Ala Ser Gly Gly Arg Leu Asn Pro Thr
13490 13495 13500
gaa ccc cta ccc atc ttc ggc gga cta cgt cct gct aca tcg gca 75121
Glu Pro Leu Pro Ile Phe Gly Gly Leu Arg Pro Ala Thr Ser Ala
13505 13510 13515
gga att gca cga ggg caa gcc tct gtg tgt gag ttt gtg gcc atg 75166
Gly Ile Ala Arg Gly Gln Ala Ser Val Cys Glu Phe Val Ala Met
13520 13525 13530
ccg gtg tcc act gac cta caa tat ttt aga act gca tgc aat cct 75211
Pro Val Ser Thr Asp Leu Gln Tyr Phe Arg Thr Ala Cys Asn Pro
13535 13540 13545
aga ggt cga gca tct gga atg tta tat atg ggt gac cgt gac gcc 75256
Arg Gly Arg Ala Ser Gly Met Leu Tyr Met Gly Asp Arg Asp Ala
13550 13555 13560
gac ata gag gct ata atg ttt gat cac aca caa tcg gat gtt gct 75301
Asp Ile Glu Ala Ile Met Phe Asp His Thr Gln Ser Asp Val Ala
13565 13570 13575
tat aca gat cga gca act ctt aac cca tgg gca tca caa aaa cat 75346
Tyr Thr Asp Arg Ala Thr Leu Asn Pro Trp Ala Ser Gln Lys His
13580 13585 13590
tca tac ggt gac agg cta tac aac gga aca tac aac ctt aca ggc 75391
Ser Tyr Gly Asp Arg Leu Tyr Asn Gly Thr Tyr Asn Leu Thr Gly
13595 13600 13605
gct tct cct atc tac agc cca tgc ttt aag ttt ttt aca cca gcg 75436
Ala Ser Pro Ile Tyr Ser Pro Cys Phe Lys Phe Phe Thr Pro Ala
13610 13615 13620
gag gtt aac act aat tgt aat aca ctg gat cgg ctt cta atg gag 75481
Glu Val Asn Thr Asn Cys Asn Thr Leu Asp Arg Leu Leu Met Glu
13625 13630 13635
gca aag gct gtg gcg tcg caa agc tcc acc gac act gaa tat caa 75526
Ala Lys Ala Val Ala Ser Gln Ser Ser Thr Asp Thr Glu Tyr Gln
13640 13645 13650
ttt aaa cgc cct ccc ggt tct acc gaa atg aca cag gat ccg tgt 75571
Phe Lys Arg Pro Pro Gly Ser Thr Glu Met Thr Gln Asp Pro Cys
13655 13660 13665
ggc ctt ttt caa gaa gca tat cca cca cta tgc tca agc gat gcg 75616
Gly Leu Phe Gln Glu Ala Tyr Pro Pro Leu Cys Ser Ser Asp Ala
13670 13675 13680
gcc atg tta cga acg gct cac gcg gga gaa acc ggg gca gat gaa 75661
Ala Met Leu Arg Thr Ala His Ala Gly Glu Thr Gly Ala Asp Glu
13685 13690 13695
gtt cac tta gcc caa tat ctg att cga gac gcg tcg ccc ctt agg 75706
Val His Leu Ala Gln Tyr Leu Ile Arg Asp Ala Ser Pro Leu Arg
13700 13705 13710
gga tgt ctt cct ctt ccg cga taa tttcaccacg cccacatacc 75750
Gly Cys Leu Pro Leu Pro Arg
13715 13720
cactcccaat aaaagccctg tagagcgcat tggcatctta cttgagattt ggatacgctc 75810
ggccgacttg gtctgtttca cgcttcctta aacaac atg gct atg cca ttt gag 75864
Met Ala Met Pro Phe Glu
13725
ata gag gta ttg tta cca gga gaa cta tcc ccg gcg gaa aca tct 75909
Ile Glu Val Leu Leu Pro Gly Glu Leu Ser Pro Ala Glu Thr Ser
13730 13735 13740
gca tta cag aaa tgt gag gga aaa att att acc ttc tca acc ctg 75954
Ala Leu Gln Lys Cys Glu Gly Lys Ile Ile Thr Phe Ser Thr Leu
13745 13750 13755
cgt cat cga gct tca ctg gtg gat ata gcg ctg tcg tca tat tac 75999
Arg His Arg Ala Ser Leu Val Asp Ile Ala Leu Ser Ser Tyr Tyr
13760 13765 13770
att aac ggt gct cca cca gac acg ctc tcg ctg tta gag gca tac 76044
Ile Asn Gly Ala Pro Pro Asp Thr Leu Ser Leu Leu Glu Ala Tyr
13775 13780 13785
cga atg cga ttc gcg gca gtt ata aca cgg gtc atc ccg gga aag 76089
Arg Met Arg Phe Ala Ala Val Ile Thr Arg Val Ile Pro Gly Lys
13790 13795 13800
ttg ttg gcg cat gcc att ggc gtg ggt act cct aca ccc ggg ttg 76134
Leu Leu Ala His Ala Ile Gly Val Gly Thr Pro Thr Pro Gly Leu
13805 13810 13815
ttt att caa aat aca tcc ccc gtt gat ctt tgt aat ggc gat tac 76179
Phe Ile Gln Asn Thr Ser Pro Val Asp Leu Cys Asn Gly Asp Tyr
13820 13825 13830
atc tgc tta ctt cct ccg gtt ttc ggg tcc gca gac tca att cgc 76224
Ile Cys Leu Leu Pro Pro Val Phe Gly Ser Ala Asp Ser Ile Arg
13835 13840 13845
ttg gac tct gta gga ctg gaa att gtt ttc cct tta acc atc ccc 76269
Leu Asp Ser Val Gly Leu Glu Ile Val Phe Pro Leu Thr Ile Pro
13850 13855 13860
cag acc tta atg cga gaa atc atc gcc aaa gtg gtt gca cgg gcc 76314
Gln Thr Leu Met Arg Glu Ile Ile Ala Lys Val Val Ala Arg Ala
13865 13870 13875
gtt gag cgc acg gcc gcg ggt gct caa att tta ccc cac gaa gtt 76359
Val Glu Arg Thr Ala Ala Gly Ala Gln Ile Leu Pro His Glu Val
13880 13885 13890
cta cga ggc gcg gat gtc att tgt tac aat gga agg cgt tat gaa 76404
Leu Arg Gly Ala Asp Val Ile Cys Tyr Asn Gly Arg Arg Tyr Glu
13895 13900 13905
ctc gaa aca aat tta caa cat cgg gac gga tcg gat gcg gct att 76449
Leu Glu Thr Asn Leu Gln His Arg Asp Gly Ser Asp Ala Ala Ile
13910 13915 13920
cgc aca ttg gtt tta aat cta atg ttt tcc ata aac gag gga tgt 76494
Arg Thr Leu Val Leu Asn Leu Met Phe Ser Ile Asn Glu Gly Cys
13925 13930 13935
ctg ctt tta ttg gcg ctg att cca act ttg tta gtc caa gga gca 76539
Leu Leu Leu Leu Ala Leu Ile Pro Thr Leu Leu Val Gln Gly Ala
13940 13945 13950
cac gac ggt tat gta aat tta ttg ata caa acg gcc aat tgc gtt 76584
His Asp Gly Tyr Val Asn Leu Leu Ile Gln Thr Ala Asn Cys Val
13955 13960 13965
aga gaa acc ggc cag tta att aat ata ccg cca atg ccg cgg att 76629
Arg Glu Thr Gly Gln Leu Ile Asn Ile Pro Pro Met Pro Arg Ile
13970 13975 13980
caa gac ggc cat cgc cga ttt ccc ata tat gaa act att tca tct 76674
Gln Asp Gly His Arg Arg Phe Pro Ile Tyr Glu Thr Ile Ser Ser
13985 13990 13995
tgg ata tca aca tca tct aga ctg ggg gat acc ttg gga act cgc 76719
Trp Ile Ser Thr Ser Ser Arg Leu Gly Asp Thr Leu Gly Thr Arg
14000 14005 14010
gca att tta cgc gtc tgt gtg ttt gat gga ccc tct act gtt cat 76764
Ala Ile Leu Arg Val Cys Val Phe Asp Gly Pro Ser Thr Val His
14015 14020 14025
ccg gga gac cgc acg gcc gtg att caa gtg taa acaggtgtta 76807
Pro Gly Asp Arg Thr Ala Val Ile Gln Val
14030 14035
ataaaaacac aaccagtcta gttacatttc acgcgtcttg tttttattta ataggcataa 76867
acacggaatc cggtatacat gaactgccaa tatacacgga cataattaat gcaaccatca 76927
gatcatctga cattgttccc gtggtacctt tacccgtgta agtttttgtg tctagattac 76987
ccataccgcc tttaattacc tctgtcaggt tatccaactg tttacataga tactccacgg 77047
ggtctacacc taactttact gttagggata caagctcctg tgaggctatt atatttccgg 77107
agttaaatcg tttaacaaaa tagtctacgg ccggcgtttt ttgtttttgt aataaaaaaa 77167
aagggtacgc cacgctacat ccgggaggta tggaatgata aaacagtaac actggagcgg 77227
aagatagcac gtttcccttt tcgaggacag caaactgttg tgctatagcc aacgatatgg 77287
caactgcaga atcctggctg ctgtttccct ctatagaaac gtgtacgttt gtaaatgtat 77347
tggggtgtaa agcgagtatg tggcctaagc attgagtaac gcaacgccct atctcactgg 77407
aagacgtgcc agttaaagct ctaagaaaaa agtgctccaa tccaaatata atccaatccg 77467
acttataacg accaacaatc gctacaccag taccagacgc tcgtgtattt gaggtaaatg 77527
cagggtctac gtaaacgtac aacactgacg ataatatagc acaattcgca acggttgacg 77587
gccgatataa aataaacctc tcacgggcag tttttgtaaa taatggccgg tcaaacccca 77647
cacccccaga attctgttta cgcccaccta caatttcctg cacgaaggag tcggccataa 77707
ataaatctgc agtgcgccgc atggctccat ccattgtgat gaaaaccggc ttatttaata 77767
cataacacga acaagctgtg acatcgctat gtgctaaaac acgcggcatg tgatcgtcgc 77827
atacatatgt aacaacgttt aacaactgat ccgacgatcc acgtaagtta tacaaaaaac 77887
ttgtacttgc ttttccggta tttgttgatg aaacaaaaat aattttacaa ttggtttgat 77947
ttaaaaatcc gactatagtt tgtacagcat caggtcgaat aaaattagct tcatccacaa 78007
acagaagatt aaaatcttga cctcggatac cctggaacga tagaaagata tatagttacc 78067
ccaccaaagt ttaaatgtat ccttaaatac cacgtacgta aaaaatgttt gaatacgtac 78127
atatttcttt tttttttcca gtacaaccat atccggtgta ta atg gaa gcc cat 78181
Met Glu Ala His
14040
ttg gca aat gaa acc aaa cat gca ctt tgg cat aat gat cac aca 78226
Leu Ala Asn Glu Thr Lys His Ala Leu Trp His Asn Asp His Thr
14045 14050 14055
aaa gga tta cta cac gtt gtg ata cct aac gcg ggg ctt att gcg 78271
Lys Gly Leu Leu His Val Val Ile Pro Asn Ala Gly Leu Ile Ala
14060 14065 14070
gcc gga ata gat ccc gca tta ctg att tta aag aaa ccc gga caa 78316
Ala Gly Ile Asp Pro Ala Leu Leu Ile Leu Lys Lys Pro Gly Gln
14075 14080 14085
cgc ttc aag gtt gaa gta caa aca aga tat cat gct aca ggt caa 78361
Arg Phe Lys Val Glu Val Gln Thr Arg Tyr His Ala Thr Gly Gln
14090 14095 14100
tgc gaa ccg tgg tgt caa gtt ttc gcc gcg tac att ccc gat aac 78406
Cys Glu Pro Trp Cys Gln Val Phe Ala Ala Tyr Ile Pro Asp Asn
14105 14110 14115
gcc tta aca aat ctc tta ata cca aaa acg gaa cca ttt gtt tca 78451
Ala Leu Thr Asn Leu Leu Ile Pro Lys Thr Glu Pro Phe Val Ser
14120 14125 14130
cac gtt ttt tcg gcc acg cat aat tca ggg gga ttg att tta tca 78496
His Val Phe Ser Ala Thr His Asn Ser Gly Gly Leu Ile Leu Ser
14135 14140 14145
ttg cct gtt tat ctt agc ccc ggt tta ttc ttt gat gca ttt aac 78541
Leu Pro Val Tyr Leu Ser Pro Gly Leu Phe Phe Asp Ala Phe Asn
14150 14155 14160
gtt gta gcg ata cga ata aat act gga aac cgc aag cac cgt gat 78586
Val Val Ala Ile Arg Ile Asn Thr Gly Asn Arg Lys His Arg Asp
14165 14170 14175
att tgt att atg tat gca gaa cta atc cca aac gga acg cgt tat 78631
Ile Cys Ile Met Tyr Ala Glu Leu Ile Pro Asn Gly Thr Arg Tyr
14180 14185 14190
ttt gct gat gga caa cgg gta ctt tta tta tgc aaa cag ctg att 78676
Phe Ala Asp Gly Gln Arg Val Leu Leu Leu Cys Lys Gln Leu Ile
14195 14200 14205
gcg tat atc cga tgc acc cct cgt ctt gca tcg tct ata aaa ata 78721
Ala Tyr Ile Arg Cys Thr Pro Arg Leu Ala Ser Ser Ile Lys Ile
14210 14215 14220
tac gca gag cat atg gtg gca gcc atg ggt gaa tca cac acg tca 78766
Tyr Ala Glu His Met Val Ala Ala Met Gly Glu Ser His Thr Ser
14225 14230 14235
aat ggg gac aat att gga ccc gtt tca tcc ata atc gat ctt gat 78811
Asn Gly Asp Asn Ile Gly Pro Val Ser Ser Ile Ile Asp Leu Asp
14240 14245 14250
cga cag tta act tct gga ggt att gat gac tcc cct gct gaa aca 78856
Arg Gln Leu Thr Ser Gly Gly Ile Asp Asp Ser Pro Ala Glu Thr
14255 14260 14265
cgc ata cag gaa aat aat cgg gac gtc ctt gag cta ata aaa cgg 78901
Arg Ile Gln Glu Asn Asn Arg Asp Val Leu Glu LeuIle Lys Arg
14270 14275 14280
gcc gta aac att gtt aac tcc agg cac ccc gtc cga cct tct agt 78946
Ala Val Asn Ile Val Asn Ser Arg His Pro Val Arg Pro Ser Ser
14285 14290 14295
tcc cgc gtt gca tct ggg ttg ctt caa agt gca aag ggc cac gga 78991
Ser Arg Val Ala Ser Gly Leu Leu Gln Ser Ala Lys Gly His Gly
14300 14305 14310
gcg caa act tcc aac aca gat ccg atc aat aac ggt tcc ttt gat 79036
Ala Gln Thr Ser Asn Thr Asp Pro Ile Asn Asn Gly Ser Phe Asp
14315 14320 14325
ggc gtc ctt gag ccg cct gga caa ggg cga ttt acg gga aag aaa 79081
Gly Val Leu Glu Pro Pro Gly Gln Gly Arg Phe Thr Gly Lys Lys
14330 14335 14340
aac aat tcg tcc gcc agc atc cca cct tta caa gac gtt cta ttg 79126
Asn Asn Ser Ser Ala Ser Ile Pro Pro Leu Gln Asp Val Leu Leu
14345 14350 14355
ttt acc cca gct tcg aca gaa ccc caa agt ctt atg gaa tgg ttc 79171
Phe Thr Pro Ala Ser Thr Glu Pro Gln Ser Leu Met Glu Trp Phe
14360 14365 14370
gac atc tgt tat gcc caa tta gtt agc ggg gac act cca gca gat 79216
Asp Ile Cys Tyr Ala Gln Leu Val Ser Gly Asp Thr Pro Ala Asp
14375 14380 14385
ttc tgg aaa cgg cgt ccc cta tca att gta ccg cga cat tac gca 79261
Phe Trp Lys Arg Arg Pro Leu Ser Ile Val Pro Arg His Tyr Ala
14390 14395 14400
gaa tcc ccc agt ccg ttg att gta gta tct tac aac gga tcc tct 79306
Glu Ser Pro Ser Pro Leu Ile Val Val Ser Tyr Asn Gly Ser Ser
14405 14410 14415
gcc tgg gga gga cgt att acc gga agt cca att tta tat cac tct 79351
Ala Trp Gly Gly Arg Ile Thr Gly Ser Pro Ile Leu Tyr His Ser
14420 14425 14430
gca cag gct att att gat gct gcg tgt ata aat gcc cgg gtt gac 79396
Ala Gln Ala Ile Ile Asp Ala Ala Cys Ile Asn Ala Arg Val Asp
14435 14440 14445
aat ccc caa agc cta cat gtg aca gct cgc caa gag cta gtc gcg 79441
Asn Pro Gln Ser Leu His Val Thr Ala Arg Gln Glu Leu Val Ala
14450 14455 14460
cgt tta ccg ttt ttg gct aac gtc cta aat aat caa acc ccc tta 79486
Arg Leu Pro Phe Leu Ala Asn Val Leu Asn Asn Gln Thr Pro Leu
14465 14470 14475
ccc gcc ttt aaa cca ggc gcc gaa atg ttt tta aac cag gta ttt 79531
Pro Ala Phe Lys Pro Gly Ala Glu Met Phe Leu Asn Gln Val Phe
14480 14485 14490
aaa caa gcg tgt gtg aca tcg cta acc caa ggt ctt ata acg gag 79576
Lys Gln Ala Cys Val Thr Ser Leu Thr Gln Gly Leu Ile Thr Glu
14495 14500 14505
tta caa acg aac ccg act cta caa caa ctc atg gaa tat gat att 79621
Leu Gln Thr Asn Pro Thr Leu Gln Gln Leu Met Glu Tyr Asp Ile
14510 14515 14520
gca gat tct tcc caa acg gtt att gat gaa att gta gcc cgc aca 79666
Ala Asp Ser Ser Gln Thr Val Ile Asp Glu Ile Val Ala Arg Thr
14525 14530 14535
cca gac ctg att cag act ata gtt tcg gtg tta acg gaa atg tca 79711
Pro Asp Leu Ile Gln Thr Ile Val Ser Val Leu Thr Glu Met Ser
14540 14545 14550
atg gat gcg ttt tat aac agc tcc ttg atg tat gcg gtt ttg gcg 79756
Met Asp Ala Phe Tyr Asn Ser Ser Leu Met Tyr Ala Val Leu Ala
14555 14560 14565
tat ctg tca tct gta tat aca cga cca caa ggt ggg ggg tat ata 79801
Tyr Leu Ser Ser Val Tyr Thr Arg Pro Gln Gly Gly Gly Tyr Ile
14570 14575 14580
ccc tac ctt cac gct tcc ttc cca tgc tgg tta ggt aat cgt tct 79846
Pro Tyr Leu His Ala Ser Phe Pro Cys Trp Leu Gly Asn Arg Ser
14585 14590 14595
ata tat tta ttt gac tat tat aat tca gga ggg gaa ata ctt aag 79891
Ile Tyr Leu Phe Asp Tyr Tyr Asn Ser Gly Gly Glu Ile Leu Lys
14600 14605 14610
ctt tcc aag gtc ccc gtt ccc gta gcc tta gaa aag gtt ggt att 79936
Leu Ser Lys Val Pro Val Pro Val Ala Leu Glu Lys Val Gly Ile
14615 14620 14625
ggt aat tcc aca caa ctg agg ggt aaa ttt ata cgc agc gcg gat 79981
Gly Asn Ser Thr Gln Leu Arg Gly Lys Phe Ile Arg Ser Ala Asp
14630 14635 14640
att gtt gat att gga att tgt tct aag tat tta ccc ggt caa tgt 80026
Ile Val Asp Ile Gly Ile Cys Ser Lys Tyr Leu Pro Gly Gln Cys
14645 14650 14655
tac gcg tac att tgt cta gga ttt aac cag caa tta caa tcc att 80071
Tyr Ala Tyr Ile Cys Leu Gly Phe Asn Gln Gln Leu Gln Ser Ile
14660 14665 14670
tta gtt tta ccg ggg gga ttt gcg gca tgt ttt tgt att acc gat 80116
Leu Val Leu Pro Gly Gly Phe Ala Ala Cys Phe Cys Ile Thr Asp
14675 14680 14685
acc cta cag gca gca cta cct gca tcg tta atc gga cct att cta 80161
Thr Leu Gln Ala Ala Leu Pro Ala Ser Leu Ile Gly Pro Ile Leu
14690 14695 14700
gac aga ttc tgc ttc tct att ccc aac ccc cat aaa taa 80200
Asp Arg Phe Cys Phe Ser Ile Pro Asn Pro His Lys
14705 14710
attagtgtca ctataaaaac ataacaccag aatctcttca tatgtaattt tacgtcattt 80260
ctcccgtttc caccccctct taaaatataa aataaccggg tgggtggcat taaacccaca 80320
agtacccggg cggcaatccg ctagactgtt tttctgctc atg gaa tta caa cgc 80374
Met Glu Leu Gln Arg
14715
ata ttt ccg ctg tac acc gct acg ggt gca gcg cgc aaa tta acc 80419
Ile Phe Pro Leu Tyr Thr Ala Thr Gly Ala Ala Arg Lys Leu Thr
14720 14725 14730
ccc gag gca gtt cag aga ctc tgc gat gca tta acg ctg gat atg 80464
Pro Glu Ala Val Gln Arg Leu Cys Asp Ala Leu Thr Leu Asp Met
14735 14740 14745
gga tta tgg aag tcc atc ctg acc gat ccc cgg gtg aaa ata atg 80509
Gly Leu Trp Lys Ser Ile Leu Thr Asp Pro Arg Val Lys Ile Met
14750 14755 14760
cga tca act gct ttt ata act tta agg atc gct ccg ttt atc ccc 80554
Arg Ser Thr Ala PheIle Thr Leu Arg Ile Ala Pro PheIle Pro
14765 14770 14775
ctt caa acg gat act act aat att gcc gtt gtt gta gcc aca att 80599
Leu Gln Thr Asp Thr Thr Asn Ile Ala Val Val Val Ala Thr Ile
14780 14785 14790
tac atc acg cgc cca cgt cag atg aac tta cct ccg aag act ttt 80644
Tyr Ile Thr Arg Pro Arg Gln Met Asn Leu Pro Pro Lys Thr Phe
14795 14800 14805
cat gta att gta aat ttt aat tac gag gtc tcg tac gca atg acg 80689
His Val Ile Val Asn Phe Asn Tyr Glu Val Ser Tyr Ala Met Thr
14810 14815 14820
gcg act tta aga att tat ccg gtt gaa aac ata gac cat gtt ttt 80734
Ala Thr Leu Arg Ile Tyr Pro Val Glu Asn Ile Asp His Val Phe
14825 14830 14835
gga gca acg ttt aag aac ccg atc gcg tac ccc ctt cca aca tct 80779
Gly Ala Thr Phe Lys Asn Pro Ile Ala Tyr Pro Leu Pro Thr Ser
14840 14845 14850
att ccg gat cct cga gca gat ccc acc ccc gca gat ctt aca cca 80824
Ile Pro Asp Pro Arg Ala Asp Pro Thr Pro Ala Asp Leu Thr Pro
14855 14860 14865
acg cca aac tta agc aac tac tta caa ccc ccg cgg ctt ccg aaa 80869
Thr Pro Asn Leu Ser Asn Tyr Leu Gln Pro Pro Arg Leu Pro Lys
14870 14875 14880
aat cca tac gca tgt aaa gtt att tct ccg gga gtg tgg tgg tca 80914
Asn Pro Tyr Ala Cys Lys Val Ile Ser Pro Gly Val Trp Trp Ser
14885 14890 14895
gac gaa cga agg cgt tta tat gta ctg gct atg gaa cct aat tta 80959
Asp Glu Arg Arg Arg Leu Tyr Val Leu Ala Met Glu Pro Asn Leu
14900 14905 14910
ata ggg cta tgt ccc gcc gga tgg cat gct cgg ata ctt ggc tct 81004
Ile Gly Leu Cys Pro Ala Gly Trp His Ala Arg Ile Leu Gly Ser
14915 14920 14925
gta tta aat cga ctc ctc agc cat gcg gac gga tgt gat gaa tgt 81049
Val Leu Asn Arg Leu Leu Ser His Ala Asp Gly Cys Asp Glu Cys
14930 14935 14940
aat cat aga gtt cac gtg ggg gca ctg tat gcg tta ccc cat gtc 81094
Asn His Arg Val His Val Gly Ala Leu Tyr Ala Leu Pro His Val
14945 14950 14955
aca aat cat gcg gaa ggt tgt gtg tgt tgg gct ccg tgt atg tgg 81139
Thr Asn His Ala Glu Gly Cys Val Cys Trp Ala Pro Cys Met Trp
14960 14965 14970
aga aag gcc ggt cag cgg gaa tta aaa gtg gag gta gac att ggc 81184
Arg Lys Ala Gly Gln Arg Glu Leu Lys Val Glu Val Asp Ile Gly
14975 14980 14985
gcc acg cag gtt ctt ttt gta gat gtc acc acc tgc att cga att 81229
Ala Thr Gln Val Leu Phe Val Asp Val Thr Thr Cys Ile Arg Ile
14990 14995 15000
acg agt act aaa aat cct cgc att acc gca aat ctt ggc gac gtt 81274
Thr Ser Thr Lys Asn Pro Arg Ile Thr Ala Asn Leu Gly Asp Val
15005 15010 15015
ata gcg gga acc aac gcc agt ggt ctc tct gta cca gta aat tca 81319
Ile Ala Gly Thr Asn Ala Ser Gly Leu Ser Val Pro Val Asn Ser
15020 15025 15030
tct ggg tgg cag ctt tat atg ttt gga gaa aca tta agc cgg gct 81364
Ser Gly Trp Gln Leu Tyr Met Phe Gly Glu Thr Leu Ser Arg Ala
15035 15040 15045
att att aac ggc tgt ggt ctg ctt cag cga att tgc ttc ccc gag 81409
Ile Ile Asn Gly Cys Gly Leu Leu Gln Arg Ile Cys Phe Pro Glu
15050 15055 15060
aca caa aga tta tcg ggt gaa ccg gaa cct aca acc acc tag 81451
Thr Gln Arg Leu Ser Gly Glu Pro Glu Pro Thr Thr Thr
15065 15070 15075
tataccttaa ctcaaccgcc gttgtggaaa ggtatatgtc aacatttaca gtaatatatt 81511
aaaggttaaa tttataaaac actcacgttt gtgttgtgac ttgacgcgaa caccgctgtg 81571
ctgtaagacc cgtcggtaaa tgaaaacgta atagattcgc cttttacatg atccacgtaa 81631
tttgccccaa accactgttc caggcgagac ttgataccct caaacacggg ttccgttgct 81691
ttgcgtatat gagccgtata acccacttta attcctctaa acgtggccat tactaaagct 81751
attaatggta caagaaacca tgttttccca tgtctacgtg gtaccaaaaa cacagttgat 81811
ttttgtttga agtgttctaa aacactgtca gaaacacttg gcgtgttaaa cactgtacgc 81871
agaaagcagt caactctgtc ggcatgatcg cccaatagca ccgatgaaat aaaatgcgtg 81931
gtgtgcatga ggatcatttt ttgaaacagt tccaacgtcc ccttatatct gccatagatt 81991
ggaacgtcaa cctttgcgcg tttgccatga cttccacact cttcaatact ctcaaaagat 82051
gtttccacaa ggtacgaaaa ccgttgtgta aaggtagaca actgacagaa actatccgac 82111
agagaaaacg cgcgaaatgt gttcataaca ccgctatacg catttcgatg aggtgctgct 82171
tcttccggtg aatattcata aaactgtaca ctactgacag ccttttttaa ttcagggctt 82231
acgtttgcat ttaccgaata tcgccatggt ttcaaaacta cattgggggt acagttgtac 82291
cctgttgacg atagaaacgc gccaaacatt gcccgtcgag cagtagccga gaacagtgga 82351
atatattcac aacagttgtg aagcgttcca attccgggaa taacggcctg atgacgtcgg 82411
gttacatcta tagcaaaatt cagaaacggg atttgggttg cgtttcccag agacccttgc 82471
cgcgtggaac acggggtagg ggactccaac gtcccaaagc gttcatccct acgacgcttt 82531
agacgttcaa aatatcttac agattcttca ccaagcgtac gaccaaacat tatcaatgac 82591
atttaacatc aattcacgga atccgcctca tctcttgtaa gcagtaaaac aggaagccgc 82651
gtcatcttac gtactcgtta cgtatatatc ataaacattt tcagggccgc attcattcac 82711
tttggtc atg tca ggc cac act cca acc tac gct tct cat agg cgt 82757
Met Ser Gly His Thr Pro Thr Tyr Ala Ser His Arg Arg
15080 15085
aac cgt gtc aaa cta gtt gag gcg cat aac cgc gcg ggg tta ttt 82802
Asn Arg Val Lys Leu Val Glu Ala His Asn Arg Ala Gly Leu Phe
15090 15095 15100
aaa gaa cgg acc ctc gat cta atc cgt ggg ggt gcg agt gta caa 82847
Lys Glu Arg Thr Leu Asp Leu Ile Arg Gly Gly Ala Ser Val Gln
15105 15110 15115
gat cca gca ttt gtg tat gcc ttt act gct gca aaa gag gcc tgc 82892
Asp Pro Ala Phe Val Tyr Ala Phe Thr Ala Ala Lys Glu Ala Cys
15120 15125 15130
gcc gat tta aat aac cag ctc cgc tct gca gct cgc ata gct tca 82937
Ala Asp Leu Asn Asn Gln Leu Arg Ser Ala Ala Arg Ile Ala Ser
15135 15140 15145
gtt gaa cag aag att cgt gat ata caa tcc aag gtt gag gaa caa 82982
Val Glu Gln Lys Ile Arg Asp Ile Gln Ser Lys Val Glu Glu Gln
15150 15155 15160
aca agt att caa cag att tta aat aca aac aga cgc tat ata gca 83027
Thr Ser Ile Gln Gln Ile Leu Asn Thr Asn Arg Arg Tyr Ile Ala
15165 15170 15175
ccc gat ttt att cgc ggt ttg gat aaa aca gaa gac gat aat acc 83072
Pro Asp Phe Ile Arg Gly Leu Asp Lys Thr Glu Asp Asp Asn Thr
15180 15185 15190
gat aat ata gac aga ctg gaa gac gcg gta gga ccg aac atc gaa 83117
Asp Asn Ile Asp Arg Leu Glu Asp Ala Val Gly Pro AsnIle Glu
15195 15200 15205
cac gaa aat cat act tgg ttt gga gaa gac gac gaa gcg tta ctt 83162
His Glu Asn His Thr Trp Phe Gly Glu Asp Asp Glu Ala Leu Leu
15210 15215 15220
aca caa tgg atg ctg acg aca cac ccc cca acc tcc aaa tat ctc 83207
Thr Gln Trp Met Leu Thr Thr His Pro Pro Thr Ser Lys Tyr Leu
15225 15230 15235
caa ctg cag gac ctt tgc gtt ccc acc aca ata ccg acg gac atg 83252
Gln Leu Gln Asp Leu Cys Val Pro Thr Thr Ile Pro Thr Asp Met
15240 15245 15250
aac caa atg caa ccg cag ccg atc agc aag aac gag aat cca cca 83297
Asn Gln Met Gln Pro Gln Pro Ile Ser Lys Asn Glu Asn Pro Pro
15255 15260 15265
acc cca cac acg gat gtg taa atcatccatg ggccaatccg tcaactgcaa 83348
Thr Pro His Thr Asp Val
15270 15275
catgcatgga atcaccagaa cgatcacaac agacaagctt atttttatta aagcacggct 83408
taacgagaga tccaatacat caacgcgaaa gggtggacgt ttttccacaa tttaacaaac 83468
ccccatgggt ttttagaatt tccaaattat cccgtttaat tgtacccatc ttcacgctca 83528
atgaacagtt atgtttttct aaattacaga ttcgagatag acccaggttt gcgggacggg 83588
gaacgtatgg gcgtgttcat atatacccat cgtcaaaaat agctgtaaaa accatggaca 83648
gtcgtgtttt taatagagag ttaattaacg cgattttagc gagtgagggt tctatacgag 83708
caggggaaag gctaggtatt tctagcatag tttgcctttt aggtttttcg ttacaaacca 83768
aacagctact gtttccggca tacgacatgg atatggatga atacattgtt cgcctgtcca 83828
gacggttgac aatacctgat cacatagaca gaaaaattgc ccatgtattt ttagatttgg 83888
ctcaagcgtt gacgttttta aatcgaacgt gcggcctgac ccacctagat gtgaaatgtg 83948
gcaatatttt tcttaacgtc gacaactttg cctcgttgga aataaccaca gcagtaatcg 84008
gagactatag cctagtaaca ttaaatacgt attccctttg tactcgagcg atatttgaag 84068
ttggaaatcc atcccacccg gagcacgtac tacgcgtacc ccgggatgca tcgcagatgt 84128
catttcgttt ggtgttgagt catggaacaa accaaccccc tgaaatcttg cttgattata 84188
ttaatggaac gggccttact aaatatactg gaaccttgcc ccaaagagtt ggacttgcga 84248
ttgatcttta tgcattgggc caagcactct tagaagttat cctgctagga cgtcttcccg 84308
gacaactgcc catttcagta catcggaccc cgcattatca ctactacggt cataagttat 84368
caccagattt ggcgcttgat acgctggcat atcgatgtgt cctggcgcca tatatactcc 84428
catctgacat ccccggggac ttaaattata atccctttat acacgccgga gagctgaaca 84488
cccgtatttc ccggaattct ttacgccgga tattccagtg tcacgcagtg cgttacggcg 84548
taacgcactc aaagcttttc gaaggcatac gcattccggc ctcattatac ccagccactg 84608
ttgttacatc gttgttgtgt cacgataatt cagaaatacg ctcggatcac cctttatt 84666
atg gca cga tcg gga ttg gat agg atc gac ata agc ccc cag cca 84711
Met Ala Arg Ser Gly Leu Asp Arg Ile Asp Ile Ser Pro Gln Pro
15280 15285 15290
gcc aaa aaa att gcc cgt gtg gga ggt cta cag cac cct ttt gta 84756
Ala Lys Lys Ile Ala Arg Val Gly Gly Leu Gln His Pro Phe Val
15295 15300 15305
aaa acg gat att aac acg att aac gtt gaa cac cat ttt ata gac 84801
Lys Thr Asp Ile Asn Thr Ile Asn Val Glu His His Phe Ile Asp
15310 15315 15320
acg cta cag aag aca tca ccg aac atg gac tgt cgc ggg atg aca 84846
Thr Leu Gln Lys Thr Ser Pro Asn Met Asp Cys Arg Gly Met Thr
15325 15330 15335
gcg ggt att ttt att cgt tta tcc cac atg tat aaa att cta aca 84891
Ala Gly Ile Phe Ile Arg Leu Ser His Met Tyr LysIle Leu Thr
15340 15345 15350
act ctg gag tct cca aat gat gta acc tac aca aca ccc ggt tct 84936
Thr Leu Glu Ser Pro Asn Asp Val Thr Tyr Thr Thr Pro Gly Ser
15355 15360 15365
acc aac gca ctg ttc ttt aag acg tcc aca cag cct cag gag ccg 84981
Thr Asn Ala Leu Phe Phe Lys Thr Ser Thr Gln Pro Gln Glu Pro
15370 15375 15380
cgt ccg gaa gag tta gca tcc aaa tta acc caa gac gac att aaa 85026
Arg Pro Glu Glu Leu Ala Ser Lys Leu Thr Gln Asp Asp Ile Lys
15385 15390 15395
cgt att cta tta aca ata gaa tcg gag act cgt ggt cag ggc gac 85071
Arg Ile Leu Leu Thr Ile Glu Ser Glu Thr Arg Gly Gln Gly Asp
15400 15405 15410
aat gcc att tgg aca cta ctc aga cga aat tta atc acc gca tca 85116
Asn Ala Ile Trp Thr Leu Leu Arg Arg Asn Leu Ile Thr Ala Ser
15415 15420 15425
act ctt aaa tgg agt gta tct gga ccc gtc att cca cct cag tgg 85161
Thr Leu Lys Trp Ser Val Ser Gly Pro Val Ile Pro Pro Gln Trp
15430 15435 15440
ttt tac cac cat aac act aca gac aca tac ggt gat gcg gcg gca 85206
Phe Tyr His His Asn Thr Thr Asp Thr Tyr Gly Asp Ala Ala Ala
15445 15450 15455
atg gcg ttt gga aaa acc aac gaa ccg gcg gca cga gcg ata gtt 85251
Met Ala Phe Gly Lys Thr Asn Glu Pro Ala Ala Arg Ala Ile Val
15460 15465 15470
gaa gca ttg ttt ata gat ccg gct gat atc cgt act cct gat cat 85296
Glu Ala Leu Phe Ile Asp Pro Ala Asp Ile Arg Thr Pro Asp His
15475 15480 15485
tta acg cca gaa gct aca act aag ttt ttt aat ttt gac atg ctc 85341
Leu Thr Pro Glu Ala Thr Thr Lys Phe Phe Asn Phe Asp Met Leu
15490 15495 15500
aat acc aaa tct cca agt ctc ctt gtg ggt aca cca aga atc gga 85386
Asn Thr Lys Ser Pro Ser Leu Leu Val Gly Thr Pro Arg Ile Gly
15505 15510 15515
acg tat gaa tgt gga ctt tta atc gac gtt cga acg gga ctt ata 85431
Thr Tyr Glu Cys Gly Leu Leu Ile Asp Val Arg Thr Gly Leu Ile
15520 15525 15530
ggc gcg tcg ttg gac gtt ctt gta tgt gac agg gac cct tta act 85476
Gly Ala Ser Leu Asp Val Leu Val Cys Asp Arg Asp Pro Leu Thr
15535 15540 15545
ggc acc cta aat ccc cac cct gca gaa acc gac att tca ttt ttt 85521
Gly Thr Leu Asn Pro His Pro Ala Glu Thr Asp Ile Ser Phe Phe
15550 15555 15560
gaa att aaa tgt cgt gct aaa tac ctc ttt gat cca gat gac aaa 85566
Glu Ile Lys Cys Arg Ala Lys Tyr Leu Phe Asp Pro Asp Asp Lys
15565 15570 15575
aat aac ccg ctc ggt cgg acg tac acc acg tta ata aat aga cct 85611
Asn Asn Pro Leu Gly Arg Thr Tyr Thr Thr Leu Ile Asn Arg Pro
15580 15585 15590
aca atg gca aat cta cgg gac ttt tta tat act ata aaa aac cca 85656
Thr Met Ala Asn Leu Arg Asp Phe Leu Tyr Thr Ile Lys Asn Pro
15595 15600 15605
tgt gta agc ttc ttt gga ccc tca gca aac cca agt aca cgc gag 85701
Cys Val Ser Phe Phe Gly Pro Ser Ala Asn Pro Ser Thr Arg Glu
15610 15615 15620
gcc tta ata acg gat cac gtt gaa tgg aaa cgt tta gga ttt aaa 85746
Ala Leu Ile Thr Asp His Val Glu Trp Lys Arg Leu Gly Phe Lys
15625 15630 15635
ggt ggg agg gcc ctt aca gaa ctc gac gcc cat cat ttg ggc ctc 85791
Gly Gly Arg Ala Leu Thr Glu Leu Asp Ala His His Leu Gly Leu
15640 15645 15650
aat cgg aca atc tca tcc cga gtg tgg gta ttt aat gat ccg gac 85836
Asn Arg Thr Ile Ser Ser Arg Val Trp Val Phe Asn Asp Pro Asp
15655 15660 15665
ata caa aag ggg aca att aca acc att gca tgg gcc act gga gat 85881
Ile Gln Lys Gly Thr Ile Thr Thr Ile Ala Trp Ala Thr Gly Asp
15670 15675 15680
acg gct ctt caa att cct gta ttt gcc aat ccg cgg cac gct aac 85926
Thr Ala Leu Gln Ile Pro Val Phe Ala Asn Pro Arg His Ala Asn
15685 15690 15695
ttt aaa caa att gcc gta caa acc tat gta tta tcc ggt tac ttt 85971
Phe Lys Gln Ile Ala Val Gln Thr Tyr Val Leu Ser Gly Tyr Phe
15700 15705 15710
cca gcg cta aaa cta cgg ccc ttc ctt gtc acc ttt ata gga cgt 86016
Pro Ala Leu Lys Leu Arg Pro Phe Leu Val Thr Phe Ile Gly Arg
15715 15720 15725
gtg cgc cga cca cac gag gtg gga gtc cca ttg cgc gtc gat aca 86061
Val Arg Arg Pro His Glu Val Gly Val Pro Leu Arg Val Asp Thr
15730 15735 15740
caa gcg gct gcc att tac gaa tat aac tgg ccg act atc cca ccc 86106
Gln Ala Ala Ala Ile Tyr Glu Tyr Asn Trp Pro Thr Ile Pro Pro
15745 15750 15755
cac tgt gcg gtt ccg gtt ata gcc gtt cta acg cct atc gaa gtt 86151
His Cys Ala Val Pro Val Ile Ala Val Leu Thr Pro Ile Glu Val
15760 15765 15770
gat gtg cct aga gtg aca caa ata ctt aaa gac aca gga aac aac 86196
Asp Val Pro Arg Val Thr Gln Ile Leu Lys Asp Thr Gly Asn Asn
15775 15780 15785
gcg att aca tca gca ttg cgg tca ttg cga tgg gac aat ctt cat 86241
Ala Ile Thr Ser Ala Leu Arg Ser Leu Arg Trp Asp Asn Leu His
15790 15795 15800
cca gcg gtc gag gag gaa tct gtg gat tgt gca aac ggt aca acg 86286
Pro Ala Val Glu Glu Glu Ser Val Asp Cys Ala Asn Gly Thr Thr
15805 15810 15815
agc ttg tta cgt gca acg gag aaa ccg ttg ctt tga actcagagtt 86332
Ser Leu Leu Arg Ala Thr Glu Lys Pro Leu Leu
15820 15825
ctttgaagac tttgactttg atgagaatgt aacagaggac gccgataaat ccacacaacg 86392
ccgcccacga gtgatcgatg taacaccaaa acgaaaacct tcgggaaaga gctcccattc 86452
caaatgcgca aaatgttaaa ccctgataaa ccctgataaa cgttctaata aaaacatcaa 86512
atcatggttg gttactgtga atgtttgttt tattgcttgg gggtttacaa gtacaaccca 86572
cgctactccc acccactgtt tgatcgctcg tataacagct catcctcgcg gtccgtttca 86632
tatgttgagt cattttcata gacgtagccg tagccttgtg atgggtaatt tgtgcggcga 86692
gaatttctat gtgcaggttt tacttttcgt atgtatcccc gtacccgctc gggtactctt 86752
cttacggcac cgtagaaccg actgcgtttc tgtcgatgat acacatatgc acgcatcaat 86812
ctgagaagca acatgacaac ggaaaacacg gccaggcaag ccaaggttcc ccgagttgtg 86872
ggaattaacc gtggagattg aaccgatata gggtcatata atcggtccat atacgagtgc 86932
gcggcggttc ccaacgtagc acaggccacg agcgttccca gggacggtcc tattaacacg 86992
tgtatataat gcgccaaaat taattctgat actataagat atacaactga caatgtacta 87052
aatgtagaca tggccacgga caccgatgac cacagtcccg tatgtagatg attcgccacc 87112
acaagttcca gcattaatga tacaaatagg atacatatcg ccatcaacgc agccatcaaa 87172
ttcacgaaca ctgcgcgcgt aggccccgca aggcgatata aaaagacgct ctgctgtcgt 87232
aaatttgcga ccgcttttat gttcgtttcg tccaattttc cgcgtccaca aaaatacgtt 87292
gtaaatatta cacttgtcgc aaaatgtcca agatataatg tagcagccac gccgatttgc 87352
ttgtaagcta ataataacac aacggcgttt aataaccaca atgacaaaag accccaaaaa 87412
agtgttgtgg gatctacaac taaccatgca acaccggagc tttgccggac acgttgattt 87472
ttcgtttctc ggtgtataat cgcggccgtg atcagtgtat ataccgccat ggccattgcc 87532
gttaaagccg tgtagtaagt aaatgccaca acgctatgtg gttccaaaaa caaaaccggg 87592
gcgctgtatc cacctctatt tccggaccat acccccccat ctagggtggc gttaaataac 87652
tcataatcaa ctacggcagc ataaaaacaa gggatcccgg tatattcaga agaggcggca 87712
attaacgtag ccaggagcat taccgcaccc aaagtgaaca tcatcacctg aattatccaa 87772
attcgccaat taagcgtatc catttgatga tctaacgctt ccacctcggg tgtcgtggtg 87832
tcgtacggcg agactttttc agaacgcggc cccttctttt gagttccc atg tct ccc 87889
Met Ser Pro
aac acc ggg gag agc aac gcc gcc gtc tat gcg tcc agt aca cag 87934
Asn Thr Gly Glu Ser Asn Ala Ala Val Tyr Ala Ser Ser Thr Gln
15830 15835 15840
ctc gcg cgg gcg tta tat gga ggg gat ctg gtt tcg tgg att aaa 87979
Leu Ala Arg Ala Leu Tyr Gly Gly Asp Leu Val Ser Trp Ile Lys
15845 15850 15855
cac acc cac ccg gga att agc ctg gaa ctg caa ttg gat gtt cca 88024
His Thr His Pro Gly Ile Ser Leu Glu Leu Gln Leu Asp Val Pro
15860 15865 15870
gta aaa cta ata aaa cct ggt atg tca caa act cgc ccg gta acc 88069
Val Lys Leu Ile Lys Pro Gly Met Ser Gln Thr Arg Pro Val Thr
15875 15880 15885
gtc gta cgt gcc cct atg ggc tct ggt aaa aca aca gcc ttg ctt 88114
Val Val Arg Ala Pro Met Gly Ser Gly Lys Thr Thr Ala Leu Leu
15890 15895 15900
gag tgg ctt caa cac gcg tta aag gca gat att agc gta ctg gtt 88159
Glu Trp Leu Gln His Ala Leu Lys Ala Asp Ile Ser Val Leu Val
15905 15910 15915
gtc tca tgt cgc cgt agc ttt acc cag acg ttg att caa cgg ttt 88204
Val Ser Cys Arg Arg Ser Phe Thr Gln Thr Leu Ile Gln Arg Phe
15920 15925 15930
aac gat gca ggc ctc tcc gga ttc gta aca tat ttg aca tcc gag 88249
Asn Asp Ala Gly Leu Ser Gly Phe Val Thr Tyr Leu Thr Ser Glu
15935 15940 15945
aca tat att atg ggt ttt aaa cgt ttg att gtg caa ctt gaa agc 88294
Thr Tyr Ile Met Gly Phe Lys Arg Leu Ile Val Gln Leu Glu Ser
15950 15955 15960
cta cac cgc gta tcc agc gaa gct atc gac agc tac gac gta tta 88339
Leu His Arg Val Ser Ser Glu Ala Ile Asp Ser Tyr Asp Val Leu
15965 15970 15975
ata ctg gat gag gta atg tca gtg att gga caa tta tac tcc ccc 88384
Ile Leu Asp Glu Val Met Ser Val Ile Gly Gln Leu Tyr Ser Pro
15980 15985 15990
aca atg aga cgt ctt tcc gcg gtt gat agc cta tta tat cgt ctt 88429
Thr Met Arg Arg Leu Ser Ala Val Asp Ser Leu Leu Tyr Arg Leu
15995 16000 16005
tta aat cgc tgt tct caa att atc gcg atg gat gct aca gta aac 88474
Leu Asn Arg Cys Ser Gln Ile Ile Ala Met Asp Ala Thr Val Asn
16010 16015 16020
tcg cag ttt att gat tta atc tcc gga ttg cgt gga gat gaa aac 88519
Ser Gln Phe Ile Asp Leu Ile Ser Gly Leu Arg Gly Asp Glu Asn
16025 16030 16035
ata cac aca att gtg tgt aca tac gcg gga gtt ggg ttc tcc gga 88564
Ile His Thr Ile Val Cys Thr Tyr Ala Gly Val Gly Phe Ser Gly
16040 16045 16050
aga act tgc acg atc ctg cgt gat atg ggc atc gac acg ctt gtg 88609
Arg Thr Cys Thr Ile Leu Arg Asp Met Gly Ile Asp Thr Leu Val
16055 16060 16065
cga gtc att aaa cga tct cct gaa cac gag gat gta cgt acc ata 88654
Arg Val Ile Lys Arg Ser Pro Glu His Glu Asp Val Arg Thr Ile
16070 16075 16080
cac caa cta cgt gga aca ttt ttt gac gaa cta gca cta cga tta 88699
His Gln Leu Arg Gly Thr Phe Phe Asp Glu Leu Ala Leu Arg Leu
16085 16090 16095
caa tgt ggg cat aac atc tgt ata ttt tca tca act tta tcg ttt 88744
Gln Cys Gly His Asn Ile Cys Ile Phe Ser Ser Thr Leu Ser Phe
16100 16105 16110
tcg gag cta gtt gct cag ttt tgt gca ata ttt aca gac tct att 88789
Ser Glu Leu Val Ala Gln Phe Cys Ala Ile Phe Thr Asp Ser Ile
16115 16120 16125
ctt att tta aac tca act cgg ccc cta tgt aat gta aac gaa tgg 88834
Leu Ile Leu Asn Ser Thr Arg Pro Leu Cys Asn Val Asn Glu Trp
16130 16135 16140
aaa cat ttt cgc gtg ttg gtg tac act acc gtc gtg acc gtt gga 88879
Lys His Phe Arg Val Leu Val Tyr Thr Thr Val Val Thr Val Gly
16145 16150 16155
ttg agt ttt gac atg gct cat ttt cat agc atg ttt gct tac ata 88924
Leu Ser Phe Asp Met Ala His Phe His Ser Met Phe Ala Tyr Ile
16160 16165 16170
aag cca atg tca tat ggg ccg gat atg gta tcg gtc tac cag tca 88969
Lys Pro Met Ser Tyr Gly Pro Asp Met Val Ser Val Tyr Gln Ser
16175 16180 16185
tta ggg cgt gta cgt tta ttg cta ctt aat gaa gtt ttg atg tac 89014
Leu Gly Arg Val Arg Leu Leu Leu Leu Asn Glu Val Leu Met Tyr
16190 16195 16200
gtc gat ggc tca agg acc aga tgc gga ccc ctg ttc tcg cca atg 89059
Val Asp Gly Ser Arg Thr Arg Cys Gly Pro Leu Phe Ser Pro Met
16205 16210 16215
tta cta aac ttt acc atc gca aat aaa ttt caa tgg ttt cct aca 89104
Leu Leu Asn Phe Thr Ile Ala Asn Lys Phe Gln Trp Phe Pro Thr
16220 16225 16230
cac acc caa ata act aac aaa ctg tgc tgt gca ttt agg caa cga 89149
His Thr Gln Ile Thr Asn Lys Leu Cys Cys Ala Phe Arg Gln Arg
16235 16240 16245
tgt gca aat gca ttt aca cgc tcg aac acc cat ctc ttc tca aga 89194
Cys Ala Asn Ala Phe Thr Arg Ser Asn Thr His Leu Phe Ser Arg
16250 16255 16260
ttt aaa tac aaa cac ctt ttc gag aga tgc tct ctt tgg agt tta 89239
Phe Lys Tyr Lys His Leu Phe Glu Arg Cys Ser Leu Trp Ser Leu
16265 16270 16275
gcc gat agc att aat atc tta caa act ctt ttg gcc tct aac caa 89284
Ala Asp Ser Ile Asn Ile Leu Gln Thr Leu Leu Ala Ser Asn Gln
16280 16285 16290
att ttg gtt gta ttg gat ggc atg ggt cca ata acg gac gtt tcc 89329
Ile Leu Val Val Leu Asp Gly Met Gly Pro Ile Thr Asp Val Ser
16295 16300 16305
cca gtt caa ttt tgt gca ttt ata cac gat ctc aga cat agc gct 89374
Pro Val Gln Phe Cys Ala Phe Ile His Asp Leu Arg His Ser Ala
16310 16315 16320
aac gcc gta gct tcc tgt atg cgt tct ctt aga cag gac aat gac 89419
Asn Ala Val Ala Ser Cys Met Arg Ser Leu Arg Gln Asp Asn Asp
16325 16330 16335
agc tgc ttg acc gat ttt ggc cct tcc gga ttt atg gcc gat aac 89464
Ser Cys Leu Thr Asp Phe Gly Pro Ser Gly Phe Met Ala Asp Asn
16340 16345 16350
att acc gcg ttt atg gaa aag tat ctt atg gag tca att aat acc 89509
Ile Thr Ala Phe Met Glu Lys Tyr Leu Met Glu Ser Ile Asn Thr
16355 16360 16365
gaa gaa caa att aaa gta ttt aaa gcc ctt gca tgt cca ata gaa 89554
Glu Glu Gln Ile Lys Val Phe Lys Ala Leu Ala Cys Pro Ile Glu
16370 16375 16380
cag cct aga cta gtc aat acg gca ata ttg ggg gcg tgt ata cga 89599
Gln Pro Arg Leu Val Asn Thr Ala Ile Leu Gly Ala CysIle Arg
16385 16390 16395
ata cct gaa gcg ttg gaa gca ttt gac gta ttt caa aaa ata tac 89644
Ile Pro Glu Ala Leu Glu Ala Phe Asp Val Phe Gln Lys Ile Tyr
16400 16405 16410
acg cac tac gct tcc ggt tgg ttt ccc gtc ctg gac aaa acc ggg 89689
Thr His Tyr Ala Ser Gly Trp Phe Pro Val Leu Asp Lys Thr Gly
16415 16420 16425
gaa ttt agc atc gcg act ata act acc gcc cca aat tta acc aca 89734
Glu Phe Ser Ile Ala Thr Ile Thr Thr Ala Pro Asn Leu Thr Thr
16430 16435 16440
cat tgg gag ctg ttt cgc cgt tgt gcc tat att gca aaa aca ctc 89779
His Trp Glu Leu Phe Arg Arg Cys Ala Tyr Ile Ala Lys Thr Leu
16445 16450 16455
aag tgg aat ccg tcc acc gaa ggc tgt gta aca caa gtt ttg gat 89824
Lys Trp Asn Pro Ser Thr Glu Gly Cys Val Thr Gln Val Leu Asp
16460 16465 16470
acg gac att aat aca ctt ttc aat caa cac ggg gat tcg ctg gct 89869
Thr Asp Ile Asn Thr Leu Phe Asn Gln His Gly Asp Ser Leu Ala
16475 16480 16485
caa cta ata ttt gag gtt atg cgc tgt aac gtt act gac gct aag 89914
Gln Leu Ile Phe Glu Val Met Arg Cys Asn Val Thr Asp Ala Lys
16490 16495 16500
att ata tta aac cgc ccg gtt tgg cga aca acc gga ttc tta gat 89959
Ile Ile Leu Asn Arg Pro Val Trp Arg Thr Thr Gly Phe Leu Asp
16505 16510 16515
gga tgc cat aat caa tgc ttc cgt cca atc cct aca aaa cac gaa 90004
Gly Cys His Asn Gln Cys Phe Arg Pro Ile Pro Thr Lys His Glu
16520 16525 16530
tat aac att gct cta ttt cgt tta att tgg gaa caa tta ttt ggc 90049
Tyr Asn Ile Ala Leu Phe Arg Leu Ile Trp Glu Gln Leu Phe Gly
16535 16540 16545
gcc cgc gta act aaa agt acc cag acc ttt ccg gga agt act cgt 90094
Ala Arg Val Thr Lys Ser Thr Gln Thr Phe Pro Gly Ser Thr Arg
16550 16555 16560
gtg aaa aac cta aaa aaa aaa gat cta gaa act tta ctt gat tca 90139
Val Lys Asn Leu Lys Lys Lys Asp Leu Glu Thr Leu Leu Asp Ser
16565 16570 16575
att aac gtg gat cgt tct gca tgt cgt acc tac cgc cag ttg tat 90184
Ile Asn Val Asp Arg Ser Ala Cys Arg Thr Tyr Arg Gln Leu Tyr
16580 16585 16590
aac ctg ctt atg agc cag cgc cat tcg ttc tct caa cag cgt tac 90229
Asn Leu Leu Met Ser Gln Arg His Ser Phe Ser Gln Gln Arg Tyr
16595 16600 16605
aaa att act gcc ccc gct tgg gca cgc cac gtg tat ttt caa gca 90274
Lys Ile Thr Ala Pro Ala Trp Ala Arg His Val Tyr Phe Gln Ala
16610 16615 16620
cat caa atg cac ttg gcc ccg cat gcc gaa gcc atg cta caa tta 90319
His Gln Met His Leu Ala Pro His Ala Glu Ala Met Leu Gln Leu
16625 16630 16635
gcg cta tcg gaa ctg tcc ccg gga tcg tgg ccg cgg ata aac ggg 90364
Ala Leu Ser Glu Leu Ser Pro Gly Ser Trp Pro Arg Ile Asn Gly
16640 16645 16650
gcg gta aat ttt gaa agt tta taa cccgttaata ccatatatgg 90408
Ala Val Asn Phe Glu Ser Leu
16655 16660
acatccatag ggggggttac ataaatacta agcctctgta caacacaaag ggcctctaac 90468
aatgcactga accacaacca agct atg gac gca acg cag att acc ttg 90516
Met Asp Ala Thr Gln Ile Thr Leu
16665
gtt aga gaa agc gga cac att tgt gcc gca agc ata tac aca tcc 90561
Val Arg Glu Ser Gly His Ile Cys Ala Ala Ser Ile Tyr Thr Ser
16670 16675 16680
tgg aca cag tcc gga caa tta aca cag aac ggt ctt tcc gtg tta 90606
Trp Thr Gln Ser Gly Gln Leu Thr Gln Asn Gly Leu Ser Val Leu
16685 16690 16695
tac tac tta tta tgc aaa aac tca tgt ggg aaa tac gtc cct aag 90651
Tyr Tyr Leu Leu Cys Lys Asn Ser Cys Gly Lys Tyr Val Pro Lys
16700 16705 16710
ttt gcc gaa att acc gta caa caa gag gat tta tgt cgc tac tcc 90696
Phe Ala Glu Ile Thr Val Gln Gln Glu Asp Leu Cys Arg Tyr Ser
16715 16720 16725
agg cat ggg ggg agt gtt tct gcg gca acg ttt gcg tct atc tgc 90741
Arg His Gly Gly Ser Val Ser Ala Ala Thr Phe Ala SerIle Cys
16730 16735 16740
agg gcg gcg tcc tcg gct gcg tta gac gcc tgg ccc ctt gaa cca 90786
Arg Ala Ala Ser Ser Ala Ala Leu Asp Ala Trp Pro Leu Glu Pro
16745 16750 16755
ctg ggt aac gca gac acc tgg cgt tgt ctc cat ggc act gcc ctg 90831
Leu Gly Asn Ala Asp Thr Trp Arg Cys Leu His Gly Thr Ala Leu
16760 16765 16770
gcc act tta cgg cgc gta tta ggg ttt aaa tcg ttt tat tcg cca 90876
Ala Thr Leu Arg Arg Val Leu Gly Phe Lys Ser Phe Tyr Ser Pro
16775 16780 16785
gta aca ttc gag act gat acg aat aca ggt ctt ctg tta aaa aca 90921
Val Thr Phe Glu Thr Asp Thr Asn Thr Gly Leu Leu Leu Lys Thr
16790 16795 16800
atc ccc gat gaa cac gcg ttg aat aat gac aac acg cca tct acc 90966
Ile Pro Asp Glu His Ala Leu Asn Asn Asp Asn Thr Pro Ser Thr
16805 16810 16815
gga gta ttg agg gct aat ttt ccc gtg gcc att gat gtt tca gca 91011
Gly Val Leu Arg Ala Asn Phe Pro Val Ala Ile Asp Val Ser Ala
16820 16825 16830
gtc agc gca tgt aac gcc cac acg caa ggt acg tcg cta gcc tac 91056
Val Ser Ala Cys Asn Ala His Thr Gln Gly Thr Ser Leu Ala Tyr
16835 16840 16845
gcc cgc ctg acc gca ctt aaa tct aac ggt gac acc cag caa caa 91101
Ala Arg Leu Thr Ala Leu Lys Ser Asn Gly Asp Thr Gln Gln Gln
16850 16855 16860
aca cct tta gac gtg gag gta att aca cca aag gcc tac ata cgt 91146
Thr Pro Leu Asp Val Glu Val Ile Thr Pro Lys Ala Tyr Ile Arg
16865 16870 16875
cgg aaa tat aag tct acg ttt tcc ccc cct ata gag cgg gaa ggc 91191
Arg Lys Tyr Lys Ser Thr Phe Ser Pro Pro Ile Glu Arg Glu Gly
16880 16885 16890
caa acc tcc gat ttg ttt aac ctt gaa gaa cgc cgc ttg gtt ctt 91236
Gln Thr Ser Asp Leu Phe Asn Leu Glu Glu Arg Arg Leu Val Leu
16895 16900 16905
agt ggc aat cgc gca att gtg gta agg gta ctc tta ccg tgt tat 91281
Ser Gly Asn Arg Ala Ile Val Val Arg Val Leu Leu Pro Cys Tyr
16910 16915 16920
ttt gac tgt tta aca acg gat tcc acc gtt aca tct tcc ctt tca 91326
Phe Asp Cys Leu Thr Thr Asp Ser Thr Val Thr Ser Ser Leu Ser
16925 16930 16935
ata tta gca aca tat aga ctg tgg tac gcg gcg gcg ttt gga aaa 91371
Ile Leu Ala Thr Tyr Arg Leu Trp Tyr Ala Ala Ala Phe Gly Lys
16940 16945 16950
ccc ggg gtt gtc cgt cca atc ttt gcg tat tta ggc ccg gaa ctc 91416
Pro Gly Val Val Arg Pro Ile Phe Ala Tyr Leu Gly Pro Glu Leu
16955 16960 16965
aat ccg aag ggt gaa gac aga gac tac ttt tgt act gtc gga ttt 91461
Asn Pro Lys Gly Glu Asp Arg Asp Tyr Phe Cys Thr Val Gly Phe
16970 16975 16980
ccc gga tgg acc act ctt cgg aca caa act cca gcc gtc gaa tct 91506
Pro Gly Trp Thr Thr Leu Arg Thr Gln Thr Pro Ala Val Glu Ser
16985 16990 16995
att cgc acg gct acg gag atg tac atg gaa acg gat ggg ttg tgg 91551
Ile Arg Thr Ala Thr Glu Met Tyr Met Glu Thr Asp Gly Leu Trp
17000 17005 17010
cca gta acc ggt att cag gcc ttt cat tat cta gcc ccc tgg gga 91596
Pro Val Thr Gly Ile Gln Ala Phe His Tyr Leu Ala Pro Trp Gly
17015 17020 17025
cag cat ccc ccc tta cct ccg cgg gtg cag gat ctt att ggg caa 91641
Gln His Pro Pro Leu Pro Pro Arg Val Gln Asp Leu Ile Gly Gln
17030 17035 17040
atc cct caa gat act gga cat gca gat gca act gtc aat tgg gac 91686
Ile Pro Gln Asp Thr Gly His Ala Asp Ala Thr Val Asn Trp Asp
17045 17050 17055
gcg ggc cgg ata tct acc gtc ttc aaa cag cct gta caa cta caa 91731
Ala Gly Arg Ile Ser Thr Val Phe Lys Gln Pro Val Gln Leu Gln
17060 17065 17070
gat cgt tgg atg gca aag ttt gat ttc agc gcc ttt ttt ccc acg 91776
Asp Arg Trp Met Ala Lys Phe Asp Phe Ser Ala Phe Phe Pro Thr
17075 17080 17085
ata tac tgc gct atg ttc ccc atg cat ttt aga tta ggc aaa atc 91821
Ile Tyr Cys Ala Met Phe Pro Met His Phe Arg Leu Gly Lys Ile
17090 17095 17100
gtc ctg gct aga atg cgt cga gga atg ggg tgc cta aaa ccc gcg 91866
Val Leu Ala Arg Met Arg Arg Gly Met Gly Cys Leu Lys Pro Ala
17105 17110 17115
ttg gtg tct ttt ttt ggg ggg tta cgg cac ata ctc ccg agt ata 91911
Leu Val Ser Phe Phe Gly Gly Leu Arg His Ile Leu Pro Ser Ile
17120 17125 17130
tac aaa gct att att ttt ata gcc aat gaa att agc ctt tgc gtc 91956
Tyr Lys Ala Ile Ile Phe Ile Ala Asn Glu Ile Ser Leu Cys Val
17135 17140 17145
gaa caa acg gcc ttg gaa cag ggc ttt gct ata tgt act tat ata 92001
Glu Gln Thr Ala Leu Glu Gln Gly Phe Ala Ile Cys Thr Tyr Ile
17150 17155 17160
aaa gat gga ttt tgg gga atc ttc acc gat tta cat acg cgc aat 92046
Lys Asp Gly Phe Trp Gly Ile Phe Thr Asp Leu His Thr Arg Asn
17165 17170 17175
gta tgt tca gat cag gca cgt tgt tcg gcc tta aat tta gcg gcc 92091
Val Cys Ser Asp Gln Ala Arg Cys Ser Ala Leu Asn Leu Ala Ala
17180 17185 17190
acc tgc gaa aga gca gtc acg ggc tta tta cga att caa cta ggt 92136
Thr Cys Glu Arg Ala Val Thr Gly Leu Leu Arg Ile Gln Leu Gly
17195 17200 17205
ctt aac ttt aca ccc gcc atg gaa ccg gta ctc cgg gtc gag ggt 92181
Leu Asn Phe Thr Pro Ala Met Glu Pro Val Leu Arg Val Glu Gly
17210 17215 17220
gtg tac act cac gca ttt acc tgg tgt acc acg gga agc tgg ctg 92226
Val Tyr Thr His Ala Phe Thr Trp Cys Thr Thr Gly Ser Trp Leu
17225 17230 17235
tgg aat tta caa aca aac acg cct ccg gat tta gtt ggc gtg cca 92271
Trp Asn Leu Gln Thr Asn Thr Pro Pro Asp Leu Val Gly Val Pro
17240 17245 17250
tgg cga agt cag gcg gcg cga gat tta aag gag cgt ctt tca gga 92316
Trp Arg Ser Gln Ala Ala Arg Asp Leu Lys Glu Arg Leu Ser Gly
17255 17260 17265
ctc cta tgt acc gca aca aaa att cga gaa cgg ata cag gaa aat 92361
Leu Leu Cys Thr Ala Thr Lys Ile Arg Glu Arg Ile Gln Glu Asn
17270 17275 17280
tgc ata tgg gac cat gtc cta tac gac ata tgg gcc gga caa gtt 92406
Cys Ile Trp Asp His Val Leu Tyr Asp Ile Trp Ala Gly Gln Val
17285 17290 17295
gtg gag gct gcc aga aaa aca tac gtc gat ttt ttt gaa cat gtt 92451
Val Glu Ala Ala Arg Lys Thr Tyr Val Asp Phe Phe Glu His Val
17300 17305 17310
ttt gat cgc cgt tat act ccg gta tac tgg agt ctt cag gag caa 92496
Phe Asp Arg Arg Tyr Thr Pro Val Tyr Trp Ser Leu Gln Glu Gln
17315 17320 17325
aat tcg gaa aca aaa gca ata ccg gca tct tat ctg aca tac gga 92541
Asn Ser Glu Thr Lys Ala Ile Pro Ala Ser Tyr Leu Thr Tyr Gly
17330 17335 17340
cac atg caa gat aag gat tat aaa cca aga cag ata att atg gtt 92586
His Met Gln Asp Lys Asp Tyr Lys Pro Arg Gln Ile Ile Met Val
17345 17350 17355
cgt aat ccc aac cca cat gga cct cct act gtt gtt tac tgg gaa 92631
Arg Asn Pro Asn Pro His Gly Pro Pro Thr Val Val Tyr Trp Glu
17360 17365 17370
ttg cta cca tcg tgt gcc tgt att ccc ccc ata gac tgc gct gct 92676
Leu Leu Pro Ser Cys Ala Cys Ile Pro Pro Ile Asp Cys Ala Ala
17375 17380 17385
cat ctc aag ccc ctt ata cac acg ttt gtc act att att aac cat 92721
His Leu Lys Pro Leu Ile His Thr Phe Val Thr Ile Ile Asn His
17390 17395 17400
ctt cta gat gct cat aat gat ttt tca agt cca tca ttg aaa ttt 92766
Leu Leu Asp Ala His Asn Asp Phe Ser Ser Pro Ser Leu Lys Phe
17405 17410 17415
act gac gat ccc ctt gct tca tat aac ttc ttg ttt tta tga 92808
Thr Asp Asp Pro Leu Ala Ser Tyr Asn Phe Leu Phe Leu
17420 17425 17430
caaaaaaaca cgccgcaaca acccatcctt aaaataaaag gtttatttac tttacaaccc 92868
gtggtgaatt tttatacgtt tcaaataact gaacattttt cggtgttacc atggtgcgat 92928
ttaaccacca aaaatatacg ctcttctgat attccgaatc tcgtaaaggt ccatttaaca 92988
atcccggggg tacttgcacc acaccatctg gacagggggg ggttccgtgg ggcaggtcaa 93048
aacgctgacc caccccacat gaatatatag cctttataat attgggggcc gttccaggct 93108
gagggttcag taacttaaca aacatataat gcggcaatac gcgggttttt gtaaaggggt 93168
tgttatcaac gacatacatt agagtgttta acaaccataa aactccctca tataaaaacc 93228
gacgcatttt ttccaaaggt cctatttgac actcaacgcg tctaagatat acagacaatt 93288
gtacaaacag cgatggagat gccccggagg gcccaatgcc ttccagatac attaaaataa 93348
cacataaggt aaaatctagg acattatccg ggcggaatag agtcatccga tagattaaca 93408
ggcgcggagg cacccccacc gtatacaccc tatcttcaac cgcagttaat acggaaaaaa 93468
taaatccgcg gaacgctggt tgagtaacac actccatgta gtaacgatca caggacacct 93528
cacttgaatc accattcaac actactaaaa cggtctcttg gtgttccggt tttacgcgca 93588
gtgatacaac agagtttgcc aaaaagcgtg gcttcaaacc ggttacctcc cgcgcctcgc 93648
atacgaatct tggtattgct tgtattctaa gatcttcgat cacgtcgctc acatccaacc 93708
cctcttcggc tcgtgttagt aagttgtcga tcgttacgct gcaacctaaa atgctgggta 93768
tatttattcc ggacatccca tcggccatcc ccgcgcctcc ggtttgctcg aattttatcc 93828
agtaaggtcg aatccgctgc atttaccttg tgtacccgta acctctcagg ggggtgtcct 93888
ttcataaaat gggataggtt tttatatcca acatgcatgt attggttatt tattttattg 93948
ggttccggga ttctttcgtc atcttctgta gggtcaggca aaccccagga aggacttggt 94008
gttctccgtg ggccccgttt tattacctct gcgcgaacct gcatttcata taatattcgg 94068
atttgggata aataggactc tgttctcgcc tttttaaaaa tagcctggca taactcttcc 94128
tctgacctat gtacctcgct ttgagttacc aagaatccta atcgggtggc ccgtaatatg 94188
aatgaaaaat acggcgcaac tagtaatgag attgacgcat ttgaatatga tacagaaatt 94248
tcctggcctt gattattgtt tacccggtga agcttaaaac agcgaacaag ttcctgtttc 94308
catagctcag acaaacgttt tatatcatct ccataagggg ggatataacg agattgaaaa 94368
ctattggcaa tatatgcatc atcccctatt atgccggtaa gatctataac ctcgtgattt 94428
aaatcggcaa tacgtgtttc ttctgccatt gtaatatgtg accctttaga tggctttatt 94488
tttaccctct cttcccgtaa ccgtttcagc tctccttctt tgaactggag cctttcggtc 94548
agatcgctgt tcacatcctt gagaccctca atggttttga ataaattatt cacataaccc 94608
tcgagcatgc cgttgatact gttaaccacc gaagttttaa acgcactttg aacgtttgtt 94668
gttccggaca ttgccccccc gttaaaggat tggttggcct tgccaaaccc cggttgtgat 94728
gtgtccaccg atccacttcc ttccagaatg tgattgcccg tttcttctag ataggaacgt 94788
acggtttcgg taatatctcc aacatgtctc atgtttttta agttaactat tagctttaca 94848
agtctagacg cggccgatcc agcccgtgtt gtatcgttct cgcccattat acgatcaacc 94908
gcacgtgtgc tgtgagatct atcatcttca ttccggcgac ctattaacac gcgcaaaggg 94968
gctgtattta aaacttggca gacgcgagca tgttcacgta atgcataaca ggccaacacc 95028
tccccagaaa gccgctgtaa gggtgagtca aatactacac cctccccaca tacaacgggc 95088
ggccacacga ccaaacactc tcccttcatg cccgttacat catcctttgc cataattaat 95148
cttcggttat aattataata aagacgcgtc ctatcataat ccataatagc aacattttgc 95208
atacactcaa ctaggcttgt gacaaccgcc gctcctctgg ccaacgttgc atcggcaact 95268
tttaacatct gggacagttc tgccgcttga cccatatacg tatttaatgg tgcaggggtt 95328
ccattctgtt ctgatcgtac ctttcttaca acgggcacaa tacctacaca ggctatccag 95388
tccacgtatt tggcaaaacc gacccttcca tttaaaccac tggtatagag acaaccggtt 95448
attccacgca gaaactcaag taacgatgac tgtaatgttt gacgccaggt ttcaaaaacc 95508
tgatgtgcaa gccgtacggc ttctgattct ccacatagcc cataacgttc cgctagagcc 95568
ccggcatgca ggttacattg ttggatgtgg tgttcccaat ctgctgctag gtcctcatac 95628
cgagttgcat ccaacgcgtt catcaaaacg gttgcctgaa cttggcgaat tacagtttcc 95688
gtagaccgta cagcgctata tatgccttgt ccatcggtat atccaaagtc accggctagg 95748
atttttcgaa acaacatact ttgcgtggtt gggtgtatta acatccagcc atcttcctcc 95808
ggaaatgtac aaaaccctat atccggggcg tactcattcc agtatatatc gaacatgttc 95868
ttgtattggt catttgggtt acttccattc aagccctggt caatagaaac agaacttgct 95928
atcctttttt cttcactacc ggaactgtta ttaaaaagag acgttatttc ggccattgaa 95988
aaccacg atg aaa aga tca att tct gta gac agt tct tca ccc aaa 96034
Met Lys Arg Ser Ile Ser Val Asp Ser Ser Ser Pro Lys
17435 17440 17445
aac gtt ttt aat cca gag acg ccc aat gga ttt gat gac agt gta 96079
Asn Val Phe Asn Pro Glu Thr Pro Asn Gly Phe Asp Asp Ser Val
17450 17455 17460
tat tta aac ttc acc tct atg cat agc att caa cct atc ctc tca 96124
Tyr Leu Asn Phe Thr Ser Met His Ser Ile Gln Pro Ile Leu Ser
17465 17470 17475
cgg att cga gaa ctt gcc gca att acg att cca aaa gaa cgt gtt 96169
Arg Ile Arg Glu Leu Ala Ala Ile Thr Ile Pro Lys Glu Arg Val
17480 17485 17490
ccg cgg ttg tgt tgg ttt aaa cag tta ctc gaa ctg caa gcg cct 96214
Pro Arg Leu Cys Trp Phe Lys Gln Leu Leu Glu Leu Gln Ala Pro
17495 17500 17505
cct gaa atg cag agg aat gag ctc ccc ttc tcc gtt tat tta att 96259
Pro Glu Met Gln Arg Asn Glu Leu Pro Phe Ser Val Tyr Leu Ile
17510 17515 17520
agc gga aat gcc ggc tcc gga aaa agc acg tgt atc caa acg ctt 96304
Ser Gly Asn Ala Gly Ser Gly Lys Ser Thr Cys Ile Gln Thr Leu
17525 17530 17535
aac gaa gct atc gat tgc att att acc gga tcc acc agg gtt gct 96349
Asn Glu Ala Ile Asp Cys Ile Ile Thr Gly Ser Thr Arg Val Ala
17540 17545 17550
gcc caa aat gtt cat gct aag tta tca acg gct tat gcg agt cgt 96394
Ala Gln Asn Val His Ala Lys Leu Ser Thr Ala Tyr Ala Ser Arg
17555 17560 17565
ccg ata aac aca atc ttt cat gaa ttt ggt ttt cgc gga aat cac 96439
Pro Ile Asn Thr Ile Phe His Glu Phe Gly Phe Arg Gly Asn His
17570 17575 17580
att cag gct cag ctg ggc cgt tac gca tat aac tgg act acg acc 96484
Ile Gln Ala Gln Leu Gly Arg Tyr Ala Tyr Asn Trp Thr Thr Thr
17585 17590 17595
ccc cct tct att gag gac ctg caa aaa aga gat att gta tac tac 96529
Pro Pro Ser Ile Glu Asp Leu Gln Lys Arg Asp Ile Val Tyr Tyr
17600 17605 17610
tgg gaa gtt tta att gat ata aca aaa cga gtg ttt caa atg ggg 96574
Trp Glu Val Leu Ile Asp Ile Thr Lys Arg Val Phe Gln Met Gly
17615 17620 17625
gac gac ggt cgc gga gga aca tcg aca ttt aaa acc ctg tgg gca 96619
Asp Asp Gly Arg Gly Gly Thr Ser Thr Phe Lys Thr Leu Trp Ala
17630 17635 17640
att gaa cgt ttg ctt aat aaa cct aca ggc tca atg tcc gga acc 96664
Ile Glu Arg Leu Leu Asn Lys Pro Thr Gly Ser Met Ser Gly Thr
17645 17650 17655
gcg ttt atc gca tgc ggt tcc ctt ccg gct ttt acc cgg agc aac 96709
Ala Phe Ile Ala Cys Gly Ser Leu Pro Ala Phe Thr Arg Ser Asn
17660 17665 17670
gtt att gtt att gat gaa gca gga ttg cta ggg cgt cat att ctc 96754
Val Ile Val Ile Asp Glu Ala Gly Leu Leu Gly Arg His Ile Leu
17675 17680 17685
acg gcc gtt gtt tac tgt tgg tgg ctt ttg aat gct ata tat caa 96799
Thr Ala Val Val Tyr Cys Trp Trp Leu Leu Asn Ala Ile Tyr Gln
17690 17695 17700
agc cct cag tac ata aac ggt cga aaa ccg gtc ata gta tgc gtc 96844
Ser Pro Gln Tyr Ile Asn Gly Arg Lys Pro Val Ile Val Cys Val
17705 17710 17715
ggt tcg ccc acc caa act gac tcg tta gaa tct cat ttt caa cat 96889
Gly Ser Pro Thr Gln Thr Asp Ser Leu Glu Ser His Phe Gln His
17720 17725 17730
gac atg cag cgt tca cac gta act cct agt gaa aat ata ctc acg 96934
Asp Met Gln Arg Ser His Val Thr Pro Ser Glu Asn Ile Leu Thr
17735 17740 17745
tat ata atc tgc aat caa act ctg cgt caa tat act aac atc tca 96979
Tyr Ile Ile Cys Asn Gln Thr Leu Arg Gln Tyr Thr Asn Ile Ser
17750 17755 17760
cat aac tgg gca atc ttt att aat aac aaa cga tgt caa gag gac 97024
His Asn Trp Ala Ile Phe Ile Asn Asn Lys Arg Cys Gln Glu Asp
17765 17770 17775
gat ttt gga aat ctt tta aaa acg ctt gag tac ggg cta cct att 97069
Asp Phe Gly Asn Leu Leu Lys Thr Leu Glu Tyr Gly Leu Pro Ile
17780 17785 17790
acc gaa gca cat gcg cgt ctg gtc gat aca ttt gtt gta cct gca 97114
Thr Glu Ala His Ala Arg Leu Val Asp Thr Phe Val Val Pro Ala
17795 17800 17805
tcc tat att aac aat cct gct aat ctt ccc gga tgg acg cgt ctg 97159
Ser Tyr Ile Asn Asn Pro Ala Asn Leu Pro Gly Trp Thr Arg Leu
17810 17815 17820
tat tcg tcg cat aag gag gtg agc gcg tat atg agt aag tta cac 97204
Tyr Ser Ser His Lys Glu Val Ser Ala Tyr Met Ser Lys Leu His
17825 17830 17835
gcg cat tta aaa cta tcg aaa aat gac cat ttt tct gtg ttt gcc 97249
Ala His Leu Lys Leu Ser Lys Asn Asp His Phe Ser Val Phe Ala
17840 17845 17850
tta ccg act tat aca ttc atc cgg cta acg gca ttt gat gaa tac 97294
Leu Pro Thr Tyr Thr Phe Ile Arg Leu Thr Ala Phe Asp Glu Tyr
17855 17860 17865
cgc aaa tta acg gga caa ccc gga ctt tct gtt gaa cat tgg ata 97339
Arg Lys Leu Thr Gly Gln Pro Gly Leu Ser Val Glu His Trp Ile
17870 17875 17880
cgg gca aac tcc ggt cgt ttg cac aat tat tcc caa agc cga gat 97384
Arg Ala Asn Ser Gly Arg Leu His Asn Tyr Ser Gln Ser Arg Asp
17885 17890 17895
cat gac atg gga aca gtt aaa tac gaa aca cat tca aat cgc gac 97429
His Asp Met Gly Thr Val Lys Tyr Glu Thr His Ser Asn Arg Asp
17900 17905 17910
tta att gta gcc cgt aca gac atc act tac gtg cta aat agt ctc 97474
Leu Ile Val Ala Arg Thr Asp Ile Thr Tyr Val Leu Asn Ser Leu
17915 17920 17925
gta gtt gta acc aca aga cta cgt aag tta gtt att gga ttc agt 97519
Val Val Val Thr Thr Arg Leu Arg Lys Leu Val Ile Gly Phe Ser
17930 17935 17940
ggt aca ttt caa tcg ttt gca aag gtt tta cgt gac gac tcc ttt 97564
Gly Thr Phe Gln Ser Phe Ala Lys Val Leu Arg Asp Asp Ser Phe
17945 17950 17955
gtg aag gct cga gga gag aca tcc atc gaa tat gct tac cgg ttt 97609
Val Lys Ala Arg Gly Glu Thr Ser Ile Glu Tyr Ala Tyr Arg Phe
17960 17965 17970
ctg tca aac cta atc ttt gga ggc ttg att aac ttt tac aat ttt 97654
Leu Ser Asn Leu Ile Phe Gly Gly Leu Ile Asn Phe Tyr Asn Phe
17975 17980 17985
ttg tta aat aaa aac cta cat ccc gat aag gta tcg tta gca tac 97699
Leu Leu Asn Lys Asn Leu His Pro Asp Lys Val Ser Leu Ala Tyr
17990 17995 18000
aaa cgg tta gct gcc tta acc ctg gag tta ttg tct gga aca aac 97744
Lys Arg Leu Ala Ala Leu Thr Leu Glu Leu Leu Ser Gly Thr Asn
18005 18010 18015
aaa gcc ccc tta cac gaa gca gcg gtt aat ggg gcg ggt gcc ggg 97789
Lys Ala Pro Leu His Glu Ala Ala Val Asn Gly Ala Gly Ala Gly
18020 18025 18030
att gac tgt gat ggt gca gct act tct gcc gat aaa gcc ttc tgc 97834
Ile Asp Cys Asp Gly Ala Ala Thr Ser Ala Asp Lys Ala Phe Cys
18035 18040 18045
ttt acc aaa gcc ccc gag tcc aaa gta acg gcc tcc ata ccc gaa 97879
Phe Thr Lys Ala Pro Glu Ser Lys Val Thr Ala Ser Ile Pro Glu
18050 18055 18060
gac ccg gat gat gta att ttt acg gca ctt aac gac gag gtt att 97924
Asp Pro Asp Asp Val Ile Phe Thr Ala Leu Asn Asp Glu Val Ile
18065 18070 18075
gac ttg gta tac tgc cag tac gaa ttt tcc tat ccc aaa tca tcc 97969
Asp Leu Val Tyr Cys Gln Tyr Glu Phe Ser Tyr Pro Lys Ser Ser
18080 18085 18090
aat gag gtc cat gct cag ttt ctg tta atg aaa gct att tac gat 98014
Asn Glu Val His Ala Gln Phe Leu Leu Met Lys Ala Ile Tyr Asp
18095 18100 18105
ggt cga tat gcc ata tta gca gag ctt ttc gaa agc agc ttt aca 98059
Gly Arg Tyr Ala Ile Leu Ala Glu Leu Phe Glu Ser Ser Phe Thr
18110 18115 18120
acc gcc ccc ttt agc gcg tat gtc gat aat gtt aat ttc aac gga 98104
Thr Ala Pro Phe Ser Ala Tyr Val Asp Asn Val Asn Phe Asn Gly
18125 18130 18135
agc gag ctt ttg atc ggc aat gtg cgg ggg ggg ctg tta tct ttg 98149
Ser Glu Leu Leu Ile Gly Asn Val Arg Gly Gly Leu Leu Ser Leu
18140 18145 18150
gca tta caa aca gat acg tat acc ctt ttg ggg tat act ttt gca 98194
Ala Leu Gln Thr Asp Thr Tyr Thr Leu Leu Gly Tyr Thr Phe Ala
18155 18160 18165
ccc gtg cca gtc ttt gta gag gaa ctg acc cga aaa aag ctg tac 98239
Pro Val Pro Val Phe Val Glu Glu Leu Thr Arg Lys Lys Leu Tyr
18170 18175 18180
cgc gaa act acc gaa atg tta tat gct cta cac gta cct ctt atg 98284
Arg Glu Thr Thr Glu Met Leu Tyr Ala Leu His Val Pro Leu Met
18185 18190 18195
gtc tta cag gat caa cat ggg ttt gtg tcc atc gta aac gct aac 98329
Val Leu Gln Asp Gln His Gly Phe Val Ser Ile Val Asn Ala Asn
18200 18205 18210
gta tgt gaa ttt acc gag tct ata gag gat gca gaa ttg gca atg 98374
Val Cys Glu Phe Thr Glu Ser Ile Glu Asp Ala Glu Leu Ala Met
18215 18220 18225
gcc acc acg gtg gac tat ggc ctt agt tct aaa cta gcc atg aca 98419
Ala Thr Thr Val Asp Tyr Gly Leu Ser Ser Lys Leu Ala Met Thr
18230 18235 18240
att gca cgc tca cag ggt ctg agt tta gag aag gta gct atc tgt 98464
Ile Ala Arg Ser Gln Gly Leu Ser Leu Glu Lys Val Ala Ile Cys
18245 18250 18255
ttt acg gcg gat aaa ctg cgc cta aat agt gtg tat gtt gcc atg 98509
Phe Thr Ala Asp Lys Leu Arg Leu Asn Ser Val Tyr Val Ala Met
18260 18265 18270
tcg cgt acg gtc tcc tct agg ttc tta aaa atg aat cta aac cct 98554
Ser Arg Thr Val Ser Ser Arg Phe Leu Lys Met Asn Leu Asn Pro
18275 18280 18285
cta cgg gaa cga tat gaa aaa tcc gca gaa att agc gat cac att 98599
Leu Arg Glu Arg Tyr Glu Lys Ser Ala Glu Ile Ser Asp His Ile
18290 18295 18300
ctt gcc gct cta cgt gat ccc aac gta cac gtt gtg tat taa 98641
Leu Ala Ala Leu Arg Asp Pro Asn Val His Val Val Tyr
18305 18310
agcattgtat aaaaacacgc atgcgggctt gctgttctca tttctaggtt ttgtcttaaa 98701
tacacccgcc atgagcatct ctggaccccc aacgacgttt attttatata ggttacatgg 98761
ggttaggcgg gttcttcact ggactttacc ggatcatgaa caaacactct acgcatttac 98821
gggtgggtca agatcaatgg cggtgaagac ggacgctcga tgtgatacaa tgagcggtgg 98881
tatgatcgtc cttcaacaca cccatacagt gaccctgcta accatagact gttctactga 98941
cttttcatca tacgcattta cgcaccggga tttccactta caggacaaac cccacgcaac 99001
atttgcgatg ccgtttatgt cctgggtcgg ttctgaccca acatctcagc tgtacagtaa 99061
tgtggggggg gtactatccg taataacgga agatgaccta tccatgtgta tctcaattgt 99121
tatatacggt ttacgggtaa acagacctga cgatcagacc acaccaacac caaccccgca 99181
ccagtataca tcgcaaaggc ggcagcctga aaccaactgt ccttcttcac cacaaccggc 99241
ctttttcaca tcagacgacg acgttctttc gttaatatta cgggacgccg caaacgcgta 99301
aagacagatt caagactaac atttatccca actgattaca tttcatacgc gaataaacga 99361
cacaaaaaat ttatatttaa cggcttttaa tttgaagaca cctatcctct taacgttgat 99421
gagccttgca ggttgggtgc cgcgcttcac cggtattata cataaccgat ttaccgtgtt 99481
tacggcagtc tgaccattta ccagtgtatg tctgtaatac gacgttgttg tgtcccgaca 99541
aaattaactc gcgtacaaat ttctgatgtt cccccggcgt ggcaacgctg gcatttccaa 99601
acacattacg ttctcgtacg tccatgaccg ctattttcag tattaattgg ttggtcggtc 99661
aaagtatttt ccttatgtaa aaggacacga tctaaagccg taaactcata cacaaacact 99721
ggtaccaacg gacgcgattt tccgtccgtt gagcgggtgt aatatcggcg aggtcttctt 99781
gcacgaatac tctcgtacag taggtttctg acacggggtg catgggtttt ttgacacaac 99841
acaaacattt gcaggctctt atgactggat ggattgaatt tatttttaga tagggtcacg 99901
tgtttttgtc gtgacacgcc tcgaccagaa aaggctgcgg ttttcgtaca cgcgaccgtt 99961
atttcacagg cgttcataac caagctgcgg cggatggtgt cggttaattg tctccgccca 100021
agttcgtcaa tagatgatac catgaacaac gtatcaaatg gtacatagtc gtctttggtt 100081
ttctcaatac agcccgcgtg cccaatcgga aatttttcat ttgcatcaac gctattttct 100141
gtaaaatcgt tctgaacact gtgttggctg gctacctgtt taaaatttgg gatcgaacac 100201
ggtccacgat gcaatcccca accccattga agcaatgccg tcggtacgga aggaggcaac 100261
tccgaaaaca ttatggtacg caagagggtc gattggagtg ttatataaca ctccaatcga 100321
tctcgggttc gcctttacgc gtaaaatact cattggcttg aacgaaatgt cgacaattcc 100381
gaaatggaac acgggacaat ggcgacggat gcgcgtgtgt tagcaccaga tgacatcttg 100441
aattcggttg ggttgtcttc tgtgcatgcg caccccacag cataaaaact aaccctgtac 100501
ggttctcgca taacctctgt agcacgcgtt gcaccagccg cccccagcct aagtatacat 100561
gcgaccccgg agtcccgcga cgaaccgtaa gcgtggtatt cagcaataac accccctgcc 100621
ttgcccaact ctccaggcat ccgtgagtgg gcggagtcat atttgggtat gattccatga 100681
gggccgcaaa aatattttta agactagacg gtggtgttat gccacgtttt acactaaacg 100741
ctagcccatg tgcatgtccc gcggtagggt atggatcttg accaataatt acaacgcgaa 100801
tgctctgggg tccgcaaaat cgcgtccatg caaaaatatc gcctgtagat ggaagtattt 100861
cttcccctga atttaaaaga cgattgtatt ctaaaaaaat acctttcgcg tacggctctt 100921
taagttcgtc cgacaacagg tcataccact caggggaaat gttaaacttg ctgaaaactt 100981
caaccgaatc cagttgcgaa gagacggggg tgaacgtttc cgtgtcgtaa tgatgtgaca 101041
tgttatttaa cttgaaggtt ggggggtcta gcttaacccc caaaggcagc ccgcggggtc 101101
gcttgcgggt ttttttggta accggatggg ccaaaacata aatgtccttt gaatccgata 101161
gtttcatttc attggcatac gcgttggaac aaacggtcgg ctccccagac acatccattt 101221
tccgggatat ttgtggaaga tggagtagag tctacccata caccggaaag ggcatccaac 101281
aaagcatcgc gtatgtcccc gcttttatgt tcttcaccaa cagattgtgc cagccccttt 101341
aaggtgacgt atggatttgt ccagtacgcc atttgtttgt ctttaaacca aagtataact 101401
tccggtactg gacattttgt cttaaccacg attcccgata gcgcctcgct gaggtttgat 101461
accgggggtg ccgcatagtc ccacgcctca tataccgatg acacgcacgg ttccgttata 101521
atcaaactca catccgatag cggtttggct ccaaaaaaca acggagtgtc gtcttggaga 101581
tgaagacaat acgcgattgt gatagttttt aaaaaaacta tctgcagtaa ccatttatgt 101641
gatgccatga cgcttgtgtt ttcccttcac tacgacgttg tcgtatcctt tgaaaaactt 101701
gaccactcta atggaagcat ggacaagtat gagttttata tatacagttg gcctttagtt 101761
aaactcttgg tgtcatatct cattttccta aaaagggcga tcttaatatg tcaaacgtca 101821
cggcgtgccg acaaagcgaa tttccatgca agatttggat gtagtattta tacacccaat 101881
cacatgtcac gtattaagct ttacagtccc ccgttatctg atataatcac ttttcttaac 101941
acgtcatcgg gaaaacagat gtttatatta tacctctcgc ggtcatttac ggcaaatact 102001
tagaccgttt tcaagcggac tgaaaacgct caaattgcct tttggaggcc tgcccaacgg 102061
ccattatccc ttggatctaa gattgatttg cggtaacgtt tgccaatcaa gctttaaaaa 102121
cgtaccccaa acttaaaacg ctcaaattgc cttttggagg cctgcccaac ggccattatc 102181
ccttggatct gagattgatt tacggtaacg tttgccaaac ccacgcattt cagtttaaat 102241
atttctaagc attcttagtg cgtacttggc agcgtgctta aaatatcaac caatatccat 102301
tatgctacac gtttccttct atccgtttca atccattaaa agtccattaa caaaaatgat 102361
gcatcatacc taattcacct aaaaacctga ctcattgcag cagcgtttcc tccttgcaga 102421
ctatccagtt ggcattttaa acgggtccgg ctgcctaaac cgaaaacacc gttgccttta 102481
ctgtaagtac aaaactaaaa tttatatttg cgtgcgtatt ttgtaacata tatgcctttt 102541
atccccccgc aagtttgctt taccctcgcc ttcaccaccc ccgccacctt ccggccattt 102601
taataacttt aattgctata agacataccc aaaccggatg atttttgccg ctggaaaaac 102661
agcttctaat tttcccgtct caactcggcc ttggttgcat ctccaagtat acctttagtt 102721
tgctcccgta gaggtgtata aatacaaacg gtgacaagta ttgagcgtaa tctcaaattt 102781
ttgtaattta gggcggagcg cttacgacag cacatgcgta ctgttagact gttatgttta 102841
ttgtatttgc agagcaggat gccccggtta ctccgagacc ggattgcggg cattccgaat 102901
cgtgtacgga cttaccaggg ggcagtattt acaccttggg ttccagatat accaaccctt 102961
acgaccaata gcaacactca ggtattttta aaatgcacgt ttaatgatca taatttacat 103021
acagttggta ataaagcaga ctgtggatgt ttaaggcatt tccttccccc tcccaacaaa 103081
ctaggacttc ttcatcttgt ttggaatacc tttacccgct ttaccggcag agcttttttt 103141
ggtaaggtgt ttcagtgaac ctgatgttga tccggaggtg gagggggtat tggactcccc 103201
ctgtggagag gcaactttgc gggttttact tcccttacat gccgaatcag actcagatgt 103261
caggtctatt gttaagcatc gtttaacgtc tctgccggta tgaaataaac ggcgcttagc 103321
accccttgcg cttcccggtt taatccccgg taacacagaa aaaagcctga ctttttgggg 103381
tgtatttacc aatcgggtat ccctttcatc gccacgagag gtctccccgg ttgaggtggt 103441
ttctggtctt acaattggac ctgtaattag ttggatggct gtatctttcc aggtccaggt 103501
ttgcatggtt aggcgggttg gatcggtaca tcgatccaac aagaataaca tgtttgttac 103561
aaacggtcct gttgaatcat gcaaaagaca acgcagggat gtttttaatc ccgcctcatc 103621
acgcccgtaa atacctatat agtttaatat caacattttt gtaggctcta caatttcggg 103681
ttgatacagt tccgcaagtt gatcatcaag ccatccgagt aaaggttgca tgtaacacgg 103741
gaatctcgcg tttccctctg ttcctctatc cgtggctcga aaaggcagtc tgtccatggt 103801
tcgtgggtct tgattaattc ccacagatac tggacgatca cggtagtcct gccccccggt 103861
ccggggttgc tgtgcagatt caatcgagcc atacaccacc ggggtcgccg atcgaacagc 103921
aggttggtct ttaaaaaata ccttccgtaa aaatgatgcg gtagagcatg ttttggttac 103981
accagggctc gagtctcggg tcggtggttg tatagaatcc tgttgagagt cacttggtga 104041
ctctgctgtg ggctctctag ccgacgattg aaggggccca gggtttggtg attgaatggg 104101
ctcccgactc gatcttgatg ttggctgttg gatggactcc cgactcggtc ctgggcttgg 104161
tggcagaaga tctatgacat ctcccggtag gatgtcgatg gaatcttcaa atgacggctc 104221
agaaaaacca tcgtcgtcgg atgggtgcac ttcatattcc ttgtaacttg tatcacttac 104281
gatcttatgc aggatggatt gcactggaca ccggcagaga ggacactgga cgctggtgga 104341
ggtccatgcc cgaatacaaa caaagcagaa gtcgtgcaaa cacggcatgg tttttccgag 104401
atcggaaacg gtgctcatgc atatggtgca ggtattatcc gaagcgtcgg aggtgccgct 104461
accgcccgct aatatggtat ccatggtaac aactggctgt attctaatgt ccgggcatcc 104521
aaacacgtag cagaactgcc atgcgttcta aattgtgagt tgtggcgagt acatttttat 104581
aattggtacc aacgaagaca cacccctata tccctccacc catttctttt aagtcccacc 104641
cactaaaacg tgggtataaa atgtgtattg gggtaggcgg acagtcccaa caaacaggga 104701
agttgattgg tataaccttg ggccgggtat acagctaagt gacattttag attctgtctt 104761
tatttagata aagagcgata cgaagacatt tctccacccc cctgtaatac ccgtaaataa 104821
aggtaagtcc acaaacaaaa gcactgtata taggaagtcg ggtgtattgg gacagttact 104881
ccattagagg cgtacaaaca atactgggat agggtaatgc aagtcccccc cgatggtcgc 104941
cccgcaaacg cgcggggagg tggggtcgct tttttttttc tctctcgagg gggccgcgag 105001
agggctggcc tcctctcccg gggtccgccg ggcgcccaga aaccgggggg gggttatttt 105061
cggggggggg tccgaccagc ccgcccgtcg cccgcccgca cagacagaca gacacttttt 105121
tcataaaaac cgttccgctt ttattaacaa caaacagtcc gcgcgccagt ggcgctcacg 105181
agaaaaggag gggactccgt cacccccgac tctgcggggg gctcctcccc ccgcgccctc 105241
cccacacatc gtcctcgtcc tcggaggacg aggacgagga caacagctcc accttgaccg 105301
ccgggcgcaa acccacccgg cggtctcgca gcacacccgg ggccaccgac acgatgctca 105361
ccccaaagga tgaccccggt gcgtccccgt cgtccccgcc cccctcctcg ctgtcccacg 105421
cgtcttcaca ccccacctcc caatcgtcca gctccaaagc gtgttctctg tcgtctgcgg 105481
tgcgccgctg tcgccccgcc tgggtttctg acggccgttc cgagcccccg tggtgtccga 105541
acacgaaccg tgttccgtcg ctcccctcca acaccgtctc cgcggcccca aaaccgggcg 105601
gccacattac tctgggaatc ggggggaggg cattccgagc ctcgtccgcc gacgcataca 105661
gcgccaccga ccgaccggcc acgggtggaa gcacgagtgg ttctgcggca gggtcgggtt 105721
ccagcagggc gtggcggcaa aacaccctcg cccaggtggg tacgtcgccg gcctccggcc 105781
cggcggcccc cggtctccgt ccctcgggaa ggaagacggg tcgaagcgcg gcacccaggc 105841
cccatcggtt tgctgcgcgg tggctatgtg ccgcctcgtc cacaaagtcg gctgccccga 105901
gccccagacc ccgagactgt cgcgcgaggt ccttgcaacc gtcaaaaccc ggcagcacgt 105961
actgccggta ttcacggggc gacaggggga cgcgggtctt ggggcccgcg cgggtacaca 106021
cggtgtatgc gacgttccca ccgcggcaca aacacagggg ttgttcgccc gggtacaggt 106081
tggcaaacgc agtctcgata cgagcaaaac tcgctggccc aaaggtgcgc gacgatgcaa 106141
acacggcccg ggcgagtcct tctgtgaccg ccgagtctgg ccatcggacg acggcctggg 106201
cgtccggtcg cgccggggcc cggacgtaca cgtgatactg agacaaagcg ggtccatccc 106261
tgggccacct ctcgagggcc accgcgtcca acaccagcaa ccggcgccgg gcagaggcca 106321
accgcgagcc tagatactcg acggccccgg caaaggccag gtctcgggtc gacagtaata 106381
aaacgccccg ggcgttcaaa gcggacacgt ccggcgggcc ggtccagttc ccggcccagg 106441
catgagtgct cggcaggcac aaccggttac tcagggctgc caggaccaca gacagtcccc 106501
ctcgggatgg actccatgac ggtcccggat ctgtcgcgag ggtgctctcg agggggccgt 106561
tgatgtcctc tccgggcaac ggatcgtaga tgatcagaag cctcacatcc tccgggtctg 106621
ggatctgccg catccaggcg cacctccgtc gcagcgcctc cactccgctg ggtggaccaa 106681
accgtcggtc tcctccgccc ggacgccgag cggcgatttc cgccaaggcg ccgggatcaa 106741
agcttagcgc agggcgccag gccgtgggaa acaatgggtc gtcgaccaga cgggcgatgg 106801
tttcgggggt acagtacgcc ttgcgagcct ggtccgacgg gaccggggta tgcagggccc 106861
cccggggaat acgccgaaat cccccgtttg gggccggtcc gtcaagtggc atcgttatta 106921
cggcgggggg atccaccaca gggcccgagg tgatggtcac gggctcggat acccgcctct 106981
tggccttgga aaccacatga tcgtctgcaa cccgggcgtc cgcgacgggt gtctccctaa 107041
tcttgtcgag gaggcttctg ctctcgactg gctgggactt gcgcttgcgc ggagttcgta 107101
aacgatcatc cggtggacac acagaaagag agcgtgcggc ggccgacggc tgagggtcgg 107161
gagcctgtgt ggccggggtt gttggagaag ggtgaccgcg ggagatccgc gccgccggac 107221
tggagcccgt tgcctcgggg tatgccatgc tggcaaaggc tctgcggaga ctctgtagga 107281
taaagtgttt ttgggcccgg tcgtatcgac ggctcatagc cacggccgcg gccgcgtggg 107341
ggagagccca gagggcctcc cccgtggcca tggcttcgcc tacatgcgga acgggagacg 107401
ctacgctccc cgtaacggcg gtacccgccc gtcccggtgg caacagcttt tggtagaact 107461
ggttcagggc cgagttgaca ccggtcagct tggggttctg gagccatgct atagggtctc 107521
tgtctggaca gtagatcagg ttaatcagcg cgcggtactg tctagccgga tctcccaact 107581
ccggcacgta aagcggcacg ggttccgttg aggcctcgta acgagcccgc gccgctctca 107641
cagcctcatc ctcccagtga ccctctctgg tctccccgga cggtccaaac cgcaccctgt 107701
tggatgggag gggtgccgat ccgggccaag ggcttccgtc gggcatcatg agcggccccg 107761
acaccggggg aattatcggg gttctggatc gcggcaggga aaatgatttc tgtctctggc 107821
gccccggttc ccccgcaaga cgtttggtct tacgaatcct cggatcggga ccgctgatgg 107881
atcgatatcc cggttggata ttttgtttcg tcgacccacc atcatttgag tccgaatcat 107941
ccgaatttga cggggaaggg gcgtgttcgc gtccggacct gctgcctgta gtttcacttc 108001
ccaccgaaac gcgccggggt tcatcgtctt catcctccga tgacgatccc cacgacgagg 108061
aagaggatga agacgaaaca aactcacgac tctttggctt tttctccact gggctgtcat 108121
cctcaatcgg gtctggtgcg tgggatcttc ccggcagggc caaaaacgct ctaggtttgc 108181
cccccgacga acgtccaggg acgcgaggtg ttataccccg ggcatcatgt ttccttgggc 108241
gggtatcatc ggtctcaaac ggcaggtccg cctttgcccc cttagcggga acgctgtccg 108301
aaaggacgtg gtacaattgc tcaaccgggc cgggtacagg tccaccgggt ttccgcgccg 108361
ggagtgggac cttaaccttc aaagtctttt tcttcgggct ctttccctga gcgggccgtt 108421
gagttttctg gagaactact ccgtcccccg atgcatgcgc atgacccgct tgctcatcgc 108481
ccggcttttt acccgagatg gactgagttt gtctgtctcg atggaccacc gacggcaaac 108541
ctggtgaatt tcctctcgtc gtttgtcggg gtatagaccg ctggtcttcc cgttgatcgt 108601
tcccggcggc gtctccaaca ggagacgcgg gggatacagg ggagaaggcc tgcgggaacg 108661
gaggggtcgt acctctgccc gtttccccat cgttcatcgg tggttttgga gacctagcaa 108721
gcttcgttcc gagagagact gtctcaaggg agcgatcggc tcctgttggt tctcgcgcgc 108781
cggcctccga gaatcgggtg tggaagacct cggccagcgg gattacaggc gagcccatta 108841
gatcctgacc gtcctcgcat acgtagtcgt cttgtgttag ctcttcgcca acatcttccg 108901
ttctgggttc tggttgaagt cccgatacgg agggaattga aacgatctcg tgttcccgtc 108961
ccaccatgac cccgttctct ccaaatagta gatcgtcagg ctgactcgag gtgaccaccc 109021
gggccctgtg ttcggcggcc gccgcggccg cgtccaacag gtccattaac tccaaagtat 109081
caggcgaccc cgcgcgttgg ggtgtagagc gctgcatcgg cggcgtatcc atcgcactgg 109141
ggtgaattta gacgtacccg agttttccaa acgctctcgc agccttcaaa ggattgcgat 109201
tgcggttggt gagggagttc caacagtact taaaacgtgt tgtgcccccc cctcgaccgc 109261
atatttcctc cccgtgtcgt caccgtgtaa atattcttaa tgataagacg atgtagtgat 109321
tggacgagac tcgaggcggg aagttcatgg accatagtat gcgtttaagg agagaccgct 109381
ggttggcgat gtacgcccgg tgtctatttc cgcatacctt acaacatcat aacaagggat 109441
accagacatg tgaatttcat ttacatatgt ttaaataaca accaatcatc gtgtgtctac 109501
agacgatata taatatacat aaacacaatt ggggttgtct cacatgcaaa acatcttata 109561
taacacgggt tgtttccacc catccggcat ctagttaatc aaatgcacgt cgacggtgtg 109621
tttgggtccc tctccgtcgt cattacgttc gcgcaatcaa caagcgtata caccaccacc 109681
cctcccaacg attatgtcag gcggcacgaa gcccgcgata acccataaaa tacacacggg 109741
gttgtggtgt tcacgtaacc ccccgccgat ggggaggggg cgcggtaccc cgccgatggg 109801
gagggggcgc ggtaccccgc cgatggggag ggggcgcggt accccgccga tggggagggg 109861
gcgcggtacc ccgccgatgg ggagggggcg cggtaccccg ccgatgttta taaccataat 109921
tctctaaacc gttgtagaaa atcacaaaaa aatttattca aaaacaagtc gaagaacttc 109981
atatctgagg catgtaaacc cgttcgcact tcctggggtg gaatggggtg gggtgggggg 110041
gtgaaaaagg gggggggtta aattgggcgt ccgcatgtct gtggtgtacg ccaatcggat 110101
acactctttt gatctgcatt cgcacttccc gttttttcac tgtatgggtt ttcatgtttt 110161
ggcatgtgtc caaccaccgt tcgcactttc tttctatata tatatatata tatatatata 110221
tatatagaga aagagagaga gtttcttgtt cgcgcgtgtt cccgcgatgt cgcggtttta 110281
tggggtgtgg gcgggctttt cacagaatat atatattcca aatggagcgg caggcttttt 110341
aaaatcgatt tgacgtgata aaaaaaaaca cacggggccc cccccttttt ttggtgttat 110401
aaaggcaacc caatcgaagg tctcccgccc cggaatcccc cattgccatt ttacccaagt 110461
agccttattc atagatgtaa acgtttgggt gtgtgttttg ttgtgcaggg ttcgtccgat 110521
tcataacgcg acagcgtcga gtcggtttta agggaaaagg ttactacggc cccaaggac 110580
atg ttt tgc acc tca ccg gct acg cgg ggc gac tcg tcc gag tca 110625
Met Phe Cys Thr Ser Pro Ala Thr Arg Gly Asp Ser Ser Glu Ser
18315 18320 18325
aaa ccc ggg gca tcg gtt gat gtt aac gga aag atg gaa tat gga 110670
Lys Pro Gly Ala Ser Val Asp Val Asn Gly Lys Met Glu Tyr Gly
18330 18335 18340
tct gca cca gga ccc ctg aac ggc cgg gat acg tcg cgg ggc ccc 110715
Ser Ala Pro Gly Pro Leu Asn Gly Arg Asp Thr Ser Arg Gly Pro
18345 18350 18355
ggc gcg ttt tgt act ccg ggt tgg gag atc cac ccg gcc agg ctc 110760
Gly Ala Phe Cys Thr Pro Gly Trp Glu Ile His Pro Ala Arg Leu
18360 18365 18370
gtt gag gac atc aac cgt gtt ttt tta tgt att gca cag tcg tcg 110805
Val Glu Asp Ile Asn Arg Val Phe Leu Cys Ile Ala Gln Ser Ser
18375 18380 18385
gga cgc gtc acg cga gat tca cga aga ttg cgg cgc ata tgc ctc 110850
Gly Arg Val Thr Arg Asp Ser Arg Arg Leu Arg Arg Ile Cys Leu
18390 18395 18400
gac ttt tat cta atg ggt cgc acc aga cag cgt ccc acg tta gcg 110895
Asp Phe Tyr Leu Met Gly Arg Thr Arg Gln Arg Pro Thr Leu Ala
18405 18410 18415
tgc tgg gag gaa ttg tta cag ctt caa ccc acc cag acg cag tgc 110940
Cys Trp Glu Glu Leu Leu Gln Leu Gln Pro Thr Gln Thr Gln Cys
18420 18425 18430
tta cgc gct act tta atg gaa gtg tcc cat cga ccc cct cgg ggg 110985
Leu Arg Ala Thr Leu Met Glu Val Ser His Arg Pro Pro Arg Gly
18435 18440 18445
gaa gac ggg ttc att gag gcg ccg aat gtt cct ttg cat agg agc 111030
Glu Asp Gly Phe Ile Glu Ala Pro Asn Val Pro Leu His Arg Ser
18450 18455 18460
gca ctg gaa tgt gac gta tct gat gat ggt ggt gaa gac gat agc 111075
Ala Leu Glu Cys Asp Val Ser Asp Asp Gly Gly Glu Asp Asp Ser
18465 18470 18475
gac gat gat ggg tct acg cca tcg gat gta att gaa ttt cgg gat 111120
Asp Asp Asp Gly Ser Thr Pro Ser Asp Val Ile Glu Phe Arg Asp
18480 18485 18490
tcc gac gcg gaa tca tcg gac ggg gaa gac ttt ata gtg gaa gaa 111165
Ser Asp Ala Glu Ser Ser Asp Gly Glu Asp Phe Ile Val Glu Glu
18495 18500 18505
gaa tca gag gag agc acc gat tct tgt gaa cca gac ggg gta ccc 111210
Glu Ser Glu Glu Ser Thr Asp Ser Cys Glu Pro Asp Gly Val Pro
18510 18515 18520
ggc gat tgt tat cga gac ggg gat ggg tgc aac acc ccg tcc cca 111255
Gly Asp Cys Tyr Arg Asp Gly Asp Gly Cys Asn Thr Pro Ser Pro
18525 18530 18535
aag aga ccc cag cgt gcc atc gag cga tac gcg ggt gca gaa acc 111300
Lys Arg Pro Gln Arg Ala Ile Glu Arg Tyr Ala Gly Ala Glu Thr
18540 18545 18550
gcg gaa tat aca gcc gcg aaa gcg ctc acc gcg ttg ggc gag ggg 111345
Ala Glu Tyr Thr Ala Ala Lys Ala Leu Thr Ala Leu Gly Glu Gly
18555 18560 18565
ggt gta gat tgg aag cga cgt cga cac gaa gcc ccg cgc cgg cat 111390
Gly Val Asp Trp Lys Arg Arg Arg His Glu Ala Pro Arg Arg His
18570 18575 18580
gat ata ccg ccc ccc cat ggc gtg tag tctttataaa taaatacaat 111437
Asp Ile Pro Pro Pro His Gly Val
18585 18590
ggtttggctc gtgtcttttt ttgatgtctg tctgtggggg agtggggtgt tgtggatatt 111497
agagggtaga gggtgctggt ttgaacgtct ccattaaccc acggggtccc cacacgggcc 111557
gtgtggt atg aat ctc tgc gga tcc cgc ggt gag cac ccg ggc ggt 111603
Met Asn Leu Cys Gly Ser Arg Gly Glu His Pro Gly Gly
18595 18600
gaa tat gcc gga ctt tac tgc aca cga cac gat acc ccc gcg cac 111648
Glu Tyr Ala Gly Leu Tyr Cys Thr Arg His Asp Thr Pro Ala His
18605 18610 18615
cag gct ctc atg aac gac gcc gaa cgg tac ttc gcc gcc gcg cta 111693
Gln Ala Leu Met Asn Asp Ala Glu Arg Tyr Phe Ala Ala Ala Leu
18620 18625 18630
tgc gcc ata tct acc gag gcc tac gag gct ttt ata cac agc ccc 111738
Cys Ala Ile Ser Thr Glu Ala Tyr Glu Ala Phe Ile His Ser Pro
18635 18640 18645
tcc gag aga ccg tgc gcg agt ttg tgg ggg agg gca aag gac gcc 111783
Ser Glu Arg Pro Cys Ala Ser Leu Trp Gly Arg Ala Lys Asp Ala
18650 18655 18660
ttc gga cgg atg tgc ggg gag ctc gca gcg gat aga caa cgt cca 111828
Phe Gly Arg Met Cys Gly Glu Leu Ala Ala Asp Arg Gln Arg Pro
18665 18670 18675
ccc tcg gtt ccg ccg atc cgc aga gcg gtg tta tcg tta tta cgc 111873
Pro Ser Val Pro Pro Ile Arg Arg Ala Val Leu Ser Leu Leu Arg
18680 18685 18690
gag caa tgc atg ccg gat cca caa tcg cat ctg gag ctc agc gag 111918
Glu Gln Cys Met Pro Asp Pro Gln Ser His Leu Glu Leu Ser Glu
18695 18700 18705
cgg ctg ata ttg atg gca tat tgg tgc tgt ttg gga cac gcc gga 111963
Arg Leu Ile Leu Met Ala Tyr Trp Cys Cys Leu Gly His Ala Gly
18710 18715 18720
ctt ccg act att gga ttg tcg ccc gat aat aaa tgc atc cgc gcc 112008
Leu Pro Thr Ile Gly Leu Ser Pro Asp Asn Lys Cys Ile Arg Ala
18725 18730 18735
gaa tta tat gac cgc ccc ggg gga att tgt cac agg ctt ttt gac 112053
Glu Leu Tyr Asp Arg Pro Gly Gly Ile Cys His Arg Leu Phe Asp
18740 18745 18750
gcg tac ctg ggc tgc ggg tcc ctt gga gtc cca aga acc tac gag 112098
Ala Tyr Leu Gly Cys Gly Ser Leu Gly Val Pro Arg Thr Tyr Glu
18755 18760 18765
aga tcc tga caccccatcc ctttatatag aaaaaaaaaa taaatttaaa 112147
Arg Ser
18770
acatacaccg gataaaagcg tactgttttt tatttaaatt tacacgctcg gcgttgcccc 112207
ggttcggtga tcaccgggtc ttatctatat acaccgtgta actcgaaccc ccgtgactcc 112267
ctccaatcgc gttaccaaac tcttcttccg tatccgtaga ttccgagtcc tcgaaatcgt 112327
ccacttatcc aacaaattgt gacgttatat atcccaaggc aaaggccgct cccgtcatag 112387
caaatacaaa gacaattatt agcgtaatat aacagaattt tttacgatga tatattttat 112447
gttgatattt tccaattcga cgcaaaaatt catctgccgt ttcattttcg ctatcactat 112507
aataacactt ttcagccgaa cggctcggtt gtatggctgt tatcgttgta ttatttggtt 112567
gcgctcgcgg ggttaccacc gcttccatca gtaaggccac ggcctcaccc tccatggtgt 112627
tttgtccggc catagaaatc cagattgtaa ggccagcagg ctagtttaaa agtgtttaat 112687
accacacctt ttgatattta tatacatgca agattctaga ttattcatca ataggtcgtt 112747
taaagcgcgt tttcataaac gttgtcagct ataccgacat tctcacaaag aggtaaagtt 112807
accttacgtt attattaaat aaaacatgta gacattatta ataatcctag gaacaatcaa 112867
atccatattt gtaagttatg tttaacccct cccctttttg tcattatctc cgccctctta 112927
taatcggatc actttataag tgtgtcggtg agtatatttt gtacagttgt tggacaacag 112987
gtttttggtt cattaacact atcaacataa gtcggggtat acaagtata atg aac gac 113045
Met Asn Asp
gtt gat gca aca gac acc ttt gtt gga caa gga aag ttc cgt ggc 113090
Val Asp Ala Thr Asp Thr Phe Val Gly Gln Gly Lys Phe Arg Gly
18775 18780 18785
gcc atc tca aca tca ccg tca cat att atg caa aca tgt ggg ttt 113135
Ala Ile Ser Thr Ser Pro Ser His Ile Met Gln Thr Cys Gly Phe
18790 18795 18800
ata caa cag atg ttt cca gtt gaa atg tcg ccc ggc ata gaa tct 113180
Ile Gln Gln Met Phe Pro Val Glu Met Ser Pro Gly Ile Glu Ser
18805 18810 18815
gag gat gat ccc aat tat gac gtt aac atg gat ata cag tct ttt 113225
Glu Asp Asp Pro Asn Tyr Asp Val Asn Met Asp Ile Gln Ser Phe
18820 18825 18830
aat ata ttt gat ggt gta cac gaa act gaa gcc gaa gcc tct gtg 113270
Asn Ile Phe Asp Gly Val His Glu Thr Glu Ala Glu Ala Ser Val
18835 18840 18845
gca ttg tgc gca gaa gca cgc gtt gga att aat aaa gcg gga ttt 113315
Ala Leu Cys Ala Glu Ala Arg Val Gly Ile Asn Lys Ala Gly Phe
18850 18855 18860
gta ata tta aaa acg ttt aca cca ggg gcg gaa ggt ttt gcg ttt 113360
Val Ile Leu Lys Thr Phe Thr Pro Gly Ala Glu Gly Phe Ala Phe
18865 18870 18875
gcg tgt atg gac agt aaa aca tgt gaa cat gtg gtc att aaa gcg 113405
Ala Cys Met Asp Ser Lys Thr Cys Glu His Val Val Ile Lys Ala
18880 18885 18890
ggt caa cgt caa gga acg gcc acc gag gca acc gtg tta aga gcg 113450
Gly Gln Arg Gln Gly Thr Ala Thr Glu Ala Thr Val Leu Arg Ala
18895 18900 18905
tta acc cac cca tcc gtt gta cag ctt aaa gga acg ttt acg tat 113495
Leu Thr His Pro Ser Val Val Gln Leu Lys Gly Thr Phe Thr Tyr
18910 18915 18920
aac aaa atg aca tgt ctt ata tta cca cgt tac cga aca gat tta 113540
Asn Lys Met Thr Cys Leu Ile Leu Pro Arg Tyr Arg Thr Asp Leu
18925 18930 18935
tac tgc tat cta gct gca aag cgc aac ctc ccc ata tgt gac att 113585
Tyr Cys Tyr Leu Ala Ala Lys Arg Asn Leu Pro Ile Cys Asp Ile
18940 18945 18950
tta gca att cag cga tct gta tta cgc gcg tta cag tat ctt cat 113630
Leu Ala Ile Gln Arg Ser Val Leu Arg Ala Leu Gln Tyr Leu His
18955 18960 18965
aat aac agt att att cac cgt gat ata aaa tct gaa aat ata ttt 113675
Asn Asn Ser Ile Ile His Arg Asp Ile Lys Ser Glu AsnIle Phe
18970 18975 18980
att aac cac cca ggt gat gtt tgt gtg gga gac ttt gga gca gcg 113720
Ile Asn His Pro Gly Asp Val Cys Val Gly Asp Phe Gly Ala Ala
18985 18990 18995
tgt ttc ccc gtg gat att aat gcc aac agg tat tat ggc tgg gct 113765
Cys Phe Pro Val Asp Ile Asn Ala Asn Arg Tyr Tyr Gly Trp Ala
19000 19005 19010
gga aca atc gcc aca aac tct cct gag tta ttg gct aga gat cca 113810
Gly Thr Ile Ala Thr Asn Ser Pro Glu Leu Leu Ala Arg Asp Pro
19015 19020 19025
tat gga cct gcc gtg gac ata tgg agt gcc ggg att gta tta ttt 113855
Tyr Gly Pro Ala Val Asp Ile Trp Ser Ala Gly Ile Val Leu Phe
19030 19035 19040
gaa atg gct aca gga cag aac tcg tta ttt gaa cga gac ggt tta 113900
Glu Met Ala Thr Gly Gln Asn Ser Leu Phe Glu Arg Asp Gly Leu
19045 19050 19055
gat ggc aat tgt gac agt gag cgt caa att aaa ctt att ata cga 113945
Asp Gly Asn Cys Asp Ser Glu Arg Gln Ile Lys Leu Ile Ile Arg
19060 19065 19070
cga tct gga act cat ccc aat gaa ttt ccc att aac cct aca tca 113990
Arg Ser Gly Thr His Pro Asn Glu Phe Pro Ile Asn Pro Thr Ser
19075 19080 19085
aat ctt cgt cga caa tac att ggt ttg gca aaa cgg tct tct cga 114035
Asn Leu Arg Arg Gln Tyr Ile Gly Leu Ala Lys Arg Ser Ser Arg
19090 19095 19100
aaa ccc gga tcc agg cca ttg tgg aca aat cta tat gag ttg cca 114080
Lys Pro Gly Ser Arg Pro Leu Trp Thr Asn Leu Tyr Glu Leu Pro
19105 19110 19115
att gat ttg gag tat ttg ata tgt aag atg tta tcg ttt gac gca 114125
Ile Asp Leu Glu Tyr Leu Ile Cys Lys Met Leu Ser Phe Asp Ala
19120 19125 19130
cgt cat cga cca tca gca gag gtg ttg ctt aac cac tct gtt ttc 114170
Arg His Arg Pro Ser Ala Glu Val Leu Leu Asn His Ser Val Phe
19135 19140 19145
caa act ctt ccc gat cca tat cca aat cca atg gaa gtt gga gat 114215
Gln Thr Leu Pro Asp Pro Tyr Pro Asn Pro Met Glu Val Gly Asp
19150 19155 19160
taa aattcattaa gcctgttaat aaaatattgt ataaattgtg tttataacgt 114268
ataacccgtt aaggcaaata gggtacaaac gcgcaatgtt ttgaaatact aatataaata 114328
acataaccaa tagaaactta atacagagtc acgccccatt acaacaagga taaaacacgg 114388
gatcattttc ttaacattgt agtagcgctg aaaagcgtcc cctcccccgg ctcacagagc 114448
tgctcttcgg tgtagttggg tatactggtg cgcctcattt aatcgcg atg ttt tta 114504
Met Phe Leu
19165
atc caa tgt ttg ata tcg gcc gtt ata ttt tac ata caa gtg acc 114549
Ile Gln Cys Leu Ile Ser Ala Val Ile Phe Tyr Ile Gln Val Thr
19170 19175 19180
aac gct ttg atc ttc aag ggc gac cac gtg agc ttg caa gtt aac 114594
Asn Ala Leu Ile Phe Lys Gly Asp His Val Ser Leu Gln Val Asn
19185 19190 19195
agc agt ctc acg tct atc ctt att ccc atg caa aat gat aat tat 114639
Ser Ser Leu Thr Ser Ile Leu Ile Pro Met Gln Asn Asp Asn Tyr
19200 19205 19210
aca gag ata aaa gga cag ctt gtc ttt att gga gag caa cta cct 114684
Thr Glu Ile Lys Gly Gln Leu Val Phe Ile Gly Glu Gln Leu Pro
19215 19220 19225
acc ggg aca aac tat agc gga aca ctg gaa ctg tta tac gcg gat 114729
Thr Gly Thr Asn Tyr Ser Gly Thr Leu Glu Leu Leu Tyr Ala Asp
19230 19235 19240
acg gtg gcg ttt tgt ttc cgg tca gta caa gta ata aga tac gac 114774
Thr Val Ala Phe Cys Phe Arg Ser Val Gln Val Ile Arg Tyr Asp
19245 19250 19255
gga tgt ccc cgg att aga acg agc gct ttt att tcg tgt agg tac 114819
Gly Cys Pro Arg Ile Arg Thr Ser Ala Phe Ile Ser Cys Arg Tyr
19260 19265 19270
aaa cat tcg tgg cat tat ggt aac tca acg gat cgg ata tca aca 114864
Lys His Ser Trp His Tyr Gly Asn Ser Thr Asp Arg Ile Ser Thr
19275 19280 19285
gag ccg gat gct ggt gta atg ttg aaa att acc aaa ccg gga ata 114909
Glu Pro Asp Ala Gly Val Met Leu Lys Ile Thr Lys Pro Gly Ile
19290 19295 19300
aat gat gct ggt gtg tat gta ctt ctt gtt cgg tta gac cat agc 114954
Asn Asp Ala Gly Val Tyr Val Leu Leu Val Arg Leu Asp His Ser
19305 19310 19315
aga tcc acc gat ggt ttc att ctt ggt gta aat gta tat aca gcg 114999
Arg Ser Thr Asp Gly Phe Ile Leu Gly Val Asn Val Tyr Thr Ala
19320 19325 19330
ggc tcg cat cac aac att cac ggg gtt atc tac act tct ccg tct 115044
Gly Ser His His Asn Ile His Gly Val Ile Tyr Thr Ser Pro Ser
19335 19340 19345
cta cag aat gga tat tct aca aga gcc ctt ttt caa caa gct cgt 115089
Leu Gln Asn Gly Tyr Ser Thr Arg Ala Leu Phe Gln Gln Ala Arg
19350 19355 19360
ttg tgt gat tta ccc gcg aca ccc aaa ggg tcc ggt acc tcc ctg 115134
Leu Cys Asp Leu Pro Ala Thr Pro Lys Gly Ser Gly Thr Ser Leu
19365 19370 19375
ttt caa cat atg ctt gat ctt cgt gcc ggt aaa tcg tta gag gat 115179
Phe Gln His Met Leu Asp Leu Arg Ala Gly Lys Ser Leu Glu Asp
19380 19385 19390
aac cct tgg tta cat gag gac gtt gtt acg aca gaa act aag tcc 115224
Asn Pro Trp Leu His Glu Asp Val Val Thr Thr Glu Thr Lys Ser
19395 19400 19405
gtt gtt aag gag ggg ata gaa aat cac gta tat cca acg gat atg 115269
Val Val Lys Glu Gly Ile Glu Asn His Val Tyr Pro Thr Asp Met
19410 19415 19420
tcc acg tta ccc gaa aag tcc ctt aat gat cct cca gaa aat cta 115314
Ser Thr Leu Pro Glu Lys Ser Leu Asn Asp Pro Pro Glu Asn Leu
19425 19430 19435
ctt ata att att cct ata gta gcg tct gtc atg atc ctc acc gcc 115359
Leu Ile Ile Ile Pro Ile Val Ala Ser Val Met Ile Leu Thr Ala
19440 19445 19450
atg gtt att gtt att gta ata agc gtt aag cga cgt aga att aaa 115404
Met Val Ile Val Ile Val Ile Ser Val Lys Arg Arg Arg Ile Lys
19455 19460 19465
aaa cat cca att tat cgc cca aat aca aaa aca aga agg ggc ata 115449
Lys His Pro Ile Tyr Arg Pro Asn Thr Lys Thr Arg Arg Gly Ile
19470 19475 19480
caa aat gcg aca cca gaa tcc gat gtg atg ttg gag gcc gcc att 115494
Gln Asn Ala Thr Pro Glu Ser Asp Val Met Leu Glu Ala Ala Ile
19485 19490 19495
gca caa cta gca acg att cgc gaa gaa tcc ccc cca cat tcc gtt 115539
Ala Gln Leu Ala Thr Ile Arg Glu Glu Ser Pro Pro His Ser Val
19500 19505 19510
gta aac ccg ttt gtt aaa tag aactaattat cccggatttt atattaaata 115590
Val Asn Pro Phe Val Lys
19515
aactatatgc gttttattta gcgttttgat tacgcgttgt gatatgaggg gaaggattaa 115650
gaatctccta actataagtt aacacgccca catttgggcg gggatgtttt atgaagcctt 115710
aaaggccgag ctggtataca cgagagcagt ccatggtttt agacctcggg cgaattgcgt 115770
ggttttaagt gactatattc cgagggtcgc ctgtaat atg ggg aca gtt aat 115822
Met Gly Thr Val Asn
19520
aaa cct gtg gtg ggg gta ttg atg ggg ttc gga att atc acg gga 115867
Lys Pro Val Val Gly Val Leu Met Gly Phe Gly Ile Ile Thr Gly
19525 19530 19535
acg ttg cgt ata acg aat ccg gtc aga gca tcc gtc ttg cga tac 115912
Thr Leu Arg Ile Thr Asn Pro Val Arg Ala Ser Val Leu Arg Tyr
19540 19545 19550
gat gat ttt cac acc gat gaa gac aaa ctg gat aca aac tcc gta 115957
Asp Asp Phe His Thr Asp Glu Asp Lys Leu Asp Thr Asn Ser Val
19555 19560 19565
tat gag cct tac tac cat tca gat cat gcg gag tct tca tgg gta 116002
Tyr Glu Pro Tyr Tyr His Ser Asp His Ala Glu Ser Ser Trp Val
19570 19575 19580
aat cgg gga gag tct tcg cga aaa gcg tac gat cat aac tca cct 116047
Asn Arg Gly Glu Ser Ser Arg Lys Ala Tyr Asp His Asn Ser Pro
19585 19590 19595
tat ata tgg cca cgt aat gat tat gat gga ttt tta gag aac gca 116092
Tyr Ile Trp Pro Arg Asn Asp Tyr Asp Gly Phe Leu Glu Asn Ala
19600 19605 19610
cac gaa cac cat ggg gtg tat aat cag ggc cgt ggt atc gat agc 116137
His Glu His His Gly Val Tyr Asn Gln Gly Arg Gly Ile Asp Ser
19615 19620 19625
ggg gaa cgg tta atg caa ccc aca caa atg tct gca cag gag gat 116182
Gly Glu Arg Leu Met Gln Pro Thr Gln Met Ser Ala Gln Glu Asp
19630 19635 19640
ctt ggg gac gat acg ggc atc cac gtt atc cct acg tta aac ggc 116227
Leu Gly Asp Asp Thr Gly Ile His Val Ile Pro Thr Leu Asn Gly
19645 19650 19655
gat gac aga cat aaa att gta aat gtg gac caa cgt caa tac ggt 116272
Asp Asp Arg His Lys Ile Val Asn Val Asp Gln Arg Gln Tyr Gly
19660 19665 19670
gac gtg ttt aaa gga gat ctt aat cca aaa ccc caa ggc caa aga 116317
Asp Val Phe Lys Gly Asp Leu Asn Pro Lys Pro Gln Gly Gln Arg
19675 19680 19685
ctc att gag gtg tca gtg gaa gaa aat cac ccg ttt act tta cgc 116362
Leu Ile Glu Val Ser Val Glu Glu Asn His Pro Phe Thr Leu Arg
19690 19695 19700
gca ccg att cag cgg att tat gga gtc cgg tac acc gag act tgg 116407
Ala Pro Ile Gln Arg Ile Tyr Gly Val Arg Tyr Thr Glu Thr Trp
19705 19710 19715
agc ttt ttg ccg tca tta acc tgt acg gga gac gca gcg ccc gcc 116452
Ser Phe Leu Pro Ser Leu Thr Cys Thr Gly Asp Ala Ala Pro Ala
19720 19725 19730
atc cag cat ata tgt tta aaa cat aca aca tgc ttt caa gac gtg 116497
Ile Gln His Ile Cys Leu Lys His Thr Thr Cys Phe Gln Asp Val
19735 19740 19745
gtg gtg gat gtg gat tgc gcg gaa aat act aaa gag gat cag ttg 116542
Val Val Asp Val Asp Cys Ala Glu Asn Thr Lys Glu Asp Gln Leu
19750 19755 19760
gcc gaa atc agt tac cgt ttt caa ggt aag aag gaa gcg gac caa 116587
Ala Glu Ile Ser Tyr Arg Phe Gln Gly Lys Lys Glu Ala Asp Gln
19765 19770 19775
ccg tgg att gtt gta aac acg agc aca ctg ttt gat gaa ctc gaa 116632
Pro Trp Ile Val Val Asn Thr Ser Thr Leu Phe Asp Glu Leu Glu
19780 19785 19790
tta gac ccc ccc gag att gaa ccg ggt gtc ttg aaa gta ctt cgg 116677
Leu Asp Pro Pro Glu Ile Glu Pro Gly Val Leu Lys Val Leu Arg
19795 19800 19805
aca gaa aaa caa tac ttg ggt gtg tac att tgg aac atg cgc ggc 116722
Thr Glu Lys Gln Tyr Leu Gly Val Tyr Ile Trp Asn Met Arg Gly
19810 19815 19820
tcc gat ggt acg tct acc tac gcc acg ttt ttg gtc acc tgg aaa 116767
Ser Asp Gly Thr Ser Thr Tyr Ala Thr Phe Leu Val Thr Trp Lys
19825 19830 19835
ggg gat gaa aaa aca aga aac cct acg ccc gca gta act cct caa 116812
Gly Asp Glu Lys Thr Arg Asn Pro Thr Pro Ala Val Thr Pro Gln
19840 19845 19850
cca aga ggg gct gag ttt cat atg tgg aat tac cac tcg cat gta 116857
Pro Arg Gly Ala Glu Phe His Met Trp Asn Tyr His Ser His Val
19855 19860 19865
ttt tca gtt ggt gat acg ttt agc ttg gca atg cat ctt cag tat 116902
Phe Ser Val Gly Asp Thr Phe Ser Leu Ala Met His Leu Gln Tyr
19870 19875 19880
aag ata cat gaa gcg cca ttt gat ttg ctg tta gag tgg ttg tat 116947
Lys Ile His Glu Ala Pro Phe Asp Leu Leu Leu Glu Trp Leu Tyr
19885 19890 19895
gtc ccc atc gat cct aca tgt caa cca atg cgg tta tat tct acg 116992
Val Pro Ile Asp Pro Thr Cys Gln Pro Met Arg Leu Tyr Ser Thr
19900 19905 19910
tgt ttg tat cat ccc aac gca ccc caa tgc ctc tct cat atg aat 117037
Cys Leu Tyr His Pro Asn Ala Pro Gln Cys Leu Ser His Met Asn
19915 19920 19925
tcc ggt tgt aca ttt acc tcg cca cat tta gcc cag cgt gtt gca 117082
Ser Gly Cys Thr Phe Thr Ser Pro His Leu Ala Gln Arg Val Ala
19930 19935 19940
agc aca gtg tat caa aat tgt gaa cat gca gat aac tac acc gca 117127
Ser Thr Val Tyr Gln Asn Cys Glu His Ala Asp Asn Tyr Thr Ala
19945 19950 19955
tat tgt ctg gga ata tct cat atg gag cct agc ttt ggt cta atc 117172
Tyr Cys Leu Gly Ile Ser His Met Glu Pro Ser Phe Gly Leu Ile
19960 19965 19970
tta cac gac ggg ggc acc acg tta aag ttt gta gat aca ccc gag 117217
Leu His Asp Gly Gly Thr Thr Leu Lys Phe Val Asp Thr Pro Glu
19975 19980 19985
agt ttg tcg gga tta tac gtt ttt gtg gtg tat ttt aac ggg cat 117262
Ser Leu Ser Gly Leu Tyr Val Phe Val Val Tyr Phe Asn Gly His
19990 19995 20000
gtt gaa gcc gta gca tac act gtt gta tcc aca gta gat cat ttt 117307
Val Glu Ala Val Ala Tyr Thr Val Val Ser Thr Val Asp His Phe
20005 20010 20015
gta aac gca att gaa gag cgt gga ttt ccg cca acg gcc ggt cag 117352
Val Asn Ala Ile Glu Glu Arg Gly Phe Pro Pro Thr Ala Gly Gln
20020 20025 20030
cca ccg gcg act act aaa ccc aag gaa att acc ccc gta aac ccc 117397
Pro Pro Ala Thr Thr Lys Pro Lys Glu Ile Thr Pro Val Asn Pro
20035 20040 20045
gga acg tca cca ctt cta cga tat gcc gca tgg acc gga ggg ctt 117442
Gly Thr Ser Pro Leu Leu Arg Tyr Ala Ala Trp Thr Gly Gly Leu
20050 20055 20060
gca gca gta gta ctt tta tgt ctc gta ata ttt tta atc tgt acg 117487
Ala Ala Val Val Leu Leu Cys Leu Val Ile Phe Leu Ile Cys Thr
20065 20070 20075
gct aaa cga atg agg gtt aaa gcc tat agg gta gac aag tcc ccg 117532
Ala Lys Arg Met Arg Val Lys Ala Tyr Arg Val Asp Lys Ser Pro
20080 20085 20090
tat aac caa agc atg tat tac gct ggc ctt cca gtg gac gat ttc 117577
Tyr Asn Gln Ser Met Tyr Tyr Ala Gly Leu Pro Val Asp Asp Phe
20095 20100 20105
gag gac tcg gaa tct acg gat acg gaa gaa gag ttt ggt aac gcg 117622
Glu Asp Ser Glu Ser Thr Asp Thr Glu Glu Glu Phe Gly Asn Ala
20110 20115 20120
att gga ggg agt cac ggg ggt tcg agt tac acg gtg tat ata gat 117667
Ile Gly Gly Ser His Gly Gly Ser Ser Tyr Thr Val Tyr Ile Asp
20125 20130 20135
aag acc cgg tga tcaccgaacc ggggcaacgc cgagcgtgta aatttaaata 117719
Lys Thr Arg
20140
aaaaacagta cgcttttatc cggtgtatgt tttaaattta tttttttttt ctatataaag 117779
ggatggggtg tcaggatctc tcgtaggttc ttgggactcc aagggacccg cagcccaggt 117839
acgcgtcaaa aagcctgtga caaattcccc cggggcggtc atataattcg gcgcggatgc 117899
atttattatc gggcgacaat ccaatagtcg gaagtccggc gtgtcccaaa cagcaccaat 117959
atgccatcaa tatcagccgc tcgctgagct ccagatgcga ttgtggatcc ggcatgcatt 118019
gctcgcgtaa taacgataac accgctctgc ggatcggcgg aaccgagggt ggacgttgtc 118079
tatccgctgc gagctccccg cacatccgtc cgaaggcgtc ctttgccctc ccccacaaac 118139
tcgcgcacgg tctctcggag gggctgtgta taaaagcctc gtaggcctcg gtagatatgg 118199
cgcatagcgc ggcggcgaag taccgttcgg cgtcgttcat gagagcctgg tgcgcggggg 118259
tatcgtgtcg tgtgcagtaa agtccggcat attcaccgcc cgggtgctca ccgcgggatc 118319
cgcagagatt cataccacac ggcccgtgtg gggaccccgt gggttaatgg agacgttcaa 118379
accagcaccc tctaccctct aatatccaca acaccccact cccccacaga cagacatcaa 118439
aaaaagacac gagccaaacc attgtattta tttataaaga ctacacgcca tgggggggcg 118499
gtatatcatg ccggcgcggg gcttcgtgtc gacgtcgctt ccaatctaca cccccctcgc 118559
ccaacgcggt gagcgctttc gcggctgtat attccgcggt ttctgcaccc gcgtatcgct 118619
cgatggcacg ctggggtctc tttggggacg gggtgttgca cccatccccg tctcgataac 118679
aatcgccggg taccccgtct ggttcacaag aatcggtgct ctcctctgat tcttcttcca 118739
ctataaagtc ttccccgtcc gatgattccg cgtcggaatc ccgaaattca attacatccg 118799
atggcgtaga cccatcatcg tcgctatcgt cttcaccacc atcatcagat acgtcacatt 118859
ccagtgcgct cctatgcaaa ggaacattcg gcgcctcaat gaacccgtct tccccccgag 118919
ggggtcgatg ggacacttcc attaaagtag cgcgtaagca ctgcgtctgg gtgggttgaa 118979
gctgtaacaa ttcctcccag cacgctaacg tgggacgctg tctggtgcga cccattagat 119039
aaaagtcgag gcatatgcgc cgcaatcttc gtgaatctcg cgtgacgcgt cccgacgact 119099
gtgcaataca taaaaaaaca cggttgatgt cctcaacgag cctggccggg tggatctccc 119159
aacccggagt acaaaacgcg ccggggcccc gcgacgtatc ccggccgttc aggggtcctg 119219
gtgcagatcc atattccatc tttccgttaa catcaaccga tgccccgggt tttgactcgg 119279
acgagtcgcc ccgcgtagcc ggtgaggtgc aaaacatgtc cttggggccg tagtaacctt 119339
ttcccttaaa accgactcga cgctgtcgcg ttatgaatcg gacgaaccct gcacaacaaa 119399
acacacaccc aaacgtttac atctatgaat aaggctactt gggtaaaatg gcaatggggg 119459
attccggggc gggagacctt cgattgggtt gcctttataa caccaaaaaa aggggggggc 119519
cccgtgtgtt tttttttatc acgtcaaatc gattttaaaa agcctgccgc tccatttgga 119579
atatatatat tctgtgaaaa gcccgcccac accccataaa accgcgacat cgcgggaaca 119639
cgcgcgaaca agaaactctc tctctttctc tatatatata tatatatata tatatatata 119699
tagaaagaaa gtgcgaacgg tggttggaca catgccaaaa catgaaaacc catacagtga 119759
aaaaacggga agtgcgaatg cagatcaaaa gagtgtatcc gattggcgta caccacagac 119819
atgcggacgc ccaatttaac cccccccctt tttcaccccc ccaccccacc ccattccacc 119879
ccaggaagtg cgaacgggtt tacatgcctc agatatgaag ttcttcgact tgtttttgaa 119939
taaatttttt tgtgattttc tacaacggtt tagagaatta tggttataaa catcggcggg 119999
gtaccgcgcc ccctccccat cggcggggta ccgcgccccc tccccatcgg cggggtaccg 120059
cgccccctcc ccatcggcgg ggtaccgcgc cccctcccca tcggcggggt accgcgcccc 120119
ctccccatcg gcggggggtt acgtgaacac cacaaccccg tgtgtatttt atgggttatc 120179
gcgggcttcg tgccgcctga cataatcgtt gggaggggtg gtggtgtata cgcttgttga 120239
ttgcgcgaac gtaatgacga cggagaggga cccaaacaca ccgtcgacgt gcatttgatt 120299
aactagatgc cggatgggtg gaaacaaccc gtgttatata agatgttttg catgtgagac 120359
aaccccaatt gtgtttatgt atattatata tcgtctgtag acacacgatg attggttgtt 120419
atttaaacat atgtaaatga aattcacatg tctggtatcc cttgttatga tgttgtaagg 120479
tatgcggaaa tagacaccgg gcgtacatcg ccaaccagcg gtctctcctt aaacgcatac 120539
tatggtccat gaacttcccg cctcgagtct cgtccaatca ctacatcgtc ttatcattaa 120599
gaatatttac acggtgacga cacggggagg aaatatgcgg tcgagggggg ggcacaacac 120659
gttttaagta ctgttggaac tccctcacca accgcaatcg caatcctttg aaggctgcga 120719
gagcgtttgg aaaactcggg tacgtctaaa ttcaccccag tgcg atg gat acg 120772
Met Asp Thr
ccg ccg atg cag cgc tct aca ccc caa cgc gcg ggg tcg cct gat 120817
Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser Pro Asp
20145 20150 20155
act ttg gag tta atg gac ctg ttg gac gcg gcc gcg gcg gcc gcc 120862
Thr Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala Ala
20160 20165 20170
gaa cac agg gcc cgg gtg gtc acc tcg agt cag cct gac gat cta 120907
Glu His Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu
20175 20180 20185
cta ttt gga gag aac ggg gtc atg gtg gga cgg gaa cac gag atc 120952
Leu Phe Gly Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile
20190 20195 20200
gtt tca att ccc tcc gta tcg gga ctt caa cca gaa ccc aga acg 120997
Val Ser Ile Pro Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr
20205 20210 20215
gaa gat gtt ggc gaa gag cta aca caa gac gac tac gta tgc gag 121042
Glu Asp Val Gly Glu Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu
20220 20225 20230
gac ggt cag gat cta atg ggc tcg cct gta atc ccg ctg gcc gag 121087
Asp Gly Gln Asp Leu Met Gly Ser Pro Val Ile Pro Leu Ala Glu
20235 20240 20245
gtc ttc cac acc cga ttc tcg gag gcc ggc gcg cga gaa cca aca 121132
Val Phe His Thr Arg Phe Ser Glu Ala Gly Ala Arg Glu Pro Thr
20250 20255 20260
gga gcc gat cgc tcc ctt gag aca gtc tct ctc gga acg aag ctt 121177
Gly Ala Asp Arg Ser Leu Glu Thr Val Ser Leu Gly Thr Lys Leu
20265 20270 20275
gct agg tct cca aaa cca ccg atg aac gat ggg gaa acg ggc aga 121222
Ala Arg Ser Pro Lys Pro Pro Met Asn Asp Gly Glu Thr Gly Arg
20280 20285 20290
ggt acg acc cct ccg ttc ccg cag gcc ttc tcc cct gta tcc ccc 121267
Gly Thr Thr Pro Pro Phe Pro Gln Ala Phe Ser Pro Val Ser Pro
20295 20300 20305
gcg tct cct gtt gga gac gcc gcc ggg aac gat caa cgg gaa gac 121312
Ala Ser Pro Val Gly Asp Ala Ala Gly Asn Asp Gln Arg Glu Asp
20310 20315 20320
cag cgg tct ata ccc cga caa acg acg aga gga aat tca cca ggt 121357
Gln Arg Ser Ile Pro Arg Gln Thr Thr Arg Gly Asn Ser Pro Gly
20325 20330 20335
ttg ccg tcg gtg gtc cat cga gac aga caa act cag tcc atc tcg 121402
Leu Pro Ser Val Val His Arg Asp Arg Gln Thr Gln Ser Ile Ser
20340 20345 20350
ggt aaa aag ccg ggc gat gag caa gcg ggt cat gcg cat gca tcg 121447
Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His Ala His Ala Ser
20355 20360 20365
ggg gac gga gta gtt ctc cag aaa act caa cgg ccc gct cag gga 121492
Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro Ala Gln Gly
20370 20375 20380
aag agc ccg aag aaa aag act ttg aag gtt aag gtc cca ctc ccg 121537
Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro Leu Pro
20385 20390 20395
gcg cgg aaa ccc ggt gga cct gta ccc ggc ccg gtt gag caa ttg 121582
Ala Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln Leu
20400 20405 20410
tac cac gtc ctt tcg gac agc gtt ccc gct aag ggg gca aag gcg 121627
Tyr His Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala
20415 20420 20425
gac ctg ccg ttt gag acc gat gat acc cgc cca agg aaa cat gat 121672
Asp Leu Pro Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp
20430 20435 20440
gcc cgg ggt ata aca cct cgc gtc cct gga cgt tcg tcg ggg ggc 121717
Ala Arg Gly Ile Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly
20445 20450 20455
aaa cct aga gcg ttt ttg gcc ctg ccg gga aga tcc cac gca cca 121762
Lys Pro Arg Ala Phe Leu Ala Leu Pro Gly Arg Ser His Ala Pro
20460 20465 20470
gac ccg att gag gat gac agc cca gtg gag aaa aag cca aag agt 121807
Asp Pro Ile Glu Asp Asp Ser Pro Val Glu Lys Lys Pro Lys Ser
20475 20480 20485
cgt gag ttt gtt tcg tct tca tcc tct tcc tcg tcg tgg gga tcg 121852
Arg Glu Phe Val Ser Ser Ser Ser Ser Ser Ser Ser Trp Gly Ser
20490 20495 20500
tca tcg gag gat gaa gac gat gaa ccc cgg cgc gtt tcg gtg gga 121897
Ser Ser Glu Asp Glu Asp Asp Glu Pro Arg Arg Val Ser Val Gly
20505 20510 20515
agt gaa act aca ggc agc agg tcc gga cgc gaa cac gcc cct tcc 121942
Ser Glu Thr Thr Gly Ser Arg Ser Gly Arg Glu His Ala Pro Ser
20520 20525 20530
ccg tca aat tcg gat gat tcg gac tca aat gat ggt ggg tcg acg 121987
Pro Ser Asn Ser Asp Asp Ser Asp Ser Asn Asp Gly Gly Ser Thr
20535 20540 20545
aaa caa aat atc caa ccg gga tat cga tcc atc agc ggt ccc gat 122032
Lys Gln Asn Ile Gln Pro Gly Tyr Arg Ser Ile Ser Gly Pro Asp
20550 20555 20560
ccg agg att cgt aag acc aaa cgt ctt gcg ggg gaa ccg ggg cgc 122077
Pro Arg Ile Arg Lys Thr Lys Arg Leu Ala Gly Glu Pro Gly Arg
20565 20570 20575
cag aga cag aaa tca ttt tcc ctg ccg cga tcc aga acc ccg ata 122122
Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg Ser Arg Thr Pro Ile
20580 20585 20590
att ccc ccg gtg tcg ggg ccg ctc atg atg ccc gac gga agc cct 122167
Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro Asp Gly Ser Pro
20595 20600 20605
tgg ccc gga tcg gca ccc ctc cca tcc aac agg gtg cgg ttt gga 122212
Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val Arg Phe Gly
20610 20615 20620
ccg tcc ggg gag acc aga gag ggt cac tgg gag gat gag gct gtg 122257
Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp Glu Ala Val
20625 20630 20635
aga gcg gcg cgg gct cgt tac gag gcc tca acg gaa ccc gtg ccg 122302
Arg Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu Pro Val Pro
20640 20645 20650
ctt tac gtg ccg gag ttg gga gat ccg gct aga cag tac cgc gcg 122347
Leu Tyr Val Pro Glu Leu Gly Asp Pro Ala Arg Gln Tyr Arg Ala
20655 20660 20665
ctg att aac ctg atc tac tgt cca gac aga gac cct ata gca tgg 122392
Leu Ile Asn Leu Ile Tyr Cys Pro Asp Arg Asp Pro Ile Ala Trp
20670 20675 20680
ctc cag aac ccc aag ctg acc ggt gtc aac tcg gcc ctg aac cag 122437
Leu Gln Asn Pro Lys Leu Thr Gly Val Asn Ser Ala Leu Asn Gln
20685 20690 20695
ttc tac caa aag ctg ttg cca ccg gga cgg gcg ggt acc gcc gtt 122482
Phe Tyr Gln Lys Leu Leu Pro Pro Gly Arg Ala Gly Thr Ala Val
20700 20705 20710
acg ggg agc gta gcg tct ccc gtt ccg cat gta ggc gaa gcc atg 122527
Thr Gly Ser Val Ala Ser Pro Val Pro His Val Gly Glu Ala Met
20715 20720 20725
gcc acg ggg gag gcc ctc tgg gct ctc ccc cac gcg gcc gcg gcc 122572
Ala Thr Gly Glu Ala Leu Trp Ala Leu Pro His Ala Ala Ala Ala
20730 20735 20740
gtg gct atg agc cgt cga tac gac cgg gcc caa aaa cac ttt atc 122617
Val Ala Met Ser Arg Arg Tyr Asp Arg Ala Gln Lys His Phe Ile
20745 20750 20755
cta cag agt ctc cgc aga gcc ttt gcc agc atg gca tac ccc gag 122662
Leu Gln Ser Leu Arg Arg Ala Phe Ala Ser Met Ala Tyr Pro Glu
20760 20765 20770
gca acg ggc tcc agt ccg gcg gcg cgg atc tcc cgc ggt cac cct 122707
Ala Thr Gly Ser Ser Pro Ala Ala Arg Ile Ser Arg Gly His Pro
20775 20780 20785
tct cca aca acc ccg gcc aca cag gct ccc gac cct cag ccg tcg 122752
Ser Pro Thr Thr Pro Ala Thr Gln Ala Pro Asp Pro Gln Pro Ser
20790 20795 20800
gcc gcc gca cgc tct ctt tct gtg tgt cca ccg gat gat cgt tta 122797
Ala Ala Ala Arg Ser Leu Ser Val Cys Pro Pro Asp Asp Arg Leu
20805 20810 20815
cga act ccg cgc aag cgc aag tcc cag cca gtc gag agc aga agc 122842
Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro Val Glu Ser Arg Ser
20820 20825 20830
ctc ctc gac aag att agg gag aca ccc gtc gcg gac gcc cgg gtt 122887
Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala Asp Ala Arg Val
20835 20840 20845
gca gac gat cat gtg gtt tcc aag gcc aag agg cgg gta tcc gag 122932
Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg Val Ser Glu
20850 20855 20860
ccc gtg acc atc acc tcg ggc cct gtg gtg gat ccc ccc gcc gta 122977
Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro Pro Ala Val
20865 20870 20875
ata acg atg cca ctt gac gga ccg gcc cca aac ggg gga ttt cgg 123022
Ile Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly Gly Phe Arg
20880 20885 20890
cgt att ccc cgg ggg gcc ctg cat acc ccg gtc ccg tcg gac cag 123067
Arg Ile Pro Arg Gly Ala Leu His Thr Pro Val Pro Ser Asp Gln
20895 20900 20905
gct cgc aag gcg tac tgt acc ccc gaa acc atc gcc cgt ctg gtc 123112
Ala Arg Lys Ala Tyr Cys Thr Pro Glu Thr Ile Ala Arg Leu Val
20910 20915 20920
gac gac cca ttg ttt ccc acg gcc tgg cgc cct gcg cta agc ttt 123157
Asp Asp Pro Leu Phe Pro Thr Ala Trp Arg Pro Ala Leu Ser Phe
20925 20930 20935
gat ccc ggc gcc ttg gcg gaa atc gcc gct cgg cgt ccg ggc gga 123202
Asp Pro Gly Ala Leu Ala Glu Ile Ala Ala Arg Arg Pro Gly Gly
20940 20945 20950
gga gac cga cgg ttt ggt cca ccc agc gga gtg gag gcg ctg cga 123247
Gly Asp Arg Arg Phe Gly Pro Pro Ser Gly Val Glu Ala Leu Arg
20955 20960 20965
cgg agg tgc gcc tgg atg cgg cag atc cca gac ccg gag gat gtg 123292
Arg Arg Cys Ala Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val
20970 20975 20980
agg ctt ctg atc atc tac gat ccg ttg ccc gga gag gac atc aac 123337
Arg Leu Leu Ile Ile Tyr Asp Pro Leu Pro Gly Glu AspIle Asn
20985 20990 20995
ggc ccc ctc gag agc acc ctc gcg aca gat ccg gga ccg tca tgg 123382
Gly Pro Leu Glu Ser Thr Leu Ala Thr Asp Pro Gly Pro Ser Trp
21000 21005 21010
agt cca tcc cga ggg gga ctg tct gtg gtc ctg gca gcc ctg agt 123427
Ser Pro Ser Arg Gly Gly Leu Ser Val Val Leu Ala Ala Leu Ser
21015 21020 21025
aac cgg ttg tgc ctg ccg agc act cat gcc tgg gcc ggg aac tgg 123472
Asn Arg Leu Cys Leu Pro Ser Thr His Ala Trp Ala Gly Asn Trp
21030 21035 21040
acc ggc ccg ccg gac gtg tcc gct ttg aac gcc cgg ggc gtt tta 123517
Thr Gly Pro Pro Asp Val Ser Ala Leu Asn Ala Arg Gly Val Leu
21045 21050 21055
tta ctg tcg acc cga gac ctg gcc ttt gcc ggg gcc gtc gag tat 123562
Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala Gly Ala Val Glu Tyr
21060 21065 21070
cta ggc tcg cgg ttg gcc tct gcc cgg cgc cgg ttg ctg gtg ttg 123607
Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg Leu Leu Val Leu
21075 21080 21085
gac gcg gtg gcc ctc gag agg tgg ccc agg gat gga ccc gct ttg 123652
Asp Ala Val Ala Leu Glu Arg Trp Pro Arg Asp Gly Pro Ala Leu
21090 21095 21100
tct cag tat cac gtg tac gtc cgg gcc ccg gcg cga ccg gac gcc 123697
Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg Pro Asp Ala
21105 21110 21115
cag gcc gtc gtc cga tgg cca gac tcg gcg gtc aca gaa gga ctc 123742
Gln Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr Glu Gly Leu
21120 21125 21130
gcc cgg gcc gtg ttt gca tcg tcg cgc acc ttt ggg cca gcg agt 123787
Ala Arg Ala Val Phe Ala Ser Ser Arg Thr Phe Gly Pro Ala Ser
21135 21140 21145
ttt gct cgt atc gag act gcg ttt gcc aac ctg tac ccg ggc gaa 123832
Phe Ala Arg Ile Glu Thr Ala Phe Ala Asn Leu Tyr Pro Gly Glu
21150 21155 21160
caa ccc ctg tgt ttg tgc cgc ggt ggg aac gtc gca tac acc gtg 123877
Gln Pro Leu Cys Leu Cys Arg Gly Gly Asn Val Ala Tyr Thr Val
21165 21170 21175
tgt acc cgc gcg ggc ccc aag acc cgc gtc ccc ctg tcg ccc cgt 123922
Cys Thr Arg Ala Gly Pro Lys Thr Arg Val Pro Leu Ser Pro Arg
21180 21185 21190
gaa tac cgg cag tac gtg ctg ccg ggt ttt gac ggt tgc aag gac 123967
Glu Tyr Arg Gln Tyr Val Leu Pro Gly Phe Asp Gly Cys Lys Asp
21195 21200 21205
ctc gcg cga cag tct cgg ggt ctg ggg ctc ggg gca gcc gac ttt 124012
Leu Ala Arg Gln Ser Arg Gly Leu Gly Leu Gly Ala Ala Asp Phe
21210 21215 21220
gtg gac gag gcg gca cat agc cac cgc gca gca aac cga tgg ggc 124057
Val Asp Glu Ala Ala His Ser His Arg Ala Ala Asn Arg Trp Gly
21225 21230 21235
ctg ggt gcc gcg ctt cga ccc gtc ttc ctt ccc gag gga cgg aga 124102
Leu Gly Ala Ala Leu Arg Pro Val Phe Leu Pro Glu Gly Arg Arg
21240 21245 21250
ccg ggg gcc gcc ggg ccg gag gcc ggc gac gta ccc acc tgg gcg 124147
Pro Gly Ala Ala Gly Pro Glu Ala Gly Asp Val Pro Thr Trp Ala
21255 21260 21265
agg gtg ttt tgc cgc cac gcc ctg ctg gaa ccc gac cct gcc gca 124192
Arg Val Phe Cys Arg His Ala Leu Leu Glu Pro Asp Pro Ala Ala
21270 21275 21280
gaa cca ctc gtg ctt cca ccc gtg gcc ggt cgg tcg gtg gcg ctg 124237
Glu Pro Leu Val Leu Pro Pro Val Ala Gly Arg Ser Val Ala Leu
21285 21290 21295
tat gcg tcg gcg gac gag gct cgg aat gcc ctc ccc ccg att ccc 124282
Tyr Ala Ser Ala Asp Glu Ala Arg Asn Ala Leu Pro Pro Ile Pro
21300 21305 21310
aga gta atg tgg ccg ccc ggt ttt ggg gcc gcg gag acg gtg ttg 124327
Arg Val Met Trp Pro Pro Gly Phe Gly Ala Ala Glu Thr Val Leu
21315 21320 21325
gag ggg agc gac gga aca cgg ttc gtg ttc gga cac cac ggg ggc 124372
Glu Gly Ser Asp Gly Thr Arg Phe Val Phe Gly His His Gly Gly
21330 21335 21340
tcg gaa cgg ccg tca gaa acc cag gcg ggg cga cag cgg cgc acc 124417
Ser Glu Arg Pro Ser Glu Thr Gln Ala Gly Arg Gln Arg Arg Thr
21345 21350 21355
gca gac gac aga gaa cac gct ttg gag ctg gac gat tgg gag gtg 124462
Ala Asp Asp Arg Glu His Ala Leu Glu Leu Asp Asp Trp Glu Val
21360 21365 21370
ggg tgt gaa gac gcg tgg gac agc gag gag ggg ggc ggg gac gac 124507
Gly Cys Glu Asp Ala Trp Asp Ser Glu Glu Gly Gly Gly Asp Asp
21375 21380 21385
ggg gac gca ccg ggg tca tcc ttt ggg gtg agc atc gtg tcg gtg 124552
Gly Asp Ala Pro Gly Ser Ser Phe Gly Val Ser Ile Val Ser Val
21390 21395 21400
gcc ccg ggt gtg ctg cga gac cgc cgg gtg ggt ttg cgc ccg gcg 124597
Ala Pro Gly Val Leu Arg Asp Arg Arg Val Gly Leu Arg Pro Ala
21405 21410 21415
gtc aag gtg gag ctg ttg tcc tcg tcc tcg tcc tcc gag gac gag 124642
Val Lys Val Glu Leu Leu Ser Ser Ser Ser Ser Ser Glu Asp Glu
21420 21425 21430
gac gat gtg tgg gga ggg cgc ggg ggg agg agc ccc ccg cag agt 124687
Asp Asp Val Trp Gly Gly Arg Gly Gly Arg Ser Pro Pro Gln Ser
21435 21440 21445
cgg ggg tga cggagtcccc tccttttctc gtgagcgcca ctggcgcgcg 124736
Arg Gly
21450
gactgtttgt tgttaataaa agcggaacgg tttttatgaa aaaagtgtct gtctgtctgt 124796
gcgggcgggc gacgggcggg ctggtcggac ccccccccga aaataacccc cccccggttt 124856
ctgggcgccc ggcggacccc gggagagg 124884
<210>9
<211>238
<212>PRT
<213>水痘带状疱疹
<400>9
Met His Val Ile Ser Glu Thr Leu Ala Tyr Gly His Val Pro Ala Phe
1 5 10 15
Ile Met Gly Ser Thr Leu Val Arg Pro Ser Leu Asn Ala Thr Ala Glu
20 25 30
Glu Asn Pro Ala Ser Glu Thr Arg Cys Leu Leu Arg Val Leu Ala Gly
35 40 45
Arg Thr Val Asp Leu Pro Gly Gly Gly Thr Leu His Ile Thr Cys Thr
50 55 60
Lys Thr Tyr Val Ile Ile Gly Lys Tyr Ser Lys Pro Gly Glu Arg Leu
65 70 75 80
Ser Leu Ala Arg Leu Ile Gly Arg Ala Met Thr Pro Gly Gly Ala Arg
85 90 95
Thr Phe Ile Ile Leu Ala Met Lys Glu Lys Arg Ser Thr Thr Leu Gly
100 105 110
Tyr Glu Cys Gly Thr Gly Leu His Leu Leu Ala Pro Ser Met Gly Thr
115 120 125
Phe Leu Arg Thr His Gly Leu Ser Asn Arg Asp Leu Cys Leu Trp Arg
130 135 140
Gly Asn Ile Tyr Asp Met His Met Gln Arg Leu Met Phe Trp Glu Asn
145 150 155 160
Ile Ala Gln Asn Thr Thr Glu Thr Pro Cys Ile Thr Ser Thr Leu Thr
165 170 175
Cys Asn Leu Thr Glu Asp Ser Gly Glu Ala Ala Leu Thr Thr Ser Asp
180 185 190
Arg Pro Thr Leu Pro Thr Leu Thr Ala Gln Gly Arg Pro Thr Val Ser
195 200 205
Asn Ile Arg Gly Ile Leu Lys Gly Ser Pro Arg Gln Gln Pro Val Cys
210 215 220
His Arg Val Arg Phe Ala Glu Pro Thr Glu Gly Val Leu Met
225 230 235
<210>10
<211>259
<212>PRT
<213>水痘带状疱疹
<400>10
Met Gln Thr Val Cys Ala Ser Leu Cys Gly Tyr Ala Arg Ile Pro Thr
1 5 10 15
Glu Glu Pro Ser Tyr Glu Glu Val Arg Val Asn Thr His Pro Gln Gly
20 25 30
Ala Ala Leu Leu Arg Leu Gln Glu Ala Leu Thr Ala Val Asn Gly Leu
35 40 45
Leu Pro Ala Pro Leu Thr Leu Glu Asp Val Val Ala Ser Ala Asp Asn
50 55 60
Thr Arg Arg Leu Val Arg Ala Gln Ala Leu Ala Arg Thr Tyr Ala Ala
65 70 75 80
Cys Ser Arg Asn Ile Glu Cys Leu Lys Gln His His Phe Thr Glu Asp
85 90 95
Asn Pro Gly Leu Asn Ala Val Val Arg Ser His Met Glu Asn Ser Lys
100 105 110
Arg Leu Ala Asp Met Cys Leu Ala Ala Ile Thr His Leu Tyr Leu Ser
115 120 125
Val Gly Ala Val Asp Val Thr Thr Asp Asp Ile Val Asp Gln Thr Leu
130 135 140
Arg Met Thr Ala Glu Ser Glu Val Val Met Ser Asp Val Val Leu Leu
145 150 155 160
Glu Lys Thr Leu Gly Val Val Ala Lys Pro Gln Ala Ser Phe Asp Val
165 170 175
Ser His Asn His Glu Leu Ser Ile Ala Lys Gly Glu Asn Val Gly Leu
180 185 190
Lys Thr Ser Pro Ile Lys Ser Glu Ala Thr Gln Leu Ser Glu Ile Lys
195 200 205
Pro Pro Leu Ile Glu Val Ser Asp Asn Asn Thr Ser Asn Leu Thr Lys
210 215 220
Lys Thr Tyr Pro Thr Glu Thr Leu Gln Pro Val Leu Thr Pro Lys Gln
225 230 235 240
Thr Gln Asp Val Gln Arg Thr Thr Pro Ala Ile Lys Lys Ser His Val
245 250 255
Met Leu Val
<210>11
<211>87
<212>PRT
<213>水痘带状疱疹
<400>11
Met Gly Ser Ile Thr Ala Ser Phe Ile Leu Ile Thr Met Gln Ile Leu
1 5 10 15
Phe Phe Cys Glu Asp Ser Ser Gly Glu Pro Asn Phe Ala Glu Arg Asn
20 25 30
Phe Trp His Ala Ser Cys Ser Ala Arg Gly Val Tyr Ile Asp Gly Ser
35 40 45
Met Ile Thr Thr Leu Phe Phe Tyr Ala Ser Leu Leu Gly Val Cys Val
50 55 60
Ala Leu Ile Ser Leu Ala Tyr His Ala Cys Phe Arg Leu Phe Thr Arg
65 70 75 80
Ser Val Leu Arg Ser Thr Trp
85
<210>12
<211>302
<212>PRT
<213>水痘带状疱疹
<400>12
Met Ala Ser Ser Asp Gly Asp Arg Leu Cys Arg Ser Asn Ala Val Arg
1 5 10 15
Arg Lys Thr Thr Pro Ser Tyr Ser Gly Gln Tyr Arg Thr Ala Arg Arg
20 25 30
Ser Val Val Val Gly Pro Pro Asp Asp Ser Asp Asp Ser Leu Gly Tyr
35 40 45
Ile Thr Thr Val Gly Ala Asp Ser Pro Ser Pro Val Tyr Ala Asp Leu
50 55 60
Tyr Phe Glu His Lys Asn Thr Thr Pro Arg Val His Gln Pro Asn Asp
65 70 75 80
Ser Ser Gly Ser Glu Asp Asp Phe Glu Asp Ile Asp Glu Val Val Ala
85 90 95
Ala Phe Arg Glu Ala Arg Leu Arg His Glu Leu Val Glu Asp Ala Val
100 105 110
Tyr Glu Asn Pro Leu Ser Val Glu Lys Pro Ser Arg Ser Phe Thr Lys
115 120 125
Asn Ala Ala Val Lys Pro Lys Leu Glu Asp Ser Pro Lys Arg Ala Pro
130 135 140
Pro Gly Ala Gly Ala Ile Ala Ser Gly Arg Pro Ile Ser Phe Ser Thr
145 150 155 160
Ala Pro Lys Thr Ala Thr Ser Ser Trp Cys Gly Pro Thr Pro Ser Tyr
165 170 175
Asn Lys Arg Val Phe Cys Glu Ala Val Arg Arg Val Ala Ala Met Gln
180 185 190
Ala Gln Lys Ala Ala Glu Ala Ala Trp Asn Ser Asn Pro Pro Arg Asn
195 200 205
Asn Ala Glu Leu Asp Arg Leu Leu Thr Gly Ala Val Ile Arg Ile Thr
210 215 220
Val His Glu Gly Leu Asn Leu Ile Gln Ala Ala Asn Glu Ala Asp Leu
225 230 235 240
Gly Glu Gly Ala Ser Val Ser Lys Arg Gly His Asn Arg Lys Thr Gly
245 250 255
Asp Leu Gln Gly Gly Met Gly Asn Glu Pro Met Tyr Ala Gln Val Arg
260 265 270
Lys Pro Lys Ser Arg Thr Asp Thr Gln Thr Thr Gly Arg Ile Thr Asn
275 280 285
Arg Ser Arg Ala Arg Ser Ala Ser Arg Thr Asp Thr Arg Lys
290 295 300
<210>13
<211>410
<212>PRT
<213>水痘带状疱疹
<400>13
Met Glu Cys Asn Leu Gly Thr Glu His Pro Ser Thr Asp Thr Trp Asn
1 5 10 15
Arg Ser Lys Thr Glu Gln Ala Val Val Asp Ala Phe Asp Glu Ser Leu
20 25 30
Phe Gly Asp Val Ala Ser Asp Ile Gly Phe Glu Thr Ser Leu Tyr Ser
35 40 45
His Ala Val Lys Thr Ala Pro Ser Pro Pro Trp Val Ala Ser Pro Lys
50 55 60
Ile Leu Tyr Gln Gln Leu Ile Arg Asp Leu Asp Phe Ser Glu Gly Pro
65 70 75 80
Arg Leu Leu Ser Cys Leu Glu Thr Trp Asn Glu Asp Leu Phe Ser Cys
85 90 95
Phe Pro Ile Asn Glu Asp Leu Tyr Ser Asp Met Met Val Leu Ser Pro
100 105 110
Asp Pro Asp Asp Val Ile Ser Thr Val Ser Thr Lys Asp His Val Glu
115 120 125
Met Phe Asn Leu Thr Thr Arg Gly Ser Val Arg Leu Pro Ser Pro Pro
130 135 140
Lys Gln Pro Thr Gly Leu Pro Ala Tyr Val Gln Glu Val Gln Asp Ser
145 150 155 160
Phe Thr Val Glu Leu Arg Ala Arg Glu Glu Ala Tyr Thr Lys Leu Leu
165 170 175
Val Thr Tyr Cys Lys Ser Ile Ile Arg Tyr Leu Gln Gly Thr Ala Lys
180 185 190
Arg Thr Thr Ile Gly Leu Asn Ile Gln Asn Pro Asp Gln Lys Ala Tyr
195 200 205
Thr Gln Leu Arg Gln Ser Ile Leu Leu Arg Tyr Tyr Arg Glu Val Ala
210 215 220
Ser Leu Ala Arg Leu Leu Tyr Leu His Leu Tyr Leu Thr Val Thr Arg
225 230 235 240
Glu Phe Ser Trp Arg Leu Tyr Ala Ser Gln Ser Ala His Pro Asp Val
245 250 255
Phe Ala Ala Leu Lys Phe Thr Trp Thr Glu Arg Arg Gln Phe Thr Cys
260 265 270
Ala Phe His Pro Val Leu Cys Asn His Gly Ile Val Leu Leu Glu Gly
275 280 285
Lys Pro Leu Thr Ala Ser Ala Leu Arg Glu Ile Asn Tyr Arg Arg Arg
290 295 300
Glu Leu Gly Leu Pro Leu Val Arg Cys Gly Leu Val Glu Glu Asn Lys
305 310 315 320
Ser Pro Leu Val Gln Gln Pro Ser Phe Ser Val His Leu Pro Arg Ser
325 330 335
Val Gly Phe Leu Thr His His Ile Lys Arg Lys Leu Asp Ala Tyr Ala
340 345 350
Val Lys His Pro Gln Glu Pro Arg His Val Arg Ala Asp His Pro Tyr
355 360 365
Ala Lys Val Val Glu Asn Arg Asn Tyr Gly Ser Ser Ile Glu Ala Met
370 375 380
Ile Leu Ala Pro Pro Ser Pro Ser Glu Ile Leu Pro Gly Asp Pro Pro
385 390 395 400
Arg Pro Pro Thr Cys Gly Phe Leu Thr Arg
405 410
<210>14
<211>819
<212>PRT
<213>水痘带状疱疹
<400>14
Met Gln Ser Gly His Tyr Asn Arg Arg Gln Ser Arg Arg Gln Arg Ile
1 5 10 15
Ser Ser Asn Thr Thr Asp Ser Pro Arg His Thr His Gly Thr Arg Tyr
20 25 30
Arg Ser Thr Asn Trp Tyr Thr His Pro Pro Gln Ile Leu Ser Asn Ser
35 40 45
Glu Thr Leu Val Ala Val Gln Glu Leu Leu Asn Ser Glu Met Asp Gln
50 55 60
Asp Ser Ser Ser Asp Ala Ser Asp Asp Phe Pro Gly Tyr Ala Leu His
65 70 75 80
His Ser Thr Tyr Asn Gly Ser Glu Gln Asn Thr Ser Thr Ser Arg His
85 90 95
Glu Asn Arg Ile Phe Lys Leu Thr Glu Arg Glu Ala Asn Glu Glu Ile
100 105 110
Asn Ile Asn Thr Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu Glu Gly
115 120 125
Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu Glu Gly
130 135 140
Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu Glu Gly
145 150 155 160
Glu Ala Glu Glu Asp Ala Ile Asp Asp Glu Gly Glu Ala Glu Glu Gly
165 170 175
Glu Ala Glu Glu Gly Glu Ala Glu Glu Gly Glu Ala Glu Glu Asp Ala
180 185 190
Ile Asp Asp Glu Gly Glu Ala Glu Glu Asp Ala Ala Glu Glu Asp Ala
195 200 205
Ile Asp Asp Glu Gly Glu Ala Glu Glu Asp Tyr Phe Ser Val Ser Gln
210 215 220
Val Cys Ser Arg Asp Ala Asp Glu Val Tyr Phe Thr Leu Asp Pro Glu
225 230 235 240
Ile Ser Tyr Ser Thr Asp Leu Arg Ile Ala Lys Val Met Glu Pro Ala
245 250 255
Val Ser Lys Glu Leu Asn Val Ser Lys Arg Cys Val Glu Pro Val Thr
260 265 270
Leu Thr Gly Ser Met Leu Ala His Asn Gly Phe Asp Glu Ser Trp Phe
275 280 285
Ala Met Arg Glu Cys Thr Arg Arg Glu Tyr Ile Thr Val Gln Gly Leu
290 295 300
Tyr Asp Pro Ile His Leu Arg Tyr Gln Phe Asp Thr Ser Arg Met Thr
305 310 315 320
Pro Pro Gln Ile Leu Arg Thr Ile Pro Ala Leu Pro Asn Met Thr Leu
325 330 335
Gly Glu Leu Leu Leu Ile Phe Pro Ile Glu Phe Met Ala Gln Pro Ile
340 345 350
Ser Ile Glu Arg Ile Leu Val Glu Asp Val Phe Leu Asp Arg Arg Ala
355 360 365
Ser Ser Lys Thr His Lys Tyr Gly Pro Arg Trp Asn Ser Val Tyr Ala
370 375 380
Leu Pro Tyr Asn Ala Gly Lys Met Tyr Val Gln His Ile Pro Gly Phe
385 390 395 400
Tyr Asp Val Ser Leu Arg Ala Val Gly Gln Gly Thr Ala Ile Trp His
405 410 415
His Met Ile Leu Ser Thr Ala Ala Cys Ala Ile Ser Asn Arg Ile Ser
420 425 430
His Gly Asp Gly Leu Gly Phe Leu Leu Asp Ala Ala Ile Arg Ile Ser
435 440 445
Ala Asn Cys Ile Phe Leu Gly Arg Asn Asp Asn Phe Gly Val Gly Asp
450 455 460
Pro Cys Trp Leu Glu Asp His Leu Ala Gly Leu Pro Arg Glu Ala Val
465 470 475 480
Pro Asp Val Leu Gln Val Thr Gln Leu Val Leu Pro Asn Arg Gly Pro
485 490 495
Thr Val Ala Ile Met Arg Gly Phe Phe Gly Ala Leu Ala Tyr Trp Pro
500 505 510
Glu Leu Arg Ile Ala Ile Ser Glu Pro Ser Thr Ser Leu Val Arg Tyr
515 520 525
Ala Thr Gly His Met Glu Leu Ala Glu Trp Phe Leu Phe Ser Arg Thr
530 535 540
His Ser Leu Lys Pro Gln Phe Thr Pro Thr Glu Arg Glu Met Leu Ala
545 550 555 560
Ser Phe Phe Thr Leu Tyr Val Thr Leu Gly Gly Gly Met Leu Asn Trp
565 570 575
Ile Cys Arg Ala Thr Ala Met Tyr Leu Ala Ala Pro Tyr His Ser Arg
580 585 590
Ser Ala Tyr Ile Ala Val Cys Glu Ser Leu Pro Tyr Tyr Tyr Ile Pro
595 600 605
Val Asn Ser Asp Leu Leu Cys Asp Leu Glu Val Leu Leu Leu Gly Glu
610 615 620
Val Asp Leu Pro Thr Val Cys Glu Ser Tyr Ala Thr Ile Ala His Glu
625 630 635 640
Leu Thr Gly Tyr Glu Ala Val Arg Thr Ala Ala Thr Asn Phe Met Ile
645 650 655
Glu Phe Ala Asp Cys Tyr Lys Glu Ser Glu Thr Asp Leu Met Val Ser
660 665 670
Ala Tyr Leu Gly Ala Val Leu Leu Leu Gln Arg Val Leu Gly His Ala
675 680 685
Asn Leu Leu Leu Leu Leu Leu Ser Gly Ala Ala Leu Tyr Gly Gly Cys
690 695 700
Ser Ile Tyr Ile Pro Arg Gly Ile Leu Asp Ala Tyr Asn Thr Leu Met
705 710 715 720
Leu Ala Ala Ser Pro Leu Tyr Ala His Gln Thr Leu Thr Ser Phe Trp
725 730 735
Lys Asp Arg Asp Asp Ala Met Gln Thr Leu Gly Ile Arg Pro Thr Thr
740 745 750
Asp Val Leu Pro Lys Glu Gln Asp Arg Ile Val Gln Ala Ser Pro Ile
755 760 765
Glu Met Asn Phe Arg Phe Val Gly Leu Glu Thr Ile Tyr Pro Arg Glu
770 775 780
Gln Pro Ile Pro Ser Val Asp Leu Ala Glu Asn Leu Met Gln Tyr Arg
785 790 795 800
Asn Glu Ile Leu Gly Leu Asp Trp Lys Ser Val Ala Met His Leu Leu
805 810 815
Arg Lys Tyr
<210>15
<211>661
<212>PRT
<213>水痘带状疱疹
<400>15
Met Phe Ser Arg Phe Ala Arg Ser Phe Ser Ser Asp Asp Arg Thr Arg
1 5 10 15
Lys Ser Tyr Asp Gly Ser Tyr Gln Ser Phe Asn Ala Gly Glu Arg Asp
20 25 30
Leu Pro Thr Pro Thr Arg Asp Trp Cys Ser Ile Ser Gln Arg Ile Thr
35 40 45
Ser Glu Arg Val Arg Asp Gly Cys Leu Ile Pro Thr Pro Gly Glu Ala
50 55 60
Leu Glu Thr Ala Val Lys Ala Leu Ser Glu Lys Thr Asp Ser Leu Thr
65 70 75 80
Ser Pro Val Leu Gln Ser Thr Glu Arg His Ser Val Leu Leu Gly Leu
85 90 95
His His Asn Asn Val Pro Glu Ser Leu Val Val Ser Cys Met Ser Asn
100 105 110
Asp Val His Asp Gly Phe Met Gln Arg Tyr Met Glu Thr Ile Gln Arg
115 120 125
Cys Leu Asp Asp Leu Lys Leu Ser Gly Asp Gly Leu Trp Trp Val Tyr
130 135 140
Glu Asn Thr Tyr Trp Gln Tyr Leu Lys Tyr Thr Thr Gly Ala Glu Val
145 150 155 160
Pro Val Thr Ser Glu Lys Val Asn Lys Lys Ser Lys Ser Thr Val Leu
165 170 175
Leu Phe Ser Ser Val Val Ala Asn Lys Pro Ile Ser Arg His Pro Phe
180 185 190
Lys Ser Lys Val Ile Asn Ser Asp Tyr Arg Gly Ile Cys Gln Glu Leu
195 200 205
Arg Glu Ala Leu Gly Ala Val Gln Lys Tyr Met Tyr Phe Met Arg Pro
210 215 220
Asp Asp Pro Thr Asn Pro Ser Pro Asp Thr Arg Ile Arg Val Gln Glu
225 230 235 240
Ile Ala Ala Tyr Thr Ala Thr Gly Tyr Gly Trp Met Leu Trp Phe Leu
245 250 255
Asp Val Val Asp Ala Arg Val Cys Arg His Leu Lys Leu Gln Phe Arg
260 265 270
Arg Ile Arg Gly Pro Arg Ala Ser Val Ile Pro Asp Asp Leu Leu Arg
275 280 285
Arg His Leu Lys Thr Gly Pro Ala Val Ser Ala Gly Thr Gly Val Ala
290 295 300
Phe Ile Leu Ala Ala Thr Thr Ala Ser Ala Leu Thr Ala Leu Leu Arg
305 310 315 320
Ile Ser Val Leu Trp Arg Lys Glu Glu Trp Arg Asp Gly Leu Asn Gly
325 330 335
Thr Ala Ala Ala Ile Val Ala Ala Val Glu Leu Ile Thr Leu Leu His
340 345 350
His His Phe Gln Tyr Leu Ile Asn Met Met Leu Ile Gly Tyr Ala Cys
355 360 365
Trp Gly Asp Gly Gly Leu Asn Asp Pro Tyr Ile Leu Lys Ala Leu Arg
370 375 380
Ala Gln Gly Arg Phe Leu Tyr Phe Ala Gly Gln Leu Val Arg Thr Met
385 390 395 400
Ser Thr His Ser Trp Val Val Leu Glu Thr Ser Thr His Met Trp Phe
405 410 415
Ser Arg Ala Val Ala Gln Ser Ile Leu Ala His Gly Gly Lys Pro Thr
420 425 430
Lys Tyr Tyr Ala Gln Val Leu Ala Ala Ser Lys Arg Tyr Thr Pro Leu
435 440 445
His Leu Arg Arg Ile Ser Glu Pro Ser Ser Val Ser Asp Gln Pro Tyr
450 455 460
Ile Arg Phe Asn Arg Leu Gly Ser Pro Ile Gly Thr Gly Ile Gly Asn
465 470 475 480
Leu Glu Cys Val Cys Leu Thr Gly Asn Tyr Leu Ser Asp Asp Val Asn
485 490 495
Ala Ser Ser His Val Ile Asn Thr Glu Ala Pro Leu Asn Ser Ile Ala
500 505 510
Pro Asp Thr Asn Arg Gln Arg Thr Ser Arg Val Leu Val Arg Pro Asp
515 520 525
Thr Gly Leu Asp Val Thr Val Arg Lys Asn His Cys Leu Asp Ile Gly
530 535 540
His Thr Asp Gly Ser Pro Val Asp Pro Thr Tyr Pro Asp His Tyr Thr
545 550 555 560
Arg Ile Lys Ala Glu Tyr Glu Gly Pro Val Arg Asp Glu Ser Asn Thr
565 570 575
Met Phe Asp Gln Arg Ser Asp Leu Arg His Ile Glu Thr Gln Ala Ser
580 585 590
Leu Asn Asp His Val Tyr Glu Asn Ile Pro Pro Lys Glu Val Gly Phe
595 600 605
Asn Ser Ser Ser Asp Leu Asp Val Asp Ser Leu Asn Gly Tyr Thr Ser
610 615 620
Gly Asp Met His Thr Asp Asp Asp Leu Ser Pro Asp Phe Ile Pro Asn
625 630 635 640
Asp Val Pro Val Arg Cys Lys Thr Thr Val Thr Phe Arg Lys Asn Thr
645 650 655
Pro Lys Ser His His
660
<210>16
<211>301
<212>PRT
<213>水痘带状疱疹
<400>16
Met Gly Asp Leu Ser Cys Trp Thr Lys Val Pro Gly Phe Thr Leu Thr
1 5 10 15
Gly Glu Leu Gln Tyr Leu Lys Gln Val Asp Asp Ile Leu Arg Tyr Gly
20 25 30
Val Arg Lys Arg Asp Arg Thr Gly Ile Gly Thr Leu Ser Leu Phe Gly
35 40 45
Met Gln Ala Arg Tyr Asn Leu Arg Asn Glu Phe Pro Leu Leu Thr Thr
50 55 60
Lys Arg Val Phe Trp Arg Ala Val Val Glu Glu Leu Leu Trp Phe Ile
65 70 75 80
Arg Gly Ser Thr Asp Ser Lys Glu Leu Ala Ala Lys Asp Ile His Ile
85 90 95
Trp Asp Ile Tyr Gly Ser Ser Lys Phe Leu Asn Arg Asn Gly Phe His
100 105 110
Lys Arg His Thr Gly Asp Leu Gly Pro Ile Tyr Gly Phe Gln Trp Arg
115 120 125
His Phe Gly Ala Glu Tyr Lys Asp Cys Gln Ser Asn Tyr Leu Gln Gln
130 135 140
Gly Ile Asp Gln Leu Gln Thr Val Ile Asp Thr Ile Lys Thr Asn Pro
145 150 155 160
Glu Ser Arg Arg Met Ile Ile Ser Ser Trp Asn Pro Lys Asp Ile Pro
165 170 175
Leu Met Val Leu Pro Pro Cys His Thr Leu Cys Gln Phe Tyr Val Ala
180 185 190
Asn Gly Glu Leu Ser Cys Gln Val Tyr Gln Arg Ser Gly Asp Met Gly
195 200 205
Leu Gly Val Pro Phe Asn Ile Ala Gly Tyr Ala Leu Leu Thr Tyr Ile
210 215 220
Val Ala His Val Thr Gly Leu Lys Thr Gly Asp Leu Ile His Thr Met
225 230 235 240
Gly Asp Ala His Ile Tyr Leu Asn His Ile Asp Ala Leu Lys Val Gln
245 250 255
Leu Ala Arg Ser Pro Lys Pro Phe Pro Cys Leu Lys Ile Ile Arg Asn
260 265 270
Val Thr Asp Ile Asn Asp Phe Lys Trp Asp Asp Phe Gln Leu Asp Gly
275 280 285
Tyr Asn Pro His Pro Pro Leu Lys Met Glu Met Ala Leu
290 295 300
<210>17
<211>455
<212>PRT
<213>水痘带状疱疹
<400>17
Met Gly Leu Phe Gly Leu Thr Arg Phe Ile His Glu His Lys Leu Val
1 5 10 15
Lys Pro Ser Ile Ile Ser Thr Pro Pro Gly Val Leu Thr Pro Val Ala
20 25 30
Val Asp Val Trp Asn Val Met Tyr Thr Leu Leu Glu Arg Leu Tyr Pro
35 40 45
Val Gly Lys Arg Glu Asn Leu His Gly Pro Ser Val Thr Ile His Cys
50 55 60
Leu Gly Val Leu Leu Arg Leu Leu Thr Gln Arg Ser Tyr Tyr Pro Ile
65 70 75 80
Phe Val Leu Glu Arg Cys Thr Asp Gly Pro Leu Ser Arg Gly Ala Lys
85 90 95
Ala Ile Met Ser Arg Ala Met Asn His Asp Glu Arg Gly Thr Ser Asp
100 105 110
Leu Thr Arg Val Leu Leu Ser Ser Asn Thr Ser Cys Ser Ile Lys Tyr
115 120 125
Asn Lys Thr Ser Glu Thr Tyr Asp Ser Val Phe Arg Asn Ser Ser Thr
130 135 140
Ser Cys Ile Pro Ser Glu Glu Asn Lys Ser Gln Asp Met Phe Leu Asp
145 150 155 160
Gly Cys Pro Arg Gln Thr Asp Lys Thr Ile Cys Leu Arg Asp Gln Asn
165 170 175
Val Cys Ser Leu Thr Ser Thr Met Pro Ser Arg Gly His Pro Asn His
180 185 190
Arg Leu Tyr His Lys Leu Cys Ala Ser Leu Ile Arg Trp Met Gly Tyr
195 200 205
Ala Tyr Val Glu Ala Val Asp Ile Glu Ala Asp Glu Ala Cys Ala Asn
210 215 220
Leu Phe His Thr Arg Thr Val Ala Leu Val Tyr Thr Thr Asp Thr Asp
225 230 235 240
Leu Leu Phe Met Gly Cys Asp Ile Leu Leu Asp Ala Ile Pro Met Phe
245 250 255
Ala Pro Val Val Arg Cys Arg Asp Leu Leu Gln Tyr Leu Gly Ile Thr
260 265 270
Tyr Pro Glu Phe Leu Val Ala Phe Val Arg Cys Gln Thr Asp Leu His
275 280 285
Thr Ser Asp Asn Leu Lys Ser Val Gln Gln Val Ile Gln Asp Thr Gly
290 295 300
Leu Lys Val Pro His Gln Met Asp Thr Ser Thr Arg Ser Pro Thr Tyr
305 310 315 320
Asp Ser Trp Arg His Gly Glu Val Phe Lys Ser Leu Thr Val Ala Thr
325 330 335
Ser Gly Lys Thr Glu Asn Gly Val Ser Val Ser Lys Tyr Ala Ser Asn
340 345 350
Arg Ser Glu Val Thr Val Asp Ala Ser Trp Ala Leu Asn Leu Leu Pro
355 360 365
Pro Ser Ser Ser Pro Leu Asp Asn Leu Glu Arg Ala Phe Val Glu His
370 375 380
Ile Ile Ala Val Val Thr Pro Leu Thr Arg Gly Arg Leu Lys Leu Met
385 390 395 400
Lys Arg Val Asn Ile Met Gln Asn Thr Ala Asp Pro Tyr Met Val Ile
405 410 415
Asn Thr Leu Tyr His Asn Leu Lys Gly Glu Lys Met Ala Arg Gln Tyr
420 425 430
Ala Arg Ile Phe Lys Gln Phe Ile Pro Thr Pro Leu Pro Leu Asn Thr
435 440 445
Val Leu Thr Lys Tyr Trp Asn
450 455
<210>18
<211>1038
<212>PRT
<213>水痘带状疱疹
<400>18
Met Glu Glu Pro Ile Cys Tyr Asp Thr Gln Lys Leu Leu Asp Asp Leu
1 5 10 15
Ser Asn Leu Lys Val Gln Glu Ala Asp Asn Glu Arg Pro Trp Ser Pro
20 25 30
Glu Lys Thr Glu Ile Ala Arg Val Lys Val Val Lys Phe Leu Arg Ser
35 40 45
Thr Gln Lys Ile Pro Ala Lys His Phe Ile Gln Ile Trp Glu Pro Leu
50 55 60
His Ser Asn Ile Cys Phe Val Tyr Ser Asn Thr Phe Leu Ala Glu Ala
65 70 75 80
Ala Phe Thr Ala Glu Asn Leu Pro Gly Leu Leu Phe Trp Arg Leu Asp
85 90 95
Leu Asp Trp Thr Ile Glu Glu Pro Gly Asn Ser Leu Lys Ile Leu Thr
100 105 110
Gln Leu Ser Ser Val Val Gln Asp Ser Glu Thr Leu His Arg Leu Ser
115 120 125
Ala Asn Lys Leu Arg Thr Ser Ser Lys Phe Gly Pro Val Ser Ile His
130 135 140
Phe Ile Ile Thr Asp Trp Ile Asn Met Tyr Glu Val Ala Leu Lys Asp
145 150 155 160
Ala Thr Thr Ala Ile Glu Ser Pro Phe Thr His Ala Arg Ile Gly Met
165 170 175
Leu Glu Ser Ala Ile Ala Ala Leu Thr Gln His Lys Phe Ala Ile Ile
180 185 190
Tyr Asp Met Pro Phe Val Gln Glu Gly Ile Arg Val Leu Thr Gln Tyr
195 200 205
Ala Gly Trp Leu Leu Pro Phe Asn Val Met Trp Asn Gln Ile Gln Asn
210 215 220
Ser Ser Leu Thr Pro Leu Thr Arg Ala Leu Phe Ile Ile Cys Met Ile
225 230 235 240
Asp Glu Tyr Leu Thr Glu Thr Pro Val His Ser Ile Ser Glu Leu Phe
245 250 255
Ala Asp Thr Val Asn Leu Ile Lys Asp Glu Ala Phe Val Ser Ile Glu
260 265 270
Glu Ala Val Thr Asn Pro Arg Thr Val His Glu Ser Arg Ile Ser Ser
275 280 285
Ala Leu Ala Tyr Arg Asp Pro Tyr Val Phe Glu Thr Ser Pro Gly Met
290 295 300
Leu Ala Arg Arg Leu Arg Leu Asp Asn Gly Ile Trp Glu Ser Asn Leu
305 310 315 320
Leu Ser Leu Ser Thr Pro Gly Ile His Ile Glu Ala Leu Leu His Leu
325 330 335
Leu Asn Ser Asp Pro Glu Ala Glu Thr Thr Ser Gly Ser Asn Val Ala
340 345 350
Glu His Thr Arg Gly Ile Trp Glu Lys Val Gln Ala Ser Thr Ser Pro
355 360 365
Ser Met Leu Ile Ser Thr Leu Ala Glu Ser Gly Phe Thr Arg Phe Ser
370 375 380
Cys Lys Leu Leu Arg Arg Phe Ile Ala His His Thr Leu Ala Gly Phe
385 390 395 400
Ile His Gly Ser Val Val Ala Asp Glu His Ile Thr Asp Phe Gln Gln
405 410 415
Thr Leu Gly Cys Leu Ala Leu Val Gly Gly Leu Ala Tyr Gln Leu Val
420 425 430
Glu Thr Tyr Ala Pro Thr Thr Glu Tyr Val Leu Thr Tyr Thr Arg Thr
435 440 445
Val Asn Glu Thr Glu Lys Arg Tyr Glu Thr Leu Leu Pro Ala Leu Gly
450 455 460
Leu Pro Pro Gly Gly Leu Gly Gln Ile Met Arg Arg Cys Phe Ala Pro
465 470 475 480
Arg Pro Leu Ile Glu Ser Ile Gln Ala Thr Arg Val Ile Leu Leu Asn
485 490 495
Glu Ile Ser His Ala Glu Ala Arg Glu Thr Thr Tyr Phe Lys Gln Thr
500 505 510
His Asn Gln Ser Ser Gly Ala Leu Leu Pro Gln Ala Gly Gln Ser Ala
515 520 525
Val Arg Glu Ala Val Leu Thr Trp Phe Asp Leu Arg Met Asp Ser Arg
530 535 540
Trp Gly Ile Thr Pro Pro Val Asp Val Gly Met Thr Pro Pro Ile Cys
545 550 555 560
Val Asp Pro Pro Ala Thr Gly Leu Glu Ala Val Met Ile Thr Glu Ala
565 570 575
Leu Lys Ile Ala Tyr Pro Thr Glu Tyr Asn Arg Ser Ser Val Phe Val
580 585 590
Glu Pro Ser Phe Val Pro Tyr Ile Ile Ala Thr Ser Thr Leu Asp Ala
595 600 605
Leu Ser Ala Thr Ile Ala Leu Ser Phe Asp Thr Arg Gly Ile Gln Gln
610 615 620
Ala Leu Ser Ile Leu Gln Trp Ala Arg Asp Tyr Gly Ser Gly Thr Val
625 630 635 640
Pro Asn Ala Asp Gly Tyr Arg Thr Lys Leu Ser Ala Leu Ile Thr Ile
645 650 655
Leu Glu Pro Phe Thr Arg Thr His Pro Pro Val Leu Leu Pro Ser His
660 665 670
Val Ser Thr Ile Asp Ser Leu Ile Cys Glu Leu His Arg Thr Val Gly
675 680 685
Ile Ala Val Asp Leu Leu Pro Gln His Val Arg Pro Leu Val Pro Asp
690 695 700
Arg Pro Ser Ile Thr Asn Ser Val Phe Leu Ala Thr Leu Tyr Tyr Asp
705 710 715 720
Glu Leu Tyr Gly Arg Trp Thr Arg Leu Asp Lys Thr Ser Gln Ala Leu
725 730 735
Val Glu Asn Phe Thr Ser Asn Ala Leu Val Val Ser Arg Tyr Met Leu
740 745 750
Met Leu Gln Lys Phe Phe Ala Cys Arg Phe Tyr Pro Thr Pro Asp Leu
755 760 765
Gln Ala Val Gly Ile Cys Asn Pro Lys Val Glu Arg Asp Glu Gln Phe
770 775 780
Gly Val Trp Arg Leu Asn Asp Leu Ala Asp Ala Val Gly His Ile Val
785 790 795 800
Gly Thr Ile Gln Gly Ile Arg Thr Gln Met Arg Val Gly Ile Ser Ser
805 810 815
Leu Arg Thr Ile Met Ala Asp Ala Ser Ser Ala Leu Arg Glu Cys Glu
820 825 830
Asn Leu Met Thr Lys Thr Ser Thr Ser Ala Ile Gly Pro Leu Phe Ser
835 840 845
Thr Met Ala Ser Arg Tyr Ala Arg Phe Thr Gln Asp Gln Met Asp Ile
850 855 860
Leu Met Arg Val Asp Lys Leu Thr Thr Gly Glu Asn Ile Pro Gly Leu
865 870 875 880
Ala Asn Val Glu Ile Phe Leu Asn Arg Trp Glu Arg Ile Ala Thr Ala
885 890 895
Cys Arg His Ala Thr Ala Val Pro Ser Ala Glu Ser Ile Ala Thr Val
900 905 910
Cys Asn Glu Leu Arg Arg Gly Leu Lys Asn Ile Gln Glu Asp Arg Val
915 920 925
Asn Ala Pro Thr Ser Tyr Met Ser His Ala Arg Asn Leu Glu Asp His
930 935 940
Lys Ala Ala Val Ser Phe Val Met Asp Ser Arg Gln Gln Phe Ile Val
945 950 955 960
Asp Ser Gly Pro Gln Met Gly Ala Val Leu Thr Ser Gln Cys Asn Ile
965 970 975
Gly Thr Trp Glu Asn Val Asn Ala Thr Phe Leu His Asp Asn Val Lys
980 985 990
Ile Thr Thr Thr Val Arg Asp Val Ile Ser Glu Ala Pro Thr Leu Ile
995 1000 1005
Ile Gly Gln Arg Trp Leu Arg Pro Asp Glu Ile Leu Ser Asn Val
1010 1015 1020
Asp Leu Arg Leu Gly Val Pro Gly Asn Thr Ser Gly Ser Asp Pro
1025 1030 1035
<210>19
<211>2763
<212>PRT
<213>水痘带状疱疹
<400>19
Met Asp Ile Ile Pro Pro Ile Ala Val Thr Val Ala Gly Val Gly Ser
1 5 10 15
Arg Asn Gln Phe Asp Gly Ala Leu Gly Pro Ala Ser Gly Leu Ser Cys
20 25 30
Leu Arg Thr Ser Leu Ser Phe Leu His Met Thr Tyr Ala His Gly Ile
35 40 45
Asn Ala Thr Leu Ser Ser Asp Met Ile Asp Gly Cys Leu Gln Glu Gly
50 55 60
Ala Ala Trp Thr Thr Asp Leu Ser Asn Met Gly Arg Gly Val Pro Asp
65 70 75 80
Met Cys Ala Leu Val Asp Leu Pro Asn Arg Ile Ser Tyr Ile Lys Leu
85 90 95
Gly Asp Thr Thr Ser Thr Cys Cys Val Leu Ser Arg Ile Tyr Gly Asp
100 105 110
Ser His Phe Phe Thr Val Pro Asp Glu Gly Phe Met Cys Thr Gln Ile
115 120 125
Pro Ala Arg Ala Phe Phe Asp Asp Val Trp Met Gly Arg Glu Glu Ser
130 135 140
Tyr Thr Ile Ile Thr Val Asp Ser Thr Gly Met Ala Ile Tyr Arg Gln
145 150 155 160
Gly Asn Ile Ser Phe Ile Phe Asp Pro His Gly His Gly Thr Ile Gly
165 170 175
Gln Ala Val Val Val Arg Val Asn Thr Thr Asp Val Tyr Ser Tyr Ile
180 185 190
Ala Ser Glu Tyr Thr His Arg Pro Asp Asn Val Glu Ser Gln Trp Ala
195 200 205
Ala Ala Leu Val Phe Phe Val Thr Ala Asn Asp Gly Pro Val Ser Glu
210 215 220
Glu Ala Leu Ser Ser Ala Val Thr Leu Ile Tyr Gly Ser Cys Asp Thr
225 230 235 240
Tyr Phe Thr Asp Glu Gln Tyr Cys Glu Lys Leu Val Thr Ala Gln His
245 250 255
Pro Leu Leu Leu Ser Pro Pro Asn Ser Thr Thr Ile Val Leu Asn Lys
260 265 270
Ser Ser Ile Val Pro Leu His Gln Asn Val Gly Glu Ser Val Ser Leu
275 280 285
Glu Ala Thr Leu His Ser Thr Leu Thr Asn Thr Val Ala Leu Asp Pro
290 295 300
Arg Cys Ser Tyr Ser Glu Val Asp Pro Trp His Ala Val Leu Glu Thr
305 310 315 320
Thr Ser Thr Gly Ser Gly Val Leu Asp Cys Arg Arg Arg Arg Arg Pro
325 330 335
Ser Trp Thr Pro Pro Ser Ser Glu Glu Asn Leu Ala Cys Ile Asp Asp
340 345 350
Gly Leu Val Asn Asn Thr His Ser Thr Asp Asn Leu His Lys Pro Ala
355 360 365
Lys Lys Val Leu Lys Phe Lys Pro Thr Val Asp Val Pro Asp Lys Thr
370 375 380
Gln Val Ala His Val Leu Pro Arg Leu Arg Glu Val Ala Asn Thr Pro
385 390 395 400
Asp Val Val Leu Asn Val Ser Asn Val Asp Thr Pro Glu Ser Ser Pro
405 410 415
Thr Phe Ser Arg Asn Met Asn Val Gly Ser Ser Leu Lys Asp Arg Lys
420 425 430
Pro Phe Leu Phe Glu Gln Ser Gly Asp Val Asn Met Val Val Glu Lys
435 440 445
Leu Leu Gln His Gly His Glu Ile Ser Asn Gly Tyr Val Gln Asn Ala
450 455 460
Val Gly Thr Leu Asp Thr Val Ile Thr Gly His Thr Asn Val Pro Ile
465 470 475 480
Trp Val Thr Arg Pro Leu Val Met Pro Asp Glu Lys Asp Pro Leu Glu
485 490 495
Leu Phe Ile Asn Leu Thr Ile Leu Arg Leu Thr Gly Phe Val Val Glu
500 505 510
Asn Gly Thr Arg Thr His His Gly Ala Thr Ser Val Val Ser Asp Phe
515 520 525
Ile Gly Pro Leu Gly Glu Ile Leu Thr Gly Phe Pro Ser Ala Ala Glu
530 535 540
Leu Ile Arg Val Thr Ser Leu Ile Leu Thr Asn Met Pro Gly Ala Glu
545 550 555 560
Tyr Ala Ile Lys Thr Val Leu Arg Lys Lys Cys Thr Ile Gly Met Leu
565 570 575
Ile Ile Ala Lys Phe Gly Leu Val Ala Met Arg Val Gln Asp Thr Thr
580 585 590
Gly Ala Leu His Ala Glu Leu Asp Val Leu Glu Ala Asp Leu Gly Gly
595 600 605
Ser Ser Pro Ile Asp Leu Tyr Ser Arg Leu Ser Thr Gly Leu Ile Ser
610 615 620
Ile Leu Asn Ser Pro Ile Ile Ser His Pro Gly Leu Phe Ala Glu Leu
625 630 635 640
Ile Pro Thr Arg Thr Gly Ser Leu Ser Glu Arg Ile Arg Leu Leu Cys
645 650 655
Glu Leu Val Ser Ala Arg Glu Thr Arg Tyr Met Arg Glu His Thr Ala
660 665 670
Leu Val Ser Ser Val Lys Ala Leu Glu Asn Ala Leu Arg Ser Thr Arg
675 680 685
Asn Lys Ile Asp Ala Ile Gln Ile Pro Glu Val Pro Gln Glu Pro Pro
690 695 700
Glu Glu Thr Asp Ile Pro Pro Glu Glu Leu Ile Arg Arg Val Tyr Glu
705 710 715 720
Ile Arg Ser Glu Val Thr Met Leu Leu Thr Ser Ala Val Thr Glu Tyr
725 730 735
Phe Thr Arg Gly Val Leu Tyr Ser Thr Arg Ala Leu Ile Ala Glu Gln
740 745 750
Ser Pro Arg Arg Phe Arg Val Ala Thr Ala Ser Thr Ala Pro Ile Gln
755 760 765
Arg Leu Leu Asp Ser Leu Pro Glu Phe Asp Ala Lys Leu Thr Ala Ile
770 775 780
Ile Ser Ser Leu Ser Ile His Pro Pro Pro Glu Thr Ile Gln Asn Leu
785 790 795 800
Pro Val Val Ser Leu Leu Lys Glu Leu Ile Lys Glu Gly Glu Asp Leu
805 810 815
Asn Thr Asp Thr Ala Leu Val Ser Trp Leu Ser Val Val Gly Glu Ala
820 825 830
Gln Thr Ala Gly Tyr Leu Ser Arg Arg Glu Phe Asp Glu Leu Ser Arg
835 840 845
Thr Ile Lys Thr Ile Asn Thr Arg Ala Thr Gln Arg Ala Ser Ala Glu
850 855 860
Ala Glu Leu Ser Cys Phe Asn Thr Leu Ser Ala Ala Val Asp Gln Ala
865 870 875 880
Val Lys Asp Tyr Glu Thr Tyr Asn Asn Gly Glu Val Lys Tyr Pro Glu
885 890 895
Ile Thr Arg Asp Asp Leu Leu Ala Thr Ile Val Arg Ala Thr Asp Asp
900 905 910
Leu Val Arg Gln Ile Lys Ile Leu Ser Asp Pro Met Ile Gln Ser Gly
915 920 925
Leu Gln Pro Ser Ile Lys Arg Arg Leu Glu Thr Arg Leu Lys Glu Val
930 935 940
Gln Thr Tyr Ala Asn Glu Ala Arg Thr Thr Gln Asp Thr Ile Lys Ser
945 950 955 960
Arg Lys Gln Ala Ala Tyr Asn Lys Leu Gly Gly Leu Leu Arg Pro Val
965 970 975
Thr Gly Phe Val Gly Leu Arg Ala Ala Val Asp Leu Leu Pro Glu Leu
980 985 990
Ala Ser Glu Leu Asp Val Gln Gly Ala Leu Val Asn Leu Arg Thr Lys
995 1000 1005
Val Leu Glu Ala Pro Val Glu Ile Arg Ser Gln Leu Thr Gly Asp
1010 1015 1020
Phe Trp Ala Leu Phe Asn Gln Tyr Arg Asp Ile Leu Glu His Pro
1025 1030 1035
Gly Asn Ala Arg Thr Ser Val Leu Gly Gly Leu Gly Ala Cys Phe
1040 1045 1050
Thr Ala Ile Ile Glu Ile Val Pro Ile Pro Thr Glu Tyr Arg Pro
1055 1060 1065
Ser Leu Leu Ala Phe Phe Gly Asp Val Ala Asp Val Leu Ala Ser
1070 1075 1080
Asp Ile Ala Thr Val Ser Thr Asn Pro Glu Ser Glu Ser Ala Ile
1085 1090 1095
Asn Ala Val Val Ala Thr Leu Ser Lys Ala Thr Leu Val Ser Ser
1100 1105 1110
Thr Val Pro Ala Leu Ser Phe Val Leu Ser Leu Tyr Lys Lys Tyr
1115 1120 1125
Gln Ala Leu Gln Gln Glu Ile Thr Asn Thr His Lys Leu Thr Glu
1130 1135 1140
Leu Gln Lys Gln Leu Gly Asp Asp Phe Ser Thr Leu Ala Val Ser
1145 1150 1155
Ser Gly His Leu Lys Phe Ile Ser Ser Ser Asn Val Asp Asp Tyr
1160 1165 1170
Glu Ile Asn Asp Ala Ile Leu Ser Ile Gln Thr Asn Val His Ala
1175 1180 1185
Leu Met Asp Thr Val Lys Leu Val Glu Val Glu Leu Gln Lys Leu
1190 1195 1200
Pro Pro His Cys Ile Ala Gly Thr Ser Thr Leu Ser Arg Val Val
1205 1210 1215
Lys Asp Leu His Lys Leu Val Thr Met Ala His Glu Lys Lys Glu
1220 1225 1230
Gln Ala Lys Val Leu Ile Thr Asp Cys Glu Arg Ala His Lys Gln
1235 1240 1245
Gln Thr Thr Arg Val Leu Tyr Glu Arg Trp Thr Arg Asp Ile Ile
1250 1255 1260
Ala Cys Leu Glu Ala Met Glu Thr Arg His Ile Phe Asn Gly Thr
1265 1270 1275
Glu Leu Ala Arg Leu Arg Asp Met Ala Ala Ala Gly Gly Phe Asp
1280 1285 1290
Ile His Ala Val Tyr Pro Gln Ala Arg Gln Val Val Ala Ala Cys
1295 1300 1305
Glu Thr Thr Ala Val Thr Ala Leu Asp Thr Val Phe Arg His Asn
1310 1315 1320
Pro Tyr Thr Pro Glu Asn Thr Asn Ile Pro Pro Pro Leu Ala Leu
1325 1330 1335
Leu Arg Gly Leu Thr Trp Phe Asp Asp Phe Ser Ile Thr Ala Pro
1340 1345 1350
Val Phe Thr Val Met Phe Pro Gly Val Ser Ile Glu Gly Leu Leu
1355 1360 1365
Leu Leu Met Arg Ile Arg Ala Val Val Leu Leu Ser Ala Asp Thr
1370 1375 1380
Ser Ile Asn Gly Ile Pro Asn Tyr Arg Asp Met Ile Leu Arg Thr
1385 1390 1395
Ser Gly Asp Leu Leu Gln Ile Pro Ala Leu Ala Gly Tyr Val Asp
1400 1405 1410
Phe Tyr Thr Arg Ser Tyr Asp Gln Phe Ile Thr Glu Ser Val Thr
1415 1420 1425
Leu Ser Glu Leu Arg Ala Asp Ile Arg Gln Ala Ala Gly Ala Lys
1430 1435 1440
Leu Thr Glu Ala Asn Lys Ala Leu Glu Glu Val Thr His Val Arg
1445 1450 1455
Ala His Glu Thr Ala Lys Leu Ala Leu Lys Glu Gly Val Phe Ile
1460 1465 1470
Thr Leu Pro Ser Glu Gly Leu Leu Ile Arg Ala Ile Glu Tyr Phe
1475 1480 1485
Thr Thr Phe Asp His Lys Arg Phe Ile Gly Thr Ala Tyr Glu Arg
1490 1495 1500
Val Leu Gln Thr Met Val Asp Arg Asp Leu Lys Glu Ala Asn Ala
1505 1510 1515
Glu Leu Ala Gln Phe Arg Met Val Cys Gln Ala Thr Lys Asn Arg
1520 1525 1530
Ala Ile Gln Ile Leu Gln Asn Ile Val Asp Thr Ala Asn Ala Thr
1535 1540 1545
Glu Gln Gln Glu Asp Val Asp Phe Thr Asn Leu Lys Thr Leu Leu
1550 1555 1560
Lys Leu Thr Pro Pro Pro Lys Thr Ile Ala Leu Ala Ile Asp Arg
1565 1570 1575
Ser Thr Ser Val Gln Asp Ile Val Thr Gln Phe Ala Leu Leu Leu
1580 1585 1590
Gly Arg Leu Glu Glu Glu Thr Gly Thr Leu Asp Ile Gln Ala Val
1595 1600 1605
Asp Trp Met Tyr Gln Ala Arg Asn Ile Ile Asp Ser His Pro Leu
1610 1615 1620
Ser Val Arg Ile Asp Gly Thr Gly Pro Leu His Thr Tyr Lys Asp
1625 1630 1635
Arg Val Asp Lys Leu Tyr Ala Leu Arg Thr Lys Leu Asp Leu Leu
1640 1645 1650
Arg Arg Arg Ile Glu Thr Gly Glu Val Thr Trp Asp Asp Ala Trp
1655 1660 1665
Thr Thr Phe Lys Arg Glu Thr Gly Asp Met Leu Ala Ser Gly Asp
1670 1675 1680
Thr Tyr Ala Thr Ser Val Asp Ser Ile Lys Ala Leu Gln Ala Ser
1685 1690 1695
Ala Ser Val Val Asp Met Leu Cys Ser Glu Pro Glu Phe Phe Leu
1700 1705 1710
Leu Pro Val Glu Thr Lys Asn Arg Leu Gln Lys Lys Gln Gln Glu
1715 1720 1725
Arg Lys Thr Ala Leu Asp Val Val Leu Gln Lys Gln Arg Gln Phe
1730 1735 1740
Glu Glu Thr Ala Ser Arg Leu Arg Ala Leu Ile Glu Arg Ile Pro
1745 1750 1755
Thr Glu Ser Asp His Asp Val Leu Arg Met Leu Leu Arg Asp Phe
1760 1765 1770
Asp Gln Phe Thr His Leu Pro Ile Trp Ile Lys Thr Gln Tyr Met
1775 1780 1785
Thr Phe Arg Asn Leu Leu Met Val Arg Leu Gly Leu Tyr Ala Ser
1790 1795 1800
Tyr Ala Glu Ile Phe Pro Pro Ala Ser Pro Asn Gly Val Phe Ala
1805 1810 1815
Pro Ile Pro Ala Met Ser Gly Val Cys Leu Glu Asp Gln Ser Arg
1820 1825 1830
Cys Ile Arg Ala Arg Val Ala Ala Phe Met Gly Glu Ala Ser Val
1835 1840 1845
Val Gln Thr Phe Arg Glu Ala Arg Ser Ser Ile Asp Ala Leu Phe
1850 1855 1860
Gly Lys Asn Leu Thr Phe Tyr Leu Asp Thr Asp Gly Val Pro Leu
1865 1870 1875
Arg Tyr Arg Val Cys Tyr Lys Ser Val Gly Val Lys Leu Gly Thr
1880 1885 1890
Met Leu Cys Ser Gln Gly Gly Leu Ser Leu Arg Pro Ala Leu Pro
1895 1900 1905
Asp Glu Gly Ile Val Glu Glu Thr Thr Leu Ser Ala Leu Arg Val
1910 1915 1920
Ala Asn Glu Val Asn Glu Leu Arg Ile Glu Tyr Glu Ser Ala Ile
1925 1930 1935
Lys Ser Gly Phe Ser Ala Phe Ser Thr Phe Val Arg His Arg His
1940 1945 1950
Ala Glu Trp Gly Lys Thr Asn Ala Arg Arg Ala Ile Ala Glu Ile
1955 1960 1965
Tyr Ala Gly Leu Ile Thr Thr Thr Leu Thr Arg Gln Tyr Gly Val
1970 1975 1980
His Trp Asp Lys Leu Ile Tyr Ser Phe Glu Lys His His Leu Thr
1985 1990 1995
Ser Val Met Gly Asn Gly Leu Thr Lys Pro Ile Gln Arg Arg Gly
2000 2005 2010
Asp Val Arg Val Leu Glu Leu Thr Leu Ser Asp Ile Val Thr Ile
2015 2020 2025
Leu Val Ala Thr Thr Pro Val His Leu Leu Asn Phe Ala Arg Leu
2030 2035 2040
Asp Leu Ile Lys Gln His Glu Tyr Met Ala Arg Thr Leu Arg Pro
2045 2050 2055
Val Ile Glu Ala Ala Phe Arg Gly Arg Leu Leu Val Arg Ser Leu
2060 2065 2070
Asp Gly Asp Pro Lys Gly Asn Ala Arg Ala Phe Phe Asn Ala Ala
2075 2080 2085
Pro Ser Lys His Lys Leu Pro Leu Ala Leu Gly Ser Asn Gln Asp
2090 2095 2100
Pro Thr Gly Gly Arg Ile Phe Ala Phe Arg Met Ala Asp Trp Lys
2105 2110 2115
Leu Val Lys Met Pro Gln Lys Ile Thr Asp Pro Phe Ala Pro Trp
2120 2125 2130
Gln Leu Ser Pro Pro Pro Gly Val Lys Ala Asn Val Asp Ala Val
2135 2140 2145
Thr Arg Ile Met Ala Thr Asp Arg Leu Ala Thr Ile Thr Val Leu
2150 2155 2160
Gly Arg Met Cys Leu Pro Pro Ile Ser Leu Val Ser Met Trp Asn
2165 2170 2175
Thr Leu Gln Pro Glu Glu Phe Ala Tyr Arg Thr Gln Asp Asp Val
2180 2185 2190
Asp Ile Ile Val Asp Ala Arg Leu Asp Leu Ser Ser Thr Leu Asn
2195 2200 2205
Ala Arg Phe Asp Thr Ala Pro Ser Asn Thr Thr Leu Glu Trp Asn
2210 2215 2220
Thr Asp Arg Lys Val Ile Thr Asp Ala Tyr Ile Gln Thr Gly Ala
2225 2230 2235
Thr Thr Val Phe Thr Val Thr Gly Ala Ala Pro Thr His Val Ser
2240 2245 2250
Asn Val Thr Ala Phe Asp Ile Ala Thr Thr Ala Ile Leu Phe Gly
2255 2260 2265
Ala Pro Leu Val Ile Ala Met Glu Leu Thr Ser Val Phe Ser Gln
2270 2275 2280
Asn Ser Gly Leu Thr Leu Gly Leu Lys Leu Phe Asp Ser Arg His
2285 2290 2295
Met Ala Thr Asp Ser Gly Ile Ser Ser Ala Val Ser Pro Asp Ile
2300 2305 2310
Val Ser Trp Gly Leu Arg Leu Leu His Met Asp Pro His Pro Ile
2315 2320 2325
Glu Asn Ala Cys Leu Ile Val Gln Leu Glu Lys Leu Ser Ala Leu
2330 2335 2340
Ile Ala Asn Lys Pro Leu Thr Asn Asn Pro Pro Cys Leu Leu Leu
2345 2350 2355
Leu Asp Glu His Met Asn Pro Ser Tyr Val Leu Trp Glu Arg Lys
2360 2365 2370
Asp Ser Ile Pro Ala Pro Asp Tyr Val Val Phe Trp Gly Pro Glu
2375 2380 2385
Ser Leu Ile Asp Leu Pro Tyr Ile Asp Ser Asp Glu Asp Ser Phe
2390 2395 2400
Pro Ser Cys Pro Asp Asp Pro Phe Tyr Ser Gln Ile Ile Ala Gly
2405 2410 2415
Tyr Ala Pro Gln Gly Pro Pro Asn Leu Asp Thr Thr Asp Phe Tyr
2420 2425 2430
Pro Thr Glu Pro Leu Phe Lys Ser Pro Val Gln Val Val Arg Ser
2435 2440 2445
Ser Lys Cys Lys Lys Met Pro Val Arg Pro Ala Gln Pro Ala Gln
2450 2455 2460
Pro Ala Gln Pro Ala Gln Pro Ala Gln Thr Val Gln Pro Ala Gln
2465 2470 2475
Pro Ile Glu Pro Gly Thr Gln Ile Val Val Gln Asn Phe Lys Lys
2480 2485 2490
Pro Gln Ser Val Lys Thr Thr Leu Ser Gln Lys Asp Ile Pro Leu
2495 2500 2505
Tyr Val Glu Thr Glu Ser Glu Thr Ala Val Leu Ile Pro Lys Gln
2510 2515 2520
Leu Thr Thr Ser Ile Lys Thr Thr Val Cys Lys Ser Ile Thr Pro
2525 2530 2535
Pro Asn Asn Gln Leu Ser Asp Trp Lys Asn Asn Pro Gln Gln Asn
2540 2545 2550
Gln Thr Leu Asn Gln Ala Phe Ser Lys Pro Ile Leu Glu Ile Thr
2555 2560 2565
Ser Ile Pro Thr Asp Asp Ser Ile Ser Tyr Arg Thr Trp Ile Glu
2570 2575 2580
Lys Ser Asn Gln Thr Gln Lys Arg His Gln Asn Asp Pro Arg Met
2585 2590 2595
Tyr Asn Ser Lys Thr Val Phe His Pro Val Asn Asn Gln Leu Pro
2600 2605 2610
Ser Trp Val Asp Thr Ala Ala Asp Ala Pro Gln Thr Asp Leu Leu
2615 2620 2625
Thr Asn Tyr Lys Thr Arg Gln Pro Ser Pro Asn Phe Pro Arg Asp
2630 2635 2640
Val His Thr Trp Gly Val Ser Ser Asn Pro Phe Asn Ser Pro Asn
2645 2650 2655
Arg Asp Leu Tyr Gln Ser Asp Phe Ser Glu Pro Ser Asp Gly Tyr
2660 2665 2670
Ser Ser Glu Ser Glu Asn Ser Ile Val Leu Ser Leu Asp Glu His
2675 2680 2685
Arg Ser Cys Arg Val Pro Arg His Val Arg Val Val Asn Ala Asp
2690 2695 2700
Val Val Thr Gly Arg Arg Tyr Val Arg Gly Thr Ala Leu Gly Ala
2705 2710 2715
Leu Ala Leu Leu Ser Gln Ala Cys Arg Arg Met Ile Asp Asn Val
2720 2725 2730
Arg Tyr Thr Arg Lys Leu Leu Met Asp His Thr Glu Asp Ile Phe
2735 2740 2745
Gln Gly Leu Gly Tyr Val Lys Leu Leu Leu Asp Gly Thr Tyr Ile
2750 2755 2760
<210>20
<211>585
<212>PRT
<213>水痘带状疱疹
<400>20
Met Asp Arg Val Glu Ser Glu Glu Pro Met Asp Gly Phe Glu Ser Pro
1 5 10 15
Val Phe Ser Glu Asn Thr Ser Ser Asn Ser Gly Trp Cys Ser Asp Ala
20 25 30
Phe Ser Asp Ser Tyr Ile Ala Tyr Asn Pro Ala Leu Leu Leu Lys Asn
35 40 45
Asp Leu Leu Phe Ser Glu Leu Leu Phe Ala Ser His Leu Ile Asn Val
50 55 60
Pro Arg Ala Ile Glu Asn Asn Val Thr Tyr Glu Ala Ser Ser Ala Val
65 70 75 80
Gly Val Asp Asn Glu Met Thr Ser Ser Thr Thr Glu Phe Ile Glu Glu
85 90 95
Ile Gly Asp Val Leu Ala Leu Asp Arg Ala Cys Leu Val Cys Arg Thr
100 105 110
Leu Asp Leu Tyr Lys Arg Lys Phe Gly Leu Thr Pro Glu Trp Val Ala
115 120 125
Asp Tyr Ala Met Leu Cys Met Lys Ser Leu Ala Ser Pro Pro Cys Ala
130 135 140
Val Val Thr Phe Ser Ala Ala Phe Glu Phe Val Tyr Leu Met Asp Arg
145 150 155 160
Tyr Tyr Leu Cys Arg Tyr Asn Val Thr Leu Val Gly Ser Phe Ala Arg
165 170 175
Arg Thr Leu Ser Leu Leu Asp Ile Gln Arg His Phe Phe Leu His Val
180 185 190
Cys Phe Arg Thr Asp Gly Gly Leu Pro Gly Ile Arg Pro Pro Pro Gly
195 200 205
Lys Glu Met Ala Asn Lys Val Arg Tyr Ser Asn Tyr Ser Phe Phe Val
210 215 220
Gln Ala Val Val Arg Ala Ala Leu Leu Ser Ile Ser Thr Ser Arg Leu
225 230 235 240
Asp Glu Thr Glu Thr Arg Lys Ser Phe Tyr Phe Asn Gln Asp Gly Leu
245 250 255
Thr Gly Gly Pro Gln Pro Leu Ala Ala Ala Leu Ala Asn Trp Lys Asp
260 265 270
Cys Ala Arg Met Val Asp Cys Ser Ser Ser Glu His Arg Thr Ser Gly
275 280 285
Met Ile Thr Cys Ala Glu Arg Ala Leu Lys Glu Asp Ile Glu Phe Glu
290 295 300
Asp Ile Leu Ile Asp Lys Leu Lys Lys Ser Ser Tyr Val Glu Ala Ala
305 310 315 320
Trp Gly Tyr Ala Asp Leu Ala Leu Leu Leu Leu Ser Gly Val Ala Thr
325 330 335
Trp Asn Val Asp Glu Arg Thr Asn Cys Ala Ile Glu Thr Arg Val Gly
340 345 350
Cys Val Lys Ser Tyr Trp Gln Ala Asn Arg Ile Glu Asn Ser Arg Asp
355 360 365
Val Pro Lys Gln Phe Ser Lys Phe Thr Ser Glu Asp Ala Cys Pro Glu
370 375 380
Val Ala Phe Gly Pro Ile Leu Leu Thr Thr Leu Lys Asn Ala Lys Cys
385 390 395 400
Arg Gly Arg Thr Asn Thr Glu Cys Met Leu Cys Cys Leu Leu Thr Ile
405 410 415
Gly His Tyr Trp Ile Ala Leu Arg Gln Phe Lys Arg Asp Ile Leu Ala
420 425 430
Tyr Ser Ala Asn Asn Thr Ser Leu Phe Asp Cys Ile Glu Pro Val Ile
435 440 445
Asn Ala Trp Ser Leu Asp Asn Pro Ile Lys Leu Lys Phe Pro Phe Asn
450 455 460
Asp Glu Gly Arg Phe Ile Thr Ile Val Lys Ala Ala Gly Ser Glu Ala
465 470 475 480
Val Tyr Lys His Leu Phe Cys Asp Leu Leu Cys Ala Leu Ser Glu Leu
485 490 495
Gln Thr Asn Pro Lys Ile Leu Phe Ala His Pro Thr Thr Ala Asp Lys
500 505 510
Glu Val Leu Glu Leu Tyr Lys Ala Gln Leu Ala Ala Gln Asn Arg Phe
515 520 525
Glu Gly Arg Val Cys Ala Gly Leu Trp Thr Leu Ala Tyr Ala Phe Lys
530 535 540
Ala Tyr Gln Ile Phe Pro Arg Lys Pro Thr Ala Asn Ala Ala Phe Ile
545 550 555 560
Arg Asp Gly Gly Leu Met Leu Arg Arg His Ala Ile Ser Leu Val Ser
565 570 575
Leu Glu His Thr Leu Ser Lys Tyr Val
580 585
<210>21
<211>1204
<212>PRT
<213>水痘带状疱疹
<400>21
Met Glu Asn Thr Gln Lys Thr Val Thr Val Pro Thr Gly Pro Leu Gly
1 5 10 15
Tyr Val Tyr Ala Cys Arg Val Glu Asp Leu Asp Leu Glu Glu Ile Ser
20 25 30
Phe Leu Ala Ala Arg Ser Thr Asp Ser Asp Leu Ala Leu Leu Pro Leu
35 40 45
Met Arg Asn Leu Thr Val Glu Lys Thr Phe Thr Ser Ser Leu Ala Val
50 55 60
Val Ser Gly Ala Arg Thr Thr Gly Leu Ala Gly Ala Gly Ile Thr Leu
65 70 75 80
Lys Leu Thr Thr Ser His Phe Tyr Pro Ser Val Phe Val Phe His Gly
85 90 95
Gly Lys His Val Leu Pro Ser Ser Ala Ala Pro Asn Leu Thr Arg Ala
100 105 110
Cys Asn Ala Ala Arg Glu Arg Phe Gly Phe Ser Arg Cys Gln Gly Pro
115 120 125
Pro Val Asp Gly Ala Val Glu Thr Thr Gly Ala Glu Ile Cys Thr Arg
130 135 140
Leu Gly Leu Glu Pro Glu Asn Thr Ile Leu Tyr Leu Val Val Thr Ala
145 150 155 160
Leu Phe Lys Glu Ala Val Phe Met Cys Asn Val Phe Leu His Tyr Gly
165 170 175
Gly Leu Asp Ile Val His Ile Asn His Gly Asp Val Ile Arg Ile Pro
180 185 190
Leu Phe Pro Val Gln Leu Phe Met Pro Asp Val Asn Arg Leu Val Pro
195 200 205
Asp Pro Phe Asn Thr His His Arg Ser Ile Gly Glu Gly Phe Val Tyr
210 215 220
Pro Thr Pro Phe Tyr Asn Thr Gly Leu Cys His Leu Ile His Asp Cys
225 230 235 240
Val Ile Ala Pro Met Ala Val Ala Leu Arg Val Arg Asn Val Thr Ala
245 250 255
Val Ala Arg Gly Ala Ala His Leu Ala Phe Asp Glu Asn His Glu Gly
260 265 270
Ala Val Leu Pro Pro Asp Ile Thr Tyr Thr Tyr Phe Gln Ser Ser Ser
275 280 285
Ser Gly Thr Thr Thr Ala Arg Gly Ala Arg Arg Asn Asp Val Asn Ser
290 295 300
Thr Ser Lys Pro Ser Pro Ser Gly Gly Phe Glu Arg Arg Leu Ala Ser
305 310 315 320
Ile Met Ala Ala Asp Thr Ala Leu His Ala Glu Val Ile Phe Asn Thr
325 330 335
Gly Ile Tyr Glu Glu Thr Pro Thr Asp Ile Lys Glu Trp Pro Met Phe
340 345 350
Ile Gly Met Glu Gly Thr Leu Pro Arg Leu Asn Ala Leu Gly Ser Tyr
355 360 365
Thr Ala Arg Val Ala Gly Val Ile Gly Ala Met Val Phe Ser Pro Asn
370 375 380
Ser Ala Leu Tyr Leu Thr Glu Val Glu Asp Ser Gly Met Thr Glu Ala
385 390 395 400
Lys Asp Gly Gly Pro Gly Pro Ser Phe Asn Arg Phe Tyr Gln Phe Ala
405 410 415
Gly Pro His Leu Ala Ala Asn Pro Gln Thr Asp Arg Asp Gly His Val
420 425 430
Leu Ser Ser Gln Ser Thr Gly Ser Ser Asn Thr Glu Phe Ser Val Asp
435 440 445
Tyr Leu Ala Leu Ile Cys Gly Phe Gly Ala Pro Leu Leu Ala Arg Leu
450 455 460
Leu Phe Tyr Leu Glu Arg Cys Asp Ala Gly Ala Phe Thr Gly Gly His
465 470 475 480
Gly Asp Ala Leu Lys Tyr Val Thr Gly Thr Phe Asp Ser Glu Ile Pro
485 490 495
Cys Ser Leu Cys Glu Lys His Thr Arg Pro Val Cys Ala His Thr Thr
500 505 510
Val His Arg Leu Arg Gln Arg Met Pro Arg Phe Gly Gln Ala Thr Arg
515 520 525
Gln Pro Ile Gly Val Phe Gly Thr Met Asn Ser Gln Tyr Ser Asp Cys
530 535 540
Asp Pro Leu Gly Asn Tyr Ala Pro Tyr Leu Ile Leu Arg Lys Pro Gly
545 550 555 560
Asp Gln Thr Glu Ala Ala Lys Ala Thr Met Gln Asp Thr Tyr Arg Ala
565 570 575
Thr Leu Glu Arg Leu Phe Ile Asp Leu Glu Gln Glu Arg Leu Leu Asp
580 585 590
Arg Gly Ala Pro Cys Ser Ser Glu Gly Leu Ser Ser Val Ile Val Asp
595 600 605
His Pro Thr Phe Arg Arg Ile Leu Asp Thr Leu Arg Ala Arg Ile Glu
610 615 620
Gln Thr Thr Thr Gln Phe Met Lys Val Leu Val Glu Thr Arg Asp Tyr
625 630 635 640
Lys Ile Arg Glu Gly Leu Ser Glu Ala Thr His Ser Met Ala Leu Thr
645 650 655
Phe Asp Pro Tyr Ser Gly Ala Phe Cys Pro Ile Thr Asn Phe Leu Val
660 665 670
Lys Arg Thr His Leu Ala Val Val Gln Asp Leu Ala Leu Ser Gln Cys
675 680 685
His Cys Val Phe Tyr Gly Gln Gln Val Glu Gly Arg Asn Phe Arg Asn
690 695 700
Gln Phe Gln Pro Val Leu Arg Arg Arg Phe Val Asp Leu Phe Asn Gly
705 710 715 720
Gly Phe Ile Ser Thr Arg Ser Ile Thr Val Thr Leu Ser Glu Gly Pro
725 730 735
Val Ser Ala Pro Asn Pro Thr Leu Gly Gln Asp Ala Pro Ala Gly Arg
740 745 750
Thr Phe Asp Gly Asp Leu Ala Arg Val Ser Val Glu Val Ile Arg Asp
755 760 765
Ile Arg Val Lys Asn Arg Val Val Phe Ser Gly Asn Cys Thr Asn Leu
770 775 780
Ser Glu Ala Ala Arg Ala Arg Leu Val Gly Leu Ala Ser Ala Tyr Gln
785 790 795 800
Arg Gln Glu Lys Arg Val Asp Met Leu His Gly Ala Leu Gly Phe Leu
805 810 815
Leu Lys Gln Phe His Gly Leu Leu Phe Pro Arg Gly Met Pro Pro Asn
820 825 830
Ser Lys Ser Pro Asn Pro Gln Trp Phe Trp Thr Leu Leu Gln Arg Asn
835 840 845
Gln Met Pro Ala Asp Lys Leu Thr His Glu Glu Ile Thr Thr Ile Ala
850 855 860
Ala Val Lys Arg Phe Thr Glu Glu Tyr Ala Ala Ile Asn Phe Ile Asn
865 870 875 880
Leu Pro Pro Thr Cys Ile Gly Glu Leu Ala Gln Phe Tyr Met Ala Asn
885 890 895
Leu Ile Leu Lys Tyr Cys Asp His Ser Gln Tyr Leu Ile Asn Thr Leu
900 905 910
Thr Ser Ile Ile Thr Gly Ala Arg Arg Pro Arg Asp Pro Ser Ser Val
915 920 925
Leu His Trp Ile Arg Lys Asp Val Thr Ser Ala Ala Asp Ile Glu Thr
930 935 940
Gln Ala Lys Ala Leu Leu Glu Lys Thr Glu Asn Leu Pro Glu Leu Trp
945 950 955 960
Thr Thr Ala Phe Thr Ser Thr His Leu Val Arg Ala Ala Met Asn Gln
965 970 975
Arg Pro Met Val Val Leu Gly Ile Ser Ile Ser Lys Tyr His Gly Ala
980 985 990
Ala Gly Asn Asn Arg Val Phe Gln Ala Gly Asn Trp Ser Gly Leu Asn
995 1000 1005
Gly Gly Lys Asn Val Cys Pro Leu Phe Thr Phe Asp Arg Thr Arg
1010 1015 1020
Arg Phe Ile Ile Ala Cys Pro Arg Gly Gly Phe Ile Cys Pro Val
1025 1030 1035
Thr Gly Pro Ser Ser Gly Asn Arg Glu Thr Thr Leu Ser Asp Gln
1040 1045 1050
Val Arg Gly Ile Ile Val Ser Gly Gly Ala Met Val Gln Leu Ala
1055 1060 1065
Ile Tyr Ala Thr Val Val Arg Ala Val Gly Ala Arg Ala Gln His
1070 1075 1080
Met Ala Phe Asp Asp Trp Leu Ser Leu Thr Asp Asp Glu Phe Leu
1085 1090 1095
Ala Arg Asp Leu Glu Glu Leu His Asp Gln Ile Ile Gln Thr Leu
1100 1105 1110
Glu Thr Pro Trp Thr Val Glu Gly Ala Leu Glu Ala Val Lys Ile
1115 1120 1125
Leu Asp Glu Lys Thr Thr Ala Gly Asp Gly Glu Thr Pro Thr Asn
1130 1135 1140
Leu Ala Phe Asn Phe Asp Ser Cys Glu Pro Ser His Asp Thr Thr
1145 1150 1155
Ser Asn Val Leu Asn Ile Ser Gly Ser Asn Ile Ser Gly Ser Thr
1160 1165 1170
Val Pro Gly Leu Lys Arg Pro Pro Glu Asp Asp Glu Leu Phe Asp
1175 1180 1185
Leu Ser Gly Ile Pro Ile Lys His Gly Asn Ile Thr Met Glu Met
1190 1195 1200
Ile
<210>22
<211>770
<212>PRT
<213>水痘带状疱疹
<400>22
Met Glu Leu Asp Ile Asn Arg Thr Leu Leu Val Leu Leu Gly Gln Val
1 5 10 15
Tyr Thr Tyr Ile Phe Gln Val Glu Leu Leu Arg Arg Cys Asp Pro Arg
20 25 30
Val Ala Cys Arg Phe Leu Tyr Arg Leu Ala Ala Asn Cys Leu Thr Val
35 40 45
Arg Tyr Leu Leu Lys Leu Phe Leu Arg Gly Phe Asn Thr Gln Leu Lys
50 55 60
Phe Gly Asn Thr Pro Thr Val Cys Ala Leu His Trp Ala Leu Cys Tyr
65 70 75 80
Val Lys Gly Glu Gly Glu Arg Leu Phe Glu Leu Leu Gln His Phe Lys
85 90 95
Thr Arg Phe Val Tyr Gly Glu Thr Lys Asp Ser Asn Cys Ile Lys Asp
100 105 110
Tyr Phe Val Ser Ala Phe Asn Leu Lys Thr Cys Gln Tyr His His Glu
115 120 125
Leu Ser Leu Thr Thr Tyr Gly Gly Tyr Val Ser Ser Glu Ile Gln Phe
130 135 140
Leu His Asp Ile Glu Asn Phe Leu Lys Gln Leu Asn Tyr Cys Tyr Ile
145 150 155 160
Ile Thr Ser Ser Arg Glu Ala Leu Asn Thr Leu Glu Thr Val Thr Arg
165 170 175
Phe Met Thr Asp Thr Ile Gly Ser Gly Leu Ile Pro Pro Val Glu Leu
180 185 190
Phe Asp Pro Ala His Pro Cys Ala Ile Cys Phe Glu Glu Leu Cys Ile
195 200 205
Thr Ala Asn Gln Gly Glu Thr Leu His Arg Arg Leu Leu Gly Cys Ile
210 215 220
Cys Asp His Val Thr Lys Gln Val Arg Val Asn Val Asp Val Asp Asp
225 230 235 240
Ile Ile Arg Cys Leu Pro Tyr Ile Pro Asp Val Pro Asp Ile Lys Arg
245 250 255
Gln Ser Ala Val Glu Ala Leu Arg Thr Leu Gln Thr Lys Thr Val Val
260 265 270
Asn Pro Met Gly Ala Lys Asn Asp Thr Phe Asp Gln Thr Tyr Glu Ile
275 280 285
Ala Ser Thr Met Leu Asp Ser Tyr Asn Val Phe Lys Pro Ala Pro Arg
290 295 300
Cys Met Tyr Ala Ile Ser Glu Leu Lys Phe Trp Leu Thr Ser Asn Ser
305 310 315 320
Thr Glu Gly Pro Gln Arg Thr Leu Asp Val Phe Val Asp Asn Leu Asp
325 330 335
Val Leu Asn Glu His Glu Lys His Ala Glu Leu Thr Ala Val Thr Val
340 345 350
Glu Leu Ala Leu Phe Gly Lys Thr Pro Ile His Phe Asp Arg Ala Phe
355 360 365
Ser Glu Glu Leu Gly Ser Leu Asp Ala Ile Asp Ser Ile Leu Val Gly
370 375 380
Asn Arg Ser Ser Ser Pro Asp Ser Gln Ile Glu Ala Leu Ile Lys Ala
385 390 395 400
Cys Tyr Ala His His Leu Ser Ser Pro Leu Met Arg His Ile Ser Asn
405 410 415
Pro Ser His Asp Asn Glu Ala Ala Leu Arg Gln Leu Leu Glu Arg Val
420 425 430
Gly Cys Glu Asp Asp Leu Thr Lys Glu Ala Ser Asp Ser Ala Thr Ala
435 440 445
Ser Glu Cys Asp Leu Asn Asp Asp Ser Ser Ile Thr Phe Ala Val His
450 455 460
Gly Trp Glu Asn Leu Leu Ser Lys Ala Lys Ile Asp Ala Ala Glu Arg
465 470 475 480
Lys Arg Val Tyr Leu Glu His Leu Ser Lys Arg Ser Leu Thr Ser Leu
485 490 495
Gly Arg Cys Ile Arg Glu Gln Arg Gln Glu Leu Glu Lys Thr Leu Arg
500 505 510
Val Asn Val Tyr Gly Glu Ala Leu Leu Gln Thr Phe Val Ser Met Gln
515 520 525
Asn Gly Phe Gly Ala Arg Asn Val Phe Leu Ala Lys Val Ser Gln Ala
530 535 540
Gly Cys Ile Ile Asp Asn Arg Ile Gln Glu Ala Ala Phe Asp Ala His
545 550 555 560
Arg Phe Ile Arg Asn Thr Leu Val Arg His Thr Val Asp Ala Ala Met
565 570 575
Leu Pro Ala Leu Thr His Lys Phe Phe Glu Leu Val Asn Gly Pro Leu
580 585 590
Phe Asn His Asp Glu His Arg Phe Ala Gln Pro Pro Asn Thr Ala Leu
595 600 605
Phe Phe Thr Val Glu Asn Val Gly Leu Phe Pro His Leu Lys Glu Glu
610 615 620
Leu Ala Lys Phe Met Gly Gly Val Val Gly Ser Asn Trp Leu Leu Ser
625 630 635 640
Pro Phe Arg Gly Phe Tyr Cys Phe Ser Gly Val Glu Gly Val Thr Phe
645 650 655
Ala Gln Arg Leu Ala Trp Lys Tyr Ile Arg Glu Leu Val Phe Ala Thr
660 665 670
Thr Leu Phe Thr Ser Val Phe His Cys Gly Glu Val Arg Leu Cys Arg
675 680 685
Val Asp Arg Leu Gly Lys Asp Pro Arg Gly Cys Thr Ser Gln Pro Lys
690 695 700
Gly Ile Gly Ser Ser His Gly Pro Leu Asp Gly Ile Tyr Leu Thr Tyr
705 710 715 720
Glu Glu Thr Cys Pro Leu Val Ala Ile Ile Gln Ser Gly Glu Thr Gly
725 730 735
Ile Asp Gln Asn Thr Val Val Ile Tyr Asp Ser Asp Val Phe Ser Leu
740 745 750
Leu Tyr Thr Leu Met Gln Arg Leu Ala Pro Asp Ser Thr Asp Pro Ala
755 760 765
Phe Ser
770
<210>23
<211>868
<212>PRT
<213>水痘带状疱疹
<400>23
Met Phe Val Thr Ala Val Val Ser Val Ser Pro Ser Ser Phe Tyr Glu
1 5 10 15
Ser Leu Gln Val Glu Pro Thr Gln Ser Glu Asp Ile Thr Arg Ser Ala
20 25 30
His Leu Gly Asp Gly Asp Glu Ile Arg Glu Ala Ile His Lys Ser Gln
35 40 45
Asp Ala Glu Thr Lys Pro Thr Phe Tyr Val Cys Pro Pro Pro Thr Gly
50 55 60
Ser Thr Ile Val Arg Leu Glu Pro Thr Arg Thr Cys Pro Asp Tyr His
65 70 75 80
Leu Gly Lys Asn Phe Thr Glu Gly Ile Ala Val Val Tyr Lys Glu Asn
85 90 95
Ile Ala Ala Tyr Lys Phe Lys Ala Thr Val Tyr Tyr Lys Asp Val Ile
100 105 110
Val Ser Thr Ala Trp Ala Gly Ser Ser Tyr Thr Gln Ile Thr Asn Arg
115 120 125
Tyr Ala Asp Arg Val Pro Ile Pro Val Ser Glu Ile Thr Asp Thr Ile
130 135 140
Asp Lys Phe Gly Lys Cys Ser Ser Lys Ala Thr Tyr Val Arg Asn Asn
145 150 155 160
His Lys Val Glu Ala Phe Asn Glu Asp Lys Asn Pro Gln Asp Met Pro
165 170 175
Leu Ile Ala Ser Lys Tyr Asn Ser Val Gly Ser Lys Ala Trp His Thr
180 185 190
Thr Asn Asp Thr Tyr Met Val Ala Gly Thr Pro Gly Thr Tyr Arg Thr
195 200 205
Gly Thr Ser Val Asn Cys Ile Ile Glu Glu Val Glu Ala Arg Ser Ile
210 215 220
Phe Pro Tyr Asp Ser Phe Gly Leu Ser Thr Gly Asp Ile Ile Tyr Met
225 230 235 240
Ser Pro Phe Phe Gly Leu Arg Asp Gly Ala Tyr Arg Glu His Ser Asn
245 250 255
Tyr Ala Met Asp Arg Phe His Gln Phe Glu Gly Tyr Arg Gln Arg Asp
260 265 270
Leu Asp Thr Arg Ala Leu Leu Glu Pro Ala Ala Arg Asn Phe Leu Val
275 280 285
Thr Pro His Leu Thr Val Gly Trp Asn Trp Lys Pro Lys Arg Thr Glu
290 295 300
Val Cys Ser Leu Val Lys Trp Arg Glu Val Glu Asp Val Val Arg Asp
305 310 315 320
Glu Tyr Ala His Asn Phe Arg Phe Thr Met Lys Thr Leu Ser Thr Thr
325 330 335
Phe Ile Ser Glu Thr Asn Glu Phe Asn Leu Asn Gln Ile His Leu Ser
340 345 350
Gln Cys Val Lys Glu Glu Ala Arg Ala Ile Ile Asn Arg Ile Tyr Thr
355 360 365
Thr Arg Tyr Asn Ser Ser His Val Arg Thr Gly Asp Ile Gln Thr Tyr
370 375 380
Leu Ala Arg Gly Gly Phe Val Val Val Phe Gln Pro Leu Leu Ser Asn
385 390 395 400
Ser Leu Ala Arg Leu Tyr Leu Gln Glu Leu Val Arg Glu Asn Thr Asn
405 410 415
His Ser Pro Gln Lys His Pro Thr Arg Asn Thr Arg Ser Arg Arg Ser
420 425 430
Val Pro Val Glu Leu Arg Ala Asn Arg Thr Ile Thr Thr Thr Ser Ser
435 440 445
Val Glu Phe Ala Met Leu Gln Phe Thr Tyr Asp His Ile Gln Glu His
450 455 460
Val Asn Glu Met Leu Ala Arg Ile Ser Ser Ser Trp Cys Gln Leu Gln
465 470 475 480
Asn Arg Glu Arg Ala Leu Trp Ser Gly Leu Phe Pro Ile Asn Pro Ser
485 490 495
Ala Leu Ala Ser Thr Ile Leu Asp Gln Arg Val Lys Ala Arg Ile Leu
500 505 510
Gly Asp Val Ile Ser Val Ser Asn Cys Pro Glu Leu Gly Ser Asp Thr
515 520 525
Arg Ile Ile Leu Gln Asn Ser Met Arg Val Ser Gly Ser Thr Thr Arg
530 535 540
Cys Tyr Ser Arg Pro Leu Ile Ser Ile Val Ser Leu Asn Gly Ser Gly
545 550 555 560
Thr Val Glu Gly Gln Leu Gly Thr Asp Asn Glu Leu Ile Met Ser Arg
565 570 575
Asp Leu Leu Glu Pro Cys Val Ala Asn His Lys Arg Tyr Phe Leu Phe
580 585 590
Gly His His Tyr Val Tyr Tyr Glu Asp Tyr Arg Tyr Val Arg Glu Ile
595 600 605
Ala Val His Asp Val Gly Met Ile Ser Thr Tyr Val Asp Leu Asn Leu
610 615 620
Thr Leu Leu Lys Asp Arg Glu Phe Met Pro Leu Gln Val Tyr Thr Arg
625 630 635 640
Asp Glu Leu Arg Asp Thr Gly Leu Leu Asp Tyr Ser Glu Ile Gln Arg
645 650 655
Arg Asn Gln Met His Ser Leu Arg Phe Tyr Asp Ile Asp Lys Val Val
660 665 670
Gln Tyr Asp Ser Gly Thr Ala Ile Met Gln Gly Met Ala Gln Phe Phe
675 680 685
Gln Gly Leu Gly Thr Ala Gly Gln Ala Val Gly His Val Val Leu Gly
690 695 700
Ala Thr Gly Ala Leu Leu Ser Thr Val His Gly Phe Thr Thr Phe Leu
705 710 715 720
Ser Asn Pro Phe Gly Ala Leu Ala Val Gly Leu Leu Val Leu Ala Gly
725 730 735
Leu Val Ala Ala Phe Phe Ala Tyr Arg Tyr Val Leu Lys Leu Lys Thr
740 745 750
Ser Pro Met Lys Ala Leu Tyr Pro Leu Thr Thr Lys Gly Leu Lys Gln
755 760 765
Leu Pro Glu Gly Met Asp Pro Phe Ala Glu Lys Pro Asn Ala Thr Asp
770 775 780
Thr Pro Ile Glu Glu Ile Gly Asp Ser Gln Asn Thr Glu Pro Ser Val
785 790 795 800
Asn Ser Gly Phe Asp Pro Asp Lys Phe Arg Glu Ala Gln Glu Met Ile
805 810 815
Lys Tyr Met Thr Leu Val Ser Ala Ala Glu Arg Gln Glu Ser Lys Ala
820 825 830
Arg Lys Lys Asn Lys Thr Ser Ala Leu Leu Thr Ser Arg Leu Thr Gly
835 840 845
Leu Ala Leu Arg Asn Arg Arg Gly Tyr Ser Arg Val Arg Thr Glu Asn
850 855 860
Val Thr Gly Val
865
<210>24
<211>143
<212>PRT
<213>水痘带状疱疹
<400>24
Met Glu Ser Ser Asn Ile Asn Ala Leu Gln Gln Pro Ser Ser Ile Ala
1 5 10 15
His His Pro Ser Lys Gln Cys Ala Ser Ser Leu Asn Glu Thr Val Lys
20 25 30
Asp Ser Pro Pro Ala Ile Tyr Glu Asp Arg Leu Glu His Thr Pro Val
35 40 45
Gln Leu Pro Arg Asp Gly Thr Pro Arg Asp Val Cys Ser Val Gly Gln
50 55 60
Leu Thr Cys Arg Ala Cys Ala Thr Lys Pro Phe Arg Leu Asn Arg Asp
65 70 75 80
Ser Gln Tyr Asp Tyr Leu Asn Thr Cys Pro Gly Gly Arg His Ile Ser
85 90 95
Leu Ala Leu Glu Ile Ile Thr Gly Arg Trp Val Cys Ile Pro Arg Val
100 105 110
Phe Pro Asp Thr Pro Glu Glu Lys Trp Met Ala Pro Tyr Ile Ile Pro
115 120 125
Asp Arg Glu Gln Pro Ser Ser Gly Asp Glu Asp Ser Asp Thr Asp
130 135 140
<210>25
<211>341
<212>PRT
<213>水痘带状疱疹
<400>25
Met Ser Thr Asp Lys Thr Asp Val Lys Met Gly Val Leu Arg Ile Tyr
1 5 10 15
Leu Asp Gly Ala Tyr Gly Ile Gly Lys Thr Thr Ala Ala Glu Glu Phe
20 25 30
Leu His His Phe Ala Ile Thr Pro Asn Arg Ile Leu Leu Ile Gly Glu
35 40 45
Pro Leu Ser Tyr Trp Arg Asn Leu Ala Gly Glu Asp Ala Ile Cys Gly
50 55 60
Ile Tyr Gly Thr Gln Thr Arg Arg Leu Asn Gly Asp Val Ser Pro Glu
65 70 75 80
Asp Ala Gln Arg Leu Thr Ala His Phe Gln Ser Leu Phe Cys Ser Pro
85 90 95
His Ala Ile Met His Ala Lys Ile Ser Ala Leu Met Asp Thr Ser Thr
100 105 110
Ser Asp Leu Val Gln Val Asn Lys Glu Pro Tyr Lys Ile Met Leu Ser
115 120 125
Asp Arg His Pro Ile Ala Ser Thr Ile Cys Phe Pro Leu Ser Arg Tyr
130 135 140
Leu Val Gly Asp Met Ser Pro Ala Ala Leu Pro Gly Leu Leu Phe Thr
145 150 155 160
Leu Pro Ala Glu Pro Pro Gly Thr Asn Leu Val Val Cys Thr Val Ser
165 170 175
Leu Pro Ser His Leu Ser Arg Val Ser Lys Arg Ala Arg Pro Gly Glu
180 185 190
Thr Val Asn Leu Pro Phe Val Met Val Leu Arg Asn Val Tyr Ile Met
195 200 205
Leu Ile Asn Thr Ile Ile Phe Leu Lys Thr Asn Asn Trp His Ala Gly
210 215 220
Trp Asn Thr Leu Ser Phe Cys Asn Asp Val Phe Lys Gln Lys Leu Gln
225 230 235 240
Lys Ser Glu Cys Ile Lys Leu Arg Glu Val Pro Gly Ile Glu Asp Thr
245 250 255
Leu Phe Ala Val Leu Lys Leu Pro Glu Leu Cys Gly Glu Phe Gly Asn
260 265 270
Ile Leu Pro Leu Trp Ala Trp Gly Met Glu Thr Leu Ser Asn Cys Ser
275 280 285
Arg Ser Met Ser Pro Phe Val Leu Ser Leu Glu Gln Thr Pro Gln His
290 295 300
Ala Ala Gln Glu Leu Lys Thr Leu Leu Pro Gln Met Thr Pro Ala Asn
305 310 315 320
Met Ser Ser Gly Ala Trp Asn Ile Leu Lys Glu Leu Val Asn Ala Val
325 330 335
Gln Asp Asn Thr Ser
340
<210>26
<211>841
<212>PRT
<213>水痘带状疱疹
<400>26
Met Phe Ala Leu Val Leu Ala Val Val Ile Leu Pro Leu Trp Thr Thr
1 5 10 15
Ala Asn Lys Ser Tyr Val Thr Pro Thr Pro Ala Thr Arg Ser Ile Gly
20 25 30
His Met Ser Ala Leu Leu Arg Glu Tyr Ser Asp Arg Asn Met Ser Leu
35 40 45
Lys Leu Glu Ala Phe Tyr Pro Thr Gly Phe Asp Glu Glu Leu Ile Lys
50 55 60
Ser Leu His Trp Gly Asn Asp Arg Lys His Val Phe Leu Val Ile Val
65 70 75 80
Lys Val Asn Pro Thr Thr His Glu Gly Asp Val Gly Leu Val Ile Phe
85 90 95
Pro Lys Tyr Leu Leu Ser Pro Tyr His Phe Lys Ala Glu His Arg Ala
100 105 110
Pro Phe Pro Ala Gly Arg Phe Gly Phe Leu Ser His Pro Val Thr Pro
115 120 125
Asp Val Ser Phe Phe Asp Ser Ser Phe Ala Pro Tyr Leu Thr Thr Gln
130 135 140
His Leu Val Ala Phe Thr Thr Phe Pro Pro Asn Pro Leu Val Trp His
145 150 155 160
Leu Glu Arg Ala Glu Thr Ala Ala Thr Ala Glu Arg Pro Phe Gly Val
165 170 175
Ser Leu Leu Pro Ala Arg Pro Thr Val Pro Lys Asn Thr Ile Leu Glu
180 185 190
His Lys Ala His Phe Ala Thr Trp Asp Ala Leu Ala Arg His Thr Phe
195 200 205
Phe Ser Ala Glu Ala Ile Ile Thr Asn Ser Thr Leu Arg Ile His Val
210 215 220
Pro Leu Phe Gly Ser Val Trp Pro Ile Arg Tyr Trp Ala Thr Gly Ser
225 230 235 240
Val Leu Leu Thr Ser Asp Ser Gly Arg Val Glu Val Asn Ile Gly Val
245 250 255
Gly Phe Met Ser Ser Leu Ile Ser Leu Ser Ser Gly Pro Pro Ile Glu
260 265 270
Leu Ile Val Val Pro His Thr Val Lys Leu Asn Ala Val Thr Ser Asp
275 280 285
Thr Thr Trp Phe Gln Leu Asn Pro Pro Gly Pro Asp Pro Gly Pro Ser
290 295 300
Tyr Arg Val Tyr Leu Leu Gly Arg Gly Leu Asp Met Asn Phe Ser Lys
305 310 315 320
His Ala Thr Val Asp Ile Cys Ala Tyr Pro Glu Glu Ser Leu Asp Tyr
325 330 335
Arg Tyr His Leu Ser Met Ala His Thr Glu Ala Leu Arg Met Thr Thr
340 345 350
Lys Ala Asp Gln His Asp Ile Asn Glu Glu Ser Tyr Tyr His Ile Ala
355 360 365
Ala Arg Ile Ala Thr Ser Ile Phe Ala Leu Ser Glu Met Gly Arg Thr
370 375 380
Thr Glu Tyr Phe Leu Leu Asp Glu Ile Val Asp Val Gln Tyr Gln Leu
385 390 395 400
Lys Phe Leu Asn Tyr Ile Leu Met Arg Ile Gly Ala Gly Ala His Pro
405 410 415
Asn Thr Ile Ser Gly Thr Ser Asp Leu Ile Phe Ala Asp Pro Ser Gln
420 425 430
Leu His Asp Glu Leu Ser Leu Leu Phe Gly Gln Val Lys Pro Ala Asn
435 440 445
Val Asp Tyr Phe Ile Ser Tyr Asp Glu Ala Arg Asp Gln Leu Lys Thr
450 455 460
Ala Tyr Ala Leu Ser Arg Gly Gln Asp His Val Asn Ala Leu Ser Leu
465 470 475 480
Ala Arg Arg Val Ile Met Ser Ile Tyr Lys Gly Leu Leu Val Lys Gln
485 490 495
Asn Leu Asn Ala Thr Glu Arg Gln Ala Leu Phe Phe Ala Ser Met Ile
500 505 510
Leu Leu Asn Phe Arg Glu Gly Leu Glu Asn Ser Ser Arg Val Leu Asp
515 520 525
Gly Arg Thr Thr Leu Leu Leu Met Thr Ser Met Cys Thr Ala Ala His
530 535 540
Ala Thr Gln Ala Ala Leu Asn Ile Gln Glu Gly Leu Ala Tyr Leu Asn
545 550 555 560
Pro Ser Lys His Met Phe Thr Ile Pro Asn Val Tyr Ser Pro Cys Met
565 570 575
Gly Ser Leu Arg Thr Asp Leu Thr Glu Glu Ile His Val Met Asn Leu
580 585 590
Leu Ser Ala Ile Pro Thr Arg Pro Gly Leu Asn Glu Val Leu His Thr
595 600 605
Gln Leu Asp Glu Ser Glu Ile Phe Asp Ala Ala Phe Lys Thr Met Met
610 615 620
Ile Phe Thr Thr Trp Thr Ala Lys Asp Leu His Ile Leu His Thr His
625 630 635 640
Val Pro Glu Val Phe Thr Cys Gln Asp Ala Ala Ala Arg Asn Gly Glu
645 650 655
Tyr Val Leu Ile Leu Pro Ala Val Gln Gly His Ser Tyr Val Ile Thr
660 665 670
Arg Asn Lys Pro Gln Arg Gly Leu Val Tyr Ser Leu Ala Asp Val Asp
675 680 685
Val Tyr Asn Pro Ile Ser Val Val Tyr Leu Ser Arg Asp Thr Cys Val
690 695 700
Ser Glu His Gly Val Ile Glu Thr Val Ala Leu Pro His Pro Asp Asn
705 710 715 720
Leu Lys Glu Cys Leu Tyr Cys Gly Ser Val Phe Leu Arg Tyr Leu Thr
725 730 735
Thr Gly Ala Ile Met Asp Ile Ile Ile Ile Asp Ser Lys Asp Thr Glu
740 745 750
Arg Gln Leu Ala Ala Met Gly Asn Ser Thr Ile Pro Pro Phe Asn Pro
755 760 765
Asp Met His Gly Asp Asp Ser Lys Ala Val Leu Leu Phe Pro Asn Gly
770 775 780
Thr Val Val Thr Leu Leu Gly Phe Glu Arg Arg Gln Ala Ile Arg Met
785 790 795 800
Ser Gly Gln Tyr Leu Gly Ala Ser Leu Gly Gly Ala Phe Leu Ala Val
805 810 815
Val Gly Phe Gly Ile Ile Gly Trp Met Leu Cys Gly Asn Ser Arg Leu
820 825 830
Arg Glu Tyr Asn Lys Ile Pro Leu Thr
835 840
<210>27
<211>240
<212>PRT
<213>水痘带状疱疹
<400>27
Met Asn Pro Pro Gln Ala Arg Val Ser Glu Gln Thr Lys Asp Leu Leu
1 5 10 15
Ser Val Met Val Asn Gln His Pro Glu Glu Asp Ala Lys Val Cys Lys
20 25 30
Ser Ser Asp Asn Ser Pro Leu Tyr Asn Thr Met Val Met Leu Ser Tyr
35 40 45
Gly Gly Asp Thr Asp Leu Leu Leu Ser Ser Ala Cys Thr Arg Thr Ser
50 55 60
Thr Val Asn Arg Ser Ala Phe Thr Gln His Ser Val Phe Tyr Ile Ile
65 70 75 80
Ser Thr Val Leu Ile Gln Pro Ile Cys Cys Ile Phe Phe Phe Phe Tyr
85 90 95
Tyr Lys Ala Thr Arg Cys Met Leu Leu Phe Thr Ala Gly Leu Leu Leu
100 105 110
Thr Ile Leu His His Phe Arg Leu Ile Ile Met Leu Leu Cys Val Tyr
115 120 125
Arg Asn Ile Arg Ser Asp Leu Leu Pro Leu Ser Thr Ser Gln Gln Leu
130 135 140
Leu Leu Gly Ile Ile Val Val Thr Arg Thr Met Leu Phe Cys Ile Thr
145 150 155 160
Ala Tyr Tyr Thr Leu Phe Ile Asp Thr Arg Val Phe Phe Leu Ile Thr
165 170 175
Gly His Leu Gln Ser Glu Val Ile Phe Pro Asp Ser Val Ser Lys Ile
180 185 190
Leu Pro Val Ser Trp Gly Pro Ser Pro Ala Val Leu Leu Val Met Ala
195 200 205
Ala Val Ile Tyr Ala Met Asp Cys Leu Val Asp Thr Val Ser Phe Ile
210 215 220
Gly Pro Arg Val Trp Val Arg Val Met Leu Lys Thr Ser Ile Ser Phe
225 230 235 240
<210>28
<211>1396
<212>PRT
<213>水痘带状疱疹
<400>28
Met Thr Thr Val Ser Cys Pro Ala Asn Val Ile Thr Thr Thr Glu Ser
1 5 10 15
Asp Arg Ile Ala Gly Leu Phe Asn Ile Pro Ala Gly Ile Ile Pro Thr
20 25 30
Gly Asn Val Leu Ser Thr Ile Glu Val Cys Ala His Arg Cys Ile Phe
35 40 45
Asp Phe Phe Lys Gln Ile Arg Ser Asp Asp Asn Ser Leu Tyr Ser Ala
50 55 60
Gln Phe Asp Ile Leu Leu Gly Thr Tyr Cys Asn Thr Leu Asn Phe Val
65 70 75 80
Arg Phe Leu Glu Leu Gly Leu Ser Val Ala Cys Ile Cys Thr Lys Phe
85 90 95
Pro Glu Leu Ala Tyr Val Arg Asp Gly Val Ile Gln Phe Glu Val Gln
100 105 110
Gln Pro Met Ile Ala Arg Asp Gly Pro His Pro Val Asp Gln Pro Val
115 120 125
His Asn Tyr Met Val Lys Arg Ile His Lys Arg Ser Leu Ser Ala Ala
130 135 140
Phe Ala Ile Ala Ser Glu Ala Leu Ser Leu Leu Ser Asn Thr Tyr Val
145 150 155 160
Asp Gly Thr Glu Ile Asp Ser Ser Leu Arg Ile Arg Ala Ile Gln Gln
165 170 175
Met Ala Arg Asn Leu Arg Thr Val Leu Asp Ser Phe Glu Arg Gly Thr
180 185 190
Ala Asp Gln Leu Leu Gly Val Leu Leu Glu Lys Ala Pro Pro Leu Ser
195 200 205
Leu Leu Ser Pro Ile Asn Lys Phe Gln Pro Glu Gly His Leu Asn Arg
210 215 220
Val Ala Arg Ala Ala Leu Leu Ser Asp Leu Lys Arg Arg Val Cys Ala
225 230 235 240
Asp Met Phe Phe Met Thr Arg His Ala Arg Glu Pro Arg Leu Ile Ser
245 250 255
Ala Tyr Leu Ser Asp Met Val Ser Cys Thr Gln Pro Ser Val Met Val
260 265 270
Ser Arg Ile Thr His Thr Asn Thr Arg Gly Arg Gln Val Asp Gly Val
275 280 285
Leu Val Thr Thr Ala Thr Leu Lys Arg Gln Leu Leu Gln Gly Ile Leu
290 295 300
GlnIle Asp Asp Thr Ala Ala Asp Val Pro Val Thr Tyr Gly Glu Met
305 310 315 320
Val Leu Gln Gly Thr Asn Leu Val Thr Ala Leu Val Met Gly Lys Ala
325 330 335
Val Arg Gly Met Asp Asp Val Ala Arg His Leu Leu Asp Ile Thr Asp
340 345 350
Pro Asn Thr Leu Asn Ile Pro Ser Ile Pro Pro Gln Ser Asn Ser Asp
355 360 365
Ser Thr Thr Ala Gly Leu Pro Val Asn Ala Arg Val Pro Ala Asp Leu
370 375 380
Val Ile Val Gly Asp Lys Leu Val Phe Leu Glu Ala Leu Glu Arg Arg
385 390 395 400
Val Tyr Gln Ala Thr Arg Val Ala Tyr Pro Leu Ile Gly Asn Ile Asp
405 410 415
Ile Thr Phe Ile Met Pro Met Gly Val Phe Gln Ala Asn Ser Met Asp
420 425 430
Arg Tyr Thr Arg His Ala Gly Asp Phe Ser Thr Val Ser Glu Gln Asp
435 440 445
Pro Arg Gln Phe Pro Pro Gln Gly Ile Phe Phe Tyr Asn Lys Asp Gly
450 455 460
Ile Leu Thr Gln Leu Thr Leu Arg Asp Ala Met Gly Thr Ile Cys His
465 470 475 480
Ser Ser Leu Leu Asp Val Glu Ala Thr Leu Val Ala Leu Arg Gln Gln
485 490 495
His Leu Asp Arg Gln Cys Tyr Phe Gly Val Tyr Val Ala Glu Gly Thr
500 505 510
Glu Asp Thr Leu Asp Val Gln Met Gly Arg Phe Met Glu Thr Trp Ala
515 520 525
Asp Met Met Pro His His Pro His Trp Val Asn Glu His Leu Thr Ile
530 535 540
Leu Gln Phe Ile Ala Pro Ser Asn Pro Arg Leu Arg Phe Glu Leu Asn
545 550 555 560
Pro Ala Phe Asp Phe Phe Val Ala Pro Gly Asp Val Asp Leu Pro Gly
565 570 575
Pro Gln Arg Pro Pro Glu Ala Met Pro Thr Val Asn Ala Thr Leu Arg
580 585 590
Ile Ile Asn Gly Asn Ile Pro Val Pro Leu Cys Pro Ile Ser Phe Arg
595 600 605
Asp Cys Arg Gly Thr Gln Leu Gly Leu Gly Arg His Thr Met Thr Pro
610 615 620
Ala Thr Ile Lys Ala Val Lys Asp Thr Phe Glu Asp Arg Ala Tyr Pro
625 630 635 640
ThrIle Phe Tyr Met Leu Glu Ala Val Ile His Gly Asn Glu Arg Asn
645 650 655
Phe Cys Ala Leu Leu Arg Leu Leu Thr Gln Cys Ile Arg Gly Tyr Trp
660 665 670
Glu Gln Ser His Arg Val Ala Phe Val Asn Asn Phe His Met Leu Met
675 680 685
Tyr Ile Thr Thr Tyr Leu Gly Asn Gly Glu Leu Pro Glu Val Cys Ile
690 695 700
Asn Ile Tyr Arg Asp Leu Leu Gln His Val Arg Ala Leu Arg Gln Thr
705 710 715 720
Ile Thr Asp Phe Thr Ile Gln Gly Glu Gly His Asn Gly Glu Thr Ser
725 730 735
Glu Ala Leu Asn Asn Ile Leu Thr Asp Asp Thr Phe Ile Ala Pro Ile
740 745 750
Leu Trp Asp Cys Asp Ala Leu Ile Tyr Arg Asp Glu Ala Ala Arg Asp
755 760 765
Arg Leu Pro Ala Ile Arg Val Ser Gly Arg Asn Gly Tyr Gln Ala Leu
770 775 780
His Phe Val Asp Met Ala Gly His Asn Phe Gln Arg Arg Asp Asn Val
785 790 795 800
Leu Ile His Gly Arg Pro Val Arg Gly Asp Thr Gly Gln Gly Ile Pro
805 810 815
Ile Thr Pro His His Asp Arg Glu Trp Gly Ile Leu Ser Lys Ile Tyr
820 825 830
Tyr Tyr Ile Val Ile Pro Ala Phe Ser Arg Gly Ser Cys Cys Thr Met
835 840 845
Gly Val Arg Tyr Asp Arg Leu Tyr Pro Ala Leu Gln Ala Val Ile Val
850 855 860
Pro Glu Ile Pro Ala Asp Glu Glu Ala Pro Thr Thr Pro Glu Asp Pro
865 870 875 880
Arg His Pro Leu His Ala His Gln Leu Val Pro Asn Ser Leu Asn Val
885 890 895
Tyr Phe His Asn Ala His Leu Thr Val Asp Gly Asp Ala Leu Leu Thr
900 905 910
Leu Gln Glu Leu Met Gly Asp Met Ala Glu Arg Thr Thr Ala Ile Leu
915 920 925
Val Ser Ser Ala Pro Asp Ala Gly Ala Ala Thr Ala Thr Thr Arg Asn
930 935 940
Met Arg Ile Tyr Asp Gly Ala Leu Tyr His Gly Leu Ile Met Met Ala
945 950 955 960
Tyr Gln Ala Tyr Asp Glu Thr Ile Ala Thr Gly Thr Phe Phe Tyr Pro
965 970 975
Val Pro Val Asn Pro Leu Phe Ala Cys Pro Glu His Leu Ala Ser Leu
980 985 990
Arg Gly Met Thr Asn Ala Arg Arg Val Leu Ala Lys Met Val Pro Pro
995 1000 1005
Ile Pro Pro Phe Leu Gly Ala Asn His His Ala Thr Ile Arg Gln
1010 1015 1020
Pro Val Ala Tyr His Val Thr His Ser Lys Ser Asp Phe Asn Thr
1025 1030 1035
Leu Thr Tyr Ser Leu Leu Gly Gly Tyr Phe Lys Phe Thr Pro Ile
1040 1045 1050
Ser Leu Thr His Gln Leu Arg Thr Gly Phe His Pro Gly Ile Ala
1055 1060 1065
Phe Thr Val Val Arg Gln Asp Arg Phe Ala Thr Glu Gln Leu Leu
1070 1075 1080
Tyr Ala Glu Arg Ala Ser Glu Ser Tyr Phe Val Gly Gln Ile Gln
1085 1090 1095
Val His His His Asp Ala Ile Gly Gly Val Asn Phe Thr Leu Thr
1100 1105 1110
Gln Pro Arg Ala His Val Asp Leu Gly Val Gly Tyr Thr Ala Val
1115 1120 1125
Cys Ala Thr Ala Ala Leu Arg Cys Pro Leu Thr Asp Met Gly Asn
1130 1135 1140
Thr Ala Gln Asn Leu Phe Phe Ser Arg Gly Gly Val Pro Met Leu
1145 1150 1155
His Asp Asn Val Thr Glu Ser Leu Arg Arg Ile Thr Ala Ser Gly
1160 1165 1170
Gly Arg Leu Asn Pro Thr Glu Pro Leu Pro Ile Phe Gly Gly Leu
1175 1180 1185
Arg Pro Ala Thr Ser Ala Gly Ile Ala Arg Gly Gln Ala Ser Val
1190 1195 1200
Cys Glu Phe Val Ala Met Pro Val Ser Thr Asp Leu Gln Tyr Phe
1205 1210 1215
Arg Thr Ala Cys Asn Pro Arg Gly Arg Ala Ser Gly Met Leu Tyr
1220 1225 1230
Met Gly Asp Arg Asp Ala Asp Ile Glu Ala Ile Met Phe Asp His
1235 1240 1245
Thr Gln Ser Asp Val Ala Tyr Thr Asp Arg Ala Thr Leu Asn Pro
1250 1255 1260
Trp Ala Ser Gln Lys His Ser Tyr Gly Asp Arg Leu Tyr Asn Gly
1265 1270 1275
Thr Tyr Asn Leu Thr Gly Ala Ser Pro Ile Tyr Ser Pro Cys Phe
1280 1285 1290
Lys Phe Phe Thr Pro Ala Glu Val Asn Thr Asn Cys Asn Thr Leu
1295 1300 1305
Asp Arg Leu Leu Met Glu Ala Lys Ala Val Ala Ser Gln Ser Ser
1310 1315 1320
Thr Asp Thr Glu Tyr Gln Phe Lys Arg Pro Pro Gly Ser Thr Glu
1325 1330 1335
Met Thr Gln Asp Pro Cys Gly Leu Phe Gln Gln Ala Tyr Pro Pro
1340 1345 1350
Leu Cys Ser Ser Asp Ala Ala Met Leu Arg Thr Ala His Ala Gly
1355 1360 1365
Glu Thr Gly Ala Asp Glu Val His Leu Ala Gln Tyr Leu Ile Arg
1370 1375 1380
Asp Ala Ser Pro Leu Arg Gly Cys Leu Pro Leu Pro Arg
1385 1390 1395
<210>29
<211>316
<212>PRT
<213>水痘带状疱疹
<400>29
Met Ala Met Pro Phe Glu Ile Glu Val Leu Leu Pro Gly Glu Leu Ser
1 5 10 15
Pro Ala Glu Thr Ser Ala Leu Gln Lys Cys Glu Gly Lys Ile Ile Thr
20 25 30
Phe Ser Thr Leu Arg His Arg Ala Ser Leu Val Asp Ile Ala Leu Ser
35 40 45
Ser Tyr Tyr Ile Asn Gly Ala Pro Pro Asp Thr Leu Ser Leu Leu Glu
50 55 60
Ala Tyr Arg Met Arg Phe Ala Ala Val Ile Thr Arg Val Ile Pro Gly
65 70 75 80
Lys Leu Leu Ala His Ala Ile Gly Val Gly Thr Pro Thr Pro Gly Leu
85 90 95
Phe Ile Gln Asn Thr Ser Pro Val Asp Leu Cys Asn Gly Asp Tyr Ile
100 105 110
Cys Leu Leu Pro Pro Val Phe Gly Ser Ala Asp Ser Ile Arg Leu Asp
115 120 125
Ser Val Gly Leu Glu Ile Val Phe Pro Leu Thr Ile Pro Gln Thr Leu
130 135 140
Met Arg Glu Ile Ile Ala Lys Val Val Ala Arg Ala Val Glu Arg Thr
145 150 155 160
Ala Ala Gly Ala Gln Ile Leu Pro His Glu Val Leu Arg Gly Ala Asp
165 170 175
Val Ile Cys Tyr Asn Gly Arg Arg Tyr Glu Leu Glu Thr Asn Leu Gln
180 185 190
His Arg Asp Gly Ser Asp Ala Ala Ile Arg Thr Leu Val Leu Asn Leu
195 200 205
Met Phe Ser Ile Asn Glu Gly Cys Leu Leu Leu Leu Ala Leu Ile Pro
210 215 220
Thr Leu Leu Val Gln Gly Ala His Asp Gly Tyr Val Asn Leu Leu Ile
225 230 235 240
Gln Thr Ala Asn Cys Val Arg Glu Thr Gly Gln Leu Ile Asn Ile Pro
245 250 255
Pro Met Pro Arg Ile Gln Asp Gly His Arg Arg Phe Pro Ile Tyr Glu
260 265 270
Thr Ile Ser Ser Trp Ile Ser Thr Ser Ser Arg Leu Gly Asp Thr Leu
275 280 285
Gly Thr Arg Ala Ile Leu Arg Val Cys Val Phe Asp Gly Pro Ser Thr
290 295 300
Val His Pro Gly Asp Arg Thr Ala Val Ile Gln Val
305 310 315
<210>30
<211>676
<212>PRT
<213>水痘带状疱疹
<400>30
Met Glu Ala His Leu Ala Asn Glu Thr Lys His Ala Leu Trp His Asn
1 5 10 15
Asp His Thr Lys Gly Leu Leu His Val Val Ile Pro Asn Ala Gly Leu
20 25 30
Ile Ala Ala Gly Ile Asp Pro Ala Leu Leu Ile Leu Lys Lys Pro Gly
35 40 45
Gln Arg Phe Lys Val Glu Val Gln Thr Arg Tyr His Ala Thr Gly Gln
50 55 60
Cys Glu Pro Trp Cys Gln Val Phe Ala Ala Tyr Ile Pro Asp Asn Ala
65 70 75 80
Leu Thr Asn Leu Leu Ile Pro Lys Thr Glu Pro Phe Val Ser His Val
85 90 95
Phe Ser Ala Thr His Asn Ser Gly Gly Leu Ile Leu Ser Leu Pro Val
100 105 110
Tyr Leu Ser Pro Gly Leu Phe Phe Asp Ala Phe Asn Val Val Ala Ile
115 120 125
Arg Ile Asn Thr Gly Asn Arg Lys His Arg Asp Ile Cys Ile Met Tyr
130 135 140
Ala Glu Leu Ile Pro Asn Gly Thr Arg Tyr Phe Ala Asp Gly Gln Arg
145 150 155 160
Val Leu Leu Leu Cys Lys Gln Leu Ile Ala Tyr Ile Arg Cys Thr Pro
165 170 175
Arg Leu Ala Ser Ser Ile Lys Ile Tyr Ala Glu His Met Val Ala Ala
180 185 190
Met Gly Glu Ser His Thr Ser Asn Gly Asp Asn Ile Gly Pro Val Ser
195 200 205
Ser Ile Ile Asp Leu Asp Arg Gln Leu Thr Ser Gly Gly Ile Asp Asp
210 215 220
Ser Pro Ala Glu Thr Arg Ile Gln Glu Asn Asn Arg Asp Val Leu Glu
225 230 235 240
Leu Ile Lys Arg Ala Val Asn Ile Val Asn Ser Arg His Pro Val Arg
245 250 255
Pro Ser Ser Ser Arg Val Ala Ser Gly Leu Leu Gln Ser Ala Lys Gly
260 265 270
His Gly Ala Gln Thr Ser Asn Thr Asp Pro Ile Asn Asn Gly Ser Phe
275 280 285
Asp Gly Val Leu Glu Pro Pro Gly Gln Gly Arg Phe Thr Gly Lys Lys
290 295 300
Asn Asn Ser Ser Ala Ser Ile Pro Pro Leu Gln Asp Val Leu Leu Phe
305 310 315 320
Thr Pro Ala Ser Thr Glu Pro Gln Ser Leu Met Glu Trp Phe Asp Ile
325 330 335
Cys Tyr Ala Gln Leu Val Ser Gly Asp Thr Pro Ala Asp Phe Trp Lys
340 345 350
Arg Arg Pro Leu Ser Ile Val Pro Arg His Tyr Ala Glu Ser Pro Ser
355 360 365
Pro Leu Ile Val Val Ser Tyr Asn Gly Ser Ser Ala Trp Gly Gly Arg
370 375 380
Ile Thr Gly Ser Pro Ile Leu Tyr His Ser Ala Gln Ala Ile Ile Asp
385 390 395 400
Ala Ala Cys Ile Asn Ala Arg Val Asp Asn Pro Gln Ser Leu His Val
405 410 415
Thr Ala Arg Gln Glu Leu Val Ala Arg Leu Pro Phe Leu Ala Asn Val
420 425 430
Leu Asn Asn Gln Thr Pro Leu Pro Ala Phe Lys Pro Gly Ala Glu Met
435 440 445
Phe Leu Asn Gln Val Phe Lys Gln Ala Cys Val Thr Ser Leu Thr Gln
450 455 460
Gly Leu Ile Thr Glu Leu Gln Thr Asn Pro Thr Leu Gln Gln Leu Met
465 470 475 480
Glu Tyr Asp Ile Ala Asp Ser Ser Gln Thr Val Ile Asp Glu Ile Val
485 490 495
Ala Arg Thr Pro Asp Leu Ile Gln Thr Ile Val Ser Val Leu Thr Glu
500 505 510
Met Ser Met Asp Ala Phe Tyr Asn Ser Ser Leu Met Tyr Ala Val Leu
515 520 525
Ala Tyr Leu Ser Ser Val Tyr Thr Arg Pro Gln Gly Gly Gly Tyr Ile
530 535 540
Pro Tyr Leu His Ala Ser Phe Pro Cys Trp Leu Gly Asn Arg Ser Ile
545 550 555 560
Tyr Leu Phe Asp Tyr Tyr Asn Ser Gly Gly Glu Ile Leu Lys Leu Ser
565 570 575
Lys Val Pro Val Pro Val Ala Leu Glu Lys Val Gly Ile Gly Asn Ser
580 585 590
Thr Gln Leu Arg Gly Lys Phe Ile Arg Ser Ala Asp Ile Val Asp Ile
595 600 605
Gly Ile Cys Ser Lys Tyr Leu Pro Gly Gln Cys Tyr Ala Tyr Ile Cys
610 615 620
Leu Gly Phe Asn Gln Gln Leu Gln Ser Ile Leu Val Leu Pro Gly Gly
625 630 635 640
Phe Ala Ala Cys Phe Cys Ile Thr Asp Thr Leu Gln Ala Ala Leu Pro
645 650 655
Ala Ser Leu Ile Gly Pro Ile Leu Asp Arg Phe Cys Phe Ser Ile Pro
660 665 670
Asn Pro His Lys
675
<210>31
<211>363
<212>PRT
<213>水痘带状疱疹
<400>31
Met Glu Leu Gln Arg Ile Phe Pro Leu Tyr Thr Ala Thr Gly Ala Ala
1 5 10 15
Arg Lys Leu Thr Pro Glu Ala Val Gln Arg Leu Cys Asp Ala Leu Thr
20 25 30
Leu Asp Met Gly Leu Trp Lys Ser Ile Leu Thr Asp Pro Arg Val Lys
35 40 45
Ile Met Arg Ser Thr Ala Phe Ile Thr Leu Arg Ile Ala Pro Phe Ile
50 55 60
Pro Leu Gln Thr Asp Thr Thr Asn Ile Ala Val Val Val Ala Thr Ile
65 70 75 80
Tyr Ile Thr Arg Pro Arg Gln Met Asn Leu Pro Pro Lys Thr Phe His
85 90 95
Val Ile Val Asn Phe Asn Tyr Glu Val Ser Tyr Ala Met Thr Ala Thr
100 105 110
Leu Arg Ile Tyr Pro Val Glu Asn Ile Asp His Val Phe Gly Ala Thr
115 120 125
Phe Lys Asn Pro Ile Ala Tyr Pro Leu Pro Thr Ser Ile Pro Asp Pro
130 135 140
Arg Ala Asp Pro Thr Pro Ala Asp Leu Thr Pro Thr Pro Asn Leu Ser
145 150 155 160
Asn Tyr Leu Gln Pro Pro Arg Leu Pro Lys Asn Pro Tyr Ala Cys Lys
165 170 175
Val Ile Ser Pro Gly Val Trp Trp Ser Asp Glu Arg Arg Arg Leu Tyr
180 185 190
Val Leu Ala Met Glu Pro Asn Leu Ile Gly Leu Cys Pro Ala Gly Trp
195 200 205
His Ala Arg Ile Leu Gly Ser Val Leu Asn Arg Leu Leu Ser His Ala
210 215 220
Asp Gly Cys Asp Glu Cys Asn His Arg Val His Val Gly Ala Leu Tyr
225 230 235 240
Ala Leu Pro His Val Thr Asn His Ala Glu Gly Cys Val Cys Trp Ala
245 250 255
Pro Cys Met Trp Arg Lys Ala Gly Gln Arg Glu Leu Lys Val Glu Val
260 265 270
Asp Ile Gly Ala Thr Gln Val Leu Phe Val Asp Val Thr Thr Cys Ile
275 280 285
Arg Ile Thr Ser Thr Lys Asn Pro Arg Ile Thr Ala Asn Leu Gly Asp
290 295 300
Val Ile Ala Gly Thr Asn Ala Ser Gly Leu Ser Val Pro Val Asn Ser
305 310 315 320
Ser Gly Trp Gln Leu Tyr Met Phe Gly Glu Thr Leu Ser Arg Ala Ile
325 330 335
Ile Asn Gly Cys Gly Leu Leu Gln Arg Ile Cys Phe Pro Glu Thr Gln
340 345 350
Arg Leu Ser Gly Glu Pro Glu Pro Thr Thr Thr
355 360
<210>32
<211>199
<212>PRT
<213>水痘带状疱疹
<400>32
Met Ser Gly His Thr Pro Thr Tyr Ala Ser His Arg Arg Asn Arg Val
1 5 10 15
Lys Leu Val Glu Ala His Asn Arg Ala Gly Leu Phe Lys Glu Arg Thr
20 25 30
Leu Asp Leu Ile Arg Gly Gly Ala Ser Val Gln Asp Pro Ala Phe Val
35 40 45
Tyr Ala Phe Thr Ala Ala Lys Glu Ala Cys Ala Asp Leu Asn Asn Gln
50 55 60
Leu Arg Ser Ala Ala Arg Ile Ala Ser Val Glu Gln Lys Ile Arg Asp
65 70 75 80
Ile Gln Ser Lys Val Glu Glu Gln Thr Ser Ile Gln Gln Ile Leu Asn
85 90 95
Thr Asn Arg Arg Tyr Ile Ala Pro Asp Phe Ile Arg Gly Leu Asp Lys
100 105 110
Thr Glu Asp Asp Asn Thr Asp Asn Ile Asp Arg Leu Glu Asp Ala Val
115 120 125
Gly Pro Asn Ile Glu His Glu Asn His Thr Trp Phe Gly Glu Asp Asp
130 135 140
Glu Ala Leu Leu Thr Gln Trp Met Leu Thr Thr His Pro Pro Thr Ser
145 150 155 160
Lys Tyr Leu Gln Leu Gln Asp Leu Cys Val Pro Thr Thr Ile Pro Thr
165 170 175
Asp Met Asn Gln Met Gln Pro Gln Pro Ile Ser Lys Asn Glu Asn Pro
180 185 190
Pro Thr Pro His Thr Asp Val
195
<210>33
<211>551
<212>PRT
<213>水痘带状疱疹
<400>33
Met Ala Arg Ser Gly Leu Asp Arg Ile Asp Ile Ser Pro Gln Pro Ala
1 5 10 15
Lys Lys Ile Ala Arg Val Gly Gly Leu Gln His Pro Phe Val Lys Thr
20 25 30
Asp Ile Asn Thr Ile Asn Val Glu His His PheIle Asp Thr Leu Gln
35 40 45
Lys Thr Ser Pro Asn Met Asp Cys Arg Gly Met Thr Ala Gly Ile Phe
50 55 60
Ile Arg Leu Ser His Met Tyr Lys Ile Leu Thr Thr Leu Glu Ser Pro
65 70 75 80
Asn Asp Val Thr Tyr Thr Thr Pro Gly Ser Thr Asn Ala Leu Phe Phe
85 90 95
Lys Thr Ser Thr Gln Pro Gln Glu Pro Arg Pro Glu Glu Leu Ala Ser
100 105 110
Lys Leu Thr Gln Asp Asp Ile Lys Arg Ile Leu Leu Thr Ile Glu Ser
115 120 125
Glu Thr Arg Gly Gln Gly Asp Asn Ala Ile Trp Thr Leu Leu Arg Arg
130 135 140
Asn Leu Ile Thr Ala Ser Thr Leu Lys Trp Ser Val Ser Gly Pro Val
145 150 155 160
Ile Pro Pro Gln Trp Phe Tyr His His Asn Thr Thr Asp Thr Tyr Gly
165 170 175
Asp Ala Ala Ala Met Ala Phe Gly Lys Thr Asn Glu Pro Ala Ala Arg
180 185 190
Ala Ile Val Glu Ala Leu Phe Ile Asp Pro Ala Asp Ile Arg Thr Pro
195 200 205
Asp His Leu Thr Pro Glu Ala Thr Thr Lys Phe Phe Asn Phe Asp Met
210 215 220
Leu Asn Thr Lys Ser Pro Ser Leu Leu Val Gly Thr Pro Arg Ile Gly
225 230 235 240
Thr Tyr Glu Cys Gly Leu Leu Ile Asp Val Arg Thr Gly Leu Ile Gly
245 250 255
Ala Ser Leu Asp Val Leu Val Cys Asp Arg Asp Pro Leu Thr Gly Thr
260 265 270
Leu Asn Pro His Pro Ala Glu Thr Asp Ile Ser Phe Phe Glu Ile Lys
275 280 285
Cys Arg Ala Lys Tyr Leu Phe Asp Pro Asp Asp Lys Asn Asn Pro Leu
290 295 300
Gly Arg Thr Tyr Thr Thr Leu Ile Asn Arg Pro Thr Met Ala Asn Leu
305 310 315 320
Arg Asp Phe Leu Tyr Thr Ile Lys Asn Pro Cys Val Ser Phe Phe Gly
325 330 335
Pro Ser Ala Asn Pro Ser Thr Arg Glu Ala Leu Ile Thr Asp His Val
340 345 350
Glu Trp Lys Arg Leu Gly Phe Lys Gly Gly Arg Ala Leu Thr Glu Leu
355 360 365
Asp Ala His His Leu Gly Leu Asn Arg Thr Ile Ser Ser Arg Val Trp
370 375 380
Val Phe Asn Asp Pro Asp Ile Gln Lys Gly Thr Ile Thr Thr Ile Ala
385 390 395 400
Trp Ala Thr Gly Asp Thr Ala Leu Gln Ile Pro Val Phe Ala Asn Pro
405 410 415
Arg His Ala Asn Phe Lys Gln Ile Ala Val Gln Thr Tyr Val Leu Ser
420 425 430
Gly Tyr Phe Pro Ala Leu Lys Leu Arg Pro Phe Leu Val Thr Phe Ile
435 440 445
Gly Arg Val Arg Arg Pro His Glu Val Gly Val Pro Leu Arg Val Asp
450 455 460
Thr Gln Ala Ala Ala Ile Tyr Glu Tyr Asn Trp Pro Thr Ile Pro Pro
465 470 475 480
His Cys Ala Val Pro Val Ile Ala Val Leu Thr Pro Ile Glu Val Asp
485 490 495
Val Pro Arg Val Thr Gln Ile Leu Lys Asp Thr Gly Asn Asn Ala Ile
500 505 510
Thr Ser Ala Leu Arg Ser Leu Arg Trp Asp Asn Leu His Pro Ala Val
515 520 525
Glu Glu Glu Ser Val Asp Cys Ala Asn Gly Thr Thr Ser Leu Leu Arg
530 535 540
Ala Thr Glu Lys Pro Leu Leu
545 550
<210>34
<211>835
<212>PRT
<213>水痘带状疱疹
<400>34
Met Ser Pro Asn Thr Gly Glu Ser Asn Ala Ala Val Tyr Ala Ser Ser
1 5 10 15
Thr Gln Leu Ala Arg Ala Leu Tyr Gly Gly Asp Leu Val Ser Trp Ile
20 25 30
Lys His Thr His Pro Gly Ile Ser Leu Glu Leu Gln Leu Asp Val Pro
35 40 45
Val Lys Leu Ile Lys Pro Gly Met Ser Gln Thr Arg Pro Val Thr Val
50 55 60
Val Arg Ala Pro Met Gly Ser Gly Lys Thr Thr Ala Leu Leu Glu Trp
65 70 75 80
Leu Gln His Ala Leu Lys Ala Asp Ile Ser Val Leu Val Val Ser Cys
85 90 95
Arg Arg Ser Phe Thr Gln Thr Leu Ile Gln Arg Phe Asn Asp Ala Gly
100 105 110
Leu Ser Gly Phe Val Thr Tyr Leu Thr Ser Glu Thr Tyr Ile Met Gly
115 120 125
Phe Lys Arg Leu Ile Val Gln Leu Glu Ser Leu His Arg Val Ser Ser
130 135 140
Glu Ala Ile Asp Ser Tyr Asp Val Leu Ile Leu Asp Glu Val Met Ser
145 150 155 160
Val Ile Gly Gln Leu Tyr Ser Pro Thr Met Arg Arg Leu Ser Ala Val
165 170 175
Asp Ser Leu Leu Tyr Arg Leu Leu Asn Arg Cys Ser Gln Ile Ile Ala
180 185 190
Met Asp Ala Thr Val Asn Ser Gln Phe Ile Asp Leu Ile Ser Gly Leu
195 200 205
Arg Gly Asp Glu Asn Ile His Thr Ile Val Cys Thr Tyr Ala Gly Val
210 215 220
Gly Phe Ser Gly Arg Thr Cys Thr Ile Leu Arg Asp Met Gly Ile Asp
225 230 235 240
Thr Leu Val Arg Val Ile Lys Arg Ser Pro Glu His Glu Asp Val Arg
245 250 255
Thr Ile His Gln Leu Arg Gly Thr Phe Phe Asp Glu Leu Ala Leu Arg
260 265 270
Leu Gln Cys Gly His Asn Ile Cys Ile Phe Ser Ser Thr Leu Ser Phe
275 280 285
Ser Glu Leu Val Ala Gln Phe Cys Ala Ile Phe Thr Asp Ser Ile Leu
290 295 300
Ile Leu Asn Ser Thr Arg Pro Leu Cys Asn Val Asn Glu Trp Lys His
305 310 315 320
Phe Arg Val Leu Val Tyr Thr Thr Val Val Thr Val Gly Leu Ser Phe
325 330 335
Asp Met Ala His Phe His Ser Met Phe Ala Tyr Ile Lys Pro Met Ser
340 345 350
Tyr Gly Pro Asp Met Val Ser Val Tyr Gln Ser Leu Gly Arg Val Arg
355 360 365
Leu Leu Leu Leu Asn Glu Val Leu Met Tyr Val Asp Gly Ser Arg Thr
370 375 380
Arg Cys Gly Pro Leu Phe Ser Pro Met Leu Leu Asn Phe Thr Ile Ala
385 390 395 400
Asn Lys Phe Gln Trp Phe Pro Thr His Thr Gln Ile Thr Asn Lys Leu
405 410 415
Cys Cys Ala Phe Arg Gln Arg Cys Ala Asn Ala Phe Thr Arg Ser Asn
420 425 430
Thr His Leu Phe Ser Arg Phe Lys Tyr Lys His Leu Phe Glu Arg Cys
435 440 445
Ser Leu Trp Ser Leu Ala Asp Ser Ile Asn Ile Leu Gln Thr Leu Leu
450 455 460
Ala Ser Asn Gln Ile Leu Val Val Leu Asp Gly Met Gly Pro Ile Thr
465 470 475 480
Asp Val Ser Pro Val Gln Phe Cys Ala Phe Ile His Asp Leu Arg His
485 490 495
Ser Ala Asn Ala Val Ala Ser Cys Met Arg Ser Leu Arg Gln Asp Asn
500 505 510
Asp Ser Cys Leu Thr Asp Phe Gly Pro Ser Gly Phe Met Ala Asp Asn
515 520 525
Ile Thr Ala Phe Met Glu Lys Tyr Leu Met Glu Ser Ile Asn Thr Glu
530 535 540
Glu Gln Ile Lys Val Phe Lys Ala Leu Ala Cys Pro Ile Glu Gln Pro
545 550 555 560
Arg Leu Val Asn Thr Ala Ile Leu Gly Ala Cys Ile Arg Ile Pro Glu
565 570 575
Ala Leu Glu Ala Phe Asp Val Phe Gln Lys Ile Tyr Thr His Tyr Ala
580 585 590
Ser Gly Trp Phe Pro Val Leu Asp Lys Thr Gly Glu Phe Ser Ile Ala
595 600 605
Thr Ile Thr Thr Ala Pro Asn Leu Thr Thr His Trp Glu Leu Phe Arg
610 615 620
Arg Cys Ala Tyr Ile Ala Lys Thr Leu Lys Trp Asn Pro Ser Thr Glu
625 630 635 640
Gly Cys Val Thr Gln Val Leu Asp Thr Asp Ile Asn Thr Leu Phe Asn
645 650 655
Gln His Gly Asp Ser Leu Ala Gln Leu Ile Phe Glu Val Met Arg Cys
660 665 670
Asn Val Thr Asp Ala Lys Ile Ile Leu Asn Arg Pro Val Trp Arg Thr
675 680 685
Thr Gly Phe Leu Asp Gly Cys His Asn Gln Cys Phe Arg Pro Ile Pro
690 695 700
Thr Lys His Glu Tyr Asn Ile Ala Leu Phe Arg Leu Ile Trp Glu Gln
705v 710 715 720
Leu Phe Gly Ala Arg Val Thr Lys Ser Thr Gln Thr Phe Pro Gly Ser
725 730 735
Thr Arg Val Lys Asn Leu Lys Lys Lys Asp Leu Glu Thr Leu Leu Asp
740 745 750
Ser Ile Asn Val Asp Arg Ser Ala Cys Arg Thr Tyr Arg Gln Leu Tyr
755 760 765
Asn Leu Leu Met Ser Gln Arg His Ser Phe Ser Gln Gln Arg Tyr Lys
770 775 780
Ile Thr Ala Pro Ala Trp Ala Arg His Val Tyr Phe Gln Ala His Gln
785 790 795 800
Met His Leu Ala Pro His Ala Glu Ala Met Leu Gln Leu Ala Leu Ser
805 810 815
Glu Leu Ser Pro Gly Ser Trp Pro Arg Ile Asn Gly Ala Val Asn Phe
820 825 830
Glu Ser Leu
835
<210>35
<211>771
<212>PRT
<213>水痘带状疱疹
<400>35
Met Asp Ala Thr Gln Ile Thr Leu Val Arg Glu Ser Gly His Ile Cys
1 5 10 15
Ala Ala Ser Ile Tyr Thr Ser Trp Thr Gln Ser Gly Gln Leu Thr Gln
20 25 30
Asn Gly Leu Ser Val Leu Tyr Tyr Leu Leu Cys Lys Asn Ser Cys Gly
35 40 45
Lys Tyr Val Pro Lys Phe Ala Glu Ile Thr Val Gln Gln Glu Asp Leu
50 55 60
Cys Arg Tyr Ser Arg His Gly Gly Ser Val Ser Ala Ala Thr Phe Ala
65 70 75 80
Ser Ile Cys Arg Ala Ala Ser Ser Ala Ala Leu Asp Ala Trp Pro Leu
85 90 95
Glu Pro Leu Gly Asn Ala Asp Thr Trp Arg Cys Leu His Gly Thr Ala
100 105 110
Leu Ala Thr Leu Arg Arg Val Leu Gly Phe Lys Ser Phe Tyr Ser Pro
115 120 125
Val Thr Phe Glu Thr Asp Thr Asn Thr Gly Leu Leu Leu Lys Thr Ile
130 135 140
Pro Asp Glu His Ala Leu Asn Asn Asp Asn Thr Pro Ser Thr Gly Val
145 150 155 160
Leu Arg Ala Asn Phe Pro Val Ala Ile Asp Val Ser Ala Val Ser Ala
165 170 175
Cys Asn Ala His Thr Gln Gly Thr Ser Leu Ala Tyr Ala Arg Leu Thr
180 185 190
Ala Leu Lys Ser Asn Gly Asp Thr Gln Gln Gln Thr Pro Leu Asp Val
195 200 205
Glu Val Ile Thr Pro Lys Ala Tyr Ile Arg Arg Lys Tyr Lys Ser Thr
210 215 220
Phe Ser Pro Pro Ile Glu Arg Glu Gly Gln Thr Ser Asp Leu Phe Asn
225 230 235 240
Leu Glu Glu Arg Arg Leu Val Leu Ser Gly Asn Arg Ala Ile Val Val
245 250 255
Arg Val Leu Leu Pro Cys Tyr Phe Asp Cys Leu Thr Thr Asp Ser Thr
260 265 270
Val Thr Ser Ser Leu Ser Ile Leu Ala Thr Tyr Arg Leu Trp Tyr Ala
275 280 285
Ala Ala Phe Gly Lys Pro Gly Val Val Arg Pro Ile Phe Ala Tyr Leu
290 295 300
Gly Pro Glu Leu Asn Pro Lys Gly Glu Asp Arg Asp Tyr Phe Cys Thr
305 310 315 320
Val Gly Phe Pro Gly Trp Thr Thr Leu Arg Thr Gln Thr Pro Ala Val
325 330 335
Glu Ser Ile Arg Thr Ala Thr Glu Met Tyr Met Glu Thr Asp Gly Leu
340 345 350
Trp Pro Val Thr Gly Ile Gln Ala Phe His Tyr Leu Ala Pro Trp Gly
355 360 365
Gln His Pro Pro Leu Pro Pro Arg Val Gln Asp Leu Ile Gly Gln Ile
370 375 380
Pro Gln Asp Thr Gly His Ala Asp Ala Thr Val Asn Trp Asp Ala Gly
385 390 395 400
Arg Ile Ser Thr Val Phe Lys Gln Pro Val Gln Leu Gln Asp Arg Trp
405 410 415
Met Ala Lys Phe Asp Phe Ser Ala Phe Phe Pro Thr Ile Tyr Cys Ala
420 425 430
Met Phe Pro Met His Phe Arg Leu Gly Lys Ile Val Leu Ala Arg Met
435 440 445
Arg Arg Gly Met Gly Cys Leu Lys Pro Ala Leu Val Ser Phe Phe Gly
450 455 460
Gly Leu Arg His Ile Leu Pro Ser Ile Tyr Lys Ala Ile Ile Phe Ile
465 470 475 480
Ala Asn Glu Ile Ser Leu Cys Val Glu Gln Thr Ala Leu Glu Gln Gly
485 490 495
Phe Ala Ile Cys Thr Tyr Ile Lys Asp Gly Phe Trp Gly Ile Phe Thr
500 505 510
Asp Leu His Thr Arg Asn Val Cys Ser Asp Gln Ala Arg Cys Ser Ala
515 520 525
Leu Asn Leu Ala Ala Thr Cys Glu Arg Ala Val Thr Gly Leu Leu Arg
530 535 540
Ile Gln Leu Gly Leu Asn Phe Thr Pro Ala Met Glu Pro Val Leu Arg
545 550 555 560
Val Glu Gly Val Tyr Thr His Ala Phe Thr Trp Cys Thr Thr Gly Ser
565 570 575
Trp Leu Trp Asn Leu Gln Thr Asn Thr Pro Pro Asp Leu Val Gly Val
580 585 590
Pro Trp Arg Ser Gln Ala Ala Arg Asp Leu Lys Glu Arg Leu Ser Gly
595 600 605
Leu Leu Cys Thr Ala Thr Lys Ile Arg Glu Arg Ile Gln Glu Asn Cys
610 615 620
Ile Trp Asp His Val Leu Tyr Asp Ile Trp Ala Gly Gln Val Val Glu
625 630 635 640
Ala Ala Arg Lys Thr Tyr Val Asp Phe Phe Glu His Val Phe Asp Arg
645 650 655
Arg Tyr Thr Pro Val Tyr Trp Ser Leu Gln Glu Gln Asn Ser Glu Thr
660 665 670
Lys Ala Ile Pro Ala Ser Tyr Leu Thr Tyr Gly His Met Gln Asp Lys
675 680 685
Asp Tyr Lys Pro Arg Gln Ile Ile Met Val Arg Asn Pro Asn Pro His
690 695 700
Gly Pro Pro Thr Val Val Tyr Trp Glu Leu Leu Pro Ser Cys Ala Cys
705 710 715 720
Ile Pro Pro Ile Asp Cys Ala Ala His Leu Lys Pro Leu Ile His Thr
725 730 735
Phe Val Thr Ile Ile Asn His Leu Leu Asp Ala His Asn Asp Phe Ser
740 745 750
Ser Pro Ser Leu Lys Phe Thr Asp Asp Pro Leu Ala Ser Tyr Asn Phe
755 760 765
Leu Phe Leu
770
<210>36
<211>881
<212>PRT
<213>水痘带状疱疹
<400>36
Met Lys Arg Ser Ile Ser Val Asp Ser Ser Ser Pro Lys Asn Val Phe
1 5 10 15
Asn Pro Glu Thr Pro Asn Gly Phe Asp Asp Ser Val Tyr Leu Asn Phe
20 25 30
Thr Ser Met His Ser Ile Gln Pro Ile Leu Ser Arg Ile Arg Glu Leu
35 40 45
Ala Ala Ile Thr Ile Pro Lys Glu Arg Val Pro Arg Leu Cys Trp Phe
50 55 60
Lys Gln Leu Leu Glu Leu Gln Ala Pro Pro Glu Met Gln Arg Asn Glu
65 70 75 80
Leu Pro Phe Ser Val Tyr Leu Ile Ser Gly Asn Ala Gly Ser Gly Lys
85 90 95
Ser Thr Cys Ile Gln Thr Leu Asn Glu Ala Ile Asp Cys Ile Ile Thr
100 105 110
Gly Ser Thr Arg Val Ala Ala Gln Asn Val His Ala Lys Leu Ser Thr
115 120 125
Ala Tyr Ala Ser Arg Pro Ile Asn Thr Ile Phe His Glu Phe Gly Phe
130 135 140
Arg Gly Asn His Ile Gln Ala Gln Leu Gly Arg Tyr Ala Tyr Asn Trp
145 150 155 160
Thr Thr Thr Pro Pro Ser Ile Glu Asp Leu Gln Lys Arg Asp Ile Val
165 170 175
Tyr Tyr Trp Glu Val Leu Ile Asp Ile Thr Lys Arg Val Phe Gln Met
180 185 190
Gly Asp Asp Gly Arg Gly Gly Thr Ser Thr Phe Lys Thr Leu Trp Ala
195 200 205
Ile Glu Arg Leu Leu Asn Lys Pro Thr Gly Ser Met Ser Gly Thr Ala
210 215 220
Phe Ile Ala Cys Gly Ser Leu Pro Ala Phe Thr Arg Ser Asn Val Ile
225 230 235 240
Val Ile Asp Glu Ala Gly Leu Leu Gly Arg His Ile Leu Thr Ala Val
245 250 255
Val Tyr Cys Trp Trp Leu Leu Asn Ala Ile Tyr Gln Ser Pro Gln Tyr
260 265 270
Ile Asn Gly Arg Lys Pro Val Ile Val Cys Val Gly Ser Pro Thr Gln
275 280 285
Thr Asp Ser Leu Glu Ser His Phe Gln His Asp Met Gln Arg Ser His
290 295 300
Val Thr Pro Ser Glu Asn Ile Leu Thr Tyr Ile Ile Cys Asn Gln Thr
305 310 315 320
Leu Arg Gln Tyr Thr Asn Ile Ser His Asn Trp Ala Ile Phe Ile Asn
325 330 335
Asn Lys Arg Cys Gln Glu Asp Asp Phe Gly Asn Leu Leu Lys Thr Leu
340 345 350
Glu Tyr Gly Leu Pro Ile Thr Glu Ala His Ala Arg Leu Val Asp Thr
355 360 365
Phe Val Val Pro Ala Ser Tyr Ile Asn Asn Pro Ala Asn Leu Pro Gly
370 375 380
Trp Thr Arg Leu Tyr Ser Ser His Lys Glu Val Ser Ala Tyr Met Ser
385 390 395 400
Lys Leu His Ala His Leu Lys Leu Ser Lys Asn Asp His Phe Ser Val
405 410 415
Phe Ala Leu Pro Thr Tyr Thr Phe Ile Arg Leu Thr Ala Phe Asp Glu
420 425 430
Tyr Arg Lys Leu Thr Gly Gln Pro Gly Leu Ser Val Glu His Trp Ile
435 440 445
Arg Ala Asn Ser Gly Arg Leu His Asn Tyr Ser Gln Ser Arg Asp His
450 455 460
Asp Met Gly Thr Val Lys Tyr Glu Thr His Ser Asn Arg Asp Leu Ile
465 470 475 480
Val Ala Arg Thr Asp Ile Thr Tyr Val Leu Asn Ser Leu Val Val Val
485 490 495
Thr Thr Arg Leu Arg Lys Leu Val Ile Gly Phe Ser Gly Thr Phe Gln
500 505 510
Ser Phe Ala Lys Val Leu Arg Asp Asp Ser Phe Val Lys Ala Arg Gly
515 520 525
Glu Thr Ser Ile Glu Tyr Ala Tyr Arg Phe Leu Ser Asn Leu Ile Phe
530 535 540
Gly Gly Leu Ile Asn Phe Tyr Asn Phe Leu Leu Asn Lys Asn Leu His
545 550 555 560
Pro Asp Lys Val Ser Leu Ala Tyr Lys Arg Leu Ala Ala Leu Thr Leu
565 570 575
Glu Leu Leu Ser Gly Thr Asn Lys Ala Pro Leu His Glu Ala Ala Val
580 585 590
Asn Gly Ala Gly Ala Gly Ile Asp Cys Asp Gly Ala Ala Thr Ser Ala
595 600 605
Asp Lys Ala Phe Cys Phe Thr Lys Ala Pro Glu Ser Lys Val Thr Ala
610 615 620
Ser Ile Pro Glu Asp Pro Asp Asp Val Ile Phe Thr Ala Leu Asn Asp
625 630 635 640
Glu Val Ile Asp Leu Val Tyr Cys Gln Tyr Glu Phe Ser Tyr Pro Lys
645 650 655
Ser Ser Asn Glu Val His Ala Gln Phe Leu Leu Met Lys Ala Ile Tyr
660 665 670
Asp Gly Arg Tyr Ala Ile Leu Ala Glu Leu Phe Glu Ser Ser Phe Thr
675 680 685
Thr Ala Pro Phe Ser Ala Tyr Val Asp Asn Val Asn Phe Asn Gly Ser
690 695 700
Glu Leu Leu Ile Gly Asn Val Arg Gly Gly Leu Leu Ser Leu Ala Leu
705 710 715 720
Gln Thr Asp Thr Tyr Thr Leu Leu Gly Tyr Thr Phe Ala Pro Val Pro
725 730 735
Val Phe Val Glu Glu Leu Thr Arg Lys Lys Leu Tyr Arg Glu Thr Thr
740 745 750
Glu Met Leu Tyr Ala Leu His Val Pro Leu Met Val Leu Gln Asp Gln
755 760 765
His Gly Phe Val Ser Ile Val Asn Ala Asn Val Cys Glu Phe Thr Glu
770 775 780
Ser Ile Glu Asp Ala Glu Leu Ala Met Ala Thr Thr Val Asp Tyr Gly
785 790 795 800
Leu Ser Ser Lys Leu Ala Met Thr Ile Ala Arg Ser Gln Gly Leu Ser
805 810 815
Leu Glu Lys Val Ala Ile Cys Phe Thr Ala Asp Lys Leu Arg Leu Asn
820 825 830
Ser Val Tyr Val Ala Met Ser Arg Thr Val Ser Ser Arg Phe Leu Lys
835 840 845
Met Asn Leu Asn Pro Leu Arg Glu Arg Tyr Glu Lys Ser Ala Glu Ile
850 855 860
Ser Asp His Ile Leu Ala Ala Leu Arg Asp Pro Asn Val His Val Val
865 870 875 880
Tyr
<210>37
<211>278
<212>PRT
<213>水痘带状疱疹
<400>37
Met Phe Cys Thr Ser Pro Ala Thr Arg Gly Asp Ser Ser Glu Ser Lys
1 5 10 15
Pro Gly Ala Ser Val Asp Val Asn Gly Lys Met Glu Tyr Gly Ser Ala
20 25 30
Pro Gly Pro Leu Asn Gly Arg Asp Thr Ser Arg Gly Pro Gly Ala Phe
35 40 45
Cys Thr Pro Gly Trp Glu Ile His Pro Ala Arg Leu Val Glu Asp Ile
50 55 60
Asn Arg Val Phe Leu Cys Ile Ala Gln Ser Ser Gly Arg Val Thr Arg
65 70 75 80
Asp Ser Arg Arg Leu Arg Arg Ile Cys Leu Asp Phe Tyr Leu Met Gly
85 90 95
Arg Thr Arg Gln Arg Pro Thr Leu Ala Cys Trp Glu Glu Leu Leu Gln
100 105 110
Leu Gln Pro Thr Gln Thr Gln Cys Leu Arg Ala Thr Leu Met Glu Val
115 120 125
Ser His Arg Pro Pro Arg Gly Glu Asp Gly Phe Ile Glu Ala Pro Asn
130 135 140
Val Pro Leu His Arg Ser Ala Leu Glu Cys Asp Val Ser Asp Asp Gly
145 150 155 160
Gly Glu Asp Asp Ser Asp Asp Asp Gly Ser Thr Pro Ser Asp Val Ile
165 170 175
Glu Phe Arg Asp Ser Asp Ala Glu Ser Ser Asp Gly Glu Asp Phe Ile
180 185 190
Val Glu Glu Glu Ser Glu Glu Ser Thr Asp Ser Cys Glu Pro Asp Gly
195 200 205
Val Pro Gly Asp Cys Tyr Arg Asp Gly Asp Gly Cys Asn Thr Pro Ser
210 215 220
Pro Lys Arg Pro Gln Arg Ala Ile Glu Arg Tyr Ala Gly Ala Glu Thr
225 230 235 240
Ala Glu Tyr Thr Ala Ala Lys Ala Leu Thr Ala Leu Gly Glu Gly Gly
245 250 255
Val Asp Trp Lys Arg Arg Arg His Glu Ala Pro Arg Arg His Asp Ile
260 265 270
Pro Pro Pro His Gly Val
275
<210>38
<211>180
<212>PRT
<213>水痘带状疱疹
<400>38
Met Asn Leu Cys Gly Ser Arg Gly Glu His Pro Gly Gly Glu Tyr Ala
1 5 10 15
Gly Leu Tyr Cys Thr Arg His Asp Thr Pro Ala His Gln Ala Leu Met
20 25 30
Asn Asp Ala Glu Arg Tyr Phe Ala Ala Ala Leu Cys Ala Ile Ser Thr
35 40 45
Glu Ala Tyr Glu Ala Phe Ile His Ser Pro Ser Glu Arg Pro Cys Ala
50 55 60
Ser Leu Trp Gly Arg Ala Lys Asp Ala Phe Gly Arg Met Cys Gly Glu
65 70 75 80
Leu Ala Ala Asp Arg Gln Arg Pro Pro Ser Val Pro Pro Ile Arg Arg
85 90 95
Ala Val Leu Ser Leu Leu Arg Glu Gln Cys Met Pro Asp Pro Gln Ser
100 105 110
His Leu Glu Leu Ser Glu Arg Leu Ile Leu Met Ala Tyr Trp Cys Cys
115 120 125
Leu Gly His Ala Gly Leu Pro Thr Ile Gly Leu Ser Pro Asp Asn Lys
130 135 140
Cys Ile Arg Ala Glu Leu Tyr Asp Arg Pro Gly Gly Ile Cys His Arg
145 150 155 160
Leu Phe Asp Ala Tyr Leu Gly Cys Gly Ser Leu Gly Val Pro Arg Thr
165 170 175
Tyr Glu Arg Ser
180
<210>39
<211>393
<212>PRT
<213>水痘带状疱疹
<400>39
Met Asn Asp Val Asp Ala Thr Asp Thr Phe Val Gly Gln Gly Lys Phe
1 5 10 15
Arg Gly Ala Ile Ser Thr Ser Pro Ser His Ile Met Gln Thr Cys Gly
20 25 30
Phe Ile Gln Gln Met Phe Pro Val Glu Met Ser Pro Gly Ile Glu Ser
35 40 45
Glu Asp Asp Pro Asn Tyr Asp Val Asn Met Asp Ile Gln Ser Phe Asn
50 55 60
Ile Phe Asp Gly Val His Glu Thr Glu Ala Glu Ala Ser Val Ala Leu
65 70 75 80
Cys Ala Glu Ala Arg Val Gly Ile Asn Lys Ala Gly Phe Val Ile Leu
85 90 95
Lys Thr Phe Thr Pro Gly Ala Glu Gly Phe Ala Phe Ala Cys Met Asp
100 105 110
Ser Lys Thr Cys Glu His Val Val Ile Lys Ala Gly Gln Arg Gln Gly
115 120 125
Thr Ala Thr Glu Ala Thr Val Leu Arg Ala Leu Thr His Pro Ser Val
130 135 140
Val Gln Leu Lys Gly Thr Phe Thr Tyr Asn Lys Met Thr Cys Leu Ile
145 150 155 160
Leu Pro Arg Tyr Arg Thr Asp Leu Tyr Cys Tyr Leu Ala Ala Lys Arg
165 170 175
Asn Leu Pro Ile Cys Asp Ile Leu Ala Ile Gln Arg Ser Val Leu Arg
180 185 190
Ala Leu Gln Tyr Leu His Asn Asn Ser Ile Ile His Arg Asp Ile Lys
195 200 205
Ser Glu Asn Ile Phe Ile Asn His Pro Gly Asp Val Cys Val Gly Asp
210 215 220
Phe Gly Ala Ala Cys Phe Pro Val Asp Ile Asn Ala Asn Arg Tyr Tyr
225 230 235 240
Gly Trp Ala Gly Thr Ile Ala Thr Asn Ser Pro Glu Leu Leu Ala Arg
245 250 255
Asp Pro Tyr Gly Pro Ala Val Asp Ile Trp Ser Ala Gly Ile Val Leu
260 265 270
Phe Glu Met Ala Thr Gly Gln Asn Ser Leu Phe Glu Arg Asp Gly Leu
275 280 285
Asp Gly Asn Cys Asp Ser Glu Arg Gln Ile Lys Leu Ile Ile Arg Arg
290 295 300
Ser Gly Thr His Pro Asn Glu Phe Pro Ile Asn Pro Thr Ser Asn Leu
305 310 315 320
Arg Arg Gln Tyr Ile Gly Leu Ala Lys Arg Ser Ser Arg Lys Pro Gly
325 330 335
Ser Arg Pro Leu Trp Thr Asn Leu Tyr Glu Leu Pro Ile Asp Leu Glu
340 345 350
Tyr Leu Ile Cys Lys Met Leu Ser Phe Asp Ala Arg His Arg Pro Ser
355 360 365
Ala Glu Val Leu Leu Asn His Ser Val Phe Gln Thr Leu Pro Asp Pro
370 375 380
Tyr Pro Asn Pro Met Glu Val Gly Asp
385 390
<210>40
<211>354
<212>PRT
<213>水痘带状疱疹
<400>40
Met Phe Leu Ile Gln Cys Leu Ile Ser Ala Val Ile Phe Tyr Ile Gln
1 5 10 15
Val Thr Asn Ala Leu Ile Phe Lys Gly Asp His Val Ser Leu Gln Val
20 25 30
Asn Ser Ser Leu Thr Ser Ile Leu Ile Pro Met Gln Asn Asp Asn Tyr
35 40 45
Thr Glu Ile Lys Gly Gln Leu Val Phe Ile Gly Glu Gln Leu Pro Thr
50 55 60
Gly Thr Asn Tyr Ser Gly Thr Leu Glu Leu Leu Tyr Ala Asp Thr Val
65 70 75 80
Ala Phe Cys Phe Arg Ser Val Gln Val Ile Arg Tyr Asp Gly Cys Pro
85 90 95
Arg Ile Arg Thr Ser Ala Phe Ile Ser Cys Arg Tyr Lys His Ser Trp
100 105 110
His Tyr Gly Asn Ser Thr Asp Arg Ile Ser Thr Glu Pro Asp Ala Gly
115 120 125
Val Met Leu Lys Ile Thr Lys Pro Gly Ile Asn Asp Ala Gly Val Tyr
130 135 140
Val Leu Leu Val Arg Leu Asp His Ser Arg Ser Thr Asp Gly Phe Ile
145 150 155 160
Leu Gly Val Asn Val Tyr Thr Ala Gly Ser His His Asn Ile His Gly
165 170 175
Val Ile Tyr Thr Ser Pro Ser Leu Gln Asn Gly Tyr Ser Thr Arg Ala
180 185 190
Leu Phe Gln Gln Ala Arg Leu Cys Asp Leu Pro Ala Thr Pro Lys Gly
195 200 205
Ser Gly Thr Ser Leu Phe Gln His Met Leu Asp Leu Arg Ala Gly Lys
210 215 220
Ser Leu Glu Asp Asn Pro Trp Leu His Glu Asp Val Val Thr Thr Glu
225 230 235 240
Thr Lys Ser Val Val Lys Glu Gly Ile Glu Asn His Val Tyr Pro Thr
245 250 255
Asp Met Ser Thr Leu Pro Glu Lys Ser Leu Asn Asp Pro Pro Glu Asn
260 265 270
Leu Leu Ile Ile Ile Pro Ile Val Ala Ser Val Met Ile Leu Thr Ala
275 280 285
Met Val Ile Val Ile Val Ile Ser Val Lys Arg Arg Arg Ile Lys Lys
290 295 300
His Pro Ile Tyr Arg Pro Asn Thr Lys Thr Arg Arg Gly Ile Gln Asn
305 310 315 320
Ala Thr Pro Glu Ser Asp Val Met Leu Glu Ala Ala Ile Ala Gln Leu
325 330 335
Ala Thr Ile Arg Glu Glu Ser Pro Pro His Ser Val Val Asn Pro Phe
340 345 350
Val Lys
<210>41
<211>623
<212>PRT
<213>水痘带状疱疹
<400>41
Met Gly Thr Val Asn Lys Pro Val Val Gly Val Leu Met Gly Phe Gly
1 5 10 15
Ile Ile Thr Gly Thr Leu Arg Ile Thr Asn Pro Val Arg Ala Ser Val
20 25 30
Leu Arg Tyr Asp Asp Phe His Thr Asp Glu Asp Lys Leu Asp Thr Asn
35 40 45
Ser Val Tyr Glu Pro Tyr Tyr His Ser Asp His Ala Glu Ser Ser Trp
50 55 60
Val Asn Arg Gly Glu Ser Ser Arg Lys Ala Tyr Asp His Asn Ser Pro
65 70 75 80
Tyr Ile Trp Pro Arg Asn Asp Tyr Asp Gly Phe Leu Glu Asn Ala His
85 90 95
Glu His His Gly Val Tyr Asn Gln Gly Arg Gly Ile Asp Ser Gly Glu
100 105 110
Arg Leu Met Gln Pro Thr Gln Met Ser Ala Gln Glu Asp Leu Gly Asp
115 120 125
Asp Thr Gly Ile His Val Ile Pro Thr Leu Asn Gly Asp Asp Arg His
130 135 140
Lys Ile Val Asn Val Asp Gln Arg Gln Tyr Gly Asp Val Phe Lys Gly
145 150 155 160
Asp Leu Asn Pro Lys Pro Gln Gly Gln Arg Leu Ile Glu Val Ser Val
165 170 175
Glu Glu Asn His Pro Phe Thr Leu Arg Ala Pro Ile Gln Arg Ile Tyr
180 185 190
Gly Val Arg Tyr Thr Glu Thr Trp Ser Phe Leu Pro Ser Leu Thr Cys
195 200 205
Thr Gly Asp Ala Ala Pro Ala Ile Gln His Ile Cys Leu Lys His Thr
210 215 220
Thr Cys Phe Gln Asp Val Val Val Asp Val Asp Cys Ala Glu Asn Thr
225 230 235 240
Lys Glu Asp Gln Leu Ala Glu Ile Ser Tyr Arg Phe Gln Gly Lys Lys
245 250 255
Glu Ala Asp Gln Pro Trp Ile Val Val Asn Thr Ser Thr Leu Phe Asp
260 265 270
Glu Leu Glu Leu Asp Pro Pro Glu Ile Glu Pro Gly Val Leu Lys Val
275 280 285
Leu Arg Thr Glu Lys Gln Tyr Leu Gly Val Tyr Ile Trp Asn Met Arg
290 295 300
Gly Ser Asp Gly Thr Ser Thr Tyr Ala Thr Phe Leu Val Thr Trp Lys
305 310 315 320
Gly Asp Glu Lys Thr Arg Asn Pro Thr Pro Ala Val Thr Pro Gln Pro
325 330 335
Arg Gly Ala Glu Phe His Met Trp Asn Tyr His Ser His Val Phe Ser
340 345 350
Val Gly Asp Thr Phe Ser Leu Ala Met His Leu Gln Tyr Lys Ile His
355 360 365
Glu Ala Pro Phe Asp Leu Leu Leu Glu Trp Leu Tyr Val Pro Ile Asp
370 375 380
Pro Thr Cys Gln Pro Met Arg Leu Tyr Ser Thr Cys Leu Tyr His Pro
385 390 395 400
Asn Ala Pro Gln Cys Leu Ser His Met Asn Ser Gly Cys Thr Phe Thr
405 410 415
Ser Pro His Leu Ala Gln Arg Val Ala Ser Thr Val Tyr Gln Asn Cys
420 425 430
Glu His Ala Asp Asn Tyr Thr Ala Tyr Cys Leu Gly Ile Ser His Met
435 440 445
Glu Pro Ser Phe Gly Leu Ile Leu His Asp Gly Gly Thr Thr Leu Lys
450 455 460
Phe Val Asp Thr Pro Glu Ser Leu Ser Gly Leu Tyr Val Phe Val Val
465 470 475 480
Tyr Phe Asn Gly His Val Glu Ala Val Ala Tyr Thr Val Val Ser Thr
485 490 495
Val Asp His Phe Val Asn Ala Ile Glu Glu Arg Gly Phe Pro Pro Thr
500 505 510
Ala Gly Gln Pro Pro Ala Thr Thr Lys Pro Lys Glu Ile Thr Pro Val
515 520 525
Asn Pro Gly Thr Ser Pro Leu Leu Arg Tyr Ala Ala Trp Thr Gly Gly
530 535 540
Leu Ala Ala Val Val Leu Leu Cys Leu Val Ile Phe Leu Ile Cys Thr
545 550 555 560
Ala Lys Arg Met Arg Val Lys Ala Tyr Arg Val Asp Lys Ser Pro Tyr
565 570 575
Asn Gln Ser Met Tyr Tyr Ala Gly Leu Pro Val Asp Asp Phe Glu Asp
580 585 590
Ser Glu Ser Thr Asp Thr Glu Glu Glu Phe Gly Asn Ala Ile Gly Gly
595 600 605
Ser His Gly Gly Ser Ser Tyr Thr Val Tyr Ile Asp Lys Thr Arg
610 615 620
<210>42
<211>1310
<212>PRT
<213>水痘带状疱疹
<400>42
Met Asp Thr Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser
1 5 10 15
Pro Asp Thr Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala
20 25 30
Ala Glu His Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu
35 40 45
Leu Phe Gly Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile Val
50 55 60
Ser Ile Pro Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr Glu Asp
65 70 75 80
Val Gly Glu Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu Asp Gly Gln
85 90 95
Asp Leu Met Gly Ser Pro Val Ile Pro Leu Ala Glu Val Phe His Thr
100 105 110
Arg Phe Ser Glu Ala Gly Ala Arg Glu Pro Thr Gly Ala Asp Arg Ser
115 120 125
Leu Glu Thr Val Ser Leu Gly Thr Lys Leu Ala Arg Ser Pro Lys Pro
130 135 140
Pro Met Asn Asp Gly Glu Thr Gly Arg Gly Thr Thr Pro Pro Phe Pro
145 150 155 160
Gln Ala Phe Ser Pro Val Ser Pro Ala Ser Pro Val Gly Asp Ala Ala
165 170 175
Gly Asn Asp Gln Arg Glu Asp Gln Arg Ser Ile Pro Arg Gln Thr Thr
180 185 190
Arg Gly Asn Ser Pro Gly Leu Pro Ser Val Val His Arg Asp Arg Gln
195 200 205
Thr Gln Ser Ile Ser Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His
210 215 220
Ala His Ala Ser Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro
225 230 235 240
Ala Gln Gly Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro
245 250 255
Leu Pro Ala Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln
260 265 270
Leu Tyr His Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala
275 280 285
Asp Leu Pro Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp Ala
290 295 300
Arg Gly Ile Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly Lys Pro
305 310 315 320
Arg Ala Phe Leu Ala Leu Pro Gly Arg Ser His Ala Pro Asp Pro Ile
325 330 335
Glu Asp Asp Ser Pro Val Glu Lys Lys Pro Lys Ser Arg Glu Phe Val
340 345 350
Ser Ser Ser Ser Ser Ser Ser Ser Trp Gly Ser Ser Ser Glu Asp Glu
355 360 365
Asp Asp Glu Pro Arg Arg Val Ser Val Gly Ser Glu Thr Thr Gly Ser
370 375 380
Arg Ser Gly Arg Glu His Ala Pro Ser Pro Ser Asn Ser Asp Asp Ser
385 390 395 400
Asp Ser Asn Asp Gly Gly Ser Thr Lys Gln Asn Ile Gln Pro Gly Tyr
405 410 415
Arg Ser Ile Ser Gly Pro Asp Pro Arg Ile Arg Lys Thr Lys Arg Leu
420 425 430
Ala Gly Glu Pro Gly Arg Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg
435 440 445
Ser Arg Thr Pro Ile Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro
450 455 460
Asp Gly Ser Pro Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val
465 470 475 480
Arg Phe Gly Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp Glu
485 490 495
Ala Val Arg Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu Pro Val
500 505 510
Pro Leu Tyr Val Pro Glu Leu Gly Asp Pro Ala Arg Gln Tyr Arg Ala
515 520 525
Leu Ile Asn Leu Ile Tyr Cys Pro Asp Arg Asp Pro Ile Ala Trp Leu
530 535 540
Gln Asn Pro Lys Leu Thr Gly Val Asn Ser Ala Leu Asn Gln Phe Tyr
545 550 555 560
Gln Lys Leu Leu Pro Pro Gly Arg Ala Gly Thr Ala Val Thr Gly Ser
565 570 575
Val Ala Ser Pro Val Pro His Val Gly Glu Ala Met Ala Thr Gly Glu
580 585 590
Ala Leu Trp Ala Leu Pro His Ala Ala Ala Ala Val Ala Met Ser Arg
595 600 605
Arg Tyr Asp Arg Ala Gln Lys His Phe Ile Leu Gln Ser Leu Arg Arg
610 615 620
Ala Phe Ala Ser Met Ala Tyr Pro Glu Ala Thr Gly Ser Ser Pro Ala
625 630 635 640
Ala Arg Ile Ser Arg Gly His Pro Ser Pro Thr Thr Pro Ala Thr Gln
645 650 655
Ala Pro Asp Pro Gln Pro Ser Ala Ala Ala Arg Ser Leu Ser Val Cys
660 665 670
Pro Pro Asp Asp Arg Leu Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro
675 680 685
Val Glu Ser Arg Ser Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala
690 695 700
Asp Ala Arg Val Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg
705 710 715 720
Val Ser Glu Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro Pro
725 730 735
Ala Val Ile Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly Gly Phe
740 745 750
Arg Arg Ile Pro Arg Gly Ala Leu His Thr Pro Val Pro Ser Asp Gln
755 760 765
Ala Arg Lys Ala Tyr Cys Thr Pro Glu Thr Ile Ala Arg Leu Val Asp
770 775 780
Asp Pro Leu Phe Pro Thr Ala Trp Arg Pro Ala Leu Ser Phe Asp Pro
785 790 795 800
Gly Ala Leu Ala Glu Ile Ala Ala Arg Arg Pro Gly Gly Gly Asp Arg
805 810 815
Arg Phe Gly Pro Pro Ser Gly Val Glu Ala Leu Arg Arg Arg Cys Ala
820 825 830
Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val Arg Leu Leu Ile Ile
835 840 845
Tyr Asp Pro Leu Pro Gly Glu Asp Ile Asn Gly Pro Leu Glu Ser Thr
850 855 860
Leu Ala Thr Asp Pro Gly Pro Ser Trp Ser Pro Ser Arg Gly Gly Leu
865 870 875 880
Ser Val Val Leu Ala Ala Leu Ser Asn Arg Leu Cys Leu Pro Ser Thr
885 890 895
His Ala Trp Ala Gly Asn Trp Thr Gly Pro Pro Asp Val Ser Ala Leu
900 905 910
Asn Ala Arg Gly Val Leu Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala
915 920 925
Gly Ala Val Glu Tyr Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg
930 935 940
Leu Leu Val Leu Asp Ala Val Ala Leu Glu Arg Trp Pro Arg Asp Gly
945 950 955 960
Pro Ala Leu Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg Pro
965 970 975
Asp Ala Gln Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr Glu Gly
980 985 990
Leu Ala Arg Ala Val Phe Ala Ser Ser Arg Thr Phe Gly Pro Ala Ser
995 1000 1005
Phe Ala Arg Ile Glu Thr Ala Phe Ala Asn Leu Tyr Pro Gly Glu
1010 1015 1020
Gln Pro Leu Cys Leu Cys Arg Gly Gly Asn Val Ala Tyr Thr Val
1025 1030 1035
Cys Thr Arg Ala Gly Pro Lys Thr Arg Val Pro Leu Ser Pro Arg
1040 1045 1050
Glu Tyr Arg Gln Tyr Val Leu Pro Gly Phe Asp Gly Cys Lys Asp
1055 1060 1065
Leu Ala Arg Gln Ser Arg Gly Leu Gly Leu Gly Ala Ala Asp Phe
1070 1075 1080
Val Asp Glu Ala Ala His Ser His Arg Ala Ala Asn Arg Trp Gly
1085 1090 1095
Leu Gly Ala Ala Leu Arg Pro Val Phe Leu Pro Glu Gly Arg Arg
1100 1105 1110
Pro Gly Ala Ala Gly Pro Glu Ala Gly Asp Val Pro Thr Trp Ala
1115 1120 1125
Arg Val Phe Cys Arg His Ala Leu Leu Glu Pro Asp Pro Ala Ala
1130 1135 1140
Glu Pro Leu Val Leu Pro Pro Val Ala Gly Arg Ser Val Ala Leu
1145 1150 1155
Tyr Ala Ser Ala Asp Glu Ala Arg Asn Ala Leu Pro Pro Ile Pro
1160 1165 1170
Arg Val Met Trp Pro Pro Gly Phe Gly Ala Ala Glu Thr Val Leu
1175 1180 1185
Glu Gly Ser Asp Gly Thr Arg Phe Val Phe Gly His His Gly Gly
1190 1195 1200
Ser Glu Arg Pro Ser Glu Thr Gln Ala Gly Arg Gln Arg Arg Thr
1205 1210 1215
Ala Asp Asp Arg Glu His Ala Leu Glu Leu Asp Asp Trp Glu Val
1220 1225 1230
Gly Cys Glu Asp Ala Trp Asp Ser Glu Glu Gly Gly Gly Asp Asp
1235 1240 1245
Gly Asp Ala Pro Gly Ser Ser Phe Gly Val Ser Ile Val Ser Val
1250 1255 1260
Ala Pro Gly Val Leu Arg Asp Arg Arg Val Gly Leu Arg Pro Ala
1265 1270 1275
Val Lys Val Glu Leu Leu Ser Ser Ser Ser Ser Ser Glu Asp Glu
1280 1285 1290
Asp Asp Val Trp Gly Gly Arg Gly Gly Arg Ser Pro Pro Gln Ser
1295 1300 1305
Arg Gly
1310
<210>43
<211>1002
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(1002)
<400>43
atg cat tta aag cct acc aga ttt ttc cac gca aac caa ccg cca atg 48
Met His Leu Lys Pro Thr Arg Phe Phe His Ala Asn Gln Pro Pro Met
1 5 10 15
ccg cat tca tac gag atg gag gac tta tgc ttc gac gac atg caa tat 96
Pro His Ser Tyr Glu Met Glu Asp Leu Cys Phe Asp Asp Met Gln Tyr
20 25 30
cgc tgg tct ccc tcg aac aca ccc tat cga agt atg tct agg cga tat 144
Arg Trp Ser Pro Ser Asn Thr Pro Tyr Arg Ser Met Ser Arg Arg Tyr
35 40 45
aaa tcc gta tct cgg agc ggg cct tcg atg cgt gta cgc tcc aga acg 192
Lys Ser Val Ser Arg Ser Gly Pro Ser Met Arg Val Arg Ser Arg Thr
50 55 60
cca tgc cgc cgt caa acc att cga gga aaa ctt atg tca aag gag cgg 240
Pro Cys Arg Arg Gln Thr Ile Arg Gly Lys Leu Met Ser Lys Glu Arg
65 70 75 80
tct gtg tac cgc cat tat ttt aat tac atc gca agg tcc ccc cca gaa 288
Ser Val Tyr Arg His Tyr Phe Asn Tyr Ile Ala Arg Ser Pro Pro Glu
85 90 95
gaa cta gct acc gtt aga ggc tta atc gtg cca att att aag acg acc 336
Glu Leu Ala Thr Val Arg Gly Leu Ile Val Pro Ile Ile Lys Thr Thr
100 105 110
cct gtc acc ctt ccg ttt aac ttg ggt cag aca gtg gcg gat aac tgc 384
Pro Val Thr Leu Pro Phe Asn Leu Gly Gln Thr Val Ala Asp Asn Cys
115 120 125
ctg tcg tta tcc gga atg ggt tat cat tta ggt ctc gga ggt tat tgt 432
Leu Ser Leu Ser Gly Met Gly Tyr His Leu Gly Leu Gly Gly Tyr Cys
130 135 140
ccg aca tgc act gca tct gga gaa ccg cgt cta tgt cga acc gat cgg 480
Pro Thr Cys Thr Ala Ser Gly Glu Pro Arg Leu Cys Arg Thr Asp Arg
145 150 155 160
gcg gct ctg ata cta gca tat gtt cag cag ctt aac aac ata tac gaa 528
Ala Ala Leu Ile Leu Ala Tyr Val Gln Gln Leu Asn Asn Ile Tyr Glu
165 170 175
tat cgt gtg ttt ctt gca tcc att ttg gcg cta tca gac cga gcc aac 576
Tyr Arg Val Phe Leu Ala Ser Ile Leu Ala Leu Ser Asp Arg Ala Asn
180 185 190
atg caa gca gcg tcc gct gaa ccc cta ttg tcg agc gta ttg gca caa 624
Met Gln Ala Ala Ser Ala Glu Pro Leu Leu Ser Ser Val Leu Ala Gln
195 200 205
ccg gaa tta ttt ttt atg tat cat att atg agg gag ggg ggc atg cga 672
Pro Glu Leu Phe Phe Met Tyr His Ile Met Arg Glu Gly Gly Met Arg
210 215 220
gat ata cgc gta ctt ttt tat cgt gat gga gat gcc gga ggg ttt atg 720
Asp Ile Arg Val Leu Phe Tyr Arg Asp Gly Asp Ala Gly Gly Phe Met
225 230 235 240
atg tat gtt ata ttt ccg ggg aaa tct gtt cac ctc cat tac aga cta 768
Met Tyr Val Ile Phe Pro Gly Lys Ser Val His Leu His Tyr Arg Leu
245 250 255
atc gat cat ata cag gcc gcg tgt cgg ggg tat aaa ata gtc gca cac 816
Ile Asp His Ile Gln Ala Ala Cys Arg Gly Tyr Lys Ile Val Ala His
260 265 270
gtt tgg cag aca aca ttt tta ctg tcg gta tgt cgc aac cca gaa caa 864
Val Trp Gln Thr Thr Phe Leu Leu Ser Val Cys Arg Asn Pro Glu Gln
275 280 285
caa aca gag act gtg gtg cca tcc att gga aca tcg gac gtt tac tgt 912
Gln Thr Glu Thr Val Val Pro Ser Ile Gly Thr Ser Asp Val Tyr Cys
290 295 300
aaa atg tgt gac ctt aac ttt gat gga gaa ttg ctt ttg gaa tac aaa 960
Lys Met Cys Asp Leu Asn Phe Asp Gly Glu Leu Leu Leu Glu Tyr Lys
305 310 315 320
aga ctc tac gca tta ttt gat gac ttt gtt cct cct cgg tga 1002
Arg Leu Tyr Ala Leu Phe Asp Asp Phe Val Pro Pro Arg
325 330
<210>44
<211>333
<212>PRT
<213>水痘带状疱疹
<400>44
Met His Leu Lys Pro Thr Arg Phe Phe His Ala Asn Gln Pro Pro Met
1 5 10 15
Pro His Ser Tyr Glu Met Glu Asp Leu Cys Phe Asp Asp Met Gln Tyr
20 25 30
Arg Trp Ser Pro Ser Asn Thr Pro Tyr Arg Ser Met Ser Arg Arg Tyr
35 40 45
Lys Ser Val Ser Arg Ser Gly Pro Ser Met Arg Val Arg Ser Arg Thr
50 55 60
Pro Cys Arg Arg Gln Thr Ile Arg Gly Lys Leu Met Ser Lys Glu Arg
65 70 75 80
Ser Val Tyr Arg His Tyr Phe Asn Tyr Ile Ala Arg Ser Pro Pro Glu
85 90 95
Glu Leu Ala Thr Val Arg Gly Leu Ile Val Pro Ile Ile Lys Thr Thr
100 105 110
Pro Val Thr Leu Pro Phe Asn Leu Gly Gln Thr Val Ala Asp Asn Cys
115 120 125
Leu Ser Leu Ser Gly Met Gly Tyr His Leu Gly Leu Gly Gly Tyr Cys
130 135 140
Pro Thr Cys Thr Ala Ser Gly Glu Pro Arg Leu Cys Arg Thr Asp Arg
145 150 155 160
Ala Ala Leu Ile Leu Ala Tyr Val Gln Gln Leu Asn Asn Ile Tyr Glu
165 170 175
Tyr Arg Val Phe Leu Ala Ser Ile Leu Ala Leu Ser Asp Arg Ala Asn
180 185 190
Met Gln Ala Ala Ser Ala Glu Pro Leu Leu Ser Ser Val Leu Ala Gln
195 200 205
Pro Glu Leu Phe Phe Met Tyr His Ile Met Arg Glu Gly Gly Met Arg
210 215 220
Asp Ile Arg Val Leu Phe Tyr Arg Asp Gly Asp Ala Gly Gly Phe Met
225 230 235 240
Met Tyr Val Ile Phe Pro Gly Lys Ser Val His Leu His Tyr Arg Leu
245 250 255
Ile Asp His Ile Gln Ala Ala Cys Arg Gly Tyr Lys Ile Val Ala His
260 265 270
Val Trp Gln Thr Thr Phe Leu Leu Ser Val Cys Arg Asn Pro Glu Gln
275 280 285
Gln Thr Glu Thr Val Val Pro Ser Ile Gly Thr Ser Asp Val Tyr Cys
290 295 300
Lys Met Cys Asp Leu Asn Phe Asp Gly Glu Leu Leu Leu Glu Tyr Lys
305 310 315 320
Arg Leu Tyr Ala Leu Phe Asp Asp Phe Val Pro Pro Arg
325 330
<210>45
<211>1533
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(1533)
<400>45
atg gat gct gac gac aca ccc ccc aac ctc caa ata tct cca act gca 48
Met Asp Ala Asp Asp Thr Pro Pro Asn Leu Gln Ile Ser Pro Thr Ala
1 5 10 15
gga cct ttg cgt tcc cac cac aat acc gac gga cat gaa cca aat gca 96
Gly Pro Leu Arg Ser His His Asn Thr Asp Gly His Glu Pro Asn Ala
20 25 30
acc gca gcc gat cag caa gaa cga gaa tcc acc aac ccc aca cac gga 144
Thr Ala Ala Asp Gln Gln Glu Arg Glu Ser Thr Asn Pro Thr His Gly
35 40 45
tgt gta aat cat cca tgg gcc aat ccg tca act gca aca tgc atg gaa 192
Cys Val Asn His Pro Trp Ala Asn Pro Ser Thr Ala Thr Cys Met Glu
50 55 60
tca cca gaa cga tca caa cag aca agc tta ttt tta tta aag cac ggc 240
Ser Pro Glu Arg Ser Gln Gln Thr Ser Leu Phe Leu Leu Lys His Gly
65 70 75 80
tta acg aga gat cca ata cat caa cgc gaa agg gtg gac gtt ttt cca 288
Leu Thr Arg Asp Pro Ile His Gln Arg Glu Arg Val Asp Val Phe Pro
85 90 95
caa ttt aac aaa ccc cca tgg gtt ttt aga att tcc aaa tta tcc cgt 336
Gln Phe Asn Lys Pro Pro Trp Val Phe Arg Ile Ser Lys Leu Ser Arg
100 105 110
tta att gta ccc atc ttc acg ctc aat gaa cag tta tgt ttt tct aaa 384
Leu Ile Val Pro Ile Phe Thr Leu Asn Glu Gln Leu Cys Phe Ser Lys
115 120 125
tta cag att cga gat aga ccc agg ttt gcg gga cgg gga acg tat ggg 432
Leu Gln Ile Arg Asp Arg Pro Arg Phe Ala Gly Arg Gly Thr Tyr Gly
130 135 140
cgt gtt cat ata tac cca tcg tca aaa ata gct gta aaa acc atg gac 480
Arg Val His Ile Tyr Pro Ser Ser Lys Ile Ala Val Lys Thr Met Asp
145 150 155 160
agt cgt gtt ttt aat aga gag tta att aac gcg att tta gcg agt gag 528
Ser Arg Val Phe Asn Arg Glu Leu Ile Asn Ala Ile Leu Ala Ser Glu
165 170 175
ggt tct ata cga gca ggg gaa agg cta ggt att tct agc ata gtt tgc 576
Gly Ser Ile Arg Ala Gly Glu Arg Leu Gly Ile Ser Ser Ile Val Cys
180 185 190
ctt tta ggt ttt tcg tta caa acc aaa cag cta ctg ttt ccg gca tac 624
Leu Leu Gly Phe Ser Leu Gln Thr Lys Gln Leu Leu Phe Pro Ala Tyr
195 200 205
gac atg gat atg gat gaa tac att gtt cgc ctg tcc aga cgg ttg aca 672
Asp Met Asp Met Asp Glu Tyr Ile Val Arg Leu Ser Arg Arg Leu Thr
210 215 220
ata cct gat cac ata gac aga aaa att gcc cat gta ttt tta gat ttg 720
Ile Pro Asp His Ile Asp Arg Lys Ile Ala His Val Phe Leu Asp Leu
225 230 235 240
gct caa gcg ttg acg ttt tta aat cga acg tgc ggc ctg acc cac cta 768
Ala Gln Ala Leu Thr Phe Leu Asn Arg Thr Cys Gly Leu Thr His Leu
245 250 255
gat gtg aaa tgt ggc aat att ttt ctt aac gtc gac aac ttt gcc tcg 816
Asp Val Lys Cys Gly Asn Ile Phe Leu Asn Val Asp Asn Phe Ala Ser
260 265 270
ttg gaa ata acc aca gca gta atc gga gac tat agc cta gta aca tta 864
Leu Glu Ile Thr Thr Ala Val Ile Gly Asp Tyr Ser Leu Val Thr Leu
275 280 285
aat acg tat tcc ctt tgt act cga gcg ata ttt gaa gtt gga aat cca 912
Asn Thr Tyr Ser Leu Cys Thr Arg Ala Ile Phe Glu Val Gly Asn Pro
290 295 300
tcc cac ccg gag cac gta cta cgc gta ccc cgg gat gca tcg cag atg 960
Ser His Pro Glu His Val Leu Arg Val Pro Arg Asp Ala Ser Gln Met
305 310 315 320
tca ttt cgt ttg gtg ttg agt cat gga aca aac caa ccc cct gaa atc 1008
Ser Phe Arg Leu Val Leu Ser His Gly Thr Asn Gln Pro Pro Glu Ile
325 330 335
ttg ctt gat tat att aat gga acg ggc ctt act aaa tat act gga acc 1056
Leu Leu Asp Tyr Ile Asn Gly Thr Gly Leu Thr Lys Tyr Thr Gly Thr
340 345 350
ttg ccc caa aga gtt gga ctt gcg att gat ctt tat gca ttg ggc caa 1104
Leu Pro Gln Arg Val Gly Leu Ala Ile Asp Leu Tyr Ala Leu Gly Gln
355 360 365
gca ctc tta gaa gtt atc ctg cta gga cgt ctt ccc gga caa ctg ccc 1152
Ala Leu Leu Glu Val Ile Leu Leu Gly Arg Leu Pro Gly Gln Leu Pro
370 375 380
att tca gta cat cgg acc ccg cat tat cac tac tac ggt cat aag tta 1200
Ile Ser Val His Arg Thr Pro His Tyr His Tyr Tyr Gly His Lys Leu
385 390 395 400
tca cca gat ttg gcg ctt gat acg ctg gca tat cga tgt gtc ctg gcg 1248
Ser Pro Asp Leu Ala Leu Asp Thr Leu Ala Tyr Arg Cys Val Leu Ala
405 410 415
cca tat ata ctc cca tct gac atc ccc ggg gac tta aat tat aat ccc 1296
Pro Tyr Ile Leu Pro Ser Asp Ile Pro Gly Asp Leu Asn Tyr Asn Pro
420 425 430
ttt ata cac gcc gga gag ctg aac acc cgt att tcc cgg aat tct tta 1344
Phe Ile His Ala Gly Glu Leu Asn Thr Arg Ile Ser Arg Asn Ser Leu
435 440 445
cgc cgg ata ttc cag tgt cac gca gtg cgt tac ggc gta acg cac tca 1392
Arg Arg Ile Phe Gln Cys His Ala Val Arg Tyr Gly Val Thr His Ser
450 455 460
aag ctt ttc gaa ggc ata cgc att ccg gcc tca tta tac cca gcc act 1440
Lys Leu Phe Glu Gly Ile Arg Ile Pro Ala Ser Leu Tyr Pro Ala Thr
465 470 475 480
gtt gtt aca tcg ttg ttg tgt cac gat aat tca gaa ata cgc tcg gat 1488
Val Val Thr Ser Leu Leu Cys His Asp Asn Ser Glu Ile Arg Ser Asp
485 490 495
cac cct tta tta tgg cac gat cgg gat tgg ata gga tcg aca taa 1533
His Pro Leu Leu Trp His Asp Arg Asp Trp Ile Gly Ser Thr
500 505 510
<210>46
<211>510
<212>PRT
<213>水痘带状疱疹
<400>46
Met Asp Ala Asp Asp Thr Pro Pro Asn Leu Gln Ile Ser Pro Thr Ala
1 5 10 15
Gly Pro Leu Arg Ser His His Asn Thr Asp Gly His Glu Pro Asn Ala
20 25 30
Thr Ala Ala Asp Gln Gln Glu Arg Glu Ser Thr Asn Pro Thr His Gly
35 40 45
Cys Val Asn His Pro Trp Ala Asn Pro Ser Thr Ala Thr Cys Met Glu
50 55 60
Ser Pro Glu Arg Ser Gln Gln Thr Ser Leu Phe Leu Leu Lys His Gly
65 70 75 80
Leu Thr Arg Asp Pro Ile His Gln Arg Glu Arg Val Asp Val Phe Pro
85 90 95
Gln Phe Asn Lys Pro Pro Trp Val Phe Arg Ile Ser Lys Leu Ser Arg
100 105 110
Leu Ile Val Pro Ile Phe Thr Leu Asn Glu Gln Leu Cys Phe Ser Lys
115 120 125
Leu Gln Ile Arg Asp Arg Pro Arg Phe Ala Gly Arg Gly Thr Tyr Gly
130 135 140
Arg Val His Ile Tyr Pro Ser Ser Lys Ile Ala Val Lys Thr Met Asp
145 150 155 160
Ser Arg Val Phe Asn Arg Glu Leu Ile Asn Ala Ile Leu Ala Ser Glu
165 170 175
Gly Ser Ile Arg Ala Gly Glu Arg Leu Gly Ile Ser Ser Ile Val Cys
180 185 190
Leu Leu Gly Phe Ser Leu Gln Thr Lys Gln Leu Leu Phe Pro Ala Tyr
195 200 205
Asp Met Asp Met Asp Glu Tyr Ile Val Arg Leu Ser Arg Arg Leu Thr
210 215 220
Ile Pro Asp His Ile Asp Arg Lys Ile Ala His Val Phe Leu Asp Leu
225 230 235 240
Ala Gln Ala Leu Thr Phe Leu Asn Arg Thr Cys Gly Leu Thr His Leu
245 250 255
Asp Val Lys Cys Gly Asn Ile Phe Leu Asn Val Asp Asn Phe Ala Ser
260 265 270
Leu Glu Ile Thr Thr Ala Val Ile Gly Asp Tyr Ser Leu Val Thr Leu
275 280 285
Asn Thr Tyr Ser Leu Cys Thr Arg Ala Ile Phe Glu Val Gly Asn Pro
290 295 300
Ser His Pro Glu His Val Leu Arg Val Pro Arg Asp Ala Ser Gln Met
305 310 315 320
Ser Phe Arg Leu Val Leu Ser His Gly Thr Asn Gln Pro Pro Glu Ile
325 330 335
Leu Leu Asp Tyr Ile Asn Gly Thr Gly Leu Thr Lys Tyr Thr Gly Thr
340 345 350
Leu Pro Gln Arg Val Gly Leu Ala Ile Asp Leu Tyr Ala Leu Gly Gln
355 360 365
Ala Leu Leu Glu Val Ile Leu Leu Gly Arg Leu Pro Gly Gln Leu Pro
370 375 380
Ile Ser Val His Arg Thr Pro His Tyr His Tyr Tyr Gly His Lys Leu
385 390 395 400
Ser Pro Asp Leu Ala Leu Asp Thr Leu Ala Tyr Arg Cys Val Leu Ala
405 410 415
Pro Tyr Ile Leu Pro Ser Asp Ile Pro Gly Asp Leu Asn Tyr Asn Pro
420 425 430
Phe Ile His Ala Gly Glu Leu Asn Thr Arg Ile Ser Arg Asn Ser Leu
435 440 445
Arg Arg Ile Phe Gln Cys His Ala Val Arg Tyr Gly Val Thr His Ser
450 455 460
Lys Leu Phe Glu Gly Ile Arg Ile Pro Ala Ser Leu Tyr Pro Ala Thr
465 470 475 480
Val Val Thr Ser Leu Leu Cys His Asp Asn Ser Glu Ile Arg Ser Asp
485 490 495
His Pro Leu Leu Trp His Asp Arg Asp Trp Ile Gly Ser Thr
500 505 510
<210>47
<211>246
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(246)
<400>47
atg gga caa tct tca tcc agc ggt cga gga gga atc tgt gga ttg tgc 48
Met Gly Gln Ser Ser Ser Ser Gly Arg Gly Gly Ile Cys Gly Leu Cys
1 5 10 15
aaa cgg tac aac gag ctt gtt acg tgc aac gga gaa acc gtt gct ttg 96
Lys Arg Tyr Asn Glu Leu Val Thr Cys Asn Gly Glu Thr Val Ala Leu
20 25 30
aac tca gag ttc ttt gaa gac ttt gac ttt gat gag aat gta aca gag 144
Asn Ser Glu Phe Phe Glu Asp Phe Asp Phe Asp Glu Asn Val Thr Glu
35 40 45
gac gcc gat aaa tcc aca caa cgc cgc cca cga gtg atc gat gta aca 192
Asp Ala Asp Lys Ser Thr Gln Arg Arg Pro Arg Val Ile Asp Val Thr
50 55 60
cca aaa cga aaa cct tcg gga aag agc tcc cat tcc aaa tgc gca aaa 240
Pro Lys Arg Lys Pro Ser Gly Lys Ser Ser His Ser Lys Cys Ala Lys
65 70 75 80
tgt taa 246
Cys
<210>48
<211>81
<212>PRT
<213>水痘带状疱疹
<400>48
Met Gly Gln Ser Ser Ser Ser Gly Arg Gly Gly Ile Cys Gly Leu Cys
1 5 10 15
Lys Arg Tyr Asn Glu Leu Val Thr Cys Asn Gly Glu Thr Val Ala Leu
20 25 30
Asn Ser Glu Phe Phe Glu Asp Phe Asp Phe Asp Glu Asn Val Thr Glu
35 40 45
Asp Ala Asp Lys Ser Thr Gln Arg Arg Pro Arg Val Ile Asp Val Thr
50 55 60
Pro Lys Arg Lys Pro Ser Gly Lys Ser Ser His Ser Lys Cys Ala Lys
65 70 75 80
Cys
<210>49
<211>735
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(735)
<400>49
atg aaa aat ccg cag aaa tta gcg atc aca ttc ttg ccg ctc tac gtg 48
Met Lys Ash Pro Gln Lys Leu Ala Ile Thr Phe Leu Pro Leu Tyr Val
1 5 10 15
atc cca acg tac acg ttg tgt att aaa gca ttg tat aaa aac acg cat 96
Ile Pro Thr Tyr Thr Leu Cys Ile Lys Ala Leu Tyr Lys Asn Thr His
20 25 30
gcg ggc ttg ctg ttc tca ttt cta ggt ttt gtc tta aat aca ccc gcc 144
Ala Gly Leu Leu Phe Set Phe Leu Gly Phe Val Leu Asn Thr Pro Ala
35 40 45
atg agc atc tct gga ccc cca acg acg ttt att tta tat agg tta cat 192
Met Ser Ile Ser Gly Pro Pro Thr Thr Phe Ile Leu Tyr Arg Leu His
50 55 60
ggg gtt agg cgg gtt ctt cac tgg act tta ccg gat cat gaa caa aca 240
Gly Val Arg Arg Val Leu His Trp Thr Leu Pro Asp His Glu Gln Thr
65 70 75 80
ctc tac gca ttt acg ggt ggg tca aga tca atg gcg gtg aag acg gac 288
Leu Tyr Ala Phe Thr Gly Gly Ser Arg Ser Met Ala Val Lys Thr Asp
85 90 95
gct cga tgt gat aca atg agc ggt ggt atg atc gtc ctt caa cac acc 336
Ala Arg Cys Asp Thr Met Ser Gly Gly Met Ile Val Leu Gln His Thr
100 105 110
cat aca gtg acc ctg cta acc ata gac tgt tct act gac ttt tca tca 384
His Thr Val Thr Leu Leu Thr Ile Asp Cys Ser Thr Asp Phe Ser Ser
115 120 125
tac gca ttt acg cac cgg gat ttc cac tta cag gac aaa ccc cac gca 432
Tyr Ala Phe Thr His Arg Asp Phe His Leu Gln Asp Lys Pro His Ala
130 135 140
aca ttt gcg atg ccg ttt atg tcc tgg gtc ggt tct gac cca aca tct 480
Thr Phe Ala Met Pro Phe Met Ser Trp Val Gly Ser Asp Pro Thr Ser
145 150 155 160
cag ctg tac agt aat gtg ggg ggg gta cta tcc gta ata acg gaa gat 528
Gln Leu Tyr Ser Asn Val Gly Gly Val Leu Ser Val Ile Thr Glu Asp
165 170 175
gac cta tcc atg tgt atc tca att gtt ata tac ggt tta cgg gta aac 576
Asp Leu Ser Met Cys Ile Ser Ile Val Ile Tyr Gly Leu Arg Val Asn
180 185 190
aga cct gac gat cag acc aca cca aca cca acc ccg cac cag tat aca 624
Arg Pro Asp Asp Gln Thr Thr Pro Thr Pro Thr Pro His Gln Tyr Thr
195 200 205
tcg caa agg cgg cag cct gaa acc aac tgt cct tct tca cca caa ccg 672
Ser Gln Arg Arg Gln Pro Glu Thr Asn Cys Pro Ser Ser Pro Gln Pro
210 215 220
gcc ttt ttc aca tca gac gac gac gtt ctt tcg tta ata tta cgg gac 720
Ala Phe Phe Thr Ser Asp Asp Asp Val Leu Ser Leu Ile Leu Arg Asp
225 230 235 240
gcc gca aac gcg taa 735
Ala Ala Asn Ala
<210>50
<211>244
<212>PRT
<213>水痘带状疱疹
<400>50
Met Lys Asn Pro Gln Lys Leu Ala Ile Thr Phe Leu Pro Leu Tyr Val
1 5 10 15
Ile Pro Thr Tyr Thr Leu Cys Ile Lys Ala Leu Tyr Lys Asn Thr His
20 25 30
Ala Gly Leu Leu Phe Ser Phe Leu Gly Phe Val Leu Asn Thr Pro Ala
35 40 45
Met Ser Ile Ser Gly Pro Pro Thr Thr Phe Ile Leu Tyr Arg Leu His
50 55 60
Gly Val Arg Arg Val Leu His Trp Thr Leu Pro Asp His Glu Gln Thr
65 70 75 80
Leu Tyr Ala Phe Thr Gly Gly Ser Arg Ser Met Ala Val Lys Thr Asp
85 90 95
Ala Arg Cys Asp Thr Met Ser Gly Gly Met Ile Val Leu Gln His Thr
100 105 110
His Thr Val Thr Leu Leu Thr Ile Asp Cys Ser Thr Asp Phe Ser Ser
115 120 125
Tyr Ala Phe Thr His Arg Asp Phe His Leu Gln Asp Lys Pro His Ala
130 135 140
Thr Phe Ala Met Pro Phe Met Ser Trp Val Gly Ser Asp Pro Thr Ser
145 150 155 160
Gln Leu Tyr Ser Asn Val Gly Gly Val Leu Ser Val Ile Thr Glu Asp
165 170 175
Asp Leu Ser Met Cys Ile Ser Ile Val Ile Tyr Gly Leu Arg Val Asn
180 185 190
Arg Pro Asp Asp Gln Thr Thr Pro Thr Pro Thr Pro His Gln Tyr Thr
195 200 205
Ser Gln Arg Arg Gln Pro Glu Thr Asn Cys Pro Ser Ser Pro Gln Pro
210 215 220
Ala Phe Phe Thr Ser Asp Asp Asp Val Leu Ser Leu Ile Leu Arg Asp
225 230 235 240
Ala Ala Asn Ala
<210>51
<211>124884
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(5569)..(6405)
<220>
<221>CDS
<222>(6553)..(7095)
<220>
<221>CDS
<222>(12245)..(12553)
<220>
<221>CDS
<222>(15752)..(19684)
<220>
<221>CDS
<222>(20400)..(21803)
<220>
<221>CDS
<222>(23666)..(24583)
<220>
<221>CDS
<222>(25259)..(25474)
<220>
<221>CDS
<222>(31035)..(32030)
<220>
<221>CDS
<222>(54592)..(56217)
<220>
<221>CDS
<222>(60132)..(60908)
<220>
<221>CDS
<222>(60975)..(62714)
<220>
<221>CDS
<222>(62747)..(64564)
<220>
<221>CDS
<222>(74249)..(77833)
<220>
<221>CDS
<222>(80267)..(80737)
<220>
<221>CDS
<222>(80864)..(81673)
<220>
<221>CDS
<222>(81747)..(82454)
<220>
<221>CDS
<222>(94410)..(95861)
<220>
<221>CDS
<222>(96040)..(98367)
<220>
<221>CDS
<222>(98392)..(99312)
<220>
<221>CDS
<222>(101091)..(102317)
<220>
<221>CDS
<222>(102407)..(103627)
<220>
<221>CDS
<222>(103772)..(105454)
<220>
<221>CDS
<222>(114218)..(115408)
<220>
<221>CDS
<222>(116308)..(119559)
<220>
<221>CDS
<222>(119611)..(120633)
<220>
<221>CDS
<222>(120744)..(122102)
<220>
<221>CDS
<222>(122438)..(122977)
<220>
<221>CDS
<222>(123970)..(124296)
<400>51
cctctcccgg ggtccgccgg gcgcccagaa accggggggg ggttattttc gggggggggt 60
ccgaccagcc cgcccgtcgc ccgcccgcac agacagacag acactttttt cataaaaacc 120
gttccgcttt tattaacaac aaacagtccg cgcgccagtg gcgctcacga gaaaaggagg 180
ggactccgtc acccccgact ctgcgggggg ctcctccccc cgcgccctcc ccacacatcg 240
tcctcgtcct cggaggacga ggacgaggac aacagctcca ccttgaccgc cgggcgcaaa 300
cccacccggc ggtctcgcag cacacccggg gccaccgaca cgatgctcac cccaaaggat 360
gaccccggtg cgtccccgtc gtccccgccc ccctcctcgc tgtcccacgc gtcttcacac 420
cccacctccc aatcgtccag ctccaaagcg tgttctctgt cgtctgcggt gcgccgctgt 480
cgccccgcct gggtttctga cggccgttcc gagcccccgt ggtgtccgaa cacgaaccgt 540
gttccgtcgc tcccctccaa caccgtctcc gcggccccaa aaccgggcgg ccacattact 600
ctgggaatcg gggggagggc attccgagcc tcgtccgccg acgcatacag cgccaccgac 660
cgaccggcca cgggtggaag cacgagtggt tctgcggcag ggtcgggttc cagcagggcg 720
tggcggcaaa acaccctcgc ccaggtgggt acgtcgccgg cctccggccc ggcggccccc 780
ggtctccgtc cctcgggaag gaagacgggt cgaagcgcgg cacccaggcc ccatcggttt 840
gctgcgcggt ggctatgtgc cgcctcgtcc acaaagtcgg ctgccccgag ccccagaccc 900
cgagactgtc gcgcgaggtc cttgcaaccg tcaaaacccg gcagcacgta ctgccggtat 960
tcacggggcg acagggggac gcgggtcttg gggcccgcgc gggtacacac ggtgtatgcg 1020
acgttcccac cgcggcacaa acacaggggt tgttcgcccg ggtacaggtt ggcaaacgca 1080
gtctcgatac gagcaaaact cgctggccca aaggtgcgcg acgatgcaaa cacggcccgg 1140
gcgagtcctt ctgtgaccgc cgagtctggc catcggacga cggcctgggc gtccggtcgc 1200
gccggggccc ggacgtacac gtgatactga gacaaagcgg gtccatccct gggccacctc 1260
tcgagggcca ccgcgtccaa caccagcaac cggcgccggg cagaggccaa ccgcgagcct 1320
agatactcga cggccccggc aaaggccagg tctcgggtcg acagtaataa aacgccccgg 1380
gcgttcaaag cggacacgtc cggcgggccg gtccagttcc cggcccaggc atgagtgctc 1440
ggcaggcaca accggttact cagggctgcc aggaccacag acagtccccc tcgggatgga 1500
ctccatgacg gtcccggatc tgtcgcgagg gtgctctcga gggggccgtt gatgtcctct 1560
ccgggcaacg gatcgtagat gatcagaagc ctcacatcct ccgggtctgg gatctgccgc 1620
atccaggcgc acctccgtcg cagcgcctcc actccgctgg gtggaccaaa ccgtcggtct 1680
cctccgcccg gacgccgagc ggcgatttcc gccaaggcgc cgggatcaaa gcttagcgca 1740
gggcgccagg ccgtgggaaa caatgggtcg tcgaccagac gggcgatggt ttcgggggta 1800
cagtacgcct tgcgagcctg gtccgacggg accggggtat gcagggcccc ccggggaata 1860
cgccgaaatc ccccgtttgg ggccggtccg tcaagtggca tcgttattac ggcgggggga 1920
tccaccacag ggcccgaggt gatggtcacg ggctcggata cccgcctctt ggccttggaa 1980
accacatgat cgtctgcaac ccgggcgtcc gcgacgggtg tctccctaat cttgtcgagg 2040
aggcttctgc tctcgactgg ctgggacttg cgcttgcgcg gagttcgtaa acgatcatcc 2100
ggtggacaca cagaaagaga gcgtgcggcg gccgacggct gagggtcggg agcctgtgtg 2160
gccggggttg ttggagaagg gtgaccgcgg gagatccgcg ccgccggact ggagcccgtt 2220
gcctcggggt atgccatgct ggcaaaggct ctgcggagac tctgtaggat aaagtgtttt 2280
tgggcccggt cgtatcgacg gctcatagcc acggccgcgg ccgcgtgggg gagagcccag 2340
agggcctccc ccgtggccat ggcttcgcct acatgcggaa cgggagacgc tacgctcccc 2400
gtaacggcgg tacccgcccg tcccggtggc aacagctttt ggtagaactg gttcagggcc 2460
gagttgacac cggtcagctt ggggttctgg agccatgcta tagggtctct gtctggacag 2520
tagatcaggt taatcagcgc gcggtactgt ctagccggat ctcccaactc cggcacgtaa 2580
agcggcacgg gttccgttga ggcctcgtaa cgagcccgcg ccgctctcac agcctcatcc 2640
tcccagtgac cctctctggt ctccccggac ggtccaaacc gcaccctgtt ggatgggagg 2700
ggtgccgatc cgggccaagg gcttccgtcg ggcatcatga gcggccccga caccggggga 2760
attatcgggg ttctggatcg cggcagggaa aatgatttct gtctctggcg ccccggttcc 2820
cccgcaagac gtttggtctt acgaatcctc ggatcgggac cgctgatgga tcgatatccc 2880
ggttggatat tttgtttcgt cgacccacca tcatttgagt ccgaatcatc cgaatttgac 2940
ggggaagggg cgtgttcgcg tccggacctg ctgcctgtag tttcacttcc caccgaaacg 3000
cgccggggtt catcgtcttc atcctccgat gacgatcccc acgacgagga agaggatgaa 3060
gacgaaacaa actcacgact ctttggcttt ttctccactg ggctgtcatc ctcaatcggg 3120
tctggtgcgt gggatcttcc cggcagggcc aaaaacgctc taggtttgcc ccccgacgaa 3180
cgtccaggga cgcgaggtgt tataccccgg gcatcatgtt tccttgggcg ggtatcatcg 3240
gtctcaaacg gcaggtccgc ctttgccccc ttagcgggaa cgctgtccga aaggacgtgg 3300
tacaattgct caaccgggcc gggtacaggt ccaccgggtt tccgcgccgg gagtgggacc 3360
ttaaccttca aagtcttttt cttcgggctc tttccctgag cgggccgttg agttttctgg 3420
agaactactc cgtcccccga tgcatgcgca tgacccgctt gctcatcgcc cggcttttta 3480
cccgagatgg actgagtttg tctgtctcga tggaccaccg acggcaaacc tggtgaattt 3540
cctctcgtcg tttgtcgggg tatagaccgc tggtcttccc gttgatcgtt cccggcggcg 3600
tctccaacag gagacgcggg ggatacaggg gagaaggcct gcgggaacgg aggggtcgta 3660
cctctgcccg tttccccatc gttcatcggt ggttttggag acctagcaag cttcgttccg 3720
agagagactg tctcaaggga gcgatcggct cctgttggtt ctcgcgcgcc ggcctccgag 3780
aatcgggtgt ggaagacctc ggccagcggg attacaggcg agcccattag atcctgaccg 3840
tcctcgcata cgtagtcgtc ttgtgttagc tcttcgccaa catcttccgt tctgggttct 3900
ggttgaagtc ccgatacgga gggaattgaa acgatctcgt gttcccgtcc caccatgacc 3960
ccgttctctc caaatagtag atcgtcaggc tgactcgagg tgaccacccg ggccctgtgt 4020
tcggcggccg ccgcggccgc gtccaacagg tccattaact ccaaagtatc aggcgacccc 4080
gcgcgttggg gtgtagagcg ctgcatcggc ggcgtatcca tcgcactggg gtgaatttag 4140
acgtacccga gttttccaaa cgctctcgca gccttcaaag gattgcgatt gcggttggtg 4200
agggagttcc aacagtactt aaaacgtgtt gtgccccccc ctcgaccgca tatttcctcc 4260
ccgtgtcgtc accgtgtaaa tattcttaat gataagacga tgtagtgatt ggacgagact 4320
cgaggcggga agttcatgga ccatagtatg cgtttaagga gagaccgctg gttggcgatg 4380
tacgcccggt gtctatttcc gcatacctta caacatcata acaagggata ccagacatgt 4440
gaatttcatt tacatatgtt taaataacaa ccaatcatcg tgtgtctaca gacgatatat 4500
aatatacata aacacaattg gggttgtctc acatgcaaaa catcttatat aacacgggtt 4560
gtttccaccc atccggcatc tagttaatca aatgcacgtc gacggtgtgt ttgggtccct 4620
ctccgtcgtc attacgttcg cgcaatcaac aagcgtatac accaccaccc ctcccaacga 4680
ttatgtcagg cggcacgaag cccgcgataa cccataaaat acacacgggg ttgtggtgtt 4740
cacgtaaccc cccgccgatg gggagggggc gcggtacccc gccgatgggg agggggcgcg 4800
gtaccccgcc gatggggagg gggcgcggta ccccgccgat ggggaggggg cgcggtaccc 4860
cgccgatggg gagggggcgc ggtaccccgc cgatgtttat aaccataatt ctctaaaccg 4920
ttgtagaaaa tcacaaaaaa atttattcaa aaacaagtcg aagaacttca tatctgaggc 4980
atgtaaaccc gttcgcactt cctggggtgg aatggggtgg ggtggggggg tgaaaaaggg 5040
ggggggttaa attgggcgtc cgcatgtctg tggtgtacgc caatcggata cactcttttg 5100
atctgcattc gcacttcccg ttttttcact gtatgggttt tcatgttttg gcatgtgtcc 5160
aaccaccgtt cgcactttct ttctatatat atatatatat atatatatat atatagagaa 5220
agagagagag tttcttgttc gcgcgtgttc ccgcgatgtc gcggttttat ggggtgtggg 5280
cgggcttttc acagaatata tatattccaa atggagcggc aggcttttta aaatcgattt 5340
gacgtgataa aaaaaaacac acggggcccc cccctttttt tggtgttata aaggcaaccc 5400
aatcgaaggt ctcccgcccc ggaatccccc attgccattt tacccaagta gccttattca 5460
tagatgtaaa cgtttgggtg tgtgttttgt tgtgcagggt tcgtccgatt cataacgcga 5520
cagcgtcgag tcggttttaa gggaaaaggt tactacggcc ccaaggac atg ttt tgc 5577
Met Phe Cys
1
acc tca ccg gct acg cgg ggc gac tcg tcc gag tca aaa ccc ggg gca 5625
Thr Ser Pro Ala Thr Arg Gly Asp Ser Ser Glu Ser Lys Pro Gly Ala
5 10 15
tcg gtt gat gtt aac gga aag atg gaa tat gga tct gca cca gga ccc 5673
Ser Val Asp Val Asn Gly Lys Met Glu Tyr Gly Ser Ala Pro Gly Pro
20 25 30 35
ctg aac ggc cgg gat acg tcg cgg ggc ccc ggc gcg ttt tgt act ccg 5721
Leu Asn Gly Arg Asp Thr Ser Arg Gly Pro Gly Ala Phe Cys Thr Pro
40 45 50
ggt tgg gag atc cac ccg gcc agg ctc gtt gag gac atc aac cgt gtt 5769
Gly Trp Glu Ile His Pro Ala Arg Leu Val Glu Asp Ile Asn Arg Val
55 60 65
ttt tta tgt att gca cag tcg tcg gga cgc gtc acg cga gat tca cga 5817
Phe Leu Cys Ile Ala Gln Ser Ser Gly Arg Val Thr Arg Asp Ser Arg
70 75 80
aga ttg cgg cgc ata tgc ctc gac ttt tat cta atg ggt cgc acc aga 5865
Arg Leu Arg Arg Ile Cys Leu Asp Phe Tyr Leu Met Gly Arg Thr Arg
85 90 95
cag cgt ccc acg tta gcg tgc tgg gag gaa ttg tta cag ctt caa ccc 5913
Gln Arg Pro Thr Leu Ala Cys Trp Glu Glu Leu Leu Gln Leu Gln Pro
100 105 110 115
acc cag acg cag tgc tta cgc gct act tta atg gaa gtg tcc cat cga 5961
Thr Gln Thr Gln Cys Leu Arg Ala Thr Leu Met Glu Val Ser His Arg
120 125 130
ccc cct cgg ggg gaa gac ggg ttc att gag gcg ccg aat gtt cct ttg 6009
Pro Pro Arg Gly Glu Asp Gly Phe Ile Glu Ala Pro Asn Val Pro Leu
135 140 145
cat agg agc gca ctg gaa tgt gac gta tct gat gat ggt ggt gaa gac 6057
His Arg Ser Ala Leu Glu Cys Asp Val Ser Asp Asp Gly Gly Glu Asp
150 155 160
gat agc gac gat gat ggg tct acg cca tcg gat gta att gaa ttt cgg 6105
Asp Ser Asp Asp Asp Gly Ser Thr Pro Ser Asp Val Ile Glu Phe Arg
165 170 175
gat tcc gac gcg gaa tca tcg gac ggg gaa gac ttt ata gtg gaa gaa 6153
Asp Ser Asp Ala Glu Ser Ser Asp Gly Glu Asp Phe Ile Val Glu Glu
180 185 190 195
gaa tca gag gag agc acc gat tct tgt gaa cca gac ggg gta ccc ggc 6201
Glu Ser Glu Glu Ser Thr Asp Ser Cys Glu Pro Asp Gly Val Pro Gly
200 205 210
gat tgt tat cga gac ggg gat ggg tgc aac acc ccg tcc cca aag aga 6249
Asp Cys Tyr Arg Asp Gly Asp Gly Cys Asn Thr Pro Ser Pro Lys Arg
215 220 225
ccc cag cgt gcc atc gag cga tac gcg ggt gca gaa acc gcg gaa tat 6297
Pro Gln Arg Ala Ile Glu Arg Tyr Ala Gly Ala Glu Thr Ala Glu Tyr
230 235 240
aca gcc gcg aaa gcg ctc acc gcg ttg ggc gag ggg ggt gta gat tgg 6345
Thr Ala Ala Lys Ala Leu Thr Ala Leu Gly Glu Gly Gly Val Asp Trp
245 250 255
aag cga cgt cga cac gaa gcc ccg cgc cgg cat gat ata ccg ccc ccc 6393
Lys Arg Arg Arg His Glu Ala Pro Arg Arg His Asp Ile Pro Pro Pro
260 265 270 275
cat ggc gtg tag tctttataaa taaatacaat ggtttggctc gtgtcttttt 6445
His Gly Val
ttgatgtctg tctgtggggg agtggggtgt tgtggatatt agagggtaga gggtgctggt 6505
ttgaacgtct ccattaaccc acggggtccc cacacgggcc gtgtggt atg aat ctc 6561
Met Asn Leu
280
tgc gga tcc cgc ggt gag cac ccg ggc ggt gaa tat gcc gga ctt tac 6609
Cys Gly Ser Arg Gly Glu His Pro Gly Gly Glu Tyr Ala Gly Leu Tyr
285 290 295
tgc aca cga cac gat acc ccc gcg cac cag gct ctc atg aac gac gcc 6657
Cys Thr Arg His Asp Thr Pro Ala His Gln Ala Leu Met Asn Asp Ala
300 305 310
gaa cgg tac ttc gcc gcc gcg cta tgc gcc ata tct acc gag gcc tac 6705
Glu Arg Tyr Phe Ala Ala Ala Leu Cys Ala Ile Ser Thr Glu Ala Tyr
315 320 325
gag gct ttt ata cac agc ccc tcc gag aga ccg tgc gcg agt ttg tgg 6753
Glu Ala Phe Ile His Ser Pro Ser Glu Arg Pro Cys Ala Ser Leu Trp
330 335 340 345
ggg agg gca aag gac gcc ttc gga cgg atg tgc ggg gag ctc gca gcg 6801
Gly Arg Ala Lys Asp Ala Phe Gly Arg Met Cys Gly Glu Leu Ala Ala
350 355 360
gat aga caa cgt cca ccc tcg gtt ccg ccg atc cgc aga gcg gtg tta 6849
Asp Arg Gln Arg Pro Pro Ser Val Pro Pro Ile Arg Arg Ala Val Leu
365 370 375
tcg tta tta cgc gag caa tgc atg ccg gat cca caa tcg cat ctg gag 6897
Ser Leu Leu Arg Glu Gln Cys Met Pro Asp Pro Gln Ser His Leu Glu
380 385 390
ctc agc gag cgg ctg ata ttg atg gca tat tgg tgc tgt ttg gga cac 6945
Leu Ser Glu Arg Leu Ile Leu Met Ala Tyr Trp Cys Cys Leu Gly His
395 400 405
gcc gga ctt ccg act att gga ttg tcg ccc gat aat aaa tgc atc cgc 6993
Ala Gly Leu Pro Thr Ile Gly Leu Ser Pro Asp Asn Lys Cys Ile Arg
410 415 420 425
gcc gaa tta tat gac cgc ccc ggg gga att tgt cac agg ctt ttt gac 7041
Ala Glu Leu Tyr Asp Arg Pro Gly Gly Ile Cys His Arg Leu Phe Asp
430 435 440
gcg tac ctg ggc tgc ggg tcc ctt gga gtc cca aga acc tac gag aga 7089
Ala Tyr Leu Gly Cys Gly Ser Leu Gly Val Pro Arg Thr Tyr Glu Arg
445 450 455
tcc tga caccccatcc ctttatatag aaaaaaaaaa taaatttaaa acatacaccg 7145
Ser
gataaaagcg tactgttttt tatttaaatt tacacgctcg gcgttgcccc ggttcggtga 7205
tcaccgggtc ttatctatat acaccgtgta actcgaaccc ccgtgactcc ctccaatcgc 7265
gttaccaaac tcttcttccg tatccgtaga ttccgagtcc tcgaaatcgt ccactggaag 7325
gccagcgtaa tacatgcttt ggttatacgg ggacttgtct accctatagg ctttaaccct 7385
cattcgttta gccgtacaga ttaaaaatat tacgagacat aaaagtacta ctgctgcaag 7445
ccctccggtc catgcggcat atcgtagaag tggtgacgtt ccggggttta cgggggtaat 7505
ttccttgggt ttagtagtcg ccggtggctg accggccgtt ggcggaaatc cacgctcttc 7565
aattgcgttt acaaaatgat ctactgtgga tacaacagtg tatgctacgg cttcaacatg 7625
cccgttaaaa tacaccacaa aaacgtataa tcccgacaaa ctctcgggtg tatctacaaa 7685
ctttaacgtg gtgcccccgt cgtgtaagat tagaccaaag ctaggctcca tatgagatat 7745
tcccagacaa tatgcggtgt agttatctgc atgttcacaa ttttgataca ctgtgcttgc 7805
aacacgctgg gctaaatgtg gcgaggtaaa tgtacaaccg gaattcatat gagagaggca 7865
ttggggtgcg ttgggatgat acaaacacgt agaatataac cgcattggtt gacatgtagg 7925
atcgatgggg acatacaacc actctaacag caaatcaaat ggcgcttcat gtatcttata 7985
ctgaagatgc attgccaagc taaacgtatc accaactgaa aatacatgcg agtggtaatt 8045
ccacatatga aactcagccc ctcttggttg aggagttact gcgggcgtag ggtttcttgt 8105
tttttcatcc cctttccagg tgaccaaaaa cgtggcgtag gtagacgtac catcggagcc 8165
gcgcatgttc caaatgtaca cacccaagta ttgtttttct gtccgaagta ctttcaagac 8225
acccggttca atctcggggg ggtctaattc gagttcatca aacagtgtgc tcgtgtttac 8285
aacaatccac ggttggtccg cttccttctt accttgaaaa cggtaactga tttcggccaa 8345
ctgatcctct ttagtatttt ccgcgcaatc cacatccacc accacgtctt gaaagcatgt 8405
tgtatgtttt aaacatatat gctggatggc gggcgctgcg tctcccgtac aggttaatga 8465
cggcaaaaag ctccaagtct cggtgtaccg gactccataa atccgctgaa tcggtgcgcg 8525
taaagtaaac gggtgatttt cttccactga cacctcaatg agtctttggc cttggggttt 8585
tggattaaga tctcctttaa acacgtcacc gtattgacgt tggtccacat ttacaatttt 8645
atgtctgtca tcgccgttta acgtagggat aacgtggatg cccgtatcgt ccccaagatc 8705
ctcctgtgca gacatttgtg tgggttgcat taaccgttcc ccgctatcga taccacggcc 8765
ctgattatac accccatggt gttcgtgtgc gttctctaaa aatccatcat aatcattacg 8825
tggccatata taaggtgagt tatgatcgta cgcttttcgc gaagactctc cccgatttac 8885
ccatgaagac tccgcatgat ctgaatggta gtaaggctca tatacggagt ttgtatccag 8945
tttgtcttca tcggtgtgaa aatcatcgta tcgcaagacg gatgctctga ccggattcgt 9005
tatacgcaac gttcccgtga taattccgaa ccccatcaat acccccacca caggtttatt 9065
aactgtcccc atattacagg cgaccctcgg aatatagtca cttaaaacca cgcaattcgc 9125
ccgaggtcta aaaccatgga ctgctctcgt gtataccagc tcggccttta aggcttcata 9185
aaacatcccc gcccaaatgt gggcgtgtta acttatagtt aggagattct taatccttcc 9245
cctcatatca caacgcgtaa tcaaaacgct aaataaaacg catatagttt atttaatata 9305
aaatccggga taattagttc tatttaacaa acgggtttac aacggaatgt gggggggatt 9365
cttcgcgaat cgttgctagt tgtgcaatgg cggcctccaa catcacatcg gattctggtg 9425
tcgcattttg tatgcccctt cttgtttttg tatttgggcg ataaattgga tgttttttaa 9485
ttctacgtcg cttaacgctt attacaataa caataaccat ggcggtgagg atcatgacag 9545
acgctactat aggaataatt ataagtagat tttctggagg atcattaagg gacttttcgg 9605
gtaacgtgga catatccgtt ggatatacgt gattttctat cccctcctta acaacggact 9665
tagtttctgt cgtaacaacg tcctcatgta accaagggtt atcctctaac gatttaccgg 9725
cacgaagatc aagcatatgt tgaaacaggg aggtaccgga ccctttgggt gtcgcgggta 9785
aatcacacaa acgagcttgt tgaaaaaggg ctcttgtaga atatccattc tgtagagacg 9845
gagaagtgta gataaccccg tgaatgttgt gatgcgagcc cgctgtatat acatttacac 9905
caagaatgaa accatcggtg gatctgctat ggtctaaccg aacaagaagt acatacacac 9965
cagcatcatt tattcccggt ttggtaattt tcaacattac accagcatcc ggctctgttg 10025
atatccgatc cgttgagtta ccataatgcc acgaatgttt gtacctacac gaaataaaag 10085
cgctcgttct aatccgggga catccgtcgt atcttattac ttgtactgac cggaaacaaa 10145
acgccaccgt atccgcgtat aacagttcca gtgttccgct atagtttgtc ccggtaggta 10205
gttgctctcc aataaagaca agctgtcctt ttatctctgt ataattatca ttttgcatgg 10265
gaataaggat agacgtgaga ctgctgttaa cttgcaagct cacgtggtcg cccttgaaga 10325
tcaaagcgtt ggtcacttgt atgtaaaata taacggccga tatcaaacat tggattaaaa 10385
acatcgcgat taaatgaggc gcaccagtat acccaactac accgaagagc agctctgtga 10445
gccgggggag gggacgcttt tcagcgctac tacaatgtta agaaaatgat cccgtgtttt 10505
atccttgttg taatggggcg tgactctgta ttaagtttct attggttatg ttatttatat 10565
tagtatttca aaacattgcg cgtttgtacc ctatttgcct taacgggtta tacgttataa 10625
acacaattta tacaatattt tattaacagg cttaatgaat tttaatctcc aacttccatt 10685
ggatttggat atggatcggg aagagtttgg aaaacagagt ggttaagcaa cacctctgct 10745
gatggtcgat gacgtgcgtc aaacgataac atcttacata tcaaatactc caaatcaatt 10805
ggcaactcat atagatttgt ccacaatggc ctggatccgg gttttcgaga agaccgtttt 10865
gccaaaccaa tgtattgtcg acgaagattt gatgtagggt taatgggaaa ttcattggga 10925
tgagttccag atcgtcgtat aataagttta atttgacgct cactgtcaca attgccatct 10985
aaaccgtctc gttcaaataa cgagttctgt cctgtagcca tttcaaataa tacaatcccg 11045
gcactccata tgtccacggc aggtccatat ggatctctag ccaataactc aggagagttt 11105
gtggcgattg ttccagccca gccataatac ctgttggcat taatatccac ggggaaacac 11165
gctgctccaa agtctcccac acaaacatca cctgggtggt taataaatat attttcagat 11225
tttatatcac ggtgaataat actgttatta tgaagatact gtaacgcgcg taatacagat 11285
cgctgaattg ctaaaatgtc acatatgggg aggttgcgct ttgcagctag atagcagtat 11345
aaatctgttc ggtaacgtgg taatataaga catgtcattt tgttatacgt aaacgttcct 11405
ttaagctgta caacggatgg gtgggttaac gctcttaaca cggttgcctc ggtggccgtt 11465
ccttgacgtt gacccgcttt aatgaccaca tgttcacatg ttttactgtc catacacgca 11525
aacgcaaaac cttccgcccc tggtgtaaac gtttttaata ttacaaatcc cgctttatta 11585
attccaacgc gtgcttctgc gcacaatgcc acagaggctt cggcttcagt ttcgtgtaca 11645
ccatcaaata tattaaaaga ctgtatatcc atgttaacgt cataattggg atcatcctca 11705
gattctatgc cgggcgacat ttcaactgga aacatctgtt gtataaaccc acatgtttgc 11765
ataatatgtg acggtgatgt tgagatggcg ccacggaact ttccttgtcc aacaaaggtg 11825
tctgttgcat caacgtcgtt cattatactt gtataccccg acttatgttg atagtgttaa 11885
tgaaccaaaa acctgttgtc caacaactgt acaaaatata ctcaccgaca cacttataaa 11945
gtgatccgat tataagaggg cggagataat gacaaaaagg ggaggggtta aacataactt 12005
acaaatatgg atttgattgt tcctaggatt attaataatg tctacatgtt ttatttaata 12065
ataacgtaag gtaactttac ctctttgtga gaatgtcggt atagctgaca acgtttatga 12125
aaacgcgctt taaacgacct attgatgaat aatctagaat cttgcatgta tataaatatc 12185
aaaaggtgtg gtattaaaca cttttaaact agcctgctgg ccttacaatc tggatttct 12244
atg gcc gga caa aac acc atg gag ggt gag gcc gtg gcc tta ctg atg 12292
Met Ala Gly Gln Asn Thr Met Glu Gly Glu Ala Val Ala Leu Leu Met
460 465 470
gaa gcg gtg gta acc ccg cga gcg caa cca aat aat aca acg ata aca 12340
Glu Ala Val Val Thr Pro Arg Ala Gln Pro Asn Asn Thr Thr Ile Thr
475 480 485 490
gcc ata caa ccg agc cgt tcg gct gaa aag tgt tat tat agt gat agc 12388
Ala Ile Gln Pro Ser Arg Ser Ala Glu Lys Cys Tyr Tyr Ser Asp Ser
495 500 505
gaa aat gaa acg gca gat gaa ttt ttg cgt cga att gga aaa tat caa 12436
Glu Asn Glu Thr Ala Asp Glu Phe Leu Arg Arg Ile Gly Lys Tyr Gln
510 515 520
cat aaa ata tat cat cgt aaa aaa ttc tgt tat att acg cta ata att 12484
His Lys Ile Tyr His Arg Lys Lys Phe Cys Tyr Ile Thr Leu Ile Ile
525 530 535
gtc ttt gta ttt gct atg acg gga gcg gcc ttt gcc ttg gga tat ata 12532
Val Phe Val Phe Ala Met Thr Gly Ala Ala Phe Ala Leu Gly Tyr Ile
540 545 550
acg tca caa ttt gtt gga taa gtggacgatt tcgaggactc ggaatctacg 12583
Thr Ser Gln Phe Val Gly
555 560
gatacggaag aagagtttgg taacgcgatt ggagggagtc acgggggttc gagttacacg 12643
gtgtatatag ataagacccg gtgatcaccg aaccggggca acgccgagcg tgtaaattta 12703
aataaaaaac agtacgcttt tatccggtgt atgttttaaa tttatttttt ttttctatat 12763
aaagggatgg ggtgtcagga tctctcgtag gttcttggga ctccaaggga cccgcagccc 12823
aggtacgcgt caaaaagcct gtgacaaatt cccccggggc ggtcatataa ttcggcgcgg 12883
atgcatttat tatcgggcga caatccaata gtcggaagtc cggcgtgtcc caaacagcac 12943
caatatgcca tcaatatcag ccgctcgctg agctccagat gcgattgtgg atccggcatg 13003
cattgctcgc gtaataacga taacaccgct ctgcggatcg gcggaaccga gggtggacgt 13063
tgtctatccg ctgcgagctc cccgcacatc cgtccgaagg cgtcctttgc cctcccccac 13123
aaactcgcgc acggtctctc ggaggggctg tgtataaaag cctcgtaggc ctcggtagat 13183
atggcgcata gcgcggcggc gaagtaccgt tcggcgtcgt tcatgagagc ctggtgcgcg 13243
ggggtatcgt gtcgtgtgca gtaaagtccg gcatattcac cgcccgggtg ctcaccgcgg 13303
gatccgcaga gattcatacc acacggcccg tgtggggacc ccgtgggtta atggagacgt 13363
tcaaaccagc accctctacc ctctaatatc cacaacaccc cactccccca cagacagaca 13423
tcaaaaaaag acacgagcca aaccattgta tttatttata aagactacac gccatggggg 13483
ggcggtatat catgccggcg cggggcttcg tgtcgacgtc gcttccaatc tacacccccc 13543
tcgcccaacg cggtgagcgc tttcgcggct gtatattccg cggtttctgc acccgcgtat 13603
cgctcgatgg cacgctgggg tctctttggg gacggggtgt tgcacccatc cccgtctcga 13663
taacaatcgc cgggtacccc gtctggttca caagaatcgg tgctctcctc tgattcttct 13723
tccactataa agtcttcccc gtccgatgat tccgcgtcgg aatcccgaaa ttcaattaca 13783
tccgatggcg tagacccatc atcgtcgcta tcgtcttcac caccatcatc agatacgtca 13843
cattccagtg cgctcctatg caaaggaaca ttcggcgcct caatgaaccc gtcttccccc 13903
cgagggggtc gatgggacac ttccattaaa gtagcgcgta agcactgcgt ctgggtgggt 13963
tgaagctgta acaattcctc ccagcacgct aacgtgggac gctgtctggt gcgacccatt 14023
agataaaagt cgaggcatat gcgccgcaat cttcgtgaat ctcgcgtgac gcgtcccgac 14083
gactgtgcaa tacataaaaa aacacggttg atgtcctcaa cgagcctggc cgggtggatc 14143
tcccaacccg gagtacaaaa cgcgccgggg ccccgcgacg tatcccggcc gttcaggggt 14203
cctggtgcag atccatattc catctttccg ttaacatcaa ccgatgcccc gggttttgac 14263
tcggacgagt cgccccgcgt agccggtgag gtgcaaaaca tgtccttggg gccgtagtaa 14323
ccttttccct taaaaccgac tcgacgctgt cgcgttatga atcggacgaa ccctgcacaa 14383
caaaacacac acccaaacgt ttacatctat gaataaggct acttgggtaa aatggcaatg 14443
ggggattccg gggcgggaga ccttcgattg ggttgccttt ataacaccaa aaaaaggggg 14503
gggccccgtg tgtttttttt tatcacgtca aatcgatttt aaaaagcctg ccgctccatt 14563
tggaatatat atattctgtg aaaagcccgc ccacacccca taaaaccgcg acatcgcggg 14623
aacacgcgcg aacaagaaac tctctctctt tctctatata tatatatata tatatatata 14683
tatatagaaa gaaagtgcga acggtggttg gacacatgcc aaaacatgaa aacccataca 14743
gtgaaaaaac gggaagtgcg aatgcagatc aaaagagtgt atccgattgg cgtacaccac 14803
agacatgcgg acgcccaatt taaccccccc cctttttcac ccccccaccc caccccattc 14863
caccccagga agtgcgaacg ggtttacatg cctcagatat gaagttcttc gacttgtttt 14923
tgaataaatt tttttgtgat tttctacaac ggtttagaga attatggtta taaacatcgg 14983
cggggtaccg cgccccctcc ccatcggcgg ggtaccgcgc cccctcccca tcggcggggt 15043
accgcgcccc ctccccatcg gcggggtacc gcgccccctc cccatcggcg gggtaccgcg 15103
ccccctcccc atcggcgggg ggttacgtga acaccacaac cccgtgtgta ttttatgggt 15163
tatcgcgggc ttcgtgccgc ctgacataat cgttgggagg ggtggtggtg tatacgcttg 15223
ttgattgcgc gaacgtaatg acgacggaga gggacccaaa cacaccgtcg acgtgcattt 15283
gattaactag atgccggatg ggtggaaaca acccgtgtta tataagatgt tttgcatgtg 15343
agacaacccc aattgtgttt atgtatatta tatatcgtct gtagacacac gatgattggt 15403
tgttatttaa acatatgtaa atgaaattca catgtctggt atcccttgtt atgatgttgt 15463
aaggtatgcg gaaatagaca ccgggcgtac atcgccaacc agcggtctct ccttaaacgc 15523
atactatggt ccatgaactt cccgcctcga gtctcgtcca atcactacat cgtcttatca 15583
ttaagaatat ttacacggtg acgacacggg gaggaaatat gcggtcgagg ggggggcaca 15643
acacgtttta agtactgttg gaactccctc accaaccgca atcgcaatcc tttgaaggct 15703
gcgagagcgt ttggaaaact cgggtacgtc taaattcacc ccagtgcg atg gat acg 15760
Met Asp Thr
ccg ccg atg cag cgc tct aca ccc caa cgc gcg ggg tcg cct gat act 15808
Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser Pro Asp Thr
565 570 575
ttg gag tta atg gac ctg ttg gac gcg gcc gcg gcg gcc gcc gaa cac 15856
Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala Ala Glu His
580 585 590 595
agg gcc cgg gtg gtc acc tcg agt cag cct gac gat cta cta ttt gga 15904
Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu Leu Phe Gly
600 605 610
gag aac ggg gtc atg gtg gga cgg gaa cac gag atc gtt tca att ccc 15952
Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile Val Ser Ile Pro
615 620 625
tcc gta tcg gga ctt caa cca gaa ccc aga acg gaa gat gtt ggc gaa 16000
Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr Glu Asp Val Gly Glu
630 635 640
gag cta aca caa gac gac tac gta tgc gag gac ggt cag gat cta atg 16048
Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu Asp Gly Gln Asp Leu Met
645 650 655
ggc tcg cct gta atc ccg ctg gcc gag gtc ttc cac acc cga ttc tcg 16096
Gly Ser Pro Val Ile Pro Leu Ala Glu Val Phe His Thr Arg Phe Ser
660 665 670 675
gag gcc ggc gcg cga gaa cca aca gga gcc gat cgc tcc ctt gag aca 16144
Glu Ala Gly Ala Arg Glu Pro Thr Gly Ala Asp Arg Ser Leu Glu Thr
680 685 690
gtc tct ctc gga acg aag ctt gct agg tct cca aaa cca ccg atg aac 16192
Val Ser Leu Gly Thr Lys Leu Ala Arg Ser Pro Lys Pro Pro Met Asn
695 700 705
gat ggg gaa acg ggc aga ggt acg acc cct ccg ttc ccg cag gcc ttc 16240
Asp Gly Glu Thr Gly Arg Gly Thr Thr Pro Pro Phe Pro Gln Ala Phe
710 715 720
tcc cct gta tcc ccc gcg tct cct gtt gga gac gcc gcc ggg aac gat 16288
Ser Pro Val Ser Pro Ala Ser Pro Val Gly Asp Ala Ala Gly Asn Asp
725 730 735
caa cgg gaa gac cag cgg tct ata ccc cga caa acg acg aga gga aat 16336
Gln Arg Glu Asp Gln Arg Ser Ile Pro Arg Gln Thr Thr Arg Gly Asn
740 745 750 755
tca cca ggt ttg ccg tcg gtg gtc cat cga gac aga caa act cag tcc 16384
Ser Pro Gly Leu Pro Ser Val Val His Arg Asp Arg Gln Thr Gln Ser
760 765 770
atc tcg ggt aaa aag ccg ggc gat gag caa gcg ggt cat gcg cat gca 16432
Ile Ser Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His Ala His Ala
775 780 785
tcg ggg gac gga gta gtt ctc cag aaa act caa cgg ccc gct cag gga 16480
Ser Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro Ala Gln Gly
790 795 800
aag agc ccg aag aaa aag act ttg aag gtt aag gtc cca ctc ccg gcg 16528
Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro Leu Pro Ala
805 810 815
cgg aaa ccc ggt gga cct gta ccc ggc ccg gtt gag caa ttg tac cac 16576
Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln Leu Tyr His
820 825 830 835
gtc ctt tcg gac agc gtt ccc gct aag ggg gca aag gcg gac ctg ccg 16624
Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala Asp Leu Pro
840 845 850
ttt gag acc gat gat acc cgc cca agg aaa cat gat gcc cgg ggt ata 16672
Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp Ala Arg Gly Ile
855 860 865
aca cct cgc gtc cct gga cgt tcg tcg ggg ggc aaa cct aga gcg ttt 16720
Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly Lys Pro Arg Ala Phe
870 875 880
ttg gcc ctg ccg gga aga tcc cac gca cca gac ccg att gag gat gac 16768
Leu Ala Leu Pro Gly Arg Ser His Ala Pro Asp Pro Ile Glu Asp Asp
885 890 895
agc cca gtg gag aaa aag cca aag agt cgt gag ttt gtt tcg tct tca 16816
Ser Pro Val Glu Lys Lys Pro Lys Ser Arg Glu Phe Val Ser Ser Ser
900 905 910 915
tcc tct tcc tcg tcg tgg gga tcg tca tcg gag gat gaa gac gat gaa 16864
Ser Ser Ser Ser Ser Trp Gly Ser Ser Ser Glu Asp Glu Asp Asp Glu
920 925 930
ccc cgg cgc gtt tcg gtg gga agt gaa act aca ggc agc agg tcc gga 16912
Pro Arg Arg Val Ser Val Gly Ser Glu Thr Thr Gly Ser Arg Ser Gly
935 940 945
cgc gaa cac gcc cct tcc ccg tca aat tcg gat gat tcg gac tca aat 16960
Arg Glu His Ala Pro Ser Pro Ser Asn Ser Asp Asp Ser Asp Ser Asn
950 955 960
gat ggt ggg tcg acg aaa caa aat atc caa ccg gga tat cga tcc atc 17008
Asp Gly Gly Ser Thr Lys Gln Asn Ile Gln Pro Gly Tyr Arg Ser Ile
965 970 975
agc ggt ccc gat ccg agg att cgt aag acc aaa cgt ctt gcg ggg gaa 17056
Ser Gly Pro Asp Pro Arg Ile Arg Lys Thr Lys Arg Leu Ala Gly Glu
980 985 990 995
ccg ggg cgc cag aga cag aaa tca ttt tcc ctg ccg cga tcc aga 17101
Pro Gly Arg Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg Ser Arg
1000 1005 1010
acc ccg ata att ccc ccg gtg tcg ggg ccg ctc atg atg ccc gac 17146
Thr Pro Ile Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro Asp
1015 1020 1025
gga agc cct tgg ccc gga tcg gca ccc ctc cca tcc aac agg gtg 17191
Gly Ser Pro Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val
1030 1035 1040
cgg ttt gga ccg tcc ggg gag acc aga gag ggt cac tgg gag gat 17236
Arg Phe Gly Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp
1045 1050 1055
gag gct gtg aga gcg gcg cgg gct cgt tac gag gcc tca acg gaa 17281
Glu Ala Val Arg Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu
1060 1065 1070
ccc gtg ccg ctt tac gtg ccg gag ttg gga gat ccg gct aga cag 17326
Pro Val Pro Leu Tyr Val Pro Glu Leu Gly Asp Pro Ala Arg Gln
1075 1080 1085
tac cgc gcg ctg att aac ctg atc tac tgt cca gac aga gac cct 17371
Tyr Arg Ala Leu Ile Asn Leu Ile Tyr Cys Pro Asp Arg Asp Pro
1090 1095 1100
ata gca tgg ctc cag aac ccc aag ctg acc ggt gtc aac tcg gcc 17416
Ile Ala Trp Leu Gln Asn Pro Lys Leu Thr Gly Val Asn Ser Ala
1105 1110 1115
ctg aac cag ttc tac caa aag ctg ttg cca ccg gga cgg gcg ggt 17461
Leu Asn Gln Phe Tyr Gln Lys Leu Leu Pro Pro Gly Arg Ala Gly
1120 1125 1130
acc gcc gtt acg ggg agc gta gcg tct ccc gtt ccg cat gta ggc 17506
Thr Ala Val Thr Gly Ser Val Ala Ser Pro Val Pro His Val Gly
1135 1140 1145
gaa gcc atg gcc acg ggg gag gcc ctc tgg gct ctc ccc cac gcg 17551
Glu Ala Met Ala Thr Gly Glu Ala Leu Trp Ala Leu Pro His Ala
1150 1155 1160
gcc gcg gcc gtg gct atg agc cgt cga tac gac cgg gcc caa aaa 17596
Ala Ala Ala Val Ala Met Ser Arg Arg Tyr Asp Arg Ala Gln Lys
1165 1170 1175
cac ttt atc cta cag agt ctc cgc aga gcc ttt gcc agc atg gca 17641
His Phe Ile Leu Gln Ser Leu Arg Arg Ala Phe Ala Ser Met Ala
1180 1185 1190
tac ccc gag gca acg ggc tcc agt ccg gcg gcg cgg atc tcc cgc 17686
Tyr Pro Glu Ala Thr Gly Ser Ser Pro Ala Ala Arg Ile Ser Arg
1195 1200 1205
ggt cac cct tct cca aca acc ccg gcc aca cag gct ccc gac cct 17731
Gly His Pro Ser Pro Thr Thr Pro Ala Thr Gln Ala Pro Asp Pro
1210 1215 1220
cag ccg tcg gcc gcc gca cgc tct ctt tct gtg tgt cca ccg gat 17776
Gln Pro Ser Ala Ala Ala Arg Ser Leu Ser Val Cys Pro Pro Asp
1225 1230 1235
gat cgt tta cga act ccg cgc aag cgc aag tcc cag cca gtc gag 17821
Asp Arg Leu Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro Val Glu
1240 1245 1250
agc aga agc ctc ctc gac aag att agg gag aca ccc gtc gcg gac 17866
Ser Arg Ser Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala Asp
1255 1260 1265
gcc cgg gtt gca gac gat cat gtg gtt tcc aag gcc aag agg cgg 17911
Ala Arg Val Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg
1270 1275 1280
gta tcc gag ccc gtg acc atc acc tcg ggc cct gtg gtg gat ccc 17956
Val Ser Glu Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro
1285 1290 1295
ccc gcc gta ata acg atg cca ctt gac gga ccg gcc cca aac ggg 18001
Pro Ala Val Ile Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly
1300 1305 1310
gga ttt cgg cgt att ccc cgg ggg gcc ctg cat acc ccg gtc ccg 18046
Gly Phe Arg Arg Ile Pro Arg Gly Ala Leu His Thr Pro Val Pro
1315 1320 1325
tcg gac cag gct cgc aag gcg tac tgt acc ccc gaa acc atc gcc 18091
Ser Asp Gln Ala Arg Lys Ala Tyr Cys Thr Pro Glu Thr Ile Ala
1330 1335 1340
cgt ctg gtc gac gac cca ttg ttt ccc acg gcc tgg cgc cct gcg 18136
Arg Leu Val Asp Asp Pro Leu Phe Pro Thr Ala Trp Arg Pro Ala
1345 1350 1355
cta agc ttt gat ccc ggc gcc ttg gcg gaa atc gcc gct cgg cgt 18181
Leu Ser Phe Asp Pro Gly Ala Leu Ala Glu Ile Ala Ala Arg Arg
1360 1365 1370
ccg ggc gga gga gac cga cgg ttt ggt cca ccc agc gga gtg gag 18226
Pro Gly Gly Gly Asp Arg Arg Phe Gly Pro Pro Ser Gly Val Glu
1375 1380 1385
gcg ctg cga cgg agg tgc gcc tgg atg cgg cag atc cca gac ccg 18271
Ala Leu Arg Arg Arg Cys Ala Trp Met Arg Gln Ile Pro Asp Pro
1390 1395 1400
gag gat gtg agg ctt ctg atc atc tac gat ccg ttg ccc gga gag 18316
Glu Asp Val Arg Leu Leu Ile Ile Tyr Asp Pro Leu Pro Gly Glu
1405 1410 1415
gac atc aac ggc ccc ctc gag agc acc ctc gcg aca gat ccg gga 18361
Asp Ile Asn Gly Pro Leu Glu Ser Thr Leu Ala Thr Asp Pro Gly
1420 1425 1430
ccg tca tgg agt cca tcc cga ggg gga ctg tct gtg gtc ctg gca 18406
Pro Ser Trp Ser Pro Ser Arg Gly Gly Leu Ser Val Val Leu Ala
1435 1440 1445
gcc ctg agt aac cgg ttg tgc ctg ccg agc act cat gcc tgg gcc 18451
Ala Leu Ser Asn Arg Leu Cys Leu Pro Ser Thr His Ala Trp Ala
1450 1455 1460
ggg aac tgg acc ggc ccg ccg gac gtg tcc gct ttg aac gcc cgg 18496
Gly Asn Trp Thr Gly Pro Pro Asp Val Ser Ala Leu Asn Ala Arg
1465 1470 1475
ggc gtt tta tta ctg tcg acc cga gac ctg gcc ttt gcc ggg gcc 18541
Gly Val Leu Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala Gly Ala
1480 1485 1490
gtc gag tat cta ggc tcg cgg ttg gcc tct gcc cgg cgc cgg ttg 18586
Val Glu Tyr Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg Leu
1495 1500 1505
ctg gtg ttg gac gcg gtg gcc ctc gag agg tgg ccc agg gat gga 18631
Leu Val Leu Asp Ala Val Ala Leu Glu Arg Trp Pro Arg Asp Gly
1510 1515 1520
ccc gct ttg tct cag tat cac gtg tac gtc cgg gcc ccg gcg cga 18676
Pro Ala Leu Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg
1525 1530 1535
ccg gac gcc cag gcc gtc gtc cga tgg cca gac tcg gcg gtc aca 18721
Pro Asp Ala Gln Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr
1540 1545 1550
gaa gga ctc gcc cgg gcc gtg ttt gca tcg tcg cgc acc ttt ggg 18766
Glu Gly Leu Ala Arg Ala Val Phe Ala Ser Ser Arg Thr Phe Gly
1555 1560 1565
cca gcg agt ttt gct cgt atc gag act gcg ttt gcc aac ctg tac 18811
Pro Ala Ser Phe Ala Arg Ile Glu Thr Ala Phe Ala Asn Leu Tyr
1570 1575 1580
ccg ggc gaa caa ccc ctg tgt ttg tgc cgc ggt ggg aac gtc gca 18856
Pro Gly Glu Gln Pro Leu Cys Leu Cys Arg Gly Gly Asn Val Ala
1585 1590 1595
tac acc gtg tgt acc cgc gcg ggc ccc aag acc cgc gtc ccc ctg 18901
Tyr Thr Val Cys Thr Arg Ala Gly Pro Lys Thr Arg Val Pro Leu
1600 1605 1610
tcg ccc cgt gaa tac cgg cag tac gtg ctg ccg ggt ttt gac ggt 18946
Ser Pro Arg Glu Tyr Arg Gln Tyr Val Leu Pro Gly Phe Asp Gly
1615 1620 1625
tgc aag gac ctc gcg cga cag tct cgg ggt ctg ggg ctc ggg gca 18991
Cys Lys Asp Leu Ala Arg Gln Ser Arg Gly Leu Gly Leu Gly Ala
1630 1635 1640
gcc gac ttt gtg gac gag gcg gca cat agc cac cgc gca gca aac 19036
Ala Asp Phe Val Asp Glu Ala Ala His Ser His Arg Ala Ala Asn
1645 1650 1655
cga tgg ggc ctg ggt gcc gcg ctt cga ccc gtc ttc ctt ccc gag 19081
Arg Trp Gly Leu Gly Ala Ala Leu Arg Pro Val Phe Leu Pro Glu
1660 1665 1670
gga cgg aga ccg ggg gcc gcc ggg ccg gag gcc ggc gac gta ccc 19126
Gly Arg Arg Pro Gly Ala Ala Gly Pro Glu Ala Gly Asp Val Pro
1675 1680 1685
acc tgg gcg agg gtg ttt tgc cgc cac gcc ctg ctg gaa ccc gac 19171
Thr Trp Ala Arg Val Phe Cys Arg His Ala Leu Leu Glu Pro Asp
1690 1695 1700
cct gcc gca gaa cca ctc gtg ctt cca ccc gtg gcc ggt cgg tcg 19216
Pro Ala Ala Glu Pro Leu Val Leu Pro Pro Val Ala Gly Arg Ser
1705 1710 1715
gtg gcg ctg tat gcg tcg gcg gac gag gct cgg aat gcc ctc ccc 19261
Val Ala Leu Tyr Ala Ser Ala Asp Glu Ala Arg Asn Ala Leu Pro
1720 1725 1730
ccg att ccc aga gta atg tgg ccg ccc ggt ttt ggg gcc gcg gag 19306
Pro Ile Pro Arg Val Met Trp Pro Pro Gly Phe Gly Ala Ala Glu
1735 1740 1745
acg gtg ttg gag ggg agc gac gga aca cgg ttc gtg ttc gga cac 19351
Thr Val Leu Glu Gly Ser Asp Gly Thr Arg Phe Val Phe Gly His
1750 1755 1760
cac ggg ggc tcg gaa cgg ccg tca gaa acc cag gcg ggg cga cag 19396
His Gly Gly Ser Glu Arg Pro Ser Glu Thr Gln Ala Gly Arg Gln
1765 1770 1775
cgg cgc acc gca gac gac aga gaa cac gct ttg gag ctg gac gat 19441
Arg Arg Thr Ala Asp Asp Arg Glu His Ala Leu Glu Leu Asp Asp
1780 1785 1790
tgg gag gtg ggg tgt gaa gac gcg tgg gac agc gag gag ggg ggc 19486
Trp Glu Val Gly Cys Glu Asp Ala Trp Asp Ser Glu Glu Gly Gly
1795 1800 1805
ggg gac gac ggg gac gca ccg ggg tca tcc ttt ggg gtg agc atc 19531
Gly Asp Asp Gly Asp Ala Pro Gly Ser Ser Phe Gly Val Ser Ile
1810 1815 1820
gtg tcg gtg gcc ccg ggt gtg ctg cga gac cgc cgg gtg ggt ttg 19576
Val Ser Val Ala Pro Gly Val Leu Arg Asp Arg Arg Val Gly Leu
1825 1830 1835
cgc ccg gcg gtc aag gtg gag ctg ttg tcc tcg tcc tcg tcc tcc 19621
Arg Pro Ala Val Lys Val Glu Leu Leu Ser Ser Ser Ser Ser Ser
1840 1845 1850
gag gac gag gac gat gtg tgg gga ggg cgc ggg ggg agg agc ccc 19666
Glu Asp Glu Asp Asp Val Trp Gly Gly Arg Gly Gly Arg Ser Pro
1855 1860 1865
ccg cag agt cgg ggg tga cggagtcccc tccttttctc gtgagcgcca 19714
Pro Gln Ser Arg Gly
1870
ctggcgcgcg gactgtttgt tgttaataaa agcggaacgg tttttatgaa aaaagtgtct 19774
gtctgtctgt gcgggcgggc gacgggcggg ctggtcggac ccccccccga aaataacccc 19834
cccccggttt ctgggcgccc ggcggacccc gggagaggag gccagccctc tcgcggcccc 19894
ctcgagagag aaaaaaaaaa gcgaccccac ctccccgcgc gtttgcgggg cgaccatcgg 19954
gggggacttg cattacccta tcccagtatt gtttgtacgc ctctaatgga gtaactgtcc 20014
caatacaccc gacttcctat atacagtgct tttgtttgtg gacttacctt tatttacggg 20074
tattacaggg gggtggagaa atgtcttcgt atcgctcttt atctaaataa agacagaatc 20134
taaaatgtca cttagctgta tacccggccc aaggttatac caatcaactt ccctgtttgt 20194
tgggactgtc cgcctacccc aatacacatt ttatacccac gttttagtgg gtgggactta 20254
aaagaaatgg gtggagggat ataggggtgt gtcttcgttg gtaccaatta taaaaatgta 20314
ctcgccacaa ctcacaattt agaacgcatg gcagttctgc tacgtgtttg gatgcccgga 20374
cattagaata cagccagttg ttacc atg gat acc ata tta gcg ggc ggt agc 20426
Met Asp Thr Ile Leu Ala Gly Gly Ser
1875
ggc acc tcc gac gct tcg gat aat acc tgc acc ata tgc atg agc 20471
Gly Thr Ser Asp Ala Ser Asp Asn Thr Cys Thr Ile Cys Met Ser
1880 1885 1890
acc gtt tcc gat ctc gga aaa acc atg ccg tgt ttg cac gac ttc 20516
Thr Val Ser Asp Leu Gly Lys Thr Met Pro Cys Leu His Asp Phe
1895 1900 1905
tgc ttt gtt tgt att cgg gca tgg acc tcc acc agc gtc cag tgt 20561
Cys Phe Val Cys Ile Arg Ala Trp Thr Ser Thr Ser Val Gln Cys
1910 1915 1920
cct ctc tgc cgg tgt cca gtg caa tcc atc ctg cat aag atc gta 20606
Pro Leu Cys Arg Cys Pro Val Gln Ser Ile Leu His Lys Ile Val
1925 1930 1935
agt gat aca agt tac aag gaa tat gaa gtg cac cca tcc gac gac 20651
Ser Asp Thr Ser Tyr Lys Glu Tyr Glu Val His Pro Ser Asp Asp
1940 1945 1950
gat ggt ttt tct gag ccg tca ttt gaa gat tcc atc gac atc cta 20696
Asp Gly Phe Ser Glu Pro Ser Phe Glu Asp Ser Ile Asp Ile Leu
1955 1960 1965
ccg gga gat gtc ata gat ctt ctg cca cca agc cca gga ccg agt 20741
Pro Gly Asp Val Ile Asp Leu Leu Pro Pro Ser Pro Gly Pro Ser
1970 1975 1980
cgg gag tcc atc caa cag cca aca tca aga tcg agt cgg gag ccc 20786
Arg Glu Ser Ile Gln Gln Pro Thr Ser Arg Ser Ser Arg Glu Pro
1985 1990 1995
att caa tca cca aac cct ggg ccc ctt caa tcg tcg gct aga gag 20831
Ile Gln Ser Pro Asn Pro Gly Pro Leu Gln Ser Ser Ala Arg Glu
2000 2005 2010
ccc aca gca gag tca cca agt gac tct caa cag gat tct ata caa 20876
Pro Thr Ala Glu Ser Pro Ser Asp Ser Gln Gln Asp Ser Ile Gln
2015 2020 2025
cca ccg acc cga gac tcg agc cct ggt gta acc aaa aca tgc tct 20921
Pro Pro Thr Arg Asp Ser Ser Pro Gly Val Thr Lys Thr Cys Ser
2030 2035 2040
acc gca tca ttt tta cgg aag gta ttt ttt aaa gac caa cct gct 20966
Thr Ala Ser Phe Leu Arg Lys Val Phe Phe Lys Asp Gln Pro Ala
2045 2050 2055
gtt cga tcg gcg acc ccg gtg gtg tat ggc tcg att gaa tct gca 21011
Val Arg Ser Ala Thr Pro Val Val Tyr Gly Ser Ile Glu Ser Ala
2060 2065 2070
cag caa ccc cgg acc ggg ggg cag gac tac cgt gat cgt cca gta 21056
Gln Gln Pro Arg Thr Gly Gly Gln Asp Tyr Arg Asp Arg Pro Val
2075 2080 2085
tct gtg gga att aat caa gac cca cga acc atg gac aga ctg cct 21101
Ser Val Gly Ile Asn Gln Asp Pro Arg Thr Met Asp Arg Leu Pro
2090 2095 2100
ttt cga gcc acg gat aga gga aca gag gga aac gcg aga ttc ccg 21146
Phe Arg Ala Thr Asp Arg Gly Thr Glu Gly Asn Ala Arg Phe Pro
2105 2110 2115
tgt tac atg caa cct tta ctc gga tgg ctt gat gat caa ctt gcg 21191
Cys Tyr Met Gln Pro Leu Leu Gly Trp Leu Asp Asp Gln Leu Ala
2120 2125 2130
gaa ctg tat caa ccc gaa att gta gag cct aca aaa atg ttg ata 21236
Glu Leu Tyr Gln Pro Glu Ile Val Glu Pro Thr Lys Met Leu Ile
2135 2140 2145
tta aac tat ata ggt att tac ggg cgt gat gag gcg gga tta aaa 21281
Leu Asn Tyr Ile Gly Ile Tyr Gly Arg Asp Glu Ala Gly Leu Lys
2150 2155 2160
aca tcc ctg cgt tgt ctt ttg cat gat tca aca gga ccg ttt gta 21326
Thr Ser Leu Arg Cys Leu Leu His Asp Ser Thr Gly Pro Phe Val
2165 2170 2175
aca aac atg tta ttc ttg ttg gat cga tgt acc gat cca acc cgc 21371
Thr Asn Met Leu Phe Leu Leu Asp Arg Cys Thr Asp Pro Thr Arg
2180 2185 2190
cta acc atg caa acc tgg acc tgg aaa gat aca gcc atc caa cta 21416
Leu Thr Met Gln Thr Trp Thr Trp Lys Asp Thr Ala Ile Gln Leu
2195 2200 2205
att aca ggt cca att gta aga cca gaa acc acc tca acc ggg gag 21461
Ile Thr Gly Pro Ile Val Arg Pro Glu Thr Thr Ser Thr Gly Glu
2210 2215 2220
acc tct cgt ggc gat gaa agg gat acc cga ttg gta aat aca ccc 21506
Thr Ser Arg Gly Asp Glu Arg Asp Thr Arg Leu Val Asn Thr Pro
2225 2230 2235
caa aaa gtc agg ctt ttt tct gtg tta ccg ggg att aaa ccg gga 21551
Gln Lys Val Arg Leu Phe Ser Val Leu Pro Gly Ile Lys Pro Gly
2240 2245 2250
agc gca agg ggt gct aag cgc cgt tta ttt cat acc ggc aga gac 21596
Ser Ala Arg Gly Ala Lys Arg Arg Leu Phe His Thr Gly Arg Asp
2255 2260 2265
gtt aaa cga tgc tta aca ata gac ctg aca tct gag tct gat tcg 21641
Val Lys Arg Cys Leu Thr Ile Asp Leu Thr Ser Glu Ser Asp Ser
2270 2275 2280
gca tgt aag gga agt aaa acc cgc aaa gtt gcc tct cca cag ggg 21686
Ala Cys Lys Gly Ser Lys Thr Arg Lys Val Ala Ser Pro Gln Gly
2285 2290 2295
gag tcc aat acc ccc tcc acc tcc gga tca aca tca ggt tca ctg 21731
Glu Ser Asn Thr Pro Ser Thr Ser Gly Ser Thr Ser Gly Ser Leu
2300 2305 2310
aaa cac ctt acc aaa aaa agc tct gcc ggt aaa gcg ggt aaa ggt 21776
Lys His Leu Thr Lys Lys Ser Ser Ala Gly Lys Ala Gly Lys Gly
2315 2320 2325
att cca aac aag atg aag aag tcc tag tttgttggga gggggaagga 21823
Ile Pro Asn Lys Met Lys Lys Ser
2330 2335
aatgccttaa acatccacag tctgctttat taccaactgt atgtaaatta tgatcattaa 21883
acgtgcattt taaaaatacc tgagtgttgc tattggtcgt aagggttggt atatctggaa 21943
cccaaggtgt aaatactgcc ccctggtaag tccgtacacg attcggaatg cccgcaatcc 22003
ggtctcggag taaccggggc atcctgctct gcaaatacaa taaacataac agtctaacag 22063
tacgcatgtg ctgtcgtaag cgctccgccc taaattacaa aaatttgaga ttacgctcaa 22123
tacttgtcac cgtttgtatt tatacacctc tacgggagca aactaaaggt atacttggag 22183
atgcaaccaa ggccgagttg agacgggaaa attagaagct gtttttccag cggcaaaaat 22243
catccggttt gggtatgtct tatagcaatt aaagttatta aaatggccgg aaggtggcgg 22303
gggtggtgaa ggcgagggta aagcaaactt gcggggggat aaaaggcata tatgttacaa 22363
aatacgcacg caaatataaa ttttagtttt gtacttacag taaaggcaac ggtgttttcg 22423
gtttaggcag ccggacccgt ttaaaatgcc aactggatag tctgcaagga ggaaacgctg 22483
ctgcaatgag tcaggttttt aggtgaatta ggtatgatgc atcatttttg ttaatggact 22543
tttaatggat tgaaacggat agaaggaaac gtgtagcata atggatattg gttgatattt 22603
taagcacgct gccaagtacg cactaagaat gcttagaaat atttaaactg aaatgcgtgg 22663
gtttggcaaa cgttaccgta aatcaatctc agatccaagg gataatggcc gttgggcagg 22723
cctccaaaag gcaatttgag cgttttaagt ttggggtacg tttttaaagc ttgattggca 22783
aacgttaccg caaatcaatc ttagatccaa gggataatgg ccgttgggca ggcctccaaa 22843
aggcaatttg agcgttttca gtccgcttga aaacggtcta agtatttgcc gtaaatgacc 22903
gcgagaggta taatataaac atctgttttc ccgatgacgt gttaagaaaa gtgattatat 22963
cagataacgg gggactgtaa agcttaatac gtgacatgtg attgggtgta taaatactac 23023
atccaaatct tgcatggaaa ttcgctttgt cggcacgccg tgacgtttga catattaaga 23083
tcgccctttt taggaaaatg agatatgaca ccaagagttt aactaaaggc caactgtata 23143
tataaaactc atacttgtcc atgcttccat tagagtggtc aagtttttca aaggatacga 23203
caacgtcgta gtgaagggaa aacacaagcg tcatggcatc acataaatgg ttactgcaga 23263
tagttttttt aaaaactatc acaatcgcgt attgtcttca tctccaagac gacactccgt 23323
tgttttttgg agccaaaccg ctatcggatg tgagtttgat tataacggaa ccgtgcgtgt 23383
catcggtata tgaggcgtgg gactatgcgg cacccccggt atcaaacctc agcgaggcgc 23443
tatcgggaat cgtggttaag acaaaatgtc cagtaccgga agttatactt tggtttaaag 23503
acaaacaaat ggcgtactgg acaaatccat acgtcacctt aaaggggctg gcacaatctg 23563
ttggtgaaga acataaaagc ggggacatac gcgatgcttt gttggatgcc ctttccggtg 23623
tatgggtaga ctctactcca tcttccacaa atatcccgga aa atg gat gtg tct 23677
Met Asp Val Ser
2340
ggg gag ccg acc gtt tgt tcc aac gcg tat gcc aat gaa atg aaa 23722
Gly Glu Pro Thr Val Cys Ser Asn Ala Tyr Ala Asn Glu Met Lys
2345 2350 2355
cta tcg gat tca aag gac att tat gtt ttg gcc cat ccg gtt acc 23767
Leu Ser Asp Ser Lys AspIle Tyr Val Leu Ala His Pro Val Thr
2360 2365 2370
aaa aaa acc cgc aag cga ccc cgc ggg ctg cct ttg ggg gtt aag 23812
Lys Lys Thr Arg Lys Arg Pro Arg Gly Leu Pro Leu Gly Val Lys
2375 2380 2385
cta gac ccc cca acc ttc aag tta aat aac atg tca cat cat tac 23857
Leu Asp Pro Pro Thr Phe Lys Leu Asn Asn Met Ser His His Tyr
2390 2395 2400
gac acg gaa acg ttc acc ccc gtc tct tcg caa ctg gat tcg gtt 23902
Asp Thr Glu Thr Phe Thr Pro Val Ser Ser Gln Leu Asp Ser Val
2405 2410 2415
gaa gtt ttc agc aag ttt aac att tcc cct gag tgg tat gac ctg 23947
Glu Val Phe Ser Lys Phe Asn Ile Ser Pro Glu Trp Tyr Asp Leu
2420 2425 2430
ttg tcg gac gaa ctt aaa gag ccg tac gcg aaa ggt att ttt tta 23992
Leu Ser Asp Glu Leu Lys Glu Pro Tyr Ala Lys Gly Ile Phe Leu
2435 2440 2445
gaa tac aat cgt ctt tta aat tca ggg gaa gaa ata ctt cca tct 24037
Glu Tyr Asn Arg Leu Leu Asn Ser Gly Glu Glu Ile Leu Pro Ser
2450 2455 2460
aca ggc gat att ttt gca tgg acg cga ttt tgc gga ccc cag agc 24082
Thr Gly Asp Ile Phe Ala Trp Thr Arg Phe Cys Gly Pro Gln Ser
2465 2470 2475
att cgc gtt gta att att ggt caa gat cca tac cct acc gcg gga 24127
Ile Arg Val Val Ile Ile Gly Gln Asp Pro Tyr Pro Thr Ala Gly
2480 2485 2490
cat gca cat ggg cta gcg ttt agt gta aaa cgt ggc ata aca cca 24172
His Ala His Gly Leu Ala Phe Ser Val Lys Arg Gly Ile Thr Pro
2495 2500 2505
ccg tct agt ctt aaa aat att ttt gcg gcc ctc atg gaa tca tac 24217
Pro Ser Ser Leu Lys Asn Ile Phe Ala Ala Leu Met Glu Ser Tyr
2510 2515 2520
cca aat atg act ccg ccc act cac gga tgc ctg gag agt tgg gca 24262
Pro Asn Met Thr Pro Pro Thr His Gly Cys Leu Glu Ser Trp Ala
2525 2530 2535
agg cag ggg gtg tta ttg ctg aat acc acg ctt acg gtt cgt cgc 24307
Arg Gln Gly Val Leu Leu Leu Asn Thr Thr Leu Thr Val Arg Arg
2540 2545 2550
ggg act ccg ggg tcg cat gta tac tta ggc tgg ggg cgg ctg gtg 24352
Gly Thr Pro Gly Ser His Val Tyr Leu Gly Trp Gly Arg Leu Val
2555 2560 2565
caa cgc gtg cta cag agg tta tgc gag aac cgt aca ggg tta gtt 24397
Gln Arg Val Leu Gln Arg Leu Cys Glu Asn Arg Thr Gly Leu Val
2570 2575 2580
ttt atg ctg tgg ggt gcg cat gca cag aag aca acc caa ccg aat 24442
Phe Met Leu Trp Gly Ala His Ala Gln Lys Thr Thr Gln Pro Asn
2585 2590 2595
tca aga tgt cat ctg gtg cta aca cac gcg cat ccg tcg cca ttg 24487
Ser Arg Cys His Leu Val Leu Thr His Ala His Pro Ser Pro Leu
2600 2605 2610
tcc cgt gtt cca ttt cgg aat tgt cga cat ttc gtt caa gcc aat 24532
Ser Arg Val Pro Phe Arg Asn Cys Arg His Phe Val Gln Ala Asn
2615 2620 2625
gag tat ttt acg cgt aaa ggc gaa ccc gag atc gat tgg agt gtt 24577
Glu Tyr Phe Thr Arg Lys Gly Glu Pro Glu Ile Asp Trp Ser Val
2630 2635 2640
ata taa cactccaatc gaccctcttg cgtaccataa tgttttcgga gttgcctcct 24633
Ile
tccgtaccga cggcattgct tcaatggggt tggggattgc atcgtggacc gtgttcgatc 24693
ccaaatttta aacaggtagc cagccaacac agtgttcaga acgattttac agaaaatagc 24753
gttgatgcaa atgaaaaatt tccgattggg cacgcgggct gtattgagaa aaccaaagac 24813
gactatgtac catttgatac gttgttcatg gtatcatcta ttgacgaact tgggcggaga 24873
caattaaccg acaccatccg ccgcagcttg gttatgaacg cctgtgaaat aacggtcgcg 24933
tgtacgaaaa ccgcagcctt ttctggtcga ggcgtgtcac gacaaaaaca cgtgacccta 24993
tctaaaaata aattcaatcc atccagtcat aagagcctgc aaatgtttgt gttgtgtcaa 25053
aaaacccatg caccccgtgt cagaaaccta ctgtacgaga gtattcgtgc aagaagacct 25113
cgccgatatt acacccgctc aacggacgga aaatcgcgtc cgttggtacc agtgtttgtg 25173
tatgagttta cggctttaga tcgtgtcctt ttacataagg aaaatacttt gaccgaccaa 25233
ccaattaata ctgaaaatag cggtc atg gac gta cga gaa cgt aat gtg ttt 25285
Met Asp Val Arg Glu Arg Asn Val Phe
2645 2650
gga aat gcc agc gtt gcc acg ccg ggg gaa cat cag aaa ttt gta 25330
Gly Asn Ala Ser Val Ala Thr Pro Gly Glu His Gln Lys Phe Val
2655 2660 2665
cgc gag tta att ttg tcg gga cac aac aac gtc gta tta cag aca 25375
Arg Glu Leu Ile Leu Ser Gly His Asn Asn Val Val Leu Gln Thr
2670 2675 2680
tac act ggt aaa tgg tca gac tgc cgt aaa cac ggt aaa tcg gtt 25420
Tyr Thr Gly Lys Trp Ser Asp Cys Arg Lys His Gly Lys Ser Val
2685 2690 2695
atg tat aat acc ggt gaa gcg cgg cac cca acc tgc aag gct cat 25465
Met Tyr Asn Thr Gly Glu Ala Arg His Pro Thr Cys Lys Ala His
2700 2705 2710
caa cgt taa gaggataggt gtcttcaaat taaaagccgt taaatataaa 25514
Gln Arg
ttttttgtgt cgtttattcg cgtatgaaat gtaatcagtt gggataaatg ttagtcttga 25574
atctgtcttt acgcgtttgc ggcgtcccgt aatattaacg aaagaacgtc gtcgtctgat 25634
gtgaaaaagg ccggttgtgg tgaagaagga cagttggttt caggctgccg cctttgcgat 25694
gtatactggt gcggggttgg tgttggtgtg gtctgatcgt caggtctgtt tacccgtaaa 25754
ccgtatataa caattgagat acacatggat aggtcatctt ccgttattac ggatagtacc 25814
ccccccacat tactgtacag ctgagatgtt gggtcagaac cgacccagga cataaacggc 25874
atcgcaaatg ttgcgtgggg tttgtcctgt aagtggaaat cccggtgcgt aaatgcgtat 25934
gatgaaaagt cagtagaaca gtctatggtt agcagggtca ctgtatgggt gtgttgaagg 25994
acgatcatac caccgctcat tgtatcacat cgagcgtccg tcttcaccgc cattgatctt 26054
gacccacccg taaatgcgta gagtgtttgt tcatgatccg gtaaagtcca gtgaagaacc 26114
cgcctaaccc catgtaacct atataaaata aacgtcgttg ggggtccaga gatgctcatg 26174
gcgggtgtat ttaagacaaa acctagaaat gagaacagca agcccgcatg cgtgttttta 26234
tacaatgctt taatacacaa cgtgtacgtt gggatcacgt agagcggcaa gaatgtgatc 26294
gctaatttct gcggattttt catatcgttc ccgtagaggg tttagattca tttttaagaa 26354
cctagaggag accgtacgcg acatggcaac atacacacta tttaggcgca gtttatccgc 26414
cgtaaaacag atagctacct tctctaaact cagaccctgt gagcgtgcaa ttgtcatggc 26474
tagtttagaa ctaaggccat agtccaccgt ggtggccatt gccaattctg catcctctat 26534
agactcggta aattcacata cgttagcgtt tacgatggac acaaacccat gttgatcctg 26594
taagaccata agaggtacgt gtagagcata taacatttcg gtagtttcgc ggtacagctt 26654
ttttcgggtc agttcctcta caaagactgg cacgggtgca aaagtatacc ccaaaagggt 26714
atacgtatct gtttgtaatg ccaaagataa cagccccccc cgcacattgc cgatcaaaag 26774
ctcgcttccg ttgaaattaa cattatcgac atacgcgcta aagggggcgg ttgtaaagct 26834
gctttcgaaa agctctgcta atatggcata tcgaccatcg taaatagctt tcattaacag 26894
aaactgagca tggacctcat tggatgattt gggataggaa aattcgtact ggcagtatac 26954
caagtcaata acctcgtcgt taagtgccgt aaaaattaca tcatccgggt cttcgggtat 27014
ggaggccgtt actttggact cgggggcttt ggtaaagcag aaggctttat cggcagaagt 27074
agctgcacca tcacagtcaa tcccggcacc cgccccatta accgctgctt cgtgtaaggg 27134
ggctttgttt gttccagaca ataactccag ggttaaggca gctaaccgtt tgtatgctaa 27194
cgatacctta tcgggatgta ggtttttatt taacaaaaaa ttgtaaaagt taatcaagcc 27254
tccaaagatt aggtttgaca gaaaccggta agcatattcg atggatgtct ctcctcgagc 27314
cttcacaaag gagtcgtcac gtaaaacctt tgcaaacgat tgaaatgtac cactgaatcc 27374
aataactaac ttacgtagtc ttgtggttac aactacgaga ctatttagca cgtaagtgat 27434
gtctgtacgg gctacaatta agtcgcgatt tgaatgtgtt tcgtatttaa ctgttcccat 27494
gtcatgatct cggctttggg aataattgtg caaacgaccg gagtttgccc gtatccaatg 27554
ttcaacagaa agtccgggtt gtcccgttaa tttgcggtat tcatcaaatg ccgttagccg 27614
gatgaatgta taagtcggta aggcaaacac agaaaaatgg tcatttttcg atagttttaa 27674
atgcgcgtgt aacttactca tatacgcgct cacctcctta tgcgacgaat acagacgcgt 27734
ccatccggga agattagcag gattgttaat ataggatgca ggtacaacaa atgtatcgac 27794
cagacgcgca tgtgcttcgg taataggtag cccgtactca agcgttttta aaagatttcc 27854
aaaatcgtcc tcttgacatc gtttgttatt aataaagatt gcccagttat gtgagatgtt 27914
agtatattga cgcagagttt gattgcagat tatatacgtg agtatatttt cactaggagt 27974
tacgtgtgaa cgctgcatgt catgttgaaa atgagattct aacgagtcag tttgggtggg 28034
cgaaccgacg catactatga ccggttttcg accgtttatg tactgagggc tttgatatat 28094
agcattcaaa agccaccaac agtaaacaac ggccgtgaga atatgacgcc ctagcaatcc 28154
tgcttcatca ataacaataa cgttgctccg ggtaaaagcc ggaagggaac cgcatgcgat 28214
aaacgcggtt ccggacattg agcctgtagg tttattaagc aaacgttcaa ttgcccacag 28274
ggttttaaat gtcgatgttc ctccgcgacc gtcgtccccc atttgaaaca ctcgttttgt 28334
tatatcaatt aaaacttccc agtagtatac aatatctctt ttttgcaggt cctcaataga 28394
agggggggtc gtagtccagt tatatgcgta acggcccagc tgagcctgaa tgtgatttcc 28454
gcgaaaacca aattcatgaa agattgtgtt tatcggacga ctcgcataag ccgttgataa 28514
cttagcatga acattttggg cagcaaccct ggtggatccg gtaataatgc aatcgatagc 28574
ttcgttaagc gtttggatac acgtgctttt tccggagccg gcatttccgc taattaaata 28634
aacggagaag gggagctcat tcctctgcat ttcaggaggc gcttgcagtt cgagtaactg 28694
tttaaaccaa cacaaccgcg gaacacgttc ttttggaatc gtaattgcgg caagttctcg 28754
aatccgtgag aggataggtt gaatgctatg catagaggtg aagtttaaat atacactgtc 28814
atcaaatcca ttgggcgtct ctggattaaa aacgtttttg ggtgaagaac tgtctacaga 28874
aattgatctt ttcatcgtgg ttttcaatgg ccgaaataac gtctcttttt aataacagtt 28934
ccggtagtga agaaaaaagg atagcaagtt ctgtttctat tgaccagggc ttgaatggaa 28994
gtaacccaaa tgaccaatac aagaacatgt tcgatatata ctggaatgag tacgccccgg 29054
atatagggtt ttgtacattt ccggaggaag atggctggat gttaatacac ccaaccacgc 29114
aaagtatgtt gtttcgaaaa atcctagccg gtgactttgg atataccgat ggacaaggca 29174
tatatagcgc tgtacggtct acggaaactg taattcgcca agttcaggca accgttttga 29234
tgaacgcgtt ggatgcaact cggtatgagg acctagcagc agattgggaa caccacatcc 29294
aacaatgtaa cctgcatgcc ggggctctag cggaacgtta tgggctatgt ggagaatcag 29354
aagccgtacg gcttgcacat caggtttttg aaacctggcg tcaaacatta cagtcatcgt 29414
tacttgagtt tctgcgtgga ataaccggtt gtctctatac cagtggttta aatggaaggg 29474
tcggttttgc caaatacgtg gactggatag cctgtgtagg tattgtgccc gttgtaagaa 29534
aggtacgatc agaacagaat ggaacccctg caccattaaa tacgtatatg ggtcaagcgg 29594
cagaactgtc ccagatgtta aaagttgccg atgcaacgtt ggccagagga gcggcggttg 29654
tcacaagcct agttgagtgt atgcaaaatg ttgctattat ggattatgat aggacgcgtc 29714
tttattataa ttataaccga agattaatta tggcaaagga tgatgtaacg ggcatgaagg 29774
gagagtgttt ggtcgtgtgg ccgcccgttg tatgtgggga gggtgtagta tttgactcac 29834
ccttacagcg gctttctggg gaggtgttgg cctgttatgc attacgtgaa catgctcgcg 29894
tctgccaagt tttaaataca gcccctttgc gcgtgttaat aggtcgccgg aatgaagatg 29954
atagatctca cagcacacgt gcggttgatc gtataatggg cgagaacgat acaacacggg 30014
ctggatcggc cgcgtctaga cttgtaaagc taatagttaa cttaaaaaac atgagacatg 30074
ttggagatat taccgaaacc gtacgttcct atctagaaga aacgggcaat cacattctgg 30134
aaggaagtgg atcggtggac acatcacaac cggggtttgg caaggccaac caatccttta 30194
acgggggggc aatgtccgga acaacaaacg ttcaaagtgc gtttaaaact tcggtggtta 30254
acagtatcaa cggcatgctc gagggttatg tgaataattt attcaaaacc attgagggtc 30314
tcaaggatgt gaacagcgat ctgaccgaaa ggctccagtt caaagaagga gagctgaaac 30374
ggttacggga agagagggta aaaataaagc catctaaagg gtcacatatt acaatggcag 30434
aagaaacacg tattgccgat ttaaatcacg aggttataga tcttaccggc ataatagggg 30494
atgatgcata tattgccaat agttttcaat ctcgttatat ccccccttat ggagatgata 30554
taaaacgttt gtctgagcta tggaaacagg aacttgttcg ctgttttaag cttcaccggg 30614
taaacaataa tcaaggccag gaaatttctg tatcatattc aaatgcgtca atctcattac 30674
tagttgcgcc gtatttttca ttcatattac gggccacccg attaggattc ttggtaactc 30734
aaagcgaggt acataggtca gaggaagagt tatgccaggc tatttttaaa aaggcgagaa 30794
cagagtccta tttatcccaa atccgaatat tatatgaaat gcaggttcgc gcagaggtaa 30854
taaaacgggg cccacggaga acaccaagtc cttcctgggg tttgcctgac cctacagaag 30914
atgacgaaag aatcccggaa cccaataaaa taaataacca atacatgcat gttggatata 30974
aaaacctatc ccattttatg aaaggacacc cccctgagag gttacgggta cacaaggtaa 31034
atg cag cgg att cga cct tac tgg ata aaa ttc gag caa acc gga 31079
Met Gln Arg Ile Arg Pro Tyr Trp Ile Lys Phe Glu Gln Thr Gly
2715 2720 2725
ggc gcg ggg atg gcc gat ggg atg tcc gga ata aat ata ccc agc 31124
Gly Ala Gly Met Ala Asp Gly Met Ser Gly Ile Asn Ile Pro Ser
2730 2735 2740
att tta ggt tgc agc gta acg atc gac aac tta cta aca cga gcc 31169
Ile Leu Gly Cys Ser Val Thr Ile Asp Asn Leu Leu Thr Arg Ala
2745 2750 2755
gaa gag ggg ttg gat gtg agc gac gtg atc gaa gat ctt aga ata 31214
Glu Glu Gly Leu Asp Val Ser Asp Val Ile Glu Asp Leu Arg Ile
2760 2765 2770
caa gca ata cca aga ttc gta tgc gag gcg cgg gag gta acc ggt 31259
Gln Ala Ile Pro Arg Phe Val Cys Glu Ala Arg Glu Val Thr Gly
2775 2780 2785
ttg aag cca cgc ttt ttg gca aac tct gtt gta tca ctg cgc gta 31304
Leu Lys Pro Arg Phe Leu Ala Asn Ser Val Val Ser Leu Arg Val
2790 2795 2800
aaa ccg gaa cac caa gag acc gtt tta gta gtg ttg aat ggt gat 31349
Lys Pro Glu His Gln Glu Thr Val Leu Val Val Leu Asn Gly Asp
2805 2810 2815
tca agt gag gtg tcc tgt gat cgt tac tac atg gag tgt gtt act 31394
Ser Ser Glu Val Ser Cys Asp Arg Tyr Tyr Met Glu Cys Val Thr
2820 2825 2830
caa cca gcg ttc cgc gga ttt att ttt tcc gta tta act gcg gtt 31439
Gln Pro Ala Phe Arg Gly Phe Ile Phe Ser Val Leu Thr Ala Val
2835 2840 2845
gaa gat agg gtg tat acg gtg ggg gtg cct ccg cgc ctg tta atc 31484
Glu Asp Arg Val Tyr Thr Val Gly Val Pro Pro Arg Leu Leu Ile
2850 2855 2860
tat cgg atg act cta ttc cgc ccg gat aat gtc cta gat ttt acc 31529
Tyr Arg Met Thr Leu Phe Arg Pro Asp Asn Val Leu Asp Phe Thr
2865 2870 2875
tta tgt gtt att tta atg tat ctg gaa ggc att ggg ccc tcc ggg 31574
Leu Cys Val Ile Leu Met Tyr Leu Glu Gly Ile Gly Pro Ser Gly
2880 2885 2890
gca tct cca tcg ctg ttt gta caa ttg tct gta tat ctt aga cgc 31619
Ala Ser Pro Ser Leu Phe Val Gln Leu Ser Val Tyr Leu Arg Arg
2895 2900 2905
gtt gag tgt caa ata gga cct ttg gaa aaa atg cgt cgg ttt tta 31664
Val Glu Cys Gln Ile Gly Pro Leu Glu Lys Met Arg Arg Phe Leu
2910 2915 2920
tat gag gga gtt tta tgg ttg tta aac act cta atg tat gtc gtt 31709
Tyr Glu Gly Val Leu Trp Leu Leu Asn Thr Leu Met Tyr Val Val
2925 2930 2935
gat aac aac ccc ttt aca aaa acc cgc gta ttg ccg cat tat atg 31754
Asp Asn Asn Pro Phe Thr Lys Thr Arg Val Leu Pro His Tyr Met
2940 2945 2950
ttt gtt aag tta ctg aac cct cag cct gga acg gcc ccc aat att 31799
Phe Val Lys Leu Leu Asn Pro Gln Pro Gly Thr Ala Pro Asn Ile
2955 2960 2965
ata aag gct ata tat tca tgt ggg gtg ggt cag cgt ttt gac ctg 31844
Ile Lys Ala Ile Tyr Ser Cys Gly Val Gly Gln Arg Phe Asp Leu
2970 2975 2980
ccc cac gga acc ccc ccc tgt cca gat ggt gtg gtg caa gta ccc 31889
Pro His Gly Thr Pro Pro Cys Pro Asp Gly Val Val Gln Val Pro
2985 2990 2995
ccg gga ttg tta aat gga cct tta cga gat tcg gaa tat cag aag 31934
Pro Gly Leu Leu Asn Gly Pro Leu Arg Asp Ser Glu Tyr Gln Lys
3000 3005 3010
agc gta tat ttt tgg tgg tta aat cgc acc atg gta aca ccg aaa 31979
Ser Val Tyr Phe Trp Trp Leu Asn Arg Thr Met Val Thr Pro Lys
3015 3020 3025
aat gtt cag tta ttt gaa acg tat aaa aat tca cca cgg gtt gta 32024
Asn Val Gln Leu Phe Glu Thr Tyr Lys Asn Ser Pro Arg Val Val
3030 3035 3040
aag taa ataaaccttt tattttaagg atgggttgtt gcggcgtgtt tttttgtcat 32080
Lys
aaaaacaaga agttatatga agcaagggga tcgtcagtaa atttcaatga tggacttgaa 32140
aaatcattat gagcatctag aagatggtta ataatagtga caaacgtgtg tataaggggc 32200
ttgagatgag cagcgcagtc tatgggggga atacaggcac acgatggtag caattcccag 32260
taaacaacag taggaggtcc atgtgggttg ggattacgaa ccataattat ctgtcttggt 32320
ttataatcct tatcttgcat gtgtccgtat gtcagataag atgccggtat tgcttttgtt 32380
tccgaatttt gctcctgaag actccagtat accggagtat aacggcgatc aaaaacatgt 32440
tcaaaaaaat cgacgtatgt ttttctggca gcctccacaa cttgtccggc ccatatgtcg 32500
tataggacat ggtcccatat gcaattttcc tgtatccgtt ctcgaatttt tgttgcggta 32560
cataggagtc ctgaaagacg ctcctttaaa tctcgcgccg cctgacttcg ccatggcacg 32620
ccaactaaat ccggaggcgt gtttgtttgt aaattccaca gccagcttcc cgtggtacac 32680
caggtaaatg cgtgagtgta cacaccctcg acccggagta ccggttccat ggcgggtgta 32740
aagttaagac ctagttgaat tcgtaataag cccgtgactg ctctttcgca ggtggccgct 32800
aaatttaagg ccgaacaacg tgcctgatct gaacatacat tgcgcgtatg taaatcggtg 32860
aagattcccc aaaatccatc ttttatataa gtacatatag caaagccctg ttccaaggcc 32920
gtttgttcga cgcaaaggct aatttcattg gctataaaaa taatagcttt gtatatactc 32980
gggagtatgt gccgtaaccc cccaaaaaaa gacaccaacg cgggttttag gcaccccatt 33040
cctcgacgca ttctagccag gacgattttg cctaatctaa aatgcatggg gaacatagcg 33100
cagtatatcg tgggaaaaaa ggcgctgaaa tcaaactttg ccatccaacg atcttgtagt 33160
tgtacaggct gtttgaagac ggtagatatc cggcccgcgt cccaattgac agttgcatct 33220
gcatgtccag tatcttgagg gatttgccca ataagatcct gcacccgcgg aggtaagggg 33280
ggatgctgtc cccagggggc tagataatga aaggcctgaa taccggttac tggccacaac 33340
ccatccgttt ccatgtacat ctccgtagcc gtgcgaatag attcgacggc tggagtttgt 33400
gtccgaagag tggtccatcc gggaaatccg acagtacaaa agtagtctct gtcttcaccc 33460
ttcggattga gttccgggcc taaatacgca aagattggac ggacaacccc gggttttcca 33520
aacgccgccg cgtaccacag tctatatgtt gctaatattg aaagggaaga tgtaacggtg 33580
gaatccgttg ttaaacagtc aaaataacac ggtaagagta cccttaccac aattgcgcga 33640
ttgccactaa gaaccaagcg gcgttcttca aggttaaaca aatcggaggt ttggccttcc 33700
cgctctatag ggggggaaaa cgtagactta tatttccgac gtatgtaggc ctttggtgta 33760
attacctcca cgtctaaagg tgtttgttgc tgggtgtcac cgttagattt aagtgcggtc 33820
aggcgggcgt aggctagcga cgtaccttgc gtgtgggcgt tacatgcgct gactgctgaa 33880
acatcaatgg ccacgggaaa attagccctc aatactccgg tagatggcgt gttgtcatta 33940
ttcaacgcgt gttcatcggg gattgttttt aacagaagac ctgtattcgt atcagtctcg 34000
aatgttactg gcgaataaaa cgatttaaac cctaatacgc gccgtaaagt ggccagggca 34060
gtgccatgga gacaacgcca ggtgtctgcg ttacccagtg gttcaagggg ccaggcgtct 34120
aacgcagccg aggacgccgc cctgcagata gacgcaaacg ttgccgcaga aacactcccc 34180
ccatgcctgg agtagcgaca taaatcctct tgttgtacgg taatttcggc aaacttaggg 34240
acgtatttcc cacatgagtt tttgcataat aagtagtata acacggaaag accgttctgt 34300
gttaattgtc cggactgtgt ccaggatgtg tatatgcttg cggcacaaat gtgtccgctt 34360
tctctaacca aggtaatctg cgttgcgtcc atagcttggt tgtggttcag tgcattgtta 34420
gaggcccttt gtgttgtaca gaggcttagt atttatgtaa ccccccctat ggatgtccat 34480
atatggtatt aacgggttat aaactttcaa aatttaccgc cccgtttatc cgcggccacg 34540
atcccgggga cagttccgat agcgctaatt gtagcatggc ttcggcatgc ggggccaagt 34600
gcatttgatg tgcttgaaaa tacacgtggc gtgcccaagc gggggcagta attttgtaac 34660
gctgttgaga gaacgaatgg cgctggctca taagcaggtt atacaactgg cggtaggtac 34720
gacatgcaga acgatccacg ttaattgaat caagtaaagt ttctagatct ttttttttta 34780
ggtttttcac acgagtactt cccggaaagg tctgggtact tttagttacg cgggcgccaa 34840
ataattgttc ccaaattaaa cgaaatagag caatgttata ttcgtgtttt gtagggattg 34900
gacggaagca ttgattatgg catccatcta agaatccggt tgttcgccaa accgggcggt 34960
ttaatataat cttagcgtca gtaacgttac agcgcataac ctcaaatatt agttgagcca 35020
gcgaatcccc gtgttgattg aaaagtgtat taatgtccgt atccaaaact tgtgttacac 35080
agccttcggt ggacggattc cacttgagtg tttttgcaat ataggcacaa cggcgaaaca 35140
gctcccaatg tgtggttaaa tttggggcgg tagttatagt cgcgatgcta aattccccgg 35200
ttttgtccag gacgggaaac caaccggaag cgtagtgcgt gtatattttt tgaaatacgt 35260
caaatgcttc caacgcttca ggtattcgta tacacgcccc caatattgcc gtattgacta 35320
gtctaggctg ttctattgga catgcaaggg ctttaaatac tttaatttgt tcttcggtat 35380
taattgactc cataagatac ttttccataa acgcggtaat gttatcggcc ataaatccgg 35440
aagggccaaa atcggtcaag cagctgtcat tgtcctgtct aagagaacgc atacaggaag 35500
ctacggcgtt agcgctatgt ctgagatcgt gtataaatgc acaaaattga actggggaaa 35560
cgtccgttat tggacccatg ccatccaata caaccaaaat ttggttagag gccaaaagag 35620
tttgtaagat attaatgcta tcggctaaac tccaaagaga gcatctctcg aaaaggtgtt 35680
tgtatttaaa tcttgagaag agatgggtgt tcgagcgtgt aaatgcattt gcacatcgtt 35740
gcctaaatgc acagcacagt ttgttagtta tttgggtgtg tgtaggaaac cattgaaatt 35800
tatttgcgat ggtaaagttt agtaacattg gcgagaacag gggtccgcat ctggtccttg 35860
agccatcgac gtacatcaaa acttcattaa gtagcaataa acgtacacgc cctaatgact 35920
ggtagaccga taccatatcc ggcccatatg acattggctt tatgtaagca aacatgctat 35980
gaaaatgagc catgtcaaaa ctcaatccaa cggtcacgac ggtagtgtac accaacacgc 36040
gaaaatgttt ccattcgttt acattacata ggggccgagt tgagtttaaa ataagaatag 36100
agtctgtaaa tattgcacaa aactgagcaa ctagctccga aaacgataaa gttgatgaaa 36160
atatacagat gttatgccca cattgtaatc gtagtgctag ttcgtcaaaa aatgttccac 36220
gtagttggtg tatggtacgt acatcctcgt gttcaggaga tcgtttaatg actcgcacaa 36280
gcgtgtcgat gcccatatca cgcaggatcg tgcaagttct tccggagaac ccaactcccg 36340
cgtatgtaca cacaattgtg tgtatgtttt catctccacg caatccggag attaaatcaa 36400
taaactgcga gtttactgta gcatccatcg cgataatttg agaacagcga tttaaaagac 36460
gatataatag gctatcaacc gcggaaagac gtctcattgt gggggagtat aattgtccaa 36520
tcactgacat tacctcatcc agtattaata cgtcgtagct gtcgatagct tcgctggata 36580
cgcggtgtag gctttcaagt tgcacaatca aacgtttaaa acccataata tatgtctcgg 36640
atgtcaaata tgttacgaat ccggagaggc ctgcatcgtt aaaccgttga atcaacgtct 36700
gggtaaagct acggcgacat gagacaacca gtacgctaat atctgccttt aacgcgtgtt 36760
gaagccactc aagcaaggct gttgttttac cagagcccat aggggcacgt acgacggtta 36820
ccgggcgagt ttgtgacata ccaggtttta ttagttttac tggaacatcc aattgcagtt 36880
ccaggctaat tcccgggtgg gtgtgtttaa tccacgaaac cagatcccct ccatataacg 36940
cccgcgcgag ctgtgtactg gacgcataga cggcggcgtt gctctccccg gtgttgggag 37000
acatgggaac tcaaaagaag gggccgcgtt ctgaaaaagt ctcgccgtac gacaccacga 37060
cacccgaggt ggaagcgtta gatcatcaaa tggatacgct taattggcga atttggataa 37120
ttcaggtgat gatgttcact ttgggtgcgg taatgctcct ggctacgtta attgccgcct 37180
cttctgaata taccgggatc ccttgttttt atgctgccgt agttgattat gagttattta 37240
acgccaccct agatgggggg gtatggtccg gaaatagagg tggatacagc gccccggttt 37300
tgtttttgga accacatagc gttgtggcat ttacttacta cacggcttta acggcaatgg 37360
ccatggcggt atatacactg atcacggccg cgattataca ccgagaaacg aaaaatcaac 37420
gtgtccggca aagctccggt gttgcatggt tagttgtaga tcccacaaca cttttttggg 37480
gtcttttgtc attgtggtta ttaaacgccg ttgtgttatt attagcttac aagcaaatcg 37540
gcgtggctgc tacattatat cttggacatt ttgcgacaag tgtaatattt acaacgtatt 37600
tttgtggacg cggaaaattg gacgaaacga acataaaagc ggtcgcaaat ttacgacagc 37660
agagcgtctt tttatatcgc cttgcggggc ctacgcgcgc agtgttcgtg aatttgatgg 37720
ctgcgttgat ggcgatatgt atcctatttg tatcattaat gctggaactt gtggtggcga 37780
atcatctaca tacgggactg tggtcatcgg tgtccgtggc catgtctaca tttagtacat 37840
tgtcagttgt atatcttata gtatcagaat taattttggc gcattatata cacgtgttaa 37900
taggaccgtc cctgggaacg ctcgtggcct gtgctacgtt gggaaccgcc gcgcactcgt 37960
atatggaccg attatatgac cctatatcgg ttcaatctcc acggttaatt cccacaactc 38020
ggggaacctt ggcttgcctg gccgtgtttt ccgttgtcat gttgcttctc agattgatgc 38080
gtgcatatgt gtatcatcga cagaaacgca gtcggttcta cggtgccgta agaagagtac 38140
ccgagcgggt acggggatac atacgaaaag taaaacctgc acatagaaat tctcgccgca 38200
caaattaccc atcacaaggc tacggctacg tctatgaaaa tgactcaaca tatgaaacgg 38260
accgcgagga tgagctgtta tacgagcgat caaacagtgg gtgggagtag cgtgggttgt 38320
acttgtaaac ccccaagcaa taaaacaaac attcacagta accaaccatg atttgatgtt 38380
tttattagaa cgtttatcag ggtttatcag ggtttaacat tttgcgcatt tggaatggga 38440
gctctttccc gaaggttttc gttttggtgt tacatcgatc actcgtgggc ggcgttgtgt 38500
ggatttatcg gcgtcctctg ttacattctc atcaaagtca aagtcttcaa agaactctga 38560
gttcaaagca acggtttctc cgttgcacgt aacaagctcg ttgtaccgtt tgcacaatcc 38620
acagattcct cctcgaccgc tggatgaaga ttgtcccatc gcaatgaccg caatgctgat 38680
gtaatcgcgt tgtttcctgt gtctttaagt atttgtgtca ctctaggcac atcaacttcg 38740
ataggcgtta gaacggctat aaccggaacc gcacagtggg gtgggatagt cggccagtta 38800
tattcgtaaa tggcagccgc ttgtgtatcg acgcgcaatg ggactcccac ctcgtgtggt 38860
cggcgcacac gtcctataaa ggtgacaagg aagggccgta gttttagcgc tggaaagtaa 38920
ccggataata cataggtttg tacggcaatt tgtttaaagt tagcgtgccg cggattggca 38980
aatacaggaa tttgaagagc cgtatctcca gtggcccatg caatggttgt aattgtcccc 39040
ttttgtatgt ccggatcatt aaatacccac actcgggatg agattgtccg attgaggccc 39100
aaatgatggg cgtcgagttc tgtaagggcc ctcccacctt taaatcctaa acgtttccat 39160
tcaacgtgat ccgttattaa ggcctcgcgt gtacttgggt ttgctgaggg tccaaagaag 39220
cttacacatg ggttttttat agtatataaa aagtcccgta gatttgccat tgtaggtcta 39280
tttattaacg tggtgtacgt ccgaccgagc gggttatttt tgtcatctgg atcaaagagg 39340
tatttagcac gacatttaat ttcaaaaaat gaaatgtcgg tttctgcagg gtggggattt 39400
agggtgccag ttaaagggtc cctgtcacat acaagaacgt ccaacgacgc gcctataagt 39460
cccgttcgaa cgtcgattaa aagtccacat tcatacgttc cgattcttgg tgtacccaca 39520
aggagacttg gagatttggt attgagcatg tcaaaattaa aaaacttagt tgtagcttct 39580
ggcgttaaat gatcaggagt acggatatca gccggatcta taaacaatgc ttcaactatc 39640
gctcgtgccg ccggttcgtt ggtttttcca aacgccattg ccgccgcatc accgtatgtg 39700
tctgtagtgt tatggtggta aaaccactga ggtggaatga cgggtccaga tacactccat 39760
ttaagagttg atgcggtgat taaatttcgt ctgagtagtg tccaaatggc attgtcgccc 39820
tgaccacgag tctccgattc tattgttaat agaatacgtt taatgtcgtc ttgggttaat 39880
ttggatgcta actcttccgg acgcggctcc tgaggctgtg tggacgtctt aaagaacagt 39940
gcgttggtag aaccgggtgt tgtgtaggtt acatcatttg gagactccag agttgttaga 40000
attttataca tgtgggataa acgaataaaa atacccgctg tcatcccgcg acagtccatg 40060
ttcggtgatg tcttctgtag cgtgtctata aaatggtgtt caacgttaat cgtgttaata 40120
tccgttttta caaaagggtg ctgtagacct cccacacggg caattttttt ggctggctgg 40180
gggcttatgt cgatcctatc caatcccgat cgtgccataa taaagggtga tccgagcgta 40240
tttctgaatt atcgtgacac aacaacgatg taacaacagt ggctgggtat aatgaggccg 40300
gaatgcgtat gccttcgaaa agctttgagt gcgttacgcc gtaacgcact gcgtgacact 40360
ggaatatccg gcgtaaagaa ttccgggaaa tacgggtgtt cagctctccg gcgtgtataa 40420
agggattata atttaagtcc ccggggatgt cagatgggag tatatatggc gccaggacac 40480
atcgatatgc cagcgtatca agcgccaaat ctggtgataa cttatgaccg tagtagtgat 40540
aatgcggggt ccgatgtact gaaatgggca gttgtccggg aagacgtcct agcaggataa 40600
cttctaagag tgcttggccc aatgcataaa gatcaatcgc aagtccaact ctttggggca 40660
aggttccagt atatttagta aggcccgttc cattaatata atcaagcaag atttcagggg 40720
gttggtttgt tccatgactc aacaccaaac gaaatgacat ctgcgatgca tcccggggta 40780
cgcgtagtac gtgctccggg tgggatggat ttccaacttc aaatatcgct cgagtacaaa 40840
gggaatacgt atttaatgtt actaggctat agtctccgat tactgctgtg gttatttcca 40900
acgaggcaaa gttgtcgacg ttaagaaaaa tattgccaca tttcacatct aggtgggtca 40960
ggccgcacgt tcgatttaaa aacgtcaacg cttgagccaa atctaaaaat acatgggcaa 41020
tttttctgtc tatgtgatca ggtattgtca accgtctgga caggcgaaca atgtattcat 41080
ccatatccat gtcgtatgcc ggaaacagta gctgtttggt ttgtaacgaa aaacctaaaa 41140
ggcaaactat gctagaaata cctagccttt cccctgctcg tatagaaccc tcactcgcta 41200
aaatcgcgtt aattaactct ctattaaaaa cacgactgtc catggttttt acagctattt 41260
ttgacgatgg gtatatatga acacgcccat acgttccccg tcccgcaaac ctgggtctat 41320
ctcgaatctg taatttagaa aaacataact gttcattgag cgtgaagatg ggtacaatta 41380
aacgggataa tttggaaatt ctaaaaaccc atgggggttt gttaaattgt ggaaaaacgt 41440
ccaccctttc gcgttgatgt attggatctc tcgttaagcc gtgctttaat aaaaataagc 41500
ttgtctgttg tgatcgttct ggtgattcca tgcatgttgc agttgacgga ttggcccatg 41560
gatgatttac acatccgtgt gtggggttgg tggattctcg ttcttgctga tcggctgcgg 41620
ttgcatttgg ttcatgtccg tcggtattgt ggtgggaacg caaaggtcct gcagttggag 41680
atatttggag gttggggggt gtgtcgtcag catccattgt gtaagtaacg cttcgtcgtc 41740
ttctccaaac caagtatgat tttcgtgttc gatgttcggt cctaccgcgt cttccagtct 41800
gtctatatta tcggtattat cgtcttctgt tttatccaaa ccgcgaataa aatcgggtgc 41860
tatatagcgt ctgtttgtat ttaaaatctg ttgaatactt gtttgttcct caaccttgga 41920
ttgtatatca cgaatcttct gttcaactga agctatgcga gctgcagagc ggagctggtt 41980
atttaaatcg gcgcaggcct cttttgcagc agtaaaggca tacacaaatg ctggatcttg 42040
tacactcgca cccccacgga ttagatcgag ggtccgttct ttaaataacc ccgcgcggtt 42100
atgcgcctca actagtttga cacggttacg cctatgagaa gcgtaggttg gagtgtggcc 42160
tgacatgacc aaagtgaatg aatgcggccc tgaaaatgtt tatgatatat acgtaacgag 42220
tacgtaagat gacgcggctt cctgttttac tgcttacaag agatgaggcg gattccgtga 42280
attgatgtta aatgtcattg ataatgtttg gtcgtacgct tggtgaagaa tctgtaagat 42340
attttgaacg tctaaagcgt cgtagggatg aacgctttgg gacgttggag tcccctaccc 42400
cgtgttccac gcggcaaggg tctctgggaa acgcaaccca aatcccgttt ctgaattttg 42460
ctatagatgt aacccgacgt catcaggccg ttattcccgg aattggaacg cttcacaact 42520
gttgtgaata tattccactg ttctcggcta ctgctcgacg ggcaatgttt ggcgcgtttc 42580
tatcgtcaac agggtacaac tgtaccccca atgtagtttt gaaaccatgg cgatattcgg 42640
taaatgcaaa cgtaagccct gaattaaaaa aggctgtcag tagtgtacag ttttatgaat 42700
attcaccgga agaagcagca cctcatcgaa atgcgtatag cggtgttatg aacacatttc 42760
gcgcgttttc tctgtcggat agtttctgtc agttgtctac ctttacacaa cggttttcgt 42820
accttgtgga aacatctttt gagagtattg aagagtgtgg aagtcatggc aaacgcgcaa 42880
aggttgacgt tccaatctat ggcagatata aggggacgtt ggaactgttt caaaaaatga 42940
tcctcatgca caccacgcat tttatttcat cggtgctatt gggcgatcat gccgacagag 43000
ttgactgctt tctgcgtaca gtgtttaaca cgccaagtgt ttctgacagt gttttagaac 43060
acttcaaaca aaaatcaact gtgtttttgg taccacgtag acatgggaaa acatggtttc 43120
ttgtaccatt aatagcttta gtaatggcca cgtttagagg aattaaagtg ggttatacgg 43180
ctcatatacg caaagcaacg gaacccgtgt ttgagggtat caagtctcgc ctggaacagt 43240
ggtttggggc aaattacgtg gatcatgtaa aaggcgaatc tattacgttt tcatttaccg 43300
acgggtctta cagcacagcg gtgttcgcgt caagtcacaa cacaaacgtg agtgttttat 43360
aaatttaacc tttaatatat tactgtaaat gttgacatat acctttccac aacggcggtt 43420
gagttaaggt atactaggtg gttgtaggtt ccggttcacc cgataatctt tgtgtctcgg 43480
ggaagcaaat tcgctgaagc agaccacagc cgttaataat agcccggctt aatgtttctc 43540
caaacatata aagctgccac ccagatgaat ttactggtac agagagacca ctggcgttgg 43600
ttcccgctat aacgtcgcca agatttgcgg taatgcgagg atttttagta ctcgtaattc 43660
gaatgcaggt ggtgacatct acaaaaagaa cctgcgtggc gccaatgtct acctccactt 43720
ttaattcccg ctgaccggcc tttctccaca tacacggagc ccaacacaca caaccttccg 43780
catgatttgt gacatggggt aacgcataca gtgcccccac gtgaactcta tgattacatt 43840
catcacatcc gtccgcatgg ctgaggagtc gatttaatac agagccaagt atccgagcat 43900
gccatccggc gggacatagc cctattaaat taggttccat agccagtaca tataaacgcc 43960
ttcgttcgtc tgaccaccac actcccggag aaataacttt acatgcgtat ggatttttcg 44020
gaagccgcgg gggttgtaag tagttgctta agtttggcgt tggtgtaaga tctgcggggg 44080
tgggatctgc tcgaggatcc ggaatagatg ttggaagggg gtacgcgatc gggttcttaa 44140
acgttgctcc aaaaacatgg tctatgtttt caaccggata aattcttaaa gtcgccgtca 44200
ttgcgtacga gacctcgtaa ttaaaattta caattacatg aaaagtcttc ggaggtaagt 44260
tcatctgacg tgggcgcgtg atgtaaattg tggctacaac aacggcaata ttagtagtat 44320
ccgtttgaag ggggataaac ggagcgatcc ttaaagttat aaaagcagtt gatcgcatta 44380
ttttcacccg gggatcggtc aggatggact tccataatcc catatccagc gttaatgcat 44440
cgcagagtct ctgaactgcc tcgggggtta atttgcgcgc tgcacccgta gcggtgtaca 44500
gcggaaatat gcgttgtaat tccatgagca gaaaaacagt ctagcggatt gccgcccggg 44560
tacttgtggg tttaatgcca cccacccggt tattttatat tttaagaggg ggtggaaacg 44620
ggagaaatga cgtaaaatta catatgaaga gattctggtg ttatgttttt atagtgacac 44680
taatttattt atgggggttg ggaatagaga agcagaatct gtctagaata ggtccgatta 44740
acgatgcagg tagtgctgcc tgtagggtat cggtaataca aaaacatgcc gcaaatcccc 44800
ccggtaaaac taaaatggat tgtaattgct ggttaaatcc tagacaaatg tacgcgtaac 44860
attgaccggg taaatactta gaacaaattc caatatcaac aatatccgcg ctgcgtataa 44920
atttacccct cagttgtgtg gaattaccaa taccaacctt ttctaaggct acgggaacgg 44980
ggaccttgga aagcttaagt atttcccctc ctgaattata atagtcaaat aaatatatag 45040
aacgattacc taaccagcat gggaaggaag cgtgaaggta gggtatatac cccccacctt 45100
gtggtcgtgt atatacagat gacagatacg ccaaaaccgc atacatcaag gagctgttat 45160
aaaacgcatc cattgacatt tccgttaaca ccgaaactat agtctgaatc aggtctggtg 45220
tgcgggctac aatttcatca ataaccgttt gggaagaatc tgcaatatca tattccatga 45280
gttgttgtag agtcgggttc gtttgtaact ccgttataag accttgggtt agcgatgtca 45340
cacacgcttg tttaaatacc tggtttaaaa acatttcggc gcctggttta aaggcgggta 45400
agggggtttg attatttagg acgttagcca aaaacggtaa acgcgcgact agctcttggc 45460
gagctgtcac atgtaggctt tggggattgt caacccgggc atttatacac gcagcatcaa 45520
taatagcctg tgcagagtga tataaaattg gacttccggt aatacgtcct ccccaggcag 45580
aggatccgtt gtaagatact acaatcaacg gactggggga ttctgcgtaa tgtcgcggta 45640
caattgatag gggacgccgt ttccagaaat ctgctggagt gtccccgcta actaattggg 45700
cataacagat gtcgaaccat tccataagac tttggggttc tgtcgaagct ggggtaaaca 45760
atagaacgtc ttgtaaaggt gggatgctgg cggacgaatt gtttttcttt cccgtaaatc 45820
gcccttgtcc aggcggctca aggacgccat caaaggaacc gttattgatc ggatctgtgt 45880
tggaagtttg cgctccgtgg ccctttgcac tttgaagcaa cccagatgca acgcgggaac 45940
tagaaggtcg gacggggtgc ctggagttaa caatgtttac ggcccgtttt attagctcaa 46000
ggacgtcccg attattttcc tgtatgcgtg tttcagcagg ggagtcatca atacctccag 46060
aagttaactg tcgatcaaga tcgattatgg atgaaacggg tccaatattg tccccatttg 46120
acgtgtgtga ttcacccatg gctgccacca tatgctctgc gtatattttt atagacgatg 46180
caagacgagg ggtgcatcgg atatacgcaa tcagctgttt gcataataaa agtacccgtt 46240
gtccatcagc aaaataacgc gttccgtttg ggattagttc tgcatacata atacaaatat 46300
cacggtgctt gcggtttcca gtatttattc gtatcgctac aacgttaaat gcatcaaaga 46360
ataaaccggg gctaagataa acaggcaatg ataaaatcaa tccccctgaa ttatgcgtgg 46420
ccgaaaaaac gtgtgaaaca aatggttccg tttttggtat taagagattt gttaaggcgt 46480
tatcgggaat gtacgcggcg aaaacttgac accacggttc gcattgacct gtagcatgat 46540
atcttgtttg tacttcaacc ttgaagcgtt gtccgggttt ctttaaaatc agtaatgcgg 46600
gatctattcc ggccgcaata agccccgcgt taggtatcac aacgtgtagt aatccttttg 46660
tgtgatcatt atgccaaagt gcatgtttgg tttcatttgc caaatgggct tccattatac 46720
accggatatg gttgtactgg aaaaaaaaaa gaaatatgta cgtattcaaa cattttttac 46780
gtacgtggta tttaaggata catttaaact ttggtggggt aactatatat ctttctatcg 46840
ttccagggta tccgaggtca agattttaat cttctgtttg tggatgaagc taattttatt 46900
cgacctgatg ctgtacaaac tatagtcgga tttttaaatc aaaccaattg taaaattatt 46960
tttgtttcat caacaaatac cggaaaagca agtacaagtt ttttgtataa cttacgtgga 47020
tcgtcggatc agttgttaaa cgttgttaca tatgtatgcg acgatcacat gccgcgtgtt 47080
ttagcacata gcgatgtcac agcttgttcg tgttatgtat taaataagcc ggttttcatc 47140
acaatggatg gagccatgcg gcgcactgca gatttattta tggccgactc cttcgtgcag 47200
gaaattgtag gtgggcgtaa acagaattct gggggtgtgg ggtttgaccg gccattattt 47260
acaaaaactg cccgtgagag gtttatttta tatcggccgt caaccgttgc gaattgtgct 47320
atattatcgt cagtgttgta cgtttacgta gaccctgcat ttacctcaaa tacacgagcg 47380
tctggtactg gtgtagcgat tgttggtcgt tataagtcgg attggattat atttggattg 47440
gagcactttt ttcttagagc tttaactggc acgtcttcca gtgagatagg gcgttgcgtt 47500
actcaatgct taggccacat actcgcttta caccccaata catttacaaa cgtacacgtt 47560
tctatagagg gaaacagcag ccaggattct gcagttgcca tatcgttggc tatagcacaa 47620
cagtttgctg tcctcgaaaa gggaaacgtg ctatcttccg ctccagtgtt actgttttat 47680
cattccatac ctcccggatg tagcgtggcg tacccttttt ttttattaca aaaacaaaaa 47740
acgccggccg tagactattt tgttaaacga tttaactccg gaaatataat agcctcacag 47800
gagcttgtat ccctaacagt aaagttaggt gtagaccccg tggagtatct atgtaaacag 47860
ttggataacc tgacagaggt aattaaaggc ggtatgggta atctagacac aaaaacttac 47920
acgggtaaag gtaccacggg aacaatgtca gatgatctga tggttgcatt aattatgtcc 47980
gtgtatattg gcagttcatg tataccggat tccgtgttta tgcctattaa ataaaaacaa 48040
gacgcgtgaa atgtaactag actggttgtg tttttattaa cacctgttta cacttgaatc 48100
acggccgtgc ggtctcccgg atgaacagta gagggtccat caaacacaca gacgcgtaaa 48160
attgcgcgag ttcccaaggt atcccccagt ctagatgatg ttgatatcca agatgaaata 48220
gtttcatata tgggaaatcg gcgatggccg tcttgaatcc gcggcattgg cggtatatta 48280
attaactggc cggtttctct aacgcaattg gccgtttgta tcaataaatt tacataaccg 48340
tcgtgtgctc cttggactaa caaagttgga atcagcgcca ataaaagcag acatccctcg 48400
tttatggaaa acattagatt taaaaccaat gtgcgaatag ccgcatccga tccgtcccga 48460
tgttgtaaat ttgtttcgag ttcataacgc cttccattgt aacaaatgac atccgcgcct 48520
cgtagaactt cgtggggtaa aatttgagca cccgcggccg tgcgctcaac ggcccgtgca 48580
accactttgg cgatgatttc tcgcattaag gtctggggga tggttaaagg gaaaacaatt 48640
tccagtccta cagagtccaa gcgaattgag tctgcggacc cgaaaaccgg aggaagtaag 48700
cagatgtaat cgccattaca aagatcaacg ggggatgtat tttgaataaa caacccgggt 48760
gtaggagtac ccacgccaat ggcatgcgcc aacaactttc ccgggatgac ccgtgttata 48820
actgccgcga atcgcattcg gtatgcctct aacagcgaga gcgtgtctgg tggagcaccg 48880
ttaatgtaat atgacgacag cgctatatcc accagtgaag ctcgatgacg cagggttgag 48940
aaggtaataa tttttccctc acatttctgt aatgcagatg tttccgccgg ggatagttct 49000
cctggtaaca atacctctat ctcaaatggc atagccatgt tgtttaagga agcgtgaaac 49060
agaccaagtc ggccgagcgt atccaaatct caagtaagat gccaatgcgc tctacagggc 49120
ttttattggg agtgggtatg tgggcgtggt gaaattatcg cggaagagga agacatcccc 49180
taaggggcga cgcgtctcga atcagatatt gggctaagtg aacttcatct gccccggttt 49240
ctcccgcgtg agccgttcgt aacatggccg catcgcttga gcatagtggt ggatatgctt 49300
cttgaaaaag gccacacgga tcctgtgtca tttcggtaga accgggaggg cgtttaaatt 49360
gatattcagt gtcggtggag ctttgcgacg ccacagcctt tgcctccatt agaagccgat 49420
ccagtgtatt acaattagtg ttaacctccg ctggtgtaaa aaacttaaag catgggctgt 49480
agataggaga agcgcctgta aggttgtatg ttccgttgta tagcctgtca ccgtatgaat 49540
gtttttgtga tgcccatggg ttaagagttg ctcgatctgt ataagcaaca tccgattgtg 49600
tgtgatcaaa cattatagcc tctatgtcgg cgtcacggtc acccatatat aacattccag 49660
atgctcgacc tctaggattg catgcagttc taaaatattg taggtcagtg gacaccggca 49720
tggccacaaa ctcacacaca gaggcttgcc ctcgtgcaat tcctgccgat gtagcaggac 49780
gtagtccgcc gaagatgggt aggggttcgg tgggatttaa gcgacccccc gatgctgtta 49840
tacgacgcaa cgattcggta acgttatcat gtaacattgg cactcctcct cgtgaaaaaa 49900
aaagattttg ggcagtattg cccatatccg tgagagggca tcgcagggct gctgtggcac 49960
atacagctgt atacccgact cccaggtcca cgtgagctct gggttgggtt agggtaaagt 50020
ttaccccccc aatagcatca tgatggtgta cttggatttg tccgacaaag tacgattcag 50080
aagcacgctc ggcatataaa agttgctctg tggcaaagcg atcctggcgc actacggtaa 50140
aggcaatccc ggggtgaaat cccgttcgta gttgatgtgt aagagatatt ggtgtaaact 50200
taaaataccc tccaagaaga gaatatgtaa gagtattaaa atccgactta ctatgcgtta 50260
catggtaggc aacgggttgg cgtatagttg cgtggtggtt ggctcccaga aaaggaggga 50320
ttggtggtac catttttgcc aaaacccgcc tagcatttgt cattccacgc aatgatgcca 50380
aatgttccgg acatgcaaac agagggttga ccggaacggg ataaaaaaaa gtacccgttg 50440
caatggtttc atcgtacgcc tgatatgcca tcataataag gccatggtaa agcgctccgt 50500
catatattct catatttctg gttgttgccg tggcggctcc cgcatcgggg gcgcttgata 50560
ctaaaatggc cgtcgttcgt tcagccatat ctcccattaa ctcttgtagt gtgagcaatg 50620
catcaccatc aacggttagg tgtgcattat ggaagtaaac gttaagagag ttcggaacga 50680
gttggtgtgc gtgaagaggg tgtcttggat cttctggggt agttggggct tcttcatcag 50740
cgggaatttc cggaacgata actgcctgta acgcagggta taggcgatca taacgcacgc 50800
ccattgtaca acaggaaccg cgggaaaatg caggaatgac aatatagtag taaatcttgg 50860
agagaatacc ccattcacgg tcatggtgtg gagtaatggg aataccctga cccgtgtctc 50920
cccgaacggg tctcccgtgg attaacacat tatcgcgtcg ttggaagtta tgcccggcca 50980
tatccacaaa gtgaagggct tggtatccgt ttcgcccgct tacacgaatt gcggggagtc 51040
ggtctcgggc ggcttcatca cggtatatta acgcatcaca atcccataga ataggtgcaa 51100
taaacgtgtc atccgtaagg atgttattta gcgcttccga ggtctcgccg ttatggccct 51160
ctccttgtat tgtaaaatcg gttatagttt ggcgtaatgc tcttacatgc tgcagtaaat 51220
cccgatatat attaatacag acttcgggaa gctcaccgtt tccgagatat gtagttatgt 51280
acattaacat gtgaaagtta tttacaaatg ccaccctgtg ggattgctcc caatacccgc 51340
gaatacactg tgttaacagt cgcagtaacg cacagaagtt tctttcgttt ccatgaataa 51400
cagcctctag catgtagaaa atagttgggt atgcgcggtc ttcaaatgta tcctttacgg 51460
ctttaatggt tgccggggtc attgtatgtc ttcccaaacc gagttgggtt ccgcgacagt 51520
ctcgaaatga aatgggacat agaggcacgg gaatgtttcc gttgataatc cgtaatgttg 51580
cgttaacggt tggcatggct tccgggggac gctgcggtcc gggaaggtct acgtcccccg 51640
gtgcaacaaa aaaatcaaag gcggggttta attcaaacct tagacgcggg ttgctcggag 51700
ctataaactg tagaattgtt aaatgttcgt ttacccaatg agggtgatga ggcatcatat 51760
ctgcccacgt ttccataaac ctccccattt gaacatccaa tgtgtcctct gtaccctcgg 51820
ccacgtatac accaaaataa cactgacgat ctaaatgttg ttggcggagg gcaacaagtg 51880
tggcctcgac atcaagcaat gaactgtggc agatggtacc cattgcatca cgaagagtca 51940
actgtgttaa tatcccatct ttattataaa aaaaaatccc ttggggtgga aattgacgtg 52000
gatcctgttc ggatacagtt gaaaaatcgc cggcgtgtcg tgtatatctg tccatggagt 52060
ttgcctgaaa cactcccatt ggcatgataa acgtaatatc tatatttcca ataagagggt 52120
aggcaacgcg cgtagcttgg tagacccgcc gttctaatgc ttctaagaat acaagtttat 52180
ccccaacaat cactaaatcc gcaggaacac gggcgttaac cggaagccca gctgtcgttg 52240
aatcggagtt ggattgtggg ggtatagacg gtatgtttaa cgtgttaggg tcggttatat 52300
caaggagatg gcgggctaca tcatccattc cgcggacggc ctttcccatc acaagggcgg 52360
ttaccaagtt tgtcccctgt agaaccattt cgccatatgt tactggtacg tcagcggcgg 52420
tgtcgtcaat ttgtaaaatt ccctgtaata gttgccgttt taaggttgct gttgttacca 52480
acacaccgtc aacctgccgt ccgcgagtgt ttgtatgagt tattcgtgat accatcaccg 52540
atggttgggt gcacgaaacc atatccgaca gatacgcaga gatcagccta ggttccctgg 52600
cgtgtcgggt cataaaaaac atatccgcac agactctacg tttgaggtcc gaaagtaggg 52660
ccgcgcgtgc aacacgattt agatgtccct cgggttggaa tttattaatt ggtgaaagca 52720
gcgatagcgg tggggctttc tccaatagaa caccaagaag ttgatcggca gtgcctcgtt 52780
caaatgagtc caaaacggtg cgtaaattac gagccatctg ttggatagct cttatacgta 52840
acgatgagtc aatctctgtc ccatcgacat atgtgttact taacaaactc aacgcttccg 52900
atgcaattgc aaacgcagcg cttaacgaac gcttgtgtat ccgcttaacc atataattat 52960
gaacaggctg atcgacggga tgtgggccat cacgtgctat catgggttgt tgtacctcaa 53020
attgaataac gccatctcgc acgtaagcca gctccggaaa tttagtacag atgcaagcga 53080
cagacagtcc aagttctaga aaacgcacaa agtttaatgt attgcagtat gtccccaaaa 53140
gaatatcgaa ttgagccgag taaaggctgt tatcatctga tcgtatttgt ttaaaaaaat 53200
caaaaatgca acggtgtgca cacacctcta tggttgacag cacatttcca gttggaatga 53260
tccccgctgg gatgttaaat aacccagcaa tacgatcaga ttccgttgta gtaatcacgt 53320
tagcgggaca tgaaaccgtt gtcattttgt cagcagcgtt gcgacttgaa tccggacgtg 53380
taaacgtgtg cgggaacaag aaaactgcga gaagtttgat gtagagtgga acatatgtat 53440
gggtgtttcc aagtaacacg ggggcggtaa ttataaatac ccaacattat ttttagactg 53500
aacaattata gtacatttat tgaaatggac taaaacgaaa tagatgtttt taacataaca 53560
cggacccaca cccttggccc aataaaggat accgtgtcca ccaaacagtc catagcgtaa 53620
ataactgccg ccattaccag taacacggct ggacttggac cccacgacac aggaagtatt 53680
tttgaaacgc tatctggaaa aataacctca ctttgcaagt gtccggtaat caaaaagaac 53740
acccgggtgt ctataaaaag agtataatac gccgtaatac aaaatagcat tgttcgagtc 53800
acaacaataa ttccaagcag cagttgctgg gatgtagata agggtagcag gtctgatcgt 53860
atatttctgt agacacacaa taacataata ataagtcgaa agtgatgtag aatcgtcaga 53920
agtaacccgg ctgtgaataa gagcatacag cgtgtcgctt tatagtaaaa aaaaaagaag 53980
atacaacata ttggttgaat caacaccgtg gatataatat aaaacacgga gtgttgcgta 54040
aacgccgacc tgtttacggt agatgtgcgg gtacatgcag agcttaatag taagtccgta 54100
tcacccccat acgataacat aaccatggtg ttataaagcg gtgaattatc actggattta 54160
cacacttttg cgtcctcttc ggggtgctgg ttaaccataa cgctaagcaa gtcctttgtc 54220
tgttccgaga cgcgggcttg gggtgggttc ataacgttca gtcgcagttt atagtccaaa 54280
gtaaaaatgt tgacaaagtt aaagccgatc cttgtattat gtagaatata ccccgcctga 54340
cagactgtta taacaccctc tctaaatgtt tacgtatgtt tggagcatgt cacatagctg 54400
acgtgctaaa cggtaggagg tgtggttgat tggattgatt ggatcttttc atatgacgtt 54460
aatgatacca cggtgttttg aatgtttgcg tgtagttata tatttctcat attcggtgtg 54520
gacctaaggt tggcgatcac acgatcacgg gacggcaagt tcgaggttaa tgttccgata 54580
aacgctttaa t atg gaa ttt cca tat cat tca acc gta tct tat aac 54627
Met Glu Phe Pro Tyr His Ser Thr Val Ser Tyr Asn
3045 3050 3055
ggc gta acg ttt tat ttt aac gag cgt gct acc agg gcg tat ttt 54672
Gly Val Thr Phe Tyr Phe Asn Glu Arg Ala Thr Arg Ala Tyr Phe
3060 3065 3070
ata tgc ggg gga tgc tta att tcc ata ccc cgc aaa cat gga ggc 54717
Ile Cys Gly Gly Cys Leu Ile Ser Ile Pro Arg Lys His Gly Gly
3075 3080 3085
gag atc gca aaa ttt gga cat gtt gtt cgc ggg gtc gga ccc ggt 54762
Glu Ile Ala Lys Phe Gly His Val Val Arg Gly Val Gly Pro Gly
3090 3095 3100
gac aga tct gtt gcc agt tac gtt cga agt gag ctc aat cgc acc 54807
Asp Arg Ser Val Ala Ser Tyr Val Arg Ser Glu Leu Asn Arg Thr
3105 3110 3115
ggg aag aca tgg gcg gta tca tcc aat aat aac tgc gtg ttt ttg 54852
Gly Lys Thr Trp Ala Val Ser Ser Asn Asn Asn Cys Val Phe Leu
3120 3125 3130
gat cga gtg gcc tta ctt gca gcg gga tcg ggg gcg gtg gat cgc 54897
Asp Arg Val Ala Leu Leu Ala Ala Gly Ser Gly Ala Val Asp Arg
3135 3140 3145
gac ctt tgc gga aca ttt gat gtt gaa gtg gag gac cct acg ctc 54942
Asp Leu Cys Gly Thr Phe Asp Val Glu Val Glu Asp Pro Thr Leu
3150 3155 3160
gca gat tat ctc gtg tcc ctc ccc gtg acg cat tta aca ttg gtg 54987
Ala Asp Tyr Leu Val Ser Leu Pro Val Thr His Leu Thr Leu Val
3165 3170 3175
gcg ggg gta gat gtt acg cgc gag aat aag cta aaa ttg ttt cca 55032
Ala Gly Val Asp Val Thr Arg Glu Asn Lys Leu Lys Leu Phe Pro
3180 3185 3190
aca ccc acc gcc att aat acc aca aat ggc ttt atg tac gta cca 55077
Thr Pro Thr Ala Ile Asn Thr Thr Asn Gly Phe Met Tyr Val Pro
3195 3200 3205
aac gaa gcc agt ttt tca ttg gtg tat atg cgt atg ttg gag tta 55122
Asn Glu Ala Ser Phe Ser Leu Val Tyr Met Arg Met Leu Glu Leu
3210 3215 3220
cca gaa agt ttg cag gag cta gtg agt gga tta ttc gac ggg acg 55167
Pro Glu Ser Leu Gln Glu Leu Val Ser Gly Leu Phe Asp Gly Thr
3225 3230 3235
ccc gag ata cga gac gcg ctt aac gga agt aac gac gat gaa aaa 55212
Pro Glu Ile Arg Asp Ala Leu Asn Gly Ser Asn Asp Asp Glu Lys
3240 3245 3250
aca agt ata att gtt agt cgg cgc gct gct gat gtg gtt acg gaa 55257
Thr Ser Ile Ile Val Ser Arg Arg Ala Ala Asp Val Val Thr Glu
3255 3260 3265
gac gta aaa gca gat gat gtg ccg att tcg ggt gaa ccg tat tct 55302
Asp Val Lys Ala Asp Asp Val Pro Ile Ser Gly Glu Pro Tyr Ser
3270 3275 3280
gag aaa cag cct aga cgg cgt aag aag tcc gat cat att aca cta 55347
Glu Lys Gln Pro Arg Arg Arg Lys Lys Ser Asp His Ile Thr Leu
3285 3290 3295
agt aac ttt gta cag att agg acc atc ccc cgg gta atg gac att 55392
Ser Asn Phe Val Gln Ile Arg Thr Ile Pro Arg Val Met Asp Ile
3300 3305 3310
tgg gat cct cgc cat aaa gcc act act cat tgt atc cgc gcg tta 55437
Trp Asp Pro Arg His Lys Ala Thr Thr His Cys Ile Arg Ala Leu
3315 3320 3325
tca tgt gcg gtt ttt ttt gcg gac gag gtt ata ttt aaa gcc aga 55482
Ser Cys Ala Val Phe Phe Ala Asp Glu Val Ile Phe Lys Ala Arg
3330 3335 3340
aaa tgg cct gga ctt gaa gat gaa ctt aat gaa gcc cgt gag acg 55527
Lys Trp Pro Gly Leu Glu Asp Glu Leu Asn Glu Ala Arg Glu Thr
3345 3350 3355
ata tat act gca gtt gtt gcg gta tat ggc gaa cgg ggg gaa ctt 55572
Ile Tyr Thr Ala Val Val Ala Val Tyr Gly Glu Arg Gly Glu Leu
3360 3365 3370
cca ttt ttc ggg cat gct tac gga cgt gat ctg acg tcc tgt caa 55617
Pro Phe Phe Gly His Ala Tyr Gly Arg Asp Leu Thr Ser Cys Gln
3375 3380 3385
cgg ttc gtg att gtt caa tat ata ctg tct cgg tgg gaa gcg ttt 55662
Arg Phe Val Ile Val Gln Tyr Ile Leu Ser Arg Trp Glu Ala Phe
3390 3395 3400
aat tgt tat gcc gtt att gaa gat tta acg cgt agt tat gtt aac 55707
Asn Cys Tyr Ala Val Ile Glu Asp Leu Thr Arg Ser Tyr Val Asn
3405 3410 3415
gcg tta ccc agt gat gat gac acg gat caa gtt gct caa gat tta 55752
Ala Leu Pro Ser Asp Asp Asp Thr Asp Gln Val Ala Gln Asp Leu
3420 3425 3430
ata cgg acc att gtg gat aca gca aac agc ctc ttg agg gaa gtg 55797
Ile Arg Thr Ile Val Asp Thr Ala Asn Ser Leu Leu Arg Glu Val
3435 3440 3445
ggc ttt att ggc acg ttg gct gaa act ttg ttg ttc tta cca ctc 55842
Gly Phe Ile Gly Thr Leu Ala Glu Thr Leu Leu Phe Leu Pro Leu
3450 3455 3460
ccc cag ctt ccc tgt tac aag gaa acg tca cat ctt gca aaa aag 55887
Pro Gln Leu Pro Cys Tyr Lys Glu Thr Ser His Leu Ala Lys Lys
3465 3470 3475
gaa ggt gtg cga att tta cgc ctt gca aaa aca gga gtt ggt tta 55932
Glu Gly Val Arg Ile Leu Arg Leu Ala Lys Thr Gly Val Gly Leu
3480 3485 3490
tcg gat act gtt ccg gtt gat gtt tct gtc acg gaa agg cat gag 55977
Ser Asp Thr Val Pro Val Asp Val Ser Val Thr Glu Arg His Glu
3495 3500 3505
tat gag ata tcc cgg tac tta gat acc ctg tac tct gga gac ccc 56022
Tyr Glu Ile Ser Arg Tyr Leu Asp Thr Leu Tyr Ser Gly Asp Pro
3510 3515 3520
tgc tat aac ggc gct gtg cgt cta tgc cgt tta ttg gga tca tca 56067
Cys Tyr Asn Gly Ala Val Arg Leu Cys Arg Leu Leu Gly Ser Ser
3525 3530 3535
att ccc att gcc ctg tac tac aat aca ata tcg ggt aat gcc ttt 56112
Ile Pro Ile Ala Leu Tyr Tyr Asn Thr Ile Ser Gly Asn Ala Phe
3540 3545 3550
gaa ccg tat ttt gct ggg agg cgt tat ata gca tat tta ggc gct 56157
Glu Pro Tyr Phe Ala Gly Arg Arg Tyr Ile Ala Tyr Leu Gly Ala
3555 3560 3565
cta ttt ttt ggt aga gtg cac caa aca cca ttt ggg gac ggg aaa 56202
Leu Phe Phe Gly Arg Val His Gln Thr Pro Phe Gly Asp Gly Lys
3570 3575 3580
aaa acc caa agg tag tgtgtattat tcgcgaataa agtattgtag agaatacgtt 56257
Lys Thr Gln Arg
3585
tatagtgact ttttattata catgtttttt atgtcagagg tattttatta tattctcgaa 56317
ggcgggaatt tccacataac atccatccga taataccaaa ccccactacc gccagaaacg 56377
cccctcctaa agaggcccca aggtattgtc ccgacattcg tatggcttgt cgtcgttcga 56437
atcctagaag cgttaccaca gttccgtttg gaaacaacaa cacagcctta gagtcatccc 56497
cgtgcatgtc tggattgaag ggtggaattg tggagtttcc catagcggct agttgtcgtt 56557
ctgtatcttt gctgtcaata ataattatat ccataatcgc ccccgtggtt agatacctaa 56617
gaaaaacact tccgcaatac aaacattctt ttaaattgtc cggatggggc agtgcgaccg 56677
tctctatgac accatgttca gacacgcaag tatccctgct taaataaaca acggatatgg 56737
ggttatatac atccacatct gccagggaat ataccaaacc cctttgaggt ttgtttcgtg 56797
taatcacata actgtgtccc tggacagctg gaagaatgag cacatattct ccgttacgcg 56857
cggctgcatc ttgacacgta aatacttctg gtacatgggt gtggagtata tgcaaatctt 56917
tggcagtcca tgtggtaaaa atcatcatgg ttttaaatgc cgcgtcgaat atttcagatt 56977
cgtctagttg ggtatgcaat acctcgttaa gtcctgggcg tgttggtatt gccgacagga 57037
gattcataac atgaatctct tccgtgaggt ctgtacgaag ggaacccata caaggactgt 57097
atacgtttgg tattgtaaac atgtgttttg aaggatttaa gtatgccagg ccttcttgta 57157
tgttaagtgc tgcttgcgtg gcgtgagctg ccgtacacat ggatgtcatt aaaagcaaag 57217
ttgtgcgacc gtctaatacc cgagatgaat tttctagtcc ttcgcggaaa tttaataaaa 57277
tcattgaggc aaaaaataaa gcctgcctct ctgtagcatt taaattttgc ttcacaagca 57337
gccccttgta tatgctcatt ataacacgcc tggcgagaga aagtgcattc acatggtctt 57397
gaccacggga aagcgcgtat gcggtcttta gttgatcacg ggcttcatca tatgaaataa 57457
aataatcgac atttgcgggt tttacctgac caaaaagaag tgaaagttcg tcatgaagct 57517
gcgatggatc ggcaaagatc agatccgagg ttccggatat agtgttggga tgagctcctg 57577
ctcctatccg cattaaaatg taattaagga attttaattg atactgaaca tctacgatct 57637
catctaacag aaaatattct gtggtacggc ccatttccga caacgcaaaa attgatgtgg 57697
ctattcttgc ggcgatatgg taatagcttt cctcgtttat gtcatgttga tccgccttcg 57757
ttgtcatccg cagagcctcc gtgtgggcca tggataaatg atagcggtaa tccaaactct 57817
cttcgggata tgcgcatata tcgaccgtag catgctttga aaaattcata tccaacccac 57877
gtccaagtaa ataaactcga taagatggcc ccggatccgg acccggtgga tttagctgga 57937
accatgtggt gtcgcttgta accgcgttca gttttactgt atgtggtaca acaattaatt 57997
ctatcggtgg tccagaggat aaagaaatga gcgagctcat aaatcctaca ccaatattta 58057
cttccacacg acccgagtcg cttgtgagaa gcaccgaacc ggtggcccag tatcgaattg 58117
gccataccga cccaaaaagg ggaacgtgta ttctcaacgt tgagttggtg ataattgctt 58177
cggcagaaaa aaaagtatgt cgggcaaggg catcccatgt agcaaaatgc gctttatgtt 58237
ccagaatagt attcttgggg actgttgggc gagcgggtaa aagacttacc ccaaacggcc 58297
tttctgcagt tgctgcggtc tcagctcttt ccaaatgcca tacaaggggg tttggtggga 58357
acgtagtaaa cgcaacaaga tgttgcgtag ttaaatacgg cgcaaacgaa ctgtcaaaga 58417
agctcacgtc gggtgtcaca gggtgactaa gaaatccaaa acgtccagca ggaaacggtg 58477
ctcgatgttc tgctttgaaa tggtatggcg ataacaagta ttttggaaat ataaccagcc 58537
cgacgtctcc ttcgtgtgtt gtagggttaa ccttaacaat aaccaagaaa acgtgttttc 58597
tatcatttcc ccagtgaagt gatttaatga gttcttcatc gaaaccagta ggataaaagg 58657
cttctaattt cagagacata ttacggtcgg aatattctcg tagaagagca gacatatgtc 58717
cgatagagcg agtcgcaggg gttggtgtta cgtaagattt attagccgtg gtccaaagag 58777
gaagaattac caccgctaaa actagcgcaa acatagtcgc tgttacttgc gctacaatat 58837
caccgcaacg ttatgcacac agcgtgcctt tacccctcct cacccaccaa caggcttggc 58897
ggcttttaat attaccactc cgttttaggg agataaaggc gggattaaag gatgtggtaa 58957
taaatgacgc atatataaaa aagaaaaaca cgtacacgcg agtatgacaa tgtgtatcat 59017
ctttttactg gtacatacgt aaatactagg tatatttagg aagtgttgtc ctgaacggca 59077
ttaacaagct ctttcaatat attccatgca ccggaggaca tgtttgccgg ggtcatctgg 59137
ggtagcagag tttttagttc ttgtgccgca tgctggggtg tctgttctaa cgataatacg 59197
aacggagaca tgcttcgtga gcagtttgaa agggtctcca ttccccatgc ccataacggc 59257
agaatatttc caaactctcc gcaaagctcc ggaagtttaa gcacggcgaa taacgtgtct 59317
tcaatcccag gtacttcgcg tagttttata cactcggatt tttgtaattt ctgtttaaat 59377
acatcattac aaaatgacag tgtgttccag cccgcgtgcc agttgttagt tttaagaaat 59437
ataattgtat taataagcat tatatataca tttctcagaa ccataacaaa cggcagatta 59497
accgtttctc ccggtctggc ccgtttgctt actctggata aatgactggg gagtgaaacg 59557
gtacaaacta ccaagttggt cccggggggt tcagcgggaa gcgtaaacaa taacccagga 59617
agcgccgctg gggacatatc tcccactaag tatctggaca agggaaaaca tatagttgag 59677
gcgattgggt gtcggtcgga taacataatt ttatacggct ccttatttac ttgtacgaga 59737
tccgatgtac ttgtgtccat caatgccgag attttcgcat gcataattgc atgcggagaa 59797
cagaacaggc tctgaaaatg agccgtgagg cgttgtgcgt cttcaggcga aacgtctcca 59857
ttaagacggc gagtttgtgt tccgtaaatt ccgcaaatgg cgtcctcccc tgcaaggtta 59917
cgccaatacg acaggggctc cccaatgagt aagatccggt ttggtgttat tgcaaagtgg 59977
tgtaaaaatt cttcggcggc ggttgttttt ccaattccat acgccccgtc caaataaata 60037
cgcaaaacgc ccatttttac atcggtttta tccgttgaca tgtttattgt agacaaacgc 60097
gtcttaggtt atcttctggg acggaacttc aaat atg tcc gct agt cga att 60149
Met Ser Ala Ser Arg Ile
3590
cgg gcc aag tgt ttt cgt ttg gga caa cgt tgc cac act cgt ttt 60194
Arg Ala Lys Cys Phe Arg Leu Gly Gln Arg Cys His Thr Arg Phe
3595 3600 3605
tac gat gta ctc aaa aag gat att gat aac gta cgt cga ggt ttc 60239
Tyr Asp Val Leu Lys Lys Asp Ile Asp Asn Val Arg Arg Gly Phe
3610 3615 3620
gcg gac gcg ttc aac ccg agg ctg gca aaa ctc ctg tcg ccg tta 60284
Ala Asp Ala Phe Asn Pro Arg Leu Ala Lys Leu Leu Ser Pro Leu
3625 3630 3635
tcc cac gtg gat gtt caa agg gct gta cgc ata tca atg tcg ttt 60329
Ser His Val Asp Val Gln Arg Ala Val Arg Ile Ser Met Ser Phe
3640 3645 3650
gaa gta aat ttg gga cgc cga cgc ccc gat tgt gtt tgt att ata 60374
Glu Val Asn Leu Gly Arg Arg Arg Pro Asp Cys Val Cys Ile Ile
3655 3660 3665
caa acg gaa tcc agt ggt gcc gga aag acc gtt tgt ttt ata gtg 60419
Gln Thr Glu Ser Ser Gly Ala Gly Lys Thr Val Cys Phe Ile Val
3670 3675 3680
gaa tta aaa tct tgc cgt ttt agc gct aat ata cat acc cct act 60464
Glu Leu Lys Ser Cys Arg Phe Ser Ala Asn Ile His Thr Pro Thr
3685 3690 3695
aag tat cac cag ttt tgc gag ggt atg cgc cag ctg agg gat acc 60509
Lys Tyr His Gln Phe Cys Glu Gly Met Arg Gln Leu Arg Asp Thr
3700 3705 3710
atg gct tta ata aag gaa acc aca ccc acg gga tct gat gaa ata 60554
Met Ala Leu Ile Lys Glu Thr Thr Pro Thr Gly Ser Asp Glu Ile
3715 3720 3725
atg gtg acc ccc ctc ctt gtg ttt gta tct caa cgg ggt ctg aac 60599
Met Val Thr Pro Leu Leu Val Phe Val Ser Gln Arg Gly Leu Asn
3730 3735 3740
ctg tta cag gta act cgg tta ccc cca aag gtg att cat gga aac 60644
Leu Leu Gln Val Thr Arg Leu Pro Pro Lys Val Ile His Gly Asn
3745 3750 3755
ctt gtt atg cta gcg tcg cat ttg gag aat gta gcg gaa tat acc 60689
Leu Val Met Leu Ala Ser His Leu Glu Asn Val Ala Glu Tyr Thr
3760 3765 3770
ccc ccg ata agg tcc gtt aga gag cga aga cgt cta tgc aaa aag 60734
Pro Pro Ile Arg Ser Val Arg Glu Arg Arg Arg Leu Cys Lys Lys
3775 3780 3785
aaa att cac gta tgt tct ctt gcg aaa aag cgt gcg aaa tca tgc 60779
Lys Ile His Val Cys Ser Leu Ala Lys Lys Arg Ala Lys Ser Cys
3790 3795 3800
cat cgt tcc gct tta aca aag ttt gaa gaa aat gca gct tgt ggg 60824
His Arg Ser Ala Leu Thr Lys Phe Glu Glu Asn Ala Ala Cys Gly
3805 3810 3815
gtg gat tta ccc ctt aga agg cct tct tta ggt gct tgt ggt gga 60869
Val Asp Leu Pro Leu Arg Arg Pro Ser Leu Gly Ala Cys Gly Gly
3820 3825 3830
att tta caa agt ata acc ggg atg ttt tcc cat ggg taa gaaaacagct 60918
Ile Leu Gln Ser Ile Thr Gly Met Phe Ser His Gly
3835 3840
tttaaagcag taccggttta tattcacgcc agttgacttt gtttgctgca gacacc atg 60977
Met
acg gcg aga tat ggg ttc gga tct atc tcg ttt ccg aat aaa tgt 61022
Thr Ala Arg Tyr Gly Phe Gly Ser Ile Ser Phe Pro Asn Lys Cys
3845 3850 3855
ggg ata ttt ttg tct acc act aag aac ttt ata gcc ccc aac ttc 61067
Gly Ile Phe Leu Ser Thr Thr Lys Asn Phe Ile Ala Pro Asn Phe
3860 3865 3870
ccc ata cac tac tgg acg gct ccc gcg ttt gag tta aga ggg cgt 61112
Pro Ile His Tyr Trp Thr Ala Pro Ala Phe Glu Leu Arg Gly Arg
3875 3880 3885
atg aat ccc gat ttg gaa aaa aat acg tta acg tta aaa aat gcg 61157
Met Asn Pro Asp Leu Glu Lys Asn Thr Leu Thr Leu Lys Asn Ala
3890 3895 3900
gcg gcc gtt gcc gca tta gac aac ctt cgc ggg gaa acg att acg 61202
Ala Ala Val Ala Ala Leu Asp Asn Leu Arg Gly Glu Thr Ile Thr
3905 3910 3915
tta cca acg gaa ata gat cgt cgt tta aag ccc ctc gaa gaa caa 61247
Leu Pro Thr Glu Ile Asp Arg Arg Leu Lys Pro Leu Glu Glu Gln
3920 3925 3930
cta acg cgc atg gcc aag gtt ttg gat tcc ctg gag acg gct gct 61292
Leu Thr Arg Met Ala Lys Val Leu Asp Ser Leu Glu Thr Ala Ala
3935 3940 3945
gcc gag gcg gaa gaa gca gat gca caa tct gag gaa tgt aca cgt 61337
Ala Glu Ala Glu Glu Ala Asp Ala Gln Ser Glu Glu Cys Thr Arg
3950 3955 3960
aca gaa ata ata cgc aat gag tct ata cac ccc gag gta cag att 61382
Thr Glu Ile Ile Arg Asn Glu Ser Ile His Pro Glu Val Gln Ile
3965 3970 3975
gcc aaa aat gat gca ccg ttg cag tac gat aca aac ttt caa gtg 61427
Ala Lys Asn Asp Ala Pro Leu Gln Tyr Asp Thr Asn Phe Gln Val
3980 3985 3990
gat ttt atc acc ctg gtg tac ttg gga agg gca agg ggc aat aac 61472
Asp Phe Ile Thr Leu Val Tyr Leu Gly Arg Ala Arg Gly Asn Asn
3995 4000 4005
tct cca ggg att gtt ttc ggg cca tgg tat cgt act ctg cag gaa 61517
Ser Pro Gly Ile Val Phe Gly Pro Trp Tyr Arg Thr Leu Gln Glu
4010 4015 4020
cgg ctt gtg tta gat agg ccc gta gct gca cgc gga gtt gat tgt 61562
Arg Leu Val Leu Asp Arg Pro Val Ala Ala Arg Gly Val Asp Cys
4025 4030 4035
aaa gac ggg cgc att tcc cgt acg ttt atg aac aca acg gta aca 61607
Lys Asp Gly Arg Ile Ser Arg Thr Phe Met Asn Thr Thr Val Thr
4040 4045 4050
tgt cta cag tcc gcc ggg aga atg tat gtt gga gat aga gcg tac 61652
Cys Leu Gln Ser Ala Gly Arg Met Tyr Val Gly Asp Arg Ala Tyr
4055 4060 4065
tcc gcg ttc gag tgt gcg gta tta tgt tta tat tta atg tat agg 61697
Ser Ala Phe Glu Cys Ala Val Leu Cys Leu Tyr Leu Met Tyr Arg
4070 4075 4080
aca tct aat agt gtc cac gaa cct caa gtt tcg tcc ttc ggg aac 61742
Thr Ser Asn Ser Val His Glu Pro Gln Val Ser Ser Phe Gly Asn
4085 4090 4095
ctt ata gag cac cta ccg gaa tat act gag aca ttt gtg aat tat 61787
Leu Ile Glu His Leu Pro Glu Tyr Thr Glu Thr Phe Val Asn Tyr
4100 4105 4110
atg aca aca cac gaa aat aaa aac agt tat caa ttt tgc tat gat 61832
Met Thr Thr His Glu Asn Lys Asn Ser Tyr Gln Phe Cys Tyr Asp
4115 4120 4125
cgt cta cca cgc gac cag ttt cat gct cgt ggg ggg cgg tat gat 61877
Arg Leu Pro Arg Asp Gln Phe His Ala Arg Gly Gly Arg Tyr Asp
4130 4135 4140
caa ggc gcc tta acg tca cat tct gtt atg gat gcg ctt ata cgg 61922
Gln Gly Ala Leu Thr Ser His Ser Val Met Asp Ala Leu Ile Arg
4145 4150 4155
ttg cag gtc cta ccg cct gca cct gga cag ttt aat cct ggg gtt 61967
Leu Gln Val Leu Pro Pro Ala Pro Gly Gln Phe Asn Pro Gly Val
4160 4165 4170
aat gac att att gat cgc aat cat acc gca tat gtg gac aag att 62012
Asn Asp Ile Ile Asp Arg Asn His Thr Ala Tyr Val Asp Lys Ile
4175 4180 4185
caa cag gcg gcc gcg gcg tat tta gaa cgg gcc caa aac gtg ttt 62057
Gln Gln Ala Ala Ala Ala Tyr Leu Glu Arg Ala Gln Asn Val Phe
4190 4195 4200
ctt atg gaa gac caa act cta tta agg tta aca att gac acg att 62102
Leu Met Glu Asp Gln Thr Leu Leu Arg Leu Thr Ile Asp Thr Ile
4205 4210 4215
acg gcg tta tta tta tta agg cgc tta tta tgg aac ggg aac gta 62147
Thr Ala Leu Leu Leu Leu Arg Arg Leu Leu Trp Asn Gly Asn Val
4220 4225 4230
tac gga gat aaa cta aaa aat aat ttt caa ctg ggt ttg att gtg 62192
Tyr Gly Asp Lys Leu Lys Asn Asn Phe Gln Leu Gly Leu Ile Val
4235 4240 4245
tca gaa gca aca gga acc cct acc aac aat gta atc ttg cgc gga 62237
Ser Glu Ala Thr Gly Thr Pro Thr Asn Asn Val Ile Leu Arg Gly
4250 4255 4260
gcg acg ggg ttt gat ggg aag ttt aaa agc ggt aat aat aac ttt 62282
Ala Thr Gly Phe Asp Gly Lys Phe Lys Ser Gly Asn Asn Asn Phe
4265 4270 4275
caa ttt tta tgt gaa cga tat ata gca cca ctg tat acg tta aat 62327
Gln Phe Leu Cys Glu Arg Tyr Ile Ala Pro Leu Tyr Thr Leu Asn
4280 4285 4290
cgg acc aca gag ctg act gaa atg ttt cct gga tta gtt gct ctt 62372
Arg Thr Thr Glu Leu Thr Glu Met Phe Pro Gly Leu Val Ala Leu
4295 4300 4305
tgt tta gac gcg cat acc cag ctt agt cgt gga agt tta gga aga 62417
Cys Leu Asp Ala His Thr Gln Leu Ser Arg Gly Ser Leu Gly Arg
4310 4315 4320
acc gta ata gat att tct tct gga cag tac caa gat cgg ctc ata 62462
Thr Val Ile Asp Ile Ser Ser Gly Gln Tyr Gln Asp Arg Leu Ile
4325 4330 4335
agc tta att gca ttg gaa tta gaa cac cgc cga caa aat gtt aca 62507
Ser Leu Ile Ala Leu Glu Leu Glu His Arg Arg Gln Asn Val Thr
4340 4345 4350
tcc cta ccc ata gcc gcc gtg gta tca ata cat gac agt gtt atg 62552
Ser Leu Pro Ile Ala Ala Val Val Ser Ile His Asp Ser Val Met
4355 4360 4365
ttg caa tat gaa cgg gga ctt gga atg tta atg cac caa ccg cgt 62597
Leu Gln Tyr Glu Arg Gly Leu Gly Met Leu Met His Gln Pro Arg
4370 4375 4380
gta agg gcg gca ttg gaa gaa agt cgc cgc ctt gcg cag ttc aac 62642
Val Arg Ala Ala Leu Glu Glu Ser Arg Arg Leu Ala Gln Phe Asn
4385 4390 4395
gtt aac agt gac tat gat ctt cta tat ttt gtc tgt ttg ggt gtc 62687
Val Asn Ser Asp Tyr Asp Leu Leu Tyr Phe Val Cys Leu Gly Val
4400 4405 4410
att cct cag ttt gcc tcc aca ccg tga gtattcacta tccggtccgt 62734
Ile Pro Gln Phe Ala Ser Thr Pro
4415 4420
ggggtgttta ta atg gct gct gaa gct gac gaa gag aac tgt gaa gcg 62782
Met Ala Ala Glu Ala Asp Glu Glu Asn Cys Glu Ala
4425 4430
tta tac gtg gct ggg tat tta gcc tta tat tca aag gac gaa ggg 62827
Leu Tyr Val Ala Gly Tyr Leu Ala Leu Tyr Ser Lys Asp Glu Gly
4435 4440 4445
gaa tta aat att acc cca gag att gtg cgg tcc gct ttg ccg cct 62872
Glu Leu Asn Ile Thr Pro Glu Ile Val Arg Ser Ala Leu Pro Pro
4450 4455 4460
act agt aaa ata cca ata aac atc gat cat cga aaa gac tgt gtc 62917
Thr Ser Lys Ile Pro Ile Asn Ile Asp His Arg Lys Asp Cys Val
4465 4470 4475
gtg ggt gaa gta atc gca atc att gag gac ata cgc gga cct ttt 62962
Val Gly Glu Val Ile Ala Ile Ile Glu Asp Ile Arg Gly Pro Phe
4480 4485 4490
ttt ttg ggt atc gtt aga tgc cct caa cta cat gcg gtg ctg ttt 63007
Phe Leu Gly Ile Val Arg Cys Pro Gln Leu His Ala Val Leu Phe
4495 4500 4505
gaa gcg gcc cat tcg aat ttt ttt gga aat aga gat tct gta tta 63052
Glu Ala Ala His Ser Asn Phe Phe Gly Asn Arg Asp Ser Val Leu
4510 4515 4520
tct ccg cta gaa cgt gcg tta tac ttg gtc aca aat tac tta ccc 63097
Ser Pro Leu Glu Arg Ala Leu Tyr Leu Val Thr Asn Tyr Leu Pro
4525 4530 4535
tcc gta tcc ctg tct tca aaa cga ttg tcc ccg aat gag ata cca 63142
Ser Val Ser Leu Ser Ser Lys Arg Leu Ser Pro Asn Glu Ile Pro
4540 4545 4550
gac ggt aat ttt ttt acc cat gtt gcg tta tgt gtt gtt gga aga 63187
Asp Gly Asn Phe Phe Thr His Val Ala Leu Cys Val Val Gly Arg
4555 4560 4565
cgc gtt gga aca gtg gtc aat tat gac tgt acc ccg gaa tct tca 63232
Arg Val Gly Thr Val Val Asn Tyr Asp Cys Thr Pro Glu Ser Ser
4570 4575 4580
att gaa cca ttc cgg gtt tta tcg atg gaa agt aaa gcg cgg tta 63277
Ile Glu Pro Phe Arg Val Leu Ser Met Glu Ser Lys Ala Arg Leu
4585 4590 4595
ttg tcg ctg gtt aaa gac tac gcg ggt tta aat aaa gta tgg aag 63322
Leu Ser Leu Val Lys Asp Tyr Ala Gly Leu Asn Lys Val Trp Lys
4600 4605 4610
gtt agc gaa gat aaa ctc gcc aag gtg tta tta tcc aca gcc gtg 63367
Val Ser Glu Asp Lys Leu Ala Lys Val Leu Leu Ser Thr Ala Val
4615 4620 4625
aac aat atg ctt tta aga gat aga tgg gac gtg gtt gca aaa cgt 63412
Asn Asn Met Leu Leu Arg Asp Arg Trp Asp Val Val Ala Lys Arg
4630 4635 4640
aga cgc gaa gcc gga att atg ggt cac gtt tat ctt cag gct agc 63457
Arg Arg Glu Ala Gly Ile Met Gly His Val Tyr Leu Gln Ala Ser
4645 4650 4655
acc gga tat gga ctt gct cgg ata acc aac gtt aat ggg gtg gag 63502
Thr Gly Tyr Gly Leu Ala Arg Ile Thr Asn Val Asn Gly Val Glu
4660 4665 4670
tct aaa tta ccc aac gcg ggt gtt ata aac gcc aca ttc cac ccc 63547
Ser Lys Leu Pro Asn Ala Gly Val Ile Asn Ala Thr Phe His Pro
4675 4680 4685
ggc ggg ccc ata tac gat ctc gcg ttg ggt gtt ggg gaa tca aat 63592
Gly Gly Pro Ile Tyr Asp Leu Ala Leu Gly Val Gly Glu Ser Asn
4690 4695 4700
gaa gat tgt gaa aag act gtt ccg cat tta aag gtt acg cag ttg 63637
Glu Asp Cys Glu Lys Thr Val Pro His Leu Lys Val Thr Gln Leu
4705 4710 4715
tgt agg aac gac agc gat atg gct tct gta gca ggt aac gct agt 63682
Cys Arg Asn Asp Ser Asp Met Ala Ser Val Ala Gly Asn Ala Ser
4720 4725 4730
aat atc tca cca cag ccc ccg tcg ggc gtt cca acc gga ggg gaa 63727
Asn Ile Ser Pro Gln Pro Pro Ser Gly Val Pro Thr Gly Gly Glu
4735 4740 4745
ttt gta ctg ata cct acc gcg tat tat tca cag ctg tta acc ggg 63772
Phe Val Leu Ile Pro Thr Ala Tyr Tyr Ser Gln Leu Leu Thr Gly
4750 4755 4760
cag act aaa aat ccg cag gta tca att gga gct cca aat aac gga 63817
Gln Thr Lys Asn Pro Gln Val Ser Ile Gly Ala Pro Asn Asn Gly
4765 4770 4775
cag tat atc gtc ggg cca tat gga tct cca cac ccg cct gcc ttc 63862
Gln Tyr Ile Val Gly Pro Tyr Gly Ser Pro His Pro Pro Ala Phe
4780 4785 4790
cca cct aat aca ggg ggt tat ggt tgc cct ccg gga cac ttc ggg 63907
Pro Pro Asn Thr Gly Gly Tyr Gly Cys Pro Pro Gly His Phe Gly
4795 4800 4805
gga ccg tac ggg ttt ccg gga tat cca cca ccc aat cgt ttg gaa 63952
Gly Pro Tyr Gly Phe Pro Gly Tyr Pro Pro Pro Asn Arg Leu Glu
4810 4815 4820
atg caa atg tcc gca ttt atg aac gca ttg gcc gcc gaa cgg ggt 63997
Met Gln Met Ser Ala Phe Met Asn Ala Leu Ala Ala Glu Arg Gly
4825 4830 4835
att gac ttg cag acc ccg tgt gta aat ttt cca gac aaa acc gat 64042
Ile Asp Leu Gln Thr Pro Cys Val Asn Phe Pro Asp Lys Thr Asp
4840 4845 4850
gtc cgt cgt cca gga aaa cgg gat ttc aag agc atg gat caa agg 64087
Val Arg Arg Pro Gly Lys Arg Asp Phe Lys Ser Met Asp Gln Arg
4855 4860 4865
gaa ttg gat tct ttt tat agt ggg gag tct caa atg gac gga gag 64132
Glu Leu Asp Ser Phe Tyr Ser Gly Glu Ser Gln Met Asp Gly Glu
4870 4875 4880
ttt ccc tca aat ata tat ttt ccc ggt gaa cca acg tat ata acg 64177
Phe Pro Ser Asn Ile Tyr Phe Pro Gly Glu Pro Thr Tyr Ile Thr
4885 4890 4895
cat cgg aga cgt cga gtt tct cca tca tat tgg cag agg aga cac 64222
His Arg Arg Arg Arg Val Ser Pro Ser Tyr Trp Gln Arg Arg His
4900 4905 4910
aga gtt tct aat ggt cag cac gaa gag ctt gct ggg gtt gtg gca 64267
Arg Val Ser Asn Gly Gln His Glu Glu Leu Ala Gly Val Val Ala
4915 4920 4925
aaa ctg caa cag gag gtt aca gag cta aaa tca caa aat ggg aca 64312
Lys Leu Gln Gln Glu Val Thr Glu Leu Lys Ser Gln Asn Gly Thr
4930 4935 4940
caa atg cct ttg tcg cac cat aca aat ata cca gag ggg aca cgg 64357
Gln Met Pro Leu Ser His His Thr Asn Ile Pro Glu Gly Thr Arg
4945 4950 4955
gat cct cga ata tcg att tta tta aaa cag ctt caa agc gtt tcg 64402
Asp Pro Arg Ile Ser Ile Leu Leu Lys Gln Leu Gln Ser Val Ser
4960 4965 4970
ggt cta tgc tca tcc caa aat aca aca agc acc cca cat aca gat 64447
Gly Leu Cys Ser Ser Gln Asn Thr Thr Ser Thr Pro His Thr Asp
4975 4980 4985
aca gtt gga caa gat gta aat gca gtg gag gcg agt tcc aag gcc 64492
Thr Val Gly Gln Asp Val Asn Ala Val Glu Ala Ser Ser Lys Ala
4990 4995 5000
cct tta ata cag ggg tcc acg gca gac gac gcc gat atg ttt gca 64537
Pro Leu Ile Gln Gly Ser Thr Ala Asp Asp Ala Asp Met Phe Ala
5005 5010 5015
aat cag atg atg gtg ggg cgg tgt taa ccaacaaata aaagtattac 64584
Asn Gln Met Met Val Gly Arg Cys
5020 5025
attcataaaa ggcgtgtttg gttttttttt ttttgttaag cggtgtcgtg ttaaacaaac 64644
agaaggcgtt tttatgggtg gtaaggtttt atttaagtta aatttaatcg gtgtcagaat 64704
cttcatcccc tgatgatggt tgttctcggt ctggaataat atatggcgcc atccattttt 64764
cctctggggt atccggaaac acacgcggga tgcaaaccca tcgacccgtt ataatctcca 64824
gtgccagtga aatatgacgg ccccctggac atgtgtttaa gtagtcgtat tggctgtcgc 64884
ggttaaggcg aaaaggtttc gttgcacatg ctcgacaggt tagctgtccc acagaacata 64944
cgtctcgggg tgtaccgtcg cggggtaatt gtaccggcgt gtgttctaac ctatcttcat 65004
aaatcgcggg gggagaatct tttactgttt cattgagact tgaagcgcac tgtttggacg 65064
gatgatgtgc gatagacgac ggttgttgta gcgcgttaat gttagacgat tccataataa 65124
ccccgttaag atatcgggtg cggtagaggt acaaaacgcg ttttgtatat tatcgggtat 65184
aaataccccc tgtgatgcgt aatggagaca catgagtaac gtaatacaca tttttattaa 65244
taaattaaaa caaaccccct ggctatttac acccccgtta cattctcggt gcgaacacgg 65304
gagtatcctc ggcgatttcg taaagcaagg ccggtaagac gtgaagttaa aagggcgcta 65364
gtcttatttt ttttgcgggc tttagattct tggcgctcag ccgcagatac taacgtcata 65424
tatttaatca tttcctgggc ttctcgaaat ttatcgggat caaacccgct atttaccgac 65484
ggttcagtgt tttgtgagtc gccaatttct tctattgggg tatcagtagc gttgggtttc 65544
tcggcaaagg gatccattcc ttccggtaac tgttttaacc ccttggttgt gagtggatat 65604
aatgccttca tcgggcttgt tttaagttta agcacgtacc ggtacgcaaa aaaggccgct 65664
accagtcccg ccaaaaccaa taatcccacg gccaatgccc caaatgggtt agataaaaac 65724
gtggtaaatc cgtgtacggt ggaaagcagc gctcccgtgg ccccaagaac cacatgtcca 65784
acggcctggc ccgcggtccc aagtccctgg aaaaactgag ccatgccctg cataatggcc 65844
gttccgctat catattgcac aaccttgtct atgtcataaa aacgcagcga atgcatttga 65904
tttcggcgtt gaatttcact gtagtctagt aatcctgtat cccgcagctc gtctcttgta 65964
tatacttgca gcggcataaa ctctctatct ttaagaagtg ttaagtttaa atctacgtaa 66024
gtgctaatca ttcccacatc atggactgcg atttcacgga cgtaacgata atcctcataa 66084
tatacgtagt gatgcccaaa tagaaaatat cgcttgtgat tagccacgca tggttctaac 66144
agatctctgg acataattaa ctcgttatct gttccaagct ggccctccac cgtcccggac 66204
ccatttaaac taactattga aattaaagga cggctataac aacgcgtagt actaccagat 66264
accctcatag agttttgaag tataatgcgt gtatctgatc ccagttctgg acaattagaa 66324
acggagataa cgtcgccgag aatacgagct ttaacacgtt gatccaaaat ggtgctcgct 66384
aaagcacttg ggttaattgg aaatagtccg ctccaaaggg cgcgttcgcg attttgtagc 66444
tggcaccacg acgaggagat acgtgccaac atttcattaa catgctcttg aatgtggtca 66504
tatgtaaact ggagcatagc aaattccacc gatgaggtgg ttgttattgt tctattggca 66564
cgcaactcaa ctggcacgct tcgtcgggat ctggtatttc gagtcgggtg tttttgtggt 66624
gaatgattag tgttttcacg gaccaattct tggagataga gacgggcgag ggaattgctc 66684
agcaggggtt gaaacaccac aacaaacccc cctctggcaa ggtaggtctg gatatccccg 66744
gttctaacat gagatgagtt gtatctggtt gtatagatcc ggttaataat agcccgggct 66804
tcctccttta cacattgact gagatggatt tggttaagat taaactcgtt tgtttcactt 66864
ataaacgtgg tagaaagtgt tttcattgta aagcgaaaat tgtgtgcata ctcatcgcga 66924
actacgtctt caacctcacg ccacttgaca agcgaacaaa cttccgttcg ttttggcttc 66984
cagttccaac caaccgttaa atgaggcgtg actaaaaagt tccgcgctgc aggttccagt 67044
aatgctctag tgtcaagatc cctttgtcta taaccctcaa actggtgaaa acgatccatt 67104
gcataattgg aatgttctct gtatgcacca tcccgtaggc caaaaaacgg ggacatgtat 67164
attatatctc ccgtggaaag tccaaaacta tcataaggga atattgatct ggcttcaact 67224
tcctcaatga tgcaattcac cgacgtgccc gtcctatatg ttccgggggt tccggcaacc 67284
atgtacgtgt cattggtagt atgccatgct ttggatccca cagaattata ttttgatgcg 67344
attagaggca tatcctgtgg atttttatcc tcattaaagg cttcaacttt gtggttattt 67404
cgtacgtacg ttgctttaga agaacacttg ccaaacttat caatggtgtc cgtgatctct 67464
gaaacgggaa ttggtaccct atccgcatat ctattagtaa tttgcgtata agaacttccg 67524
gcccacgccg tgctaacgat aacatctttg taatataccg tcgccttaaa cttgtacgct 67584
gcaatgtttt ctttataaac aacagcaata ccctctgtaa agtttttacc aaggtgataa 67644
tccggacatg tccgagttgg ttctaatcgt acgattgtgg agcctgttgg cggtgggcag 67704
acgtaaaacg tgggttttgt ttcggcgtcc tgggacttgt gtatagcttc tctgatttca 67764
tcaccatcgc ccagatgagc agaccgggtt atatcttctg attgtgtggg ctctacttgt 67824
aaactctcat aaaacgagct tggagagacc gacacaaccg ccgtaacaaa catagaaaat 67884
atgcataaaa agcataacca cacccccgta acggaggtta tgaaaacgcc gggtccgttg 67944
aatccggagc cagccgctgc attagggtgt atagaagaga aaaaacgtct gaatcgtaga 68004
ttacgacggt attctggtcg atccctgttt ctccactttg aataatagcc acaaggggac 68064
atgtttcttc gtacgttaaa taaatgccgt ctaagggtcc gtgggaactg cctatacctt 68124
taggttgaga cgtgcacccg cgtggatcct tacctagacg gtcaacgcga cataaccgca 68184
cctccccaca atggaaaaca gaggtgaata gtgtggttgc aaacacaagc tccctaatat 68244
atttccaggc aagtctctgt gcaaaagtaa cgccttctac cccagaaaag caataaaagc 68304
ccctaaatgg actgagaagc cagttggaac caacgacacc gcccataaac tttgccaatt 68364
cctcttttaa gtgcggaaat aggccaacgt tttccacggt aaaaaataag gcggtgttag 68424
ggggttgtgc aaaacggtgt tcatcgtgat taaacaatgg gccgttgacc aactcaaaaa 68484
atttatgtgt aagtgcaggt aacatagccg catctactgt atgtcgaact aaggtattcc 68544
ttataaatct atgtgcatca aaggccgctt cctgaatgcg attgtcgata atacaccctg 68604
cctgggaaac cttagctaaa aacacgtttc gtgccccaaa cccattttgc atcgaaacaa 68664
atgtctgcaa taaggcctct ccataaacgt ttaccctgag tgttttttct agctcttggc 68724
gctgttcgcg gatacatcta ccgaggctgg ttagagagcg cttagacaga tgttcaagat 68784
atactcgttt tctttccgca gcgtcaattt ttgctttgga taacaggttt tcccatccat 68844
gaacagcaaa agttatgcta ctatcatcgt tcagatcaca ttcggatgct gtagcgctgt 68904
cactcgcctc tttggttaaa tcatcctcac acccaactct ttctaaaagt tggcgtaagg 68964
cggcttcgtt atcatgactc gggttagaaa tgtgacgcat gagaggcgac gatagatgat 69024
gggcataaca ggctttaatt aatgcttcta tctgactgtc tggtgaggat gagcgattgc 69084
caaccaaaat actatcaatt gcatccagag atccgagttc ttcagaaaac gccctatcaa 69144
agtgtatggg agtttttcca aataacgcca actcaaccgt tacggctgta agttctgcgt 69204
gtttttcatg ttcgtttaat acatccaaat tatcaacaaa cacgtctaaa gtacgttggg 69264
gtccttcagt ggaattagac gttaaccaga atttaagctc gctgatggcg tacatacacc 69324
gaggggcagg tttaaaaaca ttataagaat caagcatggt gctcgcaatt tcgtatgttt 69384
ggtcaaacgt atcgttcttt gctcccatgg gattgactac cgtcttggtt tgaagtgttc 69444
gtaacgcttc aacggcggat tgacgtttga tatccggtac atcagggata tatggtaaac 69504
accgaataat atcgtcaaca tccacgttaa cccgaacttg cttagtaacg tgatcgcaga 69564
tacatcctaa taatctacga tgtaaggtct caccttggtt agctgttata cataattctt 69624
caaaacatat agcacatgga tgcgccggat caaacaactc cacgggtggt attagaccgc 69684
ttcctatagt atctgtcata aaccgcgtca cggtttccaa tgtgtttagc gcctcacgag 69744
aagacgtgat aatatagcag taattaagct gttttaaaaa attctcaatg tcgtgtaaaa 69804
actgaatttc actcgatacg taacctccgt atgttgttaa cgacagctca tggtgatatt 69864
ggcaggtttt taagttaaac gctgagacaa agtaatcttt gatacagttt gagtctttag 69924
tctcaccata aacaaaacgc gttttaaaat gttgtagcaa ctcaaacaaa cgctcacctt 69984
ctccctttac ataacataat gcccaatgca gtgcacaaac cgtgggagtg tttccaaatt 70044
ttagctgggt attaaatccc cggagaaaca gctttaataa ataacgaact gtcaaacagt 70104
tagccgctaa ccgatataaa aagcgacacg ccacccttgg atcacatcga cgtagcagtt 70164
caacctgaaa gatgtacgta taaacttgac ccagtagaac caacaatgtt cgattaatat 70224
ccaattccat gttatcgggc tgttactgtc cactggactg taagctgaat ggttggagac 70284
acactatatt acaggtcata tgggtaagga ctacacacaa aaaaagaaaa cacaacagaa 70344
tatgtaatat tgacttgttt attacgtaca ctcacccgcg tgtgggcttt aattggataa 70404
agagggaggt taaatcattt ccattgtaat gttcccatgt tttatgggaa taccactaag 70464
atcaaagagt tcgtcatctt cggggggtcg tttaagacca gggacagttg accctgaaat 70524
gtttgaccct gaaatgttta atacgttaga tgtggtgtca tggcttggtt cacaagaatc 70584
aaaattaaat gctaggtttg tgggggtttc cccatctccc gctgtcgttt tttcatctag 70644
aatctttact gcttctagag cgccttctac ggtccagggc gtttccaggg tttggataat 70704
ctggtcgtgt aactcctcca agtctctggc taaaaactca tcgtctgtaa gacttaacca 70764
gtcgtcaaat gccatatgtt gtgctcgagc gcccactgca cgcacaaccg tggcgtatat 70824
ggctaattga accatggccc cgccactgac aattataccg cgaacttggt cggatagggt 70884
ggtttctcga tttcccgacg agggacctgt tacggggcag ataaaacctc ctctaggaca 70944
tgctattata aaacggcgag tgcgatcaaa tgtaaatagc gggcatacat ttttaccccc 71004
gtttaaaccg ctccaattcc ctgcctgaaa gacgcggttg tttcctgccg ctccgtgata 71064
tttactaatg cttattccta aaacgaccat gggacgttga ttcatggccg cgcggactaa 71124
atgagttgaa gtaaaagccg tagtccataa ttccggtaag ttttccgttt tttcaagaag 71184
cgcctttgct tgggtttcta tgtccgcggc ggacgtgaca tctttacgaa tccaatgcaa 71244
aacggatgat gggtcacgcg ggcgcctggc acccgtaatt atagaagtta aggtatttat 71304
aaggtactgt gaatgatcgc agtatttaag aataagattt gccatataaa actgggctaa 71364
ttctcctatg caggttgggg gtagattaat aaagtttatt gctgcatatt cctcggtaaa 71424
ccgtttaaca gctgcaatag tggtaatctc ttcgtgtgta agtttatctg ccggcatctg 71484
gttgcgttgt aacagggtcc aaaaccactg cgggttgggg gatttactgt ttggtggcat 71544
accccgagga aataacaggc cgtgaaactg tttaagcaaa aaccctaggg ccccgtgtaa 71604
catatccact cttttttctt ggcgttggta cgcacttgca aggcctacaa gccttgcccg 71664
ggctgcctca gagagatttg tacagttacc tgaaaaaacg accctatttt taactcgtat 71724
atcccgaata acttccacgc ttacgcgcgc taaatcccca tcaaaggtac gccccgcggg 71784
cgcgtcttgt cccaatgtcg gatttggggc ggatacagga ccttcagata atgttacggt 71844
tatagagcgt gttgatataa accccccatt aaacaggtca acaaaacgcc gccgcaaaac 71904
aggttggaat tggttacgaa agttccgccc ctcaacttgc tgtccgtaaa atacacaatg 71964
acattggctt aatgctaagt cttgtaccac ggctaggtgt gttcgtttaa ctaaaaaatt 72024
ggtaatggga caaaatgctc ctgagtatgg atcaaacgtt aacgccattg aatgggtggc 72084
ttcggataat ccttcacgga tcttataatc gcgggtctca accaacactt tcataaattg 72144
tgttgttgtc tgttctatac gcgcacgcag tgtgtctaat atgcgacgaa acgttggatg 72204
atccacaatg acagacgata gtccctcgga agaacatggg gcaccgcgat ccagtagtcg 72264
ctcttgttct agatcgataa acaagcgttc tagtgtagcc ctataagtgt cctgcatggt 72324
tgcctttgct gcttccgttt gatccccggg ttttcgaagg attaaatatg gagcatagtt 72384
tcctagagga tcgcagtcgc tatattggct gttcattgtt ccaaacaccc caataggttg 72444
acgggtggct tgtccaaatc gcggcatgcg ttgtctaagt cggtgtactg ttgtgtgagc 72504
gcataccggc cgcgtgtgtt tttcacataa actacatgga atttcagagt caaaggtccc 72564
cgtaacatat tttaacgcat ccccgtgacc ccctgtaaac gcaccagcgt cacagcgttc 72624
tagataaaaa agcagtcgcg ccaacagggg tgctccaaat ccacaaatga gtgccaaata 72684
atccacgcta aactctgtgt ttgatgaacc cgtagactga ctggatagaa cgtggccatc 72744
tcgatctgtt tggggattcg cagctaaatg aggtccggca aactggtaaa atcgattaaa 72804
tgatggaccc ggtcccccat ccttggcttc ggtcatcccg ctatcctcca cctcagttag 72864
atacaacgca gaatttgggc tgaaaaccat cgcaccaatg accccggcca cacgagcggt 72924
atatgacccc agagcgttta gccttggcaa agtgccctcc atgcctataa acattggcca 72984
ttctttgata tctgttggag tttcttcgta aattccagtg ttgaatataa cttctgcgtg 73044
caaggctgtg tcagcggcca taatagacgc caaccgtctt tcaaaccccc ccgatgggct 73104
aggcttagac gtggagttga catcgtttcg acgcgctcca cgggcggtag tggttccact 73164
tgaagaggac tgaaaatacg tgtacgtaat gtcagggggg agtactgccc cctcgtgatt 73224
ttcatcaaaa gcaaggtggg ccgctcctcg ggcgacggca gttacatttc tgacgcgcaa 73284
ggcaacggcc atgggagcaa taacacagtc atgtattaaa tggcacaacc cggtgttata 73344
aaagggtgtt gggtatacaa aaccctctcc gatagacctg tgatgagtgt tgaatgggtc 73404
gggtaccaga cggttaacat cgggcatgaa aagttgtacc ggaaataacg gtatacgtat 73464
aacatcccca tggttaatat gaacaatatc gagtcctcca taatgcagaa acacgttgca 73524
cataaatacg gcttccttaa acaatgccgt gaccaccaag tataatattg tattttctgg 73584
ctctaatcca aggcgggtgc atatctcagc gccggtcgtc tcaacagcac cgtcaacagg 73644
aggcccttgg cagcgtgaaa acccaaaccg ttctcgagcc gcgttacacg cgcgtgtgag 73704
atttggggcc gcggagctgg gtaaaacgtg tttgcctccg tgaaagacaa agacagatgg 73764
atagaaatga ctggtagtga gttttaaggt aataccagct ccggcaagac ccgtagtgcg 73824
tgctccagaa accaccgcca ggctggatgt aaaagttttt tccacggtca aattacgcat 73884
caaaggtaat aaagccaaat cagagtccgt gctacgagcg gccaaaaatg aaatttcctc 73944
cagatccaaa tcttcaaccc ggcacgcata aacgtaaccc aggggccccg tgggcactgt 74004
cacagtcttc tgagtatttt ccattttggt gaagtactcg taataatggg gggttgttgg 74064
cagggtcaaa tgaccaccca aaacccagtc tgtcgataca ttaagaagag aggcttttaa 74124
acgggtatta catatgcgga aaccacaaca aatcacgtga ttacacttta tgtattagaa 74184
gggcgtgggg ttgtgttact cagtaacact ggctttttac aagattatca atcgttaaca 74244
taaa atg gcg atc aga acg ggg ttt tgt aat ccc ttt tta acc caa 74290
Met Ala Ile Arg Thr Gly Phe Cys Asn Pro Phe Leu Thr Gln
5030 5035 5040
gca tca ggg att aaa tat aac cca aga acc ggg cgc ggt agt aac 74335
Ala Ser Gly Ile Lys Tyr Asn Pro Arg Thr Gly Arg Gly Ser Asn
5045 5050 5055
aga gaa ttt ctt cat agt tac aaa act acc atg tca tcg ttt caa 74380
Arg Glu Phe Leu His Ser Tyr Lys Thr Thr Met Ser Ser Phe Gln
5060 5065 5070
ttt ttg gcc cct aaa tgt tta gat gaa gat gtg ccc atg gaa gaa 74425
Phe Leu Ala Pro Lys Cys Leu Asp Glu Asp Val Pro Met Glu Glu
5075 5080 5085
cga aag ggg gtt cac gtc ggt aca ctt agt cga ccg cct aaa gtt 74470
Arg Lys Gly Val His Val Gly Thr Leu Ser Arg Pro Pro Lys Val
5090 5095 5100
tac tgt aat gga aaa gaa gtt ccg att ctg gat ttt cgt tgt tcc 74515
Tyr Cys Asn Gly Lys Glu Val Pro Ile Leu Asp Phe Arg Cys Ser
5105 5110 5115
agc ccc tgg cct aga cgc gtg aat att tgg ggg gaa atc gac ttt 74560
Ser Pro Trp Pro Arg Arg Val Asn Ile Trp Gly Glu Ile Asp Phe
5120 5125 5130
cgt ggg gat aag ttt gac ccc cgc ttt aac aca ttc cat gta tat 74605
Arg Gly Asp Lys Phe Asp Pro Arg Phe Asn Thr Phe His Val Tyr
5135 5140 5145
gat att gtc gaa aca aca gaa gcc gcg tct aat gga gat gta tcc 74650
Asp Ile Val Glu Thr Thr Glu Ala Ala Ser Asn Gly Asp Val Ser
5150 5155 5160
cgg ttt gca act gca aca cga ccg ctt ggt acc gtt att act tta 74695
Arg Phe Ala Thr Ala Thr Arg Pro Leu Gly Thr Val Ile Thr Leu
5165 5170 5175
ctt ggc atg tcc cga tgt gga aaa agg gtg gca gtt cat gta tac 74740
Leu Gly Met Ser Arg Cys Gly Lys Arg Val Ala Val His Val Tyr
5180 5185 5190
ggc atc tgt caa tat ttt tat ata aac aaa gcc gag gtg gat acc 74785
Gly Ile Cys Gln Tyr Phe Tyr Ile Asn Lys Ala Glu Val Asp Thr
5195 5200 5205
gct tgt ggc ata cgt tcc ggt agc gag tta tct gta tta ctt gcc 74830
Ala Cys Gly Ile Arg Ser Gly Ser Glu Leu Ser Val Leu Leu Ala
5210 5215 5220
gag tgt tta cgc agt tct atg ata aca caa aat gat gca acg tta 74875
Glu Cys Leu Arg Ser Ser Met Ile Thr Gln Asn Asp Ala Thr Leu
5225 5230 5235
aat gga gac aag aac gct ttt cat ggt acc tcg ttt aaa agc gca 74920
Asn Gly Asp Lys Asn Ala Phe His Gly Thr Ser Phe Lys Ser Ala
5240 5245 5250
tct cca gaa agc ttt cgc gtt gag gtt att gag cgc aca gat gtt 74965
Ser Pro Glu Ser Phe Arg Val Glu Val Ile Glu Arg Thr Asp Val
5255 5260 5265
tat tac tac gat aca cag cca tgt gcg ttt tac agg gtg tat tct 75010
Tyr Tyr Tyr Asp Thr Gln Pro Cys Ala Phe Tyr Arg Val Tyr Ser
5270 5275 5280
ccc tca tct aaa ttt aca aat tat ctt tgt gat aac ttt cac ccg 75055
Pro Ser Ser Lys Phe Thr Asn Tyr Leu Cys Asp Asn Phe His Pro
5285 5290 5295
gag ttg aaa aag tat gaa ggt cgg gta gac gct acc act cgt ttt 75100
Glu Leu Lys Lys Tyr Glu Gly Arg Val Asp Ala Thr Thr Arg Phe
5300 5305 5310
cta atg gat aat ccc ggc ttt gtt agt ttt ggt tgg tat caa cta 75145
Leu Met Asp Asn Pro Gly Phe Val Ser Phe Gly Trp Tyr Gln Leu
5315 5320 5325
aaa cct gga gtt gat ggg gaa cgt gtt cga gtt cga ccg gca agt 75190
Lys Pro Gly Val Asp Gly Glu Arg Val Arg Val Arg Pro Ala Ser
5330 5335 5340
cgc caa tta acg tta agc gac gtt gaa att gac tgc atg tcg gat 75235
Arg Gln Leu Thr Leu Ser Asp Val Glu Ile Asp Cys Met Ser Asp
5345 5350 5355
aat ctg cag gct ata cca aac gat gac tca tgg cct gac tac aag 75280
Asn Leu Gln Ala Ile Pro Asn Asp Asp Ser Trp Pro Asp Tyr Lys
5360 5365 5370
ttg tta tgt ttc gat att gaa tgt aaa tca gga gga tct aat gag 75325
Leu Leu Cys Phe Asp Ile Glu Cys Lys Ser Gly Gly Ser Asn Glu
5375 5380 5385
ctg gcg ttt ccc gat gca aca cat ctg gag gat ctt gta atc caa 75370
Leu Ala Phe Pro Asp Ala Thr His Leu Glu Asp Leu Val Ile Gln
5390 5395 5400
att tct tgt cta tta tat tca atc cct cga cag tct tta gaa cac 75415
Ile Ser Cys Leu Leu Tyr Ser Ile Pro Arg Gln Ser Leu Glu His
5405 5410 5415
att tta ctg ttt tcc ctt ggc tct tgt gac tta cca caa agg tat 75460
Ile Leu Leu Phe Ser Leu Gly Ser Cys Asp Leu Pro Gln Arg Tyr
5420 5425 5430
gta caa gaa atg aag gac gcg ggg tta ccg gag ccg act gtg ctg 75505
Val Gln Glu Met Lys Asp Ala Gly Leu Pro Glu Pro Thr Val Leu
5435 5440 5445
gag ttt gat agt gaa ttc gag cta tta att gca ttt atg acc ctc 75550
Glu Phe Asp Ser Glu Phe Glu Leu Leu Ile Ala Phe Met Thr Leu
5450 5455 5460
gta aaa cag tac gct ccc gag ttt gcc aca ggt tat aac att gtt 75595
Val Lys Gln Tyr Ala Pro Glu Phe Ala Thr Gly Tyr Asn Ile Val
5465 5470 5475
aat ttt gat tgg gcg ttt att atg gag aaa ctt aat tct ata tac 75640
Asn Phe Asp Trp Ala Phe Ile Met Glu Lys Leu Asn Ser Ile Tyr
5480 5485 5490
agt ctc aag ctt gat ggt tat ggc agt ata aac cgt ggg ggt ctg 75685
Ser Leu Lys Leu Asp Gly Tyr Gly Ser Ile Asn Arg Gly Gly Leu
5495 5500 5505
ttt aag ata tgg gat gtt ggc aaa tcc gga ttt cag cga cga agc 75730
Phe Lys Ile Trp Asp Val Gly Lys Ser Gly Phe Gln Arg Arg Ser
5510 5515 5520
aag gta aag atc aac ggt ctc ata tct ctg gat atg tat gca att 75775
Lys Val Lys Ile Asn Gly Leu Ile Ser Leu Asp Met Tyr Ala Ile
5525 5530 5535
gca act gaa aaa tta aaa ctc tcg agt tat aaa tta gat tcg gtt 75820
Ala Thr Glu Lys Leu Lys Leu Ser Ser Tyr Lys Leu Asp Ser Val
5540 5545 5550
gca cgt gaa gct cta aat gag tcc aag aga gat ttg ccc tac aaa 75865
Ala Arg Glu Ala Leu Asn Glu Ser Lys Arg Asp Leu Pro Tyr Lys
5555 5560 5565
gac att ccg gga tat tac gct agt gga ccg aat aca cga gga att 75910
Asp Ile Pro Gly Tyr Tyr Ala Ser Gly Pro Asn Thr Arg Gly Ile
5570 5575 5580
att ggt gaa tat tgt ata caa gac tcg gct ctt gtg ggg aaa ctg 75955
Ile Gly Glu Tyr Cys Ile Gln Asp Ser Ala Leu Val Gly Lys Leu
5585 5590 5595
ttt ttt aaa tat tta cca cac ctt gag tta tcc gcg gtt gca agg 76000
Phe Phe Lys Tyr Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg
5600 5605 5610
cta gct aga att act tta acc aag gct att tac gac gga cag cag 76045
Leu Ala Arg Ile Thr Leu Thr Lys Ala Ile Tyr Asp Gly Gln Gln
5615 5620 5625
gtt agg att tac acc tgt tta tta gga ctg gct tcg tct cga gga 76090
Val Arg Ile Tyr Thr Cys Leu Leu Gly Leu Ala Ser Ser Arg Gly
5630 5635 5640
ttt att tta ccc gat ggg gga tac cca gct act ttt gaa tat aag 76135
Phe Ile Leu Pro Asp Gly Gly Tyr Pro Ala Thr Phe Glu Tyr Lys
5645 5650 5655
gat gtt att ccc gat gtc ggg gat gtt gag gaa gag atg gat gaa 76180
Asp Val Ile Pro Asp Val Gly Asp Val Glu Glu Glu Met Asp Glu
5660 5665 5670
gac gag agc gtt tct ccc act ggt acg tca agt ggg cga aat gta 76225
Asp Glu Ser Val Ser Pro Thr Gly Thr Ser Ser Gly Arg Asn Val
5675 5680 5685
gga tat aaa gga gcc agg gtt ttt gac cct gat acg gga ttt tat 76270
Gly Tyr Lys Gly Ala Arg Val Phe Asp Pro Asp Thr Gly Phe Tyr
5690 5695 5700
atc gat ccg gtg gtc gta ttg gat ttt gca agt tta tat cca agt 76315
Ile Asp Pro Val Val Val Leu Asp Phe Ala Ser Leu Tyr Pro Ser
5705 5710 5715
ata att cag gcc cat aac tta tgt ttt acc acg cta acg tta aat 76360
Ile Ile Gln Ala His Asn Leu Cys Phe Thr Thr Leu Thr Leu Asn
5720 5725 5730
ttt gag acg gtt aaa cgt ttg aat cca tcc gat tat gcc acc ttt 76405
Phe Glu Thr Val Lys Arg Leu Asn Pro Ser Asp Tyr Ala Thr Phe
5735 5740 5745
aca gtt gga gga aaa cgt ctt ttt ttt gtg cgc tct aac gtt cga 76450
Thr Val Gly Gly Lys Arg Leu Phe Phe Val Arg Ser Asn Val Arg
5750 5755 5760
gaa agt ctg ctg ggt gtt ctt tta aaa gac tgg ttg gct atg cgc 76495
Glu Ser Leu Leu Gly Val Leu Leu Lys Asp Trp Leu Ala Met Arg
5765 5770 5775
aag gct att aga gcg cgc ata ccc gga agt tct tca gat gaa gca 76540
Lys Ala Ile Arg Ala Arg Ile Pro Gly Ser Ser Ser Asp Glu Ala
5780 5785 5790
gtg tta tta gac aaa caa caa gcc gcg ata aaa gta gtt tgt aat 76585
Val Leu Leu Asp Lys Gln Gln Ala Ala Ile Lys Val Val Cys Asn
5795 5800 5805
tcc gtg tac ggt ttt act gga gtt gcg cag gga ttt ctg cca tgt 76630
Ser Val Tyr Gly Phe Thr Gly Val Ala Gln Gly Phe Leu Pro Cys
5810 5815 5820
tta tac gta gcg gcc act gtc act aca att ggc cgt caa atg tta 76675
Leu Tyr Val Ala Ala Thr Val Thr Thr Ile Gly Arg Gln Met Leu
5825 5830 5835
tta agt acc aga gat tat att cat aat aac tgg gcc gca ttt gaa 76720
Leu Ser Thr Arg Asp Tyr Ile His Asn Asn Trp Ala Ala Phe Glu
5840 5845 5850
cgt ttt att aca gcg ttt cca gac att gaa agt agc gtt ctc tcc 76765
Arg Phe Ile Thr Ala Phe Pro Asp Ile Glu Ser Ser Val Leu Ser
5855 5860 5865
caa aaa gcg tac gag gta aag gtt ata tat gga gat acg gat tct 76810
Gln Lys Ala Tyr Glu Val Lys Val Ile Tyr Gly Asp Thr Asp Ser
5870 5875 5880
gtg ttt atc cga ttc aag ggt gtt agt gtt gag ggg ata gct aaa 76855
Val Phe Ile Arg Phe Lys Gly Val Ser Val Glu Gly Ile Ala Lys
5885 5890 5895
atc ggc gag aaa atg gca cat ata att tca acg gct ctg ttt tgt 76900
Ile Gly Glu Lys Met Ala His Ile Ile Ser Thr Ala Leu Phe Cys
5900 5905 5910
cct cct ata aag ttg gag tgt gaa aaa act ttt ata aaa ctt ttg 76945
Pro Pro Ile Lys Leu Glu Cys Glu Lys Thr Phe Ile Lys Leu Leu
5915 5920 5925
ctt ata aca aag aaa aag tac att ggg gta att tac ggc gga aag 76990
Leu Ile Thr Lys Lys Lys Tyr Ile Gly Val Ile Tyr Gly Gly Lys
5930 5935 5940
gtt tta atg aag gga gtc gac ttg gtt aga aaa aac aac tgt caa 77035
Val Leu Met Lys Gly Val Asp Leu Val Arg Lys Asn Asn Cys Gln
5945 5950 5955
ttt att aac gat tat gcc cgc aaa ctt gta gaa ctg ttg tta tat 77080
Phe Ile Asn Asp Tyr Ala Arg Lys Leu Val Glu Leu Leu Leu Tyr
5960 5965 5970
gac gac acc gtc tcg cgt gct gcg gcg gag gcg tcg tgt gtt tcc 77125
Asp Asp Thr Val Ser Arg Ala Ala Ala Glu Ala Ser Cys Val Ser
5975 5980 5985
att gct gaa tgg aat aga cgg gcc atg ccg tct ggg atg gcc ggg 77170
Ile Ala Glu Trp Asn Arg Arg Ala Met Pro Ser Gly Met Ala Gly
5990 5995 6000
ttt gga cgc ata att gca gat gca cat cgc cag att aca tca ccc 77215
Phe Gly Arg Ile Ile Ala Asp Ala His Arg Gln Ile Thr Ser Pro
6005 6010 6015
aaa ttg gat att aat aag ttt gtt atg acg gcc gag ctt agt cgt 77260
Lys Leu Asp Ile Asn Lys Phe Val Met Thr Ala Glu Leu Ser Arg
6020 6025 6030
cca cca tcc gcc tac ata aac cgt cgc ttg gct cac tta aca gta 77305
Pro Pro Ser Ala Tyr Ile Asn Arg Arg Leu Ala His Leu Thr Val
6035 6040 6045
tat tat aaa tta gta atg aga cag ggt caa atc cca aac gtt cga 77350
Tyr Tyr Lys Leu Val Met Arg Gln Gly Gln Ile Pro Asn Val Arg
6050 6055 6060
gaa cgc atc cct tat gtt att gtg gcc ccc aca gac gaa gtg gag 77395
Glu Arg Ile Pro Tyr Val Ile Val Ala Pro Thr Asp Glu Val Glu
6065 6070 6075
gct gat gca aaa agt gta gct ttg cta cgt gga gat cct tta cag 77440
Ala Asp Ala Lys Ser Val Ala Leu Leu Arg Gly Asp Pro Leu Gln
6080 6085 6090
aat acc gca ggt aaa cgg tgt ggg gaa gca aag cgt aag tta ata 77485
Asn Thr Ala Gly Lys Arg Cys Gly Glu Ala Lys Arg Lys Leu Ile
6095 6100 6105
ata tct gac tta gcg gaa gat ccc att cac gta aca tca cac ggg 77530
Ile Ser Asp Leu Ala Glu Asp Pro Ile His Val Thr Ser His Gly
6110 6115 6120
ctg tct tta aac att gac tat tat ttt tct cat ctc att ggg acg 77575
Leu Ser Leu Asn Ile Asp Tyr Tyr Phe Ser His Leu Ile Gly Thr
6125 6130 6135
gcg agt gta act ttt aag gcg tta ttt gga aac gac act aaa ctc 77620
Ala Ser Val Thr Phe Lys Ala Leu Phe Gly Asn Asp Thr Lys Leu
6140 6145 6150
aca gaa cgg ctt tta aaa cgt ttt att cca gag aca cga gtt gtt 77665
Thr Glu Arg Leu Leu Lys Arg Phe Ile Pro Glu Thr Arg Val Val
6155 6160 6165
aac gtt aaa atg cta aac cgc ttg cag gcg gca ggc ttt gtt tgt 77710
Asn Val Lys Met Leu Asn Arg Leu Gln Ala Ala Gly Phe Val Cys
6170 6175 6180
ata cac gcc ccg tgc tgg gat aat aaa atg aac act gaa gct gaa 77755
Ile His Ala Pro Cys Trp Asp Asn Lys Met Asn Thr Glu Ala Glu
6185 6190 6195
atc acc gag gag gaa caa agt cat caa ata atg cgt aga gtc ttt 77800
Ile Thr Glu Glu Glu Gln Ser His Gln Ile Met Arg Arg Val Phe
6200 6205 6210
tgt att cca aaa gca att ctc cat caa agt taa ggtcacacat 77843
Cys Ile Pro Lys Ala Ile Leu His Gln Ser
6215 6220
tttacagtaa acgtccgatg ttccaatgga tggcaccaca gtctctgttt gttgttctgg 77903
gttgcgacat accgacagta aaaatgttgt ctgccaaacg tgtgcgacta ttttataccc 77963
ccgacacgcg gcctgtatat gatcgattag tctgtaatgg aggtgaacag atttccccgg 78023
aaatataaca tacatcataa accctccggc atctccatca cgataaaaaa gtacgcgtat 78083
atctcgcatg cccccctccc tcataatatg atacataaaa aataattccg gttgtgccaa 78143
tacgctcgac aataggggtt cagcggacgc tgcttgcatg ttggctcggt ctgatagcgc 78203
caaaatggat gcaagaaaca cacgatattc gtatatgttg ttaagctgct gaacatatgc 78263
tagtatcaga gccgcccgat cggttcgaca tagacgcggt tctccagatg cagtgcatgt 78323
cggacaataa cctccgagac ctaaatgata acccattccg gataacgaca ggcagttatc 78383
cgccactgtc tgacccaagt taaacggaag ggtgacaggg gtcgtcttaa taattggcac 78443
gattaagcct ctaacggtag ctagttcttc tgggggggac cttgcgatgt aattaaaata 78503
atggcggtac acagaccgct cctttgacat aagttttcct cgaatggttt gacggcggca 78563
tggcgttctg gagcgtacac gcatcgaagg cccgctccga gatacggatt tatatcgcct 78623
agacatactt cgatagggtg tgttcgaggg agaccagcga tattgcatgt cgtcgaagca 78683
taagtcctcc atctcgtatg aatgcggcat tggcggttgg tttgcgtgga aaaatctggt 78743
aggctttaaa tgcatacgcc aatgtccaca ggccagcaca tacacgacct tcaaatctgt 78803
tttgtgcagc cagttgggct ttatataact ccaacacttc cttatccgcg gttgtaggat 78863
gggcaaataa aattttaggg tttgtctgta attccgagag agcgcatagg agatcgcaaa 78923
ataaatgttt atatacggcc tcggaacctg ctgcttttac aatggttatg aatcgaccct 78983
catcattaaa tggaaattta agtttaatgg ggttatctag gctccatgca ttgattacag 79043
gttcgataca gtcaaataaa cttgtgttat ttgctgagta tgctaatata tcccttttaa 79103
actgccgcaa agcgatccaa tagtgcccta tggttaataa acaacataac atgcattcgg 79163
tattcgtgcg accacggcac tttgcgtttt ttaaggtagt taacaaaata ggcccaaatg 79223
ctacttcggg acaggcatcc tcgctcgtaa atttggaaaa ttgttttgga acgtccctgg 79283
agttttcaat ccggttcgcc tgccagtatg atttaacaca tccaacgcga gtttctatag 79343
cacaatttgt acgctcgtct acattccaag tagcaacccc actcagtaat aataaagcca 79403
agtctgcgta accccaagct gcttctacgt aagacgattt tttaagtttg tctattaata 79463
tatcttcaaa ctctatatcc tcttttaatg cacgttccgc gcaggtaatc atcccacttg 79523
tgcgatgttc cgatgatgaa cagtcaacca tccgcgcgca atctttccaa ttagccaagg 79583
cggccgctaa aggttgaggg cctccagtca gtccgtcctg attaaagtaa aatgacttac 79643
gcgtttcggt ttcgtctaaa cgagacgtgc tgatcgatag taatgcagcc ctaactaccg 79703
cctgtacaaa aaaggagtaa ttggaatatc ttactttgtt ggccatttcc ttaccggggg 79763
gcggtcgtat acctggtaac cctccatcgg tacgaaaaca tacatgcaaa aaaaaatgtc 79823
tttgtatatc taacagggaa agcgtgcgcc tggcaaagga cccaaccaaa gtaacgttat 79883
aacggcacag gtagtaacga tccataagat acacaaattc aaaggcagcg ctaaaagtga 79943
caactgcaca gggcggggat gccagacttt tcatacataa catggcgtag tccgcaaccc 80003
attccggtgt cagtccaaat ttacgtttat acaaatcaag cgttctgcag accaaacagg 80063
ctctgtctaa cgccaaaacg tctccaattt cttctataaa ttcagtggta cttgaggtca 80123
tttcattatc cacacctacc gccgaagagg cctcataagt gacgttgttt tctattgcac 80183
ggggaacatt tattaagtgg gaggcaaata acaattctga aaataacaaa tcgtttttta 80243
gcagaagggc tggattataa gcg atg tac gaa tcg gaa aat gcg tcg gaa 80293
Met Tyr Glu Ser Glu Asn Ala Ser Glu
6225 6230
cac cat ccg gaa tta gaa gat gta ttt tcg gag aat acg ggc gat 80338
His His Pro Glu Leu Glu Asp Val Phe Ser Glu Asn Thr Gly Asp
6235 6240 6245
tcg aat cca tcc atg ggt tct tct gat tct acc cga tcc atc tct 80383
Ser Asn Pro Ser Met Gly Ser Ser Asp Ser Thr Arg Ser Ile Ser
6250 6255 6260
ggg atg agg gcc cgc gat tta att aca gac acc gat gtt aat cta 80428
Gly Met Arg Ala Arg Asp Leu Ile Thr Asp Thr Asp Val Asn Leu
6265 6270 6275
tta aat atc gat gca ctg gag tca aag tat ttt cct gct gat agc 80473
Leu Asn Ile Asp Ala Leu Glu Ser Lys Tyr Phe Pro Ala Asp Ser
6280 6285 6290
acc ttt act ctt tcc gtt tgg ttt gaa aat tta att ccc ccg gaa 80518
Thr Phe Thr Leu Ser Val Trp Phe Glu Asn Leu Ile Pro Pro Glu
6295 6300 6305
ata gaa gca att cta cct aca act gac gct caa tta aat tat ata 80563
Ile Glu Ala Ile Leu Pro Thr Thr Asp Ala Gln Leu Asn Tyr Ile
6310 6315 6320
tca ttt acc agt cgc ctg gcg tcc gtt tta aaa cat aaa gaa agt 80608
Ser Phe Thr Ser Arg Leu Ala Ser Val Leu Lys His Lys Glu Ser
6325 6330 6335
aac gat tca gaa aaa tct gct tat gtt gtt cca tgt gaa cat agt 80653
Asn Asp Ser Glu Lys Ser Ala Tyr Val Val Pro Cys Glu His Ser
6340 6345 6350
gcc agc gtg acc cgt cgc cgt gaa cgc ttt gcg gga gtc atg gcc 80698
Ala Ser Val Thr Arg Arg Arg Glu Arg Phe Ala Gly Val Met Ala
6355 6360 6365
aaa ttt cta gat ttg cat gaa ata ttg aag gat gct taa tacatgggga 80747
Lys Phe Leu Asp Leu His Glu Ile Leu Lys Asp Ala
6370 6375
aaataaacgc attaataagt cgtcaggggg tggatcactc caaactcctc ctatatccca 80807
cgataaatgg tctaaatact tcacgcgtgt gccttggttt gcaaccttag atcaat atg 80866
Met
tca cgg aga acg tat gta cgg agt gaa cgc agg agg ggt tgc gga 80911
Ser Arg Arg Thr Tyr Val Arg Ser Glu Arg Arg Arg Gly Cys Gly
6380 6385 6390
gat aat ctt tta caa cgt att cgg ttg gtg gta cca agc gct ctt 80956
Asp Asn Leu Leu Gln Arg Ile Arg Leu Val Val Pro Ser Ala Leu
6395 6400 6405
caa tgt tgc gat ggg gat ctt cca ata ttt gat cca caa cgc ccc 81001
Gln Cys Cys Asp Gly Asp Leu Pro Ile Phe Asp Pro Gln Arg Pro
6410 6415 6420
ccc gcc cgt tgt gtt ttt cag ttt aac ggc gaa gac aac gta tcc 81046
Pro Ala Arg Cys Val Phe Gln Phe Asn Gly Glu Asp Asn Val Ser
6425 6430 6435
gaa gcc ttt ccg gta gag tat att atg cgt tta atg gcg aat tgg 81091
Glu Ala Phe Pro Val Glu Tyr Ile Met Arg Leu Met Ala Asn Trp
6440 6445 6450
gcg caa gta gat tgt gac cct tac ata aaa att caa aat acg ggg 81136
Ala Gln Val Asp Cys Asp Pro Tyr Ile Lys Ile Gln Asn Thr Gly
6455 6460 6465
gtg tct gtg cta ttt caa ggt ttt ttt ttt cgt ccg act aac gca 81181
Val Ser Val Leu Phe Gln Gly Phe Phe Phe Arg Pro Thr Asn Ala
6470 6475 6480
cca gtg gct gaa gtg tcc att gac agt aat aac gtg att ctt agt 81226
Pro Val Ala Glu Val Ser Ile Asp Ser Asn Asn Val Ile Leu Ser
6485 6490 6495
tca acg tta agt acc ggt atc aac cta tct gct ttg gaa tca att 81271
Ser Thr Leu Ser Thr Gly Ile Asn Leu Ser Ala Leu Glu Ser Ile
6500 6505 6510
aaa cga ggt ggg ggt att gac cgc cga cct ctc cag gct tta atg 81316
Lys Arg Gly Gly Gly Ile Asp Arg Arg Pro Leu Gln Ala Leu Met
6515 6520 6525
tgg gtg aac tgc ttt gtg cga atg cca tat gtt cag tta tcc ttt 81361
Trp Val Asn Cys Phe Val Arg Met Pro Tyr Val Gln Leu Ser Phe
6530 6535 6540
cgt ttt atg gga ccg gaa gat cca tct cgc acc att aaa ctt atg 81406
Arg Phe Met Gly Pro Glu Asp Pro Ser Arg Thr Ile Lys Leu Met
6545 6550 6555
gcc cgc gcc acg gat gca tac atg tat aag gaa act ggc aat aat 81451
Ala Arg Ala Thr Asp Ala Tyr Met Tyr Lys Glu Thr Gly Asn Asn
6560 6565 6570
ttg gat gaa tat ata cgc tgg cgg cct tca ttc aga tcc cca ccc 81496
Leu Asp Glu Tyr Ile Arg Trp Arg Pro Ser Phe Arg Ser Pro Pro
6575 6580 6585
gag aac gga agt cca aac acg tct gtt caa atg caa agt gac att 81541
Glu Asn Gly Ser Pro Asn Thr Ser Val Gln Met Gln Ser Asp Ile
6590 6595 6600
aaa cct gcg tta ccc gat acc caa act acg cgt gtc tgg aaa ctt 81586
Lys Pro Ala Leu Pro Asp Thr Gln Thr Thr Arg Val Trp Lys Leu
6605 6610 6615
gct cta ccc gta gct aac gtg aca tat gcc ctg ttc att gta att 81631
Ala Leu Pro Val Ala Asn Val Thr Tyr Ala Leu Phe Ile Val Ile
6620 6625 6630
gta ctg gta gtt gta tta ggg gcg gtg ctt ttc tgg aaa taa 81673
Val Leu Val Val Val Leu Gly Ala Val Leu Phe Trp Lys
6635 6640 6645
attgcctttc cgtacatatc ctgcgcagat gtacgtgtat gctgttatcg attgtcccgt 81733
aaactaataa acg atg aca caa ccc gca tcg tct cgt gta gtc ttt gat 81782
Met Thr Gln Pro Ala Ser Ser Arg Val Val Phe Asp
6650 6655
ccc agc aac ccc acc aca ttt tcg gtg gaa gca att gcg gct tac 81827
Pro Ser Asn Pro Thr Thr Phe Ser Val Glu Ala Ile Ala Ala Tyr
6660 6665 6670
acc ccc gtt gct tta ata cga ctt tta aac gcc agt gga cct ttg 81872
Thr Pro Val Ala Leu Ile Arg Leu Leu Asn Ala Ser Gly Pro Leu
6675 6680 6685
caa cct ggt cac cgt gtg gac atc gct gat gcc aga agc att tac 81917
Gln Pro Gly His Arg Val Asp Ile Ala Asp Ala Arg Ser Ile Tyr
6690 6695 6700
acc gtg gga gcc gcg gcc agt gcc gcg cgt gca cgc gct aac cat 81962
Thr Val Gly Ala Ala Ala Ser Ala Ala Arg Ala Arg Ala Asn His
6705 6710 6715
aat gca aat acg ata cgc cga acg gcc atg ttt gcc gag act gac 82007
Asn Ala Asn Thr Ile Arg Arg Thr Ala Met Phe Ala Glu Thr Asp
6720 6725 6730
cct atg aca tgg tta aga cca acg gtt ggc tta aaa cgt acg ttt 82052
Pro Met Thr Trp Leu Arg Pro Thr Val Gly Leu Lys Arg Thr Phe
6735 6740 6745
aac ccg cgt att ata cga cca caa ccc cca aat cca tcc atg agt 82097
Asn Pro Arg Ile Ile Arg Pro Gln Pro Pro Asn Pro Ser Met Ser
6750 6755 6760
ttg gga atc tcg ggg cct act ata ttg ccg caa aaa aca cag agc 82142
Leu Gly Ile Ser Gly Pro Thr Ile Leu Pro Gln Lys Thr Gln Ser
6765 6770 6775
gcc gat cag tct gct tta caa cag ccc gcc gcg ttg gcg ttt tcg 82187
Ala Asp Gln Ser Ala Leu Gln Gln Pro Ala Ala Leu Ala Phe Ser
6780 6785 6790
gga tca tcc ccg caa cac ccc cca cct caa acg acg tcg gca tcc 82232
Gly Ser Ser Pro Gln His Pro Pro Pro Gln Thr Thr Ser Ala Ser
6795 6800 6805
gtt gga caa cag caa cac gtg gtg tcg ggg tct tct gga caa caa 82277
Val Gly Gln Gln Gln His Val Val Ser Gly Ser Ser Gly Gln Gln
6810 6815 6820
ccg caa cag gga gca cag tca agc act gtc cag cca aca acc gga 82322
Pro Gln Gln Gly Ala Gln Ser Ser Thr Val Gln Pro Thr Thr Gly
6825 6830 6835
tca ccg ccc gcg gcc caa ggc gtg cca cag tct acc ccg ccc cca 82367
Ser Pro Pro Ala Ala Gln Gly Val Pro Gln Ser Thr Pro Pro Pro
6840 6845 6850
acc caa aat acc ccc cag ggg ggt aag gga cag acc ttg tca cac 82412
Thr Gln Asn Thr Pro Gln Gly Gly Lys Gly Gln Thr Leu Ser His
6855 6860 6865
acg gga caa tct gga aac gct tca aga agt cgt agg gtg taa 82454
Thr Gly Gln Ser Gly Asn Ala Ser Arg Ser Arg Arg Val
6870 6875 6880
ataaaaatac acagaaaata atcgttgttt tttttttttc tttaataggc gctactttat 82514
atatatgttc catctaataa caatttaaca taccccaggc cttgaaatat atcttccgtg 82574
tggtccatta aaagtttacg tgtatatcta acgttgtcga tcatacgccg acatgcctgg 82634
cttaacagtg ccagtgctcc caaggcggtc cctcggacat aacgtcgacc ggtgactaca 82694
tcggcattaa caacgcgtac gtgcctagga acgcgacatg accgatgttc gtcgagactt 82754
agtacgatag aattttcact ctcactgcta tagccgtcag aaggttcact aaaatcactt 82814
tgatataggt ctctgttcgg tgagttaaac gggttagaag atacgcccca tgtgtgtacg 82874
tcccgcggaa agtttggcga cggctgtctt gttttatagt ttgtcaatag gtccgtttgg 82934
ggggcatcgg ctgccgtgtc aacccaagaa ggtaattggt tatttacagg gtggaatact 82994
gttttggagt tatacattcg agggtcattt tgatgccgtt tttgtgtttg atttgatttt 83054
tcaatccaag tccggtaaga tatcgagtca tctgtcggaa tggaggtaat ctcaagtatt 83114
ggtttactga acgcttggtt taacgtttgg ttttgctgtg gattattttt ccaatccgac 83174
aattggttat ttggtggggt aatactttta caaacggttg ttttaatgga ggtggttaat 83234
tgcttaggta taagcacagc cgtttctgat tcggtttcca catacaaggg aatatctttt 83294
tggctaaggg ttgtttttac gctttggggt ttcttaaaat tttgtaccac tatttgtgtg 83354
cccggttcta tgggctgcgc gggctggacg gtctgcgcgg gctgcgcggg ctgcgcgggc 83414
tgcgcgggct gcgcgggccg gacgggcatt tttttacatt tggaacttct aacaacttga 83474
acgggagact taaatagtgg ctccgttggg taaaaatcag ttgtgtcgag gtttgggggg 83534
ccttggggcg cataaccggc aataatttgc gagtaaaatg gatcatcggg acacgagggg 83594
aaagagtcct catcggagtc gatgtacggc aaatcaataa gagattctgg cccccaaaag 83654
accacataat ccggagctgg aatcgagtct tttcgttccc ataaaacata agagggattc 83714
atatgttcgt ccaatagcag taaacacggg ggattgtttg taagaggttt gtttgcaatg 83774
agcgcggaca gtttttctag ttggacaatt aaacatgcat tttcaattgg gtgaggatcc 83834
atatgcagta aacgtaaccc ccaagaaaca atatcgggag atacggctga ggatataccc 83894
gaatctgtag ccatatgccg ggaatcgaat aattttaacc ccaaagtaag tccggaattt 83954
tgtgaaaaaa cggatgtaag ttccatggca ataaccaaag gagccccaaa taaaatagcc 84014
gtagttgcta tgtcaaacgc tgttacatta gaaacgtgag ttggtgccgc ccccgttact 84074
gtaaaaactg tcgttgcccc ggtttgaata taagcatctg taattacttt acggtctgta 84134
ttccactcta acgtggtatt gctgggagcg gtatcaaatc ttgcattaag cgtggatgac 84194
aaatccagtc tcgcatcaac tataatgtcc acatcatctt gtgttctgta tgcgaattcc 84254
tccggttgca gcgtattcca cattgacact aaggaaattg gcgggagaca catgcgccca 84314
agtacagtaa tggtcgcaag acgatctgtt gccattatac gggtaactgc atcgacattg 84374
gcctttaccc cggggggggg ggaaagttgc catggcgcaa aaggatccgt tattttctgt 84434
ggcattttaa caagtttcca atctgccatc cgaaatgcaa atattctccc gccggtagga 84494
tcttggtttg atccaagagc taacgggagt ttatgtttgg atggggcggc attaaaaaag 84554
gcccgggcat tgcctttcgg gtctccatcc aatgagcgaa cgagtaaacg acctctaaat 84614
gcggcctcga ttacgggtct gagggtacgg gccatatact catgctgttt aattaaatcc 84674
aatctagcaa aattgagaag atgtaccggg gttgtggcaa ccaaaatagt tacaatatca 84734
gatagggtta actctaatac gcgtacatca ccccttctct ggattggttt agttagtcca 84794
ttgcccatta cagaagttag gtggtgtttt tcaaaagaat aaataagctt gtcccaatga 84854
accccgtatt gtcgtgtcaa tgttgttgtt ataaggccgg cgtatatctc tgcaatggct 84914
ctgcgtgcgt tggttttacc ccattcggcg tggcgatgcc taacaaaggt ggaaaaggca 84974
gaaaacccgg attttatagc ggattcgtat tcaatgcgta gctcattgac ctcattggcc 85034
acgcgtaatg ccgatagtgt agtttcttcc acaatacctt catcgggaag tgccggtcgt 85094
aaagataatc caccctgact gcatagcatg gttccaagtt taaccccaac tgatttataa 85154
cacactctat atcgaagtgg aaccccatca gtatccaagt aaaaggttaa attttttcca 85214
aacaaagcgt ctatagaaga tctggcttcc ctaaacgttt gcaccacaga cgcctccccc 85274
ataaacgcgg ccacccgcgc gcgaatgcat cgggattggt cttctagaca tacacccgac 85334
atggcgggaa taggagcaaa tactccgttt ggagacgcgg gtggaaaaat ctcagcataa 85394
cttgcataca agcctaaccg taccatgagt aaatttcgaa atgtcatata ctgtgttttt 85454
atccatatag gcaaatgtgt aaattgatcg aaatcacgta ataacatacg aagaacgtca 85514
tggtcactct ccgttggaat acgttcaatt aaagctcgta agcgagacgc ggtctcttca 85574
aactgtcttt gtttttgcaa cacaacatcc aacgccgttt tacgttcctg ttgctttttt 85634
tggagacggt ttttcgtttc cacaggcaat aaaaaaaatt cgggttcgga acaaagcatg 85694
tcaaccacag acgccgatgc ctggagtgcc tttatactat ctacggaagt agcgtacgtg 85754
tcccccgatg ccaacatatc ccccgtttct cttttaaatg ttgtccatgc atcgtcccac 85814
gtaacctcac cggtttctat tcgtcgtcgt aggagatcta atttagttcg taacgcataa 85874
agtttatcca ccctatcttt ataagtatgc agggggccgg taccgtctat acgcacactt 85934
agtggatggg agtcaataat attgcgagct tggtacatcc agtcaaccgc ctgaatgtcc 85994
aacgtaccag tttcttcttc cagacgccct aacagcaatg caaactgcgt gacaatgtcc 86054
tgaacggaag tagatctatc aatggccaat gcaattgttt tgggaggggg ggttagtttt 86114
aataacgtct tcaggttagt gaaatccacg tcttcttgtt gctcagtggc attggccgta 86174
tcaacaatgt tttgtaaaat ttgtattgca cggttctttg ttgcctgaca caccatacga 86234
aactgtgcaa gctctgcgtt ggcctccttt agatcgcggt ctaccattgt ttgtaaaact 86294
ctttcatatg ccgttcctat aaatcgttta tgatcgaaag ttgtaaaata ctctatagcc 86354
cgaatcaata aaccttcgct tggtaatgta atgaagacac cttctttaag tgcaagttta 86414
gccgtttcgt gtgcccgaac atgagttact tcctccaaag ccttatttgc ttctgtaagt 86474
ttagccccgg cagcctgtct gatgtctgct ctaagttcac ttaacgttac actttcggtt 86534
ataaactgat cataagaccg tgtgtaaaaa tcaacatacc cagccaatgc gggtatttgt 86594
aatagatccc ccgaggttcg taatatcata tctcggtagt taggtattcc attaatagac 86654
gtatcggcgg ataataacac aaccgcgcga atacgcataa gcagaaggag tccctcaata 86714
ctaacacctg gaaacataac ggtgaatacg ggagccgtaa tcgaaaaatc atcaaaccat 86774
gttaaccctc ttaacaaagc caaaggtggt ggaatatttg tattttcggg ggtatatgga 86834
ttgtggcgaa acacagtatc taatgccgta acggctgtag tttcacatgc cgctacaacc 86894
tgacgtgctt gtgggtaaac tgcgtgtata tcaaaccctc ccgcagcggc catatctcgc 86954
aaccgtgcca gttctgtccc gttaaatata tggcgcgttt ccattgcctc cagacatgct 87014
ataatatcac gtgtccaacg ctcatacaaa acccgagtcg tttgttgttt atgtgcacgt 87074
tcacaatcgg taattaacac ttttgcctgt tccttcttct catgtgccat tgtgacgagt 87134
ttatgaagat cctttactac tcgagataag gtagatgtcc cagcaataca atgggggggt 87194
agcttttgca gttcaacttc aacaagttta accgtatcca ttagggcgtg cacatttgtt 87254
tgtattgata atatcgcatc gtttatttca taatcatcta catttgaaga tgatataaac 87314
ttcaagtgtc cagatgagac agctagggtg gagaagtcat ctccaagttg tttttgtaat 87374
tcagtcaact tatgggtatt cgtaatttct tgttgtaaag cctgatattt tttatataac 87434
gacaacacaa aggataaggc tggcactgta gatgaaacta acgtcgcttt actaagagtt 87494
gcaacaacag cgtttatggc ggactcactt tccgggttag tagatacggt cgcgatgtcg 87554
gatgcaagca catctgccac gtcaccaaaa aacgcaagca atgatggtct atactccgta 87614
ggtatcggca caatttcgat aatagctgta aaacaagctc ccagtcctcc taagacagat 87674
gtgcgtgcgt ttccgggatg ttctaaaatg tctcgatatt ggttaaataa cgcccagaaa 87734
tcacccgtaa gttgagaacg gatctctacc ggcgcctcta agactttggt cctgagattt 87794
accagggctc cttggacatc taactcagaa gcaagttccg gtaataaatc tactgcagcc 87854
ctaagtccca caaaaccggt taccgggcga agtaaccccc cgagtttatt atatgccgcc 87914
tgttttcgac tctttattgt gtcctgtgtg gttcgggcct cgtttgcata cgtctgaacc 87974
tctttaagcc ttgtttccaa tcgtctttta atcgaaggtt gtaaaccgga ttggatcatt 88034
ggatcactta aaatttttat ctgtcgcacc aaatcgtctg tagcacgtac aattgttgct 88094
aataaatcat cccgtgttat ttcaggatac ttgacctcac cattgttata tgtttcatag 88154
tcctttacgg cttggtctac ggccgcgctt agcgtattaa agcaagacaa ctctgcttcc 88214
gcggaagccc gttgcgttgc gcgtgtatta atggttttaa ttgtacgtga taattcatcg 88274
aactctcgtc tggataagta acctgcggtt tgagcttccc cgactacaga taaccacgat 88334
acgagagccg tgtctgtgtt taaatcttcc ccttctttaa taagctcttt taacagagat 88394
acgacgggga gattttgtat agtctcagga ggagggtgta tagacaggga cgatatgatt 88454
gccgttaatt tagcgtcgaa ttccggaaga gaatctaaaa gccgttgaat gggtgccgta 88514
cttgcggtcg cgacccgaaa acgcctaggg gattgttcag cgatcaaggc ccgtgtgcta 88574
tataacactc cgcgggtgaa gtattctgta acagccgagg tcaatagcat tgtaacttcg 88634
gatcgtatct catatacacg ccgaattaac tcttcgggtg gaatgtcggt ttcttccggg 88694
ggttcctggg gaacttctgg tatttgaatg gcatcaattt tattgcgggt agaccgtaat 88754
gcattctcta aagcctttac actagaaaca agcgcggtgt gttcacgcat atagcgtgtc 88814
tcccgggccg agactaattc acaaagaaga cgtattcgtt cagacaggga ccctgtacgg 88874
gttggaataa gctcggcaaa aagtccggga tgagaaataa taggcgaatt tagtatactt 88934
ataagacctg tcgacagtct agaatagagg tctatgggcg acgaacctcc tagatccgct 88994
tctaacacat ctagttcggc atgtaaagcg ccggttgtat cctgaacccg catggcaact 89054
agaccaaact tagcgataat gagcatgcca attgtacatt ttttccggag aacagtttta 89114
atagcatatt ccgcccccgg catgtttgtt aatatcaaac ttgtaacgcg tataagttcc 89174
gcggcggagg gaaatcctgt taaaatttcc ccaaggggac ctataaagtc tgatacaacg 89234
cttgtagcac catgatgtgt acgtgttcca ttttccacca caaatcccgt taaacgcaaa 89294
atggtgaggt taataaaaag ctccaatgga tccttttcgt ctggcataac caagggcctt 89354
gttacccaaa tgggaacatt tgtatgaccg gtaataacag tatccaacgt acccaccgca 89414
ttttgtacgt atccattgct aatttcatgc ccatgttgta gtagtttttc gacaaccatg 89474
ttgacatcac cactctgttc aaatagaaat ggcttccgat ctttcaaact gcttcctaca 89534
ttcatgttcc gtgaaaaagt gggactggat tcaggcgtat ctacattgga tacatttaac 89594
acaacgtctg gggtgttagc aacttctcgt aggcggggta atacatgtgc cacttgtgtt 89654
ttatccggca cgtctacagt tggtttaaat ttgagaacct ttttagcggg tttatgtaaa 89714
ttatccgtgg aatgtgtatt atttaccaag ccatcgtcga tacaagctaa attttcctcg 89774
cttgaaggag gagtccatga aggacggcgt ctacgacgac aatccaaaac gccagaccca 89834
gtcgaggttg tttctagaac cgcatgccaa ggatcaacct cgctgtaact acatctaggg 89894
tccagtgcaa ccgtgttggt taacgttgaa tgtagggttg cttccaagga tacactttca 89954
ccaacgtttt ggtgaagagg tactatagac gatttattaa gcacaattgt cgtggaatta 90014
ggaggtgaaa gaagcaacgg atgttgagct gtaaccagtt tttcgcaata ttgttcatct 90074
gtaaaatatg tatcacagct tccgtatata agcgttactg ccgaagatag cgcttcttcg 90134
cttacgggac cgtcgtttgc ggtgacaaaa aaaactaatg cagcggccca ttgggattct 90194
acgttatcgg ggcggtgggt atactccgat gcgatataag agtacacatc cgtggtattc 90254
acccgaacaa ctacagcctg tcctatagtc ccatggccat gtggatcaaa aataaaagat 90314
atgtttccct gacgatagat ggccattccc gttgagtcta cagttataat tgtatacgac 90374
tcttcacgtc ccatccacac atcatcgaaa aacgctctag cgggaatttg tgtgcacata 90434
aaaccctcgt ctggaacggt aaaaaaatgg ctatcgccgt atattctaga caaaacgcag 90494
cacgtactgg tagtgtcccc cagtttaata tatgaaattc gattggggag atcaacaaga 90554
gcacacatat ctgggacacc cctccccata ttagacagat ccgtagtcca tgctgcaccc 90614
tcttgtaaac atccatcaat catgtctgat gacagggttg cattaattcc atgcgcatat 90674
gtcatatgca aaaacgataa agatgttctt aaacatgaca gacctgacgc cggtcccagg 90734
gcaccgtcaa attgattacg gcttcccact cccgcaacag tgacagctat aggcggaatt 90794
atatccattt tattgcttac gaaaattacg cgctcaactg gtagggacgc tacgtttacg 90854
atgattcaaa tcccaaagag cgttgaataa gcacgtgtag ctccaaaaac ctaggaggtt 90914
atgaaaatac tagcaacaac agcgttatac aaacgaaacg cccagtcaat aaaaaccaca 90974
aatactttaa tgtacataaa cacgcctgtt ttatattaag ggtcactccc acttgtattc 91034
ccgggtacgc caagacgcaa atctacatta gataaaatct catctggacg aagccatctt 91094
tgtcctatta tcagcgtcgg agcctctgaa attacgtctc tgaccgttgt cgttatttta 91154
acattatcat gtaaaaacgt tgcatttaca ttctcccatg ttcctatatt acattgtgaa 91214
gttaaaaccg cgcccatctg aggtccagaa tccacaataa actgttgcct ggagtccata 91274
acgaatgaaa ctgctgcctt gtgatcttcc agatttcggg cgtgactcat atatgaggtt 91334
ggggcattta cacgatcctc ttgtatattt tttaaaccgc gcctcaattc attacacacg 91394
gttgcaatag attcggccga cgggactgcc gtggcatgcc tacaagctgt tgctattcgt 91454
tcccacctat ttaaaaaaat ctctacattt gcaagaccgg gtatattttc tcctgttgtt 91514
agtttgtcaa cacgcattaa aatgtccatt tgatcctgtg taaaccgtgc ataccgggaa 91574
gccatcgttg aaaaaagagg cccaatagca gaagtggagg ttttagtcat taaattttca 91634
cattccctaa gggctgagga agcatcggcc ataattgtgc gcaggctgga tattcccact 91694
ctcatttgcg ttcggattcc ttgtattgtc ccaacaatat gaccaaccgc atcagcaaga 91754
tcgtttaaac gccatacccc aaattgttca tcgcgttcaa cctttgggtt acagatacca 91814
acagcctgaa gatctggcgt tggataaaaa cgacacgcaa aaaatttttg taacattaac 91874
atgtaccgag aaaccactaa cgcgttggat gtaaaatttt caaccaacgc ctgcgatgtt 91934
ttatccagtc gggtccaacg accgtaaagt tcatcataat agagagttgc taaaaaaacg 91994
ctatttgtaa tagaaggacg gtcaggaacc aaaggacgga cgtgctgggg aagcaggtca 92054
acggcaatgc caacagtccg atgaagttcg catataaggg aatctatagt agaaacgtga 92114
gatggtaaaa gtactggggg gtgtgtacgg gtaaaaggtt ctaatattgt tataagagca 92174
gatagttttg tgcgatatcc atctgcattg ggcacggttc cggatccata atcgcgagcc 92234
cactgaagaa tagacaaggc ttgctgtatt ccccgtgtat caaaagacaa agctattgtt 92294
gccgaaaggg catcaagcgt gcttgttgca ataatataag gcacaaacga cggttccaca 92354
aacacgctag agcgattata ttcggtagga tatgcaatct ttagtgcttc tgttatcatg 92414
acagcttcca accctgtagc cggtggatca acacaaatag gaggtgtcat acccacatcc 92474
accgggggag taatacccca tcttgaatcc atacgtaggt caaaccaggt tagtacggct 92534
tcgcgtacgg cactttgtcc tgcttgtggt aataacgcac ctgaggattg attatgtgtt 92594
tgcttaaaat atgttgtctc tctagcttct gcatgtgaaa tttcattaag tagtattacg 92654
cgtgtcgctt gtatactttc aataaggggt cgtggagcaa aacagcgccg cataatttgt 92714
cccaggcctc ccggtggtaa tcctaaggcg ggtaatagcg tttcataccg tttttcggtc 92774
tcgtttactg tccgtgtata tgttaacaca tactcggtag taggagcgta cgtttccact 92834
aattggtatg ccagtccacc cactaaagcg agacatccta gtgtttgttg gaaatctgta 92894
atatgctcgt ctgctacaac gcttccgtga ataaaaccgg cgagtgtgtg gtgagcaata 92954
aaccgacgta gcaatttgca tgaaaatctt gtaaacccgg attcggcaag ggtgcttatt 93014
aacatactag gcgatgtact agcctgaacc ttttcccaaa tgccacgggt gtgttctgct 93074
acattacttc cagatgtggt ttccgcttcc gggtcggagt ttagtaaatg taacagcgcc 93134
tcaatatgaa ttccgggggt ggacaacgat aagaggttgc tttcccatat accattgtct 93194
aatctaagtc tcctagcaag cattcccggg gatgtctcaa aaacataagg gtctcgataa 93254
gccagagctg aggaaattcg tgactcgtgc accgttcgtg gattcgttac cgcttcttcg 93314
atggatacga acgcctcatc tttaattaaa tttacagtat ctgcaaataa ttctgatatg 93374
ctatgtactg gcgtttccgt gagatattca tcaatcatac agattataaa aagggctcgt 93434
gttagaggag tgagtgagct attttgaatc tgattccaca taacattaaa cggaagaagc 93494
catcctgcat attgtgttaa aacacgaatc ccctcttgaa caaatggcat atcgtaaatg 93554
atcgcaaatt tatgttgtgt taaagctgca atggcgcttt ccaacattcc aatacgagcg 93614
tgagtgaatg gtgattcaat ggctgttgtt gcatccttta aggcgacctc gtacatattt 93674
atccagtccg ttataatgaa gtgtatcgaa acgggtccaa atttagacga ggttcgtaat 93734
ttattggccg ataaacgatg taacgtctcg gaatcttgta ctacacttga tagctgggtt 93794
aaaattttta agctattacc tggctcctct atcgtccagt ctagatctag tctccaaaac 93854
aacagtccgg gtaaattttc ggccgtgaaa gcagcctccg ccaaaaatgt attggaatat 93914
acaaaacaga tattagaatg caggggttcc catatctgaa taaaatgttt agctggaatt 93974
ttctgggtag atcgtaaaaa cttaactacc ttaactctgg cgatttctgt tttctctggt 94034
gaccatggtc tttcgttgtc cgcttcttgt actttcaagt tacttaaatc atccaaaagt 94094
ttttgtgtat cataacaaat tggttcttcc atggtatatt ctacgctgac ttaactttat 94154
aacgtgatat gaaaaagcta catccgtccg gaatgaaaag acataaatgt aacatcaata 94214
aaaacgcatg ttaataaaaa ggacgtcacg gtaagttgaa gaacctttta acgccaatca 94274
aaatattttt ataggctcct cctacctaac attatataaa cggtactacg actgtataat 94334
gtgtacacat accccaggca acatttcaga tagtaccacg tcacgattgc attgtgtgaa 94394
tttaacccct cagct atg ggg agt caa cca acc aac tcg cat ttt act 94442
Met Gly Ser Gln Pro Thr Asn Ser His Phe Thr
6885 6890
tta aac gaa caa acg cta tgt gga act aat atc agt ctt tta gga 94487
Leu Asn Glu Gln Thr Leu Cys Gly Thr Asn Ile Ser Leu Leu Gly
6895 6900 6905
aat aac cgt ttt att caa ata ggg aac ggg ctt cat atg act tat 94532
Asn Asn Arg Phe Ile Gln Ile Gly Asn Gly Leu His Met Thr Tyr
6910 6915 6920
gct ccg ggc ttc ttc gga aat tgg agt cgc gat tta aca att ggc 94577
Ala Pro Gly Phe Phe Gly Asn Trp Ser Arg Asp Leu Thr Ile Gly
6925 6930 6935
cct cgc ttt gga ggt ctg aac aaa caa ccg ata cat gta cca cca 94622
Pro Arg Phe Gly Gly Leu Asn Lys Gln Pro Ile His Val Pro Pro
6940 6945 6950
aaa cgt aca gaa acc gcg tct att caa gta acc ccc cgt tca att 94667
Lys Arg Thr Glu Thr Ala Ser Ile Gln Val Thr Pro Arg Ser Ile
6955 6960 6965
gtt att aat cgt atg aac aac att caa ata aat cca act tca att 94712
Val Ile Asn Arg Met Asn Asn Ile Gln Ile Asn Pro Thr Ser Ile
6970 6975 6980
ggt aac ccg caa gtt acc att aga ctc ccg tta aat aat ttt aaa 94757
Gly Asn Pro Gln Val Thr Ile Arg Leu Pro Leu Asn Asn Phe Lys
6985 6990 6995
tca acg aca cag cta atc caa caa gtg tca tta acc gat ttt ttt 94802
Ser Thr Thr Gln Leu Ile Gln Gln Val Ser Leu Thr Asp Phe Phe
7000 7005 7010
cgt ccg gac att gag cat gct ggg tca atc gtc tta atc ctt cgt 94847
Arg Pro Asp Ile Glu His Ala Gly Ser Ile Val Leu Ile Leu Arg
7015 7020 7025
cat cca tct gac atg att gga gaa gct aat aca ctt aca cag gct 94892
His Pro Ser Asp Met Ile Gly Glu Ala Asn Thr Leu Thr Gln Ala
7030 7035 7040
gga cgt gac ccc gat gta cta cta gag ggt tta cga aac cta ttc 94937
Gly Arg Asp Pro Asp Val Leu Leu Glu Gly Leu Arg Asn Leu Phe
7045 7050 7055
aat gcc tgc acg gct cct tgg acc gtt gga gaa ggt ggg ggg ctt 94982
Asn Ala Cys Thr Ala Pro Trp Thr Val Gly Glu Gly Gly Gly Leu
7060 7065 7070
aga gca tat gta acg tca tta agt ttc atc gcc gca tgc cgg gca 95027
Arg Ala Tyr Val Thr Ser Leu Ser Phe Ile Ala Ala Cys Arg Ala
7075 7080 7085
gaa gaa tat acg gat aaa cag gca gcg gat gcc aac aga aca gca 95072
Glu Glu Tyr Thr Asp Lys Gln Ala Ala Asp Ala Asn Arg Thr Ala
7090 7095 7100
att gtt tct gcc tat gga tgc agt cgt atg gaa acg cgg ctc ata 95117
Ile Val Ser Ala Tyr Gly Cys Ser Arg Met Glu Thr Arg Leu Ile
7105 7110 7115
agg ttt tcg gag tgt tta cgt gcg atg gta caa tgt cat gta ttt 95162
Arg Phe Ser Glu Cys Leu Arg Ala Met Val Gln Cys His Val Phe
7120 7125 7130
cca cat cga ttt ata agt ttt ttt ggg tcc ctg ctg gaa tat acc 95207
Pro His Arg Phe Ile Ser Phe Phe Gly Ser Leu Leu Glu Tyr Thr
7135 7140 7145
att cag gat aat tta tgc aat ata acc gcc gtg gcc aaa ggt ccc 95252
Ile Gln Asp Asn Leu Cys Asn Ile Thr Ala Val Ala Lys Gly Pro
7150 7155 7160
caa gaa gct gca cgt aca gac aaa act tca act cgc agg gtc aca 95297
Gln Glu Ala Ala Arg Thr Asp Lys Thr Ser Thr Arg Arg Val Thr
7165 7170 7175
gcc aac atc ccg gcc tgc gta ttt tgg gac gtt gac aaa gat tta 95342
Ala Asn Ile Pro Ala Cys Val Phe Trp Asp Val Asp Lys Asp Leu
7180 7185 7190
cat ctt tcc gcg gac gga ctg aag cat gtg ttc ttg gtt ttt gta 95387
His Leu Ser Ala Asp Gly Leu Lys His Val Phe Leu Val Phe Val
7195 7200 7205
tat aca cag cga cgc caa cga gaa ggt gta aga ctg cat ctt gca 95432
Tyr Thr Gln Arg Arg Gln Arg Glu Gly Val Arg Leu His Leu Ala
7210 7215 7220
tta agc caa cta aac gaa caa tgt ttt ggt cgt ggt att ggc ttc 95477
Leu Ser Gln Leu Asn Glu Gln Cys Phe Gly Arg Gly Ile Gly Phe
7225 7230 7235
ctg tta gga cgc ata cga gct gaa aat gcc gcc tgg ggg act gaa 95522
Leu Leu Gly Arg Ile Arg Ala Glu Asn Ala Ala Trp Gly Thr Glu
7240 7245 7250
ggg gtt gca aat acc cac cag cca tat aac aca agg gcg ttg ccg 95567
Gly Val Ala Asn Thr His Gln Pro Tyr Asn Thr Arg Ala Leu Pro
7255 7260 7265
ctt gtg cag tta tcc aat gac ccg aca agc cct cga tgt agt att 95612
Leu Val Gln Leu Ser Asn Asp Pro Thr Ser Pro Arg Cys Ser Ile
7270 7275 7280
ggc gaa att aca gga gta aat tgg aac ttg gct aga cag cga ttg 95657
Gly Glu Ile Thr Gly Val Asn Trp Asn Leu Ala Arg Gln Arg Leu
7285 7290 7295
tat caa tgg acc ggc gat ttt cgg gga ctt ccc aca caa tta tcc 95702
Tyr Gln Trp Thr Gly Asp Phe Arg Gly Leu Pro Thr Gln Leu Ser
7300 7305 7310
tgc atg tat gcg gca tat acg tta att gga aca att cca tca gag 95747
Cys Met Tyr Ala Ala Tyr Thr Leu Ile Gly Thr Ile Pro Ser Glu
7315 7320 7325
tct gtg cgt tat aca aga cgc atg gaa cgg ttc gga ggt tat aac 95792
Ser Val Arg Tyr Thr Arg Arg Met Glu Arg Phe Gly Gly Tyr Asn
7330 7335 7340
gtg cca act att tgg tta gag ggg gtt gtg tgg ggg ggt aca aat 95837
Val Pro Thr Ile Trp Leu Glu Gly Val Val Trp Gly Gly Thr Asn
7345 7350 7355
aca tgg aac gaa tgt tat tat taa agcatgtatg taaaataaac tgaatttaac 95891
Thr Trp Asn Glu Cys Tyr Tyr
7360
atagcgtggg ttttgcgtga tattatatac tggggagggg caggctgtac gtaaccatat 95951
ataagggagt ctacaatatt gtagaactaa ctcagctgtg agtttagggt ttaaaggttt 96011
attccggagc ctaaatacgt tatccgtt atg gag ttc aaa aga att ttt aat 96063
Met Glu Phe Lys Arg Ile Phe Asn
7365 7370
acg gtt cat gac att ata aac cga tta tgt caa cat ggc tac aag 96108
Thr Val His Asp Ile Ile Asn Arg Leu Cys Gln His Gly Tyr Lys
7375 7380 7385
gaa tac atc att ccg ccc gaa tca acc aca ccg gtg gaa tta atg 96153
Glu Tyr Ile Ile Pro Pro Glu Ser Thr Thr Pro Val Glu Leu Met
7390 7395 7400
gag tat att agc act atc gtc tca aaa ctt aag gcg gtg acg cga 96198
Glu Tyr Ile Ser Thr Ile Val Ser Lys Leu Lys Ala Val Thr Arg
7405 7410 7415
caa gat gag cga gtg tac cga tgt tgt gga gaa ctt atc cat tgc 96243
Gln Asp Glu Arg Val Tyr Arg Cys Cys Gly Glu Leu Ile His Cys
7420 7425 7430
cgt att aac cta cga tcc gtt tcc atg gaa acg tgg ttg act tcc 96288
Arg Ile Asn Leu Arg Ser Val Ser Met Glu Thr Trp Leu Thr Ser
7435 7440 7445
cca att ctc tgt tta act ccc cga gtc cgc caa gca att gaa ggg 96333
Pro Ile Leu Cys Leu Thr Pro Arg Val Arg Gln Ala Ile Glu Gly
7450 7455 7460
cgg agg gac gaa att cgt cgg gct ata tta gaa ccg ttt ttg aaa 96378
Arg Arg Asp Glu Ile Arg Arg Ala Ile Leu Glu Pro Phe Leu Lys
7465 7470 7475
gat caa tac ccc gct tta gct acc ctt gga cta cag tct gct tta 96423
Asp Gln Tyr Pro Ala Leu Ala Thr Leu Gly Leu Gln Ser Ala Leu
7480 7485 7490
aag tac gaa gac ttt tat tta act aag tta gag gaa ggt aaa tta 96468
Lys Tyr Glu Asp Phe Tyr Leu Thr Lys Leu Glu Glu Gly Lys Leu
7495 7500 7505
gag tcg ctt tgc caa ttc ttt tta aga ctg gcg gcc acc gtg aca 96513
Glu Ser Leu Cys Gln Phe Phe Leu Arg Leu Ala Ala Thr Val Thr
7510 7515 7520
aca gaa atc gta aac ctg cct aaa atc gca act ctt att ccc gga 96558
Thr Glu Ile Val Asn Leu Pro Lys Ile Ala Thr Leu Ile Pro Gly
7525 7530 7535
ata aat gat ggt tat aca tgg act gat gtc tgt cgg gta ttt ttc 96603
Ile Asn Asp Gly Tyr Thr Trp Thr Asp Val Cys Arg Val Phe Phe
7540 7545 7550
aca gcg ttg gca tgt cag aaa att gtc ccg gct aca ccg gtt atg 96648
Thr Ala Leu Ala Cys Gln Lys Ile Val Pro Ala Thr Pro Val Met
7555 7560 7565
atg ttt tta ggt cga gag acc ggg gca acg gcc agt tgt tat tta 96693
Met Phe Leu Gly Arg Glu Thr Gly Ala Thr Ala Ser Cys Tyr Leu
7570 7575 7580
atg gac ccg gaa tcc atc act gtt ggg aga gct gtt cga gct atc 96738
Met Asp Pro Glu Ser Ile Thr Val Gly Arg Ala Val Arg Ala Ile
7585 7590 7595
aca ggc gat gtg gga acg gta tta caa agt cga ggt gga gtg gga 96783
Thr Gly Asp Val Gly Thr Val Leu Gln Ser Arg Gly Gly Val Gly
7600 7605 7610
att tct cta cag agt ctg aat tta ata cct acg gaa aat caa acg 96828
Ile Ser Leu Gln Ser Leu Asn Leu Ile Pro Thr Glu Asn Gln Thr
7615 7620 7625
aaa ggt ctt ctt gca gtt tta aaa ctt tta gat tgc atg gtt atg 96873
Lys Gly Leu Leu Ala Val Leu Lys Leu Leu Asp Cys Met Val Met
7630 7635 7640
gca att aac agt gat tgt gaa cga cca act gga gtt tgt gtt tac 96918
Ala Ile Asn Ser Asp Cys Glu Arg Pro Thr Gly Val Cys Val Tyr
7645 7650 7655
ata gaa cca tgg cac gtc gat cta caa act gtt ttg gcc aca cgt 96963
Ile Glu Pro Trp His Val Asp Leu Gln Thr Val Leu Ala Thr Arg
7660 7665 7670
gga atg ttg gtt cgt gat gaa ata ttt cga tgt gat aac ata ttt 97008
Gly Met Leu Val Arg Asp Glu Ile Phe Arg Cys Asp Asn Ile Phe
7675 7680 7685
tgt tgt tta tgg acc cca gat tta ttt ttt gaa aga tac cta agc 97053
Cys Cys Leu Trp Thr Pro Asp Leu Phe Phe Glu Arg Tyr Leu Ser
7690 7695 7700
tat cta aaa ggg gct agt aat gtt cag tgg act ctt ttt gat aac 97098
Tyr Leu Lys Gly Ala Ser Asn Val Gln Trp Thr Leu Phe Asp Asn
7705 7710 7715
aga gcc gat atc ctt cga aca tta cac ggg gag gca ttc act tca 97143
Arg Ala Asp Ile Leu Arg Thr Leu His Gly Glu Ala Phe Thr Ser
7720 7725 7730
acc tat tta cgt tta gag aga gaa gga tta ggc gtt tct tct gtt 97188
Thr Tyr Leu Arg Leu Glu Arg Glu Gly Leu Gly Val Ser Ser Val
7735 7740 7745
ccc att caa gat atc gca ttc aca atc ata cgc agt gct gct gta 97233
Pro Ile Gln Asp Ile Ala Phe Thr Ile Ile Arg Ser Ala Ala Val
7750 7755 7760
aca gga agc ccc ttt tta atg ttc aaa gat gcc tgt aat cgt aat 97278
Thr Gly Ser Pro Phe Leu Met Phe Lys Asp Ala Cys Asn Arg Asn
7765 7770 7775
tat cat atg aat acc caa gga aat gct atc acg ggg tca aat ttg 97323
Tyr His Met Asn Thr Gln Gly Asn Ala Ile Thr Gly Ser Asn Leu
7780 7785 7790
tgt acg gaa att gtt caa aag gca gac gct cat caa cat ggc gta 97368
Cys Thr Glu Ile Val Gln Lys Ala Asp Ala His Gln His Gly Val
7795 7800 7805
tgt aat ctt gcc agc ata aat ctt aca acg tgc tta tcc aaa ggc 97413
Cys Asn Leu Ala Ser Ile Asn Leu Thr Thr Cys Leu Ser Lys Gly
7810 7815 7820
cca gtg tca ttt aat tta aac gac ctt caa ttg aca gca aga acg 97458
Pro Val Ser Phe Asn Leu Asn Asp Leu Gln Leu Thr Ala Arg Thr
7825 7830 7835
act gtt att ttt tta aac ggg gtc ctg gcg gct ggg aac ttt cca 97503
Thr Val Ile Phe Leu Asn Gly Val Leu Ala Ala Gly Asn Phe Pro
7840 7845 7850
tgt aaa aaa tca tgt aaa ggt gta aaa aac aac cga tca ctt ggc 97548
Cys Lys Lys Ser Cys Lys Gly Val Lys Asn Asn Arg Ser Leu Gly
7855 7860 7865
att ggc ata caa ggg tta cat aca act tgt ctc cgc tta gga ttt 97593
Ile Gly Ile Gln Gly Leu His Thr Thr Cys Leu Arg Leu Gly Phe
7870 7875 7880
gat tta act tcc caa cca gct aga cgg tta aat gta caa ata gcg 97638
Asp Leu Thr Ser Gln Pro Ala Arg Arg Leu Asn Val Gln Ile Ala
7885 7890 7895
gag tta atg ttg tat gag aca atg aaa aca agc atg gaa atg tgt 97683
Glu Leu Met Leu Tyr Glu Thr Met Lys Thr Ser Met Glu Met Cys
7900 7905 7910
aag att ggc ggc tta gcc ccg ttt aag ggt ttt acc gaa agt aaa 97728
Lys Ile Gly Gly Leu Ala Pro Phe Lys Gly Phe Thr Glu Ser Lys
7915 7920 7925
tat gct aag gga tgg tta cac caa gat ggg ttt tct acg ata agt 97773
Tyr Ala Lys Gly Trp Leu His Gln Asp Gly Phe Ser Thr Ile Ser
7930 7935 7940
tat tta gat tta cca tgg tgt acc ctg cga gat gat att tgc gct 97818
Tyr Leu Asp Leu Pro Trp Cys Thr Leu Arg Asp Asp Ile Cys Ala
7945 7950 7955
tat ggg tta tac aac tcg cag ttc tta gcg tta atg ccc aca gtt 97863
Tyr Gly Leu Tyr Asn Ser Gln Phe Leu Ala Leu Met Pro Thr Val
7960 7965 7970
tca tct gca cag gta acg gag tgc agt gag ggt ttc tct cca att 97908
Ser Ser Ala Gln Val Thr Glu Cys Ser Glu Gly Phe Ser Pro Ile
7975 7980 7985
tat aat aat atg ttt agt aag gtc acc acc tcg ggt gag tta ctt 97953
Tyr Asn Asn Met Phe Ser Lys Val Thr Thr Ser Gly Glu Leu Leu
7990 7995 8000
aga ccc aac tta gac ctt atg gac gaa cta aga gat atg tat tca 97998
Arg Pro Asn Leu Asp Leu Met Asp Glu Leu Arg Asp Met Tyr Ser
8005 8010 8015
tgt gaa gaa aaa cga ctg gaa gtt ata aac ata ctt gag aaa aac 98043
Cys Glu Glu Lys Arg Leu Glu Val Ile Asn Ile Leu Glu Lys Asn
8020 8025 8030
caa tgg tca gta ata cgt tcg ttt ggc tgt tta tct aat agt cac 98088
Gln Trp Ser Val Ile Arg Ser Phe Gly Cys Leu Ser Asn Ser His
8035 8040 8045
cca ctc tta aaa tat aaa aca gcg ttt gaa tat gag caa gag gat 98133
Pro Leu Leu Lys Tyr Lys Thr Ala Phe Glu Tyr Glu Gln Glu Asp
8050 8055 8060
ctc gtt gat atg tgt gca gaa agg gcg cca ttt att gac caa agt 98178
Leu Val Asp Met Cys Ala Glu Arg Ala Pro Phe Ile Asp Gln Ser
8065 8070 8075
caa tca atg act tta ttt att gag gaa cgc cca gac ggg aca att 98223
Gln Ser Met Thr Leu Phe Ile Glu Glu Arg Pro Asp Gly Thr Ile
8080 8085 8090
ccc gcc tcc aaa ata atg aat ttg ctt ata cgt gcc tat aaa gcc 98268
Pro Ala Ser Lys Ile Met Asn Leu Leu Ile Arg Ala Tyr Lys Ala
8095 8100 8105
ggc ctt aaa acg ggt atg tac tac tgt aaa att cgt aaa gct acg 98313
Gly Leu Lys Thr Gly Met Tyr Tyr Cys Lys Ile Arg Lys Ala Thr
8110 8115 8120
aac agc gga ctg ttt gcg gga ggc gaa tta acc tgt acc agt tgt 98358
Asn Ser Gly Leu Phe Ala Gly Gly Glu Leu Thr Cys Thr Ser Cys
8125 8130 8135
gct tta taa atttacacgg gaaactattc caaa atg gat cag aaa gat tgc 98409
Ala Leu Met Asp Gln Lys Asp Cys
8140 8145
agt cat ttt ttt tac agg ccg gag tgt cca gat ata aac aat tta 98454
Ser His Phe Phe Tyr Arg Pro Glu Cys Pro Asp Ile Asn Asn Leu
8150 8155 8160
cgt gcc ctg agc att tcg aat cgt tgg tta gaa agc gat ttt atc 98499
Arg Ala Leu Ser Ile Ser Asn Arg Trp Leu Glu Ser Asp Phe Ile
8165 8170 8175
att gaa gat gat tat caa tac ttg gac tgt tta acg gaa gat gaa 98544
Ile Glu Asp Asp Tyr Gln Tyr Leu Asp Cys Leu Thr Glu Asp Glu
8180 8185 8190
cta ata ttc tac aga ttt att ttt aca ttt tta tcg gcg gca gat 98589
Leu Ile Phe Tyr Arg Phe Ile Phe Thr Phe Leu Ser Ala Ala Asp
8195 8200 8205
gat ctg gta aat gtt aat ttg ggc tct cta acc caa ctc ttt tcc 98634
Asp Leu Val Asn Val Asn Leu Gly Ser Leu Thr Gln Leu Phe Ser
8210 8215 8220
caa aag gat att cac cat tac tac att gaa caa gag tgc atc gag 98679
Gln Lys Asp Ile His His Tyr Tyr Ile Glu Gln Glu Cys Ile Glu
8225 8230 8235
gtt gtc cac gcg cgt gtc tat agt caa att caa cta atg ttg ttc 98724
Val Val His Ala Arg Val Tyr Ser Gln Ile Gln Leu Met Leu Phe
8240 8245 8250
aga ggg gat gaa tcg ttg cgg gta caa tac gta aat gtc act att 98769
Arg Gly Asp Glu Ser Leu Arg Val Gln Tyr Val Asn Val Thr Ile
8255 8260 8265
aat aat ccg tcg atc caa caa aaa gta caa tgg ttg gaa gaa aag 98814
Asn Asn Pro Ser Ile Gln Gln Lys Val Gln Trp Leu Glu Glu Lys
8270 8275 8280
gta cgg gac aac cca tcc gtt gca gaa aaa tat ata cta atg att 98859
Val Arg Asp Asn Pro Ser Val Ala Glu Lys Tyr Ile Leu Met Ile
8285 8290 8295
ctt ata gag ggc att ttt ttt gta tca tcg ttc gcg gct att gca 98904
Leu Ile Glu Gly Ile Phe Phe Val Ser Ser Phe Ala Ala Ile Ala
8300 8305 8310
tat tta cgc aat aac gga cta ttt gtt gta act tgt caa ttt aac 98949
Tyr Leu Arg Asn Asn Gly Leu Phe Val Val Thr Cys Gln Phe Asn
8315 8320 8325
gac ctt ata agc cga gat gaa gcc ata cat acc agc gca tcg tgt 98994
Asp Leu Ile Ser Arg Asp Glu AlaIle His Thr Ser Ala Ser Cys
8330 8335 8340
tgt ata tac aat aac tat gta ccc gaa aaa ccc gct atc acc aga 99039
CysIle Tyr Asn Asn Tyr Val Pro Glu Lys Pro Ala Ile Thr Arg
8345 8350 8355
ata cat caa ctg ttt tcg gaa gcc gtt gaa atc gag tgt gcg ttt 99084
Ile His Gln Leu Phe Ser Glu Ala Val Glu Ile Glu Cys Ala Phe
8360 8365 8370
tta aaa tcc cat gca ccc aaa acc cgt ttg gtg aac gtc gat gca 99129
Leu Lys Ser His Ala Pro Lys Thr Arg Leu Val Asn Val Asp Ala
8375 8380 8385
att aca caa tac gtg aaa ttc agc gcg gac agg ctt tta tca gcg 99174
Ile Thr Gln Tyr Val Lys Phe Ser Ala Asp Arg Leu Leu Ser Ala
8390 8395 8400
att aat gta cca aaa cta ttt aac acc cca cct ccc gat tcg gac 99219
Ile Asn Val Pro Lys Leu Phe Asn Thr Pro Pro Pro Asp Ser Asp
8405 8410 8415
ttt cca ctt gca ttt atg att gca gat aaa aac aca aat ttt ttt 99264
Phe Pro Leu Ala Phe MetIle Ala Asp Lys Asn Thr Asn Phe Phe
8420 8425 8430
gag aga cac agt aca tct tat gcg ggc aca gtg ata aac gat tta 99309
Glu Arg His Ser Thr Ser Tyr Ala Gly Thr Val Ile Asn Asp Leu
8435 8440 8445
taa catgtatata cgagcaaaat aaaacaatga accattaagt cgctcttatg 99362
tgtgttttaa ttccaatatt ttgttaatac agtgtttagt gggagtggag taggaataaa 99422
ctgtttaaaa atacgtgcgt attggcgagc cattttttcc ccctttaagt tatgatataa 99482
ggtgttaata accatatatg ggtctgccgt attttgcata atatttacac gtttcattaa 99542
ctttaggcga ccgcgggtca atggagttac cacggcgatt atatgttcaa caaatgcgcg 99602
ttccaaatta tccaatgggg aggatgaggg tggcagaagg tttaaagccc aactggcgtc 99662
tactgtcacc tccgatcggt tagatgcata tttggaaacg gacactccgt tttctgtttt 99722
acccgacgtg gctacggtaa gacttttgaa aacctcgcca tgtctccacg agtcgtaagt 99782
gggggagcgc gttgaagtgt ccatttgatg tggaactttc aggccggtat cctgaataac 99842
ttgctgaaca gattttaggt tgtcacttgt atgcaaatcg gtctgacagc gaacaaaggc 99902
aaccaaaaat tcagggtatg taattcctaa atattgaagc aaatcgcgac atcgtactac 99962
tggagcaaac ataggaattg catctaacaa aatatcacag cccatgaaga gtaaatcagt 100022
atctgtcgta taaaccaaag ccactgtacg cgtatgaaat aagtttgcac atgcctcgtc 100082
cgcctcaatg tcaaccgcct cgacgtatgc ataccccatc catctaataa gacttgcaca 100142
caatttgtga tataatcgat ggttaggatg tcctcgggat ggcattgtag aggtaagact 100202
gcatacgttt tggtcgcgca ggcagatcgt cttgtcagtt tgtcgtggac aaccgtccaa 100262
aaacatatcc tgggatttgt tttcttcgct aggaatacaa ctcgtggaag agtttcgaaa 100322
cacactgtca tatgtttccg atgttttgtt atacttgata gaacatgatg tgttggatga 100382
tagtagaaca cgggttaagt ccgaggttcc cctttcatcg tggttcatgg cccgtgacat 100442
aattgccttg gctccacgtg ataatgggcc gtctgtacaa cgttccaata caaatatcgg 100502
atagtatgac cgttgtgtta atagccgcaa taagactcca agacaatgta tcgttacaga 100562
tggtccgtgt aaattctcgc gtttacccac agggtataaa cgttccaaca atgtgtacat 100622
gacgttccat acgtctaccg ccacgggggt taaaactccg ggtggcgttg aaatgatgct 100682
gggtttaacc agtttatgtt catggataaa gcgtgtcagt ccaaagagcc ccatcaatac 100742
acttaaccaa gtttatatcg tcttgtgaaa ggtttcacag cggcttcagt aattcttcga 100802
tgtgatcttg tgttaattac agacgtatat aatgatgtac atatttttta tgacaaacat 100862
tacatcaaat ctggtaactt cttacgtatt ttattgcaca gtccctatta aggaacaccc 100922
cctgtattgg aacacgtgga aatgttccct cccatgggcc gtactattca aatatcatgc 100982
ctgtttatgt acttaagtca ccggatcggg ttaaacaaac atattaacga aactcgtgtg 101042
tttacatatg attacttttt ctatagtaaa cattttaagt agtaaatt atg gat ttg 101099
Met Asp Leu
agg tcg cgt aca gac gat gct ttg gac atg gaa ttg cat gcg ggt 101144
Arg Ser Arg Thr Asp Asp Ala Leu Asp Met Glu Leu His Ala Gly
8450 8455 8460
ttt gac gcc cca gaa atc gcc aga gct gtt tta acg gaa aaa acg 101189
Phe Asp Ala Pro Glu Ile Ala Arg Ala Val Leu Thr Glu Lys Thr
8465 8470 8475
ctt act ggt tta att tcg tct ata tca cct ctg gtt aat aga cta 101234
Leu Thr Gly LeuIle Ser Ser Ile Ser Pro Leu Val Asn Arg Leu
8480 8485 8490
agg gat tct att tta ata ttc agc gac gaa gga tta att att cac 101279
Arg Asp Ser Ile Leu Ile Phe Ser Asp Glu Gly Leu IleIle His
8495 8500 8505
tgt agt ttg gaa aca gaa caa ctg tat att cct ata ccg gca aat 101324
Cys Ser Leu Glu Thr Glu Gln Leu TyrIle Pro Ile Pro Ala Asn
8510 8515 8520
atg ttt gac cag tat aat tgg act ggg ccg aga atg gtt gta ctc 101369
Met Phe Asp Gln Tyr Asn Trp Thr Gly Pro Arg Met Val Val Leu
8525 8530 8535
gcg gca acg gag gga cgg tcc tcg ctt att gac gcg ttt cgc cat 101414
Ala Ala Thr Glu Gly Arg Ser Ser Leu Ile Asp Ala Phe Arg His
8540 8545 8550
aca aaa gat ccg tcg acc cca aca cgg tta tat ttt aaa ttt acc 101459
Thr Lys Asp Pro Ser Thr Pro Thr Arg Leu Tyr Phe Lys Phe Thr
8555 8560 8565
gga caa ccc ccc gag cgg agt att atc caa acg atg gta tgg caa 101504
Gly Gln Pro Pro Glu Arg Ser Ile Ile Gln Thr Met Val Trp Gln
8570 8575 8580
cgc ccg ggt gat tgt ggt cca gat gat caa gta caa tgt tac aaa 101549
Arg Pro Gly Asp Cys Gly Pro Asp Asp Gln Val Gln Cys Tyr Lys
8585 8590 8595
caa gtt gta aaa cgt gaa ctc gct tgt tat aca atg atg ttt cca 101594
Gln Val Val Lys Arg Glu Leu Ala Cys Tyr Thr Met Met Phe Pro
8600 8605 8610
aat cta act cca gat ata agc att tgc tta aaa cgc gat caa ttc 101639
Asn Leu Thr Pro Asp Ile Ser Ile Cys Leu Lys Arg Asp Gln Phe
8615 8620 8625
acc cgt tta cag cga cta ctt aaa act ttt ggg ttt aca aca tgc 101684
Thr Arg Leu Gln Arg Leu Leu Lys Thr Phe Gly Phe Thr Thr Cys
8630 8635 8640
ttc att cta aca gcc acg gat atg tac atc cag acc gcc ggg ggt 101729
Phe Ile Leu Thr Ala Thr Asp Met Tyr Ile Gln Thr Ala Gly Gly
8645 8650 8655
ggt ttt atc tca ttt aat gtt tcc ttg gat ata aac gga agc aag 101774
Gly Phe Ile Ser Phe Asn Val Ser Leu Asp Ile Asn Gly Ser Lys
8660 8665 8670
cct aca cca tat aat tta ata cgc tca atc aca aat tca aaa agg 101819
Pro Thr Pro Tyr Asn Leu Ile Arg Ser Ile Thr Asn Ser Lys Arg
8675 8680 8685
atc ctt aat aat gtt gtt tat ggc agc ggg agt atg cgt gaa ttt 101864
Ile Leu Asn Asn Val Val Tyr Gly Ser Gly Ser Met Arg Glu Phe
8690 8695 8700
gga gta tta ttg gaa aca cac agt gga ttc cgt tct gcc gta caa 101909
Gly Val Leu Leu Glu Thr His Ser Gly Phe Arg Ser Ala Val Gln
8705 8710 8715
aat ctt aag tta aca cgg gat gag acg tgt tat att aat ttt tat 101954
Asn Leu Lys Leu Thr Arg Asp Glu Thr Cys Tyr Ile Asn Phe Tyr
8720 8725 8730
ctc gcc tta act aac tcc ccc atg gtt gga ttg tat atc caa cgt 101999
Leu Ala Leu Thr Asn Ser Pro Met Val Gly Leu Tyr Ile Gln Arg
8735 8740 8745
tcc gca ccc gtg cat tct ttt ttt tat gca acg ttc tta agt ccc 102044
Ser Ala Pro Val His Ser Phe Phe Tyr Ala Thr Phe Leu Ser Pro
8750 8755 8760
aaa gac ctt aaa gaa aaa tta acc tcg atg caa tta ttt gcg aac 102089
Lys Asp Leu Lys Glu Lys Leu Thr Ser Met Gln Leu Phe Ala Asn
8765 8770 8775
atg gaa tct gtg aag gat gaa cca cca tta aaa aaa aga cgc aat 102134
Met Glu Ser Val Lys Asp Glu Pro Pro Leu Lys Lys Arg Arg Asn
8780 8785 8790
tta tta aca aaa aga aac gaa aaa aat acc gga aat aaa atg ggg 102179
Leu Leu Thr Lys Arg Asn Glu Lys Asn Thr Gly Asn Lys Met Gly
8795 8800 8805
ggg aaa ctc ccc gaa acc aca tgg cag gag gga atc gga att cgc 102224
Gly Lys Leu Pro Glu Thr Thr Trp Gln Glu Gly Ile GlyIle Arg
8810 8815 8820
gaa tat tgt gtg gct cct cca gtg gac cct gca gga acc ctg gat 102269
Glu Tyr Cys Val Ala Pro Pro Val Asp Pro Ala Gly Thr Leu Asp
8825 8830 8835
tat tct gaa tta tca cgt gaa tct gac gta ata tgt aca gtt aaa 102314
Tyr Ser Glu Leu Ser Arg Glu Ser Asp Val Ile Cys Thr Val Lys
8840 8845 8850
taa gtgcaacttt tgcttatatt ttacatacaa acttgtgtgt accatagatg 102367
aacacatttt tatttgtttt gaattattaa acttaagac atg gcc gtg aat ggt 102421
Met Ala Val Asn Gly
8855
gaa aga gct gtc cat gat gaa aac ctg ggt gtg tta gac aga gaa 102466
Glu Arg Ala Val His Asp Glu Asn Leu Gly Val Leu Asp Arg Glu
8860 8865 8870
tta atc cgc gct caa tca atc caa gga tgt gtc gga aac cct caa 102511
Leu Ile Arg Ala Gln Ser Ile Gln Gly Cys Val Gly Asn Pro Gln
8875 8880 8885
gaa tgt aat tcg tgt gca ata acc tca gca tcg cgg ttg ttt ctc 102556
Glu Cys Asn Ser Cys Ala Ile Thr Ser Ala Ser Arg Leu Phe Leu
8890 8895 8900
gtg gga cta caa gca agc gtt atc acg tcc ggg tta att tta caa 102601
Val Gly Leu Gln Ala Ser Val Ile Thr Ser Gly Leu Ile Leu Gln
8905 8910 8915
tat cac gtc tgc gaa gct gcc gtc aat gca act att atg ggg ttg 102646
Tyr His Val Cys Glu Ala Ala Val Asn Ala Thr Ile Met Gly Leu
8920 8925 8930
atc gtc gtt tcg ggg tta tgg cca aca tcc gtg aaa ttt cta cgc 102691
Ile Val Val Ser Gly Leu Trp Pro Thr Ser Val Lys Phe Leu Arg
8935 8940 8945
aca tta gca aaa ttg gga cga tgt ttg cag acg gtg gtc gtg ttg 102736
Thr Leu Ala Lys Leu Gly Arg Cys Leu Gln Thr Val Val Val Leu
8950 8955 8960
ggt ttt gct gtg tta tgg gcg gtt ggt tgc cca ata tcc cgg gat 102781
Gly Phe Ala Val Leu Trp Ala Val Gly Cys Pro Ile Ser Arg Asp
8965 8970 8975
ctt cca ttt gta gaa tta ctg gga att tcc ata tcc gcg att acc 102826
Leu Pro Phe Val Glu Leu Leu Gly Ile Ser Ile Ser Ala Ile Thr
8980 8985 8990
gga aca gtg gct gct gtg cat atc cat tac tac aac ttt gtt acg 102871
Gly Thr Val Ala Ala Val His Ile His Tyr Tyr Asn Phe Val Thr
8995 9000 9005
aca ttc aat gga ccg cat att tat ttt tat gtt atg atg ttg gga 102916
Thr Phe Asn Gly Pro His Ile Tyr Phe Tyr Val Met Met Leu Gly
9010 9015 9020
act ggg ttg gga ggt tta cta acc gtt att tta tat atg tat gtc 102961
Thr Gly Leu Gly Gly Leu Leu Thr Val Ile Leu Tyr Met Tyr Val
9025 9030 9035
agt aaa tat gag gtt ctt att gga ttg tgt ata tct att gtc aca 103006
Ser Lys Tyr Glu Val Leu Ile Gly Leu Cys Ile Ser Ile Val Thr
9040 9045 9050
cta gtt tca att gtc gat gcc gcc acc gat ttg caa gat acg tgt 103051
Leu Val Ser Ile Val Asp Ala Ala Thr Asp Leu Gln Asp Thr Cys
9055 9060 9065
ata tat cgt aaa aat cgc cat aag caa tta aac act tat aca gat 103096
Ile Tyr Arg Lys Asn Arg His Lys Gln Leu Asn Thr Tyr Thr Asp
9070 9075 9080
tta ggt ttt gcc gtt gta tat aca caa aat gac cgc ggg aga gta 103141
Leu Gly Phe Ala Val Val Tyr Thr Gln Asn Asp Arg Gly Arg Val
9085 9090 9095
tgt gac cat cga gaa agt tcc cgg acc ctt aaa cgc gtg ttt aaa 103186
Cys Asp His Arg Glu Ser Ser Arg Thr Leu Lys Arg Val Phe Lys
9100 9105 9110
gga att cgt ata atg tct gtt ata ccc ccg gtg tta tat ata gtt 103231
Gly Ile Arg Ile Met Ser Val Ile Pro Pro Val Leu Tyr Ile Val
9115 9120 9125
acc cca tta atg tgg gca atc tca cat ata att aaa tta aat cat 103276
Thr Pro Leu Met Trp Ala Ile Ser His Ile Ile Lys Leu Asn His
9130 9135 9140
ttt atc aaa ctt aca caa gta acg tta gca gtt tca ata gga ggt 103321
Phe Ile Lys Leu Thr Gln Val Thr Leu Ala Val Ser Ile Gly Gly
9145 9150 9155
cat att ata gca ttt ggg tta cag ggt ttt gcc gtt tta tat caa 103366
His Ile Ile Ala Phe Gly Leu Gln Gly Phe Ala Val Leu Tyr Gln
9160 9165 9170
gaa aaa aaa aac cta tgg gta att gta tta tat aca acg acc tcg 103411
Glu Lys Lys Asn Leu Trp Val Ile Val Leu Tyr Thr Thr Thr Ser
9175 9180 9185
gtg acg ggt ata gct gta aca ttt gcc ggc att tca tgg gga gct 103456
Val Thr Gly Ile Ala Val Thr Phe Ala Gly Ile Ser Trp Gly Ala
9190 9195 9200
att ata att cta aca tca aca gtt gcg gcg ggt ttg acg tgt att 103501
Ile Ile Ile Leu Thr Ser Thr Val Ala Ala Gly Leu Thr Cys Ile
9205 9210 9215
cag atg atg aga cta agc gtt aaa cct att gac tgt ttt atg gca 103546
Gln Met Met Arg Leu Ser Val Lys Pro Ile Asp Cys Phe Met Ala
9220 9225 9230
tct cat atc act aaa gta tat cac gtg tgt gtt tat att ata ata 103591
Ser His Ile Thr Lys Val Tyr His Val Cys Val Tyr Ile Ile Ile
9235 9240 9245
aat cta tgc tat cta tgt ggt aca tat gta tcg taa tcgagataaa 103637
Asn Leu Cys Tyr Leu Cys Gly Thr Tyr Val Ser
9250 9255
taaagttttt aaagttgcaa aagccgtttt tattattccc aatgtcgaaa aaaacgtttc 103697
catcatttaa attccgcggt gggtgtttta atcttttatt taaggggagc gtggatgtgt 103757
caataaaaac cagg atg aag cgg ata caa ata aat tta att tta acg 103804
Met Lys Arg Ile Gln Ile Asn Leu Ile Leu Thr
9260 9265 9270
atc gcg tgt ata caa tta tcg act gaa tct caa ccc aca ccc gta 103849
Ile Ala Cys Ile Gln Leu Ser Thr Glu Ser Gln Pro Thr Pro Val
9275 9280 9285
agt ata act gaa tta tat acc tcg gcc gct acc cga aag ccc gat 103894
Ser Ile Thr Glu Leu Tyr Thr Ser Ala Ala Thr Arg Lys Pro Asp
9290 9295 9300
ccc gcc gtc gcg ccc acc tcg gcc gct tcc cga aag ccc gat ccc 103939
Pro Ala Val Ala Pro Thr Ser Ala Ala Ser Arg Lys Pro Asp Pro
9305 9310 9315
gcc gtc gcg ccc acc tcg gcc gct tcc cga aag ccc gat ccc gcc 103984
Ala Val Ala Pro Thr Ser Ala Ala Ser Arg Lys Pro Asp Pro Ala
9320 9325 9330
gtc gcg ccc acc tcg gcc gct tcc cga aag ccc gat ccc gcc gtc 104029
Val Ala Pro Thr Ser Ala Ala Ser Arg Lys Pro Asp Pro Ala Val
9335 9340 9345
gcg ccc acc tcg gcc gct acc cga aag ccc gat ccc gcc gtc gcg 104074
Ala Pro Thr Ser Ala Ala Thr Arg Lys Pro Asp Pro Ala Val Ala
9350 9355 9360
ccc acc tcg gcc gct tcc cga aag ccc gat ccc gcc gtc gcg ccc 104119
Pro Thr Ser Ala Ala Ser Arg Lys Pro Asp Pro Ala Val Ala Pro
9365 9370 9375
acc tcg gcc gct acc cga aag ccc gat ccc gcc gtc gcg ccc acc 104164
Thr Ser Ala Ala Thr Arg Lys Pro Asp Pro Ala Val Ala Pro Thr
9380 9385 9390
tcg gcc gct tcc cga aag ccc gat ccc gca gcc aac acc caa cat 104209
Ser Ala Ala Ser Arg Lys Pro Asp Pro Ala Ala Asn Thr Gln His
9395 9400 9405
tca caa cca cct ttt cta tat gaa aat ata caa tgc gtt cac ggc 104254
Ser Gln Pro Pro Phe Leu Tyr Glu Asn Ile Gln Cys Val His Gly
9410 9415 9420
gga ata caa tcc ata ccc tat ttt cac aca ttt atc atg cct tgt 104299
Gly Ile Gln Ser Ile Pro Tyr Phe His Thr Phe Ile Met Pro Cys
9425 9430 9435
tac atg cgt cta acg acc gga caa cag gcg gcc ttt aag cag caa 104344
Tyr Met Arg Leu Thr Thr Gly Gln Gln Ala Ala Phe Lys Gln Gln
9440 9445 9450
caa aaa aca tat gaa caa tat tct tta gat ccg gaa ggt tca aat 104389
Gln Lys Thr Tyr Glu Gln Tyr Ser Leu Asp Pro Glu Gly Ser Asn
9455 9460 9465
ata aca agg tgg aag tcg ctt ata cgc ccc gat ctt cat att gaa 104434
Ile Thr Arg Trp Lys Ser Leu Ile Arg Pro Asp Leu His Ile Glu
9470 9475 9480
gtt tgg ttt acg cgt cac ctt ata gat ccg cac cgt caa ctg ggc 104479
Val Trp Phe Thr Arg His Leu Ile Asp Pro His Arg Gln Leu Gly
9485 9490 9495
aat gcg tta ata cgc atg cca gat tta ccg gtt atg tta tat agc 104524
Asn Ala Leu Ile Arg Met Pro Asp Leu Pro Val Met Leu Tyr Ser
9500 9505 9510
aac agt gcc gat tta aac tta ata aac aac cct gag ata ttt aca 104569
Asn Ser Ala Asp Leu Asn Leu Ile Asn Asn Pro Glu Ile Phe Thr
9515 9520 9525
cac gct aag gaa aat tat gta ata cca gat gtt aaa aca acg tct 104614
His Ala Lys Glu Asn Tyr Val Ile Pro Asp Val Lys Thr Thr Ser
9530 9535 9540
gat ttt tct gta aca att tta tct atg gat gct acc acg gag gga 104659
Asp Phe Ser Val Thr Ile Leu Ser Met Asp Ala Thr Thr Glu Gly
9545 9550 9555
acg tat att tgg cga gtc gtt aat aca aaa act aag aac gtc ata 104704
Thr Tyr Ile Trp Arg Val Val Asn Thr Lys Thr Lys Asn Val Ile
9560 9565 9570
tcg gaa cac agt att aca gtt aca acg tat tat cgt cca aat att 104749
Ser Glu His Ser Ile Thr Val Thr Thr Tyr Tyr Arg Pro Asn Ile
9575 9580 9585
acc gtt gtc ggc gat cca gtc tta acc gga cag aca tac gca gcc 104794
Thr Val Val Gly Asp Pro Val Leu Thr Gly Gln Thr Tyr Ala Ala
9590 9595 9600
tac tgt aac gta tca aag tat tat cca ccg cac tcg gta cgt gtt 104839
Tyr Cys Asn Val Ser Lys Tyr Tyr Pro Pro His Ser Val Arg Val
9605 9610 9615
cgg tgg act tca agg ttt ggt aac atc gga aaa aat ttt ata acc 104884
Arg Trp Thr Ser Arg Phe Gly Asn Ile Gly Lys Asn Phe Ile Thr
9620 9625 9630
gat gca ata caa gaa tat gcc aat gga tta ttt agt tat gtt tcg 104929
Asp Ala Ile Gln Glu Tyr Ala Asn Gly Leu Phe Ser Tyr Val Ser
9635 9640 9645
gcg gta cga att cca cag caa aaa caa atg gat tac cca ccc cca 104974
Ala Val Arg Ile Pro Gln Gln Lys Gln Met Asp Tyr Pro Pro Pro
9650 9655 9660
gcc atc caa tgt aat gtt tta tgg att cgg gat ggc gtc tct aat 105019
Ala Ile Gln Cys Asn Val Leu Trp Ile Arg Asp Gly Val Ser Asn
9665 9670 9675
atg aaa tat tct gct gtc gtt acc cct gac gtc tat cca ttt ccc 105064
Met Lys Tyr Ser Ala Val Val Thr Pro Asp Val Tyr Pro Phe Pro
9680 9685 9690
aac gtg tct ata ggt att att gat gga cac ata gta tgt acg gca 105109
Asn Val Ser Ile Gly Ile Ile Asp Gly His Ile Val Cys Thr Ala
9695 9700 9705
aaa tgt gtg cca cgt ggc gtt gta cat ttc gta tgg tgg gtt aac 105154
Lys Cys Val Pro Arg Gly Val Val His Phe Val Trp Trp Val Asn
9710 9715 9720
gat tct ccc att aac cac gaa aac agt gag att act ggg gtg tgt 105199
Asp Ser Pro Ile Asn His Glu Asn Ser Glu Ile Thr Gly Val Cys
9725 9730 9735
gat caa aac aaa cgg ttt gta aac atg caa agt tct tgt cca aca 105244
Asp Gln Asn Lys Arg Phe Val Asn Met Gln Ser Ser Cys Pro Thr
9740 9745 9750
tcg gaa ctc gac gga cct atc acc tat tcg tgt cat cta gat ggt 105289
Ser Glu Leu Asp Gly Pro Ile Thr Tyr Ser Cys His Leu Asp Gly
9755 9760 9765
tac cct aaa aaa ttc cct ccg ttt tcg gcc gtt tat acc tac gat 105334
Tyr Pro Lys Lys Phe Pro Pro Phe Ser Ala Val Tyr Thr Tyr Asp
9770 9775 9780
gca tct acc tac gcc act aca ttt tcc gtt gta gca gtt ata att 105379
Ala Ser Thr Tyr Ala Thr Thr Phe Ser Val Val Ala Val Ile Ile
9785 9790 9795
ggt gtg ata tct atc ctt ggg aca ttg ggt ctt atc gca gtt atc 105424
Gly Val Ile Ser Ile Leu Gly Thr Leu Gly Leu Ile Ala Val Ile
9800 9805 9810
gca acc cta tgc atc cgt tgc tgt tca taa acagaaacca accaaacgcg 105474
Ala Thr Leu Cys Ile Arg Cys Cys Ser
9815
tctgtgtata tcattttatt acattcgcaa cacatctact gtcttgacaa catttaaaaa 105534
tccattaaag agccatttcc atttttaggg gggggtgtgg attatatcca tcaagctgaa 105594
aatcgtccca tttaaagtcg tttatatctg ttacatttcg aataatttta aggcaaggaa 105654
aaggttttgg ggatcgagct agctgcactt ttaaagcatc tatatgattc aagtaaatat 105714
gtgcatcccc cattgtatga attaaatctc cggttttaag tcctgtaaca tgcgctacta 105774
tgtaggtaag aagtgcatat ccagcaatgt tgaacggtac cccaaggccc atatcccccg 105834
atctctggta tacttggcag gataattcac cgtttgcaac gtaaaactga cataacgtgt 105894
gacatggagg tagtaccatt aaggggatat cctttggatt ccaagacgat ataatcattc 105954
gtcggctttc tgggtttgtt ttaattgtat ctataacagt ttgcagctga tcgattcctt 106014
gctgtaaata gtttgattga cagtctttat attccgctcc aaaatgtctc cactggaagc 106074
cgtaaatggg gccaaggtcc cccgtgtgtc ttttatggaa gccattccta tttagaaatt 106134
tgctcgatcc gtatatatcc catatgtgta tatctttagc ggcgagttct ttggaatcgg 106194
ttgacccgcg gataaaccat aacaactctt ccacgacggc cctccaaaaa acacgctttg 106254
tagttaaaag aggaaattca tttcgcaaat tgtatcgagc ttgcattcca aataaagata 106314
acgttccgat tcctgttcga tcgcgtttcc gaactccata ccttaaaata tcatccactt 106374
gttttaagta ctgaagttcg ccggttaacg taaaacccgg cacctttgtc caacatgaca 106434
agtctcccat ggtaccggta ataatcgtta aatacaaacg accacttgat attgtgggta 106494
cattaagtaa aatttaaagg agtaattcct tttataaccc aatcaaccaa tcagaccttt 106554
aaataacgca gtccaattat tgaacagaaa atacgcaata gactattttc tcccaaatcc 106614
cccaatttta acctggttgt ggaaatgacc gccaaagtgc ctagtccata actatctatt 106674
aaccgctgta cttaatgatg actcttaggc gtatttttcc taaacgtaac cgtggtttta 106734
catctaacgg gaacgtcgtt gggtataaaa tctggtgata agtcatcgtc tgtatgcatg 106794
tctccggagg tgtacccgtt aaggctatcc acatccaggt ctgaagatga gttaaaaccc 106854
acttccttgg gtggtatatt ttcatatacg tgatcattta aagatgcttg ggtttctatg 106914
tgacgtaaat ccgatctttg gtcaaacatt gtgtttgatt catcccgaac cggaccttca 106974
tattccgcct ttatccgggt gtaatgatca ggatacgttg ggtcaactgg actaccgtcc 107034
gtatggccta tgtccagaca gtggtttttt cggacagtta catccaaacc cgtgtctgga 107094
cgaactaaaa cgcgagaagt ccgctgtcta tttgtatcgg gtgctatact gtttaacggt 107154
gcttctgtat taattacatg cgaacttgca tttacgtcgt cagataaata atttcccgtt 107214
aaacagacac attccaaatt ccctatacct gtccctattg gagatcccag tcgattaaaa 107274
cgaatatacg gctgatcaga cacactcgat ggttcggata tacgtcttaa atgtaacgga 107334
gtataccgtt tactggcggc aagaacctga gcataatact ttgtgggttt acccccatgt 107394
gctaaaatac tctgcgccac ggcccgggaa aaccacatat gggtgctggt ctctaacaca 107454
acccaactgt gtgttgacat tgttctgacc aactgacccg caaaatataa aaaccgtccc 107514
tgggcacgta gcgcctttaa tatataagga tcgtttaatc ccccatcccc ccaacatgca 107574
tatccaataa gcatcatatt aattaagtat tgaaaatggt ggtgcaaaag cgtaataagt 107634
tcaaccgccg caacaattgc agctgcggtt ccatttaaac catcccgcca ctcttccttt 107694
cgccataata cactaatacg caaaagcgca gtaagagcgc tggcagttgt tgctgctaaa 107754
ataaacgcaa ctcctgtgcc cgctgagacc gcaggacccg tttttaaatg tcgtctaagc 107814
aaatcatctg gaataacaga cgcgcgcggc cctcgaatcc gtcgaaattg aagtttgaga 107874
tggcgacata ccctggcgtc cacaacgtcc aagaaccata acatccaccc gtagccagta 107934
gccgtgtaag ccgcaatttc ttgtacacgt attcttgtat ccgggctggg gtttgtagga 107994
tcatctggac gcataaaata catatacttt tgcacagctc ctaacgcctc acgtagctcc 108054
tgacatattc cccggtaatc cgaatttata actttagatt taaaaggatg tctggatatt 108114
ggtttattgg caactacgga tgaaaacaac aaaaccgtgg atttagactt tttatttacc 108174
ttctctgaag tcaccggtac ctcggctcct gtggtgtatt tgagatactg ccaatatgta 108234
ttttcataaa cccaccaaag tccatcccca gaaagtttca ggtcatccaa acatctttga 108294
attgtttcca tataacgctg cataaacccg tcatgaacat cgttagacat acacgagacc 108354
accaacgatt caggaacatt attatggtgt aatccaagca gaacactgtg tctttcggta 108414
ctttgtaaaa ccggcgatgt taggctgtcg gtcttttcag ataaagcctt taccgccgtc 108474
tccaaagcct cgccgggcgt tggaataaga catccatccc tcacgcgctc gctggttatg 108534
cgttgggaaa tagaacacca gtcccgggta ggtgtgggca aatcacgttc gccggcatta 108594
aaactttggt aactaccatc ataagattta cgcgttctat catcgctgga aaaggaacgc 108654
gcaaaccgag aaaacattgc gatatggaca aattatatct tgtaaaggtc gatcctcacg 108714
gaccccccac acaacacaca caacaaattt taaggtctaa acagagattt tattttacaa 108774
actcctttgt gggtgtggct aggaaacgtt cttttcatcc taatgaaaaa aatcacaacc 108834
cttaatattt tcgtagtaaa tgcatggcta cgcttttcca atccaaaccc agaatttcat 108894
tcctgtattg cataagattt tcggctaggt ccacggaggg aatgggctgt tctcggggat 108954
agatggtctc caatcccaca aaacggaagt tcatctctat aggtgatgcc tgaactatcc 109014
tgtcttgctc tttgggtaaa acgtccgttg tcggtcgaat ccccaaagtt tgcattgcat 109074
catcgcggtc tttccaaaag gatgttaaag tttggtgagc gtaaagagga cttgctgcca 109134
acattaaagt attatatgca tctaaaatac ctcgggggat gtaaattgaa catcctccgt 109194
acaacgcagc accggagaga agcaacaaaa gaagatttgc atgacccaac acccgttgta 109254
acaataaaac ggcccccagg tacgcgctta ccattaaatc ggtctcactt tccttataac 109314
aatcggcaaa ctctatcata aaatttgtgg ctgctgtgcg aacagcctca tatccggtta 109374
attcgtgtgc aatagttgcg taggattcac aaacagttgg gaggtcgacc tcgcctaaca 109434
gtaatacctc taaatcacat aacaggtcac tattaaccgg gatatagtaa tagggcagag 109494
attcacagac cgcgatgtaa gccgaacggg aatggtaagg agcagctaaa tacattgcag 109554
ttgctctaca gatccagttc aacattcctc caccaagagt aacatacaac gtaaaaaatg 109614
acgctaacat ttcccgttcc gttggggtaa attgtggctt taaactatgt gtacgtgaaa 109674
ataaaaacca ttcggcaagt tccatgtgac cggtagcata tcgcaccaaa gatgtagatg 109734
gttcacttat agcaattctt agttcgggcc aatatgccaa cgccccaaaa aaaccacgca 109794
taatggcaac cgttggaccc cgatttggca aaaccaactg tgtcacttgg agtacgtcgg 109854
gtacggcttc tcgtggtaat ccggcaagat ggtcttctaa ccaacatgga tcccccacgc 109914
caaaattatc gttacgtccc aaaaaaatac agtttgcgct aatacgaatt gccgcgtcta 109974
acaaaaatcc taatccatct ccatgtgaaa tgcgattaga aatagcgcat gctgctgtgg 110034
ataatatcat gtgatgccaa atggccgttc cttggcccac agcacgtaag gacacgtcat 110094
aaaacccagg aatgtgttgt acatacattt tacccgcatt atatggaagt gcgtagacgg 110154
aattccaacg cgggccgtat ttatgtgttt tactggaagc ccgcctatct aaaaatacat 110214
cttcaactaa aatacgttct atagaaattg gctgggccat aaattcaata ggaaaaatca 110274
ataaaagttc accaagtgtc atgttaggaa gggctggtat agttctcaaa atctgtgggg 110334
gtgtcatccg ggaagtatca aactgatacc gtaaatgaat tgggtcgtat aatccttgga 110394
ccgtaatata ttcgcgacgg gtacattcgc gcatagcaaa ccaggactca tcaaacccat 110454
tatgcgctaa catagagcct gttagggtaa caggttcaac acaacgtttt gatacattaa 110514
gttcctttga taccgcaggc tccataacct ttgcaatgcg aagatcggta ctgtaactta 110574
tttccgggtc taacgtaaaa taaacctcat ccgcgtctcg actgcaaact tgacttacag 110634
aaaaataatc ctcctccgcc tctccctcgt cgtcgatcgc gtcctcctcc gccgcgtcct 110694
cctccgcctc tccctcgtcg tcgatcgcgt cctcctccgc ctctccctcc tccgcctctc 110754
cctcctccgc ctctccctcc tccgcctctc cctcgtcgtc aatcgcgtcc tcctccgcct 110814
ctccctcctc cgcctctccc tcgtcgtcaa tcgcgtcctc ctccgcctct ccctcctccg 110874
cctctccctc gtcgtcgatc gcgtcctcct ccgcctctcc ctcctccgcc tctccctcgt 110934
cgtcgatcgc gtccgtattg atgttgattt cctcattagc ttccctctcc gttaatttaa 110994
atatgcgatt ttcatgtctg gaagttgatg tattttgttc ggatccatta tatgtagaat 111054
gatgtaaggc gtatcccgga aaatcatccg atgcgtcaga actgctgtcc tgatccatct 111114
cggagttcag tagttcttga accgcaacta atgtttctga attggacaat atctggggtg 111174
ggtgtgtata ccaattggtt gaccgataac gtgttccgtg tgtgtgacgg ggggagtctg 111234
tggtattaga cgatatccgc tgtcggcggg attgcctccg gttataatga cccgactgca 111294
tacttataac cgagacaaac aaacgcgcct gtaaaacatc ccagggtcgc ggtacgcaat 111354
acgcatacac tcgttggggc ttcttttata tatgggactt tagagcacat gacagacata 111414
ccatatacgg cgcattgtaa aaataaaaaa cgcatgcacg ttttcgtaat ttatttacac 111474
cctctacccc aatgacgttt aacgcgttaa aaacccacac gtgggtgggc gtggtgggtc 111534
ccccggcagg atctcggatg gggacggagg tgctaaaatc atagcttcga tgctactacc 111594
gtagtttcta ttttcaacaa cttttgcgta aggatgatcc gctcgtacat gtctcggttc 111654
ttgaggatgt ttgaccgcat atgcgtctaa cttacgctta atgtggtggg taagaaaacc 111714
caccgaccgt ggtaaatgaa ccgaaaatga gggttgttga accaacggag atttgttttc 111774
ttcaacaaga ccacatctaa ctagaggcag tcccagttct cggcggcggt aatttatttc 111834
cctcaaggca gacgctgtta gtggtttccc ttctaataac acaatgccgt ggttgcataa 111894
tacaggatga aacgcacacg tgaactgtcg acgttcggtc caggtgaatt ttaaagccgc 111954
aaacacgtcc gggtgtgcag attgactggc gtacaaacgc caggaaaatt cacgcgttac 112014
ggttaaatat aaatgtaggt acagaagacg cgccaaactt gccacctcac gataatatct 112074
aagtagaata ctttgcctga gttgcgtgta agctttctgg tcagggtttt gtatattaag 112134
acctattgtc gtccttttcg ccgttccttg gagataacgt ataatcgatt tacaataagt 112194
aactagtagt tttgtgtatg cttcttcccg ggcgcgtagt tctacggtaa acgaatcctg 112254
gacctcctga acgtaagctg gaagccccgt cggttgcttt ggtggactag gcaatcgaac 112314
ggaaccccgg gttgttaaat taaacatttc aacatggtct ttggttgaaa cggttgagat 112374
aacgtcatct ggatccgggg ataaaaccat catatcggaa tataggtcct cattaatagg 112434
aaaacatgag aataaatcct cgttccaggt ttcaagacat gatagtaaac gcggcccttc 112494
tgaaaaatca agatcccgta ttaactgttg atataaaatt ttagggctag ctacccaagg 112554
cggagacgga gcagttttaa ctgcatgtga atataacgac gtttcaaatc caatatccga 112614
tgctacatca ccaaacaacg attcatcaaa tgcgtccaca accgcttgtt ccgttttact 112674
acgattccac gtatctgtac taggatgttc ggttcctaaa ttacactcca tacttataga 112734
gtaaaatctt tagtttaaat aagcgattcc ctttatcaaa acccgccgtc taatggggtt 112794
tgtttggtag caactgatta taaactgttc ataataccac gtggtactat ttaaacagtt 112854
tataatatgt gacatataat acacatttat aataaacaca aaccacgact gtcttttata 112914
cgtttattta ttatacataa taccgggtaa accgttactg cgtaattata tccctatttt 112974
cgcgtatcag ttcttgatgc agaacgggcc ctacttcgat tagttatacg cccagtcgtt 113034
tgtgtatccg ttcgactttt tggcttacga acttgtgcgt acataggttc attacccatg 113094
cccccctgta aatctccagt ttttcgatta tgtccacgtt tggataccga tgctccttca 113154
cctaggtctg cttcattagc ggcttgtatt aaatttaaac cctcatgcac cgtaatacga 113214
ataacggctc cggttaacaa acggtctaat tcggcgttat tccttggggg attactattc 113274
caagccgctt cggcagcctt ttgtgcctgc atggcggcta cgcgccggac cgcttcacaa 113334
aagacgcgtt tgttatatga tggcgtagga ccgcaccacg agcttgttgc ggtttttggt 113394
gcagtgctga aggaaattgg tctcccgctg gcaattgcgc ctgctcccgg gggagctcgc 113454
ttcggtgaat cctctaattt aggtttaacc gccgcatttt tagtaaaaga tctagatggt 113514
ttttctacac ttagcgggtt ttcatataca gcatcttcaa ccagttcatg tctcaaacgg 113574
gcctcccgaa aggcggccac tacttcatcg atgtcttcaa agtcatcttc cgatccgctg 113634
gagtcgtttg gttgatgtac gcgaggggtc gtatttttat gttcaaaata aagatccgcg 113694
tacactggag aaggagaatc ggccccaact gtggtaatgt aacccaacga gtcgtctgaa 113754
tcatcggggg gtcctacgac cacacttcgc cgcgcggttc gatattgtcc ggaataacta 113814
ggcgttgttt tacgacgcac tgcattagag cgacaaagtc tgtcaccgtc ggaagatgcc 113874
attacgtaaa taaacgatag ggtcgtgaaa tatccaaaca cggcagaccg cgtattaaac 113934
aggggccctc ttatacacgc ctgccccttt tataggcaaa cgggtttacc acgtgctgcg 113994
taatacagaa cgagtaaata accggaaaca cgcatgataa gctaacgaaa taagggctac 114054
acacaccccc aaaagggatg cgtagaagaa aagggtggtg atcattgatc cgtcgatata 114114
aactccacga gccgaacagc tggcatgcca aaaattccgt tctgcaaagt ttggctcccc 114174
actgctgtct tcacaaaaaa ataaaatttg catcgttatt aat atg aac gaa gcg 114229
Met Asn Glu Ala
9820
gta att gat ccc atc ttg gaa acg gca gta aat aca ggt gat atg 114274
Val Ile Asp Pro Ile Leu Glu Thr Ala Val Asn Thr Gly Asp Met
9825 9830 9835
ttt tgt agc caa act att ccg aat cgg tgt tta aaa gat aca att 114319
Phe Cys Ser Gln Thr Ile Pro Asn Arg Cys Leu Lys Asp Thr Ile
9840 9845 9850
tta ata gaa gtt caa cct gaa tgt gca gat acg ctg caa tgc gtg 114364
Leu Ile Glu Val Gln Pro Glu Cys Ala Asp Thr Leu Gln Cys Val
9855 9860 9865
tta gac gat aaa gta agt cga cat caa ccg ttg tta ctc cgg aac 114409
Leu Asp Asp Lys Val Ser Arg His Gln Pro Leu Leu Leu Arg Asn
9870 9875 9880
cac aag aaa ctc gaa ctg cca tct gaa aaa tct gta aca cgg ggc 114454
His Lys Lys Leu Glu Leu Pro Ser Glu Lys Ser Val Thr Arg Gly
9885 9890 9895
ggt ttt tat atg cag cag ttg gag ctg ttg gtt aag tcg gcg cct 114499
Gly Phe Tyr Met Gln Gln Leu Glu Leu Leu Val Lys Ser Ala Pro
9900 9905 9910
ccc aat gaa tac gca ctg ttg tta att caa tgc aaa gat act gcc 114544
Pro Asn Glu Tyr Ala Leu Leu Leu Ile Gln Cys Lys Asp Thr Ala
9915 9920 9925
ctt gct gat gaa gac aat ttt ttt gtc gcc aac gga gtt att gat 114589
Leu Ala Asp Glu Asp Asn Phe Phe Val Ala Asn Gly Val Ile Asp
9930 9935 9940
gcg ggt tac aga gga gta att tca gcc ctt ttg tat tac cgg cca 114634
Ala Gly Tyr Arg Gly Val Ile Ser Ala Leu Leu Tyr Tyr Arg Pro
9945 9950 9955
gga gta acc gtt att tta ccc gga cat tta aca atc tac ttg ttc 114679
Gly Val Thr Val Ile Leu Pro Gly His Leu Thr Ile Tyr Leu Phe
9960 9965 9970
ccg gta aaa tta aga caa agt cgc ctt ctc cca aaa aac gtt ctt 114724
Pro Val Lys Leu Arg Gln Ser Arg Leu Leu Pro Lys Asn Val Leu
9975 9980 9985
aaa cat ctg gat cca att ttt aaa tcg ata caa gtt caa ccc tta 114769
Lys His Leu Asp Pro Ile Phe Lys Ser Ile Gln Val Gln Pro Leu
9990 9995 10000
tca aac tcg ccg tca aat tat gaa aaa ccc gtt ata cct gaa ttt 114814
Ser Asn Ser Pro Ser Asn Tyr Glu Lys Pro Val Ile Pro Glu Phe
10005 10010 10015
gct gat att tcc acg gta cag cag ggg caa cct tta cat agg gat 114859
Ala Asp Ile Ser Thr Val Gln Gln Gly Gln Pro Leu His Arg Asp
10020 10025 10030
tct gca gaa tac cat atc gat gtt ccc tta acc tac aaa cat atc 114904
Ser Ala Glu Tyr His Ile Asp Val Pro Leu Thr Tyr Lys His Ile
10035 10040 10045
atc aat cca aaa cgc caa gaa gac gcg gga tat gat att tgt gta 114949
Ile Asn Pro Lys Arg Gln Glu Asp Ala Gly Tyr Asp Ile Cys Val
10050 10055 10060
cca tat aac cta tat tta aaa agg aat gaa ttt ata aaa att gtc 114994
Pro Tyr Asn Leu Tyr Leu Lys Arg Asn Glu Phe Ile Lys Ile Val
10065 10070 10075
tta ccg att ata aga gac tgg gac tta caa cat ccg agt ata aac 115039
Leu Pro Ile Ile Arg Asp Trp Asp Leu Gln His Pro Ser Ile Asn
10080 10085 10090
gct tat att ttt gga aga tca tcg aaa agc cga tca ggc att atc 115084
Ala Tyr Ile Phe Gly Arg Ser Ser Lys Ser Arg Ser Gly Ile Ile
10095 10100 10105
gtg tgt cca acg gca tgg cct gca gga gaa cac tgt aaa ttc tac 115129
Val Cys Pro Thr Ala Trp Pro Ala Gly Glu His Cys Lys Phe Tyr
10110 10115 10120
gta tat aat ctc acg ggt gat gac ata cgt ata aaa acg gga gat 115174
Val Tyr Asn Leu Thr Gly Asp Asp Ile Arg Ile Lys Thr Gly Asp
10125 10130 10135
cgt ctt gca cag gtc ctg tta ata gat cac aac acc caa ata cac 115219
Arg Leu Ala Gln Val Leu Leu Ile Asp His Asn Thr Gln Ile His
10140 10145 10150
tta aaa cac aac gtt tta agt aat att gca ttt cct tat gct atc 115264
Leu Lys His Asn Val Leu Ser Asn Ile Ala Phe Pro Tyr Ala Ile
10155 10160 10165
cgc ggt aaa tgt ggc ata ccg ggt gta caa tgg tat ttt act aaa 115309
Arg Gly Lys Cys Gly Ile Pro Gly Val Gln Trp Tyr Phe Thr Lys
10170 10175 10180
acg tta gat cta ata gcc aca ccc agc gaa cgg gga acg cgt gga 115354
Thr Leu Asp Leu Ile Ala Thr Pro Ser Glu Arg Gly Thr Arg Gly
10185 10190 10195
ttt ggt tca act gat aaa gaa aca aac gat gtc gat ttt cta cta 115399
Phe Gly Ser Thr Asp Lys Glu Thr Asn Asp Val Asp Phe Leu Leu
10200 10205 10210
aaa cat taa atgtaataac cacgccagcc agcaatgttt taattttata 115448
Lys His
10215
tacaaaataa aaacatacac cagaaacgtt tttagttttt atttcaatat ttatacaagc 115508
ataacatggg atttcttgat cgcgggggtt gtgcgttgta catcttgcgt ctgttttggg 115568
gtcaacacgg gctgaagagt ttctgtcgga tacgtttttt ttgttaggtt agatgtgtta 115628
ttatccgata cttctataag tgggggttta atttcagata attgtgtcgc ctccgattta 115688
ataggtgatg tttttaaacc cacattttcc cctttagcta tagataattc atggttgtgg 115748
gaaacatcaa acgatgcctg aggtttagca acgaccccaa gagttttctc caaaagaaca 115808
acatcagaca tgaccacttc actttcagcg gtcattctca gggtttgatc gacaatatca 115868
tccgtagtaa catccaccgc gccaaccgat aaatacaaat gggtaattgc agctaaacac 115928
atatcagcaa gccgttttga gttttccatg tgtgaacgga ccacggcgtt aagaccgggg 115988
ttatcttcag taaaatggtg ctgttttaaa cattcaatgt tacgagaaca tgcagcgtaa 116048
gttcgcgcca aagcctgggc gcggaccaaa cgacgggtat tatctgcaga agcgactacg 116108
tcttctaacg ttagaggtgc aggcaataat ccattcacag cggttaaagc ctcttggagg 116168
cggagcaggg cggctccttg ggggtgcgtg tttacacgca cctcttcata agatggctct 116228
tcagttggta ttcgagcata tccacataag ctggcacaca ccgtctgcat gattgactgg 116288
ctttccaacg tattgaact atg gat aaa tcc tcc aaa ccg acg att cgg 116337
Met Asp Lys Ser Ser Lys Pro Thr Ile Arg
10220 10225
tta tta ttt gcc aca aag gga tgt gca atc tcc cac tcg ctg ttg 116382
Leu Leu Phe Ala Thr Lys Gly Cys Ala Ile Ser His Ser Leu Leu
10230 10235 10240
ttg ctt acc ggg cag ata agc aca gaa cct ctg tat gtg gtg agt 116427
Leu Leu Thr Gly Gln Ile Ser Thr Glu Pro Leu Tyr Val Val Ser
10245 10250 10255
tat act tgg act ccc gac tta gat gac gtc ttt gtc aaa aat ggg 116472
Tyr Thr Trp Thr Pro Asp Leu Asp Asp Val Phe Val Lys Asn Gly
10260 10265 10270
agg gaa gag atc acg caa gta atc cca act aaa cgc cca cgt gaa 116517
Arg Glu Glu Ile Thr Gln Val Ile Pro Thr Lys Arg Pro Arg Glu
10275 10280 10285
gta act gaa aac gat gaa gaa aac caa ata atg cat tta ttt tgt 116562
Val Thr Glu Asn Asp Glu Glu Asn Gln Ile Met His Leu Phe Cys
10290 10295 10300
agt agg gac gtc aac gtt att ttt tat tta att ggt gga ttt tca 116607
Ser Arg Asp Val Asn Val Ile Phe Tyr Leu Ile Gly Gly Phe Ser
10305 10310 10315
act gga gat gta cga tcc cgg gtc tgg cct ata ttt ttt tgt tgt 116652
Thr Gly Asp Val Arg Ser Arg Val Trp Pro Ile Phe Phe Cys Cys
10320 10325 10330
ttt aaa acc caa act gat ttt aaa gct tta tat aag gcg tta tgg 116697
Phe Lys Thr Gln Thr Asp Phe Lys Ala Leu Tyr Lys Ala Leu Trp
10335 10340 10345
tat gga gca ccc cta aat ccg cat ata ata tct gat acc cta tgt 116742
Tyr Gly Ala Pro Leu Asn Pro His Ile Ile Ser Asp Thr Leu Cys
10350 10355 10360
ata tcg gag acg ttt gac att cac tcg gaa gtt ata caa act ctg 116787
Ile Ser Glu Thr Phe Asp Ile His Ser Glu Val Ile Gln Thr Leu
10365 10370 10375
atg gta aca aca cac cat tta aac cga aag gga tta tcg gac aac 116832
Met Val Thr Thr His His Leu Asn Arg Lys Gly Leu Ser Asp Asn
10380 10385 10390
ggc cta tgc atc aca gag gca aca ctc tgc aag tta gtt aaa aaa 116877
Gly Leu Cys Ile Thr Glu Ala Thr Leu Cys Lys Leu Val Lys Lys
10395 10400 10405
tcc gtt ggt cgt cag gag cta aca tca tta tat gcc cat tac gaa 116922
Ser Val Gly Arg Gln Glu Leu Thr Ser Leu Tyr Ala His Tyr Glu
10410 10415 10420
cgt caa gta ttg gct gca tat cga cga ctc tac tgg ggg tat gga 116967
Arg Gln Val Leu Ala Ala Tyr Arg Arg Leu Tyr Trp Gly Tyr Gly
10425 10430 10435
tgc tcg ccg ttt tgg tat att gtt cga ttt gga ccc tct gaa aaa 117012
Cys Ser Pro Phe Trp Tyr Ile Val Arg Phe Gly Pro Ser Glu Lys
10440 10445 10450
acg cta gtg ttg gct aca cgc tat tac ttg tta caa acg gac aca 117057
Thr Leu Val Leu Ala Thr Arg Tyr Tyr Leu Leu Gln Thr Asp Thr
10455 10460 10465
agt tac aat acg ttg gaa acc ccc tta tat gac tta cag gca att 117102
Ser Tyr Asn Thr Leu Glu Thr Pro Leu Tyr Asp Leu Gln Ala Ile
10470 10475 10480
aaa gat ttg ttt tta act tac caa gtc ccg gca tta cct aat tgt 117147
Lys Asp Leu Phe Leu Thr Tyr Gln Val Pro Ala Leu Pro Asn Cys
10485 10490 10495
agt ggg tac aat att tcg gac ttg ttg tct ttt gat aaa ctt tcc 117192
Ser Gly Tyr Asn Ile Ser Asp Leu Leu Ser Phe Asp Lys Leu Ser
10500 10505 10510
atg ttt tgt tgt tcc tca aca tat aca cga ggt ttg aca gcc aaa 117237
Met Phe Cys Cys Ser Ser Thr Tyr Thr Arg Gly Leu Thr Ala Lys
10515 10520 10525
aat gct cta tcg tac att tta cag cga ata cat aca gac aca acg 117282
Asn Ala Leu Ser Tyr Ile Leu Gln Arg Ile His Thr Asp Thr Thr
10530 10535 10540
gaa ata cac gca gta tcg gag tat att acc aac gat aga aaa ggc 117327
Glu Ile His Ala Val Ser Glu Tyr Ile Thr Asn Asp Arg Lys Gly
10545 10550 10555
ctt aaa gtt cca gac cgt gaa ttt gtt gat tat att tat ctg gca 117372
Leu Lys Val Pro Asp Arg Glu Phe Val Asp Tyr Ile Tyr Leu Ala
10560 10565 10570
cat ttt gaa tgt ttc aat cgg aaa cag atc gca gac cac cta caa 117417
His Phe Glu Cys Phe Asn Arg Lys Gln Ile Ala Asp His Leu Gln
10575 10580 10585
gcg gtt aca tac tca gat ttt gtg aat aaa ccg gtc ctc tta aaa 117462
Ala Val Thr Tyr Ser Asp Phe Val Asn Lys Pro Val Leu Leu Lys
10590 10595 10600
tca tcc aac ctg gga aaa aga gct act gct aat ttt ttt aat cat 117507
Ser Ser Asn Leu Gly Lys Arg Ala Thr Ala Asn Phe Phe Asn His
10605 10610 10615
gta cgt tct cgt ctc aac atg cgt gac tat ata aaa aag aac gta 117552
Val Arg Ser Arg Leu Asn Met Arg Asp Tyr Ile Lys Lys Asn Val
10620 10625 10630
att tgt gat gtc act gaa ctt gga cct gag att gga cat aaa tat 117597
Ile Cys Asp Val Thr Glu Leu Gly Pro Glu Ile Gly His Lys Tyr
10635 10640 10645
aca att act aaa aca tat act tta agt ctt acg tat gcc gca aaa 117642
Thr Ile Thr Lys Thr Tyr Thr Leu Ser Leu Thr Tyr Ala Ala Lys
10650 10655 10660
cct agc aag ttt ata ggc gta tgt gac cta gct aca acg cta act 117687
Pro Ser Lys Phe Ile Gly Val Cys Asp Leu Ala Thr Thr Leu Thr
10665 10670 10675
cgt cgt gtg gaa aac att gaa aaa caa ttt agt cca tat gga tgg 117732
Arg Arg Val Glu Asn Ile Glu Lys Gln Phe Ser Pro Tyr Gly Trp
10680 10685 10690
tcc tcc act att ccc tca aat cca ccc ggt ttt gac gaa ttg tct 117777
Ser Ser Thr Ile Pro Ser Asn Pro Pro Gly Phe Asp Glu Leu Ser
10695 10700 10705
aat ttt gag gat tcg ggt gtt tcc gcg gag gcg tta cga gca gcc 117822
Asn Phe Glu Asp Ser Gly Val Ser Ala Glu Ala Leu Arg Ala Ala
10710 10715 10720
aac ttt gca aac gat aca cct aac caa agt ggt cgt act ggt ttt 117867
Asn Phe Ala Asn Asp Thr Pro Asn Gln Ser Gly Arg Thr Gly Phe
10725 10730 10735
gat acg agc ccg ggg att aca aaa cta tta ctg ttt ttc tct gct 117912
Asp Thr Ser Pro Gly Ile Thr Lys Leu Leu Leu Phe Phe Ser Ala
10740 10745 10750
gcc act ggg ata gcc aca cat gat gta tcc atc ctg agt tat aaa 117957
Ala Thr Gly Ile Ala Thr His Asp Val Ser Ile Leu Ser Tyr Lys
10755 10760 10765
act cca tta gaa gcc ctc atc ggc cat tct gag gta act gga cca 118002
Thr Pro Leu Glu Ala Leu Ile Gly His Ser Glu Val Thr Gly Pro
10770 10775 10780
atg cct gta tat cgg gta gcc ttg cct cac ggc gcc caa gca ttt 118047
Met Pro Val Tyr Arg Val Ala Leu Pro His Gly Ala Gln Ala Phe
10785 10790 10795
gct gtt att gct aat gat acg tgg tca tca ata aca aac cgt tac 118092
Ala Val Ile Ala Asn Asp Thr Trp Ser Ser Ile Thr Asn Arg Tyr
10800 10805 10810
act tta ccg cac gag gct cga tta att gcg gag gac ctt aaa caa 118137
Thr Leu Pro His Glu Ala Arg Leu Ile Ala Glu Asp Leu Lys Gln
10815 10820 10825
att aat cca tgt aat ttt gtt gcc gct tca cta cga gat atg cag 118182
Ile Asn Pro Cys Asn Phe Val Ala Ala Ser Leu Arg Asp Met Gln
10830 10835 10840
ttg act tta cta tta tct acg tct gtt aaa aac gtt tct aaa att 118227
Leu Thr Leu Leu Leu Ser Thr Ser Val Lys Asn Val Ser Lys Ile
10845 10850 10855
tca tca aac ata ccc aaa gat cag ctt tat ata aac agg aat gag 118272
Ser Ser Asn Ile Pro Lys Asp Gln Leu Tyr Ile Asn Arg Asn Glu
10860 10865 10870
cta ttt aat aca aat ctt ata atc aca aac ctc ata ctt gat gta 118317
Leu Phe Asn Thr Asn Leu Ile Ile Thr Asn Leu Ile Leu Asp Val
10875 10880 10885
gac ttt cat ata aga aaa ccc atc cca ttg ggt att tta cat gcc 118362
Asp Phe His Ile Arg Lys Pro Ile Pro Leu Gly Ile Leu His Ala
10890 10895 10900
ggc atg cga gca ttt cgt cat ggt att tta acg gcc atg caa tta 118407
Gly Met Arg Ala Phe Arg His Gly Ile Leu Thr Ala Met Gln Leu
10905 10910 10915
ctt ttt cca aag gcc gtg gta aac cct aac aaa gac cca tgt tat 118452
Leu Phe Pro Lys Ala Val Val Asn Pro Asn Lys Asp Pro Cys Tyr
10920 10925 10930
ttt tat aaa act gca tgt cct gaa cct acc gtt gag gtg ttg gat 118497
Phe Tyr Lys Thr Ala Cys Pro Glu Pro Thr Val Glu Val Leu Asp
10935 10940 10945
gat gat aat tta ttg gat ata acc agc cat tct gac atc gat ttt 118542
Asp Asp Asn Leu Leu Asp Ile Thr Ser His Ser Asp Ile Asp Phe
10950 10955 10960
tac ata gaa aat ggc gaa tta tac acg tgt gta gaa gag aat tat 118587
Tyr Ile Glu Asn Gly Glu Leu Tyr Thr Cys Val Glu Glu Asn Tyr
10965 10970 10975
aca gag gat gta tgg ttt ttt gat aca cag aca acg tct gaa gtc 118632
Thr Glu Asp Val Trp Phe Phe Asp Thr Gln Thr Thr Ser Glu Val
10980 10985 10990
cat aca cac gcc gat gta tca aac aat gaa aac ttg cat gaa act 118677
His Thr His Ala Asp Val Ser Asn Asn Glu Asn Leu His Glu Thr
10995 11000 11005
cta ccc tgt aac tgt aaa gag aaa ata ggt ttc agg gta tgc gta 118722
Leu Pro Cys Asn Cys Lys Glu Lys Ile Gly Phe Arg Val Cys Val
11010 11015 11020
cca atc cca aat ccc tat gcg tta gtg ggg tct tcc act tta aag 118767
Pro Ile Pro Asn Pro Tyr Ala Leu Val Gly Ser Ser Thr Leu Lys
11025 11030 11035
ggg ttt gca caa ata tta cag caa gcg gtg ttg ctg gaa cgg gaa 118812
Gly Phe Ala Gln Ile Leu Gln Gln Ala Val Leu Leu Glu Arg Glu
11040 11045 11050
ttt gtt gaa tat att ggt ccg tat tta cgg gac ttt tcg ttt ata 118857
Phe Val Glu Tyr Ile Gly Pro Tyr Leu Arg Asp Phe Ser Phe Ile
11055 11060 11065
gat act ggt gtt tat agc cac gga cat agt tta aga ctg cct ttt 118902
Asp Thr Gly Val Tyr Ser His Gly His Ser Leu Arg Leu Pro Phe
11070 11075 11080
ttc tcc aaa gta aca acc aca ggg acg gcg gtt gga caa cta ctc 118947
Phe Ser Lys Val Thr Thr Thr Gly Thr Ala Val Gly Gln Leu Leu
11085 11090 11095
cca ttt tat gtt gta cct gag cag tgt att gat ata tta gcg ttt 118992
Pro Phe Tyr Val Val Pro Glu Gln Cys Ile Asp Ile Leu Ala Phe
11100 11105 11110
gtg aca tca cat aga aac ccg gca aac ttt cat ttt cat tca aga 119037
Val Thr Ser His Arg Asn Pro Ala Asn Phe His Phe His Ser Arg
11115 11120 11125
ccg cag tcg aat gtt cca gtg caa ttt att tta cat aac ctt ggg 119082
Pro Gln Ser Asn Val Pro Val Gln Phe Ile Leu His Asn Leu Gly
11130 11135 11140
ggg gaa tac gca gag ttt ttt gaa cgt aag gtt gcg cgt aat aaa 119127
Gly Glu Tyr Ala Glu Phe Phe Glu Arg Lys Val Ala Arg Asn Lys
11145 11150 11155
Vcaa ata ttt agc tcc ccg caa ata tct tta aca aag gct cta aaa 119172
Gln Ile Phe Ser Ser Pro Gln Ile Ser Leu Thr Lys Ala Leu Lys
11160 11165 11170
gag cgc ggg gta act tgt ctg gac gca ttt aca ctg gag gcc ttt 119217
Glu Arg Gly Val Thr Cys Leu Asp Ala Phe Thr Leu Glu Ala Phe
11175 11180 11185
gtc gac agc aca ata tta gaa tct att gtg gag cat att gct gtt 119262
Val Asp Ser Thr Ile Leu Glu Ser Ile Val Glu His Ile Ala Val
11190 11195 11200
cat ttc ccc ggg cgt gat cgc gaa tat acc tta aca tca tca aag 119307
His Phe Pro Gly Arg Asp Arg Glu Tyr Thr Leu Thr Ser Ser Lys
11205 11210 11215
tgt atc gcc atc aaa agg gac tgg gtg tta ttt cag ctc ata tgc 119352
Cys Ile Ala Ile Lys Arg Asp Trp Val Leu Phe Gln Leu Ile Cys
11220 11225 11230
gga aca aaa ggg ttc act tgt ctt cga tat ccc cat cgc gga gga 119397
Gly Thr Lys Gly Phe Thr Cys Leu Arg Tyr Pro His Arg Gly Gly
11235 11240 11245
aga acg gct ccc cgg aca ttt gtg tct ctg cga gtg gat cat cac 119442
Arg Thr Ala Pro Arg Thr Phe Val Ser Leu Arg Val Asp His His
11250 11255 11260
aac cgt ttg tgt att tcg ctt gca caa caa tgt ttt gct aca aag 119487
Asn Arg Leu Cys Ile Ser Leu Ala Gln Gln Cys Phe Ala Thr Lys
11265 11270 11275
tgc gat agc aat cgc atg cat aca atc ttt act cta gaa gta cct 119532
Cys Asp Ser Asn Arg Met His Thr Ile Phe Thr Leu Glu Val Pro
11280 11285 11290
aat tat cca aat tta act tcg agt taa caccaaccgt gtgatactac 119579
Asn Tyr Pro Asn Leu Thr Ser Ser
11295
atcgtgcttg aattgccatc ttccacgggt c atg cag gct tta gga atc 119628
Met Gln Ala Leu Gly Ile
11300
aag aca gaa cat ttt ata att atg tgt cta ctt agc gga cat gct 119673
Lys Thr Glu His Phe Ile Ile Met Cys Leu Leu Ser Gly His Ala
11305 11310 11315
gtt ttt acc cta tgg tat acc gct cgt gta aag ttt gaa cat gag 119718
Val Phe Thr Leu Trp Tyr Thr Ala Arg Val Lys Phe Glu His Glu
11320 11325 11330
tgt gtg tat gca acc acg gtg att aat ggt gga ccg gtt gta tgg 119763
Cys Val Tyr Ala Thr Thr Val Ile Asn Gly Gly Pro Val Val Trp
11335 11340 11345
ggg tct tat aac aac tct ctt ata tat gta acg ttt gta aac cac 119808
Gly Ser Tyr Asn Asn Ser Leu Ile Tyr Val Thr Phe Val Asn His
11350 11355 11360
tca acg ttt ttg gat ggc cta tct gga tac gat tac agc tgc cgg 119853
Ser Thr Phe Leu Asp Gly Leu Ser Gly Tyr Asp Tyr Ser Cys Arg
11365 11370 11375
gaa aat cta tta tca gga gat act atg gta aaa acc gct att tct 119898
Glu Asn Leu Leu Ser Gly Asp Thr Met Val Lys Thr Ala Ile Ser
11380 11385 11390
aca cct ttg cat gac aaa att cga att gtt ctg gga aca cgt aat 119943
Thr Pro Leu His Asp Lys Ile Arg Ile Val Leu Gly Thr Arg Asn
11395 11400 11405
tgt cac gct tat ttt tgg tgc gtg cag cta aaa atg att ttt ttt 119988
Cys His Ala Tyr Phe Trp Cys Val Gln Leu Lys Met Ile Phe Phe
11410 11415 11420
gca tgg ttt gta tat ggt atg tat tta caa ttt cga cga ata cgt 120033
Ala Trp Phe Val Tyr Gly Met Tyr Leu Gln Phe Arg Arg Ile Arg
11425 11430 11435
cgt atg ttt ggg cca ttc cga tca tcc tgt gag tta ata tcc ccc 120078
Arg Met Phe Gly Pro Phe Arg Ser Ser Cys Glu Leu Ile Ser Pro
11440 11445 11450
aca tca tat tca ctg aat tac gta aca cgg gtt att tcg aac att 120123
Thr Ser Tyr Ser Leu Asn Tyr Val Thr Arg Val Ile Ser Asn Ile
11455 11460 11465
ctt ctt ggt tac cca tat aca aag ttg gca agg ttg tta tgt gat 120168
Leu Leu Gly Tyr Pro Tyr Thr Lys Leu Ala Arg Leu Leu Cys Asp
11470 11475 11480
gtt tcc atg cga cgg gat ggt atg agt aaa gta ttt aat gct gac 120213
Val Ser Met Arg Arg Asp Gly Met Ser Lys Val Phe Asn Ala Asp
11485 11490 11495
cct ata agt ttt tta tat atg cat aaa ggt gtt acg tta ttg atg 120258
Pro Ile Ser Phe Leu Tyr Met His Lys Gly Val Thr Leu Leu Met
11500 11505 11510
ctt ttg gag gtt atc gct cat ata tca tct gga tgt att gtg ctt 120303
Leu Leu Glu Val Ile Ala His Ile Ser Ser Gly Cys Ile Val Leu
11515 11520 11525
tta acg ctt ggc gtt gca tat aca cca tgc gcg tta tta tac ccc l20348
Leu Thr Leu Gly Val Ala Tyr Thr Pro Cys Ala Leu Leu Tyr Pro
11530 11535 11540
aca tac att cgg att ctg gcc tgg gtt gtt gta tgc acg ctc gct 120393
Thr Tyr Ile Arg Ile Leu Ala Trp Val Val Val Cys Thr Leu Ala
11545 11550 11555
ata gta gag ctt ata tct tat gtt aga cca aaa cca acc aag gat 120438
Ile Val Glu Leu Ile Ser Tyr Val Arg Pro Lys Pro Thr Lys Asp
11560 11565 11570
aat cat tta aat cat atc aat acg ggg gga ata cgt ggt ata tgc 120483
Asn His Leu Asn His Ile Asn Thr Gly Gly Ile Arg Gly Ile Cys
11575 11580 11585
aca aca tgt tgc gct aca gta atg tcc ggc ctt gct ata aaa tgt 120528
Thr Thr Cys Cys Ala Thr Val Met Ser Gly Leu Ala Ile Lys Cys
11590 11595 11600
ttt tat atc gtc ata ttt gct ata gca gtg gtt att ttt atg cat 120573
Phe Tyr Ile Val Ile Phe Ala Ile Ala Val Val Ile Phe Met His
11605 11610 11615
tac gaa caa agg gtg cag gta agc ttg ttt ggg gaa agt gaa aac 120618
Tyr Glu Gln Arg Val Gln Val Ser Leu Phe Gly Glu Ser Glu Asn
11620 11625 11630
tcc cag aag cat taa tcatgtgact aaacacgccc attgcggggt tgggtgagcc 120673
Ser Gln Lys His
11635
tataaattct acaacattgg cggaagatac aggcaactgc aaacacgcaa ttgtcagata 120733
ttttgcagcc atg gcc tct gct tca att cca acc gac cca gac gtg 120779
Met Ala Ser Ala Ser Ile Pro Thr Asp Pro Asp Val
11640 11645 11650
tct act att tgt gaa gac ttt atg aat ttg cta cca gac gaa cct 120824
Ser Thr Ile Cys Glu Asp Phe Met Asn Leu Leu Pro Asp Glu Pro
11655 11660 11665
tcg gat gac ttt gca ttg gaa gtc acc gat tgg gca aat gat gaa 120869
Ser Asp Asp Phe Ala Leu Glu Val Thr Asp Trp Ala Asn Asp Glu
11670 11675 11680
gct att ggc tcc act cca ggc gag gac tcc aca acg tct aga act 120914
Ala Ile Gly Ser Thr Pro Gly Glu Asp Ser Thr Thr Ser Arg Thr
11685 11690 11695
gtg tat gtg gag cgt act gca gat aca gca tat aat cca cgg tat 120959
Val Tyr Val Glu Arg Thr Ala Asp Thr Ala Tyr Asn Pro Arg Tyr
11700 11705 11710
tcc aaa cga agg cac gga agg cgt gaa agc tac cac cac aat cgc 121004
Ser Lys Arg Arg His Gly Arg Arg Glu Ser Tyr His His Asn Arg
11715 11720 11725
ccg aaa act ttg gtt gtt gta tta ccc gat tca aac cat cat gga 121049
Pro Lys Thr Leu Val Val Val Leu Pro Asp Ser Asn His His Gly
11730 11735 11740
gga aga gac gtg gag act gga tat gca cgc atc gaa cgg gga cat 121094
Gly Arg Asp Val Glu Thr Gly Tyr Ala Arg Ile Glu Arg Gly His
11745 11750 11755
cga cga tca tcc aga tct tat aac act caa agt tca aga aaa cac 121139
Arg Arg Ser Ser Arg Ser Tyr Asn Thr Gln Ser Ser Arg Lys His
11760 11765 11770
cgt gat cga tcc ctg tca aat cga aga cgg cgt cct aca acg cct 121184
Arg Asp Arg Ser Leu Ser Asn Arg Arg Arg Arg Pro Thr Thr Pro
11775 11780 11785
cct gca atg acc acg gga gaa aga aat gat cag aca cat gac gaa 121229
Pro Ala Met Thr Thr Gly Glu Arg Asn Asp Gln Thr His Asp Glu
11790 11795 11800
tcg tac agg ttg cga ttt tcc aag aga gac gcc cgc cga gag cgt 121274
Ser Tyr Arg Leu Arg Phe Ser Lys Arg Asp Ala Arg Arg Glu Arg
11805 11810 11815
att cga aaa gag tat gat atc ccg gtc gat cga att acg ggc cgt 121319
Ile Arg Lys Glu Tyr Asp Ile Pro Val Asp Arg Ile Thr Gly Arg
11820 11825 11830
gct att gaa gtc gtc tcc acc gcg gga gcc agc gtg acc att gac 121364
Ala Ile Glu Val Val Ser Thr Ala Gly Ala Ser Val Thr Ile Asp
11835 11840 11845
tcg gta cgc cat tta gat gaa aca att gaa aaa ctg gta gtc cga 121409
Ser Val Arg His Leu Asp Glu Thr Ile Glu Lys Leu Val Val Arg
11850 11855 11860
tat gcc aca ata caa gag ggt gat tca tgg gct tcc ggt gga tgt 121454
Tyr Ala Thr Ile Gln Glu Gly Asp Ser Trp Ala Ser Gly Gly Cys
11865 11870 11875
ttt ccg ggg ata aaa caa aac aca tct tgg ccg gag ttg atg ttg 121499
Phe Pro Gly Ile Lys Gln Asn Thr Ser Trp Pro Glu Leu Met Leu
11880 11885 11890
tac gga cat gaa ctt tat cgt acc ttt gag tca tat aaa atg gac 121544
Tyr Gly His Glu Leu Tyr Arg Thr Phe Glu Ser Tyr Lys Met Asp
11895 11900 11905
tca cgt att gcc cgc gcg ttg cgt gag aga gtc ata cgt gga gaa 121589
Ser Arg Ile Ala Arg Ala Leu Arg Glu Arg Val Ile Arg Gly Glu
11910 11915 11920
tct ttg att gaa gcg ttg gag tct gcg gat gaa ctg tta acg tgg 121634
Ser Leu Ile Glu Ala Leu Glu Ser Ala Asp Glu Leu Leu Thr Trp
11925 11930 11935
att aaa atg tta gcg gca aaa aac ttg ccc atc tac aca aat aat 121679
Ile Lys Met Leu Ala Ala Lys Asn Leu Pro Ile Tyr Thr Asn Asn
11940 11945 11950
ccc att gtt gca acc tcg aag tca ctt ttg gag aat tta aag tta 121724
Pro Ile Val Ala Thr Ser Lys Ser Leu Leu Glu Asn Leu Lys Leu
11955 11960 11965
aag ctg ggg cct ttt gta aga tgt ctt ctt cta aac agg gac aac 121769
Lys Leu Gly Pro Phe Val Arg Cys Leu Leu Leu Asn Arg Asp Asn
11970 11975 11980
gat ttg ggg tct cgt act ctc ccc gaa ctg ttg cgc cag caa cgt 121814
Asp Leu Gly Ser Arg Thr Leu Pro Glu Leu Leu Arg Gln Gln Arg
11985 11990 11995
ttt agt gat atc acg tgt att act act tat atg ttt gtt atg att 121859
Phe Ser Asp Ile Thr Cys Ile Thr Thr Tyr Met Phe Val Met Ile
12000 12005 12010
gcc cgc att gct aat ata gtt gtc cgt ggc tct aaa ttt gtg gaa 121904
Ala Arg Ile Ala Asn Ile Val Val Arg Gly Ser Lys Phe Val Glu
12015 12020 12025
tat gat gat atc agt tgt aac gtt caa gtg tta caa gaa tat aca 121949
Tyr Asp Asp Ile Ser Cys Asn Val Gln Val Leu Gln Glu Tyr Thr
12030 12035 12040
ccc ggg tca tgt ctg gcc ggt gtt tta gag gcc cta atc acc cac 121994
Pro Gly Ser Cys Leu Ala Gly Val Leu Glu Ala Leu Ile Thr His
12045 12050 12055
caa cgc gag tgt ggt cgt gtt gaa tgt acc ctc tca act tgg gcc 122039
Gln Arg Glu Cys Gly Arg Val Glu Cys Thr Leu Ser Thr Trp Ala
12060 12065 12070
ggg cat ctt tct gac gcc cgt cca tac ggt aaa tat ttt aag tgt 122084
Gly His Leu Ser Asp Ala Arg Pro Tyr Gly Lys Tyr Phe Lys Cys
12075 12080 12085
agt acc ttt aac tgc taa aataaaaaat acctttttca tgcttgtcaa 122132
Ser Thr Phe Asn Cys
12090
aacatactaa tttgtatttt taatcattaa gcatatttac ctgttagaat agtaaatacc 122192
tatttcatgc ttgtaaaaca tatcaattta tatttttaat cattatacac caccccttga 122252
ttttatcaat tgtttatgtg attctatatt aataaaagac taataaaaat gtaatattgt 122312
gggagttttt aaggcgacgt tggggatata tgggcgggat tacattgctt tcaaaccaat 122372
atctttgcaa ttccgtctct gattcgggta aaacacacac cagacgtgta ccgaacgttt 122432
aatta atg gat aca acg gga gct tcc gaa agc agt caa ccc atc cga 122479
Met Asp Thr Thr Gly Ala Ser Glu Ser Ser Gln Pro Ile Arg
12095 12100
gtg aat ctt aaa cct gac ccg ttg gcg tcg ttt aca caa gtt ata 122524
Val Asn Leu Lys Pro Asp Pro Leu Ala Ser Phe Thr Gln Val Ile
12105 12110 12115
ccg cca ctg gcg ttg gaa aca acg tgg aca tgc cct gcc aac tca 122569
Pro Pro Leu Ala Leu Glu Thr Thr Trp Thr Cys Pro Ala Asn Ser
12120 12125 12130
cat gca ccg acg cca tcc cct ctg tac ggt gtt aag agg tta tgt 122614
His Ala Pro Thr Pro Ser Pro Leu Tyr Gly Val Lys Arg Leu Cys
12135 12140 12145
gct ctt cga gca aca tgc ggc cgg gct gat gat tta cac gct ttt 122659
Ala Leu Arg Ala Thr Cys Gly Arg Ala Asp Asp Leu His Ala Phe
12150 12155 12160
ttg att gga ctt gga cgt cga gat aaa cca tct gaa tcc cca atg 122704
Leu Ile Gly Leu Gly Arg Arg Asp Lys Pro Ser Glu Ser Pro Met
12165 12170 12175
tat gtt gac cta cag ccg ttt tgc agc ctc cta aat tcc caa cga 122749
Tyr Val Asp Leu Gln Pro Phe Cys Ser Leu Leu Asn Ser Gln Arg
12180 12185 12190
ctg tta ccg gaa atg gct aat tat aac acc cta tgc gat gca ccc 122794
Leu Leu Pro Glu Met Ala Asn Tyr Asn Thr Leu Cys Asp Ala Pro
12195 12200 12205
ttc agc gcc gca act cag cag atg atg ctg gag tcc gga cag ctg 122839
Phe Ser Ala Ala Thr Gln Gln Met Met Leu Glu Ser Gly Gln Leu
12210 12215 12220
ggt gta cat ctg gcg gct att ggg tat cac tgt cat tgt aaa tcc 122884
Gly Val His Leu Ala Ala Ile Gly Tyr His Cys His Cys Lys Ser
12225 12230 12235
ccc ttc tcg gcg gag tgt tgg acc ggt gca tcc gag gca tac gat 122929
Pro Phe Ser Ala Glu Cys Trp Thr Gly Ala Ser Glu Ala Tyr Asp
12240 12245 12250
cat gtt gta tgt ggg gga aaa gcc cga gcg gct gtc ggc gga cta 122974
His Val Val Cys Gly Gly Lys Ala Arg Ala Ala Val Gly Gly Leu
12255 12260 12265
tga actacacatt taaataaaaa tacgtacaat cgaaaaaagg tgtattttat 123027
ttagtgatta catcaatacg ccctccgtag gttcggcaaa tctaacccgg tgacagaccg 123087
gctgttgacg gggggatcct ttcaatattc cacgaatgtt ggaaactgtt ggtcttcctt 123147
gggctgttag ggttgggaga gtgggtcggt ctgacgtggt aagtgcggct tcaccagagt 123207
cttctgtcaa gttgcatgtt aacgtcgacg ttatacaagg tgtttcagtg gtattttgcg 123267
cgatattctc ccaaaacata agacgttgca tatgcatatc ataaatatta ccccgccata 123327
aacagagatc tctgttactt aaaccgtgtg tgcggagaaa tgtacccata gatggagcca 123387
gtaaatgcaa gcccgtacca cattcatacc caagcgttgt ggatcgcttt tccttcatcg 123447
ccaaaataat aaatgtcctt gcacctccag gcgtcattgc acgccctatt agacgggcaa 123507
ggctaagacg ttcgccgggt ttgctatatt tgccaataat tacataggtt ttggtacagg 123567
taatgtgtaa cgttcctccg cctggcaggt ctacagttct ccccgcaagc actcgtaata 123627
aacatcgcgt ttctgacgcg ggattttcct cggcggtggc gtttaaactg ggacgcacca 123687
gagtggagcc cataataaat gcgggaacat gcccatatgc aagtgtctca gaaattacat 123747
gcattttatg attttacggg ggtgggttac gataatagct attaaacaaa cacccaataa 123807
agcatttttt gtagaacctt tattgggtaa cacagtcttt tcacgtgaca ggcaatgtat 123867
aaataacgta cggatgcact taagatgtat cgcacataaa tttatataag ctgtagcaaa 123927
gtataagcaa atcctgttaa tattatattt ttgggatccg ca atg tcc agg gta 123981
Met Ser Arg Val
12270
tcg gag tat ggg gta ccg gaa ggt gtt cgg gaa tct gat agc gat 124026
Set Glu Tyr Gly Val Pro Glu Gly Val Arg Glu Ser Asp Ser Asp
12275 12280 12285
aca gac tct gtg ttt atg tat cag cat aca gag ctt atg cag aac 124071
Thr Asp Ser Val Phe Met Tyr Gln His Thr Glu Leu Met Gln Asn
12290 12295 12300
aac gcg tcg cca ctc gtc gtt caa aca aga cca ccg gcg gtt ctt 124116
Asn Ala Ser Pro Leu Val Val Gln Thr Arg Pro Pro Ala Val Leu
12305 12310 12315
att cca ctg gtt gat gtc cca agg cca cga tcc cgg aga aag gcg 124161
Ile Pro Leu Val Asp Val Pro Arg Pro Arg Ser Arg Arg Lys Ala
12320 12325 12330
tcc gcg caa ctg aaa atg caa atg gac agg tta tgc aac gta ctg 124206
Set Ala Gln Leu Lys Met Gln Met Asp Arg Leu Cys Asn Val Leu
12335 12340 12345
ggt gta gta ctc cag atg gcg acg ttg gct ttg gtg aca tat ata 124251
Gly Val Val Leu Gln Met Ala Thr Leu Ala Leu Val Thr Tyr Ile
12350 12355 12360
gct ttt gtt gtg cat aca cgc gcg aca agc tgc aag cga gaa taa 124296
Ala Phe Val Val His Thr Arg Ala Thr Ser Cys Lys Arg Glu
12365 12370 12375
ataccttccc cttccggaca gtagtttcat gtagttgagt tgggaggttc ctcgggaaaa 124356
acggcaacaa tggcgaccac gacagcagta agggtgagaa taccaaaaag aattaaaaaa 124416
gccggtacac agcatttgcg aattcttaga aaagccagct gaagtctgtg gattccatca 124476
taagcgcagt ctaggcaatc gtaagactgt tgggtggtgc cattctccac gtgaggaaaa 124536
agaggcgggg agaacaccag actctcgcgg cttctgtaag gggggggcgc gtctgccacg 124596
gcctcggcgt atgtgggtag gtaggggatg ggggtcgcaa cgtcatccat gctgggggac 124656
gacgtgaggg tgaccggcgg ggtcccaggt cggcgggagt agtgcacggt cgccatccga 124716
gcagtaaacg aggggtggac gcaaaaggcg cgggttttgt taaaggctgg cggggggggg 124776
tttcccggca aaaaatccca tcccccccga tggtcgcccc gcaaacgcgc ggggaggtgg 124836
ggtcgctttt ttttttctct ctcgaggggg ccgcgagagg gctggcct 124884
<210>52
<211>278
<212>PRT
<213>水痘带状疱疹
<400>52
Met Phe Cys Thr Ser Pro Ala Thr Arg Gly Asp Ser Ser Glu Ser Lys
1 5 10 15
Pro Gly Ala Ser Val Asp Val Asn Gly Lys Met Glu Tyr Gly Ser Ala
20 25 30
Pro Gly Pro Leu Asn Gly Arg Asp Thr Ser Arg Gly Pro Gly Ala Phe
35 40 45
Cys Thr Pro Gly Trp Glu Ile His Pro Ala Arg Leu Val Glu Asp Ile
50 55 60
Asn Arg Val Phe Leu Cys Ile Ala Gln Ser Ser Gly Arg Val Thr Arg
65 70 75 80
Asp Ser Arg Arg Leu Arg Arg Ile Cys Leu Asp Phe Tyr Leu Met Gly
85 90 95
Arg Thr Arg Gln Arg Pro Thr Leu Ala Cys Trp Glu Glu Leu Leu Gln
100 105 110
Leu Gln Pro Thr Gln Thr Gln Cys Leu Arg Ala Thr Leu Met Glu Val
115 120 125
Ser His Arg Pro Pro Arg Gly Glu Asp Gly Phe Ile Glu Ala Pro Asn
130 135 140
Val Pro Leu His Arg Ser Ala Leu Glu Cys Asp Val Ser Asp Asp Gly
145 150 155 160
Gly Glu Asp Asp Ser Asp Asp Asp Gly Ser Thr Pro Ser Asp Val Ile
165 170 175
Glu Phe Arg Asp Ser Asp Ala Glu Ser Ser Asp Gly Glu Asp Phe Ile
180 185 190
Val Glu Glu Glu Ser Glu Glu Ser Thr Asp Ser Cys Glu Pro Asp Gly
195 200 205
Val Pro Gly Asp Cys Tyr Arg Asp Gly Asp Gly Cys Asn Thr Pro Ser
210 215 220
Pro Lys Arg Pro Gln Arg Ala Ile Glu Arg Tyr Ala Gly Ala Glu Thr
225 230 235 240
Ala Glu Tyr Thr Ala Ala Lys Ala Leu Thr Ala Leu Gly Glu Gly Gly
245 250 255
Val Asp Trp Lys Arg Arg Arg His Glu Ala Pro Arg Arg His Asp Ile
260 265 270
Pro Pro Pro His Gly Val
275
<210>53
<211>180
<212>PRT
<213>水痘带状疱疹
<400>53
Met Asn Leu Cys Gly Ser Arg Gly Glu His Pro Gly Gly Glu Tyr Ala
1 5 10 15
Gly Leu Tyr Cys Thr Arg His Asp Thr Pro Ala His Gln Ala Leu Met
20 25 30
Asn Asp Ala Glu Arg Tyr Phe Ala Ala Ala Leu Cys Ala Ile Ser Thr
35 40 45
Glu Ala Tyr Glu Ala Phe Ile His Ser Pro Ser Glu Arg Pro Cys Ala
50 55 60
Set Leu Trp Gly Arg Ala Lys Asp Ala Phe Gly Arg Met Cys Gly Glu
65 70 75 80
Leu Ala Ala Asp Arg Gln Arg Pro Pro Ser Val Pro Pro Ile Arg Arg
85 90 95
Ala Val Leu Ser Leu Leu Arg Glu Gln Cys Met Pro Asp Pro Gln Ser
100 105 110
His Leu Glu Leu Ser Glu Arg Leu Ile Leu Met Ala Tyr Trp Cys Cys
115 120 125
Leu Gly His Ala Gly Leu Pro Thr Ile Gly Leu Ser Pro Asp Asn Lys
130 135 140
Cys Ile Arg Ala Glu Leu Tyr Asp Arg Pro Gly Gly Ile Cys His Arg
145 150 155 160
Leu Phe Asp Ala Tyr Leu Gly Cys Gly Ser Leu Gly Val Pro Arg Thr
165 170 175
Tyr Glu Arg Ser
180
<210>54
<211>102
<212>PRT
<213>水痘带状疱疹
<400>54
Met Ala Gly Gln Asn Thr Met Glu Gly Glu Ala Val Ala Leu Leu Met
1 5 10 15
Glu Ala Val Val Thr Pro Arg Ala Gln Pro Asn Asn Thr Thr Ile Thr
20 25 30
Ala Ile Gln Pro Ser Arg Ser Ala Glu Lys Cys Tyr Tyr Ser Asp Ser
35 40 45
Glu Asn Glu Thr Ala Asp Glu Phe Leu Arg Arg Ile Gly Lys Tyr Gln
50 55 60
His Lys Ile Tyr His Arg Lys Lys Phe Cys Tyr Ile Thr Leu Ile Ile
65 70 75 80
Val Phe Val Phe Ala Met Thr Gly Ala Ala Phe Ala Leu Gly Tyr Ile
85 90 95
Thr Ser Gln Phe Val Gly
100
<210>55
<211>1310
<212>PRT
<213>水痘带状疱疹
<400>55
Met Asp Thr Pro Pro Met Gln Arg Ser Thr Pro Gln Arg Ala Gly Ser
1 5 10 15
Pro Asp Thr Leu Glu Leu Met Asp Leu Leu Asp Ala Ala Ala Ala Ala
20 25 30
Ala Glu His Arg Ala Arg Val Val Thr Ser Ser Gln Pro Asp Asp Leu
35 40 45
Leu Phe Gly Glu Asn Gly Val Met Val Gly Arg Glu His Glu Ile Val
50 55 60
Ser Ile Pro Ser Val Ser Gly Leu Gln Pro Glu Pro Arg Thr Glu Asp
65 70 75 80
Val Gly Glu Glu Leu Thr Gln Asp Asp Tyr Val Cys Glu Asp Gly Gln
85 90 95
Asp Leu Met Gly Ser Pro Val Ile Pro Leu Ala Glu Val Phe His Thr
100 105 110
Arg Phe Ser Glu Ala Gly Ala Arg Glu Pro Thr Gly Ala Asp Arg Ser
115 120 125
Leu Glu Thr Val Ser Leu Gly Thr Lys Leu Ala Arg Ser Pro Lys Pro
130 135 140
Pro Met Asn Asp Gly Glu Thr Gly Arg Gly Thr Thr Pro Pro Phe Pro
145 150 155 160
Gln Ala Phe Ser Pro Val Ser Pro Ala Ser Pro Val Gly Asp Ala Ala
165 170 175
Gly Asn Asp Gln Arg Glu Asp Gln Arg Ser Ile Pro Arg Gln Thr Thr
180 185 190
Arg Gly Asn Ser Pro Gly Leu Pro Ser Val Val His Arg Asp Arg Gln
195 200 205
Thr Gln Ser Ile Ser Gly Lys Lys Pro Gly Asp Glu Gln Ala Gly His
210 215 220
Ala His Ala Ser Gly Asp Gly Val Val Leu Gln Lys Thr Gln Arg Pro
225 230 235 240
Ala Gln Gly Lys Ser Pro Lys Lys Lys Thr Leu Lys Val Lys Val Pro
245 250 255
Leu Pro Ala Arg Lys Pro Gly Gly Pro Val Pro Gly Pro Val Glu Gln
260 265 270
Leu Tyr His Val Leu Ser Asp Ser Val Pro Ala Lys Gly Ala Lys Ala
275 280 285
Asp Leu Pro Phe Glu Thr Asp Asp Thr Arg Pro Arg Lys His Asp Ala
290 295 300
Arg Gly Ile Thr Pro Arg Val Pro Gly Arg Ser Ser Gly Gly Lys Pro
305 310 315 320
Arg Ala Phe Leu Ala Leu Pro Gly Arg Ser His Ala Pro Asp Pro Ile
325 330 335
Glu Asp Asp Ser Pro Val Glu Lys Lys Pro Lys Ser Arg Glu Phe Val
340 345 350
Ser Ser Ser Ser Ser Ser Ser Ser Trp Gly Ser Ser Ser Glu Asp Glu
355 360 365
Asp Asp Glu Pro Arg Arg Val Ser Val Gly Ser Glu Thr Thr Gly Ser
370 375 380
Arg Ser Gly Arg Glu His Ala Pro Ser Pro Ser Asn Ser Asp Asp Ser
385 390 395 400
Asp Ser Asn Asp Gly Gly Ser Thr Lys Gln Asn Ile Gln Pro Gly Tyr
405 410 415
Arg Ser Ile Ser Gly Pro Asp Pro Arg Ile Arg Lys Thr Lys Arg Leu
420 425 430
Ala Gly Glu Pro Gly Arg Gln Arg Gln Lys Ser Phe Ser Leu Pro Arg
435 440 445
Ser Arg Thr Pro Ile Ile Pro Pro Val Ser Gly Pro Leu Met Met Pro
450 455 460
Asp Gly Ser Pro Trp Pro Gly Ser Ala Pro Leu Pro Ser Asn Arg Val
465 470 475 480
Arg Phe Gly Pro Ser Gly Glu Thr Arg Glu Gly His Trp Glu Asp Glu
485 490 495
Ala Val Arg Ala Ala Arg Ala Arg Tyr Glu Ala Ser Thr Glu Pro Val
500 505 510
Pro Leu Tyr Val Pro Glu Leu Gly Asp Pro Ala Arg Gln Tyr Arg Ala
515 520 525
Leu Ile Asn Leu Ile Tyr Cys Pro Asp Arg Asp Pro Ile Ala Trp Leu
530 535 540
Gln Asn Pro Lys Leu Thr Gly Val Asn Ser Ala Leu Asn Gln Phe Tyr
545 550 555 560
Gln Lys Leu Leu Pro Pro Gly Arg Ala Gly Thr Ala Val Thr Gly Ser
565 570 575
Val Ala Ser Pro Val Pro His Val Gly Glu Ala Met Ala Thr Gly Glu
580 585 590
Ala Leu Trp Ala Leu Pro His Ala Ala Ala Ala Val Ala Met Ser Arg
595 600 605
Arg Tyr Asp Arg Ala Gln Lys His Phe Ile Leu Gln Ser Leu Arg Arg
610 615 620
Ala Phe Ala Ser Met Ala Tyr Pro Glu Ala Thr Gly Ser Ser Pro Ala
625 630 635 640
Ala Arg Ile Ser Arg Gly His Pro Ser Pro Thr Thr Pro Ala Thr Gln
645 650 655
Ala Pro Asp Pro Gln Pro Ser Ala Ala Ala Arg Ser Leu Ser Val Cys
660 665 670
Pro Pro Asp Asp Arg Leu Arg Thr Pro Arg Lys Arg Lys Ser Gln Pro
675 680 685
Val Glu Ser Arg Ser Leu Leu Asp Lys Ile Arg Glu Thr Pro Val Ala
690 695 700
Asp Ala Arg Val Ala Asp Asp His Val Val Ser Lys Ala Lys Arg Arg
705 710 715 720
Val Ser Glu Pro Val Thr Ile Thr Ser Gly Pro Val Val Asp Pro Pro
725 730 735
Ala Val Ile Thr Met Pro Leu Asp Gly Pro Ala Pro Asn Gly Gly Phe
740 745 750
Arg Arg Ile Pro Arg Gly Ala Leu His Thr Pro Val Pro Ser Asp Gln
755 760 765
Ala Arg Lys Ala Tyr Cys Thr Pro Glu Thr Ile Ala Arg Leu Val Asp
770 775 780
Asp Pro Leu Phe Pro Thr Ala Trp Arg Pro Ala Leu Ser Phe Asp Pro
785 790 795 800
Gly Ala Leu Ala Glu Ile Ala Ala Arg Arg Pro Gly Gly Gly Asp Arg
805 810 815
Arg Phe Gly Pro Pro Ser Gly Val Glu Ala Leu Arg Arg Arg Cys Ala
820 825 830
Trp Met Arg Gln Ile Pro Asp Pro Glu Asp Val Arg Leu Leu Ile Ile
835 840 845
Tyr Asp Pro Leu Pro Gly Glu Asp Ile Asn Gly Pro Leu Glu Ser Thr
850 855 860
Leu Ala Thr Asp Pro Gly Pro Ser Trp Ser Pro Ser Arg Gly Gly Leu
865 870 875 880
Ser Val Val Leu Ala Ala Leu Ser Asn Arg Leu Cys Leu Pro Ser Thr
885 890 895
His Ala Trp Ala Gly Asn Trp Thr Gly Pro Pro Asp Val Ser Ala Leu
900 905 910
Asn Ala Arg Gly Val Leu Leu Leu Ser Thr Arg Asp Leu Ala Phe Ala
915 920 925
Gly Ala Val Glu Tyr Leu Gly Ser Arg Leu Ala Ser Ala Arg Arg Arg
930 935 940
Leu Leu Val Leu Asp Ala Val Ala Leu Glu Arg Trp Pro Arg Asp Gly
945 950 955 960
Pro Ala Leu Ser Gln Tyr His Val Tyr Val Arg Ala Pro Ala Arg Pro
965 970 975
Asp Ala Gln Ala Val Val Arg Trp Pro Asp Ser Ala Val Thr Glu Gly
980 985 990
Leu Ala Arg Ala Val Phe Ala Ser Ser Arg Thr Phe Gly Pro Ala Ser
995 1000 1005
Phe Ala Arg Ile Glu Thr Ala Phe Ala Asn Leu Tyr Pro Gly Glu
1010 1015 1020
Gln Pro Leu Cys Leu Cys Arg Gly Gly Asn Val Ala Tyr Thr Val
1025 1030 1035
Cys Thr Arg Ala Gly Pro Lys Thr Arg Val Pro Leu Ser Pro Arg
1040 1045 1050
Glu Tyr Arg Gln Tyr Val Leu Pro Gly Phe Asp Gly Cys Lys Asp
1055 1060 1065
Leu Ala Arg Gln Ser Arg Gly Leu Gly Leu Gly Ala Ala Asp Phe
1070 1075 1080
Val Asp Glu Ala Ala His Ser His Arg Ala Ala Asn Arg Trp Gly
1085 1090 1095
Leu Gly Ala Ala Leu Arg Pro Val Phe Leu Pro Glu Gly Arg Arg
1100 1105 1110
Pro Gly Ala Ala Gly Pro Glu Ala Gly Asp Val Pro Thr Trp Ala
1115 1120 1125
Arg Val Phe Cys Arg His Ala Leu Leu Glu Pro Asp Pro Ala Ala
1130 1135 1140
Glu Pro Leu Val Leu Pro Pro Val Ala Gly Arg Ser Val Ala Leu
1145 1150 1155
Tyr Ala Ser Ala Asp Glu Ala Arg Asn Ala Leu Pro Pro Ile Pro
1160 1165 1170
Arg Val Met Trp Pro Pro Gly Phe Gly Ala Ala Glu Thr Val Leu
1175 1180 1185
Glu Gly Ser Asp Gly Thr Arg Phe Val Phe Gly His His Gly Gly
1190 1195 1200
Ser Glu Arg Pro Ser Glu Thr Gln Ala Gly Arg Gln Arg Arg Thr
1205 1210 1215
Ala Asp Asp Arg Glu His Ala Leu Glu Leu Asp Asp Trp Glu Val
1220 1225 1230
Gly Cys Glu Asp Ala Trp Asp Ser Glu Glu Gly Gly Gly Asp Asp
1235 1240 1245
Gly Asp Ala Pro Gly Ser Ser Phe Gly Val Ser Ile Val Ser Val
1250 1255 1260
Ala Pro Gly Val Leu Arg Asp Arg Arg Val Gly Leu Arg Pro Ala
1265 1270 1275
Val Lys Val Glu Leu Leu Ser Ser Ser Ser Ser Ser Glu Asp Glu
1280 1285 1290
Asp Asp Val Trp Gly Gly Arg Gly Gly Arg Ser Pro Pro Gln Ser
1295 1300 1305
Arg Gly
1310
<210>56
<211>467
<212>PRT
<213>水痘带状疱疹
<400>56
Met Asp Thr Ile Leu Ala Gly Gly Ser Gly Thr Ser Asp Ala Ser Asp
1 5 10 15
Asn Thr Cys Thr Ile Cys Met Ser Thr Val Ser Asp Leu Gly Lys Thr
20 25 30
Met Pro Cys Leu His Asp Phe Cys Phe Val Cys Ile Arg Ala Trp Thr
35 40 45
Ser Thr Ser Val Gln Cys Pro Leu Cys Arg Cys Pro Val Gln Ser Ile
50 55 60
Leu His Lys Ile Val Ser Asp Thr Ser Tyr Lys Glu Tyr Glu Val His
65 70 75 80
Pro Ser Asp Asp Asp Gly Phe Ser Glu Pro Ser Phe Glu Asp Ser Ile
85 90 95
Asp Ile Leu Pro Gly Asp Val Ile Asp Leu Leu Pro Pro Ser Pro Gly
100 105 110
Pro Ser Arg Glu Ser Ile Gln Gln Pro Thr Ser Arg Ser Ser Arg Glu
115 120 125
Pro Ile Gln Ser Pro Asn Pro Gly Pro Leu Gln Ser Ser Ala Arg Glu
130 135 140
Pro Thr Ala Glu Ser Pro Ser Asp Ser Gln Gln Asp Ser Ile Gln Pro
145 150 155 160
Pro Thr Arg Asp Ser Ser Pro Gly Val Thr Lys Thr Cys Ser Thr Ala
165 170 175
Ser Phe Leu Arg Lys Val Phe Phe Lys Asp Gln Pro Ala Val Arg Ser
180 185 190
Ala Thr Pro Val Val Tyr Gly Ser Ile Glu Ser Ala Gln Gln Pro Arg
195 200 205
Thr Gly Gly Gln Asp Tyr Arg Asp Arg Pro Val Ser Val Gly Ile Asn
210 215 220
Gln Asp Pro Arg Thr Met Asp Arg Leu Pro Phe Arg Ala Thr Asp Arg
225 230 235 240
Gly Thr Glu Gly Asn Ala Arg Phe Pro Cys Tyr Met Gln Pro Leu Leu
245 250 255
Gly Trp Leu Asp Asp Gln Leu Ala Glu Leu Tyr Gln Pro Glu Ile Val
260 265 270
Glu Pro Thr Lys Met Leu Ile Leu Asn Tyr Ile Gly Ile Tyr Gly Arg
275 280 285
Asp Glu Ala Gly Leu Lys Thr Ser Leu Arg Cys Leu Leu His Asp Ser
290 295 300
Thr Gly Pro Phe Val Thr Asn Met Leu Phe Leu Leu Asp Arg Cys Thr
305 310 315 320
Asp Pro Thr Arg Leu Thr Met Gln Thr Trp Thr Trp Lys Asp Thr Ala
325 330 335
Ile Gln Leu Ile Thr Gly Pro Ile Val Arg Pro Glu Thr Thr Ser Thr
340 345 350
Gly Glu Thr Ser Arg Gly Asp Glu Arg Asp Thr Arg Leu Val Asn Thr
355 360 365
Pro Gln Lys Val Arg Leu Phe Ser Val Leu Pro Gly Ile Lys Pro Gly
370 375 380
Ser Ala Arg Gly Ala Lys Arg Arg Leu Phe His Thr Gly Arg Asp Val
385 390 395 400
Lys Arg Cys Leu Thr Ile Asp Leu Thr Ser Glu Ser Asp Ser Ala Cys
405 410 415
Lys Gly Ser Lys Thr Arg Lys Val Ala Ser Pro Gln Gly Glu Ser Asn
420 425 430
Thr Pro Ser Thr Ser Gly Ser Thr Ser Gly Ser Leu Lys His Leu Thr
435 440 445
Lys Lys Ser Ser Ala Gly Lys Ala Gly Lys Gly Ile Pro Asn Lys Met
450 455 460
Lys Lys Ser
465
<210>57
<211>305
<212>PRT
<213>水痘带状疱疹
<400>57
Met Asp Val Ser Gly Glu Pro Thr Val Cys Ser Asn Ala Tyr Ala Asn
1 5 10 15
Glu Met Lys Leu Ser Asp Ser Lys Asp Ile Tyr Val Leu Ala His Pro
20 25 30
Val Thr Lys Lys Thr Arg Lys Arg Pro Arg Gly Leu Pro Leu Gly Val
35 40 45
Lys Leu Asp Pro Pro Thr Phe Lys Leu Asn Asn Met Ser His His Tyr
50 55 60
Asp Thr Glu Thr Phe Thr Pro Val Ser Ser Gln Leu Asp Ser Val Glu
65 70 75 80
Val Phe Ser Lys Phe Asn Ile Ser Pro Glu Trp Tyr Asp Leu Leu Ser
85 90 95
Asp Glu Leu Lys Glu Pro Tyr Ala Lys Gly Ile Phe Leu Glu Tyr Asn
100 105 110
Arg Leu Leu Asn Ser Gly Glu Glu Ile Leu Pro Ser Thr Gly Asp Ile
115 120 125
Phe Ala Trp Thr Arg Phe Cys Gly Pro Gln Ser Ile Arg Val Val Ile
130 135 140
Ile Gly Gln Asp Pro Tyr Pro Thr Ala Gly His Ala His Gly Leu Ala
145 150 155 160
Phe Ser Val Lys Arg Gly Ile Thr Pro Pro Ser Ser Leu Lys Asn Ile
165 170 175
Phe Ala Ala Leu Met Glu Ser Tyr Pro Asn Met Thr Pro Pro Thr His
180 185 190
Gly Cys Leu Glu Ser Trp Ala Arg Gln Gly Val Leu Leu Leu Asn Thr
195 200 205
Thr Leu Thr Val Arg Arg Gly Thr Pro Gly Ser His Val Tyr Leu Gly
210 215 220
Trp Gly Arg Leu Val Gln Arg Val Leu Gln Arg Leu Cys Glu Asn Arg
225 230 235 240
Thr Gly Leu Val Phe Met Leu Trp Gly Ala His Ala Gln Lys Thr Thr
245 250 255
Gln Pro Asn Ser Arg Cys His Leu Val Leu Thr His Ala His Pro Ser
260 265 270
Pro Leu Ser Arg Val Pro Phe Arg Asn Cys Arg His Phe Val Gln Ala
275 280 285
Asn Glu Tyr Phe Thr Arg Lys Gly Glu Pro Glu Ile Asp Trp Ser Val
290 295 300
Ile
305
<210>58
<211>71
<212>PRT
<213>水痘带状疱疹
<400>58
Met Asp Val Arg Glu Arg Asn Val Phe Gly Asn Ala Ser Val Ala Thr
1 5 10 15
Pro Gly Glu His Gln Lys Phe Val Arg Glu Leu Ile Leu Ser Gly His
20 25 30
Asn Asn Val Val Leu Gln Thr Tyr Thr Gly Lys Trp Ser Asp Cys Arg
35 40 45
Lys His Gly Lys Ser Val Met Tyr Asn Thr Gly Glu Ala Arg His Pro
50 55 60
Thr Cys Lys Ala His Gln Arg
65 70
<210>59
<211>331
<212>PRT
<213>水痘带状疱疹
<400>59
Met Gln Arg Ile Arg Pro Tyr Trp Ile Lys Phe Glu Gln Thr Gly Gly
1 5 10 15
Ala Gly Met Ala Asp Gly Met Ser Gly Ile Asn Ile Pro Ser Ile Leu
20 25 30
Gly Cys Ser Val Thr Ile Asp Asn Leu Leu Thr Arg Ala Glu Glu Gly
35 40 45
Leu Asp Val Ser Asp Val Ile Glu Asp Leu Arg Ile Gln Ala Ile Pro
50 55 60
Arg Phe Val Cys Glu Ala Arg Glu Val Thr Gly Leu Lys Pro Arg Phe
65 70 75 80
Leu Ala Asn Ser Val Val Ser Leu Arg Val Lys Pro Glu His Gln Glu
85 90 95
Thr Val Leu Val Val Leu Asn Gly Asp Ser Ser Glu Val Ser Cys Asp
100 105 110
Arg Tyr Tyr Met Glu Cys Val Thr Gln Pro Ala Phe Arg Gly Phe Ile
115 120 125
Phe Ser Val Leu Thr Ala Val Glu Asp Arg Val Tyr Thr Val Gly Val
130 135 140
Pro Pro Arg Leu Leu Ile Tyr Arg Met Thr Leu Phe Arg Pro Asp Asn
145 150 155 160
Val Leu Asp Phe Thr Leu Cys Val Ile Leu Met Tyr Leu Glu Gly Ile
165 170 175
Gly Pro Ser Gly Ala Ser Pro Ser Leu Phe Val Gln Leu Ser Val Tyr
180 185 190
Leu Arg Arg Val Glu Cys Gln Ile Gly Pro Leu Glu Lys Met Arg Arg
195 200 205
Phe Leu Tyr Glu Gly Val Leu Trp Leu Leu Asn Thr Leu Met Tyr Val
210 215 220
Val Asp Asn Asn Pro Phe Thr Lys Thr Arg Val Leu Pro His Tyr Met
225 230 235 240
Phe Val Lys Leu Leu Asn Pro Gln Pro Gly Thr Ala Pro Asn Ile Ile
245 250 255
Lys Ala Ile Tyr Ser Cys Gly Val Gly Gln Arg Phe Asp Leu Pro His
260 265 270
Gly Thr Pro Pro Cys Pro Asp Gly Val Val Gln Val Pro Pro Gly Leu
275 280 285
Leu Asn Gly Pro Leu Arg Asp Ser Glu Tyr Gln Lys Ser Val Tyr Phe
290 295 300
Trp Trp Leu Asn Arg Thr Met Val Thr Pro Lys Asn Val Gln Leu Phe
305 310 315 320
Glu Thr Tyr Lys Asn Ser Pro Arg Val Val Lys
325 330
<210>60
<211>541
<212>PRT
<213>水痘带状疱疹
<400>60
Met Glu Phe Pro Tyr His Ser Thr Val Ser Tyr Asn Gly Val Thr Phe
1 5 10 15
Tyr Phe Asn Glu Arg Ala Thr Arg Ala Tyr Phe Ile Cys Gly Gly Cys
20 25 30
Leu Ile Ser Ile Pro Arg Lys His Gly Gly Glu Ile Ala Lys Phe Gly
35 40 45
His Val Val Arg Gly Val Gly Pro Gly Asp Arg Ser Val Ala Ser Tyr
50 55 60
Val Arg Ser Glu Leu Asn Arg Thr Gly Lys Thr Trp Ala Val Ser Ser
65 70 75 80
Asn Asn Asn Cys Val Phe Leu Asp Arg Val Ala Leu Leu Ala Ala Gly
85 90 95
Ser Gly Ala Val Asp Arg Asp Leu Cys Gly Thr Phe Asp Val Glu Val
100 105 110
Glu Asp Pro Thr Leu Ala Asp Tyr Leu Val Ser Leu Pro Val Thr His
115 120 125
Leu Thr Leu Val Ala Gly Val Asp Val Thr Arg Glu Asn Lys Leu Lys
130 135 140
Leu Phe Pro Thr Pro Thr Ala Ile Asn Thr Thr Asn Gly Phe Met Tyr
145 150 155 160
Val Pro Asn Glu Ala Ser Phe Ser Leu Val Tyr Met Arg Met Leu Glu
165 170 175
Leu Pro Glu Ser Leu Gln Glu Leu Val Ser Gly Leu Phe Asp Gly Thr
180 185 190
Pro Glu Ile Arg Asp Ala Leu Asn Gly Ser Asn Asp Asp Glu Lys Thr
195 200 205
Ser Ile Ile Val Ser Arg Arg Ala Ala Asp Val Val Thr Glu Asp Val
210 215 220
Lys Ala Asp Asp Val Pro Ile Ser Gly Glu Pro Tyr Ser Glu Lys Gln
225 230 235 240
Pro Arg Arg Arg Lys Lys Ser Asp His Ile Thr Leu Ser Asn Phe Val
245 250 255
Gln Ile Arg Thr Ile Pro Arg Val Met Asp Ile Trp Asp Pro Arg His
260 265 270
Lys Ala Thr Thr His Cys Ile Arg Ala Leu Ser Cys Ala Val Phe Phe
275 280 285
Ala Asp Glu Val Ile Phe Lys Ala Arg Lys Trp Pro Gly Leu Glu Asp
290 295 300
Glu Leu Asn Glu Ala Arg Glu Thr Ile Tyr Thr Ala Val Val Ala Val
305 310 315 320
Tyr Gly Glu Arg Gly Glu Leu Pro Phe Phe Gly His Ala Tyr Gly Arg
325 330 335
Asp Leu Thr Ser Cys Gln Arg Phe Val Ile Val Gln Tyr Ile Leu Ser
340 345 350
Arg Trp Glu Ala Phe Asn Cys Tyr Ala Val Ile Glu Asp Leu Thr Arg
355 360 365
Ser Tyr Val Asn Ala Leu Pro Ser Asp Asp Asp Thr Asp Gln Val Ala
370 375 380
Gln Asp Leu Ile Arg Thr Ile Val Asp Thr Ala Asn Ser Leu Leu Arg
385 390 395 400
Glu Val Gly Phe Ile Gly Thr Leu Ala Glu Thr Leu Leu Phe Leu Pro
405 410 415
Leu Pro Gln Leu Pro Cys Tyr Lys Glu Thr Ser His Leu Ala Lys Lys
420 425 430
Glu Gly Val Arg Ile Leu Arg Leu Ala Lys Thr Gly Val Gly Leu Ser
435 440 445
Asp Thr Val Pro Val Asp Val Ser Val Thr Glu Arg His Glu Tyr Glu
450 455 460
Ile Ser Arg Tyr Leu Asp Thr Leu Tyr Ser Gly Asp Pro Cys Tyr Asn
465 470 475 480
Gly Ala Val Arg Leu Cys Arg Leu Leu Gly Ser Ser Ile Pro Ile Ala
485 490 495
Leu Tyr Tyr Asn Thr Ile Ser Gly Asn Ala Phe Glu Pro Tyr Phe Ala
500 505 510
Gly Arg Arg Tyr Ile Ala Tyr Leu Gly Ala Leu Phe Phe Gly Arg Val
515 520 525
His Gln Thr Pro Phe Gly Asp Gly Lys Lys Thr Gln Arg
530 535 540
<210>61
<211>258
<212>PRT
<213>水痘带状疱疹
<400>61
Met Ser Ala Ser Arg Ile Arg Ala Lys Cys Phe Arg Leu Gly Gln Arg
1 5 10 15
Cys His Thr Arg Phe Tyr Asp Val Leu Lys Lys Asp Ile Asp Asn Val
20 25 30
Arg Arg Gly Phe Ala Asp Ala Phe Asn Pro Arg Leu Ala Lys Leu Leu
35 40 45
Ser Pro Leu Ser His Val Asp Val Gln Arg Ala Val Arg Ile Ser Met
50 55 60
Ser Phe Glu Val Asn Leu Gly Arg Arg Arg Pro Asp Cys Val Cys Ile
65 70 75 80
Ile Gln Thr Glu Ser Ser Gly Ala Gly Lys Thr Val Cys Phe Ile Val
85 90 95
Glu Leu Lys Ser Cys Arg Phe Ser Ala Asn Ile His Thr Pro Thr Lys
100 105 110
Tyr His Gln Phe Cys Glu Gly Met Arg Gln Leu Arg Asp Thr Met Ala
115 120 125
Leu Ile Lys Glu Thr Thr Pro Thr Gly Ser Asp Glu Ile Met Val Thr
130 135 140
Pro Leu Leu Val Phe Val Ser Gln Arg Gly Leu Asn Leu Leu Gln Val
145 150 155 160
Thr Arg Leu Pro Pro Lys Val Ile His Gly Asn Leu Val Met Leu Ala
165 170 175
Ser His Leu Glu Asn Val Ala Glu Tyr Thr Pro Pro Ile Arg Ser Val
180 185 190
Arg Glu Arg Arg Arg Leu Cys Lys Lys Lys Ile His Val Cys Ser Leu
195 200 205
Ala Lys Lys Arg Ala Lys Ser Cys His Arg Ser Ala Leu Thr Lys Phe
210 215 220
Glu Glu Asn Ala Ala Cys Gly Val Asp Leu Pro Leu Arg Arg Pro Ser
225 230 235 240
Leu Gly Ala Cys Gly Gly Ile Leu Gln Ser Ile Thr Gly Met Phe Ser
245 250 255
His Gly
<210>62
<211>579
<212>PRT
<213>水痘带状疱疹
<400>62
Met Thr Ala Arg Tyr Gly Phe Gly Ser Ile Ser Phe Pro Asn Lys Cys
1 5 10 15
Gly Ile Phe Leu Ser Thr Thr Lys Asn Phe Ile Ala Pro Asn Phe Pro
20 25 30
Ile His Tyr Trp Thr Ala Pro Ala Phe Glu Leu Arg Gly Arg Met Asn
35 40 45
Pro Asp Leu Glu Lys Asn Thr Leu Thr Leu Lys Asn Ala Ala Ala Val
50 55 60
Ala Ala Leu Asp Asn Leu Arg Gly Glu Thr Ile Thr Leu Pro Thr Glu
65 70 75 80
Ile Asp Arg Arg Leu Lys Pro Leu Glu Glu Gln Leu Thr Arg Met Ala
85 90 95
Lys Val Leu Asp Ser Leu Glu Thr Ala Ala Ala Glu Ala Glu Glu Ala
100 105 110
Asp Ala Gln Ser Glu Glu Cys Thr Arg Thr Glu Ile Ile Arg Asn Glu
115 120 125
Ser Ile His Pro Glu Val Gln Ile Ala Lys Asn Asp Ala Pro Leu Gln
130 135 140
Tyr Asp Thr Asn Phe Gln Val Asp Phe Ile Thr Leu Val Tyr Leu Gly
145 150 155 160
Arg Ala Arg Gly Asn Asn Ser Pro Gly Ile Val Phe Gly Pro Trp Tyr
165 170 175
Arg Thr Leu Gln Glu Arg Leu Val Leu Asp Arg Pro Val Ala Ala Arg
180 185 190
Gly Val Asp Cys Lys Asp Gly Arg Ile Ser Arg Thr Phe Met Asn Thr
195 200 205
Thr Val Thr Cys Leu Gln Ser Ala Gly Arg Met Tyr Val Gly Asp Arg
210 215 220
Ala Tyr Ser Ala Phe Glu Cys Ala Val Leu Cys Leu Tyr Leu Met Tyr
225 230 235 240
Arg Thr Ser Asn Ser Val His Glu Pro Gln Val Ser Ser Phe Gly Asn
245 250 255
Leu Ile Glu His Leu Pro Glu Tyr Thr Glu Thr Phe Val Asn Tyr Met
260 265 270
Thr Thr His Glu Asn Lys Asn Ser Tyr Gln Phe Cys Tyr Asp Arg Leu
275 280 285
Pro Arg Asp Gln Phe His Ala Arg Gly Gly Arg Tyr Asp Gln Gly Ala
290 295 300
Leu Thr Ser His Ser Val Met Asp Ala Leu Ile Arg Leu Gln Val Leu
305 310 315 320
Pro Pro Ala Pro Gly Gln Phe Asn Pro Gly Val Asn Asp Ile Ile Asp
325 330 335
Arg Asn His Thr Ala Tyr Val Asp Lys Ile Gln Gln Ala Ala Ala Ala
340 345 350
Tyr Leu Glu Arg Ala Gln Asn Val Phe Leu Met Glu Asp Gln Thr Leu
355 360 365
Leu Arg Leu Thr Ile Asp Thr Ile Thr Ala Leu Leu Leu Leu Arg Arg
370 375 380
Leu Leu Trp Asn Gly Asn Val Tyr Gly Asp Lys Leu Lys Asn Asn Phe
385 390 395 400
Gln Leu Gly Leu Ile Val Ser Glu Ala Thr Gly Thr Pro Thr Asn Asn
405 410 415
Val Ile Leu Arg Gly Ala Thr Gly Phe Asp Gly Lys Phe Lys Ser Gly
420 425 430
Asn Asn Asn Phe Gln Phe Leu Cys Glu Arg Tyr Ile Ala Pro Leu Tyr
435 440 445
Thr Leu Asn Arg Thr Thr Glu Leu Thr Glu Met Phe Pro Gly Leu Val
450 455 460
Ala Leu Cys Leu Asp Ala His Thr Gln Leu Ser Arg Gly Ser Leu Gly
465 470 475 480
Arg Thr Val Ile Asp Ile Ser Ser Gly Gln Tyr Gln Asp Arg Leu Ile
485 490 495
Set Leu Ile Ala Leu Glu Leu Glu His Arg Arg Gln Asn Val Thr Ser
500 505 510
Leu Pro Ile Ala Ala Val Val Ser Ile His Asp Ser Val Met Leu Gln
515 520 525
Tyr Glu Arg Gly Leu Gly Met Leu Met His Gln Pro Arg Val Arg Ala
530 535 540
Ala Leu Glu Glu Ser Arg Arg Leu Ala Gln Phe Asn Val Asn Ser Asp
545 550 555 560
Tyr Asp Leu Leu Tyr Phe Val Cys Leu Gly Val Ile Pro Gln Phe Ala
565 570 575
Set Thr Pro
<210>63
<211>605
<212>PRT
<213>水痘带状疱疹
<400>63
Met Ala Ala Glu Ala Asp Glu Glu Asn Cys Glu Ala Leu Tyr Val Ala
1 5 10 15
Gly Tyr Leu Ala Leu Tyr Ser Lys Asp Glu Gly Glu Leu Asn Ile Thr
20 25 30
Pro Glu Ile Val Arg Ser Ala Leu Pro Pro Thr Ser Lys Ile Pro Ile
35 40 45
Asn Ile Asp His Arg Lys Asp Cys Val Val Gly Glu Val Ile Ala Ile
50 55 60
Ile Glu Asp Ile Arg Gly Pro Phe Phe Leu Gly Ile Val Arg Cys Pro
65 70 75 80
Gln Leu His Ala Val Leu Phe Glu Ala Ala His Ser Asn Phe Phe Gly
85 90 95
Asn Arg Asp Ser Val Leu Ser Pro Leu Glu Arg Ala Leu Tyr Leu Val
100 105 110
Thr Asn Tyr Leu Pro Ser Val Ser Leu Ser Ser Lys Arg Leu Ser Pro
115 120 125
Asn Glu Ile Pro Asp Gly Asn Phe Phe Thr His Val Ala Leu Cys Val
130 135 140
Val Gly Arg Arg Val Gly Thr Val Val Asn Tyr Asp Cys Thr Pro Glu
145 150 155 160
Ser Ser Ile Glu Pro Phe Arg Val Leu Ser Met Glu Ser Lys Ala Arg
165 170 175
Leu Leu Ser Leu Val Lys Asp Tyr Ala Gly Leu Asn Lys Val Trp Lys
180 185 190
Val Ser Glu Asp Lys Leu Ala Lys Val Leu Leu Ser Thr Ala Val Asn
195 200 205
Asn Met Leu Leu Arg Asp Arg Trp Asp Val Val Ala Lys Arg Arg Arg
210 215 220
Glu Ala Gly Ile Met Gly His Val Tyr Leu Gln Ala Ser Thr Gly Tyr
225 230 235 240
Gly Leu Ala Arg Ile Thr Asn Val Asn Gly Val Glu Ser Lys Leu Pro
245 250 255
Asn Ala Gly Val Ile Asn Ala Thr Phe His Pro Gly Gly Pro Ile Tyr
260 265 270
Asp Leu Ala Leu Gly Val Gly Glu Ser Asn Glu Asp Cys Glu Lys Thr
275 280 285
Val Pro His Leu Lys Val Thr Gln Leu Cys Arg Asn Asp Ser Asp Met
290 295 300
Ala Ser Val Ala Gly Asn Ala Ser Asn Ile Ser Pro Gln Pro Pro Ser
305 310 315 320
Gly Val Pro Thr Gly Gly Glu Phe Val Leu Ile Pro Thr Ala Tyr Tyr
325 330 335
Ser Gln Leu Leu Thr Gly Gln Thr Lys Asn Pro Gln Val Ser Ile Gly
340 345 350
Ala Pro Asn Asn Gly Gln Tyr Ile Val Gly Pro Tyr Gly Ser Pro His
355 360 365
Pro Pro Ala Phe Pro Pro Asn Thr Gly Gly Tyr Gly Cys Pro Pro Gly
370 375 380
His Phe Gly Gly Pro Tyr Gly Phe Pro Gly Tyr Pro Pro Pro Asn Arg
385 390 395 400
Leu Glu Met Gln Met Ser Ala Phe Met Asn Ala Leu Ala Ala Glu Arg
405 410 415
Gly Ile Asp Leu Gln Thr Pro Cys Val Asn Phe Pro Asp Lys Thr Asp
420 425 430
Val Arg Arg Pro Gly Lys Arg Asp Phe Lys Ser Met Asp Gln Arg Glu
435 440 445
Leu Asp Ser Phe Tyr Ser Gly Glu Ser Gln Met Asp Gly Glu Phe Pro
450 455 460
Set Asn Ile Tyr Phe Pro Gly Glu Pro Thr Tyr Ile Thr His Arg Arg
465 470 475 480
Arg Arg Val Ser Pro Ser Tyr Trp Gln Arg Arg His Arg Val Ser Asn
485 490 495
Gly Gln His Glu Glu Leu Ala Gly Val Val Ala Lys Leu Gln Gln Glu
500 505 510
Val Thr Glu Leu Lys Ser Gln Asn Gly Thr Gln Met Pro Leu Ser His
515 520 525
His Thr Asn Ile Pro Glu Gly Thr Arg Asp Pro Arg Ile Ser Ile Leu
530 535 540
Leu Lys Gln Leu Gln Ser Val Ser Gly Leu Cys Ser Ser Gln Asn Thr
545 550 555 560
Thr Ser Thr Pro His Thr Asp Thr Val Gly Gln Asp Val Asn Ala Val
565 570 575
Glu Ala Ser Ser Lys Ala Pro Leu Ile Gln Gly Ser Thr Ala Asp Asp
580 585 590
Ala Asp Met Phe Ala Asn Gln Met Met Val Gly Arg Cys
595 600 605
<210>64
<211>1194
<212>PRT
<213>水痘带状疱疹
<400>64
Met Ala Ile Arg Thr Gly Phe Cys Asn Pro Phe Leu Thr Gln Ala Ser
1 5 10 15
Gly Ile Lys Tyr Asn Pro Arg Thr Gly Arg Gly Ser Asn Arg Glu Phe
20 25 30
Leu His Ser Tyr Lys Thr Thr Met Ser Ser Phe Gln Phe Leu Ala Pro
35 40 45
Lys Cys Leu Asp Glu Asp Val Pro Met Glu Glu Arg Lys Gly Val His
50 55 60
Val Gly Thr Leu Ser Arg Pro Pro Lys Val Tyr Cys Asn Gly Lys Glu
65 70 75 80
Val Pro Ile Leu Asp Phe Arg Cys Ser Ser Pro Trp Pro Arg Arg Val
85 90 95
Asn Ile Trp Gly Glu Ile Asp Phe Arg Gly Asp Lys Phe Asp Pro Arg
100 105 110
Phe Asn Thr Phe His Val Tyr Asp Ile Val Glu Thr Thr Glu Ala Ala
115 120 125
Ser Asn Gly Asp Val Ser Arg Phe Ala Thr Ala Thr Arg Pro Leu Gly
130 135 140
Thr Val Ile Thr Leu Leu Gly Met Ser Arg Cys Gly Lys Arg Val Ala
145 150 155 160
Val His Val Tyr Gly Ile Cys Gln Tyr Phe Tyr Ile Asn Lys Ala Glu
165 170 175
Val Asp Thr Ala Cys Gly Ile Arg Ser Gly Ser Glu Leu Ser Val Leu
180 185 190
Leu Ala Glu Cys Leu Arg Ser Ser Met Ile Thr Gln Asn Asp Ala Thr
195 200 205
Leu Asn Gly Asp Lys Asn Ala Phe His Gly Thr Ser Phe Lys Ser Ala
210 215 220
Ser Pro Glu Ser Phe Arg Val Glu Val Ile Glu Arg Thr Asp Val Tyr
225 230 235 240
Tyr Tyr Asp Thr Gln Pro Cys Ala Phe Tyr Arg Val Tyr Ser Pro Ser
245 250 255
Ser Lys Phe Thr Asn Tyr Leu Cys Asp Asn Phe His Pro Glu Leu Lys
260 265 270
Lys Tyr Glu Gly Arg Val Asp Ala Thr Thr Arg Phe Leu Met Asp Asn
275 280 285
Pro Gly Phe Val Ser Phe Gly Trp Tyr Gln Leu Lys Pro Gly Val Asp
290 295 300
Gly Glu Arg Val Arg Val Arg Pro Ala Ser Arg Gln Leu Thr Leu Ser
305 310 315 320
Asp Val Glu Ile Asp Cys Met Ser Asp Asn Leu Gln Ala Ile Pro Asn
325 330 335
Asp Asp Ser Trp Pro Asp Tyr Lys Leu Leu Cys Phe Asp Ile Glu Cys
340 345 350
Lys Ser Gly Gly Ser Asn Glu Leu Ala Phe Pro Asp Ala Thr His Leu
355 360 365
Glu Asp Leu Val Ile Gln Ile Ser Cys Leu Leu Tyr Ser Ile Pro Arg
370 375 380
Gln Ser Leu Glu His Ile Leu Leu Phe Ser Leu Gly Ser Cys Asp Leu
385 390 395 400
Pro Gln Arg Tyr Val Gln Glu Met Lys Asp Ala Gly Leu Pro Glu Pro
405 410 415
Thr Val Leu Glu Phe Asp Ser Glu Phe Glu Leu Leu Ile Ala Phe Met
420 425 430
Thr Leu Val Lys Gln Tyr Ala Pro Glu Phe Ala Thr Gly Tyr Asn Ile
435 440 445
Val Asn Phe Asp Trp Ala Phe Ile Met Glu Lys Leu Asn Ser Ile Tyr
450 455 460
Ser Leu Lys Leu Asp Gly Tyr Gly Ser Ile Asn Arg Gly Gly Leu Phe
465 470 475 480
Lys Ile Trp Asp Val Gly Lys Ser Gly Phe Gln Arg Arg Ser Lys Val
485 490 495
Lys Ile Asn Gly Leu Ile Ser Leu Asp Met Tyr Ala Ile Ala Thr Glu
500 505 510
Lys Leu Lys Leu Ser Ser Tyr Lys Leu Asp Ser Val Ala Arg Glu Ala
515 520 525
Leu Asn Glu Ser Lys Arg Asp Leu Pro Tyr Lys Asp Ile Pro Gly Tyr
530 535 540
Tyr Ala Ser Gly Pro Asn Thr Arg Gly Ile Ile Gly Glu Tyr Cys Ile
545 550 555 560
Gln Asp Ser Ala Leu Val Gly Lys Leu Phe Phe Lys Tyr Leu Pro His
565 570 575
Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Arg Ile Thr Leu Thr Lys
580 585 590
Ala Ile Tyr Asp Gly Gln Gln Val Arg Ile Tyr Thr Cys Leu Leu Gly
595 600 605
Leu Ala Ser Ser Arg Gly Phe Ile Leu Pro Asp Gly Gly Tyr Pro Ala
610 615 620
Thr Phe Glu Tyr Lys Asp Val Ile Pro Asp Val Gly Asp Val Glu Glu
625 630 635 640
Glu Met Asp Glu Asp Glu Ser Val Ser Pro Thr Gly Thr Ser Ser Gly
645 650 655
Arg Asn Val Gly Tyr Lys Gly Ala Arg Val Phe Asp Pro Asp Thr Gly
660 665 670
Phe Tyr Ile Asp Pro Val Val Val Leu Asp Phe Ala Ser Leu Tyr Pro
675 680 685
Ser Ile Ile Gln Ala His Asn Leu Cys Phe Thr Thr Leu Thr Leu Asn
690 695 700
Phe Glu Thr Val Lys Arg Leu Asn Pro Ser Asp Tyr Ala Thr Phe Thr
705 710 715 720
Val Gly Gly Lys Arg Leu Phe Phe Val Arg Ser Asn Val Arg Glu Ser
725 730 735
Leu Leu Gly Val Leu Leu Lys Asp Trp Leu Ala Met Arg Lys Ala Ile
740 745 750
Arg Ala Arg Ile Pro Gly Ser Ser Ser Asp Glu Ala Val Leu Leu Asp
755 760 765
Lys Gln Gln Ala Ala Ile Lys Val Val Cys Asn Ser Val Tyr Gly Phe
770 775 780
Thr Gly Val Ala Gln Gly Phe Leu Pro Cys Leu Tyr Val Ala Ala Thr
785 790 795 800
Val Thr Thr Ile Gly Arg Gln Met Leu Leu Ser Thr Arg Asp Tyr Ile
805 810 815
His Asn Asn Trp Ala Ala Phe Glu Arg Phe Ile Thr Ala Phe Pro Asp
820 825 830
Ile Glu Ser Ser Val Leu Ser Gln Lys Ala Tyr Glu Val Lys Val Ile
835 840 845
Tyr Gly Asp Thr Asp Ser Val Phe Ile Arg Phe Lys Gly Val Ser Val
850 855 860
Glu Gly Ile Ala Lys Ile Gly Glu Lys Met Ala His Ile Ile Ser Thr
865 870 875 880
Ala Leu Phe Cys Pro Pro Ile Lys Leu Glu Cys Glu Lys Thr Phe Ile
885 890 895
Lys Leu Leu Leu Ile Thr Lys Lys Lys Tyr Ile Gly Val Ile Tyr Gly
900 905 910
Gly Lys Val Leu Met Lys Gly Val Asp Leu Val Arg Lys Asn Asn Cys
915 920 925
Gln Phe Ile Asn Asp Tyr Ala Arg Lys Leu Val Glu Leu Leu Leu Tyr
930 935 940
Asp Asp Thr Val Ser Arg Ala Ala Ala Glu Ala Ser Cys Val Ser Ile
945 950 955 960
Ala Glu Trp Asn Arg Arg Ala Met Pro Ser Gly Met Ala Gly Phe Gly
965 970 975
Arg Ile Ile Ala Asp Ala His Arg Gln Ile Thr Ser Pro Lys Leu Asp
980 985 990
Ile Asn Lys Phe Val Met Thr Ala Glu Leu Ser Arg Pro Pro Ser Ala
995 1000 1005
Tyr Ile Asn Arg Arg Leu Ala His Leu Thr Val Tyr Tyr Lys Leu
1010 1015 1020
Val Met Arg Gln Gly Gln Ile Pro Asn Val Arg Glu Arg Ile Pro
1025 1030 1035
Tyr Val Ile Val Ala Pro Thr Asp Glu Val Glu Ala Asp Ala Lys
1040 1045 1050
Ser Val Ala Leu Leu Arg Gly Asp Pro Leu Gln Asn Thr Ala Gly
1055 1060 1065
Lys Arg Cys Gly Glu Ala Lys Arg Lys Leu Ile Ile Ser Asp Leu
1070 1075 1080
Ala Glu Asp Pro Ile His Val Thr Ser His Gly Leu Ser Leu Asn
1085 1090 1095
Ile Asp Tyr Tyr Phe Ser His Leu Ile Gly Thr Ala Ser Val Thr
1100 1105 1110
Phe Lys Ala Leu Phe Gly Asn Asp Thr Lys Leu Thr Glu Arg Leu
1115 1120 1125
Leu Lys Arg Phe Ile Pro Glu Thr Arg Val Val Asn Val Lys Met
1130 1135 1140
Leu Asn Arg Leu Gln Ala Ala Gly Phe Val Cys Ile His Ala Pro
1145 1150 1155
Cys Trp Asp Asn Lys Met Asn Thr Glu Ala Glu Ile Thr Glu Glu
1160 1165 1170
Glu Gln Ser His Gln Ile Met Arg Arg Val Phe Cys Ile Pro Lys
1175 1180 1185
Ala Ile Leu His Gln Ser
1190
<210>65
<211>156
<212>PRT
<213>水痘带状疱疹
<400>65
Met Tyr Glu Ser Glu Asn Ala Ser Glu His His Pro Glu Leu Glu Asp
1 5 10 15
Val Phe Ser Glu Asn Thr Gly Asp Ser Asn Pro Ser Met Gly Ser Ser
20 25 30
Asp Ser Thr Arg Ser Ile Ser Gly Met Arg Ala Arg Asp Leu Ile Thr
35 40 45
Asp Thr Asp Val Asn Leu Leu Asn Ile Asp Ala Leu Glu Ser Lys Tyr
50 55 60
Phe Pro Ala Asp Ser Thr Phe Thr Leu Ser Val Trp Phe Glu Asn Leu
65 70 75 80
Ile Pro Pro Glu Ile Glu Ala Ile Leu Pro Thr Thr Asp Ala Gln Leu
85 90 95
Asn Tyr Ile Ser Phe Thr Ser Arg Leu Ala Ser Val Leu Lys His Lys
100 105 ll0
Glu Ser Asn Asp Ser Glu Lys Ser Ala Tyr Val Val Pro Cys Glu His
115 120 125
Ser Ala Ser Val Thr Arg Arg Arg Glu Arg Phe Ala Gly Val Met Ala
130 135 140
Lys Phe Leu Asp Leu His Glu Ile Leu Lys Asp Ala
145 150 155
<210>66
<211>269
<212>PRT
<213>水痘带状疱疹
<400>66
Met Ser Arg Arg Thr Tyr Val Arg Ser Glu Arg Arg Arg Gly Cys Gly
1 5 10 15
Asp Asn Leu Leu Gln Arg Ile Arg Leu Val Val Pro Ser Ala Leu Gln
20 25 30
Cys Cys Asp Gly Asp Leu Pro Ile Phe Asp Pro Gln Arg Pro Pro Ala
35 40 45
Arg Cys Val Phe Gln Phe Asn Gly Glu Asp Asn Val Ser Glu Ala Phe
50 55 60
Pro Val Glu Tyr Ile Met Arg Leu Met Ala Asn Trp Ala Gln Val Asp
65 70 75 80
Cys Asp Pro Tyr Ile Lys Ile Gln Asn Thr Gly Val Ser Val Leu Phe
85 90 95
Gln Gly Phe Phe Phe Arg Pro Thr Asn Ala Pro Val Ala Glu Val Ser
100 105 110
Ile Asp Ser Asn Asn Val Ile Leu Ser Ser Thr Leu Ser Thr Gly Ile
115 120 125
Asn Leu Ser Ala Leu Glu Ser Ile Lys Arg Gly Gly Gly Ile Asp Arg
130 135 140
Arg Pro Leu Gln Ala Leu Met Trp Val Asn Cys Phe Val Arg Met Pro
145 150 155 160
Tyr Val Gln Leu Ser Phe Arg Phe Met Gly Pro Glu Asp Pro Ser Arg
165 170 175
Thr Ile Lys Leu Met Ala Arg Ala Thr Asp Ala Tyr Met Tyr Lys Glu
180 185 190
Thr Gly Asn Asn Leu Asp Glu Tyr Ile Arg Trp Arg Pro Ser Phe Arg
195 200 205
Ser Pro Pro Glu Asn Gly Ser Pro Asn Thr Ser Val Gln Met Gln Ser
210 215 220
Asp Ile Lys Pro Ala Leu Pro Asp Thr Gln Thr Thr Arg Val Trp Lys
225 230 235 240
Leu Ala Leu Pro Val Ala Asn Val Thr Tyr Ala Leu Phe Ile Val Ile
245 250 255
Val Leu Val Val Val Leu Gly Ala Val Leu Phe Trp Lys
260 265
<210>67
<211>235
<212>PRT
<213>水痘带状疱疹
<400>67
Met Thr Gln Pro Ala Ser Ser Arg Val Val Phe Asp Pro Ser Asn Pro
1 5 10 15
Thr Thr Phe Ser Val Glu Ala Ile Ala Ala Tyr Thr Pro Val Ala Leu
20 25 30
Ile Arg Leu Leu Asn Ala Ser Gly Pro Leu Gln Pro Gly His Arg Val
35 40 45
Asp Ile Ala Asp Ala Arg Ser Ile Tyr Thr Val Gly Ala Ala Ala Ser
50 55 60
Ala Ala Arg Ala Arg Ala Asn His Asn Ala Asn Thr Ile Arg Arg Thr
65 70 75 80
Ala Met Phe Ala Glu Thr Asp Pro Met Thr Trp Leu Arg Pro Thr Val
85 90 95
Gly Leu Lys Arg Thr Phe Asn Pro Arg Ile Ile Arg Pro Gln Pro Pro
100 105 ll0
Asn Pro Ser Met Ser Leu Gly Ile Ser Gly Pro Thr Ile Leu Pro Gln
115 120 125
Lys Thr Gln Ser Ala Asp Gln Ser Ala Leu Gln Gln Pro Ala Ala Leu
130 135 140
Ala Phe Ser Gly Ser Ser Pro Gln His Pro Pro Pro Gln Thr Thr Ser
145 150 155 160
Ala Ser Val Gly Gln Gln Gln His Val Val Ser Gly Ser Ser Gly Gln
165 170 175
Gln Pro Gln Gln Gly Ala Gln Ser Ser Thr Val Gln Pro Thr Thr Gly
180 185 190
Ser Pro Pro Ala Ala Gln Gly Val Pro Gln Ser Thr Pro Pro Pro Thr
195 200 205
Gln Asn Thr Pro Gln Gly Gly Lys Gly Gln Thr Leu Ser His Thr Gly
210 215 220
Gln Ser Gly Asn Ala Ser Arg Ser Arg Arg Val
225 230 235
<210>68
<211>483
<212>PRT
<213>水痘带状疱疹
<400>68
Met Gly Ser Gln Pro Thr Asn Ser His Phe Thr Leu Asn Glu Gln Thr
1 5 10 15
Leu Cys Gly Thr Asn Ile Ser Leu Leu Gly Asn Asn Arg Phe Ile Gln
20 25 30
Ile Gly Asn Gly Leu His Met Thr Tyr Ala Pro Gly Phe Phe Gly Asn
35 40 45
Trp Ser Arg Asp Leu Thr Ile Gly Pro Arg Phe Gly Gly Leu Asn Lys
50 55 60
Gln Pro Ile His Val Pro Pro Lys Arg Thr Glu Thr Ala Ser Ile Gln
65 70 75 80
Val Thr Pro Arg Ser Ile Val Ile Asn Arg Met Asn Asn Ile Gln Ile
85 90 95
Asn Pro Thr Ser Ile Gly Asn Pro Gln Val Thr Ile Arg Leu Pro Leu
100 105 110
Asn Asn Phe Lys Ser Thr Thr Gln Leu Ile Gln Gln Val Ser Leu Thr
115 120 125
Asp Phe Phe Arg Pro Asp Ile Glu His Ala Gly Ser Ile Val Leu Ile
130 135 140
Leu Arg His Pro Ser Asp Met Ile Gly Glu Ala Asn Thr Leu Thr Gln
145 150 155 160
Ala Gly Arg Asp Pro Asp Val Leu Leu Glu Gly Leu Arg Asn Leu Phe
165 170 175
Asn Ala Cys Thr Ala Pro Trp Thr Val Gly Glu Gly Gly Gly Leu Arg
180 185 190
Ala Tyr Val Thr Ser Leu Ser Phe Ile Ala Ala Cys Arg Ala Glu Glu
195 200 205
Tyr Thr Asp Lys Gln Ala Ala Asp Ala Asn Arg Thr Ala Ile Val Ser
210 215 220
Ala Tyr Gly Cys Ser Arg Met Glu Thr Arg Leu Ile Arg Phe Ser Glu
225 230 235 240
Cys Leu Arg Ala Met Val Gln Cys His Val Phe Pro His Arg Phe Ile
245 250 255
Ser Phe Phe Gly Ser Leu Leu Glu Tyr Thr Ile Gln Asp Asn Leu Cys
260 265 270
Asn Ile Thr Ala Val Ala Lys Gly Pro Gln Glu Ala Ala Arg Thr Asp
275 280 285
Lys Thr Ser Thr Arg Arg Val Thr Ala Asn Ile Pro Ala Cys Val Phe
290 295 300
Trp Asp Val Asp Lys Asp Leu His Leu Ser Ala Asp Gly Leu Lys His
305 310 315 320
Val Phe Leu Val Phe Val Tyr Thr Gln Arg Arg Gln Arg Glu Gly Val
325 330 335
Arg Leu His Leu Ala Leu Ser Gln Leu Asn Glu Gln Cys Phe Gly Arg
340 345 350
Gly Ile Gly Phe Leu Leu Gly Arg Ile Arg Ala Glu Asn Ala Ala Trp
355 360 365
Gly Thr Glu Gly Val Ala Asn Thr His Gln Pro Tyr Asn Thr Arg Ala
370 375 380
Leu Pro Leu Val Gln Leu Ser Asn Asp Pro Thr Ser Pro Arg Cys Ser
385 390 395 400
Ile Gly Glu Ile Thr Gly Val Asn Trp Asn Leu Ala Arg Gln Arg Leu
405 410 415
Tyr Gln Trp Thr Gly Asp Phe Arg Gly Leu Pro Thr Gln Leu Ser Cys
420 425 430
Met Tyr Ala Ala Tyr Thr Leu Ile Gly Thr Ile Pro Ser Glu Ser Val
435 440 445
Arg Tyr Thr Arg Arg Met Glu Arg Phe Gly Gly Tyr Asn Val Pro Thr
450 455 460
Ile Trp Leu Glu Gly Val Val Trp Gly Gly Thr Asn Thr Trp Asn Glu
465 470 475 480
Cys Tyr Tyr
<210>69
<211>775
<212>PRT
<213>水痘带状疱疹
<400>69
Met Glu Phe Lys Arg Ile Phe Asn Thr Val His Asp Ile Ile Asn Arg
1 5 10 15
Leu Cys Gln His Gly Tyr Lys Glu Tyr Ile Ile Pro Pro Glu Ser Thr
20 25 30
Thr Pro Val Glu Leu Met Glu Tyr Ile Ser Thr Ile Val Ser Lys Leu
35 40 45
Lys Ala Val Thr Arg Gln Asp Glu Arg Val Tyr Arg Cys Cys Gly Glu
50 55 60
Leu Ile His Cys Arg Ile Asn Leu Arg Ser Val Ser Met Glu Thr Trp
65 70 75 80
Leu Thr Ser Pro Ile Leu Cys Leu Thr Pro Arg Val Arg Gln Ala Ile
85 90 95
Glu Gly Arg Arg Asp Glu Ile Arg Arg Ala Ile Leu Glu Pro Phe Leu
100 105 110
Lys Asp Gln Tyr Pro Ala Leu Ala Thr Leu Gly Leu Gln Ser Ala Leu
115 120 125
Lys Tyr Glu Asp Phe Tyr Leu Thr Lys Leu Glu Glu Gly Lys Leu Glu
130 135 140
Ser Leu Cys Gln Phe Phe Leu Arg Leu Ala Ala Thr Val Thr Thr Glu
145 150 155 160
Ile Val Asn Leu Pro Lys Ile Ala Thr Leu Ile Pro Gly Ile Asn Asp
165 170 175
Gly Tyr Thr Trp Thr Asp Val Cys Arg Val Phe Phe Thr Ala Leu Ala
180 185 190
Cys Gln Lys Ile Val Pro Ala Thr Pro Val Met Met Phe Leu Gly Arg
195 200 205
Glu Thr Gly Ala Thr Ala Ser Cys Tyr Leu Met Asp Pro Glu Ser Ile
210 215 220
Thr Val Gly Arg Ala Val Arg Ala Ile Thr Gly Asp Val Gly Thr Val
225 230 235 240
Leu Gln Ser Arg Gly Gly Val Gly Ile Ser Leu Gln Ser Leu Asn Leu
245 250 255
Ile Pro Thr Glu Asn Gln Thr Lys Gly Leu Leu Ala Val Leu Lys Leu
260 265 270
Leu Asp Cys Met Val Met Ala Ile Asn Ser Asp Cys Glu Arg Pro Thr
275 280 285
Gly Val Cys Val Tyr Ile Glu Pro Trp His Val Asp Leu Gln Thr Val
290 295 300
Leu Ala Thr Arg Gly Met Leu Val Arg Asp Glu Ile Phe Arg Cys Asp
305 310 315 320
Asn Ile Phe Cys Cys Leu Trp Thr Pro Asp Leu Phe Phe Glu Arg Tyr
325 330 335
Leu Ser Tyr Leu Lys Gly Ala Ser Asn Val Gln Trp Thr Leu Phe Asp
340 345 350
Asn Arg Ala Asp Ile Leu Arg Thr Leu His Gly Glu Ala Phe Thr Ser
355 360 365
Thr Tyr Leu Arg Leu Glu Arg Glu Gly Leu Gly Val Ser Ser Val Pro
370 375 380
Ile Gln Asp Ile Ala Phe Thr Ile Ile Arg Ser Ala Ala Val Thr Gly
385 390 395 400
Ser Pro Phe Leu Met Phe Lys Asp Ala Cys Asn Arg Asn Tyr His Met
405 410 415
Asn Thr Gln Gly Asn Ala Ile Thr Gly Ser Asn Leu Cys Thr Glu Ile
420 425 430
Val Gln Lys Ala Asp Ala His Gln His Gly Val Cys Asn Leu Ala Ser
435 440 445
Ile Asn Leu Thr Thr Cys Leu Ser Lys Gly Pro Val Ser Phe Asn Leu
450 455 460
Asn Asp Leu Gln Leu Thr Ala Arg Thr Thr Val Ile Phe Leu Asn Gly
465 470 475 480
Val Leu Ala Ala Gly Asn Phe Pro Cys Lys Lys Ser Cys Lys Gly Val
485 490 495
Lys Asn Asn Arg Ser Leu Gly Ile Gly Ile Gln Gly Leu His Thr Thr
500 505 510
Cys Leu Arg Leu Gly Phe Asp Leu Thr Ser Gln Pro Ala Arg Arg Leu
515 520 525
Asn Val Gln Ile Ala Glu Leu Met Leu Tyr Glu Thr Met Lys Thr Ser
530 535 540
Met Glu Met Cys Lys Ile Gly Gly Leu Ala Pro Phe Lys Gly Phe Thr
545 550 555 560
Glu Ser Lys Tyr Ala Lys Gly Trp Leu His Gln Asp Gly Phe Ser Thr
565 570 575
Ile Ser Tyr Leu Asp Leu Pro Trp Cys Thr Leu Arg Asp Asp Ile Cys
580 585 590
Ala Tyr Gly Leu Tyr Asn Ser Gln Phe Leu Ala Leu Met Pro Thr Val
595 600 605
Ser Ser Ala Gln Val Thr Glu Cys Ser Glu Gly Phe Ser Pro Ile Tyr
610 615 620
Asn Asn Met Phe Ser Lys Val Thr Thr Ser Gly Glu Leu Leu Arg Pro
625 630 635 640
Asn Leu Asp Leu Met Asp Glu Leu Arg Asp Met Tyr Ser Cys Glu Glu
645 650 655
Lys Arg Leu Glu Val Ile Asn Ile Leu Glu Lys Asn Gln Trp Ser Val
660 665 670
Ile Arg Ser Phe Gly Cys Leu Ser Asn Ser His Pro Leu Leu Lys Tyr
675 680 685
Lys Thr Ala Phe Glu Tyr Glu Gln Glu Asp Leu Val Asp Met Cys Ala
690 695 700
Glu Arg Ala Pro Phe Ile Asp Gln Ser Gln Ser Met Thr Leu Phe Ile
705 710 715 720
Glu Glu Arg Pro Asp Gly Thr Ile Pro Ala Ser Lys Ile Met Asn Leu
725 730 735
Leu Ile Arg Ala Tyr Lys Ala Gly Leu Lys Thr Gly Met Tyr Tyr Cys
740 745 750
Lys Ile Arg Lys Ala Thr Asn Ser Gly Leu Phe Ala Gly Gly Glu Leu
755 760 765
Thr Cys Thr Ser Cys Ala Leu
770 775
<210>70
<211>306
<212>PRT
<213>水痘带状疱疹
<400>70
Met Asp Gln Lys Asp Cys Ser His Phe Phe Tyr Arg Pro Glu Cys Pro
1 5 10 15
Asp Ile Asn Asn Leu Arg Ala Leu Ser Ile Ser Asn Arg Trp Leu Glu
20 25 30
Set Asp Phe Ile Ile Glu Asp Asp Tyr Gln Tyr Leu Asp Cys Leu Thr
35 40 45
Glu Asp Glu Leu Ile Phe Tyr Arg Phe Ile Phe Thr Phe Leu Ser Ala
50 55 60
Ala Asp Asp Leu Val Asn Val Asn Leu Gly Ser Leu Thr Gln Leu Phe
65 70 75 80
Set Gln Lys Asp Ile His His Tyr Tyr Ile Glu Gln Glu Cys Ile Glu
85 90 95
Val Val His Ala Arg Val Tyr Ser Gln Ile Gln Leu Met Leu Phe Arg
100 105 110
Gly Asp Glu Ser Leu Arg Val Gln Tyr Val Asn Val Thr Ile Asn Asn
115 120 125
Pro Ser Ile Gln 6ln Lys Val Gln Trp Leu Glu Glu Lys Val Arg Asp
130 135 l40
Asn Pro Ser Val Ala Glu Lys Tyr Ile Leu Met Ile Leu Ile Glu Gly
145 150 155 160
Ile Phe Phe Val Ser Ser Phe Ala Ala Ile Ala Tyr Leu Arg Asn Asn
165 170 175
Gly Leu Phe Val Val Thr Cys Gln Phe Asn Asp Leu Ile Ser Arg Asp
180 185 190
Glu Ala Ile His Thr Ser Ala Ser Cys Cys Ile Tyr Asn Asn Tyr Val
195 200 205
Pro Glu Lys Pro Ala Ile Thr Arg Ile His Gln Leu Phe Ser Glu Ala
210 215 220
Val Glu Ile Glu Cys Ala Phe Leu Lys Ser His Ala Pro Lys Thr Arg
225 230 235 240
Leu Val Asn Val Asp Ala Ile Thr Gln Tyr Val Lys Phe Ser Ala Asp
245 250 255
Arg Leu Leu Ser Ala Ile Asn Val Pro Lys Leu Phe Asn Thr Pro Pro
260 265 270
Pro Asp Ser Asp Phe Pro Leu Ala Phe Met Ile Ala Asp Lys Asn Thr
275 280 285
Asn Phe Phe Glu Arg His Ser Thr Ser Tyr Ala Gly Thr Val Ile Asn
290 295 300
Asp Leu
305
<210>71
<211>408
<212>PRT
<213>水痘带状疱疹
<400>71
Met Asp Leu Arg Ser Arg Thr Asp Asp Ala Leu Asp Met Glu Leu His
1 5 10 15
Ala Gly Phe Asp Ala Pro Glu Ile Ala Arg Ala Val Leu Thr Glu Lys
20 25 30
Thr Leu Thr Gly Leu Ile Ser Ser Ile Ser Pro Leu Val Asn Arg Leu
35 40 45
Arg Asp Ser Ile Leu Ile Phe Ser Asp Glu Gly Leu Ile Ile His Cys
50 55 60
Ser Leu Glu Thr Glu Gln Leu Tyr Ile Pro Ile Pro Ala Asn Met Phe
65 70 75 80
Asp Gln Tyr Asn Trp Thr Gly Pro Arg Met Val Val Leu Ala Ala Thr
85 90 95
Glu Gly Arg Ser Ser Leu Ile Asp Ala Phe Arg His Thr Lys Asp Pro
100 105 110
Ser Thr Pro Thr Arg Leu Tyr Phe Lys Phe Thr Gly Gln Pro Pro Glu
115 120 125
Arg Ser Ile Ile Gln Thr Met Val Trp Gln Arg Pro Gly Asp Cys Gly
130 135 140
Pro Asp Asp Gln Val Gln Cys Tyr Lys Gln Val Val Lys Arg Glu Leu
145 150 155 160
Ala Cys Tyr Thr Met Met Phe Pro Asn Leu Thr Pro Asp Ile Ser Ile
165 170 175
Cys Leu Lys Arg Asp Gln Phe Thr Arg Leu Gln Arg Leu Leu Lys Thr
180 185 190
Phe Gly Phe Thr Thr Cys Phe Ile Leu Thr Ala Thr Asp Met Tyr Ile
195 200 205
Gln Thr Ala Gly Gly Gly Phe Ile Ser Phe Asn Val Ser Leu Asp Ile
210 215 220
Asn Gly Ser Lys Pro Thr Pro Tyr Asn Leu Ile Arg Ser Ile Thr Asn
225 230 235 240
Ser Lys Arg Ile Leu Asn Asn Val Val Tyr Gly Ser Gly Ser Met Arg
245 250 255
Glu Phe Gly Val Leu Leu Glu Thr His Ser Gly Phe Arg Ser Ala Val
260 265 270
Gln Asn Leu Lys Leu Thr Arg Asp Glu Thr Cys Tyr Ile Asn Phe Tyr
275 280 285
Leu Ala Leu Thr Asn Ser Pro Met Val Gly Leu Tyr Ile Gln Arg Ser
290 295 300
Ala Pro Val His Ser Phe Phe Tyr Ala Thr Phe Leu Ser Pro Lys Asp
305 310 315 320
Leu Lys Glu Lys Leu Thr Ser Met Gln Leu Phe Ala Asn Met Glu Ser
325 330 335
Val Lys Asp Glu Pro Pro Leu Lys Lys Arg Arg Asn Leu Leu Thr Lys
340 345 350
Arg Asn Glu Lys Asn Thr Gly Asn Lys Met Gly Gly Lys Leu Pro Glu
355 360 365
Thr Thr Trp Gln Glu Gly Ile Gly Ile Arg Glu Tyr Cys Val Ala Pro
370 375 380
Pro Val Asp Pro Ala Gly Thr Leu Asp Tyr Ser Glu Leu Ser Arg Glu
385 390 395 400
Set Asp Val Ile Cys Thr Val Lys
405
<210>72
<211>406
<212>PRT
<213>水痘带状疱疹
<400>72
Met Ala Val Asn Gly Glu Arg Ala Val His Asp Glu Asn Leu Gly Val
1 5 10 15
Leu Asp Arg Glu Leu Ile Arg Ala Gln Ser Ile Gln Gly Cys Val Gly
20 25 30
Asn Pro Gln Glu Cys Asn Ser Cys Ala Ile Thr Ser Ala Ser Arg Leu
35 40 45
Phe Leu Val Gly Leu Gln Ala Ser Val Ile Thr Ser Gly Leu Ile Leu
50 55 60
Gln Tyr His Val Cys Glu Ala Ala Val Asn Ala Thr Ile Met Gly Leu
65 70 75 80
Ile Val Val Ser Gly Leu Trp Pro Thr Ser Val Lys Phe Leu Arg Thr
85 90 95
Leu Ala Lys Leu Gly Arg Cys Leu Gln Thr Val Val Val Leu Gly Phe
100 105 110
Ala Val Leu Trp Ala Val Gly Cys Pro Ile Ser Arg Asp Leu Pro Phe
115 120 125
Val Glu Leu Leu Gly Ile Ser Ile Ser Ala Ile Thr Gly Thr Val Ala
130 135 140
Ala Val His Ile His Tyr Tyr Asn Phe Val Thr Thr Phe Asn Gly Pro
145 150 155 160
His Ile Tyr Phe Tyr Val Met Met Leu Gly Thr Gly Leu Gly Gly Leu
165 170 175
Leu Thr Val Ile Leu Tyr Met Tyr Val Ser Lys Tyr Glu Val Leu Ile
180 185 190
Gly Leu Cys Ile Ser Ile Val Thr Leu Val Ser Ile Val Asp Ala Ala
195 200 205
Thr Asp Leu Gln Asp Thr Cys Ile Tyr Arg Lys Asn Arg His Lys Gln
210 215 220
Leu Asn Thr Tyr Thr Asp Leu Gly Phe Ala Val Val Tyr Thr Gln Asn
225 230 235 240
Asp Arg Gly Arg Val Cys Asp His Arg Glu Ser Ser Arg Thr Leu Lys
245 250 255
Arg Val Phe Lys Gly Ile Arg Ile Met Ser Val Ile Pro Pro Val Leu
260 265 270
Tyr Ile Val Thr Pro Leu Met Trp Ala Ile Ser His Ile Ile Lys Leu
275 280 285
Asn His Phe Ile Lys Leu Thr Gln Val Thr Leu Ala Val Ser Ile Gly
290 295 300
Gly His Ile Ile Ala Phe Gly Leu Gln Gly Phe Ala Val Leu Tyr Gln
305 310 315 320
Glu Lys Lys Asn Leu Trp Val Ile Val Leu Tyr Thr Thr Thr Ser Val
325 330 335
Thr Gly Ile Ala Val Thr Phe Ala Gly Ile Ser Trp Gly Ala Ile Ile
340 345 350
Ile Leu Thr Ser Thr Val Ala Ala Gly Leu Thr Cys Ile Gln Met Met
355 360 365
Arg Leu Ser Val Lys Pro Ile Asp Cys Phe Met Ala Ser His Ile Thr
370 375 380
Lys Val Tyr His Val Cys Val Tyr Ile Ile Ile Asn Leu Cys Tyr Leu
385 390 395 400
Cys Gly Thr Tyr Val Ser
405
<210>73
<211>560
<212>PRT
<213>水痘带状疱疹
<400>73
Met Lys Arg Ile Gln Ile Asn Leu Ile Leu Thr Ile Ala Cys Ile Gln
1 5 10 15
Leu Ser Thr Glu Ser Gln Pro Thr Pro Val Ser Ile Thr Glu Leu Tyr
20 25 30
Thr Ser Ala Ala Thr Arg Lys Pro Asp Pro Ala Val Ala Pro Thr Ser
35 40 45
Ala Ala Ser Arg Lys Pro Asp Pro Ala Val Ala Pro Thr Ser Ala Ala
50 55 60
Ser Arg Lys Pro Asp Pro Ala Val Ala Pro Thr Ser Ala Ala Ser Arg
65 70 75 80
Lys Pro Asp Pro Ala Val Ala Pro Thr Ser Ala Ala Thr Arg Lys Pro
85 90 95
Asp Pro Ala Val Ala Pro Thr Ser Ala Ala Ser Arg Lys Pro Asp Pro
100 105 ll0
Ala Val Ala Pro Thr Ser Ala Ala Thr Arg Lys Pro Asp Pro Ala Val
115 120 125
Ala Pro Thr Ser Ala Ala Ser Arg Lys Pro Asp Pro Ala Ala Asn Thr
130 135 140
Gln His Ser Gln Pro Pro Phe Leu Tyr Glu Asn Ile Gln Cys Val His
145 150 155 160
Gly Gly Ile Gln Ser Ile Pro Tyr Phe His Thr Phe Ile Met Pro Cys
165 170 175
Tyr Met Arg Leu Thr Thr Gly Gln Gln Ala Ala Phe Lys Gln Gln Gln
180 185 190
Lys Thr Tyr Glu Gln Tyr Ser Leu Asp Pro Glu Gly Ser Asn Ile Thr
195 200 205
Arg Trp Lys Ser Leu Ile Arg Pro Asp Leu His Ile Glu Val Trp Phe
210 215 220
Thr Arg His Leu Ile Asp Pro His Arg Gln Leu Gly Asn Ala Leu Ile
225 230 235 240
Arg Met Pro Asp Leu Pro Val Met Leu Tyr Ser Asn Ser Ala Asp Leu
245 250 255
Asn Leu Ile Asn Asn Pro Glu Ile Phe Thr His Ala Lys Glu Asn Tyr
260 265 270
Val Ile Pro Asp Val Lys Thr Thr Ser Asp Phe Ser Val Thr Ile Leu
275 280 285
Ser Met Asp Ala Thr Thr Glu Gly Thr Tyr Ile Trp Arg Val Val Asn
290 295 300
Thr Lys Thr Lys Asn Val Ile Ser Glu His Ser Ile Thr Val Thr Thr
305 310 315 320
Tyr Tyr Arg Pro Asn Ile Thr Val Val Gly Asp Pro Val Leu Thr Gly
325 330 335
Gln Thr Tyr Ala Ala Tyr Cys Asn Val Ser Lys Tyr Tyr Pro Pro His
340 345 350
Ser Val Arg Val Arg Trp Thr Ser Arg Phe Gly Asn Ile Gly Lys Asn
355 360 365
Phe Ile Thr Asp Ala Ile Gln Glu Tyr Ala Asn Gly Leu Phe Ser Tyr
370 375 380
Val Ser Ala Val Arg Ile Pro Gln Gln Lys Gln Met Asp Tyr Pro Pro
385 390 395 400
Pro Ala Ile Gln Cys Asn Val Leu Trp Ile Arg Asp Gly Val Ser Asn
405 410 415
Met Lys Tyr Ser Ala Val Val Thr Pro Asp Val Tyr Pro Phe Pro Asn
420 425 430
Val Ser Ile Gly Ile Ile Asp Gly His Ile Val Cys Thr Ala Lys Cys
435 440 445
Val Pro Arg Gly Val Val His Phe Val Trp Trp Val Asn Asp Ser Pro
450 455 460
Ile Asn His Glu Asn Ser Glu Ile Thr Gly Val Cys Asp Gln Asn Lys
465 470 475 480
Arg Phe Val Asn Met Gln Ser Ser Cys Pro Thr Ser Glu Leu Asp Gly
485 490 495
Pro Ile Thr Tyr Ser Cys His Leu Asp Gly Tyr Pro Lys Lys Phe Pro
500 505 510
Pro Phe Ser Ala Val Tyr Thr Tyr Asp Ala Ser Thr Tyr Ala Thr Thr
515 520 525
Phe Ser Val Val Ala Val Ile Ile Gly Val Ile Ser Ile Leu Gly Thr
530 535 540
Leu Gly Leu Ile Ala Val Ile Ala Thr Leu Cys Ile Arg Cys Cys Ser
545 550 555 560
<210>74
<211>396
<212>PRT
<213>水痘带状疱疹
<400>74
Met Asn Glu Ala Val Ile Asp Pro Ile Leu Glu Thr Ala Val Asn Thr
1 5 10 15
Gly Asp Met Phe Cys Ser Gln Thr Ile Pro Asn Arg Cys Leu Lys Asp
20 25 30
Thr Ile Leu Ile Glu Val Gln Pro Glu Cys Ala Asp Thr Leu Gln Cys
35 40 45
Val Leu Asp Asp Lys Val Ser Arg His Gln Pro Leu Leu Leu Arg Asn
50 55 60
His Lys Lys Leu Glu Leu Pro Ser Glu Lys Ser Val Thr Arg Gly Gly
65 70 75 80
Phe Tyr Met Gln Gln Leu Glu Leu Leu Val Lys Ser Ala Pro Pro Asn
85 90 95
Glu Tyr Ala Leu Leu Leu Ile Gln Cys Lys Asp Thr Ala Leu Ala Asp
100 105 110
Glu Asp Asn Phe Phe Val Ala Asn Gly Val Ile Asp Ala Gly Tyr Arg
115 120 125
Gly Val Ile Ser Ala Leu Leu Tyr Tyr Arg Pro Gly Val Thr Val Ile
130 135 140
Leu Pro Gly His Leu Thr Ile Tyr Leu Phe Pro Val Lys Leu Arg Gln
145 150 155 160
Ser Arg Leu Leu Pro Lys Asn Val Leu Lys His Leu Asp Pro Ile Phe
165 170 175
Lys Ser Ile Gln Val Gln Pro Leu Ser Asn Ser Pro Ser Asn Tyr Glu
180 185 190
Lys Pro Val Ile Pro Glu Phe Ala Asp Ile Ser Thr Val Gln Gln Gly
195 200 205
Gln Pro Leu His Arg Asp Ser Ala Glu Tyr His Ile Asp Val Pro Leu
210 215 220
Thr Tyr Lys His Ile Ile Asn Pro Lys Arg Gln Glu Asp Ala Gly Tyr
225 230 235 240
Asp Ile Cys Val Pro Tyr Asn Leu Tyr Leu Lys Arg Asn Glu Phe Ile
245 250 255
Lys Ile Val Leu Pro Ile Ile Arg Asp Trp Asp Leu Gln His Pro Ser
260 265 270
Ile Asn Ala Tyr Ile Phe Gly Arg Ser Ser Lys Ser Arg Ser Gly Ile
275 280 285
Ile Val Cys Pro Thr Ala Trp Pro Ala Gly Glu His Cys Lys Phe Tyr
290 295 300
Val Tyr Asn Leu Thr Gly Asp Asp Ile Arg Ile Lys Thr Gly Asp Arg
305 310 315 320
Leu Ala Gln Val Leu Leu Ile Asp His Asn Thr Gln Ile His Leu Lys
325 330 335
His Asn Val Leu Ser Asn Ile Ala Phe Pro Tyr Ala Ile Arg Gly Lys
340 345 350
Cys Gly Ile Pro Gly Val Gln Trp Tyr Phe Thr Lys Thr Leu Asp Leu
355 360 365
Ile Ala Thr Pro Ser Glu Arg Gly Thr Arg Gly Phe Gly Ser Thr Asp
370 375 380
Lys Glu Thr Asn Asp Val Asp Phe Leu Leu Lys His
385 390 395
<210>75
<211>1083
<212>PRT
<213>水痘带状疱疹
<400>75
Met Asp Lys Ser Ser Lys Pro Thr Ile Arg Leu Leu Phe Ala Thr Lys
1 5 10 15
Gly Cys Ala Ile Ser His Ser Leu Leu Leu Leu Thr Gly Gln Ile Ser
20 25 30
Thr Glu Pro Leu Tyr Val Val Ser Tyr Thr Trp Thr Pro Asp Leu Asp
35 40 45
Asp Val Phe Val Lys Asn Gly Arg Glu Glu Ile Thr Gln Val Ile Pro
50 55 60
Thr Lys Arg Pro Arg Glu Val Thr Glu Asn Asp Glu Glu Asn Gln Ile
65 70 75 80
Met His Leu Phe Cys Ser Arg Asp Val Asn Val Ile Phe Tyr Leu Ile
85 90 95
Gly Gly Phe Ser Thr Gly Asp Val Arg Ser Arg Val Trp Pro Ile Phe
100 105 110
Phe Cys Cys Phe Lys Thr Gln Thr Asp Phe Lys Ala Leu Tyr Lys Ala
115 120 125
Leu Trp Tyr Gly Ala Pro Leu Asn Pro His Ile Ile Ser Asp Thr Leu
130 135 140
Cys Ile Ser Glu Thr Phe Asp Ile His Ser Glu Val Ile Gln Thr Leu
145 150 155 160
Met Val Thr Thr His His Leu Asn Arg Lys Gly Leu Ser Asp Asn Gly
165 170 175
Leu Cys Ile Thr Glu Ala Thr Leu Cys Lys Leu Val Lys Lys Ser Val
180 185 190
Gly Arg Gln Glu Leu Thr Ser Leu Tyr Ala His Tyr Glu Arg Gln Val
195 200 205
Leu Ala Ala Tyr Arg Arg Leu Tyr Trp Gly Tyr Gly Cys Ser Pro Phe
210 215 220
Trp Tyr Ile Val Arg Phe Gly Pro Ser Glu Lys Thr Leu Val Leu Ala
225 230 235 240
Thr Arg Tyr Tyr Leu Leu Gln Thr Asp Thr Ser Tyr Asn Thr Leu Glu
245 250 255
Thr Pro Leu Tyr Asp Leu Gln Ala Ile Lys Asp Leu Phe Leu Thr Tyr
260 265 270
Gln Val Pro Ala Leu Pro Asn Cys Ser Gly Tyr Asn Ile Ser Asp Leu
275 280 285
Leu Ser Phe Asp Lys Leu Ser Met Phe Cys Cys Ser Ser Thr Tyr Thr
290 295 300
Arg Gly Leu Thr Ala Lys Asn Ala Leu Ser Tyr Ile Leu Gln Arg Ile
305 310 315 320
His Thr Asp Thr Thr Glu Ile His Ala Val Ser Glu Tyr Ile Thr Asn
325 330 335
Asp Arg Lys Gly Leu Lys Val Pro Asp Arg Glu Phe Val Asp Tyr Ile
340 345 350
Tyr Leu Ala His Phe Glu Cys Phe Asn Arg Lys Gln Ile Ala Asp His
355 360 365
Leu Gln Ala Val Thr Tyr Ser Asp Phe Val Asn Lys Pro Val Leu Leu
370 375 380
Lys Ser Ser Asn Leu Gly Lys Arg Ala Thr Ala Asn Phe Phe Asn His
385 390 395 400
Val Arg Ser Arg Leu Asn Met Arg Asp Tyr Ile Lys Lys Asn Val Ile
405 410 415
Cys Asp Val Thr Glu Leu Gly Pro Glu Ile Gly His Lys Tyr Thr Ile
420 425 430
Thr Lys Thr Tyr Thr Leu Ser Leu Thr Tyr Ala Ala Lys Pro Ser Lys
435 440 445
Phe Ile Gly Val Cys Asp Leu Ala Thr Thr Leu Thr Arg Arg Val Glu
450 455 460
Asn Ile Glu Lys Gln Phe Ser Pro Tyr Gly Trp Ser Ser Thr Ile Pro
465 470 475 480
Ser Asn Pro Pro Gly Phe Asp Glu Leu Ser Asn Phe Glu Asp Ser Gly
485 490 495
Val Ser Ala Glu Ala Leu Arg Ala Ala Asn Phe Ala Asn Asp Thr Pro
500 505 510
Asn Gln Ser Gly Arg Thr Gly Phe Asp Thr Ser Pro Gly Ile Thr Lys
515 520 525
Leu Leu Leu Phe Phe Ser Ala Ala Thr Gly Ile Ala Thr His Asp Val
530 535 540
Ser Ile Leu Ser Tyr Lys Thr Pro Leu Glu Ala Leu Ile Gly His Ser
545 550 555 560
Glu Val Thr Gly Pro Met Pro Val Tyr Arg Val Ala Leu Pro His Gly
565 570 575
Ala Gln Ala Phe Ala Val Ile Ala Asn Asp Thr Trp Ser Ser Ile Thr
580 585 590
Asn Arg Tyr Thr Leu Pro His Glu Ala Arg Leu Ile Ala Glu Asp Leu
595 600 605
Lys Gln Ile Asn Pro Cys Asn Phe Val Ala Ala Ser Leu Arg Asp Met
610 615 620
Gln Leu Thr Leu Leu Leu Ser Thr Ser Val Lys Asn Val Ser Lys Ile
625 630 635 640
Ser Ser Asn Ile Pro Lys Asp Gln Leu Tyr Ile Asn Arg Asn Glu Leu
645 650 655
Phe Asn Thr Asn Leu Ile Ile Thr Asn Leu Ile Leu Asp Val Asp Phe
660 665 670
His Ile Arg Lys Pro Ile Pro Leu Gly Ile Leu His Ala Gly Met Arg
675 680 685
Ala Phe Arg His Gly Ile Leu Thr Ala Met Gln Leu Leu Phe Pro Lys
690 695 700
Ala Val Val Asn Pro Asn Lys Asp Pro Cys Tyr Phe Tyr Lys Thr Ala
705 710 715 720
Cys Pro Glu Pro Thr Val Glu Val Leu Asp Asp Asp Asn Leu Leu Asp
725 730 735
Ile Thr Ser His Ser Asp Ile Asp Phe Tyr Ile Glu Asn Gly Glu Leu
740 745 750
Tyr Thr Cys Val Glu Glu Asn Tyr Thr Glu Asp Val Trp Phe Phe Asp
755 760 765
Thr Gln Thr Thr Ser Glu Val His Thr His Ala Asp Val Ser Asn Asn
770 775 780
Glu Asn Leu His Glu Thr Leu Pro Cys Asn Cys Lys Glu Lys Ile Gly
785 790 795 800
Phe Arg Val Cys Val Pro Ile Pro Asn Pro Tyr Ala Leu Val Gly Ser
805 810 815
Ser Thr Leu Lys Gly Phe Ala Gln Ile Leu Gln Gln Ala Val Leu Leu
820 825 830
Glu Arg Glu Phe Val Glu Tyr Ile Gly Pro Tyr Leu Arg Asp Phe Ser
835 840 845
Phe Ile Asp Thr Gly Val Tyr Ser His Gly His Ser Leu Arg Leu Pro
850 855 860
Phe Phe Ser Lys Val Thr Thr Thr Gly Thr Ala Val Gly Gln Leu Leu
865 870 875 880
Pro Phe Tyr Val Val Pro Glu Gln Cys Ile Asp Ile Leu Ala Phe Val
885 890 895
Thr Ser His Arg Asn Pro Ala Asn Phe His Phe His Ser Arg Pro Gln
900 905 910
Ser Asn Val Pro Val Gln Phe Ile Leu His Asn Leu Gly Gly Glu Tyr
915 920 925
Ala Glu Phe Phe Glu Arg Lys Val Ala Arg Asn Lys Gln Ile Phe Ser
930 935 940
Ser Pro Gln Ile Ser Leu Thr Lys Ala Leu Lys Glu Arg Gly Val Thr
945 950 955 960
Cys Leu Asp Ala Phe Thr Leu Glu Ala Phe Val Asp Ser Thr Ile Leu
965 970 975
Glu Ser Ile Val Glu His Ile Ala Val His Phe Pro Gly Arg Asp Arg
980 985 990
Glu Tyr Thr Leu Thr Ser Ser Lys Cys Ile Ala Ile Lys Arg Asp Trp
995 1000 1005
Val Leu Phe Gln Leu Ile Cys Gly Thr Lys Gly Phe Thr Cys Leu
1010 1015 1020
Arg Tyr Pro His Arg Gly Gly Arg Thr Ala Pro Arg Thr Phe Val
1025 1030 1035
Set Leu Arg Val Asp His His Asn Arg Leu Cys Ile Ser Leu Ala
1040 1045 1050
Gln Gln Cys Phe Ala Thr Lys Cys Asp Set Asn Arg Met His Thr
1055 1060 1065
Ile Phe Thr Leu Glu Val Pro Asn Tyr Pro Asn Leu Thr Ser Ser
1070 1075 1080
<210>76
<211>340
<212>PRT
<213>水痘带状疱疹
<400>76
Met Gln Ala Leu Gly Ile Lys Thr Glu His Phe Ile Ile Met Cys Leu
1 5 10 15
Leu Ser Gly His Ala Val Phe Thr Leu Trp Tyr Thr Ala Arg Val Lys
20 25 30
Phe Glu His Glu Cys Val Tyr Ala Thr Thr Val Ile Asn Gly Gly Pro
35 40 45
Val Val Trp Gly Ser Tyr Asn Asn Ser Leu Ile Tyr Val Thr Phe Val
50 55 60
Asn His Ser Thr Phe Leu Asp Gly Leu Ser Gly Tyr Asp Tyr Ser Cys
65 70 75 80
Arg Glu Asn Leu Leu Ser Gly Asp Thr Met Val Lys Thr Ala Ile Ser
85 90 95
Thr Pro Leu His Asp Lys Ile Arg Ile Val Leu Gly Thr Arg Asn Cys
100 105 110
His Ala Tyr Phe Trp Cys Val Gln Leu Lys Met Ile Phe Phe Ala Trp
115 120 125
Phe Val Tyr Gly Met Tyr Leu Gln Phe Arg Arg Ile Arg Arg Met Phe
130 135 140
Gly Pro Phe Arg Ser Ser Cys Glu Leu Ile Ser Pro Thr Ser Tyr Ser
145 150 155 160
Leu Asn Tyr Val Thr Arg Val Ile Ser Asn Ile Leu Leu Gly Tyr Pro
165 170 175
Tyr Thr Lys Leu Ala Arg Leu Leu Cys Asp Val Ser Met Arg Arg Asp
180 185 190
Gly Met Ser Lys Val Phe Asn Ala Asp Pro Ile Ser Phe Leu Tyr Met
195 200 205
His Lys Gly Val Thr Leu Leu Met Leu Leu Glu Val Ile Ala His Ile
210 215 220
Ser Ser Gly Cys Ile Val Leu Leu Thr Leu Gly Val Ala Tyr Thr Pro
225 230 235 240
Cys Ala Leu Leu Tyr Pro Thr Tyr Ile Arg Ile Leu Ala Trp Val Val
245 250 255
Val Cys Thr Leu Ala Ile Val Glu Leu Ile Ser Tyr Val Arg Pro Lys
260 265 270
Pro Thr Lys Asp Asn His Leu Asn His Ile Asn Thr Gly Gly Ile Arg
275 280 285
Gly Ile Cys Thr Thr Cys Cys Ala Thr Val Met Ser Gly Leu Ala Ile
290 295 300
Lys Cys Phe Tyr Ile Val Ile Phe Ala Ile Ala Val Val Ile Phe Met
305 310 315 320
His Tyr Glu Gln Arg Val Gln Val Ser Leu Phe Gly Glu Ser Glu Asn
325 330 335
Ser Gln Lys His
340
<210>77
<211>452
<212>PRT
<213>水痘带状疱疹
<400>77
Met Ala Ser Ala Ser Ile Pro Thr Asp Pro Asp Val Ser Thr Ile Cys
1 5 10 15
Glu Asp Phe Met Asn Leu Leu Pro Asp Glu Pro Ser Asp Asp Phe Ala
20 25 30
Leu Glu Val Thr Asp Trp Ala Asn Asp Glu Ala Ile Gly Ser Thr Pro
35 40 45
Gly Glu Asp Ser Thr Thr Ser Arg Thr Val Tyr Val Glu Arg Thr Ala
50 55 60
Asp Thr Ala Tyr Asn Pro Arg Tyr Ser Lys Arg Arg His Gly Arg Arg
65 70 75 80
Glu Ser Tyr His His Asn Arg Pro Lys Thr Leu Val Val Val Leu Pro
85 90 95
Asp Ser Asn His His Gly Gly Arg Asp Val Glu Thr Gly Tyr Ala Arg
100 105 110
Ile Glu Arg Gly His Arg Arg Ser Ser Arg Ser Tyr Asn Thr Gln Ser
115 120 125
Set Arg Lys His Arg Asp Arg Ser Leu Ser Asn Arg Arg Arg Arg Pro
130 135 140
Thr Thr Pro Pro Ala Met Thr Thr Gly Glu Arg Asn Asp Gln Thr His
145 150 155 160
Asp Glu Ser Tyr Arg Leu Arg Phe Ser Lys Arg Asp Ala Arg Arg Glu
165 170 175
Arg Ile Arg Lys Glu Tyr Asp Ile Pro Val Asp Arg Ile Thr Gly Arg
180 185 190
Ala Ile Glu Val Val Ser Thr Ala Gly Ala Ser Val Thr Ile Asp Ser
195 200 205
Val Arg His Leu Asp Glu Thr Ile Glu Lys Leu Val Val Arg Tyr Ala
210 215 220
Thr Ile Gln Glu Gly Asp Ser Trp Ala Ser Gly Gly Cys Phe Pro Gly
225 230 235 240
Ile Lys Gln Asn Thr Ser Trp Pro Glu Leu Met Leu Tyr Gly His Glu
245 250 255
Leu Tyr Arg Thr Phe Glu Ser Tyr Lys Met Asp Ser Arg Ile Ala Arg
260 265 270
Ala Leu Arg Glu Arg Val Ile Arg Gly Glu Ser Leu Ile Glu Ala Leu
275 280 285
Glu Ser Ala Asp Glu Leu Leu Thr Trp Ile Lys Met Leu Ala Ala Lys
290 295 300
Asn Leu Pro Ile Tyr Thr Asn Asn Pro Ile Val Ala Thr Ser Lys Ser
305 310 315 320
Leu Leu Glu Asn Leu Lys Leu Lys Leu Gly Pro Phe Val Arg Cys Leu
325 330 335
Leu Leu Asn Arg Asp Asn Asp Leu Gly Ser Arg Thr Leu Pro Glu Leu
340 345 350
Leu Arg Gln Gln Arg Phe Ser Asp Ile Thr Cys Ile Thr Thr Tyr Met
355 360 365
Phe Val Met Ile Ala Arg Ile Ala Asn Ile Val Val Arg Gly Ser Lys
370 375 380
Phe Val Glu Tyr Asp Asp Ile Ser Cys Asn Val Gln Val Leu Gln Glu
385 390 395 400
Tyr Thr Pro Gly Ser Cys Leu Ala Gly Val Leu Glu Ala Leu Ile Thr
405 410 415
His Gln Arg Glu Cys Gly Arg Val Glu Cys Thr Leu Ser Thr Trp Ala
420 425 430
Gly His Leu Ser Asp Ala Arg Pro Tyr Gly Lys Tyr Phe Lys Cys Ser
435 440 445
Thr Phe Asn Cys
450
<210>78
<211>179
<212>PRT
<213>水痘带状疱疹
<400>78
Met Asp Thr Thr Gly Ala Ser Glu Ser Ser Gln Pro Ile Arg Val Asn
1 5 10 15
Leu Lys Pro Asp Pro Leu Ala Ser Phe Thr Gln Val Ile Pro Pro Leu
20 25 30
Ala Leu Glu Thr Thr Trp Thr Cys Pro Ala Asn Ser His Ala Pro Thr
35 40 45
Pro Ser Pro Leu Tyr Gly Val Lys Arg Leu Cys Ala Leu Arg Ala Thr
50 55 60
Cys Gly Arg Ala Asp Asp Leu His Ala Phe Leu Ile Gly Leu Gly Arg
65 70 75 80
Arg Asp Lys Pro Ser Glu Ser Pro Met Tyr Val Asp Leu Gln Pro Phe
85 90 95
Cys Ser Leu Leu Asn Ser Gln Arg Leu Leu Pro Glu Met Ala Asn Tyr
100 105 ll0
Asn Thr Leu Cys Asp Ala Pro Phe Ser Ala Ala Thr Gln Gln Met Met
115 120 125
Leu Glu Ser Gly Gln Leu Gly Val His Leu Ala Ala Ile Gly Tyr His
130 135 140
Cys His Cys Lys Ser Pro Phe Ser Ala Glu Cys Trp Thr Gly Ala Ser
145 150 155 160
Glu Ala Tyr Asp His Val Val Cys Gly Gly Lys Ala Arg Ala Ala Val
165 170 175
Gly Gly Leu
<210>79
<211>108
<212>PRT
<213>水痘带状疱疹
<400>79
Met Ser Arg Val Ser Glu Tyr Gly Val Pro Glu Gly Val Arg Glu Ser
1 5 10 15
Asp Ser Asp Thr Asp Ser Val Phe Met Tyr Gln His Thr Glu Leu Met
20 25 30
Gln Asn Asn Ala Ser Pro Leu Val Val Gln Thr Arg Pro Pro Ala Val
35 40 45
Leu Ile Pro Leu Val Asp Val Pro Arg Pro Arg Ser Arg Arg Lys Ala
50 55 60
Ser Ala Gln Leu Lys Met Gln Met Asp Arg Leu Cys Asn Val Leu Gly
65 70 75 80
Val Val Leu Gln Met Ala Thr Leu Ala Leu Val Thr Tyr Ile Ala Phe
85 90 95
Val Val His Thr Arg Ala Thr Ser Cys Lys Arg Glu
100 105
<210>80
<211>5743
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(1056)
<220>
<221>CDS
<222>(4556)..(5740)
<400>80
atg tca ttg ata atg ttt ggt cgt acg ctt ggt gaa gaa tct gta aga 48
Met Ser Leu Ile Met Phe Gly Arg Thr Leu Gly Glu Glu Ser Val Arg
1 5 10 15
tat ttt gaa cgt cta aag cgt cgt agg gat gaa cgc ttt ggg acg ttg 96
Tyr Phe Glu Arg Leu Lys Arg Arg Arg Asp Glu Arg Phe Gly Thr Leu
20 25 30
gag tcc cct acc ccg tgt tcc acg cgg caa ggg tct ctg gga aac gca 144
Glu Ser Pro Thr Pro Cys Ser Thr Arg Gln Gly Ser Leu Gly Asn Ala
35 40 45
acc caa atc ccg ttt ctg aat ttt gct ata gat gta acc cga cgt cat 192
Thr Gln Ile Pro Phe Leu Asn Phe Ala Ile Asp Val Thr Arg Arg His
50 55 60
cag gcc gtt att ccc gga att gga acg ctt cac aac tgt tgt gaa tat 240
Gln Ala Val Ile Pro Gly Ile Gly Thr Leu His Asn Cys Cys Glu Tyr
65 70 75 80
att cca ctg ttc tcg gct act gct cga cgg gca atg ttt ggc gcg ttt 288
Ile Pro Leu Phe Ser Ala Thr Ala Arg Arg Ala Met Phe Gly Ala Phe
85 90 95
cta tcg tca aca ggg tac aac tgt acc ccc aat gta gtt ttg aaa cca 336
Leu Ser Ser Thr Gly Tyr Asn Cys Thr Pro Asn Val Val Leu Lys Pro
100 105 110
tgg cga tat tcg gta aat gca aac gta agc cct gaa tta aaa aag gct 384
Trp Arg Tyr Ser Val Asn Ala Asn Val Ser Pro Glu Leu Lys Lys Ala
115 120 125
gtc agt agt gta cag ttt tat gaa tat tca ccg gaa gaa gca gca cct 432
Val Ser Ser Val Gln Phe Tyr Glu Tyr Ser Pro Glu Glu Ala Ala Pro
130 135 140
cat cga aat gcg tat agc ggt gtt atg aac aca ttt cgc gcg ttt tct 480
His Arg Asn Ala Tyr Ser Gly Val Met Asn Thr Phe Arg Ala Phe Ser
145 150 155 160
ctg tcg gat agt ttc tgt cag ttg tct acc ttt aca caa cgg ttt tcg 528
Leu Ser Asp Ser Phe Cys Gln Leu Ser Thr Phe Thr Gln Arg Phe Ser
165 170 175
tac ctt gtg gaa aca tct ttt gag agt att gaa gag tgt gga agt cat 576
Tyr Leu Val Glu Thr Ser Phe Glu Ser Ile Glu Glu Cys Gly Ser His
180 185 190
ggc aaa cgc gca aag gtt gac gtt cca atc tat ggc aga tat aag ggg 624
Gly Lys Arg Ala Lys Val Asp Val Pro Ile Tyr Gly Arg Tyr Lys Gly
195 200 205
acg ttg gaa ctg ttt caa aaa atg atc ctc atg cac acc acg cat ttt 672
Thr Leu Glu Leu Phe Gln Lys Met Ile Leu Met His Thr Thr His Phe
210 215 220
att tca tcg gtg cta ttg ggc gat cat gcc gac aga gtt gac tgc ttt 720
Ile Ser Ser Val Leu Leu Gly Asp His Ala Asp Arg Val Asp Cys Phe
225 230 235 240
ctg cgt aca gtg ttt aac acg cca agt gtt tct gac agt gtt tta gaa 768
Leu Arg Thr Val Phe Asn Thr Pro Ser Val Ser Asp Ser Val Leu Glu
245 250 255
cac ttc aaa caa aaa tca act gtg ttt ttg gta cca cgt aga cat ggg 816
His Phe Lys Gln Lys Ser Thr Val Phe Leu Val Pro Arg Arg His Gly
260 265 270
aaa aca tgg ttt ctt gta cca tta ata gct tta gta atg gcc acg ttt 864
Lys Thr Trp Phe Leu Val Pro Leu Ile Ala Leu Val Met Ala Thr Phe
275 280 285
aga gga att aaa gtg ggt tat acg gct cat ata cgc aaa gca acg gaa 912
Arg Gly Ile Lys Val Gly Tyr Thr Ala His Ile Arg Lys Ala Thr Glu
290 295 300
ccc gtg ttt gag ggt atc aag tct cgc ctg gaa cag tgg ttt ggg gca 960
Pro Val Phe Glu Gly Ile Lys Ser Arg Leu Glu Gln Trp Phe Gly Ala
305 310 315 320
aat tac gtg gat cat gta aaa ggc gaa tct att acg ttt tca ttt acc 1008
Asn Tyr Val Asp His Val Lys Gly Glu Ser Ile Thr Phe Ser Phe Thr
325 330 335
gac ggg tct tac agc aca gcg gtg ttc gcg tca agt cac aac aca aac 1056
Asp Gly Ser Tyr Ser Thr Ala Val Phe Ala Ser Ser His Asn Thr Asn
340 345 350
gtgagtgttt tataaattta acctttaata tattactgta aatgttgaca tatacctttc 1116
cacaacggcg gttgagttaa ggtatactag gtggttgtag gttccggttc acccgataat 1176
ctttgtgtct cggggaagca aattcgctga agcagaccac agccgttaat aatagcccgg 1236
cttaatgttt ctccaaacat ataaagctgc cacccagatg aatttactgg tacagagaga 1296
ccactggcgt tggttcccgc tataacgtcg ccaagatttg cggtaatgcg aggattttta 1356
gtactcgtaa ttcgaatgca ggtggtgaca tctacaaaaa gaacctgcgt ggcgccaatg 1416
tctacctcca cttttaattc ccgctgaccg gcctttctcc acatacacgg agcccaacac 1476
acacaacctt ccgcatgatt tgtgacatgg ggtaacgcat acagtgcccc cacgtgaact 1536
ctatgattac attcatcaca tccgtccgca tggctgagga gtcgatttaa tacagagcca 1596
agtatccgag catgccatcc ggcgggacat agccctatta aattaggttc catagccagt 1656
acatataaac gccttcgttc gtctgaccac cacactcccg gagaaataac tttacatgcg 1716
tatggatttt tcggaagccg cgggggttgt aagtagttgc ttaagtttgg cgttggtgta 1776
agatctgcgg gggtgggatc tgctcgagga tccggaatag atgttggaag ggggtacgcg 1836
atcgggttct taaacgttgc tccaaaaaca tggtctatgt tttcaaccgg ataaattctt 1896
aaagtcgccg tcattgcgta cgagacctcg taattaaaat ttacaattac atgaaaagtc 1956
ttcggaggta agttcatctg acgtgggcgc gtgatgtaaa ttgtggctac aacaacggca 2016
atattagtag tatccgtttg aagggggata aacggagcga tccttaaagt tataaaagca 2076
gttgatcgca ttattttcac ccggggatcg gtcaggatgg acttccataa tcccatatcc 2136
agcgttaatg catcgcagag tctctgaact gcctcggggg ttaatttgcg cgctgcaccc 2196
gtagcggtgt acagcggaaa tatgcgttgt aattccatga gcagaaaaac agtctagcgg 2256
attgccgccc gggtacttgt gggtttaatg ccacccaccc ggttatttta tattttaaga 2316
gggggtggaa acgggagaaa tgacgtaaaa ttacatatga agagattctg gtgttatgtt 2376
tttatagtga cactaattta tttatggggg ttgggaatag agaagcagaa tctgtctaga 2436
ataggtccga ttaacgatgc aggtagtgct gcctgtaggg tatcggtaat acaaaaacat 2496
gccgcaaatc cccccggtaa aactaaaatg gattgtaatt gctggttaaa tcctagacaa 2556
atgtacgcgt aacattgacc gggtaaatac ttagaacaaa ttccaatatc aacaatatcc 2616
gcgctgcgta taaatttacc cctcagttgt gtggaattac caataccaac cttttctaag 2676
gctacgggaa cggggacctt ggaaagctta agtatttccc ctcctgaatt ataatagtca 2736
aataaatata tagaacgatt acctaaccag catgggaagg aagcgtgaag gtagggtata 2796
taccccccac cttgtggtcg tgtatataca gatgacagat acgccaaaac cgcatacatc 2856
aaggagctgt tataaaacgc atccattgac atttccgtta acaccgaaac tatagtctga 2916
atcaggtctg gtgtgcgggc tacaatttca tcaataaccg tttgggaaga atctgcaata 2976
tcatattcca tgagttgttg tagagtcggg ttcgtttgta actccgttat aagaccttgg 3036
gttagcgatg tcacacacgc ttgtttaaat acctggttta aaaacatttc ggcgcctggt 3096
ttaaaggcgg gtaagggggt ttgattattt aggacgttag ccaaaaacgg taaacgcgcg 3156
actagctctt ggcgagctgt cacatgtagg ctttggggat tgtcaacccg ggcatttata 3216
cacgcagcat caataatagc ctgtgcagag tgatataaaa ttggacttcc ggtaatacgt 3276
cctccccagg cagaggatcc gttgtaagat actacaatca acggactggg ggattctgcg 3336
taatgtcgcg gtacaattga taggggacgc cgtttccaga aatctgctgg agtgtccccg 3396
ctaactaatt gggcataaca gatgtcgaac cattccataa gactttgggg ttctgtcgaa 3456
gctggggtaa acaatagaac gtcttgtaaa ggtgggatgc tggcggacga attgtttttc 3516
tttcccgtaa atcgcccttg tccaggcggc tcaaggacgc catcaaagga accgttattg 3576
atcggatctg tgttggaagt ttgcgctccg tggccctttg cactttgaag caacccagat 3636
gcaacgcggg aactagaagg tcggacgggg tgcctggagt taacaatgtt tacggcccgt 3696
tttattagct caaggacgtc ccgattattt tcctgtatgc gtgtttcagc aggggagtca 3756
tcaatacctc cagaagttaa ctgtcgatca agatcgatta tggatgaaac gggtccaata 3816
ttgtccccat ttgacgtgtg tgattcaccc atggctgcca ccatatgctc tgcgtatatt 3876
tttatagacg atgcaagacg aggggtgcat cggatatacg caatcagctg tttgcataat 3936
aaaagtaccc gttgtccatc agcaaaataa cgcgttccgt ttgggattag ttctgcatac 3996
ataatacaaa tatcacggtg cttgcggttt ccagtattta ttcgtatcgc tacaacgtta 4056
aatgcatcaa agaataaacc ggggctaaga taaacaggca atgataaaat caatccccct 4116
gaattatgcg tggccgaaaa aacgtgtgaa acaaatggtt ccgtttttgg tattaagaga 4176
tttgttaagg cgttatcggg aatgtacgcg gcgaaaactt gacaccacgg ttcgcattga 4236
cctgtagcat gatatcttgt ttgtacttca accttgaagc gttgtccggg tttctttaaa 4296
atcagtaatg cgggatctat tccggccgca ataagccccg cgttaggtat cacaacgtgt 4356
agtaatcctt ttgtgtgatc attatgccaa agtgcatgtt tggtttcatt tgccaaatgg 4416
gcttccatta tacaccggat atggttgtac tggaaaaaaa aaagaaatat gtacgtattc 4476
aaacattttt tacgtacgtg gtatttaagg atacatttaa actttggtgg ggtaactata 4536
tatctttcta tcgttccag ggt atc cga ggt caa gat ttt aat ctt ctg ttt 4588
Gly Ile Arg Gly Gln Asp Phe Asn Leu Leu Phe
355 360
gtg gat gaa gct aat ttt att cga cct gat gct gta caa act ata gtc 4636
Val Asp Glu Ala Asn Phe Ile Arg Pro Asp Ala Val Gln Thr Ile Val
365 370 375
gga ttt tta aat caa acc aat tgt aaa att att ttt gtt tca tca aca 4684
Gly Phe Leu Asn Gln Thr Asn Cys Lys Ile Ile Phe Val Ser Ser Thr
380 385 390 395
aat acc gga aaa gca agt aca agt ttt ttg tat aac tta cgt gga tcg 4732
Asn Thr Gly Lys Ala Ser Thr Ser Phe Leu Tyr Asn Leu Arg Gly Ser
400 405 410
tcg gat cag ttg tta aac gtt gtt aca tat gta tgc gac gat cac atg 4780
Ser Asp Gln Leu Leu Asn Val Val Thr Tyr Val Cys Asp Asp His Met
415 420 425
ccg cgt gtt tta gca cat agc gat gtc aca gct tgt tcg tgt tat gta 4828
Pro Arg Val Leu Ala His Ser Asp Val Thr Ala Cys Ser Cys Tyr Val
430 435 440
tta aat aag ccg gtt ttc atc aca atg gat gga gcc atg cgg cgc act 4876
Leu Asn Lys Pro Val Phe Ile Thr Met Asp Gly Ala Met Arg Arg Thr
445 450 455
gca gat tta ttt atg gcc gac tcc ttc gtg cag gaa att gta ggt ggg 4924
Ala Asp Leu Phe Met Ala Asp Ser Phe Val Gln Glu Ile Val Gly Gly
460 465 470 475
cgt aaa cag aat tct ggg ggt gtg ggg ttt gac cgg cca tta ttt aca 4972
Arg Lys Gln Asn Ser Gly Gly Val Gly Phe Asp Arg Pro Leu Phe Thr
480 485 490
aaa act gcc cgt gag agg ttt att tta tat cgg ccg tca acc gtt gcg 5020
Lys Thr Ala Arg Glu Arg Phe Ile Leu Tyr Arg Pro Ser Thr Val Ala
495 500 505
aat tgt gct ata tta tcg tca gtg ttg tac gtt tac gta gac cct gca 5068
Asn Cys Ala Ile Leu Ser Ser Val Leu Tyr Val Tyr Val Asp Pro Ala
510 515 520
ttt acc tca aat aca cga gcg tct ggt act ggt gta gcg att gtt ggt 5116
Phe Thr Ser Asn Thr Arg Ala Ser Gly Thr Gly Val Ala Ile Val Gly
525 530 535
cgt tat aag tcg gat tgg att ata ttt gga ttg gag cac ttt ttt ctt 5164
Arg Tyr Lys Ser Asp Trp Ile Ile Phe Gly Leu Glu His Phe Phe Leu
540 545 550 555
aga gct tta act ggc acg tct tcc agt gag ata ggg cgt tgc gtt act 5212
Arg Ala Leu Thr Gly Thr Ser Ser Ser Glu Ile Gly Arg Cys Val Thr
560 565 570
caa tgc tta ggc cac ata ctc gct tta cac ccc aat aca ttt aca aac 5260
Gln Cys Leu Gly His Ile Leu Ala Leu His Pro Asn Thr Phe Thr Asn
575 580 585
gta cac gtt tct ata gag gga aac agc agc cag gat tct gca gtt gcc 5308
Val His Val Ser Ile Glu Gly Asn Ser Ser Gln Asp Ser Ala Val Ala
590 595 600
ata tcg ttg gct ata gca caa cag ttt gct gtc ctc gaa aag gga aac 5356
Ile Ser Leu Ala Ile Ala Gln Gln Phe Ala Val Leu Glu Lys Gly Asn
605 610 615
gtg cta tct tcc gct cca gtg tta ctg ttt tat cat tcc ata cct ccc 5404
Val Leu Ser Ser Ala Pro Val Leu Leu Phe Tyr His Ser Ile Pro Pro
620 625 630 635
gga tgt agc gtg gcg tac cct ttt ttt tta tta caa aaa caa aaa acg 5452
Gly Cys Ser Val Ala Tyr Pro Phe Phe Leu Leu Gln Lys Gln Lys Thr
640 645 650
ccg gcc gta gac tat ttt gtt aaa cga ttt aac tcc gga aat ata ata 5500
Pro Ala Val Asp Tyr Phe Val Lys Arg Phe Asn Ser Gly Asn Ile Ile
655 660 665
gcc tca cag gag ctt gta tcc cta aca gta aag tta ggt gta gac ccc 5548
Ala Ser Gln Glu Leu Val Ser Leu Thr Val Lys Leu Gly Val Asp Pro
670 675 680
gtg gag tat cta tgt aaa cag ttg gat aac ctg aca gag gta att aaa 5596
Val Glu Tyr Leu Cys Lys Gln Leu Asp Asn Leu Thr Glu Val Ile Lys
685 690 695
ggc ggt atg ggt aat cta gac aca aaa act tac acg ggt aaa ggt acc 5644
Gly Gly Met Gly Asn Leu Asp Thr Lys Thr Tyr Thr Gly Lys Gly Thr
700 705 710 715
acg gga aca atg tca gat gat ctg atg gtt gca tta att atg tcc gtg 5692
Thr Gly Thr Met Ser Asp Asp Leu Met Val Ala Leu Ile Met Ser Val
720 725 730
tat att ggc agt tca tgt ata ccg gat tcc gtg ttt atg cct att aaa 5740
Tyr Ile Gly Ser Ser Cys Ile Pro Asp Ser Val Phe Met Pro Ile Lys
735 740 745
taa 5743
<210>81
<211>747
<212>PRT
<213>水痘带状疱疹
<400>81
Met Ser Leu Ile Met Phe Gly Arg Thr Leu Gly Glu Glu Ser Val Arg
1 5 10 15
Tyr Phe Glu Arg Leu Lys Arg Arg Arg Asp Glu Arg Phe Gly Thr Leu
20 25 30
Glu Ser Pro Thr Pro Cys Ser Thr Arg Gln Gly Ser Leu Gly Asn Ala
35 40 45
Thr Gln Ile Pro Phe Leu Asn Phe Ala Ile Asp Val Thr Arg Arg His
50 55 60
Gln Ala Val Ile Pro Gly Ile Gly Thr Leu His Asn Cys Cys Glu Tyr
65 70 75 80
Ile Pro Leu Phe Ser Ala Thr Ala Arg Arg Ala Met Phe Gly Ala Phe
85 90 95
Leu Ser Ser Thr Gly Tyr Asn Cys Thr Pro Asn Val Val Leu Lys Pro
100 105 110
Trp Arg Tyr Ser Val Asn Ala Asn Val Ser Pro Glu Leu Lys Lys Ala
115 120 125
Val Ser Ser Val Gln Phe Tyr Glu Tyr Ser Pro Glu Glu Ala Ala Pro
130 135 140
His Arg Asn Ala Tyr Ser Gly Val Met Asn Thr Phe Arg Ala Phe Ser
145 150 155 160
Leu Ser Asp Ser Phe Cys Gln Leu Ser Thr Phe Thr Gln Arg Phe Ser
165 170 175
Tyr Leu Val Glu Thr Ser Phe Glu Ser Ile Glu Glu Cys Gly Ser His
180 185 190
Gly Lys Arg Ala Lys Val Asp Val Pro Ile Tyr Gly Arg Tyr Lys Gly
195 200 205
Thr Leu Glu Leu Phe Gln Lys Met Ile Leu Met His Thr Thr His Phe
210 215 220
Ile Ser Ser Val Leu Leu Gly Asp His Ala Asp Arg Val Asp Cys Phe
225 230 235 240
Leu Arg Thr Val Phe Asn Thr Pro Ser Val Ser Asp Ser Val Leu Glu
245 250 255
His Phe Lys Gln Lys Ser Thr Val Phe Leu Val Pro Arg Arg His Gly
260 265 270
Lys Thr Trp Phe Leu Val Pro Leu Ile Ala Leu Val Met Ala Thr Phe
275 280 285
Arg Gly Ile Lys Val Gly Tyr Thr Ala His Ile Arg Lys Ala Thr Glu
290 295 300
Pro Val Phe Glu Gly Ile Lys Ser Arg Leu Glu Gln Trp Phe Gly Ala
305 310 315 320
Asn Tyr Val Asp His Val Lys Gly Glu Ser Ile Thr Phe Ser Phe Thr
325 330 335
Asp Gly Ser Tyr Ser Thr Ala Val Phe Ala Ser Ser His Asn Thr Asn
340 345 350
Gly Ile Arg Gly Gln Asp Phe Asn Leu Leu Phe Val Asp Glu Ala Asn
355 360 365
Phe Ile Arg Pro Asp Ala Val Gln Thr Ile Val Gly Phe Leu Asn Gln
370 375 380
Thr Asn Cys Lys Ile Ile Phe Val Ser Ser Thr Asn Thr Gly Lys Ala
385 390 395 400
Ser Thr Ser Phe Leu Tyr Asn Leu Arg Gly Ser Ser Asp Gln Leu Leu
405 410 415
Asn Val Val Thr Tyr Val Cys Asp Asp His Met Pro Arg Val Leu Ala
420 425 430
His Ser Asp Val Thr Ala Cys Ser Cys Tyr Val Leu Asn Lys Pro Val
435 440 445
Phe Ile Thr Met Asp Gly Ala Met Arg Arg Thr Ala Asp Leu Phe Met
450 455 460
Ala Asp Ser Phe Val Gln Glu Ile Val Gly Gly Arg Lys Gln Asn Ser
465 470 475 480
Gly Gly Val Gly Phe Asp Arg Pro Leu Phe Thr Lys Thr Ala Arg Glu
485 490 495
Arg Phe Ile Leu Tyr Arg Pro Ser Thr Val Ala Asn Cys Ala Ile Leu
500 505 510
Ser Ser Val Leu Tyr Val Tyr Val Asp Pro Ala Phe Thr Ser Asn Thr
515 520 525
Arg Ala Ser Gly Thr Gly Val Ala Ile Val Gly Arg Tyr Lys Ser Asp
530 535 540
Trp Ile Ile Phe Gly Leu Glu His Phe Phe Leu Arg Ala Leu Thr Gly
545 550 555 560
Thr Ser Ser Ser Glu Ile Gly Arg Cys Val Thr Gln Cys Leu Gly His
565 570 575
Ile Leu Ala Leu His Pro Asn Thr Phe Thr Asn Val His Val Ser Ile
580 585 590
Glu Gly Asn Ser Ser Gln Asp Ser Ala Val Ala Ile Ser Leu Ala Ile
595 600 605
Ala Gln Gln Phe Ala Val Leu Glu Lys Gly Asn Val Leu Ser Ser Ala
610 615 620
Pro Val Leu Leu Phe Tyr His Ser Ile Pro Pro Gly Cys Ser Val Ala
625 630 635 640
Tyr Pro Phe Phe Leu Leu Gln Lys Gln Lys Thr Pro Ala Val Asp Tyr
645 650 655
Phe Val Lys Arg Phe Asn Ser Gly Asn Ile Ile Ala Ser Gln Glu Leu
660 665 670
Val Ser Leu Thr Val Lys Leu Gly Val Asp Pro Val Glu Tyr Leu Cys
675 680 685
Lys Gln Leu Asp Asn Leu Thr Glu Val Ile Lys Gly Gly Met Gly Asn
690 695 700
Leu Asp Thr Lys Thr Tyr Thr Gly Lys Gly Thr Thr Gly Thr Met Ser
705 710 715 720
Asp Asp Leu Met Val Ala Leu Ile Met Ser Val Tyr Ile Gly Ser Ser
725 730 735
Cys Ile Pro Asp Ser Val Phe Met Pro Ile Lys
740 745
<210>82
<211>1308
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(1308)
<400>82
atg gga act caa aag aag ggg ccg cgt tct gaa aaa gtc tcg ccg tac 48
Met Gly Thr Gln Lys Lys Gly Pro Arg Ser Glu Lys Val Ser Pro Tyr
1 5 10 15
gac acc acg aca ccc gag gtg gaa gcg tta gat cat caa atg gat acg 96
Asp Thr Thr Thr Pro Glu Val Glu Ala Leu Asp His Gln Met Asp Thr
20 25 30
ctt aat tgg cga att tgg ata att cag gtg atg atg ttc act ttg ggt 144
Leu Asn Trp Arg Ile Trp Ile Ile Gln Val Met Met Phe Thr Leu Gly
35 40 45
gcg gta atg ctc ctg gct acg tta att gcc gcc tct tct gaa tat acc 192
Ala Val Met Leu Leu Ala Thr Leu Ile Ala Ala Ser Ser Glu Tyr Thr
50 55 60
ggg atc cct tgt ttt tat gct gcc gta gtt gat tat gag tta ttt aac 240
Gly Ile Pro Cys Phe Tyr Ala Ala Val Val Asp Tyr Glu Leu Phe Asn
65 70 75 80
gcc acc cta gat ggg ggg gta tgg tcc gga aat aga ggt gga tac agc 288
Ala Thr Leu Asp Gly Gly Val Trp Ser Gly Asn Arg Gly Gly Tyr Ser
85 90 95
gcc ccg gtt ttg ttt ttg gaa cca cat agc gtt gtg gca ttt act tac 336
Ala Pro Val Leu Phe Leu Glu Pro His Ser Val Val Ala Phe Thr Tyr
100 105c110
tac acg gct tta acg gca atg gcc atg gcg gta tat aca ctg atc acg 384
Tyr Thr Ala Leu Thr Ala Met Ala Met Ala Val Tyr Thr Leu Ile Thr
115 120 125
gcc gcg att ata cac cga gaa acg aaa aat caa cgt gtc cgg caa agc 432
Ala Ala Ile Ile His Arg Glu Thr Lys Asn Gln Arg Val Arg Gln Ser
130 135 140
tcc ggt gtt gca tgg tta gtt gta gat ccc aca aca ctt ttt tgg ggt 480
Ser Gly Val Ala Trp Leu Val Val Asp Pro Thr Thr Leu Phe Trp Gly
145 150 155 160
ctt ttg tca ttg tgg tta tta aac gcc gtt gtg tta tta tta gct tac 528
Leu Leu Ser Leu Trp Leu Leu Asn Ala Val Val Leu Leu Leu Ala Tyr
165 170 175
aag caa atc ggc gtg gct gct aca tta tat ctt gga cat ttt gcg aca 576
Lys Gln Ile Gly Val Ala Ala Thr Leu Tyr Leu Gly His Phe Ala Thr
180 185 190
agt gta ata ttt aca acg tat ttt tgt gga cgc gga aaa ttg gac gaa 624
Ser Val Ile Phe Thr Thr Tyr Phe Cys Gly Arg Gly Lys Leu Asp Glu
195 200 205
acg aac ata aaa gcg gtc gca aat tta cga cag cag agc gtc ttt tta 672
Thr Asn Ile Lys Ala Val Ala Asn Leu Arg Gln Gln Ser Val Phe Leu
210 215 220
tat cgc ctt gcg ggg cct acg cgc gca gtg ttc gtg aat ttg atg gct 720
Tyr Arg Leu Ala Gly Pro Thr Arg Ala Val Phe Val Asn Leu Met Ala
225 230 235 240
gcg ttg atg gcg ata tgt atc cta ttt gta tca tta atg ctg gaa ctt 768
Ala Leu Met Ala Ile Cys Ile Leu Phe Val Ser Leu Met Leu Glu Leu
245 250 255
gtg gtg gcg aat cat cta cat acg gga ctg tgg tca tcg gtg tcc gtg 816
Val Val Ala Asn His Leu His Thr Gly Leu Trp Ser Ser Val Ser Val
260 265 270
gcc atg tct aca ttt agt aca ttg tca gtt gta tat ctt ata gta tca 864
Ala Met Ser Thr Phe Ser Thr Leu Ser Val Val Tyr Leu Ile Val Ser
275 280 285
gaa tta att ttg gcg cat tat ata cac gtg tta ata gga ccg tcc ctg 912
Glu Leu Ile Leu Ala His Tyr Ile His Val Leu Ile Gly Pro Ser Leu
290 295 300
gga acg ctc gtg gcc tgt gct acg ttg gga acc gcc gcg cac tcg tat 960
Gly Thr Leu Val Ala Cys Ala Thr Leu Gly Thr Ala Ala His Ser Tyr
305 310 315 320
atg gac cga tta tat gac cct ata tcg gtt caa tct cca cgg tta att 1008
Met Asp Arg Leu Tyr Asp Pro Ile Ser Val Gln Ser Pro Arg Leu Ile
325 330 335
ccc aca act cgg gga acc ttg gct tgc ctg gcc gtg ttt tcc gtt gtc 1056
Pro Thr Thr Arg Gly Thr Leu Ala Cys Leu Ala Val Phe Ser Val Val
340 345 350
atg ttg ctt ctc aga ttg atg cgt gca tat gtg tat cat cga cag aaa 1104
Met Leu Leu Leu Arg Leu Met Arg Ala Tyr Val Tyr His Arg Gln Lys
355 360 365
cgc agt cgg ttc tac ggt gcc gta aga aga gta ccc gag cgg gta cgg 1152
Arg Ser Arg Phe Tyr Gly Ala Val Arg Arg Val Pro Glu Arg Val Arg
370 375 380
gga tac ata cga aaa gta aaa cct gca cat aga aat tct cgc cgc aca 1200
Gly Tyr Ile Arg Lys Val Lys Pro Ala His Arg Asn Ser Arg Arg Thr
385 390 395 400
aat tac cca tca caa ggc tac ggc tac gtc tat gaa aat gac tca aca 1248
Asn Tyr Pro Ser Gln Gly Tyr Gly Tyr Val Tyr Glu Asn Asp Ser Thr
405 410 415
tat gaa acg gac cgc gag gat gag ctg tta tac gag cga tca aac agt 1296
Tyr Glu Thr Asp Arg Glu Asp Glu Leu Leu Tyr Glu Arg Ser Asn Ser
420 425 430
ggg tgg gag tag 1308
Gly Trp Glu
435
<210>83
<211>435
<212>PRT
<213>水痘带状疱疹
<400>83
Met Gly Thr Gln Lys Lys Gly Pro Arg Ser Glu Lys Val Ser Pro Tyr
1 5 10 15
Asp Thr Thr Thr Pro Glu Val Glu Ala Leu Asp His Gln Met Asp Thr
20 25 30
Leu Asn Trp Arg Ile Trp Ile Ile Gln Val Met Met Phe Thr Leu Gly
35 40 45
Ala Val Met Leu Leu Ala Thr Leu Ile Ala Ala Ser Ser Glu Tyr Thr
50 55 60
Gly Ile Pro Cys Phe Tyr Ala Ala Val Val Asp Tyr Glu Leu Phe Asn
65 70 75 80
Ala Thr Leu Asp Gly Gly Val Trp Ser Gly Asn Arg Gly Gly Tyr Ser
85 90 95
Ala Pro Val Leu Phe Leu Glu Pro His Ser Val Val Ala Phe Thr Tyr
100 105 110
Tyr Thr Ala Leu Thr Ala Met Ala Met Ala Val Tyr Thr Leu Ile Thr
115 120 125
Ala Ala Ile Ile His Arg Glu Thr Lys Asn Gln Arg Val Arg Gln Ser
130 135 140
Ser Gly Val Ala Trp Leu Val Val Asp Pro Thr Thr Leu Phe Trp Gly
145 150 155 160
Leu Leu Ser Leu Trp Leu Leu Asn Ala Val Val Leu Leu Leu Ala Tyr
165 170 175
Lys Gln Ile Gly Val Ala Ala Thr Leu Tyr Leu Gly His Phe Ala Thr
180 185 190
Ser Val Ile Phe Thr Thr Tyr Phe Cys Gly Arg Gly Lys Leu Asp Glu
195 200 205
Thr Asn Ile Lys Ala Val Ala Asn Leu Arg Gln Gln Ser Val Phe Leu
210 215 220
Tyr Arg Leu Ala Gly Pro Thr Arg Ala Val Phe Val Asn Leu Met Ala
225 230 235 240
Ala Leu Met Ala Ile Cys Ile Leu Phe Val Ser Leu Met Leu Glu Leu
245 250 255
Val Val Ala Asn His Leu His Thr Gly Leu Trp Ser Ser Val Ser Val
260 265 270
Ala Met Ser Thr Phe Ser Thr Leu Ser Val Val Tyr Leu Ile Val Ser
275 280 285
Glu Leu Ile Leu Ala His Tyr Ile His Val Leu Ile Gly Pro Ser Leu
290 295 300
Gly Thr Leu Val Ala Cys Ala Thr Leu Gly Thr Ala Ala His Ser Tyr
305 310 315 320
Met Asp Arg Leu Tyr Asp Pro Ile Ser Val Gln Ser Pro Arg Leu Ile
325 330 335
Pro Thr Thr Arg Gly Thr Leu Ala Cys Leu Ala Val Phe Ser Val Val
340 345 350
Met Leu Leu Leu Arg Leu Met Arg Ala Tyr Val Tyr His Arg Gln Lys
355 360 365
Arg Ser Arg Phe Tyr Gly Ala Val Arg Arg Val Pro Glu Arg Val Arg
370 375 380
Gly Tyr Ile Arg Lys Val Lys Pro Ala His Arg Asn Ser Arg Arg Thr
385 390 395 400
Asn Tyr Pro Ser Gln Gly Tyr Gly Tyr Val Tyr Glu Asn Asp Ser Thr
405 410 415
Tyr Glu Thr Asp Arg Glu Asp Glu Leu Leu Tyr Glu Arg Ser Asn Ser
420 425 430
Gly Trp Glu
435
<210>84
<211>2310
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(2310)
<400>84
atg gcc gaa ata acg tct ctt ttt aat aac agt tcc ggt agt gaa gaa 48
Met Ala Glu Ile Thr Ser Leu Phe Asn Asn Ser Ser Gly Ser Glu Glu
1 5 10 15
aaa agg ata gca agt tct gtt tct att gac cag ggc ttg aat gga agt 96
Lys Arg Ile Ala Ser Ser Val Ser Ile Asp Gln Gly Leu Asn Gly Ser
20 25 30
aac cca aat gac caa tac aag aac atg ttc gat ata tac tgg aat gag 144
Asn Pro Asn Asp Gln Tyr Lys Asn Met Phe Asp Ile Tyr Trp Asn Glu
35 40 45
tac gcc ccg gat ata ggg ttt tgt aca ttt ccg gag gaa gat ggc tgg 192
Tyr Ala Pro Asp Ile Gly Phe Cys Thr Phe Pro Glu Glu Asp Gly Trp
50 55 60
atg tta ata cac cca acc acg caa agt atg ttg ttt cga aaa atc cta 240
Met Leu Ile His Pro Thr Thr Gln Ser Met Leu Phe Arg Lys Ile Leu
65 70 75 80
gcc ggt gac ttt gga tat acc gat gga caa ggc ata tat agc gct gta 288
Ala Gly Asp Phe Gly Tyr Thr Asp Gly Gln Gly Ile Tyr Ser Ala Val
85 90 95
cgg tct acg gaa act gta att cgc caa gtt cag gca acc gtt ttg atg 336
Arg Ser Thr Glu Thr Val Ile Arg Gln Val Gln Ala Thr Val Leu Met
100 105 110
aac gcg ttg gat gca act cgg tat gag gac cta gca gca gat tgg gaa 384
Asn Ala Leu Asp Ala Thr Arg Tyr Glu Asp Leu Ala Ala Asp Trp Glu
115 120 125
cac cac atc caa caa tgt aac ctg cat gcc ggg gct cta gcg gaa cgt 432
His His Ile Gln Gln Cys Asn Leu His Ala Gly Ala Leu Ala Glu Arg
130 135 140
tat ggg cta tgt gga gaa tca gaa gcc gta cgg ctt gca cat cag gtt 480
Tyr Gly Leu Cys Gly Glu Ser Glu Ala Val Arg Leu Ala His Gln Val
145 150 155 160
ttt gaa acc tgg cgt caa aca tta cag tca tcg tta ctt gag ttt ctg 528
Phe Glu Thr Trp Arg Gln Thr Leu Gln Ser Ser Leu Leu Glu Phe Leu
165 170 175
cgt gga ata acc ggt tgt ctc tat acc agt ggt tta aat gga agg gtc 576
Arg Gly Ile Thr Gly Cys Leu Tyr Thr Ser Gly Leu Asn Gly Arg Val
180 185 190
ggt ttt gcc aaa tac gtg gac tgg ata gcc tgt gta ggt att gtg ccc 624
Gly Phe Ala Lys Tyr Val Asp Trp Ile Ala Cys Val Gly Ile Val Pro
195 200 205
gtt gta aga aag gta cga tca gaa cag aat gga acc cct gca cca tta 672
Val Val Arg Lys Val Arg Ser Glu Gln Asn Gly Thr Pro Ala Pro Leu
210 215 220
aat acg tat atg ggt caa gcg gca gaa ctg tcc cag atg tta aaa gtt 720
Asn Thr Tyr Met Gly Gln Ala Ala Glu Leu Ser Gln Met Leu Lys Val
225 230 235 240
gcc gat gca acg ttg gcc aga gga gcg gcg gtt gtc aca agc cta gtt 768
Ala Asp Ala Thr Leu Ala Arg Gly Ala Ala Val Val Thr Ser Leu Val
245 250 255
gag tgt atg caa aat gtt gct att atg gat tat gat agg acg cgt ctt 816
Glu Cys Met Gln Asn Val Ala Ile Met Asp Tyr Asp Arg Thr Arg Leu
260 265 270
tat tat aat tat aac cga aga tta att atg gca aag gat gat gta acg 864
Tyr Tyr Asn Tyr Asn Arg Arg Leu Ile Met Ala Lys Asp Asp Val Thr
275 280 285
ggc atg aag gga gag tgt ttg gtc gtg tgg ccg ccc gtt gta tgt ggg 912
Gly Met Lys Gly Glu Cys Leu Val Val Trp Pro Pro Val Val Cys Gly
290 295 300
gag ggt gta gta ttt gac tca ccc tta cag cgg ctt tct ggg gag gtg 960
Glu Gly Val Val Phe Asp Ser Pro Leu Gln Arg Leu Ser Gly Glu Val
305 310 315 320
ttg gcc tgt tat gca tta cgt gaa cat gct cgc gtc tgc caa gtt tta 1008
Leu Ala Cys Tyr Ala Leu Arg Glu His Ala Arg Val Cys Gln Val Leu
325 330 335
aat aca gcc cct ttg cgc gtg tta ata ggt cgc cgg aat gaa gat gat 1056
Asn Thr Ala Pro Leu Arg Val Leu Ile Gly Arg Arg Asn Glu Asp Asp
340 345 350
aga tct cac agc aca cgt gcg gtt gat cgt ata atg ggc gag aac gat 1104
Arg Ser His Ser Thr Arg Ala Val Asp Arg Ile Met Gly Glu Asn Asp
355 360 365
aca aca cgg gct gga tcg gcc gcg tct aga ctt gta aag cta ata gtt 1152
Thr Thr Arg Ala Gly Ser Ala Ala Ser Arg Leu Val Lys Leu Ile Val
370 375 380
aac tta aaa aac atg aga cat gtt gga gat att acc gaa acc gta cgt 1200
Asn Leu Lys Asn Met Arg His Val Gly Asp Ile Thr Glu Thr Val Arg
385 390 395 400
tcc tat cta gaa gaa acg ggc aat cac att ctg gaa gga agt gga tcg 1248
Ser Tyr Leu Glu Glu Thr Gly Asn His Ile Leu Glu Gly Ser Gly Ser
405 410 415
gtg gac aca tca caa ccg ggg ttt ggc aag gcc aac caa tcc ttt aac 1296
Val Asp Thr Ser Gln Pro Gly Phe Gly Lys Ala Asn Gln Ser Phe Asn
420 425 430
ggg ggg gca atg tcc gga aca aca aac gtt caa agt gcg ttt aaa act 1344
Gly Gly Ala Met Ser Gly Thr Thr Asn Val Gln Ser Ala Phe Lys Thr
435 440 445
tcg gtg gtt aac agt atc aac ggc atg ctc gag ggt tat gtg aat aat 1392
Ser Val Val Asn Ser Ile Asn Gly Met Leu Glu Gly Tyr Val Asn Asn
450 455 460
tta ttc aaa acc att gag ggt ctc aag gat gtg aac agc gat ctg acc 1440
Leu Phe Lys Thr Ile Glu Gly Leu Lys Asp Val Asn Ser Asp Leu Thr
465 470 475 480
gaa agg ctc cag ttc aaa gaa gga gag ctg aaa cgg tta cgg gaa gag 1488
Glu Arg Leu Gln Phe Lys Glu Gly Glu Leu Lys Arg Leu Arg Glu Glu
485 490 495
agg gta aaa ata aag cca tct aaa ggg tca cat att aca atg gca gaa 1536
Arg Val Lys Ile Lys Pro Ser Lys Gly Ser His Ile Thr Met Ala Glu
500 505 510
gaa aca cgt att gcc gat tta aat cac gag gtt ata gat ctt acc ggc 1584
Glu Thr Arg Ile Ala Asp Leu Asn His Glu Val Ile Asp Leu Thr Gly
515 520 525
ata ata ggg gat gat gca tat att gcc aat agt ttt caa tct cgt tat 1632
Ile Ile Gly Asp Asp Ala Tyr Ile Ala Asn Ser Phe Gln Ser Arg Tyr
530 535 540
atc ccc cct tat gga gat gat ata aaa cgt ttg tct gag cta tgg aaa 1680
Ile Pro Pro Tyr Gly Asp Asp Ile Lys Arg Leu Ser Glu Leu Trp Lys
545 550 555 560
cag gaa ctt gtt cgc tgt ttt aag ctt cac cgg gta aac aat aat caa 1728
Gln Glu Leu Val Arg Cys Phe Lys Leu His Arg Val Asn Asn Asn Gln
565 570 575
ggc cag gaa att tct gta tca tat tca aat gcg tca atc tca tta cta 1776
Gly Gln Glu Ile Ser Val Ser Tyr Ser Asn Ala Ser Ile Ser Leu Leu
580 585 590
gtt gcg ccg tat ttt tca ttc ata tta cgg gcc acc cga tta gga ttc 1824
Val Ala Pro Tyr Phe Ser Phe Ile Leu Arg Ala Thr Arg Leu Gly Phe
595 600 605
ttg gta act caa agc gag gta cat agg tca gag gaa gag tta tgc cag 1872
Leu Val Thr Gln Ser Glu Val His Arg Ser Glu Glu Glu Leu Cys Gln
610 615 620
gct att ttt aaa aag gcg aga aca gag tcc tat tta tcc caa atc cga 1920
Ala Ile Phe Lys Lys Ala Arg Thr Glu Ser Tyr Leu Ser Gln Ile Arg
625 630 635 640
ata tta tat gaa atg cag gtt cgc gca gag gta ata aaa cgg ggc cca 1968
Ile Leu Tyr Glu Met Gln Val Arg Ala Glu Val Ile Lys Arg Gly Pro
645 650 655
cgg aga aca cca agt cct tcc tgg ggt ttg cct gac cct aca gaa gat 2016
Arg Arg Thr Pro Ser Pro Ser Trp Gly Leu Pro Asp Pro Thr Glu Asp
660 665 670
gac gaa aga atc ccg gaa ccc aat aaa ata aat aac caa tac atg cat 2064
Asp Glu Arg Ile Pro Glu Pro Asn Lys Ile Asn Asn Gln Tyr Met His
675 680 685
gtt gga tat aaa aac cta tcc cat ttt atg aaa gga cac ccc cct gag 2112
Val Gly Tyr Lys Asn Leu Ser His Phe Met Lys Gly His Pro Pro Glu
690 695 700
agg tta cgg gta cac aag gta aat gca gcg gat tcg acc tta ctg gat 2160
Arg Leu Arg Val His Lys Val Asn Ala Ala Asp Ser Thr Leu Leu Asp
705 710 715 720
aaa att cga gca aac cgg agg cgc ggg gat ggc cga tgg gat gtc cgg 2208
Lys Ile Arg Ala Asn Arg Arg Arg Gly Asp Gly Arg Trp Asp Val Arg
725 730 735
aat aaa tat acc cag cat ttt agg ttg cag cgt aac gat cga caa ctt 2256
Asn Lys Tyr Thr Gln His Phe Arg Leu Gln Arg Asn Asp Arg Gln Leu
740 745 750
act aac acg agc cga aga ggg gtt gga tgt gag cga cgt gat cga aga 2304
Thr Asn Thr Ser Arg Arg Gly Val Gly Cys Glu Arg Arg Asp Arg Arg
755 760 765
tct tag 2310
Ser
<210>85
<211>769
<212>PRT
<213>水痘带状疱疹
<400>85
Met Ala Glu Ile Thr Ser Leu Phe Asn Asn Ser Ser Gly Ser Glu Glu
1 5 10 15
Lys Arg Ile Ala Ser Ser Val Ser Ile Asp Gln Gly Leu Asn Gly Ser
20 25 30
Asn Pro Asn Asp Gln Tyr Lys Asn Met Phe Asp Ile Tyr Trp Asn Glu
35 40 45
Tyr Ala Pro Asp Ile Gly Phe Cys Thr Phe Pro Glu Glu Asp Gly Trp
50 55 60
Met Leu Ile His Pro Thr Thr Gln Ser Met Leu Phe Arg Lys Ile Leu
65 70 75 80
Ala Gly Asp Phe Gly Tyr Thr Asp Gly Gln Gly Ile Tyr Ser Ala Val
85 90 95
Arg Ser Thr Glu Thr Val Ile Arg Gln Val Gln Ala Thr Val Leu Met
100 105 110
Asn Ala Leu Asp Ala Thr Arg Tyr Glu Asp Leu Ala Ala Asp Trp Glu
115 120 125
His His Ile Gln Gln Cys Asn Leu His Ala Gly Ala Leu Ala Glu Arg
130 135 140
Tyr Gly Leu Cys Gly Glu Ser Glu Ala Val Arg Leu Ala His Gln Val
145 150 155 160
Phe Glu Thr Trp Arg Gln Thr Leu Gln Ser Ser Leu Leu Glu Phe Leu
165 170 175
Arg Gly Ile Thr Gly Cys Leu Tyr Thr Ser Gly Leu Asn Gly Arg Val
180 185 190
Gly Phe Ala Lys Tyr Val Asp Trp Ile Ala Cys Val Gly Ile Val Pro
195 200 205
Val Val Arg Lys Val Arg Ser Glu Gln Asn Gly Thr Pro Ala Pro Leu
210 215 220
Asn Thr Tyr Met Gly Gln Ala Ala Glu Leu Ser Gln Met Leu Lys Val
225 230 235 240
Ala Asp Ala Thr Leu Ala Arg Gly Ala Ala Val Val Thr Ser Leu Val
245 250 255
Glu Cys Met Gln Asn Val Ala Ile Met Asp Tyr Asp Arg Thr Arg Leu
260 265 270
Tyr Tyr Asn Tyr Asn Arg Arg Leu Ile Met Ala Lys Asp Asp Val Thr
275 280 285
Gly Met Lys Gly Glu Cys Leu Val Val Trp Pro Pro Val Val Cys Gly
290 295 300
Glu Gly Val Val Phe Asp Ser Pro Leu Gln Arg Leu Ser Gly Glu Val
305 310 315 320
Leu Ala Cys Tyr Ala Leu Arg Glu His Ala Arg Val Cys Gln Val Leu
325 330 335
Asn Thr Ala Pro Leu Arg Val Leu Ile Gly Arg Arg Asn Glu Asp Asp
340 345 350
Arg Ser His Ser Thr Arg Ala Val Asp Arg Ile Met Gly Glu Asn Asp
355 360 365
Thr Thr Arg Ala Gly Ser Ala Ala Ser Arg Leu Val Lys Leu Ile Val
370 375 380
Asn Leu Lys Asn Met Arg His Val Gly Asp Ile Thr Glu Thr Val Arg
385 390 395 400
Ser Tyr Leu Glu Glu Thr Gly Asn His Ile Leu Glu Gly Ser Gly Ser
405 410 415
Val Asp Thr Ser Gln Pro Gly Phe Gly Lys Ala Asn Gln Ser Phe Asn
420 425 430
Gly Gly Ala Met Ser Gly Thr Thr Asn Val Gln Ser Ala Phe Lys Thr
435 440 445
Ser Val Val Asn Ser Ile Asn Gly Met Leu Glu Gly Tyr Val Asn Asn
450 455 460
Leu Phe Lys Thr Ile Glu Gly Leu Lys Asp Val Asn Ser Asp Leu Thr
465 470 475 480
Glu Arg Leu Gln Phe Lys Glu Gly Glu Leu Lys Arg Leu Arg Glu Glu
485 490 495
Arg Val Lys Ile Lys Pro Ser Lys Gly Ser His Ile Thr Met Ala Glu
500 505 510
Glu Thr Arg Ile Ala Asp Leu Asn His Glu Val Ile Asp Leu Thr Gly
515 520 525
Ile Ile Gly Asp Asp Ala Tyr Ile Ala Asn Ser Phe Gln Ser Arg Tyr
530 535 540
Ile Pro Pro Tyr Gly Asp Asp Ile Lys Arg Leu Ser Glu Leu Trp Lys
545 550 555 560
Gln Glu Leu Val Arg Cys Phe Lys Leu His Arg Val Asn Asn Asn Gln
565 570 575
Gly Gln Glu Ile Ser Val Ser Tyr Ser Asn Ala Ser Ile Ser Leu Leu
580 585 590
Val Ala Pro Tyr Phe Ser Phe Ile Leu Arg Ala Thr Arg Leu Gly Phe
595 600 605
Leu Val Thr Gln Ser Glu Val His Arg Ser Glu Glu Glu Leu Cys Gln
610 615 620
Ala Ile Phe Lys Lys Ala Arg Thr Glu Ser Tyr Leu Ser Gln Ile Arg
625 630 635 640
Ile Leu Tyr Glu Met Gln Val Arg Ala Glu Val Ile Lys Arg Gly Pro
645 650 655
Arg Arg Thr Pro Ser Pro Ser Trp Gly Leu Pro Asp Pro Thr Glu Asp
660 665 670
Asp Glu Arg Ile Pro Glu Pro Asn Lys Ile Asn Asn Gln Tyr Met His
675 680 685
Val Gly Tyr Lys Asn Leu Ser His Phe Met Lys Gly His Pro Pro Glu
690 695 700
Arg Leu Arg Val His Lys Val Asn Ala Ala Asp Ser Thr Leu Leu Asp
705 710 715 720
Lys Ile Arg Ala Asn Arg Arg Arg Gly Asp Gly Arg Trp Asp Val Arg
725 730 735
Asn Lys Tyr Thr Gln His Phe Arg Leu Gln Arg Asn Asp Arg Gln Leu
740 745 750
Thr Asn Thr Ser Arg Arg Gly Val Gly Cys Glu Arg Arg Asp Arg Arg
755 760 765
Ser
<210>86
<211>666
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(666)
<400>86
atg ttt tcg gag ttg cct cct tcc gta ecg acg gca ttg crt caa tgg 48
Met Phe Ser Glu Leu Pro Pro Ser Val Pro Thr Ala Leu Leu Gln Trp
1 5 10 15
ggt tgg gga ttg cat cgt gga ccg tgt tcg arc cca aat ttt aaa cag 96
Gly Trp Gly Leu His Arg Gly Pro Cys Ser Ile Pro Asn Phe Lys Gln
20 25 30
gta gcc agc caa cac agt gtt cag aac gat ttt aca gaa aat agc gtt 144
Val Ala Ser Gln His Ser Val Gln Asn Asp Phe Thr Glu Asn Ser Val
35 40 45
gat gca aat gaa aaa ttt ccg att ggg cac gcg ggc tgt att gag aaa 192
Asp Ala Asn Glu Lys Phe Pro Ile Gly His Ala Gly Cys Ile Glu Lys
50 55 60
acc aaa gac gac tat gta cca ttt gat acg ttg ttc atg gta rca tct 240
Thr Lys Asp Asp Tyr Val Pro Phe Asp Thr Leu Phe Met Val Ser Ser
65 70 75 80
att gac gaa crt ggg cgg aga caa tta acc gac acc arc cgc cgc agc 288
Ile Asp Glu Leu Gly Arg Arg Gln Leu Thr Asp Thr Ile Arg Arg Ser
85 90 95
ttg gtt atg aac gcc tgt gaa ara acg gtc gcg tgt acg aaa acc gca 336
Leu Val Met Asn Ala Cys Glu Ile Thr Val Ala Cys Thr Lys Thr Ala
100 105 110
gcc ttt tct ggt cga ggc gtg rca cga caa aaa cac gtg acc cra tct 384
Ala Phe Ser Gly Arg Gly Val Ser Arg Gln Lys His Val Thr Leu Ser
115 120 125
aaa aat aaa ttc aat cca tcc agt cat aag agc ctg caa atg ttt gtg 432
Lys Asn Lys Phe Asn Pro Ser Ser His Lys Ser Leu Gln Met Phe Val
130 135 140
ttg tgt caa aaa acc cat gca ccc cgt gtc aga aac cra ctg tac gag 480
Leu Cys Gln Lys Thr His Ala Pro Arg Val Arg Asn Leu Leu Tyr Glu
145 150 155 160
agt att cgt gca aga aga cct cgc cga tat tac acc cgc rca acg gac 528
Ser Ile Arg Ala Arg Arg Pro Arg Arg Tyr Tyr Thr Arg Ser Thr Asp
165 170 175
gga aaa tcg cgt ccg ttg gta cca gtg ttt gtg tat gag ttt acg gct 576
Gly Lys Ser Arg Pro Leu Val Pro Val Phe Val Tyr Glu Phe Thr Ala
180 185 190
tta gat cgt gtc ctt tta cat aag gaa aat act ttg acc gac caa cca 624
Leu Asp Arg Val Leu Leu His Lys Glu Asn Thr Leu Thr Asp Gln Pro
195 200 205
att aat act gaa aat agc ggt cat gga cgt acg aga acg taa 666
Ile Asn Thr Glu Asn Ser Gly His Gly Arg Thr Arg Thr
210 215 220
<210>87
<211>221
<212>PRT
<213>水痘带状疱疹
<400>87
Met Phe Ser Glu Leu Pro Pro Ser Val Pro Thr Ala Leu Leu Gln Trp
1 5 10 15
Gly Trp Gly Leu His Arg Gly Pro Cys Ser Ile Pro Asn Phe Lys Gln
20 25 30
Val Ala Ser Gln His Ser Val Gln Asn Asp Phe Thr Glu Asn Ser Val
35 40 45
Asp Ala Asn Glu Lys Phe Pro Ile Gly His Ala Gly Cys Ile Glu Lys
50 55 60
Thr Lys Asp Asp Tyr Val Pro Phe Asp Thr Leu Phe Met Val Ser Ser
65 70 75 80
Ile Asp Glu Leu Gly Arg Arg Gln Leu Thr Asp Thr Ile Arg Arg Ser
85 90 95
Leu Val Met Asn Ala Cys Glu Ile Thr Val Ala Cys Thr Lys Thr Ala
100 105 110
Ala Phe Ser Gly Arg Gly Val Ser Arg Gln Lys His Val Thr Leu Ser
115 120 125
Lys Asn Lys Phe Asn Pro Ser Ser His Lys Ser Leu Gln Met Phe Val
130 135 140
Leu Cys Gln Lys Thr His Ala Pro Arg Val Arg Asn Leu Leu Tyr Glu
145 150 155 160
Ser Ile Arg Ala Arg Arg Pro Arg Arg Tyr Tyr Thr Arg Ser Thr Asp
165 170 175
Gly Lys Ser Arg Pro Leu Val Pro Val Phe Val Tyr Glu Phe Thr Ala
180 185 190
Leu Asp Arg Val Leu Leu His Lys Glu Asn Thr Leu Thr Asp Gln Pro
195 200 205
Ile Asn Thr Glu Asn Ser Gly His Gly Arg Thr Arg Thr
210 215 220
<210>88
<211>480
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(480)
<400>88
atg gca rca cat aaa tgg tta ctg cag ara gtt ttt tta aaa act arc 48
Met Ala Ser His Lys Trp Leu Leu Gln Ile Val Phe Leu Lys Thr Ile
1 5 10 15
aca arc gcg tat tgt ctt cat crc caa gac gac act ccg ttg ttt ttt 96
Thr Ile Ala Tyr Cys Leu His Leu Gln Asp Asp Thr Pro Leu Phe Phe
20 25 30
gga gcc aaa ccg cra tcg gat gtg agt ttg att ara acg gaa ccg tgc 144
Gly Ala Lys Pro Leu Ser Asp Val Ser Leu Ile Ile Thr Glu Pro Cys
35 40 45
gtg rca tcg gta tat gag gcg tgg gac tat gcg gca ccc ccg gta rca 192
Val Ser Ser Val Tyr Glu Ala Trp Asp Tyr Ala Ala Pro Pro Val Ser
50 55 60
aac crc agc gag gcg cra tcg gga arc gtg gtt aag aca aaa tgt cca 240
Asn Leu Ser Glu Ala Leu Ser Gly Ile Val Val Lys Thr Lys Cys Pro
65 70 75 80
gta ccg gaa gtt ara ctt tgg ttt aaa gac aaa caa atg gcg tac tgg 288
Val Pro Glu Val Ile Leu Trp Phe Lys Asp Lys Gln Met Ala Tyr Trp
85 90 95
aca aat cca tac gtc acc tta aag ggg ctg gca caa tct gtt ggt gaa 336
Thr Asn Pro Tyr Val Thr Leu Lys Gly Leu Ala Gln Ser Val Gly Glu
100 105 110
gaa cat aaa agc ggg gac ara cgc gat gct ttg ttg gat gcc crt tcc 384
Glu His Lys Set Gly Asp Ile Arg Asp Ala Leu Leu Asp Ala Leu Ser
115 120 125
ggt gta tgg gta gac tct act cca tct tcc aca aat atc ccg gaa aat 432
Gly Val Trp Val Asp Ser Thr Pro Ser Ser Thr Asn Ile Pro Glu Asn
130 135 140
gga tgt gtc tgg gga gcc gac cgt ttg ttc caa cgc gta tgc caa tga 480
Gly Cys Val Trp Gly Ala Asp Arg Leu Phe Gln Arg Val Cys Gln
145 150 155
<210>89
<211>159
<212>PRT
<213>水痘带状疱疹
<400>89
Met Ala Ser His Lys Trp Leu Leu Gln Ile Val Phe Leu Lys Thr Ile
1 5 10 15
Thr Ile Ala Tyr Cys Leu His Leu Gln Asp Asp Thr Pro Leu Phe Phe
20 25 30
Gly Ala Lys Pro Leu Ser Asp Val Ser Leu Ile Ile Thr Glu Pro Cys
35 40 45
Val Ser Ser Val Tyr Glu Ala Trp Asp Tyr Ala Ala Pro Pro Val Ser
50 55 60
Asn Leu Ser Glu Ala Leu Ser Gly Ile Val Val Lys Thr Lys Cys Pro
65 70 75 80
Val Pro Glu Val Ile Leu Trp Phe Lys Asp Lys Gln Met Ala Tyr Trp
85 90 95
Thr Asn Pro Tyr Val Thr Leu Lys Gly Leu Ala Gln Ser Val Gly Glu
100 105 110
Glu His Lys Ser Gly Asp Ile Arg Asp Ala Leu Leu Asp Ala Leu Ser
115 120 125
Gly Val Trp Val Asp Ser Thr Pro Ser Ser Thr Asn Ile Pro Glu Asn
130 135 140
Gly Cys Val Trp Gly Ala Asp Arg Leu Phe Gln Arg Val Cys Gln
145 150 155
<210>90
<211>909
<212>DNA
<213>水痘带状疱疹
<220>
<221>CDS
<222>(1)..(909)
<400>90
atg gct tct gta gca ggt aac gct agt aat atc tca cca cag ccc ccg 48
Met Ala Ser Val Ala Gly Asn Ala Ser Asn Ile Ser Pro Gln Pro Pro
1 5 10 15
tcg ggc gtt cca acc gga ggg gaa ttt gta ctg ata cct acc gcg tat 96
Ser Gly Val Pro Thr Gly Gly Glu Phe Val Leu Ile Pro Thr Ala Tyr
20 25 30
tat tca cag ctg tta acc ggg cag act aaa aat ccg cag gta tca att 144
Tyr Ser Gln Leu Leu Thr Gly Gln Thr Lys Asn Pro Gln Val Ser Ile
35 40 45
gga gct cca aat aac gga cag tat atc gtc ggg cca tat gga tct cca 192
Gly Ala Pro Asn Asn Gly Gln Tyr Ile Val Gly Pro Tyr Gly Set Pro
50 55 60
cac ccg cct gcc ttc cca cct aat aca ggg ggt tat ggt tgc cct ccg 240
His Pro Pro Ala Phe Pro Pro Asn Thr Gly Gly Tyr Gly Cys Pro Pro
65 70 75 80
gga cac ttc ggg gga ccg tac ggg ttt ccg gga tat cca cca ccc aat 288
Gly His Phe Gly Gly Pro Tyr Gly Phe Pro Gly Tyr Pro Pro Pro Asn
85 90 95
cgt ttg gaa atg caa atg tcc gca ttt atg aac gca ttg gcc gcc gaa 336
Arg Leu Glu Met Gln Met Ser Ala Phe Met Asn Ala Leu Ala Ala Glu
100 105 110
cgg ggt att gac ttg cag acc ccg tgt gta aat ttt cca gac aaa acc 384
Arg Gly Ile Asp Leu Gln Thr Pro Cys Val Asn Phe Pro Asp Lys Thr
115 120 125
gat gtc cgt cgt cca gga aaa cgg gat ttc aag agc atg gat caa agg 432
Asp Val Arg Arg Pro Gly Lys Arg Asp Phe Lys Ser Met Asp Gln Arg
130 135 140
gaa ttg gat tct ttt tat agt ggg gag tct caa atg gac gga gag ttt 480
Glu Leu Asn Ser Phe Tyr Ser Gly Glu Ser Gln Met Asp Gly Glu Phe
145 150 155 160
ccc rca aat ara tat ttt ccc ggt gaa cca acg tat ata acg cat cgg 528
Pro Ser Asn Ile Tyr Phe Pro Gly Glu Pro Thr Tyr Ile Thr His Arg
165 170 175
aga cgt cga gtt tct cca tca tat tgg cag agg aga cac aga gtt tct 576
Arg Arg Arg Val Ser Pro Ser Tyr Trp Gln Arg Arg His Arg Val Ser
180 185 190
aat ggt cag cac gaa gag ctt gct ggg gtt gtg gca aaa ctg caa cag 624
Asn Gly Gln His Glu Glu Leu Ala Gly Val Val Ala Lys Leu Gln Gln
195 200 205
gag gtt aca gag cta aaa tca caa aat ggg aca caa atg cct ttg tcg 672
Glu Val Thr Glu Leu Lys Ser Gln Asn Gly Thr Gln Met Pro Leu Ser
210 215 220
cac cat aca aat ata cca gag ggg aca cgg gat cct cga ata tcg att 720
His His Thr Asn Ile Pro Glu Gly Thr Arg Asp Pro Arg Ile Ser Ile
225 230 235 240
tta tta aaa cag ctt caa agc gtt tcg ggt cta tgc tca tcc caa aat 768
Leu Leu Lys Gln Leu Gln Ser Val Ser Gly Leu Cys Ser Ser Gln Asn
245 250 255
aca aca agc acc cca cat aca gat aca gtt gga caa gat gta aat gca 816
Thr Thr Ser Thr Pro His Thr Asp Thr Val Gly Gln Asp Val Asn Ala
260 265 270
gtg gag gcg agt tcc aag gcc cct tta ata cag ggg tcc acg gca gac 864
Val Glu Ala Ser Ser Lys Ala Pro Leu Ile Gln Gly Ser Thr Ala Asp
275 280 285
gac gcc gat atg ttt gca aat cag atg atg gtg ggg cgg tgt taa 909
Asp Ala Asp Met Phe Ala Asn Gln Met Met Val Gly Arg Cys
290 295 300
<210>91
<211>302
<212>PRT
<213>水痘带状疱疹
<400>91
Met Ala Ser Val Ala Gly Asn Ala Ser Asn Ile Ser Pro Gln Pro Pro
1 5 10 15
Ser Gly Val Pro Thr Gly Gly Glu Phe Val Leu Ile Pro Thr Ala Tyr
20 25 30
Tyr Ser Gln Leu Leu Thr Gly Gln Thr Lys Asn Pro Gln Val Ser Ile
35 40 45
Gly Ala Pro Asn Asn Gly Gln Tyr Ile Val Gly Pro Tyr Gly Ser Pro
50 55 60
His Pro Pro Ala Phe Pro Pro Asn Thr Gly Gly Tyr Gly Cys Pro Pro
65 70 75 80
Gly His Phe Gly Gly Pro Tyr Gly Phe Pro Gly Tyr Pro Pro Pro Asn
85 90 95
Arg Leu Glu Met Gln Met Ser Ala Phe Met Asn Ala Leu Ala Ala Glu
100 105 110
Arg Gly Ile Asp Leu Gln Thr Pro Cys Val Asn Phe Pro Asp Lys Thr
115 120 125
Asp Val Arg Arg Pro Gly Lys Arg Asp Phe Lys Ser Met Asp Gln Arg
130 135 140
Glu Leu Asp Ser Phe Tyr Ser Gly Glu Ser Gln Met Asp Gly Glu Phe
145 150 155 160
Pro Ser Asn Ile Tyr Phe Pro Gly Glu Pro Thr Tyr Ile Thr His Arg
165 170 175
Arg Arg Arg Val Ser Pro Ser Tyr Trp Gln Arg Arg His Arg Val Ser
180 185 190
Asn Gly Gln His Glu Glu Leu Ala Gly Val Val Ala Lys Leu Gln Gln
195 200 205
Glu Val Thr Glu Leu Lys Ser Gln Asn Gly Thr Gln Met Pro Leu Ser
210 215 220
His His Thr Asn Ile Pro Glu Gly Thr Arg Asp Pro Arg Ile Ser Ile
225 230 235 240
Leu Leu Lys Gln Leu Gln Ser Val Ser Gly Leu Cys Ser Ser Gln Asn
245 250 255
Thr Thr Ser Thr Pro His Thr Asp Thr Val Gly Gln Asp Val Asn Ala
260 265 270
Val Glu Ala Ser Ser Lys Ala Pro Leu Ile Gln Gly Ser Thr Ala Asp
275 280 285
Asp Ala Asp Met Phe Ala Asn Gln Met Met Val Gly Arg Cys
290 295 300
Claims (91)
1.重组水痘-带状疱疹病毒。
2.如权利要求1所述的重组水痘-带状疱疹病毒,其中包含BAC载体序列。
3.如权利要求2所述的重组水痘-带状疱疹病毒,其中,至少部分前述BAC载体序列插入到水痘-带状疱疹病毒基因组的非必需区域之中。
4.如权利要求3所述的重组水痘-带状疱疹病毒,其中,前述非必需区域选自下述区域:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
5.如权利要求4所述的重组水痘-带状疱疹病毒,其中,所述的非必需区域是基因11的ORF的侧翼区域或基因12的ORF的侧翼区域。
6.如权利要求2所述的重组水痘-带状疱疹病毒,其中,至少部分BAC载体序列插入到水痘-带状疱疹病毒基因组的基因62的ORF中的区域。
7.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
8.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的BAC载体序列包含选择标记。
9.如权利要求8所述的重组水痘-带状疱疹病毒,其中,所述的选择标记是药物选择标记。
10.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的选择标记是编码绿荧光蛋白的基因。
11.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
12.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
13.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
14.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的水痘-带状疱疹病毒基因组在基因62和基因6中带有突变。
15.如权利要求14所述的重组水痘-带状疱疹病毒,其中,所述的基因62在SEQ ID NO.5的碱基中至少具有下述(a)-(d)中的碱基取代:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代,
以及前述基因6在SEQ ID NO.8的碱基序列中至少包含第5745位碱基的G碱基取代。
16.如权利要求2所述的重组水痘-带状疱疹病毒,其中,所述的BAC载体序列包含SEQ ID NO.:7所示的序列。
17.包含权利要求1所述病毒的药物组合物。
18.如权利要求17所述的药物组合物,其中,所述的组合物是疫苗的形式。
19.载体,其包含除基因62之外的水痘-带状疱疹病毒基因组必需基因和BAC载体序列。
20.如权利要求19所述的载体,其进一步包含基因62。
21.如权利要求19所述的载体,其中,当所述载体插入哺乳动物细胞时,哺乳动物细胞产生水痘-带状疱疹病毒。
22.如权利要求19所述的载体,其中,来自水痘-带状疱疹病毒基因组的序列与BAC载体序列相连的部位位于该水痘-带状疱疹病毒基因组的非必需区域内。
23.如权利要求22所述的载体,其中,所述的非必需区域选自下述区域:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
24.如权利要求23所述的载体,其中,所述的连接部分是基因11的ORF侧翼区域或基因12的ORF侧翼区域。
25.如权利要求19所述的载体,其中,所述的来自水痘-带状疱疹病毒基因组的序列与前述BAC载体序列相连的部分位于水痘-带状疱疹病毒基因组的基因62的ORF中。
26.如权利要求19所述的载体,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
27.如权利要求19所述的载体,其中,所述的BAC载体序列包含选择标记。
28.如权利要求27所述的载体,其中,所述的选择标记是药物选择标记。
29.如权利要求27所述的载体,其中,所述的选择标记是编码绿荧光蛋白的基因。
30.如权利要求19所述的载体,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
31.如权利要求19所述的载体,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
32.如权利要求19所述的载体,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
33.如权利要求19所述的载体,其中,所述的水痘-带状疱疹病毒基因组带有在基因62和基因6中的突变。
34.如权利要求33所述的载体,其中,所述的基因62在SEQ ID NO.5中至少包含下述(a)-(d)中的碱基取代:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代,
此外,基因6在SEQ ID NO.8的碱基序列中至少包含第5745位的G取代。
35.如权利要求19所述的载体,其中,所述的BAC载体序列包含SEQ ID NO.:7所示的序列。
36.包含权利要求19所述载体的细胞。
37.如权利要求36所述的细胞,其中,所述的细胞是细菌细胞。
38.如权利要求37所述的细菌细胞,其中,所述的细菌是大肠杆菌。
39.如权利要求36所述的细胞,其中,所述的细胞是哺乳动物细胞。
40.如权利要求39所述的哺乳动物细胞,其中,所述的哺乳动物细胞来源于人。
41.如权利要求39所述的哺乳动物细胞产生的病毒。
42.包含如权利要求41所述的病毒的药物组合物。
43.如权利要求42所述的药物组合物,其中所述的组合物是疫苗的形式。
44.生产重组水痘-带状疱疹病毒的方法,该方法包含以下步骤:
将含有除基因62之外的水痘-带状疱疹病毒基因组必需基因与BAC载体序列的载体导入到哺乳动物宿主细胞中的步骤;和
培养所述的哺乳动物宿主细胞,生产重组水痘-带状疱疹病毒的步骤。
45.如权利要求44所述的方法,其中所述的载体进一步包含基因62。
46.如权利要求44所述的方法,其中所述的哺乳动物宿主细胞来源于人。
47.如权利要求44所述的方法,其中,所述的BAC载体序列至少包含两种重组蛋白依赖的重组序列。
48.如权利要求47所述的方法,其进一步包含在上述两种重组蛋白依赖的重组序列之间的重组步骤。
49.如权利要求44所述的方法,其中,来自前述水痘-带状疱疹病毒基因组的序列与BAC载体序列相连的部位位于该水痘-带状疱疹病毒基因组的非必需区域内。
50.如权利要求49所述的方法,其中,所述的非必需区域选自下述区域:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
51.如权利要求50所述的载体,其中,所述的非必需区域是基因11的ORF侧翼区域或基因12的ORF侧翼区域。
52.如权利要求44所述的方法,其中,所述的来自水痘-带状疱疹病毒基因组的序列与BAC载体序列相连接的部分位于水痘-带状疱疹病毒基因组的基因62的ORF区域内。
53.如权利要求44所述的方法,其中,所述的BAC载体序列包含重组蛋白依赖的重组序列。
54.如权利要求44所述的方法,其中,所述的BAC载体序列包含选择标记。
55.如权利要求54所述的方法,其中,所述的选择标记是药物选择标记。
56.如权利要求54所述的方法,其中,所述的选择标记是编码绿荧光蛋白的基因。
57.如权利要求44所述的方法,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
58.如权利要求44所述的方法,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
59.如权利要求44所述的方法,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
60.如权利要求44所述的方法,其中,所述的水痘-带状疱疹病毒基因组具有基因62和基因6中的突变。
61.如权利要求60所述的方法,其中,所述的基因62 SEQ ID NO.5中至少包含下述(a)-(d)中的碱基取代:
(a)第2110位的G取代;
(b)第3100位的G取代;
(c)第3818位的C取代;和
(d)第4006位的G取代,
以及基因6在SEQ ID NO.8的碱基序列中至少包含第5745位的G取代。
62.如权利要求44所述的方法,其中,所述的BAC载体序列包含SEQ ID NO.:7所示的序列。
63.如权利要求44所述的方法制备的病毒。
64.包含权利要求63所述病毒的药物组合物.
65.如权利要求64所述的药物组合物,其中,所述的组合物是疫苗的形式。
66.将突变导入如权利要求19所述的载体中的方法,该方法包括以下步骤:
将所述载体导入到细菌宿主细胞中的步骤;
将包含由水痘-带状疱疹病毒基因组的一部分组成的片段的质粒载体导入该细菌宿主细胞的步骤,其中,所述片段至少具有一个突变;
培养所述的细菌宿主细胞的步骤;
由经培养的细菌宿主细胞分离具有BAC载体序列的载体的步骤。
67.向如权利要求19所述的载体中导入突变的方法,该方法包括以下步骤,将所述载体导入细菌宿主细胞的步骤;
将包含由水痘-带状疱疹病毒基因组的一部分组成的第一片段的第一质粒载体导入所述细菌宿主细胞的步骤,其中,所述的第一片段至少具有一个变异;
将包含由水痘-带状疱疹病毒基因组的一部分组成的第二片段的第二质粒载体导入所述细菌宿主细胞的步骤,其中,所述的第二片段至少具有一个变异,并且所述的第二片段与第一片段不同;
培养所述细菌宿主细胞的步骤;
从所培养的细菌宿主细胞分离具有BAC载体序列的载体的步骤。
68.核酸盒,其包含能够在细菌细胞内与水痘-带状疱疹病毒基因组基因组同源重组的第一片段,BAC载体序列,能够在细菌细胞内与水痘-带状疱疹病毒基因组基因组同源重组的第二片段,其中,所述的BAC序列两端分别与第一片段和第二片段相连。
69.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段至少为1kb。
70.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段至少为1.5kb。
71.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段至少为2kb。
72.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少80%同一。
73.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少85%同一。
74.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少90%同一。
75.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段与水痘-带状疱疹病毒基因组序列至少95%同一。
76.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段分别独立地来自选自下述的水痘-带状疱疹病毒基因组的区域:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域,基因70的ORF的侧翼区域。
77.如第68项所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少80%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
78.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少85%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
79.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少90%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
80.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段分别独立地与选自下述的水痘-带状疱疹病毒基因组的区域至少95%同一,所述的区域为:
基因7的ORF中的区域,基因8的ORF中的区域,基因9的ORF中的区域,基因10的ORF中的区域,基因11的ORF中的区域,基因12的ORF中的区域,基因13的ORF中的区域,基因14的ORF中的区域,基因15的ORF中的区域,基因17的ORF中的区域,基因18的ORF中的区域,基因19的ORF中的区域,基因38的ORF中的区域,基因39的ORF中的区域,基因46的ORF中的区域,基因47的ORF中的区域,基因48的ORF中的区域,基因49的ORF中的区域,基因50的ORF中的区域,基因56的ORF中的区域,基因57的ORF中的区域,基因58的ORF中的区域,基因59的ORF中的区域,基因61的ORF中的区域,基因62的ORF中的区域,基因63的ORF中的区域,基因64的ORF中的区域,基因65的ORF中的区域,基因66的ORF中的区域,基因67的ORF中的区域,基因68的ORF中的区域,基因69的ORF中的区域,基因70的ORF中的区域,基因7的ORF的侧翼区域,基因8的ORF的侧翼区域,基因9的ORF的侧翼区域,基因10的ORF的侧翼区域,基因11的ORF的侧翼区域,基因12的ORF的侧翼区域,基因13的ORF的侧翼区域,基因14的ORF的侧翼区域,基因15的ORF的侧翼区域,基因17的ORF的侧翼区域,基因18的ORF的侧翼区域,基因19的ORF的侧翼区域,基因38的ORF的侧翼区域,基因39的ORF的侧翼区域,基因46的ORF的侧翼区域,基因47的ORF的侧翼区域,基因48的ORF的侧翼区域,基因49的ORF的侧翼区域,基因50的ORF的侧翼区域,基因56的ORF的侧翼区域,基因57的ORF的侧翼区域,基因58的ORF的侧翼区域,基因59的ORF的侧翼区域,基因61的ORF的侧翼区域,基因62的ORF的侧翼区域,基因63的ORF的侧翼区域,基因64的ORF的侧翼区域,基因65的ORF的侧翼区域,基因66的ORF的侧翼区域,基因67的ORF的侧翼区域,基因68的ORF的侧翼区域,基因69的ORF的侧翼区域和基因70的ORF的侧翼区域。
81.如权利要求68所述的核酸盒,其中,所述的第一片段和第二片段来自不同的区域。
82.如权利要求72所述的核酸盒,其中,所述的第一片段和第二片段分别独立地来自于基因11的ORF的侧翼序列和基因12的ORF的侧翼序列。
83.如权利要求68所述的核酸盒,其中,所述的BAC载体包含重组蛋白依赖的重组序列。
84.如权利要求68所述的核酸盒,其中,所述的BAC载体序列包含选择标记。
85.如权利要求84所述的核酸盒,其中,所述的选择标记是药物选择标记。
86.如权利要求68所述的核酸盒,其中,所述的选择标记是编码绿荧光蛋白的基因。
87.如权利要求68所述的核酸盒,其中,所述的水痘-带状疱疹病毒基因组来自野生株。
88.如权利要求68所述的核酸盒,其中,所述的水痘-带状疱疹病毒基因组来自突变型病毒株。
89.如权利要求68所述的核酸盒,其中,所述的水痘-带状疱疹病毒基因组来自Oka疫苗株。
90.如权利要求68所述的核酸盒,其中,所述的BAC载体序列包含SEQ ID NO.:7所示的序列。
91.如权利要求68所述的核酸盒,其具有SEQ ID NO.:2所示的核酸序列。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210581606.XA CN103074305B (zh) | 2004-03-05 | 2005-03-03 | 重组水痘-带状疱疹病毒 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP063277/2004 | 2004-03-05 | ||
JP2004063277 | 2004-03-05 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210581606.XA Division CN103074305B (zh) | 2004-03-05 | 2005-03-03 | 重组水痘-带状疱疹病毒 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1950507A true CN1950507A (zh) | 2007-04-18 |
Family
ID=34918153
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2005800145230A Pending CN1950507A (zh) | 2004-03-05 | 2005-03-03 | 重组水痘-带状疱疹病毒 |
CN201210581606.XA Expired - Fee Related CN103074305B (zh) | 2004-03-05 | 2005-03-03 | 重组水痘-带状疱疹病毒 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210581606.XA Expired - Fee Related CN103074305B (zh) | 2004-03-05 | 2005-03-03 | 重组水痘-带状疱疹病毒 |
Country Status (8)
Country | Link |
---|---|
US (2) | US20110189233A1 (zh) |
EP (2) | EP1721981A4 (zh) |
JP (3) | JPWO2005085445A1 (zh) |
KR (1) | KR101246740B1 (zh) |
CN (2) | CN1950507A (zh) |
AU (1) | AU2005219731B2 (zh) |
CA (1) | CA2558586A1 (zh) |
WO (1) | WO2005085445A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102666842A (zh) * | 2009-07-28 | 2012-09-12 | 北京万泰生物药业股份有限公司 | Orf7缺陷型水痘病毒、含有该病毒的疫苗及应用 |
CN115894707A (zh) * | 2021-07-28 | 2023-04-04 | 江苏瑞科生物技术股份有限公司 | 一种基因重组水痘-带状疱疹病毒融合蛋白及其制备方法和应用 |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080226677A1 (en) | 2004-05-06 | 2008-09-18 | Yasuko Mori | Recombinant virus vector for gene transfer into lymphoid cells |
EP2471938A3 (en) * | 2005-11-24 | 2013-04-24 | The Research Foundation for Microbial Diseases of Osaka University | Recombinant polyvalent vaccine |
US10166285B2 (en) * | 2006-11-09 | 2019-01-01 | The United States Of America, As Represented By The Secretary, Department Of Health & Human Services | Recombinant virus with diminished latency and methods of using same |
UA112970C2 (uk) * | 2010-08-05 | 2016-11-25 | Мерк Шарп Енд Доме Корп. | Інактивований вірус вітряної віспи, спосіб його одержання і застосування |
US20140147458A1 (en) * | 2011-02-24 | 2014-05-29 | Mogam Biotechnology Research Institute | Novel varicella-zoster virus strains, and chicken pox and herpes zoster virus vaccine using same |
WO2014043189A1 (en) * | 2012-09-14 | 2014-03-20 | The Regents Of The University Of Colorado, A Body Corporate | Conditionally replication deficient herpes viruses and use thereof in vaccines |
JP2014236748A (ja) * | 2014-08-26 | 2014-12-18 | 一般財団法人阪大微生物病研究会 | 組換え多価ワクチン |
GB201818084D0 (en) * | 2018-11-06 | 2018-12-19 | Univ Oxford Innovation Ltd | Compositions and methods |
CN110845583A (zh) * | 2019-11-18 | 2020-02-28 | 维塔恩(广州)医药有限公司 | 水痘带状疱疹病毒相关抗原短肽及其应用 |
CN112870344B (zh) * | 2019-11-29 | 2022-07-19 | 北京绿竹生物技术股份有限公司 | 一种重组水痘带状疱疹病毒疫苗 |
CN115819616A (zh) * | 2021-07-28 | 2023-03-21 | 江苏瑞科生物技术股份有限公司 | 一种基因重组vzv融合蛋白及其制备方法和应用 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5341202A (en) | 1976-09-28 | 1978-04-14 | Fuji Photo Film Co Ltd | Novel magnetic recording medium |
US4554101A (en) | 1981-01-09 | 1985-11-19 | New York Blood Center, Inc. | Identification and preparation of epitopes on antigens and allergens on the basis of hydrophilicity |
ZA872705B (en) | 1986-04-22 | 1987-10-05 | Immunex Corporation | Human g-csf protein expression |
US6093535A (en) | 1996-05-15 | 2000-07-25 | The Research Foundation For Microbial Diseases Of Osaka University | Method for identifying attenuated chickenpox virus Oka strain or strain originating therein and acceptable as attenuated chickenpox vaccine virus |
US6277621B1 (en) * | 1998-02-26 | 2001-08-21 | Medigene, Inc. | Artificial chromosome constructs containing foreign nucleic acid sequences |
CN1163604C (zh) * | 1999-02-25 | 2004-08-25 | 财团法人阪大微生物病研究会 | 减毒水痘病毒冈株的基因62和利用基因62的减毒水痘活疫苗用病毒株的鉴定方法 |
KR100441459B1 (ko) * | 2000-01-31 | 2004-07-23 | 사이단호진한다이비세이부쯔뵤우겐큐우카이 | 약독 생수두백신의 품질 관리 방법 |
EP1178111A1 (en) * | 2000-08-03 | 2002-02-06 | Lohmann Animal Health GmbH & Co. KG | Vaccination against host cell-associated herpesviruses |
US20080226677A1 (en) * | 2004-05-06 | 2008-09-18 | Yasuko Mori | Recombinant virus vector for gene transfer into lymphoid cells |
-
2005
- 2005-03-03 CN CNA2005800145230A patent/CN1950507A/zh active Pending
- 2005-03-03 CN CN201210581606.XA patent/CN103074305B/zh not_active Expired - Fee Related
- 2005-03-03 EP EP05719956A patent/EP1721981A4/en not_active Withdrawn
- 2005-03-03 JP JP2006510728A patent/JPWO2005085445A1/ja not_active Withdrawn
- 2005-03-03 AU AU2005219731A patent/AU2005219731B2/en not_active Ceased
- 2005-03-03 US US10/591,787 patent/US20110189233A1/en not_active Abandoned
- 2005-03-03 KR KR1020067018048A patent/KR101246740B1/ko not_active IP Right Cessation
- 2005-03-03 CA CA002558586A patent/CA2558586A1/en not_active Abandoned
- 2005-03-03 WO PCT/JP2005/003652 patent/WO2005085445A1/ja active Application Filing
- 2005-03-03 EP EP11153574A patent/EP2383343A3/en not_active Withdrawn
-
2011
- 2011-12-12 JP JP2011271666A patent/JP2012061005A/ja not_active Withdrawn
-
2012
- 2012-12-14 US US13/715,665 patent/US20130101620A1/en not_active Abandoned
-
2014
- 2014-03-20 JP JP2014057800A patent/JP2014166182A/ja active Pending
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102666842A (zh) * | 2009-07-28 | 2012-09-12 | 北京万泰生物药业股份有限公司 | Orf7缺陷型水痘病毒、含有该病毒的疫苗及应用 |
CN102666842B (zh) * | 2009-07-28 | 2016-03-02 | 北京万泰生物药业股份有限公司 | Orf7缺陷型水痘病毒、含有该病毒的疫苗及应用 |
CN105770886A (zh) * | 2009-07-28 | 2016-07-20 | 北京万泰生物药业股份有限公司 | Orf7缺陷型水痘病毒、含有该病毒的疫苗及应用 |
CN105770886B (zh) * | 2009-07-28 | 2017-11-28 | 北京万泰生物药业股份有限公司 | Orf7缺陷型水痘病毒、含有该病毒的疫苗及应用 |
US9885020B2 (en) | 2009-07-28 | 2018-02-06 | Rutgers, The State University Of New Jersey | ORF7 deficient varicella virus, vaccine comprising the virus and use thereof |
US10752885B2 (en) | 2009-07-28 | 2020-08-25 | Rutgers, The State University Of New Jersey | ORF7 deficient varicella virus, vaccine comprising the virus and use thereof |
US11220673B2 (en) | 2009-07-28 | 2022-01-11 | Beijing Wantai Biological Pharmacy Enterprise Co., Ltd. | ORF7 deficient varicella virus, vaccine comprising the virus and use thereof |
CN115894707A (zh) * | 2021-07-28 | 2023-04-04 | 江苏瑞科生物技术股份有限公司 | 一种基因重组水痘-带状疱疹病毒融合蛋白及其制备方法和应用 |
Also Published As
Publication number | Publication date |
---|---|
JP2012061005A (ja) | 2012-03-29 |
EP2383343A3 (en) | 2012-01-25 |
KR20070009580A (ko) | 2007-01-18 |
EP1721981A4 (en) | 2007-03-14 |
CN103074305B (zh) | 2015-07-22 |
AU2005219731B2 (en) | 2012-02-23 |
CA2558586A1 (en) | 2005-09-15 |
KR101246740B1 (ko) | 2013-03-26 |
JPWO2005085445A1 (ja) | 2007-12-13 |
CN103074305A (zh) | 2013-05-01 |
US20110189233A1 (en) | 2011-08-04 |
EP2383343A2 (en) | 2011-11-02 |
WO2005085445A1 (ja) | 2005-09-15 |
AU2005219731A1 (en) | 2005-09-15 |
US20130101620A1 (en) | 2013-04-25 |
JP2014166182A (ja) | 2014-09-11 |
EP1721981A1 (en) | 2006-11-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1950507A (zh) | 重组水痘-带状疱疹病毒 | |
CN1056878C (zh) | 重组火鸡疱疹病毒及其衍生的活载体疫苗 | |
CN1289674C (zh) | Dna疫苗-pcv | |
CN100335131C (zh) | 重组火鸡疱疹病毒及其应用 | |
CN113215109A (zh) | 非洲猪瘟多基因联合缺失减毒株的构建及作为疫苗的应用 | |
CN113025629A (zh) | 一种基因缺失的减毒非洲猪瘟病毒株及应用 | |
CN112899290B (zh) | 一种天然免疫抑制基因缺失的减毒非洲猪瘟病毒株及应用 | |
US20140348875A1 (en) | Koi herpesvirus vaccine | |
US20100119550A1 (en) | Recombinant multivalent vaccine | |
CN1244692C (zh) | 一种伪狂犬病TK-/gE-/gI-基因缺失标志活疫苗及制备方法 | |
CN114222579A (zh) | 猪圆环病毒3型(pcv3)疫苗及其生产和用途 | |
JP7387623B2 (ja) | 標的タンパク質を安定して発現できる組換えウイルス | |
CN1533242A (zh) | 口蹄疫病毒疫苗 | |
Kucuktas et al. | Molecular biology of channel catfish virus | |
CN112790125B (zh) | 一种玉兔百褶裙泰狮金鱼的创制方法 | |
CN1654667A (zh) | 减毒hsv-1基因治疗载体 | |
US20220154149A1 (en) | Syncytial oncolytic herpes simplex mutants as potent cancer therapeutics | |
KR20080070012A (ko) | 재조합 다가 백신 | |
Shiau et al. | A simple selection system for construction of recombinant gD-negative pseudorabies virus as a vaccine vector | |
CN1283788C (zh) | 表达猪繁殖与呼吸综合征病毒的重组伪狂犬病毒及应用 | |
CN108864300A (zh) | 鸡白蛋白-干扰素α-白介素2融合蛋白、制备方法及其编码基因、一种鸡长效干扰素 | |
CN105343877A (zh) | 一种5基因缺失伪狂犬病重组病毒活疫苗及其制备方法 | |
KR101623498B1 (ko) | 약독화 백시니아 바이러스주 kvac103 | |
CN1287861C (zh) | 人源基因引导序列、基因载体及基因表达方法 | |
US7589071B1 (en) | Large capacity viral amplicon using a minimal orilyt from human cytomegalovirus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1104315 Country of ref document: HK |
|
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20070418 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1104315 Country of ref document: HK |