CN112094822A - Infectious cDNA clone based on EV71 strain and application thereof - Google Patents
Infectious cDNA clone based on EV71 strain and application thereof Download PDFInfo
- Publication number
- CN112094822A CN112094822A CN201910474088.3A CN201910474088A CN112094822A CN 112094822 A CN112094822 A CN 112094822A CN 201910474088 A CN201910474088 A CN 201910474088A CN 112094822 A CN112094822 A CN 112094822A
- Authority
- CN
- China
- Prior art keywords
- virus
- leu
- ala
- ser
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 208000015181 infectious disease Diseases 0.000 title claims abstract description 116
- 241001529459 Enterovirus A71 Species 0.000 title claims abstract description 84
- 230000002458 infectious effect Effects 0.000 title claims abstract description 79
- 239000002299 complementary DNA Substances 0.000 title claims abstract description 54
- 241000700605 Viruses Species 0.000 claims abstract description 146
- 108700008625 Reporter Genes Proteins 0.000 claims abstract description 46
- 239000002245 particle Substances 0.000 claims abstract description 33
- 230000003612 virological effect Effects 0.000 claims abstract description 24
- 241001465754 Metazoa Species 0.000 claims abstract description 16
- 229960005486 vaccine Drugs 0.000 claims abstract description 13
- 238000010171 animal model Methods 0.000 claims abstract description 11
- 239000003814 drug Substances 0.000 claims abstract description 7
- 239000013612 plasmid Substances 0.000 claims description 50
- 150000007523 nucleic acids Chemical group 0.000 claims description 47
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 46
- 108020004414 DNA Proteins 0.000 claims description 33
- 108700026244 Open Reading Frames Proteins 0.000 claims description 17
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 11
- 108010067390 Viral Proteins Proteins 0.000 claims description 9
- 238000010276 construction Methods 0.000 claims description 8
- 102000053602 DNA Human genes 0.000 claims description 7
- 238000000034 method Methods 0.000 claims description 7
- 108091026890 Coding region Proteins 0.000 claims description 6
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 5
- 108060001084 Luciferase Proteins 0.000 claims description 4
- 239000005089 Luciferase Substances 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 238000012216 screening Methods 0.000 claims description 3
- 239000013603 viral vector Substances 0.000 claims description 3
- 108091006047 fluorescent proteins Proteins 0.000 claims description 2
- 102000034287 fluorescent proteins Human genes 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 210000002845 virion Anatomy 0.000 claims 1
- 238000011161 development Methods 0.000 abstract description 11
- 239000013598 vector Substances 0.000 abstract description 7
- 238000001415 gene therapy Methods 0.000 abstract description 5
- 238000001514 detection method Methods 0.000 abstract description 4
- 230000009385 viral infection Effects 0.000 abstract description 4
- 239000003153 chemical reaction reagent Substances 0.000 abstract description 3
- 239000013604 expression vector Substances 0.000 abstract description 3
- 230000003053 immunization Effects 0.000 abstract description 3
- 238000002649 immunization Methods 0.000 abstract description 2
- 230000002265 prevention Effects 0.000 abstract description 2
- 108020004635 Complementary DNA Proteins 0.000 description 38
- 238000010804 cDNA synthesis Methods 0.000 description 37
- 210000004027 cell Anatomy 0.000 description 35
- 241000699670 Mus sp. Species 0.000 description 34
- 108020000999 Viral RNA Proteins 0.000 description 23
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 23
- 210000003501 vero cell Anatomy 0.000 description 18
- 238000000338 in vitro Methods 0.000 description 16
- 108090000623 proteins and genes Proteins 0.000 description 14
- 239000006228 supernatant Substances 0.000 description 14
- 239000012634 fragment Substances 0.000 description 11
- 101710172711 Structural protein Proteins 0.000 description 9
- 238000010367 cloning Methods 0.000 description 9
- 241000709661 Enterovirus Species 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 102000004169 proteins and genes Human genes 0.000 description 8
- 230000010076 replication Effects 0.000 description 8
- 101710132601 Capsid protein Proteins 0.000 description 7
- 101710197658 Capsid protein VP1 Proteins 0.000 description 7
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 7
- 101710108545 Viral protein 1 Proteins 0.000 description 7
- 238000003556 assay Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 238000004520 electroporation Methods 0.000 description 7
- 230000004927 fusion Effects 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 230000029812 viral genome replication Effects 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 6
- 239000003443 antiviral agent Substances 0.000 description 6
- 230000002238 attenuated effect Effects 0.000 description 5
- 230000002950 deficient Effects 0.000 description 5
- 229940079593 drug Drugs 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- 241000709664 Picornaviridae Species 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 230000003834 intracellular effect Effects 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 238000010172 mouse model Methods 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 241000991587 Enterovirus C Species 0.000 description 3
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 230000000840 anti-viral effect Effects 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 210000003169 central nervous system Anatomy 0.000 description 3
- 230000000120 cytopathologic effect Effects 0.000 description 3
- 241001493065 dsRNA viruses Species 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 231100000225 lethality Toxicity 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 230000004083 survival effect Effects 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 230000017613 viral reproduction Effects 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- 102100025230 2-amino-3-ketobutyrate coenzyme A ligase, mitochondrial Human genes 0.000 description 2
- 108010087522 Aeromonas hydrophilia lipase-acyltransferase Proteins 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- 101000651036 Arabidopsis thaliana Galactolipid galactosyltransferase SFR2, chloroplastic Proteins 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 241000709687 Coxsackievirus Species 0.000 description 2
- 102100031780 Endonuclease Human genes 0.000 description 2
- 241000988559 Enterovirus A Species 0.000 description 2
- 208000007212 Foot-and-Mouth Disease Diseases 0.000 description 2
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- 208000020061 Hand, Foot and Mouth Disease Diseases 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- 241000282567 Macaca fascicularis Species 0.000 description 2
- 241000282560 Macaca mulatta Species 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 208000028389 Nerve injury Diseases 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 208000035415 Reinfection Diseases 0.000 description 2
- 238000011579 SCID mouse model Methods 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 230000002550 fecal effect Effects 0.000 description 2
- 230000001605 fetal effect Effects 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 108700043045 nanoluc Proteins 0.000 description 2
- 230000008764 nerve damage Effects 0.000 description 2
- 230000000926 neurological effect Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- BAAVRTJSLCSMNM-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 BAAVRTJSLCSMNM-CMOCDZPBSA-N 0.000 description 1
- AWXGSYPUMWKTBR-UHFFFAOYSA-N 4-carbazol-9-yl-n,n-bis(4-carbazol-9-ylphenyl)aniline Chemical compound C12=CC=CC=C2C2=CC=CC=C2N1C1=CC=C(N(C=2C=CC(=CC=2)N2C3=CC=CC=C3C3=CC=CC=C32)C=2C=CC(=CC=2)N2C3=CC=CC=C3C3=CC=CC=C32)C=C1 AWXGSYPUMWKTBR-UHFFFAOYSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- LXTGAOAXPSJWOU-DCAQKATOSA-N Asn-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N LXTGAOAXPSJWOU-DCAQKATOSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- UBHPUQAWSSNQLQ-DCAQKATOSA-N Cys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O UBHPUQAWSSNQLQ-DCAQKATOSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241001466953 Echovirus Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000963438 Gaussia <copepod> Species 0.000 description 1
- 108700023863 Gene Components Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- UGEZSPWLJABDAR-KKUMJFAQSA-N Gln-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N UGEZSPWLJABDAR-KKUMJFAQSA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- YDJOULGWHQRPEV-SRVKXCTJSA-N Glu-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N YDJOULGWHQRPEV-SRVKXCTJSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- ORZGPQXISSXQGW-IHRRRGAJSA-N His-His-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O ORZGPQXISSXQGW-IHRRRGAJSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- 101001128634 Homo sapiens NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, mitochondrial Proteins 0.000 description 1
- 101000837344 Homo sapiens T-cell leukemia translocation-altered gene protein Proteins 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 1
- 102000001617 Interferon Receptors Human genes 0.000 description 1
- 108010054267 Interferon Receptors Proteins 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- JOYFULUKJRJCSX-IUCAKERBSA-N Met-Met-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O JOYFULUKJRJCSX-IUCAKERBSA-N 0.000 description 1
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 102100032194 NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, mitochondrial Human genes 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 206010021888 Nervous system infections Diseases 0.000 description 1
- 206010029350 Neurotoxicity Diseases 0.000 description 1
- 206010033799 Paralysis Diseases 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 206010037394 Pulmonary haemorrhage Diseases 0.000 description 1
- 206010037423 Pulmonary oedema Diseases 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 241001068295 Replication defective viruses Species 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PZHJLTWGMYERRJ-SRVKXCTJSA-N Ser-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O PZHJLTWGMYERRJ-SRVKXCTJSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 102100028692 T-cell leukemia translocation-altered gene protein Human genes 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- 206010044221 Toxic encephalopathy Diseases 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- HJWVPKJHHLZCNH-DVXDUOKCSA-N Trp-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)C)C(O)=O)=CNC2=C1 HJWVPKJHHLZCNH-DVXDUOKCSA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 108010067674 Viral Nonstructural Proteins Proteins 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000907316 Zika virus Species 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 229940031567 attenuated vaccine Drugs 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 108010020595 beta-casomorphin 4 Proteins 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 108091092328 cellular RNA Proteins 0.000 description 1
- 208000015114 central nervous system disease Diseases 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-L glutamate group Chemical group N[C@@H](CCC(=O)[O-])C(=O)[O-] WHUUTDBJXJRKMK-VKHMYHEASA-L 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 231100000516 lung damage Toxicity 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 210000004165 myocardium Anatomy 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000007135 neurotoxicity Effects 0.000 description 1
- 231100000228 neurotoxicity Toxicity 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000011809 primate model Methods 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 208000005333 pulmonary edema Diseases 0.000 description 1
- 230000003016 quadriplegic effect Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010032276 tyrosyl-glutamyl-tyrosyl-glutamic acid Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000006656 viral protein synthesis Effects 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
- 230000006394 virus-host interaction Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1009—Picornaviridae, e.g. hepatitis A virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32311—Enterovirus
- C12N2770/32321—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32311—Enterovirus
- C12N2770/32334—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32311—Enterovirus
- C12N2770/32351—Methods of production or purification of viral material
- C12N2770/32352—Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/005—Assays involving biological materials from specific organisms or of a specific nature from viruses
- G01N2333/01—DNA viruses
- G01N2333/015—Parvoviridae, e.g. feline panleukopenia virus, human Parvovirus
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Hematology (AREA)
- Urology & Nephrology (AREA)
- Communicable Diseases (AREA)
- Pharmacology & Pharmacy (AREA)
- Biophysics (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- General Physics & Mathematics (AREA)
- Epidemiology (AREA)
- Mycology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Tropical Medicine & Parasitology (AREA)
- Pathology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Analytical Chemistry (AREA)
- Food Science & Technology (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
Abstract
Description
技术领域technical field
本发明属于生物医药领域,具体涉及基于一株临床分离的EV71毒株(js1)的感染性cDNA克隆的构建,以及利用此cDNA克隆及其衍生克隆产生的病毒及其带有报告基因的病毒及建立的动物模型在抗病毒药物研发,疫苗研发,病毒诊断方面的应用。The invention belongs to the field of biomedicine, and in particular relates to the construction of an infectious cDNA clone based on a clinically isolated EV71 strain (js1), a virus produced by using the cDNA clone and its derivative clones, and a virus with a reporter gene and The established animal models are used in antiviral drug development, vaccine development, and virus diagnosis.
背景技术Background technique
现有技术公开了肠道病毒是一类病毒的总称,包括脊髓灰质炎病毒(Poliovirus)的3个分型、柯萨奇病毒(Coxsackie virus A)的23个分型、柯萨奇(Coxsackie virus B)病毒的6个分型、埃可病毒(ECHO virus)的31个分型、肠病毒(Enterovirus)的68-71个分型,共67种类型。传统分型以后发现的肠道病毒﹐按照发现顺序命名,现在已经发现的新型肠道病毒有68﹑69﹑70﹑71和72型肠道病毒。新型肠道病毒71型,简称EV71,属于小RNA病毒科,肠道病毒属,其于1969年从澳大利亚和美国分离出,于1973在日本分离出,并被认为是儿童手足口病爆发流行的主要病原体(Schmidt et al.J Infect Dis 1974,129:304-309;Hagiwara et al.Intervirology 1978,9:60-63.)。在1988年以前,EV71病毒主要在美国、日本、欧洲和澳大利亚地区引起婴幼儿手足口病的爆发流行(Weng et al.MicrobesInfect 2010;12:505-10;Tagaya et al.Jpn J Med Sci Biol 1975;28:231-4;Blomberget al.Lancet 1974;2:112;Nagy et al..Arch Virol 1982;71:217-27;Kennett etal.Bull World Health Organ 1974;51:609-15;Gilbert et al.Pediatr Infect Dis J1988;7:484-8)。自1990年以来,EV71病毒在亚洲-太平洋地区引起一系列的爆发流行(Chanet al.Clin Infect Dis 2000;31:678-83;Tu et al.Emerg Infect Dis 2007;13:1733-41;Jeong et al..Arch Virol 2010;155:1707-12)。截止2014年,EV71感染已波及到全球各个洲和国家。研究显示,EV71感染主要分布在亚洲-太平洋地区,在北美、南美、欧洲和澳大利亚也有EV71感染的分布。The prior art discloses that enterovirus is a general term for a class of viruses, including 3 types of poliovirus (Poliovirus), 23 types of Coxsackie virus (Coxsackie virus A), Coxsackie virus (Coxsackie virus A)
研究报道了EV71病毒是单股正链RNA病毒,其基因组可以编码一条单长的开放阅读框(ORF),在基因组的两段还包含两个长的非编码区5'TURs和3'TURs。5'TURs含有核糖体内部结合位点(Internal ribosome entry site,IRES),启动病毒的翻译过程(Hellen etal.Genes Dev.2001;15,1593–1612)。病毒编码的开放阅读框翻译后被病毒自身编码的蛋白酶切割加工成单个病毒蛋白,其中包括组成病毒颗粒的结构蛋白VP4,VP2,VP3,VP1和负责病毒复制的非结构蛋白2A,2B,2C,3A,3B,3C及3D(Racaniello,et al.FieldsVirology.2007,Fifth edition)。The study reported that EV71 virus is a single-stranded positive-stranded RNA virus whose genome can encode a single-length open reading frame (ORF), and also contains two long non-coding regions 5'TURs and 3'TURs in two segments of the genome. 5'TURs contain internal ribosome entry sites (IRES) that initiate viral translation (Hellen et al. Genes Dev. 2001; 15, 1593–1612). After translation, the virus-encoded open reading frame is cleaved and processed into a single viral protein by protease encoded by the virus itself, including the structural proteins VP4, VP2, VP3, VP1 that make up the virus particle and the
研究报道,灵长类动物可以作为EV71的感染模型。最早在1978年,Hashimoto等报道使用1.8-3.8kg的食蟹猴,在隔离了9周以后,可以感染来自一名3岁儿童粪便标本分离到的EV71病毒毒株,EV71病毒对这种猴子具有神经毒性,在感染的第四天食蟹猴表现出神经损伤的临床症状,损伤程度跟病毒滴度呈正相关。并且EV71病毒可以诱发猴子产生血清中和抗体(Hashimoto et al.Arch Virol.1978;56:257-61)。Zhang等使用3-3.5岁大小的恒河猴可以建立出现脑内感染,肺水肿、出血伴发神经损伤等症状的动物感染模型,而静脉和呼吸系统感染可直接导致神经系统感染。因此,通过不同的感染途径可以获得不同研究目的的模型(Zhang et al.Lab Invest.2011;91:1337-50)。此外还有能引起中枢神经系统疾病的恒河猴动物模型,(Liu et al.Virology.2011;412:91-100)。Studies have reported that primates can be used as an infection model for EV71. As early as 1978, Hashimoto et al. reported that 1.8-3.8kg cynomolgus monkeys could be infected with the EV71 virus strain isolated from the fecal specimen of a 3-year-old child after 9 weeks of isolation. Neurotoxicity, cynomolgus monkeys showed clinical symptoms of nerve damage on the fourth day of infection, and the degree of damage was positively correlated with virus titer. And EV71 virus can induce monkeys to produce serum neutralizing antibodies (Hashimoto et al. Arch Virol. 1978; 56: 257-61). Zhang et al. used 3-3.5-year-old rhesus monkeys to establish an animal infection model with symptoms such as brain infection, pulmonary edema, and hemorrhage accompanied by nerve damage. Intravenous and respiratory system infections can directly lead to nervous system infections. Therefore, models for different research purposes can be obtained through different infection routes (Zhang et al. Lab Invest. 2011;91:1337-50). In addition, there are rhesus monkey animal models that can cause central nervous system diseases, (Liu et al. Virology. 2011; 412:91-100).
EV71的非灵长类动物模型也有报道,如,老鼠适应性突变的EV71毒株EV71/MP4可以感染ICR老鼠,出现神经和肺部损伤(Chen et al.J Virol.2007,81:8996-9003;Wang etal.J Virol.2004,78:7916-24)。Arita等使用免疫缺陷的非肥胖严重糖尿病的小鼠(NOD/SCID mice),对病毒传代获得了可以使3周大小NOD/SCID小鼠感染的鼠适应EV71毒株,该小鼠模型自然杀伤细胞功能被抑制,并且缺乏功能性T、B细胞。而且获得的鼠适应株主要感染动物的中枢神经系统、心脏和骨骼肌(Arita et al.J Virol.2008,82(4):1787-97)。利用干扰素受体α、β和γ缺陷免疫缺陷小鼠AG129小鼠,2周龄或更小龄的AG129小鼠可以感染EV71天然毒株,并且在小鼠死亡之前表现出肢体瘫痪的症状(Khong et al.J Virol.2012,86(4):2121-31)。三周龄的表达有EV71受体hSCARB2的转基因小鼠可以成功感染EV71Isehara/Japan/99(Isehara)毒株;研究表明构建EV71的老鼠模型需要特殊的老鼠适应性毒株或基因缺失或修饰的老鼠。Non-primate models of EV71 have also been reported, for example, the mouse adaptive mutant EV71 strain EV71/MP4 can infect ICR mice with neurological and lung damage (Chen et al. J Virol. 2007, 81:8996-9003 ; Wang et al. J Virol. 2004, 78:7916-24). Arita et al. used immunodeficient non-obese severely diabetic mice (NOD/SCID mice) to passage the virus to obtain a mouse-adapted EV71 strain that can infect 3-week-old NOD/SCID mice. This mouse model is a natural killer cell Function is suppressed and functional T and B cells are absent. Moreover, the obtained murine-adapted strain mainly infects the central nervous system, heart and skeletal muscle of animals (Arita et al. J Virol. 2008, 82(4): 1787-97). Using AG129 mice, immunodeficient mice deficient in interferon receptor alpha, beta and gamma,
研究还报道,单正链(positive-strand)RNA病毒的基因组RNA被释放,进入宿主细胞细胞浆后可以直接作为mRNA模版进行翻译;翻译产生的病毒非结构蛋白招募病毒基因组形成复制复合物起始病毒的基因复制及生活周期,因此单正链RNA病毒的基因组RNA具有感染性,经导入到宿主细胞后,可以完全起始病毒的整个生活周期(Racaniello,etal.Science.1981,214(4523):916)。构建感染性克隆的方法通常采用病毒感染的细胞总RNA作为模版,逆转录成互补DNA(cDNA),然后克隆病毒片段入克隆载体形成病毒的感染性克隆。构建的感染性克隆利用体外转录产生完整的病毒RNA,然后转染病毒RNA入宿主细胞来起始病毒生活周期,产生子代病毒。或者构建的感染性克隆如果带有真核细胞启动子,可以直接转染质粒,由宿主细胞的RNA聚合酶转录出病毒全长RNA,进而起始病毒生活周期,产生子代病毒。The study also reported that the genomic RNA of a single positive-strand RNA virus is released and can be directly translated as an mRNA template after entering the cytoplasm of the host cell; the viral non-structural proteins generated by translation recruit the viral genome to form the initiation of a replication complex. The gene replication and life cycle of the virus, so the genomic RNA of the single positive-stranded RNA virus is infectious, and after being introduced into the host cell, the entire life cycle of the virus can be completely initiated (Racaniello, et al. Science. 1981, 214 (4523) :916). The method for constructing infectious clones usually uses the total RNA of virus-infected cells as a template, reverse-transcribes it into complementary DNA (cDNA), and then clones the viral fragments into a cloning vector to form an infectious clone of the virus. The constructed infectious clones utilize in vitro transcription to generate complete viral RNA, and then transfect the viral RNA into host cells to initiate the viral life cycle and produce progeny viruses. Alternatively, if the constructed infectious clone has a eukaryotic cell promoter, the plasmid can be directly transfected, and the full-length viral RNA is transcribed by the RNA polymerase of the host cell, thereby initiating the viral life cycle and producing progeny viruses.
老鼠模型研究证实EV71的VP1 145位谷氨酸是病毒致小鼠死亡的主要位点,VP2149位赖氨酸的甲基化能够协同促进VP1 145E致小鼠死亡的能力(Huang etal.Virology.2012,422(1):132-43)。此病毒位点在病毒体外传代过程中易突变未145G,导致病毒感染动物能力的下降(Yi et al.Unpublished data)。Mouse model studies have confirmed that glutamate at position 145 of EV71 is the main site of virus-induced mouse death, and methylation of lysine at position 149 of VP2 can synergistically promote the ability of VP1 145E to cause mouse death (Huang et al.Virology.2012 , 422(1):132-43). This viral site is easily mutated to 145G during the in vitro passage of the virus, resulting in a decrease in the ability of the virus to infect animals (Yi et al. Unpublished data).
基于现有技术的基础与现状,本申请的发明人拟提供基于EV71毒株的感染性cDNA克隆及其应用。Based on the basis and status of the prior art, the inventors of the present application intend to provide an infectious cDNA clone based on EV71 strain and its application.
发明内容SUMMARY OF THE INVENTION
本发明的目的是基于现有技术的基础与现状,提供基于EV71毒株的感染性cDNA克隆及其应用。具体涉及一个稳定的EV71毒株的感染性cDNA克隆,该克隆及其衍生产生的病毒RNA在细胞中能自行复制、产生子代病毒颗粒及表达报道基因。The purpose of the present invention is to provide an infectious cDNA clone based on EV71 strain and its application based on the basis and current state of the prior art. Specifically, it relates to an infectious cDNA clone of a stable EV71 strain, and the clone and its derived viral RNA can self-replicate in cells, produce progeny virus particles and express reporter genes.
本发明要解决的另一个技术问题是提供以上述克隆为基础构建的重组病毒或者亚单位病毒颗粒、质粒等,为构建动物模型、疫苗开发及抗病毒药物的开发提供支持。Another technical problem to be solved by the present invention is to provide recombinant viruses or subunit virus particles, plasmids, etc. constructed on the basis of the above clones, so as to provide support for the construction of animal models, the development of vaccines and the development of antiviral drugs.
本发明从临床上分离到一株EV71毒株(命名为js1),其无需老鼠适应性突变、且能感染未经基因背景改变的小鼠,通过构其感染性克隆能产生稳定基因序列的病毒颗粒,感染普通老鼠,建立简便、高效的EV71动物感染模型。The present invention isolates an EV71 strain (named js1) from the clinic, which does not require mouse adaptive mutation and can infect mice without genetic background changes, and can produce a virus with stable gene sequence by constructing its infectious clone particles, infect ordinary mice, and establish a simple and efficient EV71 animal infection model.
更具体的,more specific,
本发明提供了一种cDNA,它包含EV71毒株的核酸序列和一个低拷贝质粒骨架的核酸序列;其中,EV71毒株的核酸序列涵盖EV71病毒5′到3′正向极性序列,包含病毒5′及3′非编码区及一个编码病毒蛋白的开放阅读框。The present invention provides a cDNA comprising the nucleic acid sequence of the EV71 strain and the nucleic acid sequence of a low-copy plasmid backbone; wherein, the nucleic acid sequence of the EV71 strain covers the 5' to 3' forward polar sequence of the EV71 virus, including the virus 5' and 3' non-coding regions and an open reading frame encoding a viral protein.
较好的,它还包括EV71毒株的核酸序列插入的报道基因荧光素酶或者荧光蛋白的序列。Preferably, it also includes the sequence of reporter gene luciferase or fluorescent protein inserted into the nucleic acid sequence of EV71 strain.
所述的病毒蛋白开放阅读框的氨基酸序列如SEQ ID NO 4所示。The amino acid sequence of the viral protein open reading frame is shown in
所述的低拷贝质粒骨架的编码序列如SEQ ID NO 3所示。The coding sequence of the low-copy plasmid backbone is shown in
所述的EV71毒株的核酸序列如SEQ ID NO 2所示。The nucleic acid sequence of the EV71 strain is shown in
在本发明的一个优选实施例中,所述的EV71毒株的感染性cDNA克隆,其序列如SEQID NO 1所示。In a preferred embodiment of the present invention, the infectious cDNA clone of the EV71 strain has the sequence shown in
在本发明的一个实施例中,提供了一个稳定的、一株临床分离的EV71毒株的感染性cDNA克隆的构建(核酸序列1)及其含有各类报道基因的衍生克隆(核酸序列5,核酸序列6)、及以其为母本构建的各种突变克隆。这些克隆产生的病毒RNA在细胞中能自行复制、产生子代病毒颗粒及表达报道基因。In one embodiment of the present invention, the construction of a stable infectious cDNA clone of a clinically isolated EV71 strain (nucleic acid sequence 1) and its derivative clones containing various reporter genes (
本发明还包括以核酸序列6或核酸序列7所述的序列为母本,通过替代Nluc或EGFP所构建的含有异源性报道序列或目的基因的重组病毒克隆及其序列。The present invention also includes recombinant virus clones containing heterologous reporter sequences or target genes constructed by replacing Nluc or EGFP with the sequences described in
本发明还包括各种嵌合病毒感染性克隆和含有异源性报道序列或目的基因的重组病毒克隆产生的各种嵌合病毒和含有报道基因或外源基因的各种病毒颗粒。The present invention also includes various chimeric virus infectious clones and various chimeric viruses produced by recombinant virus clones containing heterologous reporter sequences or genes of interest, and various viral particles containing reporter genes or foreign genes.
本发明还包括全长感染性克隆序列构建的在病毒蛋白中同开放阅读框插入有异源性抗性序列的重组病毒克隆及其序列。The present invention also includes recombinant virus clones constructed from full-length infectious clone sequences and inserted into the viral protein with a heterologous resistance sequence in the same open reading frame and their sequences.
具体的,本发明提供了一种包含临床分离的EV71毒株(js1)的感染性cDNA克隆(核酸序列1),此感染性克隆(核酸序列1)包含一个全长的EV71毒株(js1)的核酸序列(核酸序列2)及一个低拷贝质粒骨架(核酸序列3)。核酸序列2涵盖EV71病毒5′到3′正向极性(positive-sense)序列,其中包含病毒5′及3′非编码区及一个编码病毒蛋白的开放阅读框(open reading frame),开放阅读框病毒编码蛋白(蛋白序列4),在此感染性克隆(核酸序列1)中插入报道基因荧光素酶NanoLuc(Nluc)及荧光蛋白EGFP,分别构成带有Nluc的感染性克隆(核酸序列5)及带有EGFP的感染性克隆(核酸序列6),以及以这些克隆为基础,通过改变核酸的手段得到的突变病毒克隆(adapted virus),减毒病毒克隆(live-attenuatedvirus),复制缺陷病毒克隆(defective virus)及复制性的非感染性克隆(replication-competent non-infectious virus)等衍生物(derivative),如包括缺失结构蛋白的亚基因组复制子。Specifically, the present invention provides an infectious cDNA clone (nucleic acid sequence 1) comprising a clinically isolated EV71 strain (js1), the infectious clone (nucleic acid sequence 1) comprising a full-length EV71 strain (js1) nucleotide sequence (nucleotide sequence 2) and a low-copy plasmid backbone (nucleotide sequence 3).
上述序列1-6具体如下:The above sequences 1-6 are as follows:
核酸序列1,SEQ ID NO 1:
GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTATAGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACATGGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttatGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCAC CTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTAT AGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACATGGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTA ACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCAT GGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCC GGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATA GGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTT CAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAAC CCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACC ATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat
核酸序列2,SEQ ID NO 2:
TTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACATGGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACATGGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACT ATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGC AATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGC GCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGT TGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAG AACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCG CACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATG ACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGC
核酸序列3,SEQ ID NO 3:
AGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCAGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGG CACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGC
蛋白序列4,SEQ ID NO 4:
MGSQVSTQRSGSYENSNSATEGSTINYTTINYYKDSYAATAGKQSLKQDPDKFANPVKDIFTEMAAPLKSPSAEACGYSDRVAQLTIGNSTITTQEAANIIVGYGEWPSYCSDSDATAVDKPTRPDVSVNRFYTLDTKLWEKSSKGWYWKFPDVLTETGVFGQNAQFHYLYRSGFCIHVQCNASKFHQGALLVAVLPEYVIGTVAGGTGTEDTHPPYKQTQPGADGFELQHPYVLDAGIPISQLTVCPHQWINLRTNNCATIIVPYINALPFDSALNHCNFGLLVVPISPLDYDQGATPVIPITITLAPMCSEFAGLRQAVTQGFPTELKPGTNQFLTTDDGVSAPILPNFHPTPCIHIPGEVRNLLELCQVETILEVNNVPTNATSLMERLRFPVSAQAGKGELCAVFRADPGRNGPWQSTLLGQLCGYYTQWSGSLEVTFMFTGSFMATGKMLIAYTPPGGPLPKDRATAMLGTHVIWDFGLQSSVTLVIPWISNTHYRAHARDGVFDYYTTGLVSIWYQTNYVVPIGAPNTAYIIALAAAQKNFTMKLCKDASDILQTGTIQGDRVADVIESSIGDSVSRALTHALPAPTGQNTQVSSHRLDTGKVPALQAAEIGASSNASDESMIETRCVLNSHSTAETTLDSFFSRAGLVGEIDLPLEGTTNPNGYANWDIDITGYAQMRRKVELFTYMRFDAEFTFVACTPTGEVVPQLLQYMFVPPGAPKPDSRESLAWQTATNPSVFVKLSDPPAQVSVPFMSPASAYQWFYDGYPTFGEHKQEKDLEYGACPNNMMGTFSVRTVGTSKSKYPLVVRIYMRMKHVRAWIPRPMRNQNYLFKANPNYAGNSIKPTGASRTAITTLGKFGQQSGAIYVGNFRVVNRHLATHNDWANLVWEDSSRDLLVSSTTAQGCDTIARCDCQTGVYYCNSMRKHYPVSFSKPSLIYVEASEYYPARYQSHLMLAQGHSEPGDCGGILRCQHGVIGIVSTGGNGLVGFADVRDLLWLDEEAMEQGVSDYIKGLGDAFGTGFTDAVSREVEALKNYLIGSEGAVEKILKNLIKLISALVIVIRSDYDMVTLTATLALIGCHGSPWAWIKAKTASILGIPIAQKQSASWLKKFNDMANAAKGLEWVSNKISKFIDWLKEKIVPAAREKVEFLNNLKQLPLLENQISNLEQSAASQEDLEVMFGNVSYLAHFCRKFQPLYATEAKRVYALEKRMNNYMQFKSKHRIEPVCLIIRGSPGTGKSLATGIIARAIADKYHSSVYSLPPDPDHFDGYKQQVVTVMDDLCQNPDGKDMSLFCQMVSTVDFIPPMASLEEKGVSFTSKFVIASTNASNIIVPTVSDSDAIRRRFYMDCDIEVTDSYKTDLGRLDAGRAAKLCSENNTANFKRCSPLVCGKAIQLRDRKSKVRYSVDTVVSELIREYSNRSAIGNTIEALFQGPPKFRPIRISLEEKPAPDAISDLLASVDSEEVRQYCRDQGWIIPEAPTNVERHLNRAVLVMQSITTVVAVVSLVYVIYKLFAGFQGAYSGAPKQVLKKPALRTATVQGPSLDFALSLLRRNIRQVQTDQGHFTMLGVRDRLAVLPRHSQPGKTIWIEHKLVNVLDAVELVDEQGVNLELTLITLDTNEKFRDITKFIPENISTASDATLVINTEHMPSMFVPVGDVVQYGFLNLSGKPTHRTMMYNFPTKAGQCGGVVTSVGKVVGIHIGGNGRQGFCAGLKRSYFASEQGEIQWVKPNKETGRLNINGPTRTKLEPSVFHDIFEGNKEPAVLHSKDPRLEVDFEQALFSKYVGNTLHEPDEYIKEAALHYANQLKQLEINTSQMSMEEACYGTENLEAIDLHTSAGYPYSALGIKKRDILDPTTRDVSRMKFYMDKYGLDLPYSTYVKDELRSIDKIKKGKSRLIEASSLNDSVYLRMAFGHLYEAFHANPGTITGSAVGCNPDTFWSKLPILLPGSLFAFDYSGYDASLSPVWFRALELVLREIGYSEEAISLIEGINHTHHVYRNKTYCVLGGMPSGCSGTSIFNSMINNIIIRALLIKTFKGIDLDELNMVAYGDDVLASYPFPIDCLELAKTGKEYGLTMTPADKSPCFNEVNWGNATFLKRGFLPDEQFPFLIHPTMPMREIHESIRWTKDARNTQDHVRSLCLLAWHNGKQEYEKFVSTIRSVPVGRALAIPNYENLRRNWLELFMGSQVSTQRSGSYENSNSATEGSTINYTTINYYKDSYAATAGKQSLKQDPDKFANPVKDIFTEMAAPLKSPSAEACGYSDRVAQLTIGNSTITTQEAANIIVGYGEWPSYCSDSDATAVDKPTRPDVSVNRFYTLDTKLWEKSSKGWYWKFPDVLTETGVFGQNAQFHYLYRSGFCIHVQCNASKFHQGALLVAVLPEYVIGTVAGGTGTEDTHPPYKQTQPGADGFELQHPYVLDAGIPISQLTVCPHQWINLRTNNCATIIVPYINALPFDSALNHCNFGLLVVPISPLDYDQGATPVIPITITLAPMCSEFAGLRQAVTQGFPTELKPGTNQFLTTDDGVSAPILPNFHPTPCIHIPGEVRNLLELCQVETILEVNNVPTNATSLMERLRFPVSAQAGKGELCAVFRADPGRNGPWQSTLLGQLCGYYTQWSGSLEVTFMFTGSFMATGKMLIAYTPPGGPLPKDRATAMLGTHVIWDFGLQSSVTLVIPWISNTHYRAHARDGVFDYYTTGLVSIWYQTNYVVPIGAPNTAYIIALAAAQKNFTMKLCKDASDILQTGTIQGDRVADVIESSIGDSVSRALTHALPAPTGQNTQVSSHRLDTGKVPALQAAEIGASSNASDESMIETRCVLNSHSTAETTLDSFFSRAGLVGEIDLPLEGTTNPNGYANWDIDITGYAQMRRKVELFTYMRFDAEFTFVACTPTGEVVPQLLQYMFVPPGAPKPDSRESLAWQTATNPSVFVKLSDPPAQVSVPFMSPASAYQWFYDGYPTFGEHKQEKDLEYGACPNNMMGTFSVRTVGTSKSKYPLVVRIYMRMKHVRAWIPRPMRNQNYLFKANPNYAGNSIKPTGASRTAITTLGKFGQQSGAIYVGNFRVVNRHLATHNDWANLVWEDSSRDLLVSSTTAQGCDTIARCDCQTGVYYCNSMRKHYPVSFSKPSLIYVEASEYYPARYQSHLMLAQGHSEPGDCGGILRCQHGVIGIVSTGGNGLVGFADVR DLLWLDEEAMEQGVSDYIKGLGDAFGTGFTDAVSREVEALKNYLIGSEGAVEKILKNLIKLISALVIVIRSDYDMVTLTATLALIGCHGSPWAWIKAKTASILGIPIAQKQSASWLKKFNDMANAAKGLEWVSNKISKFIDWLKEKIVPAAREKVEFLNNLKQLPLLENQISNLEQSAASQEDLEVMFGNVSYLAHFCRKFQPLYATEAKRVYALEKRMNNYMQFKSKHRIEPVCLIIRGSPGTGKSLATGIIARAIADKYHSSVYSLPPDPDHFDGYKQQVVTVMDDLCQNPDGKDMSLFCQMVSTVDFIPPMASLEEKGVSFTSKFVIASTNASNIIVPTVSDSDAIRRRFYMDCDIEVTDSYKTDLGRLDAGRAAKLCSENNTANFKRCSPLVCGKAIQLRDRKSKVRYSVDTVVSELIREYSNRSAIGNTIEALFQGPPKFRPIRISLEEKPAPDAISDLLASVDSEEVRQYCRDQGWIIPEAPTNVERHLNRAVLVMQSITTVVAVVSLVYVIYKLFAGFQGAYSGAPKQVLKKPALRTATVQGPSLDFALSLLRRNIRQVQTDQGHFTMLGVRDRLAVLPRHSQPGKTIWIEHKLVNVLDAVELVDEQGVNLELTLITLDTNEKFRDITKFIPENISTASDATLVINTEHMPSMFVPVGDVVQYGFLNLSGKPTHRTMMYNFPTKAGQCGGVVTSVGKVVGIHIGGNGRQGFCAGLKRSYFASEQGEIQWVKPNKETGRLNINGPTRTKLEPSVFHDIFEGNKEPAVLHSKDPRLEVDFEQALFSKYVGNTLHEPDEYIKEAALHYANQLKQLEINTSQMSMEEACYGTENLEAIDLHTSAGYPYSALGIKKRDILDPTTRDVSRMKFYMDKYGLDLPYSTYVKDELRSIDKIKKGKSRLIEASSLNDSVYLRMAFGHLYEAFHANPGTITGSAVGCNPDTFWSKLPILLPGSLFAFDYSGYDASLSPVWFRALELVLREIGYSEEAISLIEGI NHTHHVYRNKTYCVLGGMPSGCSGTSIFNSMINNIIIRALLIKTFKGIDLDELNMVAYGDDVLASYPFPIDCLELAKTGKEYGLTMTPADKSPCFNEVNWGNATFLKRGFLPDEQFPFLIHPTMPMREIHESIRWTKDARNTQDHVRSLCLLAWHNGKQEYEKFVSTIRSVPVGRALAIPNYENLRRNWLELF
核酸序列5,SEQ ID NO 5:
GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTATAGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACtctagaatggtcttcacactcgaagatttcgttggggactggcgacagacagccggctacaacctggaccaagtccttgaacagggaggtgtgtccagtttgtttcagaatctcggggtgtccgtaactccgatccaaaggattgtcctgagcggtgaaaatgggctgaagatcgacatccatgtcatcatcccgtatgaaggtctgagcggcgaccaaatgggccagatcgaaaaaatttttaaggtggtgtaccctgtggatgatcatcactttaaggtgatcctgcactatggcacactggtaatcgacggggttacgccgaacatgatcgactatttcggacggccgtatgaaggcatcgccgtgttcgacggcaaaaagatcactgtaacagggaccctgtggaacggcaacaaaattatcgacgagcgcctgatcaaccccgacggctccctgctgttccgagtaaccatcaacggagtgaccggctggcggctgtgcgaacgcattctggcgatgcatGCGATCACCACTCTTGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttatGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCAC CTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTAT AGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACtctagaatggtcttcacactcgaagatttcgttggggactggcgacagacagccggctacaacctggaccaagtccttgaacagggaggtgtgtccagtttgtttcagaatctcggggtgtccgtaactccgatccaaaggattgtcctgagcggtgaaaatgggctgaagatcgacatccatgtcatcatcccgtatgaaggtctgagcggcgaccaaatgggccagatcgaaaaaatttttaaggtggtgtac cctgtggatgatcatcactttaaggtgatcctgcactatggcacactggtaatcgacggggttacgccgaacatgatcgactatttcggacggccgtatgaaggcatcgccgtgttcgacggcaaaaagatcactgtaacagggaccctgtggaacggcaacaaaattatcgacgagcgcctgatcaaccccgacggctccctgctgttccgagtaaccatcaacggagtgaccggctggcggctgtgcgaacgcattctggcgatgcatGCGATCACCACTCTTGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCC CAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAAT TGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCC ACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGG GCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTT CACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTG ATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat
核酸序列6,SEQ ID NO 6:
GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTATAGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACcatatgATGgtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagatgcatGCGATCACCACTCTTGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat。GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCAC CTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTAT AGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACcatatgATGgtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgac ttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagatgcatGCGATCACCACTCTTGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACC GATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCG AGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTC AAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCT CGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCAT CACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTT GAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCC CAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat.
本发明还包括这些cDNA克隆的表达产物。The present invention also includes the expression products of these cDNA clones.
本发明还包括含有上述cDNA的双链DNA,能产生全长感染性克隆序列的双链DNA(double stranded DNA),正向cDNA(positive-sense cDNA)或负向cDNA(negative-sensecDNA)。The present invention also includes double-stranded DNA containing the above cDNA, double stranded DNA capable of producing a full-length infectious clone sequence, positive-sense cDNA or negative-sense cDNA.
本发明还包括含有上述cDNA或者双链DNA的质粒。The present invention also includes plasmids containing the above-mentioned cDNA or double-stranded DNA.
较好的,该质粒能够转录产生EV71毒株的全长感染性RNA或者其突变体,能通过体外转录产生含有全长EV71毒株(js1)全长感染性RNA的质粒,能通过体外转录产生含有全长EV71毒株(js1)全长感染性RNA的质粒及衍生质粒。Preferably, the plasmid can be transcribed to produce the full-length infectious RNA of the EV71 strain or its mutant, can be transcribed in vitro to produce a plasmid containing the full-length infectious RNA of the full-length EV71 strain (js1), and can be produced by in vitro transcription. Plasmids and derivatives containing the full-length infectious RNA of the EV71 strain (js1).
其中,衍生质粒包括:Among them, the derivative plasmids include:
A.利用其他分离株(isolates)的部分序列替换权利要求1中EV71毒株(js1)全长感染克隆的部分序列得到的重组病毒克隆;A. The recombinant virus clone obtained by replacing the partial sequence of the full-length infection clone of EV71 strain (js1) in
B.利用基因突变对权利要求1或2中EV71毒株(js1)全长感染克隆中的序列进行突变得到的突变病毒克隆;B. a mutant virus clone obtained by mutating the sequence in the full-length infection clone of EV71 strain (js1) in
C.由EV71毒株(js1)全长感染克隆产生的病毒经过适应性突变产生的减毒(live-attenuated),复制非感染病毒(replication competent non-infectious)及非复制性病毒(defective variants)等衍生克隆。C. Live-attenuated, replication competent non-infectious and defective variants of virus produced by full-length infected clones of EV71 strain (js1) and other derivative clones.
本发明提供了一种质粒,它含有上述的双链DNA或者其衍生物。The present invention provides a plasmid containing the above-mentioned double-stranded DNA or its derivatives.
较好的,它能够转录产生EV71毒株的全长感染性RNA或者其突变体。Preferably, it is capable of transcribing the full-length infectious RNA of the EV71 strain or a mutant thereof.
本发明还提供了一种疫苗或者病毒载体,它根据上述质粒制备;The present invention also provides a vaccine or viral vector, which is prepared according to the above-mentioned plasmid;
本发明提供了一种病毒颗粒,它由上述cDNA克隆或者质粒制备;The present invention provides a virus particle prepared from the above-mentioned cDNA clone or plasmid;
例如,减毒(live-attenuated)病毒颗粒,非感染复制性病毒(replicationcompetent non-infectious)颗粒及非复制性病毒(defective variants)颗粒;For example, live-attenuated virus particles, replicationcompetent non-infectious virus particles and non-replicative virus particles (defective variants);
所述的病毒可以通过免疫动物方法,分离纯化并获得抗EV71病毒抗体,也可以用于筛选人抗体库;或者,用于制备检测EV71病毒的试剂盒和各种细胞系、组织和动物感染模型。The virus can be isolated and purified to obtain anti-EV71 virus antibodies by immunizing animals, and can also be used to screen human antibody libraries; or, used to prepare kits for detecting EV71 virus and various cell lines, tissues and animal infection models .
上述细胞系、组织和动物感染模型可以用于筛选抗EV71病毒的药物。The above cell lines, tissues and animal infection models can be used to screen drugs against EV71 virus.
本发明还包括上述病毒载体和病毒颗粒的检测方法,制备方法;The present invention also includes the detection method and preparation method of the above-mentioned virus vector and virus particle;
例如,使用所述的病毒颗粒免疫动物并分离抗体,或者筛选人抗体库。For example, the viral particles are used to immunize animals and isolate antibodies, or to screen human antibody libraries.
另一方面,本发明提供了一种检测EV71的试剂盒,它含有上述cDNA或者病毒颗粒。In another aspect, the present invention provides a kit for detecting EV71, which contains the above-mentioned cDNA or virus particles.
本发明还提供了一种抗病毒EV71药物的制备方法,它使用上述cDNA或者病毒颗粒构建细胞或者动物模型,用于筛选抗病毒EV71的药物;或者,使用上述cDNA或者病毒颗粒构建细胞或者动物模型,用于筛选抗病毒EV71的药物。The present invention also provides a method for preparing an antiviral EV71 drug, which uses the above cDNA or virus particles to construct a cell or animal model for screening antiviral EV71 drugs; or, uses the above cDNA or virus particles to construct a cell or animal model , for the screening of antiviral EV71 drugs.
本发明的感染性克隆(核酸序列1)为一个由DNA序列构成的一个完整质粒(plasmid)。其中包含一个全长的EV71毒株(js1)的核酸序列(核酸序列2)及一个低拷贝质粒骨架序列(核酸序列3)。质粒(plasmid)是以共价键结合的闭合双链DNA(doublestranded DNA)。其中包含一条与mRNA序列一致的一条有义链(positive-sense strand)及一条与之互补的反义链或负义链(negative-sense strand)。The infectious clone of the present invention (nucleic acid sequence 1) is a complete plasmid consisting of a DNA sequence. It contains a full-length EV71 strain (js1) nucleic acid sequence (nucleic acid sequence 2) and a low-copy plasmid backbone sequence (nucleic acid sequence 3). Plasmids are closed double-stranded DNAs that are covalently bonded. It contains a positive-sense strand consistent with the mRNA sequence and a complementary antisense or negative-sense strand.
本发明的感染性克隆(核酸序列1)中所包含的EV71毒株(js1)的全长核酸序列(核酸序列2)包括病毒正链(positive sense)序列的5’末端的非翻译区(non-translatedregion,NTR)、一个开放阅读框(open reading frame,ORF)和3’末端非翻译区(3’-NTR)。在此感染性克隆中,病毒全长核酸序列5’末端含有一个T7启动子(TAA TAC GAC TCA CTA TAGG,SEQ ID NO 7)(图1A),可以在体外由商品化的T7转录试剂盒来转录病毒全长RNA;在病毒全长核酸序列3’末端含有一个30核苷酸长的polyA尾(AAAAA AAAAA AAAAA AAAAA AAAAAAAAAA,SEQ ID NO 8)(图1A)。The full-length nucleic acid sequence (nucleic acid sequence 2) of the EV71 strain (js1) contained in the infectious clone (nucleic acid sequence 1) of the present invention includes the untranslated region (non-translated region) at the 5' end of the viral positive sense sequence. -translated region, NTR), an open reading frame (open reading frame, ORF) and 3' terminal untranslated region (3'-NTR). In this infectious clone, the 5' end of the viral full-length nucleic acid sequence contains a T7 promoter (TAA TAC GAC TCA CTA TAGG, SEQ ID NO 7) (Figure 1A), which can be transcribed in vitro by a commercial T7 transcription kit Full-length viral RNA is transcribed; contains a 30-nucleotide polyA tail (AAAAA AAAAA AAAAA AAAAA AAAAAAAAAA, SEQ ID NO 8) at the 3' end of the viral full-length nucleic acid sequence (FIG. 1A).
临床分离毒株感染RD细胞,待细胞出现细胞病变时,抽提细胞的总RNA,利用EV71特异性引物(GCTAG CGCTtt tttttttt tttttttt ttttttt ttttt,SEQ ID NO 9)进行逆转录,然后利用逆转录(Superscript II逆转录酶)得到的cDNA进行PCR扩增。全长EV71基因组分4段扩增(图1A),扩增引物为F1(S:GACGC GGCCG CTAA TAC GACTC ACTATAG GTTAAAACAGC CTGT GGGT TGCAC CC,SEQ ID NO 10;As:GCACTG CACGT GGATGC AGAAC,SEQ ID NO11),F2(S:GACGCG GCCGCG TTCT GCAT CCAC GTGCA GTGC,SEQ ID NO 12;As:AAGTC GCGAGAGCT GTCTTC CC,SEQ ID NO 13),F3(S:GACGCG GCCGCG GGAA GACAG CTCTCG CGACTT,SEQID NO 14;As:AATTG TACAT CATG GTGC GATGG GTAGG,SEQ ID NO 15),F4(S:GACGC GGCCGCCCTAC CCATCG CACCATG ATGTAC AATT,SEQ ID NO 16;As:GCTAGC GCTtttttttttttttttt tttttttt ttttttGCT ATTCT GGTTAT AACAA ATTTA CCCCCA CCAG,SEQ ID NO17),扩增片段采用分步克隆的方法克隆到pANCR载体,得到最后的全长cDNA克隆,命名为pEV71-js1(图1A),RD cells were infected with clinically isolated virus strains. When the cells were cytopathic, the total RNA of the cells was extracted and reverse transcribed using EV71 specific primers (GCT AG CGCT tttttttttt tttttttt ttttttt ttttt, SEQ ID NO 9), and then reverse transcription (Superscript II reverse transcriptase) obtained cDNA was amplified by PCR. The full-length EV71 gene component was amplified in 4 segments (Figure 1A), and the amplification primer was F1(S: GAC GC GGCCG C TAA TAC GACTC ACTATAG GTTAAAACAGC CTGT GGGT TGCAC CC,
此感染性克隆在体外经HindIII线性化后,由T7转录试剂盒来转录出含有病毒全长RNA及其3’末端的polyA尾。该体外产生的病毒RNA经电转或转染的方法导入到宿主细胞如Vero细胞后,病毒的RNA作为翻译模版,翻译其ORF,产生病毒多肽(蛋白序列4);该病毒多肽经加工形成病毒结构蛋白及非结构蛋白,起始整个病毒生活周期,产生子代病毒。After linearization with HindIII in vitro, the infectious clone was transcribed by the T7 transcription kit to contain the full-length viral RNA and its 3'-terminal polyA tail. After the in vitro-produced viral RNA is introduced into host cells such as Vero cells by electroporation or transfection, the viral RNA is used as a translation template to translate its ORF to produce a viral polypeptide (protein sequence 4); the viral polypeptide is processed to form a viral structure Proteins and non-structural proteins start the entire virus life cycle and produce progeny viruses.
由于基于编码的兼并性,通过改变密码子而不改变蛋白序列仍可以得到相同功能蛋白产物;本发明包括编码与“蛋白序列4”相同的其他核酸序列和感染性克隆。Due to degeneracy based on coding, the same functional protein product can still be obtained by changing codons without changing the protein sequence; the present invention includes other nucleic acid sequences and infectious clones encoding the same as "
本发明的感染性克隆(核酸序列1)所产生的病毒在细胞中表现出很强的复制能力(图2),可以用于感染体外培养的细胞系、神经组织、小鼠(图6)或猴等建立病毒感染的细胞模型及动物感染模型,用于药物的研发。The virus produced by the infectious clone (nucleic acid sequence 1) of the present invention exhibits strong replication ability in cells (Fig. 2), and can be used to infect in vitro cultured cell lines, neural tissues, mice (Fig. 6) or Monkeys and others established viral infection cell models and animal infection models for drug research and development.
通过对此感染性克隆(核酸序列1)进行改造,在病毒的特定区域(VP4蛋白编码区之前)插入报道基因,报道基因由病毒的IRES翻译起始。在报道基因的C末端添加额外氨基酸位点(AITTL)(图1B),此位点可被病毒的3C蛋白酶识别并切割,产生正常的VP4的N端。我们成功的在此感染性克隆(核酸序列1)中插入报道基因荧光素酶NanoLuc(Nluc)及荧光蛋白EGFP,分别构成带有Nluc的感染性克隆(核酸序列5)及带有EGFP的感染性克隆(核酸序列6)(图1B)。各报告基因通过融合PCR连接入pEV71-js1质粒(核酸序列1),分别命名为pEV71-js1-Nluc(核酸序列5)(图1B)及pEV71-js1-EGFP(核酸序列6)(图1C);此感染性克隆同上,在体外经HindIII线性化后,由T7转录试剂盒来转录出病毒全长RNA,体外转录的的病毒RNA经电转或转染的方法导入宿主细胞如Vero细胞后,可以起始病毒生活周期,产生子代病毒(图2)。病毒在复制过程中表达报道基因Nluc及EGFP。Nluc可以利用商品化的荧光素酶活性检测试剂盒进行检测(图5)。EGFP的表达可以利用荧光显微镜进行观察(图4)或利用流式细胞仪进行检测。产生的含有报道基因片段的子代病毒重新感染新细胞,在新细胞中可以有效复制。报道基因由于与病毒蛋白处于同一个开放阅读框,其表达水平反应病毒蛋白水平,亦可反应病毒复制水平。且含有报道基因的重组病毒在相当长的时间内连续传代报道基因无丢失(图4)。利用此含有报道基因的重组病毒,可以快速、方便的检测病毒复制及包装水平,可以用于研究病毒的生活周期、病毒-宿主相互作用、病毒的免疫学及抗病毒药物的开发等。This infectious clone (nucleic acid sequence 1) was engineered by inserting a reporter gene in a specific region of the virus (before the VP4 protein coding region), the reporter gene being translationally initiated by the IRES of the virus. An additional amino acid site (AITTL) was added to the C-terminus of the reporter gene (Fig. 1B), which was recognized and cleaved by the viral 3C protease, resulting in the normal N-terminus of VP4. We successfully inserted reporter gene luciferase NanoLuc (Nluc) and fluorescent protein EGFP into this infectious clone (nucleic acid sequence 1), which constituted an infectious clone with Nluc (nucleic acid sequence 5) and an infectious clone with EGFP, respectively. clone (nucleic acid sequence 6) (FIG. 1B). Each reporter gene was ligated into the pEV71-js1 plasmid (nucleic acid sequence 1) by fusion PCR and named as pEV71-js1-Nluc (nucleic acid sequence 5) (Fig. 1B) and pEV71-js1-EGFP (nucleic acid sequence 6) (Fig. 1C) ; This infectious clone is the same as above. After being linearized by HindIII in vitro, the full-length viral RNA is transcribed by the T7 transcription kit. The viral life cycle is initiated, producing progeny viruses (Figure 2). The virus expresses reporter genes Nluc and EGFP during replication. Nluc can be detected using a commercial luciferase activity detection kit (Figure 5). The expression of EGFP can be observed by fluorescence microscopy (Figure 4) or detected by flow cytometry. The resulting progeny virus containing the reporter gene segment re-infects new cells, where it can replicate efficiently. Since the reporter gene is in the same open reading frame as the viral protein, its expression level reflects the viral protein level and also the viral replication level. And the recombinant virus containing the reporter gene has no loss of the reporter gene in the continuous passage for a long period of time (Fig. 4). The recombinant virus containing the reporter gene can quickly and conveniently detect the level of virus replication and packaging, and can be used to study the life cycle of the virus, virus-host interaction, virus immunology and the development of antiviral drugs.
对此感染性克隆(核酸序列1)进行改造,参照肠道病毒属其他病毒,比如剔除病毒的结构蛋白VP4-VP3-VP2-VP1区域,可以构成病毒的亚基因组复制子(subgenomicreplicon)等复制非感染性病毒(replication competent non-infectious),该亚基因组复制子能进行病毒基因复制,但由于缺少病毒的结构蛋白不能包装出子代病毒。同时此亚基因组复制子RNA可以由表达的结构蛋白进行反式互补(trans complement),包装成重组亚病毒颗粒(recombinant subviral particles,RSPs)(Barclay,et al.J GenVirol.1998,79:1725-1734;Jia,et al.J Virol.1998,72:7972-7977),该亚病毒颗粒可以进行一轮感染,但由于基因组没有不编码结构蛋白,因此感染后不能再次病毒颗粒包装,是一种非复制性病毒(defective variants)颗粒。这些非复制性病毒颗粒可以作为一种型式的疫苗。This infectious clone (nucleic acid sequence 1) is modified with reference to other viruses belonging to the genus Enterovirus, such as deleting the VP4-VP3-VP2-VP1 region of the structural protein of the virus, which can constitute the subgenomic replicon of the virus and other non-replicating viruses. Infectious virus (replication competent non-infectious), the subgenomic replicon is capable of viral gene replication, but cannot package progeny virus due to the lack of viral structural proteins. At the same time, this subgenomic replicon RNA can be trans-complemented by the expressed structural protein and packaged into recombinant subviral particles (RSPs) (Barclay, et al. J GenVirol. 1998, 79:1725- 1734; Jia, et al. J Virol. 1998, 72: 7972-7977), this subviral particle can carry out a round of infection, but because the genome does not encode a structural protein, it cannot be packaged again after infection. Non-replicating viral particles (defective variants). These non-replicating viral particles can serve as a form of vaccine.
对此感染性克隆(核酸序列1)进行改造,可以构成减毒(live-attenuated)病毒,此减毒病毒可以作为疫苗。参考同为微小RNA病毒科的Polio病毒的减毒策略,在5’NTR上进行突变,可以构建减毒疫苗(Arita,et al.J Virol.2008,82:1787-1797)。This infectious clone (nucleic acid sequence 1) can be engineered to constitute a live-attenuated virus, which can be used as a vaccine. Referring to the attenuation strategy of Polio virus, which is also a picornaviridae, attenuated vaccine can be constructed by mutation on 5'NTR (Arita, et al. J Virol. 2008, 82: 1787-1797).
感染性克隆产生的病毒感染小鼠,建立方便、稳定的动物感染模型,本申请中,发现初始分离的毒株经过多于3次的传代后其感染新生小鼠后致死率下降,通过测序发现传代后的病毒其VP1的145位发生由E到G到突变,因此由分离的病毒进行感染建立老鼠模型一致性较差,利用感染性克隆产生病毒可以保证病毒序列不受细胞传代的影响。本申请利用上述感染性克隆得到的病毒感染不同品系(ICR,Balb/c,C57)的新生老鼠,均可以在9天内得到100%的死亡率(图6B),但携带有VP1的145G突变的病毒感染老鼠后不导致小鼠死亡(图6C),该动物模型可以方便的用于抗病毒药物以疫苗评价等。The virus produced by the infectious clone infects mice, and establishes a convenient and stable animal infection model. In the present application, it was found that the lethality of the initially isolated virus strain decreased after infecting neonatal mice after more than 3 passages. After passage, the 145 position of VP1 of the virus is mutated from E to G. Therefore, the mouse model established by the infection of the isolated virus has poor consistency. The use of infectious clone to generate virus can ensure that the virus sequence is not affected by cell passage. In this application, neonatal mice of different strains (ICR, Balb/c, C57) were infected with the virus obtained by the above infectious clone, and 100% mortality could be obtained within 9 days (Fig. 6B), but the mice carrying the 145G mutation of VP1 The virus did not lead to death of mice after infection (Fig. 6C). This animal model can be conveniently used for antiviral drugs and vaccine evaluation.
新型肠道病毒71型(Human enterovirus type71,EV71)为微小RNA病毒科属于微小病毒科(picornaviridae)中的肠病毒群(enterovirus)成员。EV71是在全球范围内引起儿童手足口病的主要病原体,它可以导致儿童罹患轻症和重症手足口病。病毒可感染中枢神经系统,引起中枢神经系统损伤,但其机制未明。目前无有效的治疗EV71的抗病毒药物。本发明通过分离一株临床EV71毒株,利用分子克隆,构建了稳定的病毒的全长cDNA克隆,通过体外转录RNA、转染Vero细胞证实所述的cDNA克隆来源的病毒RNA能产生EV71病毒;进一步,本申请构建了含有报道基因Gluc(Gaussia luciferase)及EGFP的重组病毒,并证实含有报道Gluc及EGFP的重组病毒具有感染宿主细胞并引起细胞病变的能力,利用感染性克隆来源的EV71病毒感染免疫健全的ICR、Bab/C及C57乳鼠,在10天内导致100%的被感染老鼠出现神经损伤症状而死亡。A new type of enterovirus type 71 (Human enterovirus type 71, EV71) is a member of the enterovirus group in the Picornaviridae family of Picornaviridae . EV71 is the main pathogen causing HFMD in children worldwide, and it can cause mild and severe HFMD in children. The virus can infect the central nervous system and cause damage to the central nervous system, but the mechanism is unknown. There are currently no effective antiviral drugs for EV71. In the present invention, a stable virus full-length cDNA clone is constructed by isolating a clinical EV71 strain by molecular cloning, and it is confirmed by in vitro transcription of RNA and transfection of Vero cells that the viral RNA derived from the cDNA clone can produce EV71 virus; Further, the application has constructed a recombinant virus containing reporter gene Gluc (Gaussia luciferase) and EGFP, and confirmed that the recombinant virus containing reporter Gluc and EGFP has the ability to infect host cells and cause cytopathic effects, and utilize the EV71 virus of infectious clone source to infect. The immune-competent ICR, Bab/C and C57 suckling mice caused 100% of the infected mice to develop neurological damage and die within 10 days.
本发明提供了稳定的、基于一株临床分离的EV71毒株的感染性cDNA克隆及其含有各类报道基因的衍生克隆、及以其为母本构建的各种突变克隆;以及利用这些克隆产生的各种重组病毒、亚单位病毒颗粒;以及利用这些克隆产生的各种重组病毒感染动物建立的动物模型;以及利用这些病毒或亚单位病毒颗粒用于疫苗的开发及诊断试剂的应用;以及利用此病毒作为基因治疗载体或表达载体。The present invention provides stable, infectious cDNA clones based on a clinically isolated EV71 strain, its derivative clones containing various reporter genes, and various mutant clones constructed therefrom; and the use of these clones to generate Various recombinant viruses and subunit virus particles produced by these clones; and animal models established by infecting animals with various recombinant viruses produced by these clones; and using these viruses or subunit virus particles for the development of vaccines and the application of diagnostic reagents; and the use of This virus is used as a gene therapy vector or an expression vector.
本发明的优点还有:The advantages of the present invention also include:
本发明包括利用这些克隆质粒为母本,通过分子生物学构建的各种重组病毒、亚单位病毒颗粒质粒。The present invention includes various recombinant viruses and subunit virus particle plasmids constructed by molecular biology using these cloned plasmids as mothers.
本发明还包括利用这些克隆可以产生的各种重组病毒、亚单位病毒颗粒;其含有上述cDNA。The present invention also includes various recombinant viruses, subunit virus particles that can be produced using these clones; which contain the cDNAs described above.
本发明还包括利用这些克隆可以产生的各种重组病毒构建的动物感染模型。The present invention also includes animal infection models constructed using the various recombinant viruses that can be produced by these clones.
本发明还包括利用这些病毒或亚单位病毒颗粒及动物模型用于疫苗的开发及诊断试剂。The present invention also includes the use of these viruses or subunit virus particles and animal models for vaccine development and diagnostic reagents.
本发明还包括利用这些病毒或亚单位病毒颗粒建立的动物模型用于疫苗的开发及抗病毒药物的开发。The present invention also includes animal models established by using these viruses or subunit virus particles for the development of vaccines and the development of antiviral drugs.
本发明还包括利用此病毒或亚病毒单位质粒作为基因治疗载体或表达载体质粒及利用这些质粒所产生的病毒或亚病毒颗粒。The present invention also includes the use of the viral or subviral unit plasmids as gene therapy vectors or expression vector plasmids and viral or subviral particles produced by using these plasmids.
本发明为EV71病毒感染的检测、预防、免疫提供了新的工具和途径,为利用此EV71毒株感染性克隆作为病毒载体进行基因治疗及疫苗开发提供了可能性。The present invention provides a new tool and approach for detection, prevention and immunization of EV71 virus infection, and provides a possibility for gene therapy and vaccine development by using the EV71 strain infectious clone as a viral vector.
附图说明Description of drawings
图1:EV71毒株js1的感染性cDNA克隆的构建,其中,Figure 1: Construction of an infectious cDNA clone of EV71 strain js1 in which,
(A)感染性克隆构建策略;寨卡病毒全基因组模式图,两端黑色柱子分别表示5’-NTR及3’-NTR;病毒结构蛋白区域及非结构蛋白区域如图所示;病毒全长序列分成4段分别进行扩增,其中第一段F1中含有T7序列,第四段F4含有由PCR引物引入的polyA30序列;合成的序列通过限制性内切酶依图所示依次连接入pACNR载体,得到全长克隆;(B)通过融合PCR,在VP4的N端同框融合Nluc或EGFP基因,Nluc或EGFP基因C端添加额外的氨基酸序列AITTL,便于被病毒蛋白酶切割从而产生正确的VP4的N端。(A) Infectious clone construction strategy; Zika virus whole genome schematic diagram, the black bars at both ends represent 5'-NTR and 3'-NTR respectively; the structural protein region and non-structural protein region of the virus are shown in the figure; the full length of the virus The sequence is divided into 4 segments and amplified respectively. The first segment F1 contains the T7 sequence, and the fourth segment F4 contains the polyA 30 sequence introduced by PCR primers; the synthesized sequences are sequentially connected to pACNR by restriction endonucleases as shown in the figure. (B) by fusion PCR, the Nluc or EGFP gene was fused in-frame at the N-terminus of VP4, and an additional amino acid sequence AITTL was added to the C-terminus of the Nluc or EGFP gene to facilitate cleavage by viral protease to produce correct VP4 the N-terminus of .
图2:EV71毒株js1的感染性cDNA克隆产生病毒的复制能力及感染能力,其中,Figure 2: Infectious cDNA clone of EV71 strain js1 produces virus replication and infectivity, wherein,
感染性克隆质粒作为模版,经体外转录成病毒RNA,病毒RNA通过电转导入到Vero细胞,收集上清病毒,在Vero细胞上利用噬斑实验进行滴度滴定,感染性克隆产生(Clone-WT)的噬斑同母本病毒(Parent)产生的噬斑的比较如图(上),相同滴度的感染性克隆产生的病毒同母本病毒再次感染Vero细胞(MOI=0.1),收集感染后不同时间(h.p.i)的细胞上清,利用噬斑实验对其进行滴定,得到两者的生长曲线如图(下),病毒滴度由PFU/ml表示。Infectious cloning plasmid was used as a template, transcribed into viral RNA in vitro, viral RNA was introduced into Vero cells by electroporation, supernatant virus was collected, titered by plaque assay on Vero cells, and infectious clones were generated (Clone-WT) The comparison between the plaques produced by the parent virus (Parent) and the plaques produced by the parent virus (Parent) is shown in the figure (top), the virus produced by the infectious clone of the same titer and the parent virus were re-infected with Vero cells (MOI=0.1), and the difference after infection was collected. Time (h.p.i) cell supernatant was titrated by plaque assay, and the growth curves of the two were obtained as shown in the figure (bottom), and the virus titer was represented by PFU/ml.
图3:含有报道基因Nluc及EGFP的重组病毒的产生,其中,Figure 3: Production of recombinant viruses containing reporter genes Nluc and EGFP, wherein,
(A)含有报道基因Nluc及EGFP的感染性克隆质粒,同不含报道基因的感染性克隆质粒,经体外转录成病毒RNA,病毒RNA通过电转导入到Vero细胞,收集上清病毒,在Vero细胞上利用噬斑实验进行滴度滴定,含各报道基因的重组病毒产生的噬斑同不含报道基因的病毒产生的噬斑的比较;(B)相同滴度的含各报道基因的病毒同不含报道基因的病毒再次感染Vero细胞(MOI=0.1),收集感染后不同天数的上清,利用噬斑实验对其进行滴定,得到的生长曲线;病毒滴度由PFU/ml表示。(A) Infectious cloned plasmids containing reporter genes Nluc and EGFP, and infectious cloned plasmids without reporter genes, were transcribed into viral RNA in vitro, and the viral RNA was introduced into Vero cells by electroporation, and the supernatant virus was collected, and the virus was collected in Vero cells. Titer titration was carried out using the plaque experiment, and the plaques produced by the recombinant viruses containing each reporter gene were compared with those produced by the virus without the reporter gene; (B) the same titer of the virus containing each reporter gene was different The virus containing the reporter gene was re-infected with Vero cells (MOI=0.1), and the supernatants of different days after infection were collected and titrated by plaque assay to obtain the growth curve; the virus titer was expressed by PFU/ml.
图4:含EGFP报道基因的重组病毒的稳定性,其中,Figure 4: Stability of recombinant virus containing EGFP reporter gene, wherein,
重组病毒EV71-EGFP感染的细胞上清以1:10稀释后,重新感染新的Vero细胞,感染二天后细胞用荧光显微镜观察及收集上清,再次以1:10稀释度重新感染新的Vero细胞(C+1),感染二天后同上细胞用荧光显微镜观察及收集上清进行再次感染;依次传代感染,观察感染细胞中EGFP的表达情况。The supernatant of cells infected with recombinant virus EV71-EGFP was diluted 1:10, and then re-infected with new Vero cells. After two days of infection, the cells were observed with a fluorescence microscope and the supernatant was collected, and the new Vero cells were re-infected with a 1:10 dilution. (C+1), two days after infection, the same cells were observed with a fluorescence microscope and the supernatant was collected for re-infection; the infection was successively subcultured, and the expression of EGFP in the infected cells was observed.
图5:含Nluc报道基因的重组病毒的产生Nluc的活性,其中,Figure 5: Nluc-producing activity of recombinant viruses containing the Nluc reporter gene, wherein,
含有报道基因Nluc的感染性克隆质粒同含有VP1 E145G及3C C147A突变的质粒,经体外转录成病毒RNA,病毒RNA通过电转导入到Vero细胞,电转后不同时间点收集细胞,检测细胞内Nluc活性,C147A为3C蛋白酶酶活性缺失突变。The infectious cloned plasmid containing the reporter gene Nluc and the plasmid containing the VP1 E145G and 3C C147A mutations were transcribed into viral RNA in vitro, and the viral RNA was introduced into Vero cells by electroporation. The cells were collected at different time points after electroporation, and the intracellular Nluc activity was detected. C147A is a deletion mutation of 3C protease enzymatic activity.
图6:EV71毒株js1的感染性cDNA克隆产生病毒感染小鼠构建动物感染模型,其中,Figure 6: Infectious cDNA clone of EV71 strain js1 produces virus-infected mice to construct an animal infection model, wherein,
(A)感染性cDNA克隆产生的病毒感染不同品系3日龄胎鼠(1.4×104pfu/只),感染后5天观察。(B)病毒感染后小鼠的生存曲线(n=5/group)。(C)感染性cDNA克隆(WT)及携带有VP1 E145G突变的克隆产生的病毒感染3日龄ICR小鼠,小鼠的生存曲线(n=5/group)。(A) Viruses generated from infectious cDNA clones infected different strains of 3-day-old fetal mice (1.4×10 4 pfu/mice) and observed 5 days after infection. (B) Survival curve of mice after virus infection (n=5/group). (C) Survival curves of 3-day-old ICR mice (n=5/group) generated by infectious cDNA clones (WT) and clones carrying the VP1 E145G mutation.
具体实施方式Detailed ways
本发明所用的方法均为常规的分子生物学方法,其中具体的操作细节不再赘述。The methods used in the present invention are all conventional molecular biology methods, and the specific operation details are not repeated here.
实施例1:EV71毒株js1的感染性cDNA克隆的构建Example 1: Construction of an infectious cDNA clone of EV71 strain js1
如图1A所示,从粪便标本中分离的病毒,经RD细胞培养,待细胞出现明显细胞病变时,提取总的细胞RNA,利用superscript II(Invitrogen)逆转录酶,以序列特异性引物(GCTAG CGCTttt tttttttttttt tttttttttt ttttt)进行逆转录,以得到的cDNA为模版,利用高保真酶super Fi(Invitrogen)分4段进行PCR扩增,扩增引物为F1(S:GACGC GGCCG CTAA TAC GACTC ACTATAG GTTAAA ACAGC CTGT GGGT TGCAC CC;As:GCACTG CACGT GGATGCAGAAC),F2(S:GACGCG GCCGCG TTCT GCAT CCAC GTGCA GTGC;As:AAGTC GCGA GAGCTGTCTTC CC),F3(S:GACGCG GCCGCG GGAA GACAG CTCTCG CGACTT;As:AATTG TACAT CATGGTGC GATGG GTAGG),F4(S:GACGC GGCCGCCCTAC CCATCG CACCATG ATGTAC AATT;As:GCTAGC GCTtttttttt tttttttt tttttttt ttttttGCT ATTCT GGTTAT AACAA ATTTA CCCCCACCAG),扩增的F4片段首先经限制性内切酶NotI/AfeI消化后,与经同样限制性内切酶消化的pANCR载体连接,得到pANCR-F4质粒,PCR扩增的F3片段利用NruI/BsrGI连接入pANCR-F4得到pANCR-F34质粒,PCR扩增的F2片段利用PmlI/NruI连接入pANCR-F34得到pANCR-F234,PCR扩增的F1片段利用NotI/PmlI连接入pANCR-F234得到最后的全长cDNA克隆,命名为pEV71-js1。As shown in Figure 1A, the virus isolated from fecal specimens was cultured in RD cells. When the cells showed obvious cytopathic changes, total cellular RNA was extracted, and the reverse transcriptase was superscript II (Invitrogen) and sequence-specific primers (GCT) were used. AG CGCT ttt tttttttttttt tttttttttt ttttt) was reverse transcribed, and the obtained cDNA was used as a template, and the high-fidelity enzyme super Fi (Invitrogen) was used for PCR amplification in 4 segments, and the amplification primer was F1 (S: GAC GC GGCCG C TAA TAC GACTC ACTATAG GTTAAA ACAGC CTGT GGGT TGCAC CC; As: GCACTG CACGT GGATGCAGAAC), F2 (S: GAC GCG GCCGC G TTCT GCAT CCAC GTGCA GTGC; As: AAGTC GCGA GAGCTGTCTTC CC), F3 (S: GAC GCG GCCGC G GGAA GACAG CTCTCG CGACTT; As: AATTG TACAT CATGGTGC GATGG GTAGG), F4 (S: GAC GC GGCCGC CCTAC CCATCG CACCATG ATGTAC AATT; As: GCT AGC GCT tttttttt tttttttt tttttttt ttttttGCT ATTCT GGTTAT AACAA ATTTA CCCCCACCAG), the amplified F4 fragment was first subjected to restriction endonuclease After digestion with NotI/AfeI, it was ligated with the pANCR vector digested with the same restriction enzymes to obtain the pANCR-F4 plasmid. The PCR-amplified F3 fragment was ligated into pANCR-F4 using NruI/BsrGI to obtain the pANCR-F34 plasmid, which was amplified by PCR. The F2 fragment was ligated into pANCR-F34 using PmlI/NruI to obtain pANCR-F234, and the PCR-amplified F1 fragment was ligated into pANCR-F234 using NotI/PmlI to obtain the final full-length cDNA clone, named pEV71-js1.
为构建带有报道基因EGFP的感染性克隆质粒(如图1B所示),利用融合PCR,融合三段序列,其中EGFP-F1为EV71 5UTR序列,PCR扩增引物为S:CCTGA CGTG TCGA CGCGG,SEQ IDNO 18,As:cctc gccct tgctcac CATcatatgG TTTAGCTGT GTTAAG GGTCAAGA,SEQ ID NO19,EGFP-F2为含有EGFP的片段,PCR扩增引物为S:TCTT GACC CTTAAC ACAGC TAA ACcatatgATG gtga gcaag ggcg agg,SEQ ID NO 20,As:CGCT GTGT AGACAC TTGCGA ACCAAGAGTGGTG ATCGC atgcat cttgtac agctcgt ccatgc cg,SEQ ID NO 21,EGFP-F3为含有包含VP4及VP2区域的片段,PCR扩增引物为S:cggca tggac gagct gtaca agatgc atGCGA TCACCACT CTTGG TTCGC AAGTG TCTA CACAG CG,SEQ ID NO 22;As:CTGC ACGT GGAT GCA GAACCC,SEQ ID NO 23,三个片段经融合PCR融合后,利用NotI/PmlI连接入pEV71-js1质粒,替换原质粒中的序列,得到pEV71-js1-EGFP质粒。In order to construct the infectious cloning plasmid with reporter gene EGFP (as shown in Figure 1B), use fusion PCR to fuse three sequences, wherein EGFP-F1 is EV71 5UTR sequence, and PCR amplification primers are S: CCTGA CGTG TCGA CGCGG, SEQ ID NO 18, As: cctc gccct tgctcac CATcatatgG TTTAGCTGT GTTAAG GGTCAAGA, SEQ ID NO 19, EGFP-F2 is a fragment containing EGFP, PCR amplification primer is S: TCTT GACC CTTAAC ACAGC TAA ACcatatgATG gtga gcaag ggcg agg,
为构建带有报道基因Nluc的感染性克隆质粒(如图1B所示),利用融合PCR,融合两段序列,其中Nluc-F1为EV715UTR序列,PCR扩增引物为S:CTGC ACGT GGAT GCA GAA CCC,SEQ ID NO 24,As:gaaa tcttcg agtgtga agaccattct agaGTT TAGC TGTG TTA AGGG TCAAG,SEQ ID NO 25,EGFP-F2为含有Nluc的片段,PCR扩增引物为S:CTTG ACCC TTAAC ACAGCTAA ACtct agaat ggtctt cacac tcgaa gatttc,SEQ ID NO 26;As:CGCat gcatcg ccagaatgcgt tcgca,SEQ ID NO 27。两个片段经融合PCR融合后,利用NotI/NsiI连接入pEV71-js1-EGFP质粒,替换原质粒中的序列,得到pEV71-js1-Nluc质粒。In order to construct the infectious cloning plasmid with reporter gene Nluc (as shown in Figure 1B), use fusion PCR to fuse two sequences, wherein Nluc-F1 is the EV715UTR sequence, and the PCR amplification primer is S: CTGC ACGT GGAT GCA GAA CCC ,
实施例2:EV71毒株js1的感染性cDNA克隆产生病毒的复制能力及感染能力Example 2: Infectious cDNA clone of EV71 strain js1 produces virus replication ability and infectivity
感染性克隆质粒pEV71-js1用HindIII进行酶切,线性化,然后T7利用体外转录试剂盒(Ambion)。体外转录的RNA3g利用电转导的方法转入Vero细胞。电转后2天,待细胞出现病变,收集病毒上清,离心3000g,10min,然后经0.45m的滤膜过滤去掉细胞碎片。上清中的病毒利用噬斑实验进行滴定。感染性克隆质粒产生的病毒形成的噬斑与最初分离的母本病毒的噬斑比较如图2所示(上),两者噬斑的形态和大小无显著差异。相同滴度的感染性克隆产生的病毒同母本病毒再次感染Vero细胞(MOI=0.1),收集感染后不同时间的细胞上清,利用噬斑实验对其进行滴定(以PFU/ml表示),得到两者的生长曲线如图2所示(下),两者的生长曲线无显著差异。The infectious cloning plasmid pEV71-js1 was digested with HindIII, linearized, and then T7 using an in vitro transcription kit (Ambion). In vitro transcribed RNA3g was transferred into Vero cells by electrotransduction. 2 days after electroporation, when the cells became diseased, the virus supernatant was collected, centrifuged at 3000g for 10min, and then filtered through a 0.45m filter to remove cell debris. The virus in the supernatant was titered using a plaque assay. The plaques formed by the virus produced by the infectious cloning plasmid were compared with the plaques of the parent virus originally isolated as shown in Figure 2 (top), and there was no significant difference in the morphology and size of the plaques. The virus produced by the infectious clone of the same titer was re-infected with the parent virus in Vero cells (MOI=0.1), and the cell supernatants at different times after infection were collected and titrated by plaque assay (expressed in PFU/ml), The growth curves of the two are shown in Figure 2 (bottom), and there is no significant difference in the growth curves of the two.
实施例3:含有报道基因Nluc及EGFP的重组病毒的产生及其稳定性Embodiment 3: the production of the recombinant virus containing reporter gene Nluc and EGFP and its stability
含有报道基因Nluc及EGFP的感染性克隆质粒,同不含报道基因的感染性克隆质粒,同上经体外转录成病毒RNA,电转导入到Vero细胞,两天后收集细胞上清中的病毒,在Vero细胞上利用噬斑实验对其病毒滴度进行滴定。如图3A所示,含有报道基因EGFP及Nluc的病毒同不含报道基因的病毒其噬斑形态和大小类似。利用相同滴度的含各报道基因的病毒同不含报道基因的病毒再次感染Vero细胞(MOI=0.1),收集感染后不同天数的上清,利用噬斑实验对其进行滴定,得到的生长曲线,如图3B所示,携带有报道基因的病毒,相较于野生病毒,其生长周期表现为滞后,提示融合报道基因导致病毒复制周期的延缓。含有报道基因Nluc的感染性克隆质粒产出的病毒,其复制能力可以利用Nluc的底物(Promega),通过测定细胞内Nluc的活性进行判断。含有VP1 E145G及3C C147A突变的质粒,经体外转录成病毒RNA,病毒RNA通过电转导入到Vero细胞,不同时间测定细胞内Nluc活性,如图5所示,含有3C蛋白酶失活突变(C147A)的病毒RNA,其转染后,细胞内Nluc的活性在8小时后不再上升,其活性仅反应病毒RNA的起始翻译信号,但野生病毒活VP145G病毒RNA转染后,Nluc随时间延长表现为逐渐上升,表示正常病毒复制信号。为证明含有EGFP报道基因的病毒其报道基因的稳定性,我们利用EV71-EGFP病毒感染的细胞上清以1:10稀释后,重新感染新的Vero细胞,感染二天后细胞用荧光显微镜观察及收集上清,再次以1:10稀释度重新感染新的Vero细胞(C+1),感染二天后同上细胞用荧光显微镜观察及收集上清进行再次感染;依次传代感染,观察感染细胞中EGFP的表达情况,在连续传代至少6代后,EGFP基因仍然稳定。Infectious cloned plasmids containing reporter genes Nluc and EGFP, and infectious cloned plasmids without reporter genes, were transcribed into viral RNA in vitro as above, and electroporated into Vero cells. The virus titer was titrated by plaque assay. As shown in Figure 3A, the plaque morphology and size of the virus containing the reporter genes EGFP and Nluc were similar to the virus without the reporter gene. Vero cells were re-infected (MOI=0.1) with the same titer of the virus containing each reporter gene and the virus without the reporter gene, and the supernatants of different days after infection were collected and titrated by plaque assay. The obtained growth curve , as shown in Figure 3B, compared with the wild virus, the growth cycle of the virus carrying the reporter gene lagged behind, suggesting that the fusion of the reporter gene leads to the delay of the virus replication cycle. The replication ability of the virus produced by the infectious cloning plasmid containing the reporter gene Nluc can be judged by measuring the activity of intracellular Nluc using the Nluc substrate (Promega). The plasmids containing the VP1 E145G and 3C C147A mutations were transcribed into viral RNA in vitro, and the viral RNA was introduced into Vero cells by electroporation, and the intracellular Nluc activity was measured at different times. Viral RNA, after transfection, the activity of intracellular Nluc no longer rises after 8 hours, and its activity only reflects the initial translation signal of viral RNA, but after transfection of wild virus live VP145G viral RNA, Nluc over time appears as Gradually rise, indicating normal virus replication signal. In order to prove the stability of the reporter gene of the virus containing the EGFP reporter gene, we used the supernatant of EV71-EGFP virus-infected cells to be diluted 1:10, and then re-infected new Vero cells, and the cells were observed and collected with a fluorescence microscope two days after infection. The supernatant was re-infected with new Vero cells (C+1) at a dilution of 1:10. After two days of infection, the cells were observed with a fluorescence microscope and the supernatant was collected for re-infection; the infection was successively subcultured, and the expression of EGFP in the infected cells was observed. In some cases, the EGFP gene was still stable after at least 6 consecutive passages.
实施例4:EV71毒株js1的感染性cDNA克隆产生病毒感染小鼠构建动物感染模型Example 4: Infectious cDNA cloning of EV71 strain js1 produces virus-infected mice to construct an animal infection model
如图6A所示,感染性cDNA克隆产生的病毒感染不同品系3日龄胎鼠(1.4×104pfu/只),感染后5天观察小鼠,相比较不感染小鼠,感染有病毒的小鼠均表现为四肢瘫痪。各不同品系的小鼠感染后其生存曲线如图6B所示,在10天内,均达到100%死亡率。携带有VP1E145G突变的感染性克隆产生的病毒感染3日龄ICR小鼠后,不同于野生型小鼠,不导致小鼠的死亡,说明E145位点为病毒感染小鼠致死的决定性位点,也解释了传代病毒随传代次数的增加,其感染致死率降低的原因。As shown in Figure 6A, the virus produced by the infectious cDNA clone infected 3-day-old fetal mice of different strains (1.4×10 4 pfu/mice), and the mice were observed 5 days after infection. Compared with the uninfected mice, the mice infected with the virus The mice were all quadriplegic. The survival curves of different strains of mice after infection are shown in Figure 6B, and they all reached 100% mortality within 10 days. The virus produced by the infectious clone carrying the VP1E145G mutation infects 3-day-old ICR mice, which is different from wild-type mice and does not lead to the death of the mice, indicating that the E145 locus is the decisive locus for the lethality of virus-infected mice. Explains the reason why the infection lethality of the passaged virus decreases with the increase of the passage number.
序列表sequence listing
<110> 复旦大学<110> Fudan University
<120> 基于EV71毒株的感染性cDNA克隆及其应用<120> Infectious cDNA Cloning Based on EV71 Strain and Its Application
<130> 20190601<130> 20190601
<160> 27<160> 27
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 9446<211> 9446
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 1<400> 1
gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60
tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120
tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180
atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240
gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300
aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360
ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420
tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480
ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540
ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600
ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660
gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720
gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780
gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260
aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560
cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980
ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040
tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100
ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160
cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220
ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280
gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340
ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400
gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460
gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520
acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580
attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640
cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700
ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaacatggg ttcgcaagtg 2760ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaacatggg ttcgcaagtg 2760
tctacacagc gctccggttc ttacgaaaac tcaaactcag ccactgaggg ttctaccata 2820tctacacagc gctccggttc ttacgaaaac tcaaactcag ccactgaggg ttctaccata 2820
aactacacca ccattaatta ctacaaagac tcctatgctg ccacagcagg caaacagagt 2880aactacacca ccattaatta ctacaaagac tcctatgctg ccacagcagg caaacagagt 2880
ctcaagcagg atccagacaa gtttgcaaat cctgttaaag acatattcac cgaaatggca 2940ctcaagcagg atccagacaa gtttgcaaat cctgttaaag acatattcac cgaaatggca 2940
gcgccactga agtccccatc cgctgaggca tgtggataca gtgatcgagt ggcgcaatta 3000gcgccactga agtccccatc cgctgaggca tgtggataca gtgatcgagt ggcgcaatta 3000
actattggca actccaccat cacgacgcaa gaagcggcta acatcatagt cggctatggt 3060actattggca actccaccat cacgacgcaa gaagcggcta acatcatagt cggctatggt 3060
gagtggcctt cctactgctc agattctgac gctacagcag tggataaacc aacgcgcccg 3120gagtggcctt cctactgctc agattctgac gctacagcag tggataaacc aacgcgcccg 3120
gatgtttcag tgaacaggtt ttacacattg gacactaaat tgtgggagaa atcgtccaag 3180gatgtttcag tgaacaggtt ttacacattg gacactaaat tgtgggagaa atcgtccaag 3180
ggatggtact ggaagttccc ggatgtgtta actgaaactg gggtttttgg gcaaaatgca 3240ggatggtact ggaagttccc ggatgtgtta actgaaactg gggtttttgg gcaaaatgca 3240
caattccact acctctaccg atcagggttc tgcatccacg tgcagtgcaa tgccagtaaa 3300caattccact acctctaccg atcagggttc tgcatccacg tgcagtgcaa tgccagtaaa 3300
ttccaccaag gagcactcct agtcgctgtc ctaccagagt atgtcattgg gacagtggca 3360ttccaccaag gagcactcct agtcgctgtc ctaccagagt atgtcattgg gacagtggca 3360
ggcggtacag ggacggaaga cacccacccc ccctacaagc agacccaacc cggcgccgat 3420ggcggtacag ggacggaaga cacccacccc ccctacaagc agacccaacc cggcgccgat 3420
ggtttcgagt tgcaacaccc gtacgtgctt gatgctggca tcccaatatc acagttaaca 3480ggtttcgagt tgcaacaccc gtacgtgctt gatgctggca tcccaatatc acagttaaca 3480
gtgtgcccac accagtggat taatttgagg accaacaatt gtgctacaat aatagtgcca 3540gtgtgcccac accagtggat taatttgagg accaacaatt gtgctacaat aatagtgcca 3540
tacattaacg cactgccttt tgattctgcc ttgaaccatt gcaactttgg cctgttagtt 3600tacattaacg cactgccttt tgattctgcc ttgaaccatt gcaactttgg cctgttagtt 3600
gtgcctatta gcccactaga ctacgaccaa ggagcaacgc cagtaatccc tataactatc 3660gtgcctatta gcccactaga ctacgaccaa ggagcaacgc cagtaatccc tataactatc 3660
acattggccc caatgtgctc tgaattcgca ggtcttaggc aggcagtcac gcaagggttc 3720acattggccc caatgtgctc tgaattcgca ggtcttaggc aggcagtcac gcaagggttc 3720
cccaccgagc taaaacctgg cacaaatcaa tttttaacca ccgatgatgg cgtctcagca 3780cccaccgagc taaaacctgg cacaaatcaa tttttaacca ccgatgatgg cgtctcagca 3780
cctattctac caaacttcca ccccaccccg tgtatccaca tacctggtga agttaggaac 3840cctattctac caaacttcca ccccaccccg tgtatccaca tacctggtga agttaggaac 3840
ttgctagagt tatgccaggt ggagaccatt ctggaggtta acaatgtgcc cacgaatgcc 3900ttgctagagt tatgccaggt ggagaccatt ctggaggtta acaatgtgcc cacgaatgcc 3900
actagcttaa tggagagact gcgcttcccg gtctcagcac aagcagggaa aggtgaactg 3960actagcttaa tggagagact gcgcttcccg gtctcagcac aagcagggaa aggtgaactg 3960
tgtgcggtgt ttagagccga tcctgggcga aatggaccat ggcaatccac cttactgggc 4020tgtgcggtgt ttagagccga tcctgggcga aatggaccat ggcaatccac cttactgggc 4020
cagttgtgcg ggtactacac ccaatggtca gggtcattgg aagtcacctt catgtttact 4080cagttgtgcg ggtactacac ccaatggtca gggtcattgg aagtcacctt catgtttact 4080
ggatccttca tggctaccgg caagatgctc atagcctata caccgccagg gggtcctctg 4140ggatccttca tggctaccgg caagatgctc atagcctata caccgccagg gggtcctctg 4140
cccaaggacc gggcgaccgc catgttgggc acgcacgtca tctgggattt tgggctgcaa 4200cccaaggacc gggcgaccgc catgttgggc acgcacgtca tctgggattt tgggctgcaa 4200
tcgtctgtta cccttgtaat accatggatc agtaacactc attatagagc acatgcccga 4260tcgtctgtta cccttgtaat accatggatc agtaacactc attatagagc acatgcccga 4260
gatggagtgt ttgactatta cactacaggg ttagtcagta tatggtacca gacaaattac 4320gatggagtgt ttgactatta cactacaggg ttagtcagta tatggtacca gacaaattac 4320
gtggttccaa tcggtgcgcc caacacagcc tatataatag cactagcggc agcccaaaag 4380gtggttccaa tcggtgcgcc caacacagcc tatataatag cactagcggc agcccaaaag 4380
aacttcacta tgaaattgtg caaggatgct agtgatatcc tgcagacggg caccatccag 4440aacttcacta tgaaattgtg caaggatgct agtgatatcc tgcagacggg caccatccag 4440
ggagataggg tggcagatgt aattgaaagt tccataggag atagcgtgag cagagccctc 4500ggagataggg tggcagatgt aattgaaagt tccataggag atagcgtgag cagagccctc 4500
actcacgctc taccagcacc cacaggccaa aacacacagg tgagcagtca tcgactggat 4560actcacgctc taccagcacc cacaggccaa aacacacagg tgagcagtca tcgactggat 4560
acaggcaagg ttccagcact ccaagctgct gaaattgggg catcatcaaa tgctagtgac 4620acaggcaagg ttccagcact ccaagctgct gaaattgggg catcatcaaa tgctagtgac 4620
gagagcatga ttgaaacacg ttgtgttctt aactcgcata gtacagctga gaccactctt 4680gagagcatga ttgaaacacg ttgtgttctt aactcgcata gtacagctga gaccactctt 4680
gatagtttct tcagtagggc aggattagtt ggagagatag atctccctct tgagggcaca 4740gatagtttct tcagtagggc aggattagtt ggagagatag atctccctct tgagggcaca 4740
actaacccaa atggttatgc caactgggac atagatataa caggttacgc gcaaatgcgt 4800actaacccaa atggttatgc caactgggac atagatataa caggttacgc gcaaatgcgt 4800
agaaaggtag agctattcac ctacatgcgt tttgatgcag agttcacttt tgttgcgtgc 4860agaaaggtag agctattcac ctacatgcgt tttgatgcag agttcacttt tgttgcgtgc 4860
acacccaccg gggaggttgt cccacaattg ctccaatata tgtttgtgcc acctggagcc 4920acacccaccg gggaggttgt cccacaattg ctccaatata tgtttgtgcc acctggagcc 4920
cctaagccag attctaggga atcccttgca tggcaaaccg ccaccaaccc ctcagttttt 4980cctaagccag attctaggga atcccttgca tggcaaaccg ccaccaaccc ctcagttttt 4980
gtcaagctgt cagaccctcc ggcgcaggtt tcagtgccat tcatgtcacc tgcgagtgct 5040gtcaagctgt cagaccctcc ggcgcaggtt tcagtgccat tcatgtcacc tgcgagtgct 5040
tatcaatggt tttatgacgg atatcccaca ttcggagaac acaaacagga gaaagacctt 5100tatcaatggt tttatgacgg atatcccaca ttcggagaac acaaacagga gaaagacctt 5100
gaatacgggg catgtcctaa taacatgatg ggtacattct cagtgcggac tgtggggacc 5160gaatacgggg catgtcctaa taacatgatg ggtacattct cagtgcggac tgtggggacc 5160
tccaagtcca agtacccttt agtggttagg atttacatga gaatgaagca cgtcagggcg 5220tccaagtcca agtacccttt agtggttagg atttacatga gaatgaagca cgtcagggcg 5220
tggatacctc gcccgatgcg caaccagaac tacctgttca aagccaaccc aaattatgct 5280tggatacctc gcccgatgcg caaccagaac tacctgttca aagccaaccc aaattatgct 5280
ggcaactcta ttaagccaac tggtgccagt cgcacagcga tcaccactct tgggaaattt 5340ggcaactcta ttaagccaac tggtgccagt cgcacagcga tcaccactct tgggaaattt 5340
ggacaacagt ctggggctat ttatgtgggc aactttagag tggtcaaccg acatcttgcc 5400ggacaacagt ctggggctat ttatgtgggc aactttagag tggtcaaccg acatcttgcc 5400
acccataatg attgggcaaa tcttgtttgg gaagacagct ctcgcgactt gctcgtgtca 5460acccataatg attgggcaaa tcttgtttgg gaagacagct ctcgcgactt gctcgtgtca 5460
tccaccactg cccaaggttg tgacacgatt gcccgttgcg attgccagac aggggtgtac 5520tccaccactg cccaaggttg tgacacgatt gcccgttgcg attgccagac aggggtgtac 5520
tactgtaact cgatgagaaa acactaccca gtcagttttt caaaacccag cctgatctat 5580tactgtaact cgatgagaaa acactaccca gtcagttttt caaaacccag cctgatctat 5580
gtagaggcta gcgagtatta cccagccagg taccaatcac atctcatgct cgcacagggt 5640gtagaggcta gcgagtatta cccagccagg taccaatcac atctcatgct cgcacagggt 5640
cactcggaac ctggtgattg cggtggtatc cttaggtgcc aacatggcgt catcggcata 5700cactcggaac ctggtgattg cggtggtatc cttaggtgcc aacatggcgt catcggcata 5700
gtgtctactg gtggcaatgg gctcgttggc tttgcagacg tcagagacct cttgtggtta 5760gtgtctactg gtggcaatgg gctcgttggc tttgcagacg tcagagacct cttgtggtta 5760
gatgaagaag ctatggaaca gggcgtgtcc gactacatta agggtctcgg agatgctttt 5820gatgaagaag ctatggaaca gggcgtgtcc gactacatta agggtctcgg agatgctttt 5820
ggaacaggct tcactgacgc agtctcaagg gaggttgaag ctctcaagaa ctatcttata 5880ggaacaggct tcactgacgc agtctcaagg gaggttgaag ctctcaagaa ctatcttata 5880
gggtctgaag gagcagttga gaaaattttg aaaaatctta ttaaactaat ctctgcactg 5940gggtctgaag gagcagttga gaaaattttg aaaaatctta ttaaactaat ctctgcactg 5940
gtgattgtga tcagaagtga ttacgacatg gttaccctca ctgcaacctt agcgctgata 6000gtgattgtga tcagaagtga ttacgacatg gttaccctca ctgcaacctt agcgctgata 6000
ggttgtcatg gcagtccttg ggcttggatt aaagccaaaa cagcctccat cttaggtatc 6060ggttgtcatg gcagtccttg ggcttggatt aaagccaaaa cagcctccat cttaggtatc 6060
cctatcgccc aaaagcagag cgcttcctgg ctcaagaagt tcaatgacat ggccaacgcc 6120cctatcgccc aaaagcagag cgcttcctgg ctcaagaagt tcaatgacat ggccaacgcc 6120
gctaaggggt tagagtgggt ttccaacaag atcagcaaat ttattgattg gcttaaggag 6180gctaaggggt tagagtgggt ttccaacaag atcagcaaat ttattgattg gcttaaggag 6180
aaaatagtac cagcagccag ggagaaggtt gaattcctaa ataacttgaa acagctgcca 6240aaaatagtac cagcagccag ggagaaggtt gaattcctaa ataacttgaa acagctgcca 6240
ctgctagaga atcagatctc gaacttggaa caatctgctg cttcacaaga ggaccttgaa 6300ctgctagaga atcagatctc gaacttggaa caatctgctg cttcacaaga ggaccttgaa 6300
gtcatgtttg ggaatgtgtc gtacctagct cacttctgtc gcaagtttca accgctatac 6360gtcatgtttg ggaatgtgtc gtacctagct cacttctgtc gcaagtttca accgctatac 6360
gccacggaag ctaaaagagt ctatgccctg gagaagagaa tgaataacta tatgcagttc 6420gccacggaag ctaaaagagt ctatgccctg gagaagagaa tgaataacta tatgcagttc 6420
aagagcaaac accgaattga acctgtatgt ctcattatta ggggctcacc aggcaccggg 6480aagagcaaac accgaattga acctgtatgt ctcattatta ggggctcacc aggcaccggg 6480
aagtctctag ccactggtat tattgctcga gcaatcgctg ataagtacca ctccagcgtg 6540aagtctctag ccactggtat tattgctcga gcaatcgctg ataagtacca ctccagcgtg 6540
tactcgctcc caccagaccc ggatcatttt gacggttaca agcaacaggt ggttacagtg 6600tactcgctcc caccagaccc ggatcatttt gacggttaca agcaacaggt ggttacagtg 6600
atggatgatt tgtgtcaaaa ccccgatggt aaggatatgt ccttattctg tcaaatggta 6660atggatgatt tgtgtcaaaa ccccgatggt aaggatatgt ccttattctg tcaaatggta 6660
tccaccgtag atttcattcc accaatggct tctctcgagg agaagggagt ttccttcacc 6720tccaccgtag atttcattcc accaatggct tctctcgagg agaagggagt ttccttcacc 6720
tctaagtttg tcatcgcatc cactaatgcc agtaatatca tagtaccaac agtgtctgat 6780tctaagtttg tcatcgcatc cactaatgcc agtaatatca tagtaccaac agtgtctgat 6780
tctgacgcta ttcgccgcag gttctacatg gactgtgaca ttgaagtgac agactcgtac 6840tctgacgcta ttcgccgcag gttctacatg gactgtgaca ttgaagtgac agactcgtac 6840
aaaacagatc taggtagact ggatgcaggg cgagccgcta aactgtgttc tgaaaataac 6900aaaacagatc taggtagact ggatgcaggg cgagccgcta aactgtgttc tgaaaataac 6900
actgcaaatt tcaaacgttg cagcccatta gtgtgtggga aagccatcca acttagagat 6960actgcaaatt tcaaacgttg cagcccatta gtgtgtggga aagccatcca acttagagat 6960
agaaagtcta aagtcagata cagtgtggat acggtggttt cagaacttat tagggaatac 7020agaaagtcta aagtcagata cagtgtggat acggtggttt cagaacttat tagggaatac 7020
agcaataggt ccgccattgg taacacaatc gaggctcttt tccaaggtcc acccaagttc 7080agcaataggt ccgccattgg taacacaatc gaggctcttt tccaaggtcc acccaagttc 7080
aggccaatta ggattagcct tgaagaaaaa ccagccccag acgctattag cgatctcctt 7140aggccaatta ggattagcct tgaagaaaaa ccagccccag acgctattag cgatctcctt 7140
gctagtgtag atagtgaaga agtgcgccag tactgcaggg atcaaggctg gattattcct 7200gctagtgtag atagtgaaga agtgcgccag tactgcaggg atcaaggctg gattattcct 7200
gaagctccca ccaatgtgga gcggcacctt aatagagcgg tgctcgtcat gcaatccatc 7260gaagctccca ccaatgtgga gcggcacctt aatagagcgg tgctcgtcat gcaatccatc 7260
accacagtag tggcggttgt ttcgttggtg tacgtcatct acaagctctt tgcagggttt 7320accacagtag tggcggttgt ttcgttggtg tacgtcatct acaagctctt tgcagggttt 7320
cagggtgcat attctggtgc tcctaagcaa gtgcttaaga aacctgctct tcgcacagca 7380cagggtgcat attctggtgc tcctaagcaa gtgcttaaga aacctgctct tcgcacagca 7380
acagtgcagg gtccgagcct tgactttgct ctctccctac tgagaaggaa catcaggcag 7440acagtgcagg gtccgagcct tgactttgct ctctccctac tgagaaggaa catcaggcag 7440
gtccaaacag accaagggca tttcaccatg ttgggtgtta gggatcgctt agcagtcctc 7500gtccaaacag accaagggca tttcaccatg ttgggtgtta gggatcgctt agcagtcctc 7500
ccacgccact cacaacctgg caaaaccatt tggattgagc acaaactcgt gaacgtcctt 7560ccacgccact cacaacctgg caaaaccatt tggattgagc acaaactcgt gaacgtcctt 7560
gatgcagttg aactggtgga tgagcaagga gtcaacctgg aattaaccct catcactctt 7620gatgcagttg aactggtgga tgagcaagga gtcaacctgg aattaaccct catcactctt 7620
gacaccaacg agaagtttag ggatatcacc aaattcatcc cagaaaatat cagcactgct 7680gacaccaacg agaagtttag ggatatcacc aaattcatcc cagaaaatat cagcactgct 7680
agcgatgcca ccctagtgat caacacggag cacatgccgt caatgtttgt cccggtgggt 7740agcgatgcca ccctagtgat caacacggag cacatgccgt caatgtttgt cccggtgggt 7740
gacgttgtgc agtatggctt tttgaatctc agtggcaagc ctacccatcg caccatgatg 7800gacgttgtgc agtatggctt tttgaatctc agtggcaagc ctacccatcg caccatgatg 7800
tacaattttc ctactaaagc aggacagtgt ggaggagtgg tgacatctgt tgggaaggtt 7860tacaattttc ctactaaagc aggacagtgt ggaggagtgg tgacatctgt tgggaaggtt 7860
gtcggtattc acattggtgg caatggcaga caaggttttt gcgcaggcct caaaaggagt 7920gtcggtattc acattggtgg caatggcaga caaggttttt gcgcaggcct caaaaggagt 7920
tactttgcta gtgaacaagg agagatccag tgggttaagc ccaataaaga aactggaaga 7980tactttgcta gtgaacaagg agagatccag tgggttaagc ccaataaaga aactggaaga 7980
ctcaacatca atggaccaac ccgcaccaag ttagaaccta gtgtattcca tgacatcttc 8040ctcaacatca atggaccaac ccgcaccaag ttagaaccta gtgtattcca tgacatcttc 8040
gagggaaata aggaaccagc tgtcttgcac agtaaagacc cccgacttga ggtagatttt 8100gagggaaata aggaaccagc tgtcttgcac agtaaagacc cccgacttga ggtagatttt 8100
gaacaggccc tgttctctaa gtatgtggga aacacactac atgagcctga cgagtacatc 8160gaacaggccc tgttctctaa gtatgtggga aacacactac atgagcctga cgagtacatc 8160
aaagaggcag ctctacatta tgcaaaccaa ttaaagcaac tagaaatcaa tacctctcaa 8220aaagaggcag ctctacatta tgcaaaccaa ttaaagcaac tagaaatcaa tacctctcaa 8220
atgagcatgg aggaggcctg ctatggtact gagaatcttg aggctattga tcttcacact 8280atgagcatgg aggaggcctg ctatggtact gagaatcttg aggctattga tcttcacact 8280
agtgcaggtt acccctatag tgccctaggg ataaagaaaa gagacatctt agaccctacc 8340agtgcaggtt acccctatag tgccctaggg ataaagaaaa gagacatctt agaccctacc 8340
accagggacg tgagtagaat gaagttctac atggacaagt atggtcttga tcttccctac 8400accagggacg tgagtagaat gaagttctac atggacaagt atggtcttga tcttccctac 8400
tccacttatg tcaaggacga gctacgctcg attgataaaa tcaagaaagg gaagtcccgc 8460tccacttatg tcaaggacga gctacgctcg attgataaaa tcaagaaagg gaagtcccgc 8460
ctgatcgagg ccagtagtct aaatgattca gtgtacctca gaatggcttt cgggcatttg 8520ctgatcgagg ccagtagtct aaatgattca gtgtacctca gaatggcttt cgggcatttg 8520
tatgaggctt tccacgcaaa tcctgggacg ataactggat cggccgtggg gtgtaaccct 8580tatgaggctt tccacgcaaa tcctgggacg ataactggat cggccgtggg gtgtaaccct 8580
gacacattct ggagcaagct gccaattttg ctccctggtt cactctttgc ctttgactac 8640gacacattct ggagcaagct gccaattttg ctccctggtt cactctttgc ctttgactac 8640
tcaggctatg atgccagcct tagccctgtc tggttcagag cattagaatt ggttcttagg 8700tcaggctatg atgccagcct tagccctgtc tggttcagag cattagaatt ggttcttagg 8700
gagatagggt atagtgaaga ggcaatctca ctcattgagg gaatcaacca cacacatcat 8760gagatagggt atagtgaaga ggcaatctca ctcattgagg gaatcaacca cacacatcat 8760
gtgtatcgta ataagaccta ttgcgtgctt ggtgggatgc cctcaggctg ttcaggaaca 8820gtgtatcgta ataagaccta ttgcgtgctt ggtgggatgc cctcaggctg ttcaggaaca 8820
tccatcttca actcaatgat caacaacatt attatcagag cactgctcat aaaaacattt 8880tccatcttca actcaatgat caacaacatt attatcagag cactgctcat aaaaacattt 8880
aagggcattg atttggatga actcaacatg gtcgcttatg gagacgatgt gctcgctagc 8940aagggcattg atttggatga actcaacatg gtcgcttatg gagacgatgt gctcgctagc 8940
tatcccttcc caattgattg cttggaacta gcaaagactg gtaaggagta tggtctgacc 9000tatcccttcc caattgattg cttggaacta gcaaagactg gtaaggagta tggtctgacc 9000
atgacccctg ctgataaatc tccttgcttt aatgaggtca attggggtaa tgcgaccttc 9060atgacccctg ctgataaatc tccttgcttt aatgaggtca attggggtaa tgcgaccttc 9060
ctcaaaaggg gctttttgcc cgatgaacag tttccatttt tgattcaccc tactatgcca 9120ctcaaaaggg gctttttgcc cgatgaacag tttccatttt tgattcaccc tactatgcca 9120
atgagggaga tccatgagtc cattcgatgg accaaggacg cacggaacac tcaagatcat 9180atgagggaga tccatgagtc cattcgatgg accaaggacg cacggaacac tcaagatcat 9180
gtgcggtcct tgtgcctcct agcatggcat aatggtaagc aagaatacga gaagtttgtg 9240gtgcggtcct tgtgcctcct agcatggcat aatggtaagc aagaatacga gaagtttgtg 9240
agcacaatta ggtctgtccc agtagggaga gcgttggcta ttccaaatta tgaaaatctt 9300agcacaatta ggtctgtccc agtagggaga gcgttggcta ttccaaatta tgaaaatctt 9300
agacgaaatt ggctcgagtt attttagagg ttatacacac ctcaacccca ccagaaatct 9360agacgaaatt ggctcgagtt attttagagg ttatacacac ctcaacccca ccagaaatct 9360
ggtcgtgaat gtgactggtg ggggtaaatt tgttataacc agaatagcaa aaaaaaaaaa 9420ggtcgtgaat gtgactggtg ggggtaaatt tgttataacc agaatagcaa aaaaaaaaaa 9420
aaaaaaaaaa aaaaaaaaaa gcttat 9446aaaaaaaaaa aaaaaaaaaa gcttat 9446
<210> 2<210> 2
<211> 7405<211> 7405
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 2<400> 2
ttaaaacagc ctgtgggttg cacccactca cagggcctac tgggcgcaag cactctggta 60ttaaaacagc ctgtgggttg cacccactca cagggcctac tgggcgcaag cactctggta 60
cctcggtacc tttgtgcgcc tgttttacac ccccccccca atgaaactta gaagcaataa 120cctcggtacc tttgtgcgcc tgttttacac ccccccccca atgaaactta gaagcaataa 120
accacgatca atagcaggca taacgctcca gttatgtctt gatcaagcac ttctgtttcc 180accacgatca atagcaggca taacgctcca gttatgtctt gatcaagcac ttctgtttcc 180
ccggactgag tatcaataga ctgctcgcgc ggttgaagga gaaaacgttc gttatccggc 240ccggactgag tatcaataga ctgctcgcgc ggttgaagga gaaaacgttc gttatccggc 240
taactacttc ggaaaaccta gtaacaccat gaaagttgcg gagagcttcg ttcagcactc 300taactacttc ggaaaaccta gtaacaccat gaaagttgcg gagagcttcg ttcagcactc 300
ccccagtgta gatcaggtcg atgagtcacc gcgttcccca cgggcgaccg tggcggtggc 360ccccagtgta gatcaggtcg atgagtcacc gcgttcccca cgggcgaccg tggcggtggc 360
tgcgttggcg gcctgcccat ggggtaaccc atggggcgct ctaatacgga catggtgtga 420tgcgttggcg gcctgcccat ggggtaaccc atggggcgct ctaatacgga catggtgtga 420
agagtctact gagctagttg gtagtcctcc ggcccctgaa tgcggctaat cccaactgcg 480agagtctact gagctagttg gtagtcctcc ggcccctgaa tgcggctaat cccaactgcg 480
gagcacacgc ccacaagcca gcgggtagtg tgtcgtaacg ggtaactctg cagcggaacc 540gagcacacgc ccacaagcca gcgggtagtg tgtcgtaacg ggtaactctg cagcggaacc 540
gactactttg ggtgtccgtg tttcctttta tctttatatt ggctgcttat ggtgacaatt 600gactactttg ggtgtccgtg tttcctttta tctttatatt ggctgcttat ggtgacaatt 600
aaagaattgt taccatatag ctattggatt agccatccgg tgtgcaacag agcaattatt 660aaagaattgt taccatatag ctattggatt agccatccgg tgtgcaacag agcaattatt 660
tacctattta ttggttttgt accattaacc tcgaattctg tgaccaccct taattatatc 720tacctattta ttggttttgt accattaacc tcgaattctg tgaccaccct taattatatc 720
ttgaccctta acacagctaa acatgggttc gcaagtgtct acacagcgct ccggttctta 780ttgaccctta acacagctaa acatgggttc gcaagtgtct acacagcgct ccggttctta 780
cgaaaactca aactcagcca ctgagggttc taccataaac tacaccacca ttaattacta 840cgaaaactca aactcagcca ctgagggttc taccataaac tacaccacca ttaattacta 840
caaagactcc tatgctgcca cagcaggcaa acagagtctc aagcaggatc cagacaagtt 900caaagactcc tatgctgcca cagcaggcaa acagagtctc aagcaggatc cagacaagtt 900
tgcaaatcct gttaaagaca tattcaccga aatggcagcg ccactgaagt ccccatccgc 960tgcaaatcct gttaaagaca tattcaccga aatggcagcg ccactgaagt ccccatccgc 960
tgaggcatgt ggatacagtg atcgagtggc gcaattaact attggcaact ccaccatcac 1020tgaggcatgt ggatacagtg atcgagtggc gcaattaact attggcaact ccaccatcac 1020
gacgcaagaa gcggctaaca tcatagtcgg ctatggtgag tggccttcct actgctcaga 1080gacgcaagaa gcggctaaca tcatagtcgg ctatggtgag tggccttcct actgctcaga 1080
ttctgacgct acagcagtgg ataaaccaac gcgcccggat gtttcagtga acaggtttta 1140ttctgacgct acagcagtgg ataaaccaac gcgcccggat gtttcagtga acaggtttta 1140
cacattggac actaaattgt gggagaaatc gtccaaggga tggtactgga agttcccgga 1200cacattggac actaaattgt gggagaaatc gtccaaggga tggtactgga agttcccgga 1200
tgtgttaact gaaactgggg tttttgggca aaatgcacaa ttccactacc tctaccgatc 1260tgtgttaact gaaactgggg tttttgggca aaatgcacaa ttccactacc tctaccgatc 1260
agggttctgc atccacgtgc agtgcaatgc cagtaaattc caccaaggag cactcctagt 1320agggttctgc atccacgtgc agtgcaatgc cagtaaattc caccaaggag cactcctagt 1320
cgctgtccta ccagagtatg tcattgggac agtggcaggc ggtacaggga cggaagacac 1380cgctgtccta ccagagtatg tcattgggac agtggcaggc ggtacaggga cggaagacac 1380
ccaccccccc tacaagcaga cccaacccgg cgccgatggt ttcgagttgc aacacccgta 1440ccacccccccc tacaagcaga cccaacccgg cgccgatggt ttcgagttgc aacacccgta 1440
cgtgcttgat gctggcatcc caatatcaca gttaacagtg tgcccacacc agtggattaa 1500cgtgcttgat gctggcatcc caatatcaca gttaacagtg tgcccacacc agtggattaa 1500
tttgaggacc aacaattgtg ctacaataat agtgccatac attaacgcac tgccttttga 1560tttgaggacc aacaattgtg ctacaataat agtgccatac attaacgcac tgccttttga 1560
ttctgccttg aaccattgca actttggcct gttagttgtg cctattagcc cactagacta 1620ttctgccttg aaccattgca actttggcct gttagttgtg cctattagcc cactagacta 1620
cgaccaagga gcaacgccag taatccctat aactatcaca ttggccccaa tgtgctctga 1680cgaccaagga gcaacgccag taatccctat aactatcaca ttggccccaa tgtgctctga 1680
attcgcaggt cttaggcagg cagtcacgca agggttcccc accgagctaa aacctggcac 1740attcgcaggt ctaggcagg cagtcacgca agggttcccc accgagctaa aacctggcac 1740
aaatcaattt ttaaccaccg atgatggcgt ctcagcacct attctaccaa acttccaccc 1800aaatcaattt ttaaccaccg atgatggcgt ctcagcacct attctaccaa acttccaccc 1800
caccccgtgt atccacatac ctggtgaagt taggaacttg ctagagttat gccaggtgga 1860caccccgtgt atccacatac ctggtgaagt taggaacttg ctagagttat gccaggtgga 1860
gaccattctg gaggttaaca atgtgcccac gaatgccact agcttaatgg agagactgcg 1920gaccattctg gaggttaaca atgtgcccac gaatgccact agcttaatgg agagactgcg 1920
cttcccggtc tcagcacaag cagggaaagg tgaactgtgt gcggtgttta gagccgatcc 1980cttcccggtc tcagcacaag cagggaaagg tgaactgtgt gcggtgttta gagccgatcc 1980
tgggcgaaat ggaccatggc aatccacctt actgggccag ttgtgcgggt actacaccca 2040tgggcgaaat ggaccatggc aatccacctt actgggccag ttgtgcgggt actacaccca 2040
atggtcaggg tcattggaag tcaccttcat gtttactgga tccttcatgg ctaccggcaa 2100atggtcaggg tcattggaag tcaccttcat gtttactgga tccttcatgg ctaccggcaa 2100
gatgctcata gcctatacac cgccaggggg tcctctgccc aaggaccggg cgaccgccat 2160gatgctcata gcctatacac cgccaggggg tcctctgccc aaggaccggg cgaccgccat 2160
gttgggcacg cacgtcatct gggattttgg gctgcaatcg tctgttaccc ttgtaatacc 2220gttgggcacg cacgtcatct gggattttgg gctgcaatcg tctgttaccc ttgtaatacc 2220
atggatcagt aacactcatt atagagcaca tgcccgagat ggagtgtttg actattacac 2280atggatcagt aacactcatt atagagcaca tgcccgagat ggagtgtttg actattacac 2280
tacagggtta gtcagtatat ggtaccagac aaattacgtg gttccaatcg gtgcgcccaa 2340tacagggtta gtcagtatat ggtaccagac aaattacgtg gttccaatcg gtgcgcccaa 2340
cacagcctat ataatagcac tagcggcagc ccaaaagaac ttcactatga aattgtgcaa 2400cacagcctat ataatagcac tagcggcagc ccaaaagaac ttcactatga aattgtgcaa 2400
ggatgctagt gatatcctgc agacgggcac catccaggga gatagggtgg cagatgtaat 2460ggatgctagt gatatcctgc agacgggcac catccaggga gatagggtgg cagatgtaat 2460
tgaaagttcc ataggagata gcgtgagcag agccctcact cacgctctac cagcacccac 2520tgaaagttcc ataggagata gcgtgagcag agccctcact cacgctctac cagcacccac 2520
aggccaaaac acacaggtga gcagtcatcg actggataca ggcaaggttc cagcactcca 2580aggccaaaac acacaggtga gcagtcatcg actggataca ggcaaggttc cagcactcca 2580
agctgctgaa attggggcat catcaaatgc tagtgacgag agcatgattg aaacacgttg 2640agctgctgaa attggggcat catcaaatgc tagtgacgag agcatgattg aaacacgttg 2640
tgttcttaac tcgcatagta cagctgagac cactcttgat agtttcttca gtagggcagg 2700tgttcttaac tcgcatagta cagctgagac cactcttgat agtttcttca gtagggcagg 2700
attagttgga gagatagatc tccctcttga gggcacaact aacccaaatg gttatgccaa 2760attagttgga gagatagatc tccctcttga gggcacaact aacccaaatg gttatgccaa 2760
ctgggacata gatataacag gttacgcgca aatgcgtaga aaggtagagc tattcaccta 2820ctgggacata gatataacag gttacgcgca aatgcgtaga aaggtagagc tattcaccta 2820
catgcgtttt gatgcagagt tcacttttgt tgcgtgcaca cccaccgggg aggttgtccc 2880catgcgtttt gatgcagagt tcacttttgt tgcgtgcaca cccaccgggg aggttgtccc 2880
acaattgctc caatatatgt ttgtgccacc tggagcccct aagccagatt ctagggaatc 2940acaattgctc caatatatgt ttgtgccacc tggagcccct aagccagatt ctagggaatc 2940
ccttgcatgg caaaccgcca ccaacccctc agtttttgtc aagctgtcag accctccggc 3000ccttgcatgg caaaccgcca ccaacccctc agtttttgtc aagctgtcag accctccggc 3000
gcaggtttca gtgccattca tgtcacctgc gagtgcttat caatggtttt atgacggata 3060gcaggtttca gtgccattca tgtcacctgc gagtgcttat caatggtttt atgacggata 3060
tcccacattc ggagaacaca aacaggagaa agaccttgaa tacggggcat gtcctaataa 3120tcccacattc ggagaacaca aacaggagaa agaccttgaa tacggggcat gtcctaataa 3120
catgatgggt acattctcag tgcggactgt ggggacctcc aagtccaagt accctttagt 3180catgatgggt acattctcag tgcggactgt ggggacctcc aagtccaagt accctttagt 3180
ggttaggatt tacatgagaa tgaagcacgt cagggcgtgg atacctcgcc cgatgcgcaa 3240ggttaggatt tacatgagaa tgaagcacgt cagggcgtgg atacctcgcc cgatgcgcaa 3240
ccagaactac ctgttcaaag ccaacccaaa ttatgctggc aactctatta agccaactgg 3300ccagaactac ctgttcaaag ccaacccaaa ttatgctggc aactctatta agccaactgg 3300
tgccagtcgc acagcgatca ccactcttgg gaaatttgga caacagtctg gggctattta 3360tgccagtcgc acagcgatca ccactcttgg gaaatttgga caacagtctg gggctattta 3360
tgtgggcaac tttagagtgg tcaaccgaca tcttgccacc cataatgatt gggcaaatct 3420tgtgggcaac tttagagtgg tcaaccgaca tcttgccacc cataatgatt gggcaaatct 3420
tgtttgggaa gacagctctc gcgacttgct cgtgtcatcc accactgccc aaggttgtga 3480tgtttgggaa gacagctctc gcgacttgct cgtgtcatcc accactgccc aaggttgtga 3480
cacgattgcc cgttgcgatt gccagacagg ggtgtactac tgtaactcga tgagaaaaca 3540cacgattgcc cgttgcgatt gccagacagg ggtgtactac tgtaactcga tgagaaaaca 3540
ctacccagtc agtttttcaa aacccagcct gatctatgta gaggctagcg agtattaccc 3600ctacccagtc agtttttcaa aacccagcct gatctatgta gaggctagcg agtattaccc 3600
agccaggtac caatcacatc tcatgctcgc acagggtcac tcggaacctg gtgattgcgg 3660agccaggtac caatcacatc tcatgctcgc acagggtcac tcggaacctg gtgattgcgg 3660
tggtatcctt aggtgccaac atggcgtcat cggcatagtg tctactggtg gcaatgggct 3720tggtatcctt aggtgccaac atggcgtcat cggcatagtg tctactggtg gcaatgggct 3720
cgttggcttt gcagacgtca gagacctctt gtggttagat gaagaagcta tggaacaggg 3780cgttggcttt gcagacgtca gagacctctt gtggttagat gaagaagcta tggaacaggg 3780
cgtgtccgac tacattaagg gtctcggaga tgcttttgga acaggcttca ctgacgcagt 3840cgtgtccgac tacattaagg gtctcggaga tgcttttgga acaggcttca ctgacgcagt 3840
ctcaagggag gttgaagctc tcaagaacta tcttataggg tctgaaggag cagttgagaa 3900ctcaagggag gttgaagctc tcaagaacta tcttataggg tctgaaggag cagttgagaa 3900
aattttgaaa aatcttatta aactaatctc tgcactggtg attgtgatca gaagtgatta 3960aattttgaaa aatcttatta aactaatctc tgcactggtg attgtgatca gaagtgatta 3960
cgacatggtt accctcactg caaccttagc gctgataggt tgtcatggca gtccttgggc 4020cgacatggtt accctcactg caaccttagc gctgataggt tgtcatggca gtccttgggc 4020
ttggattaaa gccaaaacag cctccatctt aggtatccct atcgcccaaa agcagagcgc 4080ttggattaaa gccaaaacag cctccatctt aggtatccct atcgcccaaa agcagagcgc 4080
ttcctggctc aagaagttca atgacatggc caacgccgct aaggggttag agtgggtttc 4140ttcctggctc aagaagttca atgacatggc caacgccgct aaggggttag agtgggtttc 4140
caacaagatc agcaaattta ttgattggct taaggagaaa atagtaccag cagccaggga 4200caacaagatc agcaaattta ttgattggct taaggagaaa atagtaccag cagccaggga 4200
gaaggttgaa ttcctaaata acttgaaaca gctgccactg ctagagaatc agatctcgaa 4260gaaggttgaa ttcctaaata acttgaaaca gctgccactg ctagagaatc agatctcgaa 4260
cttggaacaa tctgctgctt cacaagagga ccttgaagtc atgtttggga atgtgtcgta 4320cttggaacaa tctgctgctt cacaagagga ccttgaagtc atgtttggga atgtgtcgta 4320
cctagctcac ttctgtcgca agtttcaacc gctatacgcc acggaagcta aaagagtcta 4380cctagctcac ttctgtcgca agtttcaacc gctatacgcc acggaagcta aaagagtcta 4380
tgccctggag aagagaatga ataactatat gcagttcaag agcaaacacc gaattgaacc 4440tgccctggag aagagaatga ataactatat gcagttcaag agcaaacacc gaattgaacc 4440
tgtatgtctc attattaggg gctcaccagg caccgggaag tctctagcca ctggtattat 4500tgtatgtctc attattaggg gctcaccagg caccgggaag tctctagcca ctggtattat 4500
tgctcgagca atcgctgata agtaccactc cagcgtgtac tcgctcccac cagacccgga 4560tgctcgagca atcgctgata agtaccactc cagcgtgtac tcgctcccac cagacccgga 4560
tcattttgac ggttacaagc aacaggtggt tacagtgatg gatgatttgt gtcaaaaccc 4620tcattttgac ggttacaagc aacaggtggt tacagtgatg gatgatttgt gtcaaaaccc 4620
cgatggtaag gatatgtcct tattctgtca aatggtatcc accgtagatt tcattccacc 4680cgatggtaag gatatgtcct tattctgtca aatggtatcc accgtagatt tcattccacc 4680
aatggcttct ctcgaggaga agggagtttc cttcacctct aagtttgtca tcgcatccac 4740aatggcttct ctcgaggaga agggagtttc cttcacctct aagtttgtca tcgcatccac 4740
taatgccagt aatatcatag taccaacagt gtctgattct gacgctattc gccgcaggtt 4800taatgccagt aatatcatag taccaacagt gtctgattct gacgctattc gccgcaggtt 4800
ctacatggac tgtgacattg aagtgacaga ctcgtacaaa acagatctag gtagactgga 4860ctacatggac tgtgacattg aagtgacaga ctcgtacaaa acagatctag gtagactgga 4860
tgcagggcga gccgctaaac tgtgttctga aaataacact gcaaatttca aacgttgcag 4920tgcagggcga gccgctaaac tgtgttctga aaataacact gcaaatttca aacgttgcag 4920
cccattagtg tgtgggaaag ccatccaact tagagataga aagtctaaag tcagatacag 4980cccattagtg tgtgggaaag ccatccaact tagagataga aagtctaaag tcagatacag 4980
tgtggatacg gtggtttcag aacttattag ggaatacagc aataggtccg ccattggtaa 5040tgtggatacg gtggtttcag aacttattag ggaatacagc aataggtccg ccattggtaa 5040
cacaatcgag gctcttttcc aaggtccacc caagttcagg ccaattagga ttagccttga 5100cacaatcgag gctcttttcc aaggtccacc caagttcagg ccaattagga ttagccttga 5100
agaaaaacca gccccagacg ctattagcga tctccttgct agtgtagata gtgaagaagt 5160agaaaaacca gccccagacg ctattagcga tctccttgct agtgtagata gtgaagaagt 5160
gcgccagtac tgcagggatc aaggctggat tattcctgaa gctcccacca atgtggagcg 5220gcgccagtac tgcagggatc aaggctggat tattcctgaa gctcccacca atgtggagcg 5220
gcaccttaat agagcggtgc tcgtcatgca atccatcacc acagtagtgg cggttgtttc 5280gcaccttaat agagcggtgc tcgtcatgca atccatcacc acagtagtgg cggttgtttc 5280
gttggtgtac gtcatctaca agctctttgc agggtttcag ggtgcatatt ctggtgctcc 5340gttggtgtac gtcatctaca agctctttgc agggtttcag ggtgcatatt ctggtgctcc 5340
taagcaagtg cttaagaaac ctgctcttcg cacagcaaca gtgcagggtc cgagccttga 5400taagcaagtg cttaagaaac ctgctcttcg cacagcaaca gtgcagggtc cgagccttga 5400
ctttgctctc tccctactga gaaggaacat caggcaggtc caaacagacc aagggcattt 5460ctttgctctc tccctactga gaaggaacat caggcaggtc caaacagacc aagggcattt 5460
caccatgttg ggtgttaggg atcgcttagc agtcctccca cgccactcac aacctggcaa 5520caccatgttg ggtgttaggg atcgcttagc agtcctccca cgccactcac aacctggcaa 5520
aaccatttgg attgagcaca aactcgtgaa cgtccttgat gcagttgaac tggtggatga 5580aaccatttgg attgagcaca aactcgtgaa cgtccttgat gcagttgaac tggtggatga 5580
gcaaggagtc aacctggaat taaccctcat cactcttgac accaacgaga agtttaggga 5640gcaaggagtc aacctggaat taaccctcat cactcttgac accaacgaga agtttaggga 5640
tatcaccaaa ttcatcccag aaaatatcag cactgctagc gatgccaccc tagtgatcaa 5700tatcaccaaa ttcatcccag aaaatatcag cactgctagc gatgccaccc tagtgatcaa 5700
cacggagcac atgccgtcaa tgtttgtccc ggtgggtgac gttgtgcagt atggcttttt 5760cacggagcac atgccgtcaa tgtttgtccc ggtgggtgac gttgtgcagt atggcttttt 5760
gaatctcagt ggcaagccta cccatcgcac catgatgtac aattttccta ctaaagcagg 5820gaatctcagt ggcaagccta cccatcgcac catgatgtac aattttccta ctaaagcagg 5820
acagtgtgga ggagtggtga catctgttgg gaaggttgtc ggtattcaca ttggtggcaa 5880acagtgtgga ggagtggtga catctgttgg gaaggttgtc ggtattcaca ttggtggcaa 5880
tggcagacaa ggtttttgcg caggcctcaa aaggagttac tttgctagtg aacaaggaga 5940tggcagacaa ggttttttgcg caggcctcaa aaggagttac tttgctagtg aacaaggaga 5940
gatccagtgg gttaagccca ataaagaaac tggaagactc aacatcaatg gaccaacccg 6000gatccagtgg gttaagccca ataaagaaac tggaagactc aacatcaatg gaccaacccg 6000
caccaagtta gaacctagtg tattccatga catcttcgag ggaaataagg aaccagctgt 6060caccaagtta gaacctagtg tattccatga catcttcgag ggaaataagg aaccagctgt 6060
cttgcacagt aaagaccccc gacttgaggt agattttgaa caggccctgt tctctaagta 6120cttgcacagt aaagaccccc gacttgaggt agattttgaa caggccctgt tctctaagta 6120
tgtgggaaac acactacatg agcctgacga gtacatcaaa gaggcagctc tacattatgc 6180tgtgggaaac acactacatg agcctgacga gtacatcaaa gaggcagctc tacattatgc 6180
aaaccaatta aagcaactag aaatcaatac ctctcaaatg agcatggagg aggcctgcta 6240aaaccaatta aagcaactag aaatcaatac ctctcaaatg agcatggagg aggcctgcta 6240
tggtactgag aatcttgagg ctattgatct tcacactagt gcaggttacc cctatagtgc 6300tggtactgag aatcttgagg ctattgatct tcacactagt gcaggttacc cctatagtgc 6300
cctagggata aagaaaagag acatcttaga ccctaccacc agggacgtga gtagaatgaa 6360cctagggata aagaaaagag acatcttaga ccctaccacc agggacgtga gtagaatgaa 6360
gttctacatg gacaagtatg gtcttgatct tccctactcc acttatgtca aggacgagct 6420gttctacatg gacaagtatg gtcttgatct tccctactcc acttatgtca aggacgagct 6420
acgctcgatt gataaaatca agaaagggaa gtcccgcctg atcgaggcca gtagtctaaa 6480acgctcgatt gataaaatca agaaagggaa gtcccgcctg atcgaggcca gtagtctaaa 6480
tgattcagtg tacctcagaa tggctttcgg gcatttgtat gaggctttcc acgcaaatcc 6540tgattcagtg tacctcagaa tggctttcgg gcatttgtat gaggctttcc acgcaaatcc 6540
tgggacgata actggatcgg ccgtggggtg taaccctgac acattctgga gcaagctgcc 6600tgggacgata actggatcgg ccgtggggtg taaccctgac acattctgga gcaagctgcc 6600
aattttgctc cctggttcac tctttgcctt tgactactca ggctatgatg ccagccttag 6660aattttgctc cctggttcac tctttgcctt tgactactca ggctatgatg ccagccttag 6660
ccctgtctgg ttcagagcat tagaattggt tcttagggag atagggtata gtgaagaggc 6720ccctgtctgg ttcagagcat tagaattggt tcttagggag atagggtata gtgaagaggc 6720
aatctcactc attgagggaa tcaaccacac acatcatgtg tatcgtaata agacctattg 6780aatctcactc attgagggaa tcaaccacac acatcatgtg tatcgtaata agacctattg 6780
cgtgcttggt gggatgccct caggctgttc aggaacatcc atcttcaact caatgatcaa 6840cgtgcttggt gggatgccct caggctgttc aggaacatcc atcttcaact caatgatcaa 6840
caacattatt atcagagcac tgctcataaa aacatttaag ggcattgatt tggatgaact 6900caacattatt atcagagcac tgctcataaa aacatttaag ggcattgatt tggatgaact 6900
caacatggtc gcttatggag acgatgtgct cgctagctat cccttcccaa ttgattgctt 6960caacatggtc gcttatggag acgatgtgct cgctagctat cccttcccaa ttgattgctt 6960
ggaactagca aagactggta aggagtatgg tctgaccatg acccctgctg ataaatctcc 7020ggaactagca aagactggta aggagtatgg tctgaccatg acccctgctg ataaatctcc 7020
ttgctttaat gaggtcaatt ggggtaatgc gaccttcctc aaaaggggct ttttgcccga 7080ttgctttaat gaggtcaatt ggggtaatgc gaccttcctc aaaaggggct ttttgcccga 7080
tgaacagttt ccatttttga ttcaccctac tatgccaatg agggagatcc atgagtccat 7140tgaacagttt ccatttttga ttcaccctac tatgccaatg agggagatcc atgagtccat 7140
tcgatggacc aaggacgcac ggaacactca agatcatgtg cggtccttgt gcctcctagc 7200tcgatggacc aaggacgcac ggaacactca agatcatgtg cggtccttgt gcctcctagc 7200
atggcataat ggtaagcaag aatacgagaa gtttgtgagc acaattaggt ctgtcccagt 7260atggcataat ggtaagcaag aatacgagaa gtttgtgagc acaattaggt ctgtcccagt 7260
agggagagcg ttggctattc caaattatga aaatcttaga cgaaattggc tcgagttatt 7320agggagagcg ttggctattc caaattatga aaatcttaga cgaaattggc tcgagttatt 7320
ttagaggtta tacacacctc aaccccacca gaaatctggt cgtgaatgtg actggtgggg 7380ttagaggtta tacacacctc aaccccacca gaaatctggt cgtgaatgtg actggtgggg 7380
gtaaatttgt tataaccaga atagc 7405gtaaatttgt tataaccaga atagc 7405
<210> 3<210> 3
<211> 1987<211> 1987
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 3<400> 3
agcgctagcg gagtgtatac tggcttacta tgttggcact gatgagggtg tcagtgaagt 60agcgctagcg gagtgtatac tggcttacta tgttggcact gatgagggtg tcagtgaagt 60
gcttcatgtg gcaggagaaa aaaggctgca ccggtgcgtc agcagaatat gtgatacagg 120gcttcatgtg gcaggagaaa aaaggctgca ccggtgcgtc agcagaatat gtgatacagg 120
atatattccg cttcctcgct cactgactcg ctacgctcgg tcgttcgact gcggcgagcg 180atatattccg cttcctcgct cactgactcg ctacgctcgg tcgttcgact gcggcgagcg 180
gaaatggctt acgaacgggg cggagatttc ctggaagatg ccaggaagat acttaacagg 240gaaatggctt acgaacgggg cggagatttc ctggaagatg ccaggaagat acttaacagg 240
gaagtgagag ggccgcggca aagccgtttt tccataggct ccgcccccct gacaagcatc 300gaagtgagag ggccgcggca aagccgtttt tccataggct ccgcccccct gacaagcatc 300
acgaaatctg acgctcaaat cagtggtggc gaaacccgac aggactataa agataccagg 360acgaaatctg acgctcaaat cagtggtggc gaaacccgac aggactataa agataccagg 360
cgtttcccct ggcggctccc tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt 420cgtttcccct ggcggctccc tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt 420
cattccgctg ttatggccgc gtttgtctca ttccacgcct gacactcagt tccgggtagg 480cattccgctg ttatggccgc gtttgtctca ttccacgcct gacactcagt tccgggtagg 480
cagttcgctc caagctggac tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct 540cagttcgctc caagctggac tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct 540
tatccggtaa ctatcgtctt gagtccaacc cggaaagaca tgcaaaagca ccactggcag 600tatccggtaa ctatcgtctt gagtccaacc cggaaagaca tgcaaaagca ccactggcag 600
cagccactgg taattgattt agaggagtta gtcttgaagt catgcgccgg ttaaggctaa 660cagccactgg taattgattt agaggagtta gtcttgaagt catgcgccgg ttaaggctaa 660
actgaaagga caagttttgg tgactgcgct cctccaagcc agttacctcg gttcaaagag 720actgaaagga caagttttgg tgactgcgct cctccaagcc agttacctcg gttcaaagag 720
ttggtagctc agagaacctt cgaaaaaccg ccctgcaagg cggttttttc gttttcagag 780ttggtagctc agagaacctt cgaaaaaccg ccctgcaagg cggttttttc gttttcagag 780
caagagatta cgcgcagacc aaaacgatct caagaagatc atcttattaa ggggtctgac 840caagagatta cgcgcagacc aaaacgatct caagaagatc atcttattaa ggggtctgac 840
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 900gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 900
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 960ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 960
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 1020taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 1020
ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 1080ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 1080
ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 1140ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 1140
gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 1200gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 1200
ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 1260ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 1260
gttaatagtt tgcgcaacgt tgttgccatt gctgcaggca tcgtggtgtc acgctcgtcg 1320gttaatagtt tgcgcaacgt tgttgccatt gctgcaggca tcgtggtgtc acgctcgtcg 1320
tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 1380tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 1380
atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 1440atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 1440
gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 1500gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 1500
tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 1560tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 1560
atgcggcgac cgagttgctc ttgcccggcg tcaacacggg ataataccgc gccacatagc 1620atgcggcgac cgagttgctc ttgcccggcg tcaacacggg ataataccgc gccacatagc 1620
agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 1680agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 1680
ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 1740ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 1740
tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 1800tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 1800
aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 1860aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 1860
tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 1920tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 1920
aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtgtcgacg 1980aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtgtcgacg 1980
cggccgc 1987cggccgc 1987
<210> 4<210> 4
<211> 2193<211> 2193
<212> PRT<212> PRT
<213> Artificial<213> Artificial
<400> 4<400> 4
Met Gly Ser Gln Val Ser Thr Gln Arg Ser Gly Ser Tyr Glu Asn SerMet Gly Ser Gln Val Ser Thr Gln Arg Ser Gly Ser Tyr Glu Asn Ser
1 5 10 151 5 10 15
Asn Ser Ala Thr Glu Gly Ser Thr Ile Asn Tyr Thr Thr Ile Asn TyrAsn Ser Ala Thr Glu Gly Ser Thr Ile Asn Tyr Thr Thr Ile Asn Tyr
20 25 30 20 25 30
Tyr Lys Asp Ser Tyr Ala Ala Thr Ala Gly Lys Gln Ser Leu Lys GlnTyr Lys Asp Ser Tyr Ala Ala Thr Ala Gly Lys Gln Ser Leu Lys Gln
35 40 45 35 40 45
Asp Pro Asp Lys Phe Ala Asn Pro Val Lys Asp Ile Phe Thr Glu MetAsp Pro Asp Lys Phe Ala Asn Pro Val Lys Asp Ile Phe Thr Glu Met
50 55 60 50 55 60
Ala Ala Pro Leu Lys Ser Pro Ser Ala Glu Ala Cys Gly Tyr Ser AspAla Ala Pro Leu Lys Ser Pro Ser Ala Glu Ala Cys Gly Tyr Ser Asp
65 70 75 8065 70 75 80
Arg Val Ala Gln Leu Thr Ile Gly Asn Ser Thr Ile Thr Thr Gln GluArg Val Ala Gln Leu Thr Ile Gly Asn Ser Thr Ile Thr Thr Gln Glu
85 90 95 85 90 95
Ala Ala Asn Ile Ile Val Gly Tyr Gly Glu Trp Pro Ser Tyr Cys SerAla Ala Asn Ile Ile Val Gly Tyr Gly Glu Trp Pro Ser Tyr Cys Ser
100 105 110 100 105 110
Asp Ser Asp Ala Thr Ala Val Asp Lys Pro Thr Arg Pro Asp Val SerAsp Ser Asp Ala Thr Ala Val Asp Lys Pro Thr Arg Pro Asp Val Ser
115 120 125 115 120 125
Val Asn Arg Phe Tyr Thr Leu Asp Thr Lys Leu Trp Glu Lys Ser SerVal Asn Arg Phe Tyr Thr Leu Asp Thr Lys Leu Trp Glu Lys Ser Ser
130 135 140 130 135 140
Lys Gly Trp Tyr Trp Lys Phe Pro Asp Val Leu Thr Glu Thr Gly ValLys Gly Trp Tyr Trp Lys Phe Pro Asp Val Leu Thr Glu Thr Gly Val
145 150 155 160145 150 155 160
Phe Gly Gln Asn Ala Gln Phe His Tyr Leu Tyr Arg Ser Gly Phe CysPhe Gly Gln Asn Ala Gln Phe His Tyr Leu Tyr Arg Ser Gly Phe Cys
165 170 175 165 170 175
Ile His Val Gln Cys Asn Ala Ser Lys Phe His Gln Gly Ala Leu LeuIle His Val Gln Cys Asn Ala Ser Lys Phe His Gln Gly Ala Leu Leu
180 185 190 180 185 190
Val Ala Val Leu Pro Glu Tyr Val Ile Gly Thr Val Ala Gly Gly ThrVal Ala Val Leu Pro Glu Tyr Val Ile Gly Thr Val Ala Gly Gly Thr
195 200 205 195 200 205
Gly Thr Glu Asp Thr His Pro Pro Tyr Lys Gln Thr Gln Pro Gly AlaGly Thr Glu Asp Thr His Pro Pro Tyr Lys Gln Thr Gln Pro Gly Ala
210 215 220 210 215 220
Asp Gly Phe Glu Leu Gln His Pro Tyr Val Leu Asp Ala Gly Ile ProAsp Gly Phe Glu Leu Gln His Pro Tyr Val Leu Asp Ala Gly Ile Pro
225 230 235 240225 230 235 240
Ile Ser Gln Leu Thr Val Cys Pro His Gln Trp Ile Asn Leu Arg ThrIle Ser Gln Leu Thr Val Cys Pro His Gln Trp Ile Asn Leu Arg Thr
245 250 255 245 250 255
Asn Asn Cys Ala Thr Ile Ile Val Pro Tyr Ile Asn Ala Leu Pro PheAsn Asn Cys Ala Thr Ile Ile Ile Val Pro Tyr Ile Asn Ala Leu Pro Phe
260 265 270 260 265 270
Asp Ser Ala Leu Asn His Cys Asn Phe Gly Leu Leu Val Val Pro IleAsp Ser Ala Leu Asn His Cys Asn Phe Gly Leu Leu Val Val Pro Ile
275 280 285 275 280 285
Ser Pro Leu Asp Tyr Asp Gln Gly Ala Thr Pro Val Ile Pro Ile ThrSer Pro Leu Asp Tyr Asp Gln Gly Ala Thr Pro Val Ile Pro Ile Thr
290 295 300 290 295 300
Ile Thr Leu Ala Pro Met Cys Ser Glu Phe Ala Gly Leu Arg Gln AlaIle Thr Leu Ala Pro Met Cys Ser Glu Phe Ala Gly Leu Arg Gln Ala
305 310 315 320305 310 315 320
Val Thr Gln Gly Phe Pro Thr Glu Leu Lys Pro Gly Thr Asn Gln PheVal Thr Gln Gly Phe Pro Thr Glu Leu Lys Pro Gly Thr Asn Gln Phe
325 330 335 325 330 335
Leu Thr Thr Asp Asp Gly Val Ser Ala Pro Ile Leu Pro Asn Phe HisLeu Thr Thr Asp Asp Gly Val Ser Ala Pro Ile Leu Pro Asn Phe His
340 345 350 340 345 350
Pro Thr Pro Cys Ile His Ile Pro Gly Glu Val Arg Asn Leu Leu GluPro Thr Pro Cys Ile His Ile Pro Gly Glu Val Arg Asn Leu Leu Glu
355 360 365 355 360 365
Leu Cys Gln Val Glu Thr Ile Leu Glu Val Asn Asn Val Pro Thr AsnLeu Cys Gln Val Glu Thr Ile Leu Glu Val Asn Asn Val Pro Thr Asn
370 375 380 370 375 380
Ala Thr Ser Leu Met Glu Arg Leu Arg Phe Pro Val Ser Ala Gln AlaAla Thr Ser Leu Met Glu Arg Leu Arg Phe Pro Val Ser Ala Gln Ala
385 390 395 400385 390 395 400
Gly Lys Gly Glu Leu Cys Ala Val Phe Arg Ala Asp Pro Gly Arg AsnGly Lys Gly Glu Leu Cys Ala Val Phe Arg Ala Asp Pro Gly Arg Asn
405 410 415 405 410 415
Gly Pro Trp Gln Ser Thr Leu Leu Gly Gln Leu Cys Gly Tyr Tyr ThrGly Pro Trp Gln Ser Thr Leu Leu Gly Gln Leu Cys Gly Tyr Tyr Thr
420 425 430 420 425 430
Gln Trp Ser Gly Ser Leu Glu Val Thr Phe Met Phe Thr Gly Ser PheGln Trp Ser Gly Ser Leu Glu Val Thr Phe Met Phe Thr Gly Ser Phe
435 440 445 435 440 445
Met Ala Thr Gly Lys Met Leu Ile Ala Tyr Thr Pro Pro Gly Gly ProMet Ala Thr Gly Lys Met Leu Ile Ala Tyr Thr Pro Pro Gly Gly Pro
450 455 460 450 455 460
Leu Pro Lys Asp Arg Ala Thr Ala Met Leu Gly Thr His Val Ile TrpLeu Pro Lys Asp Arg Ala Thr Ala Met Leu Gly Thr His Val Ile Trp
465 470 475 480465 470 475 480
Asp Phe Gly Leu Gln Ser Ser Val Thr Leu Val Ile Pro Trp Ile SerAsp Phe Gly Leu Gln Ser Ser Val Thr Leu Val Ile Pro Trp Ile Ser
485 490 495 485 490 495
Asn Thr His Tyr Arg Ala His Ala Arg Asp Gly Val Phe Asp Tyr TyrAsn Thr His Tyr Arg Ala His Ala Arg Asp Gly Val Phe Asp Tyr Tyr
500 505 510 500 505 510
Thr Thr Gly Leu Val Ser Ile Trp Tyr Gln Thr Asn Tyr Val Val ProThr Thr Gly Leu Val Ser Ile Trp Tyr Gln Thr Asn Tyr Val Val Pro
515 520 525 515 520 525
Ile Gly Ala Pro Asn Thr Ala Tyr Ile Ile Ala Leu Ala Ala Ala GlnIle Gly Ala Pro Asn Thr Ala Tyr Ile Ile Ala Leu Ala Ala Ala Gln
530 535 540 530 535 540
Lys Asn Phe Thr Met Lys Leu Cys Lys Asp Ala Ser Asp Ile Leu GlnLys Asn Phe Thr Met Lys Leu Cys Lys Asp Ala Ser Asp Ile Leu Gln
545 550 555 560545 550 555 560
Thr Gly Thr Ile Gln Gly Asp Arg Val Ala Asp Val Ile Glu Ser SerThr Gly Thr Ile Gln Gly Asp Arg Val Ala Asp Val Ile Glu Ser Ser
565 570 575 565 570 575
Ile Gly Asp Ser Val Ser Arg Ala Leu Thr His Ala Leu Pro Ala ProIle Gly Asp Ser Val Ser Arg Ala Leu Thr His Ala Leu Pro Ala Pro
580 585 590 580 585 590
Thr Gly Gln Asn Thr Gln Val Ser Ser His Arg Leu Asp Thr Gly LysThr Gly Gln Asn Thr Gln Val Ser Ser His Arg Leu Asp Thr Gly Lys
595 600 605 595 600 605
Val Pro Ala Leu Gln Ala Ala Glu Ile Gly Ala Ser Ser Asn Ala SerVal Pro Ala Leu Gln Ala Ala Glu Ile Gly Ala Ser Ser Asn Ala Ser
610 615 620 610 615 620
Asp Glu Ser Met Ile Glu Thr Arg Cys Val Leu Asn Ser His Ser ThrAsp Glu Ser Met Ile Glu Thr Arg Cys Val Leu Asn Ser His Ser Thr
625 630 635 640625 630 635 640
Ala Glu Thr Thr Leu Asp Ser Phe Phe Ser Arg Ala Gly Leu Val GlyAla Glu Thr Thr Leu Asp Ser Phe Phe Ser Arg Ala Gly Leu Val Gly
645 650 655 645 650 655
Glu Ile Asp Leu Pro Leu Glu Gly Thr Thr Asn Pro Asn Gly Tyr AlaGlu Ile Asp Leu Pro Leu Glu Gly Thr Thr Asn Pro Asn Gly Tyr Ala
660 665 670 660 665 670
Asn Trp Asp Ile Asp Ile Thr Gly Tyr Ala Gln Met Arg Arg Lys ValAsn Trp Asp Ile Asp Ile Thr Gly Tyr Ala Gln Met Arg Arg Lys Val
675 680 685 675 680 685
Glu Leu Phe Thr Tyr Met Arg Phe Asp Ala Glu Phe Thr Phe Val AlaGlu Leu Phe Thr Tyr Met Arg Phe Asp Ala Glu Phe Thr Phe Val Ala
690 695 700 690 695 700
Cys Thr Pro Thr Gly Glu Val Val Pro Gln Leu Leu Gln Tyr Met PheCys Thr Pro Thr Gly Glu Val Val Pro Gln Leu Leu Gln Tyr Met Phe
705 710 715 720705 710 715 720
Val Pro Pro Gly Ala Pro Lys Pro Asp Ser Arg Glu Ser Leu Ala TrpVal Pro Pro Gly Ala Pro Lys Pro Asp Ser Arg Glu Ser Leu Ala Trp
725 730 735 725 730 735
Gln Thr Ala Thr Asn Pro Ser Val Phe Val Lys Leu Ser Asp Pro ProGln Thr Ala Thr Asn Pro Ser Val Phe Val Lys Leu Ser Asp Pro Pro
740 745 750 740 745 750
Ala Gln Val Ser Val Pro Phe Met Ser Pro Ala Ser Ala Tyr Gln TrpAla Gln Val Ser Val Pro Phe Met Ser Pro Ala Ser Ala Tyr Gln Trp
755 760 765 755 760 765
Phe Tyr Asp Gly Tyr Pro Thr Phe Gly Glu His Lys Gln Glu Lys AspPhe Tyr Asp Gly Tyr Pro Thr Phe Gly Glu His Lys Gln Glu Lys Asp
770 775 780 770 775 780
Leu Glu Tyr Gly Ala Cys Pro Asn Asn Met Met Gly Thr Phe Ser ValLeu Glu Tyr Gly Ala Cys Pro Asn Asn Met Met Gly Thr Phe Ser Val
785 790 795 800785 790 795 800
Arg Thr Val Gly Thr Ser Lys Ser Lys Tyr Pro Leu Val Val Arg IleArg Thr Val Gly Thr Ser Lys Ser Lys Tyr Pro Leu Val Val Arg Ile
805 810 815 805 810 815
Tyr Met Arg Met Lys His Val Arg Ala Trp Ile Pro Arg Pro Met ArgTyr Met Arg Met Lys His Val Arg Ala Trp Ile Pro Arg Pro Met Arg
820 825 830 820 825 830
Asn Gln Asn Tyr Leu Phe Lys Ala Asn Pro Asn Tyr Ala Gly Asn SerAsn Gln Asn Tyr Leu Phe Lys Ala Asn Pro Asn Tyr Ala Gly Asn Ser
835 840 845 835 840 845
Ile Lys Pro Thr Gly Ala Ser Arg Thr Ala Ile Thr Thr Leu Gly LysIle Lys Pro Thr Gly Ala Ser Arg Thr Ala Ile Thr Thr Leu Gly Lys
850 855 860 850 855 860
Phe Gly Gln Gln Ser Gly Ala Ile Tyr Val Gly Asn Phe Arg Val ValPhe Gly Gln Gln Ser Gly Ala Ile Tyr Val Gly Asn Phe Arg Val Val
865 870 875 880865 870 875 880
Asn Arg His Leu Ala Thr His Asn Asp Trp Ala Asn Leu Val Trp GluAsn Arg His Leu Ala Thr His Asn Asp Trp Ala Asn Leu Val Trp Glu
885 890 895 885 890 895
Asp Ser Ser Arg Asp Leu Leu Val Ser Ser Thr Thr Ala Gln Gly CysAsp Ser Ser Arg Asp Leu Leu Val Ser Ser Thr Thr Ala Gln Gly Cys
900 905 910 900 905 910
Asp Thr Ile Ala Arg Cys Asp Cys Gln Thr Gly Val Tyr Tyr Cys AsnAsp Thr Ile Ala Arg Cys Asp Cys Gln Thr Gly Val Tyr Tyr Cys Asn
915 920 925 915 920 925
Ser Met Arg Lys His Tyr Pro Val Ser Phe Ser Lys Pro Ser Leu IleSer Met Arg Lys His Tyr Pro Val Ser Phe Ser Lys Pro Ser Leu Ile
930 935 940 930 935 940
Tyr Val Glu Ala Ser Glu Tyr Tyr Pro Ala Arg Tyr Gln Ser His LeuTyr Val Glu Ala Ser Glu Tyr Tyr Pro Ala Arg Tyr Gln Ser His Leu
945 950 955 960945 950 955 960
Met Leu Ala Gln Gly His Ser Glu Pro Gly Asp Cys Gly Gly Ile LeuMet Leu Ala Gln Gly His Ser Glu Pro Gly Asp Cys Gly Gly Ile Leu
965 970 975 965 970 975
Arg Cys Gln His Gly Val Ile Gly Ile Val Ser Thr Gly Gly Asn GlyArg Cys Gln His Gly Val Ile Gly Ile Val Ser Thr Gly Gly Asn Gly
980 985 990 980 985 990
Leu Val Gly Phe Ala Asp Val Arg Asp Leu Leu Trp Leu Asp Glu GluLeu Val Gly Phe Ala Asp Val Arg Asp Leu Leu Trp Leu Asp Glu Glu
995 1000 1005 995 1000 1005
Ala Met Glu Gln Gly Val Ser Asp Tyr Ile Lys Gly Leu Gly Asp AlaAla Met Glu Gln Gly Val Ser Asp Tyr Ile Lys Gly Leu Gly Asp Ala
1010 1015 1020 1010 1015 1020
Phe Gly Thr Gly Phe Thr Asp Ala Val Ser Arg Glu Val Glu Ala LeuPhe Gly Thr Gly Phe Thr Asp Ala Val Ser Arg Glu Val Glu Ala Leu
1025 1030 1035 10401025 1030 1035 1040
Lys Asn Tyr Leu Ile Gly Ser Glu Gly Ala Val Glu Lys Ile Leu LysLys Asn Tyr Leu Ile Gly Ser Glu Gly Ala Val Glu Lys Ile Leu Lys
1045 1050 1055 1045 1050 1055
Asn Leu Ile Lys Leu Ile Ser Ala Leu Val Ile Val Ile Arg Ser AspAsn Leu Ile Lys Leu Ile Ser Ala Leu Val Ile Val Ile Arg Ser Asp
1060 1065 1070 1060 1065 1070
Tyr Asp Met Val Thr Leu Thr Ala Thr Leu Ala Leu Ile Gly Cys HisTyr Asp Met Val Thr Leu Thr Ala Thr Leu Ala Leu Ile Gly Cys His
1075 1080 1085 1075 1080 1085
Gly Ser Pro Trp Ala Trp Ile Lys Ala Lys Thr Ala Ser Ile Leu GlyGly Ser Pro Trp Ala Trp Ile Lys Ala Lys Thr Ala Ser Ile Leu Gly
1090 1095 1100 1090 1095 1100
Ile Pro Ile Ala Gln Lys Gln Ser Ala Ser Trp Leu Lys Lys Phe AsnIle Pro Ile Ala Gln Lys Gln Ser Ala Ser Trp Leu Lys Lys Phe Asn
1105 1110 1115 11201105 1110 1115 1120
Asp Met Ala Asn Ala Ala Lys Gly Leu Glu Trp Val Ser Asn Lys IleAsp Met Ala Asn Ala Ala Lys Gly Leu Glu Trp Val Ser Asn Lys Ile
1125 1130 1135 1125 1130 1135
Ser Lys Phe Ile Asp Trp Leu Lys Glu Lys Ile Val Pro Ala Ala ArgSer Lys Phe Ile Asp Trp Leu Lys Glu Lys Ile Val Pro Ala Ala Arg
1140 1145 1150 1140 1145 1150
Glu Lys Val Glu Phe Leu Asn Asn Leu Lys Gln Leu Pro Leu Leu GluGlu Lys Val Glu Phe Leu Asn Asn Leu Lys Gln Leu Pro Leu Leu Glu
1155 1160 1165 1155 1160 1165
Asn Gln Ile Ser Asn Leu Glu Gln Ser Ala Ala Ser Gln Glu Asp LeuAsn Gln Ile Ser Asn Leu Glu Gln Ser Ala Ala Ser Gln Glu Asp Leu
1170 1175 1180 1170 1175 1180
Glu Val Met Phe Gly Asn Val Ser Tyr Leu Ala His Phe Cys Arg LysGlu Val Met Phe Gly Asn Val Ser Tyr Leu Ala His Phe Cys Arg Lys
1185 1190 1195 12001185 1190 1195 1200
Phe Gln Pro Leu Tyr Ala Thr Glu Ala Lys Arg Val Tyr Ala Leu GluPhe Gln Pro Leu Tyr Ala Thr Glu Ala Lys Arg Val Tyr Ala Leu Glu
1205 1210 1215 1205 1210 1215
Lys Arg Met Asn Asn Tyr Met Gln Phe Lys Ser Lys His Arg Ile GluLys Arg Met Asn Asn Tyr Met Gln Phe Lys Ser Lys His Arg Ile Glu
1220 1225 1230 1220 1225 1230
Pro Val Cys Leu Ile Ile Arg Gly Ser Pro Gly Thr Gly Lys Ser LeuPro Val Cys Leu Ile Ile Arg Gly Ser Pro Gly Thr Gly Lys Ser Leu
1235 1240 1245 1235 1240 1245
Ala Thr Gly Ile Ile Ala Arg Ala Ile Ala Asp Lys Tyr His Ser SerAla Thr Gly Ile Ile Ala Arg Ala Ile Ala Asp Lys Tyr His Ser Ser
1250 1255 1260 1250 1255 1260
Val Tyr Ser Leu Pro Pro Asp Pro Asp His Phe Asp Gly Tyr Lys GlnVal Tyr Ser Leu Pro Pro Asp Pro Asp His Phe Asp Gly Tyr Lys Gln
1265 1270 1275 12801265 1270 1275 1280
Gln Val Val Thr Val Met Asp Asp Leu Cys Gln Asn Pro Asp Gly LysGln Val Val Thr Val Met Asp Asp Asp Leu Cys Gln Asn Pro Asp Gly Lys
1285 1290 1295 1285 1290 1295
Asp Met Ser Leu Phe Cys Gln Met Val Ser Thr Val Asp Phe Ile ProAsp Met Ser Leu Phe Cys Gln Met Val Ser Thr Val Asp Phe Ile Pro
1300 1305 1310 1300 1305 1310
Pro Met Ala Ser Leu Glu Glu Lys Gly Val Ser Phe Thr Ser Lys PhePro Met Ala Ser Leu Glu Glu Lys Gly Val Ser Phe Thr Ser Lys Phe
1315 1320 1325 1315 1320 1325
Val Ile Ala Ser Thr Asn Ala Ser Asn Ile Ile Val Pro Thr Val SerVal Ile Ala Ser Thr Asn Ala Ser Asn Ile Ile Val Pro Thr Val Ser
1330 1335 1340 1330 1335 1340
Asp Ser Asp Ala Ile Arg Arg Arg Phe Tyr Met Asp Cys Asp Ile GluAsp Ser Asp Ala Ile Arg Arg Arg Phe Tyr Met Asp Cys Asp Ile Glu
1345 1350 1355 13601345 1350 1355 1360
Val Thr Asp Ser Tyr Lys Thr Asp Leu Gly Arg Leu Asp Ala Gly ArgVal Thr Asp Ser Tyr Lys Thr Asp Leu Gly Arg Leu Asp Ala Gly Arg
1365 1370 1375 1365 1370 1375
Ala Ala Lys Leu Cys Ser Glu Asn Asn Thr Ala Asn Phe Lys Arg CysAla Ala Lys Leu Cys Ser Glu Asn Asn Thr Ala Asn Phe Lys Arg Cys
1380 1385 1390 1380 1385 1390
Ser Pro Leu Val Cys Gly Lys Ala Ile Gln Leu Arg Asp Arg Lys SerSer Pro Leu Val Cys Gly Lys Ala Ile Gln Leu Arg Asp Arg Lys Ser
1395 1400 1405 1395 1400 1405
Lys Val Arg Tyr Ser Val Asp Thr Val Val Ser Glu Leu Ile Arg GluLys Val Arg Tyr Ser Val Asp Thr Val Val Ser Glu Leu Ile Arg Glu
1410 1415 1420 1410 1415 1420
Tyr Ser Asn Arg Ser Ala Ile Gly Asn Thr Ile Glu Ala Leu Phe GlnTyr Ser Asn Arg Ser Ala Ile Gly Asn Thr Ile Glu Ala Leu Phe Gln
1425 1430 1435 14401425 1430 1435 1440
Gly Pro Pro Lys Phe Arg Pro Ile Arg Ile Ser Leu Glu Glu Lys ProGly Pro Pro Lys Phe Arg Pro Ile Arg Ile Ser Leu Glu Glu Lys Pro
1445 1450 1455 1445 1450 1455
Ala Pro Asp Ala Ile Ser Asp Leu Leu Ala Ser Val Asp Ser Glu GluAla Pro Asp Ala Ile Ser Asp Leu Leu Ala Ser Val Asp Ser Glu Glu
1460 1465 1470 1460 1465 1470
Val Arg Gln Tyr Cys Arg Asp Gln Gly Trp Ile Ile Pro Glu Ala ProVal Arg Gln Tyr Cys Arg Asp Gln Gly Trp Ile Ile Pro Glu Ala Pro
1475 1480 1485 1475 1480 1485
Thr Asn Val Glu Arg His Leu Asn Arg Ala Val Leu Val Met Gln SerThr Asn Val Glu Arg His Leu Asn Arg Ala Val Leu Val Met Gln Ser
1490 1495 1500 1490 1495 1500
Ile Thr Thr Val Val Ala Val Val Ser Leu Val Tyr Val Ile Tyr LysIle Thr Thr Val Val Ala Val Val Ser Leu Val Tyr Val Ile Tyr Lys
1505 1510 1515 15201505 1510 1515 1520
Leu Phe Ala Gly Phe Gln Gly Ala Tyr Ser Gly Ala Pro Lys Gln ValLeu Phe Ala Gly Phe Gln Gly Ala Tyr Ser Gly Ala Pro Lys Gln Val
1525 1530 1535 1525 1530 1535
Leu Lys Lys Pro Ala Leu Arg Thr Ala Thr Val Gln Gly Pro Ser LeuLeu Lys Lys Pro Ala Leu Arg Thr Ala Thr Val Gln Gly Pro Ser Leu
1540 1545 1550 1540 1545 1550
Asp Phe Ala Leu Ser Leu Leu Arg Arg Asn Ile Arg Gln Val Gln ThrAsp Phe Ala Leu Ser Leu Leu Arg Arg Asn Ile Arg Gln Val Gln Thr
1555 1560 1565 1555 1560 1565
Asp Gln Gly His Phe Thr Met Leu Gly Val Arg Asp Arg Leu Ala ValAsp Gln Gly His Phe Thr Met Leu Gly Val Arg Asp Arg Leu Ala Val
1570 1575 1580 1570 1575 1580
Leu Pro Arg His Ser Gln Pro Gly Lys Thr Ile Trp Ile Glu His LysLeu Pro Arg His Ser Gln Pro Gly Lys Thr Ile Trp Ile Glu His Lys
1585 1590 1595 16001585 1590 1595 1600
Leu Val Asn Val Leu Asp Ala Val Glu Leu Val Asp Glu Gln Gly ValLeu Val Asn Val Leu Asp Ala Val Glu Leu Val Asp Glu Gln Gly Val
1605 1610 1615 1605 1610 1615
Asn Leu Glu Leu Thr Leu Ile Thr Leu Asp Thr Asn Glu Lys Phe ArgAsn Leu Glu Leu Thr Leu Ile Thr Leu Asp Thr Asn Glu Lys Phe Arg
1620 1625 1630 1620 1625 1630
Asp Ile Thr Lys Phe Ile Pro Glu Asn Ile Ser Thr Ala Ser Asp AlaAsp Ile Thr Lys Phe Ile Pro Glu Asn Ile Ser Thr Ala Ser Asp Ala
1635 1640 1645 1635 1640 1645
Thr Leu Val Ile Asn Thr Glu His Met Pro Ser Met Phe Val Pro ValThr Leu Val Ile Asn Thr Glu His Met Pro Ser Met Phe Val Pro Val
1650 1655 1660 1650 1655 1660
Gly Asp Val Val Gln Tyr Gly Phe Leu Asn Leu Ser Gly Lys Pro ThrGly Asp Val Val Gln Tyr Gly Phe Leu Asn Leu Ser Gly Lys Pro Thr
1665 1670 1675 16801665 1670 1675 1680
His Arg Thr Met Met Tyr Asn Phe Pro Thr Lys Ala Gly Gln Cys GlyHis Arg Thr Met Met Tyr Asn Phe Pro Thr Lys Ala Gly Gln Cys Gly
1685 1690 1695 1685 1690 1695
Gly Val Val Thr Ser Val Gly Lys Val Val Gly Ile His Ile Gly GlyGly Val Val Thr Ser Val Gly Lys Val Val Gly Ile His Ile Gly Gly
1700 1705 1710 1700 1705 1710
Asn Gly Arg Gln Gly Phe Cys Ala Gly Leu Lys Arg Ser Tyr Phe AlaAsn Gly Arg Gln Gly Phe Cys Ala Gly Leu Lys Arg Ser Tyr Phe Ala
1715 1720 1725 1715 1720 1725
Ser Glu Gln Gly Glu Ile Gln Trp Val Lys Pro Asn Lys Glu Thr GlySer Glu Gln Gly Glu Ile Gln Trp Val Lys Pro Asn Lys Glu Thr Gly
1730 1735 1740 1730 1735 1740
Arg Leu Asn Ile Asn Gly Pro Thr Arg Thr Lys Leu Glu Pro Ser ValArg Leu Asn Ile Asn Gly Pro Thr Arg Thr Lys Leu Glu Pro Ser Val
1745 1750 1755 17601745 1750 1755 1760
Phe His Asp Ile Phe Glu Gly Asn Lys Glu Pro Ala Val Leu His SerPhe His Asp Ile Phe Glu Gly Asn Lys Glu Pro Ala Val Leu His Ser
1765 1770 1775 1765 1770 1775
Lys Asp Pro Arg Leu Glu Val Asp Phe Glu Gln Ala Leu Phe Ser LysLys Asp Pro Arg Leu Glu Val Asp Phe Glu Gln Ala Leu Phe Ser Lys
1780 1785 1790 1780 1785 1790
Tyr Val Gly Asn Thr Leu His Glu Pro Asp Glu Tyr Ile Lys Glu AlaTyr Val Gly Asn Thr Leu His Glu Pro Asp Glu Tyr Ile Lys Glu Ala
1795 1800 1805 1795 1800 1805
Ala Leu His Tyr Ala Asn Gln Leu Lys Gln Leu Glu Ile Asn Thr SerAla Leu His Tyr Ala Asn Gln Leu Lys Gln Leu Glu Ile Asn Thr Ser
1810 1815 1820 1810 1815 1820
Gln Met Ser Met Glu Glu Ala Cys Tyr Gly Thr Glu Asn Leu Glu AlaGln Met Ser Met Glu Glu Ala Cys Tyr Gly Thr Glu Asn Leu Glu Ala
1825 1830 1835 18401825 1830 1835 1840
Ile Asp Leu His Thr Ser Ala Gly Tyr Pro Tyr Ser Ala Leu Gly IleIle Asp Leu His Thr Ser Ala Gly Tyr Pro Tyr Ser Ala Leu Gly Ile
1845 1850 1855 1845 1850 1855
Lys Lys Arg Asp Ile Leu Asp Pro Thr Thr Arg Asp Val Ser Arg MetLys Lys Arg Asp Ile Leu Asp Pro Thr Thr Arg Asp Val Ser Arg Met
1860 1865 1870 1860 1865 1870
Lys Phe Tyr Met Asp Lys Tyr Gly Leu Asp Leu Pro Tyr Ser Thr TyrLys Phe Tyr Met Asp Lys Tyr Gly Leu Asp Leu Pro Tyr Ser Thr Tyr
1875 1880 1885 1875 1880 1885
Val Lys Asp Glu Leu Arg Ser Ile Asp Lys Ile Lys Lys Gly Lys SerVal Lys Asp Glu Leu Arg Ser Ile Asp Lys Ile Lys Lys Gly Lys Ser
1890 1895 1900 1890 1895 1900
Arg Leu Ile Glu Ala Ser Ser Leu Asn Asp Ser Val Tyr Leu Arg MetArg Leu Ile Glu Ala Ser Ser Leu Asn Asp Ser Val Tyr Leu Arg Met
1905 1910 1915 19201905 1910 1915 1920
Ala Phe Gly His Leu Tyr Glu Ala Phe His Ala Asn Pro Gly Thr IleAla Phe Gly His Leu Tyr Glu Ala Phe His Ala Asn Pro Gly Thr Ile
1925 1930 1935 1925 1930 1935
Thr Gly Ser Ala Val Gly Cys Asn Pro Asp Thr Phe Trp Ser Lys LeuThr Gly Ser Ala Val Gly Cys Asn Pro Asp Thr Phe Trp Ser Lys Leu
1940 1945 1950 1940 1945 1950
Pro Ile Leu Leu Pro Gly Ser Leu Phe Ala Phe Asp Tyr Ser Gly TyrPro Ile Leu Leu Pro Gly Ser Leu Phe Ala Phe Asp Tyr Ser Gly Tyr
1955 1960 1965 1955 1960 1965
Asp Ala Ser Leu Ser Pro Val Trp Phe Arg Ala Leu Glu Leu Val LeuAsp Ala Ser Leu Ser Pro Val Trp Phe Arg Ala Leu Glu Leu Val Leu
1970 1975 1980 1970 1975 1980
Arg Glu Ile Gly Tyr Ser Glu Glu Ala Ile Ser Leu Ile Glu Gly IleArg Glu Ile Gly Tyr Ser Glu Glu Ala Ile Ser Leu Ile Glu Gly Ile
1985 1990 1995 20001985 1990 1995 2000
Asn His Thr His His Val Tyr Arg Asn Lys Thr Tyr Cys Val Leu GlyAsn His Thr His His Val Tyr Arg Asn Lys Thr Tyr Cys Val Leu Gly
2005 2010 2015 2005 2010 2015
Gly Met Pro Ser Gly Cys Ser Gly Thr Ser Ile Phe Asn Ser Met IleGly Met Pro Ser Gly Cys Ser Gly Thr Ser Ile Phe Asn Ser Met Ile
2020 2025 2030 2020 2025 2030
Asn Asn Ile Ile Ile Arg Ala Leu Leu Ile Lys Thr Phe Lys Gly IleAsn Asn Ile Ile Ile Arg Ala Leu Leu Ile Lys Thr Phe Lys Gly Ile
2035 2040 2045 2035 2040 2045
Asp Leu Asp Glu Leu Asn Met Val Ala Tyr Gly Asp Asp Val Leu AlaAsp Leu Asp Glu Leu Asn Met Val Ala Tyr Gly Asp Asp Val Leu Ala
2050 2055 2060 2050 2055 2060
Ser Tyr Pro Phe Pro Ile Asp Cys Leu Glu Leu Ala Lys Thr Gly LysSer Tyr Pro Phe Pro Ile Asp Cys Leu Glu Leu Ala Lys Thr Gly Lys
2065 2070 2075 20802065 2070 2075 2080
Glu Tyr Gly Leu Thr Met Thr Pro Ala Asp Lys Ser Pro Cys Phe AsnGlu Tyr Gly Leu Thr Met Thr Pro Ala Asp Lys Ser Pro Cys Phe Asn
2085 2090 2095 2085 2090 2095
Glu Val Asn Trp Gly Asn Ala Thr Phe Leu Lys Arg Gly Phe Leu ProGlu Val Asn Trp Gly Asn Ala Thr Phe Leu Lys Arg Gly Phe Leu Pro
2100 2105 2110 2100 2105 2110
Asp Glu Gln Phe Pro Phe Leu Ile His Pro Thr Met Pro Met Arg GluAsp Glu Gln Phe Pro Phe Leu Ile His Pro Thr Met Pro Met Arg Glu
2115 2120 2125 2115 2120 2125
Ile His Glu Ser Ile Arg Trp Thr Lys Asp Ala Arg Asn Thr Gln AspIle His Glu Ser Ile Arg Trp Thr Lys Asp Ala Arg Asn Thr Gln Asp
2130 2135 2140 2130 2135 2140
His Val Arg Ser Leu Cys Leu Leu Ala Trp His Asn Gly Lys Gln GluHis Val Arg Ser Leu Cys Leu Leu Ala Trp His Asn Gly Lys Gln Glu
2145 2150 2155 21602145 2150 2155 2160
Tyr Glu Lys Phe Val Ser Thr Ile Arg Ser Val Pro Val Gly Arg AlaTyr Glu Lys Phe Val Ser Thr Ile Arg Ser Val Pro Val Gly Arg Ala
2165 2170 2175 2165 2170 2175
Leu Ala Ile Pro Asn Tyr Glu Asn Leu Arg Arg Asn Trp Leu Glu LeuLeu Ala Ile Pro Asn Tyr Glu Asn Leu Arg Arg Asn Trp Leu Glu Leu
2180 2185 2190 2180 2185 2190
PhePhe
<210> 5<210> 5
<211> 9982<211> 9982
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 5<400> 5
gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60
tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120
tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180
atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240
gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300
aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360
ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420
tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480
ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540
ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600
ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660
gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720
gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780
gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260
aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560
cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980
ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040
tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100
ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160
cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220
ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280
gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340
ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400
gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460
gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520
acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580
attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640
cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700
ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaactctag aatggtcttc 2760ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaactctag aatggtcttc 2760
acactcgaag atttcgttgg ggactggcga cagacagccg gctacaacct ggaccaagtc 2820acactcgaag atttcgttgg ggactggcga cagacagccg gctacaacct ggaccaagtc 2820
cttgaacagg gaggtgtgtc cagtttgttt cagaatctcg gggtgtccgt aactccgatc 2880cttgaacagg gaggtgtgtc cagtttgttt cagaatctcg gggtgtccgt aactccgatc 2880
caaaggattg tcctgagcgg tgaaaatggg ctgaagatcg acatccatgt catcatcccg 2940caaaggattg tcctgagcgg tgaaaatggg ctgaagatcg acatccatgt catcatcccg 2940
tatgaaggtc tgagcggcga ccaaatgggc cagatcgaaa aaatttttaa ggtggtgtac 3000tatgaaggtc tgagcggcga ccaaatgggc cagatcgaaa aaatttttaa ggtggtgtac 3000
cctgtggatg atcatcactt taaggtgatc ctgcactatg gcacactggt aatcgacggg 3060cctgtggatg atcatcactt taaggtgatc ctgcactatg gcacactggt aatcgacggg 3060
gttacgccga acatgatcga ctatttcgga cggccgtatg aaggcatcgc cgtgttcgac 3120gttacgccga acatgatcga ctatttcgga cggccgtatg aaggcatcgc cgtgttcgac 3120
ggcaaaaaga tcactgtaac agggaccctg tggaacggca acaaaattat cgacgagcgc 3180ggcaaaaaga tcactgtaac agggaccctg tggaacggca acaaaattat cgacgagcgc 3180
ctgatcaacc ccgacggctc cctgctgttc cgagtaacca tcaacggagt gaccggctgg 3240ctgatcaacc ccgacggctc cctgctgttc cgagtaacca tcaacggagt gaccggctgg 3240
cggctgtgcg aacgcattct ggcgatgcat gcgatcacca ctcttggttc gcaagtgtct 3300cggctgtgcg aacgcattct ggcgatgcat gcgatcacca ctcttggttc gcaagtgtct 3300
acacagcgct ccggttctta cgaaaactca aactcagcca ctgagggttc taccataaac 3360acacagcgct ccggttctta cgaaaactca aactcagcca ctgagggttc taccataaac 3360
tacaccacca ttaattacta caaagactcc tatgctgcca cagcaggcaa acagagtctc 3420tacaccacca ttaattacta caaagactcc tatgctgcca cagcaggcaa acagagtctc 3420
aagcaggatc cagacaagtt tgcaaatcct gttaaagaca tattcaccga aatggcagcg 3480aagcaggatc cagacaagtt tgcaaatcct gttaaagaca tattcaccga aatggcagcg 3480
ccactgaagt ccccatccgc tgaggcatgt ggatacagtg atcgagtggc gcaattaact 3540ccactgaagt ccccatccgc tgaggcatgt ggatacagtg atcgagtggc gcaattaact 3540
attggcaact ccaccatcac gacgcaagaa gcggctaaca tcatagtcgg ctatggtgag 3600attggcaact ccaccatcac gacgcaagaa gcggctaaca tcatagtcgg ctatggtgag 3600
tggccttcct actgctcaga ttctgacgct acagcagtgg ataaaccaac gcgcccggat 3660tggccttcct actgctcaga ttctgacgct acagcagtgg ataaaccaac gcgcccggat 3660
gtttcagtga acaggtttta cacattggac actaaattgt gggagaaatc gtccaaggga 3720gtttcagtga acaggtttta cacattggac actaaattgt gggagaaatc gtccaaggga 3720
tggtactgga agttcccgga tgtgttaact gaaactgggg tttttgggca aaatgcacaa 3780tggtactgga agttcccgga tgtgttaact gaaactgggg tttttgggca aaatgcacaa 3780
ttccactacc tctaccgatc agggttctgc atccacgtgc agtgcaatgc cagtaaattc 3840ttccactacc tctaccgatc agggttctgc atccacgtgc agtgcaatgc cagtaaattc 3840
caccaaggag cactcctagt cgctgtccta ccagagtatg tcattgggac agtggcaggc 3900caccaaggag cactcctagt cgctgtccta ccagagtatg tcattgggac agtggcaggc 3900
ggtacaggga cggaagacac ccaccccccc tacaagcaga cccaacccgg cgccgatggt 3960ggtacaggga cggaagacac ccacccccccc tacaagcaga cccaacccgg cgccgatggt 3960
ttcgagttgc aacacccgta cgtgcttgat gctggcatcc caatatcaca gttaacagtg 4020ttcgagttgc aacacccgta cgtgcttgat gctggcatcc caatatcaca gttaacagtg 4020
tgcccacacc agtggattaa tttgaggacc aacaattgtg ctacaataat agtgccatac 4080tgcccacacc agtggattaa tttgaggacc aacaattgtg ctacaataat agtgccatac 4080
attaacgcac tgccttttga ttctgccttg aaccattgca actttggcct gttagttgtg 4140attaacgcac tgccttttga ttctgccttg aaccattgca actttggcct gttagttgtg 4140
cctattagcc cactagacta cgaccaagga gcaacgccag taatccctat aactatcaca 4200cctattagcc cactagacta cgaccaagga gcaacgccag taatccctat aactatcaca 4200
ttggccccaa tgtgctctga attcgcaggt cttaggcagg cagtcacgca agggttcccc 4260ttggccccaa tgtgctctga attcgcaggt cttaggcagg cagtcacgca agggttcccc 4260
accgagctaa aacctggcac aaatcaattt ttaaccaccg atgatggcgt ctcagcacct 4320accgagctaa aacctggcac aaatcaattt ttaaccaccg atgatggcgt ctcagcacct 4320
attctaccaa acttccaccc caccccgtgt atccacatac ctggtgaagt taggaacttg 4380attctaccaa acttccaccc caccccgtgt atccacatac ctggtgaagt taggaacttg 4380
ctagagttat gccaggtgga gaccattctg gaggttaaca atgtgcccac gaatgccact 4440ctagagttat gccaggtgga gaccattctg gaggttaaca atgtgcccac gaatgccact 4440
agcttaatgg agagactgcg cttcccggtc tcagcacaag cagggaaagg tgaactgtgt 4500agcttaatgg agagactgcg cttcccggtc tcagcacaag cagggaaagg tgaactgtgt 4500
gcggtgttta gagccgatcc tgggcgaaat ggaccatggc aatccacctt actgggccag 4560gcggtgttta gagccgatcc tgggcgaaat ggaccatggc aatccacctt actgggccag 4560
ttgtgcgggt actacaccca atggtcaggg tcattggaag tcaccttcat gtttactgga 4620ttgtgcgggt actacaccca atggtcaggg tcattggaag tcaccttcat gtttactgga 4620
tccttcatgg ctaccggcaa gatgctcata gcctatacac cgccaggggg tcctctgccc 4680tccttcatgg ctaccggcaa gatgctcata gcctatacac cgccaggggg tcctctgccc 4680
aaggaccggg cgaccgccat gttgggcacg cacgtcatct gggattttgg gctgcaatcg 4740aaggaccggg cgaccgccat gttgggcacg cacgtcatct gggattttgg gctgcaatcg 4740
tctgttaccc ttgtaatacc atggatcagt aacactcatt atagagcaca tgcccgagat 4800tctgttaccc ttgtaatacc atggatcagt aacactcatt atagagcaca tgcccgagat 4800
ggagtgtttg actattacac tacagggtta gtcagtatat ggtaccagac aaattacgtg 4860ggagtgtttg actattacac tacagggtta gtcagtatat ggtaccagac aaattacgtg 4860
gttccaatcg gtgcgcccaa cacagcctat ataatagcac tagcggcagc ccaaaagaac 4920gttccaatcg gtgcgcccaa cacagcctat ataatagcac tagcggcagc ccaaaagaac 4920
ttcactatga aattgtgcaa ggatgctagt gatatcctgc agacgggcac catccaggga 4980ttcactatga aattgtgcaa ggatgctagt gatatcctgc agacgggcac catccaggga 4980
gatagggtgg cagatgtaat tgaaagttcc ataggagata gcgtgagcag agccctcact 5040gatagggtgg cagatgtaat tgaaagttcc ataggagata gcgtgagcag agccctcact 5040
cacgctctac cagcacccac aggccaaaac acacaggtga gcagtcatcg actggataca 5100cacgctctac cagcacccac aggccaaaac acacaggtga gcagtcatcg actggataca 5100
ggcaaggttc cagcactcca agctgctgaa attggggcat catcaaatgc tagtgacgag 5160ggcaaggttc cagcactcca agctgctgaa attggggcat catcaaatgc tagtgacgag 5160
agcatgattg aaacacgttg tgttcttaac tcgcatagta cagctgagac cactcttgat 5220agcatgattg aaacacgttg tgttcttaac tcgcatagta cagctgagac cactcttgat 5220
agtttcttca gtagggcagg attagttgga gagatagatc tccctcttga gggcacaact 5280agtttcttca gtagggcagg attagttgga gagatagatc tccctcttga gggcacaact 5280
aacccaaatg gttatgccaa ctgggacata gatataacag gttacgcgca aatgcgtaga 5340aacccaaatg gttatgccaa ctgggacata gatataacag gttacgcgca aatgcgtaga 5340
aaggtagagc tattcaccta catgcgtttt gatgcagagt tcacttttgt tgcgtgcaca 5400aaggtagagc tattcaccta catgcgtttt gatgcagagt tcacttttgt tgcgtgcaca 5400
cccaccgggg aggttgtccc acaattgctc caatatatgt ttgtgccacc tggagcccct 5460cccaccgggg aggttgtccc acaattgctc caatatatgt ttgtgccacc tggagcccct 5460
aagccagatt ctagggaatc ccttgcatgg caaaccgcca ccaacccctc agtttttgtc 5520aagccagatt ctagggaatc ccttgcatgg caaaccgcca ccaacccctc agtttttgtc 5520
aagctgtcag accctccggc gcaggtttca gtgccattca tgtcacctgc gagtgcttat 5580aagctgtcag accctccggc gcaggtttca gtgccattca tgtcacctgc gagtgcttat 5580
caatggtttt atgacggata tcccacattc ggagaacaca aacaggagaa agaccttgaa 5640caatggtttt atgacggata tcccacattc ggagaacaca aacaggagaa agaccttgaa 5640
tacggggcat gtcctaataa catgatgggt acattctcag tgcggactgt ggggacctcc 5700tacggggcat gtcctaataa catgatgggt acattctcag tgcggactgt ggggacctcc 5700
aagtccaagt accctttagt ggttaggatt tacatgagaa tgaagcacgt cagggcgtgg 5760aagtccaagt accctttagt ggttaggatt tacatgagaa tgaagcacgt cagggcgtgg 5760
atacctcgcc cgatgcgcaa ccagaactac ctgttcaaag ccaacccaaa ttatgctggc 5820atacctcgcc cgatgcgcaa ccagaactac ctgttcaaag ccaacccaaa ttatgctggc 5820
aactctatta agccaactgg tgccagtcgc acagcgatca ccactcttgg gaaatttgga 5880aactctatta agccaactgg tgccagtcgc acagcgatca ccactcttgg gaaatttgga 5880
caacagtctg gggctattta tgtgggcaac tttagagtgg tcaaccgaca tcttgccacc 5940caacagtctg gggctattta tgtgggcaac tttagagtgg tcaaccgaca tcttgccacc 5940
cataatgatt gggcaaatct tgtttgggaa gacagctctc gcgacttgct cgtgtcatcc 6000cataatgatt gggcaaatct tgtttgggaa gacagctctc gcgacttgct cgtgtcatcc 6000
accactgccc aaggttgtga cacgattgcc cgttgcgatt gccagacagg ggtgtactac 6060accactgccc aaggttgtga cacgattgcc cgttgcgatt gccagacagg ggtgtactac 6060
tgtaactcga tgagaaaaca ctacccagtc agtttttcaa aacccagcct gatctatgta 6120tgtaactcga tgagaaaaca ctacccagtc agtttttcaa aacccagcct gatctatgta 6120
gaggctagcg agtattaccc agccaggtac caatcacatc tcatgctcgc acagggtcac 6180gaggctagcg agtattaccc agccaggtac caatcacatc tcatgctcgc acagggtcac 6180
tcggaacctg gtgattgcgg tggtatcctt aggtgccaac atggcgtcat cggcatagtg 6240tcggaacctg gtgattgcgg tggtatcctt aggtgccaac atggcgtcat cggcatagtg 6240
tctactggtg gcaatgggct cgttggcttt gcagacgtca gagacctctt gtggttagat 6300tctactggtg gcaatgggct cgttggcttt gcagacgtca gagacctctt gtggttagat 6300
gaagaagcta tggaacaggg cgtgtccgac tacattaagg gtctcggaga tgcttttgga 6360gaagaagcta tggaacaggg cgtgtccgac tacattaagg gtctcggaga tgcttttgga 6360
acaggcttca ctgacgcagt ctcaagggag gttgaagctc tcaagaacta tcttataggg 6420acaggcttca ctgacgcagt ctcaagggag gttgaagctc tcaagaacta tcttataggg 6420
tctgaaggag cagttgagaa aattttgaaa aatcttatta aactaatctc tgcactggtg 6480tctgaaggag cagttgagaa aattttgaaa aatcttatta aactaatctc tgcactggtg 6480
attgtgatca gaagtgatta cgacatggtt accctcactg caaccttagc gctgataggt 6540attgtgatca gaagtgatta cgacatggtt accctcactg caaccttagc gctgataggt 6540
tgtcatggca gtccttgggc ttggattaaa gccaaaacag cctccatctt aggtatccct 6600tgtcatggca gtccttgggc ttggattaaa gccaaaacag cctccatctt aggtatccct 6600
atcgcccaaa agcagagcgc ttcctggctc aagaagttca atgacatggc caacgccgct 6660atcgcccaaa agcagagcgc ttcctggctc aagaagttca atgacatggc caacgccgct 6660
aaggggttag agtgggtttc caacaagatc agcaaattta ttgattggct taaggagaaa 6720aaggggttag agtgggtttc caacaagatc agcaaattta ttgattggct taaggagaaa 6720
atagtaccag cagccaggga gaaggttgaa ttcctaaata acttgaaaca gctgccactg 6780atagtaccag cagccaggga gaaggttgaa ttcctaaata acttgaaaca gctgccactg 6780
ctagagaatc agatctcgaa cttggaacaa tctgctgctt cacaagagga ccttgaagtc 6840ctagagaatc agatctcgaa cttggaacaa tctgctgctt cacaagagga ccttgaagtc 6840
atgtttggga atgtgtcgta cctagctcac ttctgtcgca agtttcaacc gctatacgcc 6900atgtttggga atgtgtcgta cctagctcac ttctgtcgca agtttcaacc gctatacgcc 6900
acggaagcta aaagagtcta tgccctggag aagagaatga ataactatat gcagttcaag 6960acggaagcta aaagagtcta tgccctggag aagagaatga ataactatat gcagttcaag 6960
agcaaacacc gaattgaacc tgtatgtctc attattaggg gctcaccagg caccgggaag 7020agcaaacacc gaattgaacc tgtatgtctc attattaggg gctcaccagg caccgggaag 7020
tctctagcca ctggtattat tgctcgagca atcgctgata agtaccactc cagcgtgtac 7080tctctagcca ctggtattat tgctcgagca atcgctgata agtaccactc cagcgtgtac 7080
tcgctcccac cagacccgga tcattttgac ggttacaagc aacaggtggt tacagtgatg 7140tcgctcccac cagacccgga tcattttgac ggttacaagc aacaggtggt tacagtgatg 7140
gatgatttgt gtcaaaaccc cgatggtaag gatatgtcct tattctgtca aatggtatcc 7200gatgatttgt gtcaaaaccc cgatggtaag gatatgtcct tattctgtca aatggtatcc 7200
accgtagatt tcattccacc aatggcttct ctcgaggaga agggagtttc cttcacctct 7260accgtagatt tcattccacc aatggcttct ctcgaggaga agggagtttc cttcacctct 7260
aagtttgtca tcgcatccac taatgccagt aatatcatag taccaacagt gtctgattct 7320aagtttgtca tcgcatccac taatgccagt aatatcatag taccaacagt gtctgattct 7320
gacgctattc gccgcaggtt ctacatggac tgtgacattg aagtgacaga ctcgtacaaa 7380gacgctattc gccgcaggtt ctacatggac tgtgacattg aagtgacaga ctcgtacaaa 7380
acagatctag gtagactgga tgcagggcga gccgctaaac tgtgttctga aaataacact 7440acagatctag gtagactgga tgcagggcga gccgctaaac tgtgttctga aaataacact 7440
gcaaatttca aacgttgcag cccattagtg tgtgggaaag ccatccaact tagagataga 7500gcaaatttca aacgttgcag cccattagtg tgtgggaaag ccatccaact tagagataga 7500
aagtctaaag tcagatacag tgtggatacg gtggtttcag aacttattag ggaatacagc 7560aagtctaaag tcagatacag tgtggatacg gtggtttcag aacttattag ggaatacagc 7560
aataggtccg ccattggtaa cacaatcgag gctcttttcc aaggtccacc caagttcagg 7620aataggtccg ccattggtaa cacaatcgag gctcttttcc aaggtccacc caagttcagg 7620
ccaattagga ttagccttga agaaaaacca gccccagacg ctattagcga tctccttgct 7680ccaattagga ttagccttga agaaaaacca gccccagacg ctattagcga tctccttgct 7680
agtgtagata gtgaagaagt gcgccagtac tgcagggatc aaggctggat tattcctgaa 7740agtgtagata gtgaagaagt gcgccagtac tgcagggatc aaggctggat tattcctgaa 7740
gctcccacca atgtggagcg gcaccttaat agagcggtgc tcgtcatgca atccatcacc 7800gctcccacca atgtggagcg gcaccttaat agagcggtgc tcgtcatgca atccatcacc 7800
acagtagtgg cggttgtttc gttggtgtac gtcatctaca agctctttgc agggtttcag 7860acagtagtgg cggttgtttc gttggtgtac gtcatctaca agctctttgc agggtttcag 7860
ggtgcatatt ctggtgctcc taagcaagtg cttaagaaac ctgctcttcg cacagcaaca 7920ggtgcatatt ctggtgctcc taagcaagtg cttaagaaac ctgctcttcg cacagcaaca 7920
gtgcagggtc cgagccttga ctttgctctc tccctactga gaaggaacat caggcaggtc 7980gtgcagggtc cgagccttga ctttgctctc tccctactga gaaggaacat caggcaggtc 7980
caaacagacc aagggcattt caccatgttg ggtgttaggg atcgcttagc agtcctccca 8040caaacagacc aagggcattt caccatgttg ggtgttaggg atcgcttagc agtcctccca 8040
cgccactcac aacctggcaa aaccatttgg attgagcaca aactcgtgaa cgtccttgat 8100cgccactcac aacctggcaa aaccatttgg attgagcaca aactcgtgaa cgtccttgat 8100
gcagttgaac tggtggatga gcaaggagtc aacctggaat taaccctcat cactcttgac 8160gcagttgaac tggtggatga gcaaggagtc aacctggaat taaccctcat cactcttgac 8160
accaacgaga agtttaggga tatcaccaaa ttcatcccag aaaatatcag cactgctagc 8220accaacgaga agtttaggga tatcaccaaa ttcatcccag aaaatatcag cactgctagc 8220
gatgccaccc tagtgatcaa cacggagcac atgccgtcaa tgtttgtccc ggtgggtgac 8280gatgccaccc tagtgatcaa cacggagcac atgccgtcaa tgtttgtccc ggtgggtgac 8280
gttgtgcagt atggcttttt gaatctcagt ggcaagccta cccatcgcac catgatgtac 8340gttgtgcagt atggcttttt gaatctcagt ggcaagccta cccatcgcac catgatgtac 8340
aattttccta ctaaagcagg acagtgtgga ggagtggtga catctgttgg gaaggttgtc 8400aattttccta ctaaagcagg acagtgtgga ggagtggtga catctgttgg gaaggttgtc 8400
ggtattcaca ttggtggcaa tggcagacaa ggtttttgcg caggcctcaa aaggagttac 8460ggtattcaca ttggtggcaa tggcagacaa ggttttttgcg caggcctcaa aaggagttac 8460
tttgctagtg aacaaggaga gatccagtgg gttaagccca ataaagaaac tggaagactc 8520tttgctagtg aacaaggaga gatccagtgg gttaagccca ataaagaaac tggaagactc 8520
aacatcaatg gaccaacccg caccaagtta gaacctagtg tattccatga catcttcgag 8580aacatcaatg gaccaacccg caccaagtta gaacctagtg tattccatga catcttcgag 8580
ggaaataagg aaccagctgt cttgcacagt aaagaccccc gacttgaggt agattttgaa 8640ggaaataagg aaccagctgt cttgcacagt aaagaccccc gacttgaggt agattttgaa 8640
caggccctgt tctctaagta tgtgggaaac acactacatg agcctgacga gtacatcaaa 8700caggccctgt tctctaagta tgtgggaaac acactacatg agcctgacga gtacatcaaa 8700
gaggcagctc tacattatgc aaaccaatta aagcaactag aaatcaatac ctctcaaatg 8760gaggcagctc tacattatgc aaaccaatta aagcaactag aaatcaatac ctctcaaatg 8760
agcatggagg aggcctgcta tggtactgag aatcttgagg ctattgatct tcacactagt 8820agcatggagg aggcctgcta tggtactgag aatcttgagg ctattgatct tcacactagt 8820
gcaggttacc cctatagtgc cctagggata aagaaaagag acatcttaga ccctaccacc 8880gcaggttacc cctatagtgc cctagggata aagaaaagag acatcttaga ccctaccacc 8880
agggacgtga gtagaatgaa gttctacatg gacaagtatg gtcttgatct tccctactcc 8940agggacgtga gtagaatgaa gttctacatg gacaagtatg gtcttgatct tccctactcc 8940
acttatgtca aggacgagct acgctcgatt gataaaatca agaaagggaa gtcccgcctg 9000acttatgtca aggacgagct acgctcgatt gataaaatca agaaagggaa gtcccgcctg 9000
atcgaggcca gtagtctaaa tgattcagtg tacctcagaa tggctttcgg gcatttgtat 9060atcgaggcca gtagtctaaa tgattcagtg tacctcagaa tggctttcgg gcatttgtat 9060
gaggctttcc acgcaaatcc tgggacgata actggatcgg ccgtggggtg taaccctgac 9120gaggctttcc acgcaaatcc tgggacgata actggatcgg ccgtggggtg taaccctgac 9120
acattctgga gcaagctgcc aattttgctc cctggttcac tctttgcctt tgactactca 9180acattctgga gcaagctgcc aattttgctc cctggttcac tctttgcctt tgactactca 9180
ggctatgatg ccagccttag ccctgtctgg ttcagagcat tagaattggt tcttagggag 9240ggctatgatg ccagccttag ccctgtctgg ttcagagcat tagaattggt tcttagggag 9240
atagggtata gtgaagaggc aatctcactc attgagggaa tcaaccacac acatcatgtg 9300atagggtata gtgaagaggc aatctcactc attgagggaa tcaaccacac acatcatgtg 9300
tatcgtaata agacctattg cgtgcttggt gggatgccct caggctgttc aggaacatcc 9360tatcgtaata agacctattg cgtgcttggt gggatgccct caggctgttc aggaacatcc 9360
atcttcaact caatgatcaa caacattatt atcagagcac tgctcataaa aacatttaag 9420atcttcaact caatgatcaa caacattatt atcagagcac tgctcataaa aacatttaag 9420
ggcattgatt tggatgaact caacatggtc gcttatggag acgatgtgct cgctagctat 9480ggcattgatt tggatgaact caacatggtc gcttatggag acgatgtgct cgctagctat 9480
cccttcccaa ttgattgctt ggaactagca aagactggta aggagtatgg tctgaccatg 9540cccttcccaa ttgattgctt ggaactagca aagactggta aggagtatgg tctgaccatg 9540
acccctgctg ataaatctcc ttgctttaat gaggtcaatt ggggtaatgc gaccttcctc 9600acccctgctg ataaatctcc ttgctttaat gaggtcaatt ggggtaatgc gaccttcctc 9600
aaaaggggct ttttgcccga tgaacagttt ccatttttga ttcaccctac tatgccaatg 9660aaaaggggct ttttgcccga tgaacagttt ccatttttga ttcaccctac tatgccaatg 9660
agggagatcc atgagtccat tcgatggacc aaggacgcac ggaacactca agatcatgtg 9720agggagatcc atgagtccat tcgatggacc aaggacgcac ggaacactca agatcatgtg 9720
cggtccttgt gcctcctagc atggcataat ggtaagcaag aatacgagaa gtttgtgagc 9780cggtccttgt gcctcctagc atggcataat ggtaagcaag aatacgagaa gtttgtgagc 9780
acaattaggt ctgtcccagt agggagagcg ttggctattc caaattatga aaatcttaga 9840acaattaggt ctgtcccagt agggagagcg ttggctattc caaattatga aaatcttaga 9840
cgaaattggc tcgagttatt ttagaggtta tacacacctc aaccccacca gaaatctggt 9900cgaaattggc tcgagttatt ttagaggtta tacacacctc aaccccacca gaaatctggt 9900
cgtgaatgtg actggtgggg gtaaatttgt tataaccaga atagcaaaaa aaaaaaaaaa 9960cgtgaatgtg actggtgggg gtaaatttgt tataaccaga atagcaaaaa aaaaaaaaaa 9960
aaaaaaaaaa aaaaaaagct ta 9982aaaaaaaaaa aaaaaaagct ta 9982
<210> 6<210> 6
<211> 10187<211> 10187
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 6<400> 6
gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60
tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120
tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180
atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240
gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300
aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360
ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420
tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480
ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540
ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600
ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660
gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720
gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780
gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260
aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560
cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980
ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040
tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100
ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160
cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220
ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280
gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340
ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400
gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460
gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520
acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580
attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640
cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700
ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaaccatat gatggtgagc 2760ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaaccatat gatggtgagc 2760
aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta 2820aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta 2820
aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg 2880aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg 2880
accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc 2940accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc 2940
accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac 3000accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac 3000
ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac 3060ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac 3060
gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc 3120gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc 3120
atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag 3180atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag 3180
tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag 3240tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag 3240
gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac 3300gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac 3300
cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc 3360cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc 3360
acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag 3420acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag 3420
ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagat gcatgcgatc 3480ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagat gcatgcgatc 3480
accactcttg gttcgcaagt gtctacacag cgctccggtt cttacgaaaa ctcaaactca 3540accactcttg gttcgcaagt gtctacacag cgctccggtt cttacgaaaa ctcaaactca 3540
gccactgagg gttctaccat aaactacacc accattaatt actacaaaga ctcctatgct 3600gccactgagg gttctaccat aaactacacc accattaatt actacaaaga ctcctatgct 3600
gccacagcag gcaaacagag tctcaagcag gatccagaca agtttgcaaa tcctgttaaa 3660gccacagcag gcaaacagag tctcaagcag gatccagaca agtttgcaaa tcctgttaaa 3660
gacatattca ccgaaatggc agcgccactg aagtccccat ccgctgaggc atgtggatac 3720gacatattca ccgaaatggc agcgccactg aagtccccat ccgctgaggc atgtggatac 3720
agtgatcgag tggcgcaatt aactattggc aactccacca tcacgacgca agaagcggct 3780agtgatcgag tggcgcaatt aactattggc aactccacca tcacgacgca agaagcggct 3780
aacatcatag tcggctatgg tgagtggcct tcctactgct cagattctga cgctacagca 3840aacatcatag tcggctatgg tgagtggcct tcctactgct cagattctga cgctacagca 3840
gtggataaac caacgcgccc ggatgtttca gtgaacaggt tttacacatt ggacactaaa 3900gtggataaac caacgcgccc ggatgtttca gtgaacaggt tttacacatt ggacactaaa 3900
ttgtgggaga aatcgtccaa gggatggtac tggaagttcc cggatgtgtt aactgaaact 3960ttgtgggaga aatcgtccaa gggatggtac tggaagttcc cggatgtgtt aactgaaact 3960
ggggtttttg ggcaaaatgc acaattccac tacctctacc gatcagggtt ctgcatccac 4020ggggtttttg ggcaaaatgc acaattccac tacctctacc gatcagggtt ctgcatccac 4020
gtgcagtgca atgccagtaa attccaccaa ggagcactcc tagtcgctgt cctaccagag 4080gtgcagtgca atgccagtaa attccaccaa ggagcactcc tagtcgctgt cctaccagag 4080
tatgtcattg ggacagtggc aggcggtaca gggacggaag acacccaccc cccctacaag 4140tatgtcattg ggacagtggc aggcggtaca gggacggaag acacccaccc cccctacaag 4140
cagacccaac ccggcgccga tggtttcgag ttgcaacacc cgtacgtgct tgatgctggc 4200cagacccaac ccggcgccga tggtttcgag ttgcaacacc cgtacgtgct tgatgctggc 4200
atcccaatat cacagttaac agtgtgccca caccagtgga ttaatttgag gaccaacaat 4260atcccaatat cacagttaac agtgtgccca caccagtgga ttaatttgag gaccaacaat 4260
tgtgctacaa taatagtgcc atacattaac gcactgcctt ttgattctgc cttgaaccat 4320tgtgctacaa taatagtgcc atacattaac gcactgcctt ttgattctgc cttgaaccat 4320
tgcaactttg gcctgttagt tgtgcctatt agcccactag actacgacca aggagcaacg 4380tgcaactttg gcctgttagt tgtgcctatt agcccactag actacgacca aggagcaacg 4380
ccagtaatcc ctataactat cacattggcc ccaatgtgct ctgaattcgc aggtcttagg 4440ccagtaatcc ctataactat cacattggcc ccaatgtgct ctgaattcgc aggtcttagg 4440
caggcagtca cgcaagggtt ccccaccgag ctaaaacctg gcacaaatca atttttaacc 4500caggcagtca cgcaagggtt ccccaccgag ctaaaacctg gcacaaatca atttttaacc 4500
accgatgatg gcgtctcagc acctattcta ccaaacttcc accccacccc gtgtatccac 4560accgatgatg gcgtctcagc acctattcta ccaaacttcc accccacccc gtgtatccac 4560
atacctggtg aagttaggaa cttgctagag ttatgccagg tggagaccat tctggaggtt 4620atacctggtg aagttaggaa cttgctagag ttatgccagg tggagaccat tctggaggtt 4620
aacaatgtgc ccacgaatgc cactagctta atggagagac tgcgcttccc ggtctcagca 4680aacaatgtgc ccacgaatgc cactagctta atggagagac tgcgcttccc ggtctcagca 4680
caagcaggga aaggtgaact gtgtgcggtg tttagagccg atcctgggcg aaatggacca 4740caagcaggga aaggtgaact gtgtgcggtg tttagagccg atcctgggcg aaatggacca 4740
tggcaatcca ccttactggg ccagttgtgc gggtactaca cccaatggtc agggtcattg 4800tggcaatcca ccttactggg ccagttgtgc gggtactaca cccaatggtc agggtcattg 4800
gaagtcacct tcatgtttac tggatccttc atggctaccg gcaagatgct catagcctat 4860gaagtcacct tcatgtttac tggatccttc atggctaccg gcaagatgct catagcctat 4860
acaccgccag ggggtcctct gcccaaggac cgggcgaccg ccatgttggg cacgcacgtc 4920acaccgccag ggggtcctct gcccaaggac cgggcgaccg ccatgttggg cacgcacgtc 4920
atctgggatt ttgggctgca atcgtctgtt acccttgtaa taccatggat cagtaacact 4980atctgggatt ttgggctgca atcgtctgtt acccttgtaa taccatggat cagtaacact 4980
cattatagag cacatgcccg agatggagtg tttgactatt acactacagg gttagtcagt 5040cattatagag cacatgcccg agatggagtg tttgactatt acactacagg gttagtcagt 5040
atatggtacc agacaaatta cgtggttcca atcggtgcgc ccaacacagc ctatataata 5100atatggtacc agacaaatta cgtggttcca atcggtgcgc ccaacacagc ctatataata 5100
gcactagcgg cagcccaaaa gaacttcact atgaaattgt gcaaggatgc tagtgatatc 5160gcactagcgg cagcccaaaa gaacttcact atgaaattgt gcaaggatgc tagtgatatc 5160
ctgcagacgg gcaccatcca gggagatagg gtggcagatg taattgaaag ttccatagga 5220ctgcagacgg gcaccatcca gggagatagg gtggcagatg taattgaaag ttccatagga 5220
gatagcgtga gcagagccct cactcacgct ctaccagcac ccacaggcca aaacacacag 5280gatagcgtga gcagagccct cactcacgct ctaccagcac ccacaggcca aaacacacag 5280
gtgagcagtc atcgactgga tacaggcaag gttccagcac tccaagctgc tgaaattggg 5340gtgagcagtc atcgactgga tacaggcaag gttccagcac tccaagctgc tgaaattggg 5340
gcatcatcaa atgctagtga cgagagcatg attgaaacac gttgtgttct taactcgcat 5400gcatcatcaa atgctagtga cgagagcatg attgaaacac gttgtgttct taactcgcat 5400
agtacagctg agaccactct tgatagtttc ttcagtaggg caggattagt tggagagata 5460agtacagctg agaccactct tgatagtttc ttcagtaggg caggattagt tggagagata 5460
gatctccctc ttgagggcac aactaaccca aatggttatg ccaactggga catagatata 5520gatctccctc ttgagggcac aactaaccca aatggttatg ccaactggga catagatata 5520
acaggttacg cgcaaatgcg tagaaaggta gagctattca cctacatgcg ttttgatgca 5580acaggttacg cgcaaatgcg tagaaaggta gagctattca cctacatgcg ttttgatgca 5580
gagttcactt ttgttgcgtg cacacccacc ggggaggttg tcccacaatt gctccaatat 5640gagttcactt ttgttgcgtg cacacccacc ggggaggttg tcccacaatt gctccaatat 5640
atgtttgtgc cacctggagc ccctaagcca gattctaggg aatcccttgc atggcaaacc 5700atgtttgtgc cacctggagc ccctaagcca gattctaggg aatcccttgc atggcaaacc 5700
gccaccaacc cctcagtttt tgtcaagctg tcagaccctc cggcgcaggt ttcagtgcca 5760gccaccaacc cctcagtttt tgtcaagctg tcagaccctc cggcgcaggt ttcagtgcca 5760
ttcatgtcac ctgcgagtgc ttatcaatgg ttttatgacg gatatcccac attcggagaa 5820ttcatgtcac ctgcgagtgc ttatcaatgg ttttatgacg gatatcccac attcggagaa 5820
cacaaacagg agaaagacct tgaatacggg gcatgtccta ataacatgat gggtacattc 5880cacaaacagg agaaagacct tgaatacggg gcatgtccta ataacatgat gggtacattc 5880
tcagtgcgga ctgtggggac ctccaagtcc aagtaccctt tagtggttag gatttacatg 5940tcagtgcgga ctgtggggac ctccaagtcc aagtaccctt tagtggttag gatttacatg 5940
agaatgaagc acgtcagggc gtggatacct cgcccgatgc gcaaccagaa ctacctgttc 6000agaatgaagc acgtcagggc gtggatacct cgcccgatgc gcaaccagaa ctacctgttc 6000
aaagccaacc caaattatgc tggcaactct attaagccaa ctggtgccag tcgcacagcg 6060aaagccaacc caaattatgc tggcaactct attaagccaa ctggtgccag tcgcacagcg 6060
atcaccactc ttgggaaatt tggacaacag tctggggcta tttatgtggg caactttaga 6120atcaccactc ttgggaaatt tggacaacag tctggggcta tttatgtggg caactttaga 6120
gtggtcaacc gacatcttgc cacccataat gattgggcaa atcttgtttg ggaagacagc 6180gtggtcaacc gacatcttgc cacccataat gattgggcaa atcttgtttg ggaagacagc 6180
tctcgcgact tgctcgtgtc atccaccact gcccaaggtt gtgacacgat tgcccgttgc 6240tctcgcgact tgctcgtgtc atccaccact gcccaaggtt gtgacacgat tgcccgttgc 6240
gattgccaga caggggtgta ctactgtaac tcgatgagaa aacactaccc agtcagtttt 6300gattgccaga caggggtgta ctactgtaac tcgatgagaa aacactaccc agtcagtttt 6300
tcaaaaccca gcctgatcta tgtagaggct agcgagtatt acccagccag gtaccaatca 6360tcaaaaccca gcctgatcta tgtagaggct agcgagtatt acccagccag gtaccaatca 6360
catctcatgc tcgcacaggg tcactcggaa cctggtgatt gcggtggtat ccttaggtgc 6420catctcatgc tcgcacaggg tcactcggaa cctggtgatt gcggtggtat ccttaggtgc 6420
caacatggcg tcatcggcat agtgtctact ggtggcaatg ggctcgttgg ctttgcagac 6480caacatggcg tcatcggcat agtgtctact ggtggcaatg ggctcgttgg ctttgcagac 6480
gtcagagacc tcttgtggtt agatgaagaa gctatggaac agggcgtgtc cgactacatt 6540gtcagagacc tcttgtggtt agatgaagaa gctatggaac agggcgtgtc cgactacatt 6540
aagggtctcg gagatgcttt tggaacaggc ttcactgacg cagtctcaag ggaggttgaa 6600aagggtctcg gagatgcttt tggaacaggc ttcactgacg cagtctcaag ggaggttgaa 6600
gctctcaaga actatcttat agggtctgaa ggagcagttg agaaaatttt gaaaaatctt 6660gctctcaaga actatcttat agggtctgaa ggagcagttg agaaaatttt gaaaaatctt 6660
attaaactaa tctctgcact ggtgattgtg atcagaagtg attacgacat ggttaccctc 6720attaaactaa tctctgcact ggtgattgtg atcagaagtg attacgacat ggttaccctc 6720
actgcaacct tagcgctgat aggttgtcat ggcagtcctt gggcttggat taaagccaaa 6780actgcaacct tagcgctgat aggttgtcat ggcagtcctt gggcttggat taaagccaaa 6780
acagcctcca tcttaggtat ccctatcgcc caaaagcaga gcgcttcctg gctcaagaag 6840acagcctcca tcttaggtat ccctatcgcc caaaagcaga gcgcttcctg gctcaagaag 6840
ttcaatgaca tggccaacgc cgctaagggg ttagagtggg tttccaacaa gatcagcaaa 6900ttcaatgaca tggccaacgc cgctaagggg ttagagtggg tttccaacaa gatcagcaaa 6900
tttattgatt ggcttaagga gaaaatagta ccagcagcca gggagaaggt tgaattccta 6960tttattgatt ggcttaagga gaaaatagta ccagcagcca gggagaaggt tgaattccta 6960
aataacttga aacagctgcc actgctagag aatcagatct cgaacttgga acaatctgct 7020aataacttga aacagctgcc actgctagg aatcagatct cgaacttgga acaatctgct 7020
gcttcacaag aggaccttga agtcatgttt gggaatgtgt cgtacctagc tcacttctgt 7080gcttcacaag aggaccttga agtcatgttt gggaatgtgt cgtacctagc tcacttctgt 7080
cgcaagtttc aaccgctata cgccacggaa gctaaaagag tctatgccct ggagaagaga 7140cgcaagtttc aaccgctata cgccacggaa gctaaaagag tctatgccct ggagaagaga 7140
atgaataact atatgcagtt caagagcaaa caccgaattg aacctgtatg tctcattatt 7200atgaataact atatgcagtt caagagcaaa caccgaattg aacctgtatg tctcattatt 7200
aggggctcac caggcaccgg gaagtctcta gccactggta ttattgctcg agcaatcgct 7260aggggctcac caggcaccgg gaagtctcta gccactggta ttattgctcg agcaatcgct 7260
gataagtacc actccagcgt gtactcgctc ccaccagacc cggatcattt tgacggttac 7320gataagtacc actccagcgt gtactcgctc ccaccagacc cggatcattt tgacggttac 7320
aagcaacagg tggttacagt gatggatgat ttgtgtcaaa accccgatgg taaggatatg 7380aagcaacagg tggttacagt gatggatgat ttgtgtcaaa accccgatgg taaggatg 7380
tccttattct gtcaaatggt atccaccgta gatttcattc caccaatggc ttctctcgag 7440tccttattct gtcaaatggt atccaccgta gatttcattc caccaatggc ttctctcgag 7440
gagaagggag tttccttcac ctctaagttt gtcatcgcat ccactaatgc cagtaatatc 7500gagaagggag tttccttcac ctctaagttt gtcatcgcat ccactaatgc cagtaatatc 7500
atagtaccaa cagtgtctga ttctgacgct attcgccgca ggttctacat ggactgtgac 7560atagtaccaa cagtgtctga ttctgacgct attcgccgca ggttctacat ggactgtgac 7560
attgaagtga cagactcgta caaaacagat ctaggtagac tggatgcagg gcgagccgct 7620attgaagtga cagactcgta caaaacagat ctaggtagac tggatgcagg gcgagccgct 7620
aaactgtgtt ctgaaaataa cactgcaaat ttcaaacgtt gcagcccatt agtgtgtggg 7680aaactgtgtt ctgaaaataa cactgcaaat ttcaaacgtt gcagcccatt agtgtgtgggg 7680
aaagccatcc aacttagaga tagaaagtct aaagtcagat acagtgtgga tacggtggtt 7740aaagccatcc aacttagaga tagaaagtct aaagtcagat acagtgtgga tacggtggtt 7740
tcagaactta ttagggaata cagcaatagg tccgccattg gtaacacaat cgaggctctt 7800tcagaactta ttagggaata cagcaatagg tccgccattg gtaacacaat cgaggctctt 7800
ttccaaggtc cacccaagtt caggccaatt aggattagcc ttgaagaaaa accagcccca 7860ttccaaggtc cacccaagtt caggccaatt aggattagcc ttgaagaaaa accagcccca 7860
gacgctatta gcgatctcct tgctagtgta gatagtgaag aagtgcgcca gtactgcagg 7920gacgctatta gcgatctcct tgctagtgta gatagtgaag aagtgcgcca gtactgcagg 7920
gatcaaggct ggattattcc tgaagctccc accaatgtgg agcggcacct taatagagcg 7980gatcaaggct ggattattcc tgaagctccc accaatgtgg agcggcacct taatagagcg 7980
gtgctcgtca tgcaatccat caccacagta gtggcggttg tttcgttggt gtacgtcatc 8040gtgctcgtca tgcaatccat caccacagta gtggcggttg tttcgttggt gtacgtcatc 8040
tacaagctct ttgcagggtt tcagggtgca tattctggtg ctcctaagca agtgcttaag 8100tacaagctct ttgcagggtt tcagggtgca tattctggtg ctcctaagca agtgcttaag 8100
aaacctgctc ttcgcacagc aacagtgcag ggtccgagcc ttgactttgc tctctcccta 8160aaacctgctc ttcgcacagc aacagtgcag ggtccgagcc ttgactttgc tctctcccta 8160
ctgagaagga acatcaggca ggtccaaaca gaccaagggc atttcaccat gttgggtgtt 8220ctgagaagga acatcaggca ggtccaaaca gaccaagggc atttcaccat gttgggtgtt 8220
agggatcgct tagcagtcct cccacgccac tcacaacctg gcaaaaccat ttggattgag 8280agggatcgct tagcagtcct cccacgccac tcacaacctg gcaaaaccat ttggattgag 8280
cacaaactcg tgaacgtcct tgatgcagtt gaactggtgg atgagcaagg agtcaacctg 8340cacaaactcg tgaacgtcct tgatgcagtt gaactggtgg atgagcaagg agtcaacctg 8340
gaattaaccc tcatcactct tgacaccaac gagaagttta gggatatcac caaattcatc 8400gaattaaccc tcatcactct tgacaccaac gagaagttta gggatatcac caaattcatc 8400
ccagaaaata tcagcactgc tagcgatgcc accctagtga tcaacacgga gcacatgccg 8460ccagaaaata tcagcactgc tagcgatgcc accctagtga tcaacacgga gcacatgccg 8460
tcaatgtttg tcccggtggg tgacgttgtg cagtatggct ttttgaatct cagtggcaag 8520tcaatgtttg tcccggtggg tgacgttgtg cagtatggct ttttgaatct cagtggcaag 8520
cctacccatc gcaccatgat gtacaatttt cctactaaag caggacagtg tggaggagtg 8580cctacccatc gcaccatgat gtacaatttt cctactaaag caggacagtg tggaggagtg 8580
gtgacatctg ttgggaaggt tgtcggtatt cacattggtg gcaatggcag acaaggtttt 8640gtgacatctg ttgggaaggt tgtcggtatt cacattggtg gcaatggcag acaaggtttt 8640
tgcgcaggcc tcaaaaggag ttactttgct agtgaacaag gagagatcca gtgggttaag 8700tgcgcaggcc tcaaaaggag ttactttgct agtgaacaag gagagatcca gtgggttaag 8700
cccaataaag aaactggaag actcaacatc aatggaccaa cccgcaccaa gttagaacct 8760cccaataaag aaactggaag actcaacatc aatggaccaa cccgcaccaa gttagaacct 8760
agtgtattcc atgacatctt cgagggaaat aaggaaccag ctgtcttgca cagtaaagac 8820agtgtattcc atgacatctt cgagggaaat aaggaaccag ctgtcttgca cagtaaagac 8820
ccccgacttg aggtagattt tgaacaggcc ctgttctcta agtatgtggg aaacacacta 8880ccccgacttg aggtagattt tgaacaggcc ctgttctcta agtatgtggg aaacacacta 8880
catgagcctg acgagtacat caaagaggca gctctacatt atgcaaacca attaaagcaa 8940catgagcctg acgagtacat caaagaggca gctctacatt atgcaaacca attaaagcaa 8940
ctagaaatca atacctctca aatgagcatg gaggaggcct gctatggtac tgagaatctt 9000ctagaaatca atacctctca aatgagcatg gaggaggcct gctatggtac tgagaatctt 9000
gaggctattg atcttcacac tagtgcaggt tacccctata gtgccctagg gataaagaaa 9060gaggctattg atcttcacac tagtgcaggt tacccctata gtgccctagg gataaagaaa 9060
agagacatct tagaccctac caccagggac gtgagtagaa tgaagttcta catggacaag 9120agagacatct tagaccctac caccagggac gtgagtagaa tgaagttcta catggacaag 9120
tatggtcttg atcttcccta ctccacttat gtcaaggacg agctacgctc gattgataaa 9180tatggtcttg atcttcccta ctccacttat gtcaaggacg agctacgctc gattgataaa 9180
atcaagaaag ggaagtcccg cctgatcgag gccagtagtc taaatgattc agtgtacctc 9240atcaagaaag ggaagtcccg cctgatcgag gccagtagtc taaatgattc agtgtacctc 9240
agaatggctt tcgggcattt gtatgaggct ttccacgcaa atcctgggac gataactgga 9300agaatggctt tcgggcattt gtatgaggct ttccacgcaa atcctgggac gataactgga 9300
tcggccgtgg ggtgtaaccc tgacacattc tggagcaagc tgccaatttt gctccctggt 9360tcggccgtgg ggtgtaaccc tgacacattc tggagcaagc tgccaatttt gctccctggt 9360
tcactctttg cctttgacta ctcaggctat gatgccagcc ttagccctgt ctggttcaga 9420tcactctttg cctttgacta ctcaggctat gatgccagcc ttagccctgt ctggttcaga 9420
gcattagaat tggttcttag ggagataggg tatagtgaag aggcaatctc actcattgag 9480gcattagaat tggttcttag ggagataggg tatagtgaag aggcaatctc actcattgag 9480
ggaatcaacc acacacatca tgtgtatcgt aataagacct attgcgtgct tggtgggatg 9540ggaatcaacc acacacatca tgtgtatcgt aataagacct attgcgtgct tggtgggatg 9540
ccctcaggct gttcaggaac atccatcttc aactcaatga tcaacaacat tattatcaga 9600ccctcaggct gttcaggaac atccatcttc aactcaatga tcaacaacat tattatcaga 9600
gcactgctca taaaaacatt taagggcatt gatttggatg aactcaacat ggtcgcttat 9660gcactgctca taaaaacatt taagggcatt gatttggatg aactcaacat ggtcgcttat 9660
ggagacgatg tgctcgctag ctatcccttc ccaattgatt gcttggaact agcaaagact 9720ggagacgatg tgctcgctag ctatcccttc ccaattgatt gcttggaact agcaaagact 9720
ggtaaggagt atggtctgac catgacccct gctgataaat ctccttgctt taatgaggtc 9780ggtaaggagt atggtctgac catgacccct gctgataaat ctccttgctt taatgaggtc 9780
aattggggta atgcgacctt cctcaaaagg ggctttttgc ccgatgaaca gtttccattt 9840aattggggta atgcgacctt cctcaaaagg ggctttttgc ccgatgaaca gtttccattt 9840
ttgattcacc ctactatgcc aatgagggag atccatgagt ccattcgatg gaccaaggac 9900ttgattcacc ctactatgcc aatgagggag atccatgagt ccattcgatg gaccaaggac 9900
gcacggaaca ctcaagatca tgtgcggtcc ttgtgcctcc tagcatggca taatggtaag 9960gcacggaaca ctcaagatca tgtgcggtcc ttgtgcctcc tagcatggca taatggtaag 9960
caagaatacg agaagtttgt gagcacaatt aggtctgtcc cagtagggag agcgttggct 10020caagaatacg agaagtttgt gagcacaatt aggtctgtcc cagtagggag agcgttggct 10020
attccaaatt atgaaaatct tagacgaaat tggctcgagt tattttagag gttatacaca 10080attccaaatt atgaaaatct tagacgaaat tggctcgagt tattttagag gttatacaca 10080
cctcaacccc accagaaatc tggtcgtgaa tgtgactggt gggggtaaat ttgttataac 10140cctcaacccc accagaaatc tggtcgtgaa tgtgactggt gggggtaaat ttgttataac 10140
cagaatagca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agcttat 10187cagaatagca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agcttat 10187
<210> 7<210> 7
<211> 19<211> 19
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 7<400> 7
taatacgact cactatagg 19taatacgact cactatagg 19
<210> 8<210> 8
<211> 30<211> 30
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 8<400> 8
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 30aaaaaaaaaa aaaaaaaaaa aaaaaaaaaaa 30
<210> 9<210> 9
<211> 39<211> 39
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 9<400> 9
gctagcgctt tttttttttt tttttttttt ttttttttt 39gctagcgctt tttttttttt tttttttttt tttttttttt 39
<210> 10<210> 10
<211> 55<211> 55
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 10<400> 10
gacgcggccg ctaatacgac tcactatagg ttaaaacagc ctgtgggttg caccc 55gacgcggccg ctaatacgac tcactatagg ttaaaacagc ctgtgggttg caccc 55
<210> 11<210> 11
<211> 22<211> 22
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 11<400> 11
gcactgcacg tggatgcaga ac 22gcactgcacg tggatgcaga ac 22
<210> 12<210> 12
<211> 33<211> 33
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 12<400> 12
gacgcggccg cgttctgcat ccacgtgcag tgc 33gacgcggccg cgttctgcat ccacgtgcag tgc 33
<210> 13<210> 13
<211> 22<211> 22
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 13<400> 13
aagtcgcgag agctgtcttc cc 22aagtcgcgag agctgtcttc cc 22
<210> 14<210> 14
<211> 33<211> 33
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 14<400> 14
gacgcggccg cgggaagaca gctctcgcga ctt 33gacgcggccg cgggaagaca gctctcgcga ctt 33
<210> 15<210> 15
<211> 28<211> 28
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 15<400> 15
aattgtacat catggtgcga tgggtagg 28aattgtacat catggtgcga tgggtagg 28
<210> 16<210> 16
<211> 39<211> 39
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 16<400> 16
gacgcggccg ccctacccat cgcaccatga tgtacaatt 39gacgcggccg ccctacccat cgcaccatga tgtacaatt 39
<210> 17<210> 17
<211> 73<211> 73
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 17<400> 17
gctagcgctt tttttttttt tttttttttt tttttttttg ctattctggt tataacaaat 60gctagcgctt tttttttttt tttttttttt ttttttttttg ctattctggt tataacaaat 60
ttacccccac cag 73ttacccccac cag 73
<210> 18<210> 18
<211> 18<211> 18
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 18<400> 18
cctgacgtgt cgacgcgg 18cctgacgtgt cgacgcgg 18
<210> 19<210> 19
<211> 49<211> 49
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 19<400> 19
cctcgccctt gctcaccatc atatggttta gctgtgttaa gggtcaaga 49cctcgccctt gctcaccatc atatggttta gctgtgttaa gggtcaaga 49
<210> 20<210> 20
<211> 49<211> 49
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 20<400> 20
tcttgaccct taacacagct aaaccatatg atggtgagca agggcgagg 49tcttgaccct taacacagct aaaccatatg atggtgagca agggcgagg 49
<210> 21<210> 21
<211> 66<211> 66
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 21<400> 21
cgctgtgtag acacttgcga accaagagtg gtgatcgcat gcatcttgta cagctcgtcc 60cgctgtgtag acacttgcga accaagagtg gtgatcgcat gcatcttgta cagctcgtcc 60
atgccg 66atgccg 66
<210> 22<210> 22
<211> 66<211> 66
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 22<400> 22
cggcatggac gagctgtaca agatgcatgc gatcaccact cttggttcgc aagtgtctac 60cggcatggac gagctgtaca agatgcatgc gatcaccact cttggttcgc aagtgtctac 60
acagcg 66acagcg 66
<210> 23<210> 23
<211> 21<211> 21
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 23<400> 23
ctgcacgtgg atgcagaacc c 21ctgcacgtgg atgcagaacc c 21
<210> 24<210> 24
<211> 21<211> 21
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 24<400> 24
ctgcacgtgg atgcagaacc c 21ctgcacgtgg atgcagaacc c 21
<210> 25<210> 25
<211> 53<211> 53
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 25<400> 25
gaaatcttcg agtgtgaaga ccattctaga gtttagctgt gttaagggtc aag 53gaaatcttcg agtgtgaaga ccattctaga gtttagctgt gttaagggtc aag 53
<210> 26<210> 26
<211> 53<211> 53
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 26<400> 26
cttgaccctt aacacagcta aactctagaa tggtcttcac actcgaagat ttc 53cttgaccctt aacacagcta aactctagaa tggtcttcac actcgaagat ttc 53
<210> 27<210> 27
<211> 27<211> 27
<212> DNA<212> DNA
<213> Artificial<213> Artificial
<400> 27<400> 27
cgcatgcatc gccagaatgc gttcgca 27cgcatgcatc gccagaatgc gttcgca 27
Claims (20)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910474088.3A CN112094822A (en) | 2019-06-02 | 2019-06-02 | Infectious cDNA clone based on EV71 strain and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910474088.3A CN112094822A (en) | 2019-06-02 | 2019-06-02 | Infectious cDNA clone based on EV71 strain and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112094822A true CN112094822A (en) | 2020-12-18 |
Family
ID=73748863
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910474088.3A Pending CN112094822A (en) | 2019-06-02 | 2019-06-02 | Infectious cDNA clone based on EV71 strain and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112094822A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115088674A (en) * | 2022-06-10 | 2022-09-23 | 桂林医学院第二附属医院 | Construction method and application of echovirus 30 type wild suckling mouse model |
CN116218907A (en) * | 2023-02-20 | 2023-06-06 | 复旦大学附属中山医院 | Enterovirus infectious clone with HiBiT novel reporter gene and its construction method and application |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102766607A (en) * | 2012-07-23 | 2012-11-07 | 哈尔滨医科大学 | Fusion protein for screening and evaluating anti-enterovirus 71 medicine and application of fusion protein |
CN103160475A (en) * | 2011-12-14 | 2013-06-19 | 北京微谷生物医药有限公司 | Enterovirus 71 type viral strain, its application, vaccine and preparation method |
CN103374580A (en) * | 2012-04-27 | 2013-10-30 | 中国医学科学院医药生物技术研究所 | Enterovirus 71 (EV 71) Fuyang strain and cDNA (deoxyribonucleic acid) infectious clone of attenuated strain of enterovirus 71 (EV 71) Fuyang strain as well as application of enterovirus 71 (EV 71) Fuyang strain |
CN103805634A (en) * | 2014-03-05 | 2014-05-21 | 中国科学院武汉病毒研究所 | CA16 infectious clone with green fluorescent protein gene as well as construction method and application of CA16 infectious clone |
US20180036398A1 (en) * | 2015-02-27 | 2018-02-08 | Novartis Ag | Flavivirus replicons |
CN107849540A (en) * | 2015-01-28 | 2018-03-27 | 淡马锡生命科学研究院有限公司 | Enterovirus 71 animal model |
-
2019
- 2019-06-02 CN CN201910474088.3A patent/CN112094822A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103160475A (en) * | 2011-12-14 | 2013-06-19 | 北京微谷生物医药有限公司 | Enterovirus 71 type viral strain, its application, vaccine and preparation method |
CN103374580A (en) * | 2012-04-27 | 2013-10-30 | 中国医学科学院医药生物技术研究所 | Enterovirus 71 (EV 71) Fuyang strain and cDNA (deoxyribonucleic acid) infectious clone of attenuated strain of enterovirus 71 (EV 71) Fuyang strain as well as application of enterovirus 71 (EV 71) Fuyang strain |
CN102766607A (en) * | 2012-07-23 | 2012-11-07 | 哈尔滨医科大学 | Fusion protein for screening and evaluating anti-enterovirus 71 medicine and application of fusion protein |
CN103805634A (en) * | 2014-03-05 | 2014-05-21 | 中国科学院武汉病毒研究所 | CA16 infectious clone with green fluorescent protein gene as well as construction method and application of CA16 infectious clone |
CN107849540A (en) * | 2015-01-28 | 2018-03-27 | 淡马锡生命科学研究院有限公司 | Enterovirus 71 animal model |
US20180036398A1 (en) * | 2015-02-27 | 2018-02-08 | Novartis Ag | Flavivirus replicons |
Non-Patent Citations (2)
Title |
---|
HUIQIANG WANG等: "Recent Progress on Functional Genomics Research of Enterovirus 71", 《VIROLOGICA SINICA》, vol. 34, no. 1, pages 9 - 21, XP036728199, DOI: 10.1007/s12250-018-0071-9 * |
JIE SONG等: "Suppression of the toll-like receptor 7-dependent type I interferon production pathway by autophagy resulting from enterovirus 71 and coxsackievirus A16 infections facilitates their replication", 《ARCH VIROL 》, vol. 163, no. 1, pages 135 - 144, XP036400088, DOI: 10.1007/s00705-017-3592-x * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115088674A (en) * | 2022-06-10 | 2022-09-23 | 桂林医学院第二附属医院 | Construction method and application of echovirus 30 type wild suckling mouse model |
CN116218907A (en) * | 2023-02-20 | 2023-06-06 | 复旦大学附属中山医院 | Enterovirus infectious clone with HiBiT novel reporter gene and its construction method and application |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2788478T3 (en) | Multiplex IMMUNSCREENINGSASSAY | |
AU2023241391A1 (en) | Novel crispr enzymes and systems | |
AU2024216517A1 (en) | Enhanced systems for cell-mediated oncolytic viral therapy | |
KR102077131B1 (en) | Recombinant measles virus expressing chikungunya virus polypeptides and their applications | |
JP2023071855A (en) | CRISPR-Cas effector polypeptides and methods of use thereof | |
CN109312360B (en) | Transposon-based transfection system for primary cells | |
KR101227128B1 (en) | INFECTIOUS cDNA OF AN APPROVED VACCINE STRAIN OF MEASLES VIRUS, USE FOR IMMUNOGENIC COMPOSITIONS | |
US6168943B1 (en) | Methods for making modified recombinant vesiculoviruses | |
KR20070077140A (en) | How to analyze protein-protein interactions | |
CN101213203A (en) | Methods and compositions for modulating nucleic acid expression at the post-transcriptional level | |
KR20210126680A (en) | Compositions and methods for treating alpha-1 antitrypsin deficiency | |
KR20120034652A (en) | Method for generating a genetically modified microbe | |
KR20220007155A (en) | Modified S1 subunit of coronavirus spike protein | |
CN108949825A (en) | A kind of preparation method and application for the CAR-T cell targeting HER2 | |
CN112094822A (en) | Infectious cDNA clone based on EV71 strain and application thereof | |
CN107043783A (en) | A kind of carrier and its application for carrying out live body positioning to mammalian cell gene group based on CRISPRCas9 systems | |
KR20220016485A (en) | AAV vectors having myelin protein zero promoter, and their use for treating Schwann cell-associated diseases such as Charcot-Marie-Tooth disease | |
CN110343713A (en) | It is a kind of based on the multi-functional luciferase reporter gene carrier and its construction method of source of people TLR4 gene and application | |
CN112057611A (en) | Application of African swine fever virus E120R protein as immunosuppressant and construction of immunosuppressive site knockout strain | |
CN114703207B (en) | Preparation method of recombinant plasmid and recombinant virus | |
CN109468244A (en) | An acid-fast high-density Escherichia coli and its application | |
CN110777147A (en) | IKZF3 gene-silenced T cell and application thereof | |
CN110129340A (en) | Infectious Cloning and Application of Zika Virus MR766 Strain | |
CN114174321A (en) | Modified S2 subunit of coronavirus spike protein | |
CN114231513B (en) | A Short Peptide Inhibiting the Activity of Proteasome PSMB5 Subunit and Its Application in Anti-Rickettsia Infection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |