CN112094822A - Infectious cDNA clone based on EV71 strain and application thereof - Google Patents
Infectious cDNA clone based on EV71 strain and application thereof Download PDFInfo
- Publication number
- CN112094822A CN112094822A CN201910474088.3A CN201910474088A CN112094822A CN 112094822 A CN112094822 A CN 112094822A CN 201910474088 A CN201910474088 A CN 201910474088A CN 112094822 A CN112094822 A CN 112094822A
- Authority
- CN
- China
- Prior art keywords
- virus
- leu
- ala
- ser
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 208000015181 infectious disease Diseases 0.000 title claims abstract description 121
- 230000002458 infectious effect Effects 0.000 title claims abstract description 83
- 241001529459 Enterovirus A71 Species 0.000 title claims abstract description 79
- 239000002299 complementary DNA Substances 0.000 title claims abstract description 59
- 241000700605 Viruses Species 0.000 claims abstract description 158
- 108700008625 Reporter Genes Proteins 0.000 claims abstract description 46
- 230000003612 virological effect Effects 0.000 claims abstract description 37
- 239000002245 particle Substances 0.000 claims abstract description 34
- 241001465754 Metazoa Species 0.000 claims abstract description 16
- 229960005486 vaccine Drugs 0.000 claims abstract description 13
- 238000010171 animal model Methods 0.000 claims abstract description 12
- 239000003814 drug Substances 0.000 claims abstract description 7
- 239000013612 plasmid Substances 0.000 claims description 54
- 150000007523 nucleic acids Chemical group 0.000 claims description 47
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 46
- 108020004414 DNA Proteins 0.000 claims description 35
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 24
- 108700026244 Open Reading Frames Proteins 0.000 claims description 16
- 238000000034 method Methods 0.000 claims description 11
- 102000053602 DNA Human genes 0.000 claims description 9
- 108010067390 Viral Proteins Proteins 0.000 claims description 6
- 238000010276 construction Methods 0.000 claims description 6
- 108091026890 Coding region Proteins 0.000 claims description 5
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 5
- 238000012216 screening Methods 0.000 claims description 5
- 108060001084 Luciferase Proteins 0.000 claims description 4
- 239000005089 Luciferase Substances 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 4
- 108091006047 fluorescent proteins Proteins 0.000 claims description 3
- 102000034287 fluorescent proteins Human genes 0.000 claims description 3
- 239000013603 viral vector Substances 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 210000002845 virion Anatomy 0.000 claims 1
- 239000013598 vector Substances 0.000 abstract description 9
- 238000011161 development Methods 0.000 abstract description 7
- 238000001415 gene therapy Methods 0.000 abstract description 5
- 239000003153 chemical reaction reagent Substances 0.000 abstract description 4
- 238000001514 detection method Methods 0.000 abstract description 4
- 230000009385 viral infection Effects 0.000 abstract description 4
- 239000013604 expression vector Substances 0.000 abstract description 3
- 238000002649 immunization Methods 0.000 abstract description 2
- 230000003053 immunization Effects 0.000 abstract description 2
- 230000002265 prevention Effects 0.000 abstract description 2
- 108020004635 Complementary DNA Proteins 0.000 description 42
- 238000010804 cDNA synthesis Methods 0.000 description 42
- 210000004027 cell Anatomy 0.000 description 39
- 241000699670 Mus sp. Species 0.000 description 27
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 23
- 210000003501 vero cell Anatomy 0.000 description 20
- 238000000338 in vitro Methods 0.000 description 17
- 230000010076 replication Effects 0.000 description 16
- 239000006228 supernatant Substances 0.000 description 16
- 108090000623 proteins and genes Proteins 0.000 description 15
- 241000699666 Mus <mouse, genus> Species 0.000 description 11
- 230000035772 mutation Effects 0.000 description 11
- 241000709661 Enterovirus Species 0.000 description 10
- 239000012634 fragment Substances 0.000 description 10
- 101710172711 Structural protein Proteins 0.000 description 9
- 108020000999 Viral RNA Proteins 0.000 description 9
- 102000004169 proteins and genes Human genes 0.000 description 9
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 230000004927 fusion Effects 0.000 description 7
- 238000012408 PCR amplification Methods 0.000 description 6
- 239000003443 antiviral agent Substances 0.000 description 6
- 230000002238 attenuated effect Effects 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 108091005804 Peptidases Proteins 0.000 description 5
- 239000004365 Protease Substances 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 230000029812 viral genome replication Effects 0.000 description 5
- 208000020061 Hand, Foot and Mouth Disease Diseases 0.000 description 4
- 208000025713 Hand-foot-and-mouth disease Diseases 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 238000010172 mouse model Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 230000003362 replicative effect Effects 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 101710132601 Capsid protein Proteins 0.000 description 3
- 101710197658 Capsid protein VP1 Proteins 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 241000709687 Coxsackievirus Species 0.000 description 3
- 241000991587 Enterovirus C Species 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 208000028389 Nerve injury Diseases 0.000 description 3
- 241000709664 Picornaviridae Species 0.000 description 3
- 101710118046 RNA-directed RNA polymerase Proteins 0.000 description 3
- 101710108545 Viral protein 1 Proteins 0.000 description 3
- 150000001413 amino acids Chemical group 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000006378 damage Effects 0.000 description 3
- 230000001066 destructive effect Effects 0.000 description 3
- 241001493065 dsRNA viruses Species 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 230000004083 survival effect Effects 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- 102100025230 2-amino-3-ketobutyrate coenzyme A ligase, mitochondrial Human genes 0.000 description 2
- 108010087522 Aeromonas hydrophilia lipase-acyltransferase Proteins 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- 101000651036 Arabidopsis thaliana Galactolipid galactosyltransferase SFR2, chloroplastic Proteins 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- 241001466953 Echovirus Species 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- 241000282560 Macaca mulatta Species 0.000 description 2
- 241000702318 Microviridae Species 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 2
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 2
- 238000011579 SCID mouse model Methods 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108700005077 Viral Genes Proteins 0.000 description 2
- 108010087302 Viral Structural Proteins Proteins 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 230000002155 anti-virotic effect Effects 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 210000003169 central nervous system Anatomy 0.000 description 2
- 230000000120 cytopathologic effect Effects 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 238000009509 drug development Methods 0.000 description 2
- 230000001605 fetal effect Effects 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 108700043045 nanoluc Proteins 0.000 description 2
- 230000008764 nerve damage Effects 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000012827 research and development Methods 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 238000004448 titration Methods 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 230000017613 viral reproduction Effects 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- BAAVRTJSLCSMNM-CMOCDZPBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-carboxybutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]pentanedioic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 BAAVRTJSLCSMNM-CMOCDZPBSA-N 0.000 description 1
- AWXGSYPUMWKTBR-UHFFFAOYSA-N 4-carbazol-9-yl-n,n-bis(4-carbazol-9-ylphenyl)aniline Chemical compound C12=CC=CC=C2C2=CC=CC=C2N1C1=CC=C(N(C=2C=CC(=CC=2)N2C3=CC=CC=C3C3=CC=CC=C32)C=2C=CC(=CC=2)N2C3=CC=CC=C3C3=CC=CC=C32)C=C1 AWXGSYPUMWKTBR-UHFFFAOYSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- 241001424309 Arita Species 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- LXTGAOAXPSJWOU-DCAQKATOSA-N Asn-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N LXTGAOAXPSJWOU-DCAQKATOSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- QADHATDBZXHRCA-ACZMJKKPSA-N Cys-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N QADHATDBZXHRCA-ACZMJKKPSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- UBHPUQAWSSNQLQ-DCAQKATOSA-N Cys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O UBHPUQAWSSNQLQ-DCAQKATOSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000988559 Enterovirus A Species 0.000 description 1
- 241000963438 Gaussia <copepod> Species 0.000 description 1
- 108700023863 Gene Components Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- UGEZSPWLJABDAR-KKUMJFAQSA-N Gln-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N UGEZSPWLJABDAR-KKUMJFAQSA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- YDJOULGWHQRPEV-SRVKXCTJSA-N Glu-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N YDJOULGWHQRPEV-SRVKXCTJSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 208000032843 Hemorrhage Diseases 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- MJNWEIMBXKKCSF-XVYDVKMFSA-N His-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N MJNWEIMBXKKCSF-XVYDVKMFSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 1
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- ORZGPQXISSXQGW-IHRRRGAJSA-N His-His-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O ORZGPQXISSXQGW-IHRRRGAJSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- CHIAUHSHDARFBD-ULQDDVLXSA-N His-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 CHIAUHSHDARFBD-ULQDDVLXSA-N 0.000 description 1
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- 101000957437 Homo sapiens Mitochondrial carnitine/acylcarnitine carrier protein Proteins 0.000 description 1
- 101001128634 Homo sapiens NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, mitochondrial Proteins 0.000 description 1
- 101000837344 Homo sapiens T-cell leukemia translocation-altered gene protein Proteins 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- MITYXXNZSZLHGG-OBAATPRFSA-N Ile-Trp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N MITYXXNZSZLHGG-OBAATPRFSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 102000001617 Interferon Receptors Human genes 0.000 description 1
- 108010054267 Interferon Receptors Proteins 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 1
- 208000004852 Lung Injury Diseases 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000282567 Macaca fascicularis Species 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- JOYFULUKJRJCSX-IUCAKERBSA-N Met-Met-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O JOYFULUKJRJCSX-IUCAKERBSA-N 0.000 description 1
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 1
- 102100038738 Mitochondrial carnitine/acylcarnitine carrier protein Human genes 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 102100032194 NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 2, mitochondrial Human genes 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 206010021888 Nervous system infections Diseases 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- FXYXBEZMRACDDR-KKUMJFAQSA-N Phe-His-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O FXYXBEZMRACDDR-KKUMJFAQSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- DZVXMMSUWWUIQE-ACRUOGEOSA-N Phe-His-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N DZVXMMSUWWUIQE-ACRUOGEOSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- ZBAGOWGNNAXMOY-IHRRRGAJSA-N Pro-Cys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZBAGOWGNNAXMOY-IHRRRGAJSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 206010037423 Pulmonary oedema Diseases 0.000 description 1
- 206010037714 Quadriplegia Diseases 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 208000035415 Reinfection Diseases 0.000 description 1
- 241001068295 Replication defective viruses Species 0.000 description 1
- 206010057190 Respiratory tract infections Diseases 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PZHJLTWGMYERRJ-SRVKXCTJSA-N Ser-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O PZHJLTWGMYERRJ-SRVKXCTJSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 102100028692 T-cell leukemia translocation-altered gene protein Human genes 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- 206010069363 Traumatic lung injury Diseases 0.000 description 1
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 1
- HJWVPKJHHLZCNH-DVXDUOKCSA-N Trp-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)C)C(O)=O)=CNC2=C1 HJWVPKJHHLZCNH-DVXDUOKCSA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- LDMUNXDDIDAPJH-VMBFOHBNSA-N Trp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N LDMUNXDDIDAPJH-VMBFOHBNSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- BBSPTGPYIPGTKH-JYJNAYRXSA-N Tyr-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BBSPTGPYIPGTKH-JYJNAYRXSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- 108010067674 Viral Nonstructural Proteins Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000907316 Zika virus Species 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 229940031567 attenuated vaccine Drugs 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 108010020595 beta-casomorphin 4 Proteins 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 108091092328 cellular RNA Proteins 0.000 description 1
- 208000015114 central nervous system disease Diseases 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 231100000676 disease causative agent Toxicity 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 231100000225 lethality Toxicity 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 231100000515 lung injury Toxicity 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 210000004165 myocardium Anatomy 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 230000000926 neurological effect Effects 0.000 description 1
- 231100000189 neurotoxic Toxicity 0.000 description 1
- 230000002887 neurotoxic effect Effects 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 208000005333 pulmonary edema Diseases 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 1
- 108010032276 tyrosyl-glutamyl-tyrosyl-glutamic acid Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000006656 viral protein synthesis Effects 0.000 description 1
- 230000006394 virus-host interaction Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1009—Picornaviridae, e.g. hepatitis A virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32311—Enterovirus
- C12N2770/32321—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32311—Enterovirus
- C12N2770/32334—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/32011—Picornaviridae
- C12N2770/32311—Enterovirus
- C12N2770/32351—Methods of production or purification of viral material
- C12N2770/32352—Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/005—Assays involving biological materials from specific organisms or of a specific nature from viruses
- G01N2333/01—DNA viruses
- G01N2333/015—Parvoviridae, e.g. feline panleukopenia virus, human Parvovirus
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Hematology (AREA)
- Urology & Nephrology (AREA)
- Communicable Diseases (AREA)
- Pharmacology & Pharmacy (AREA)
- Biophysics (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- General Physics & Mathematics (AREA)
- Epidemiology (AREA)
- Mycology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Tropical Medicine & Parasitology (AREA)
- Pathology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Analytical Chemistry (AREA)
- Food Science & Technology (AREA)
- Oncology (AREA)
- Chemical Kinetics & Catalysis (AREA)
Abstract
The invention belongs to the field of biological medicine, and provides a stable infectious cDNA clone based on an EV71 strain which is clinically separated, derivative clones containing various reporter genes, and various mutant clones constructed by taking the infectious cDNA clone as a female parent; and various recombinant viruses, subunit viral particles produced using these clones; and animal models established by infecting animals with various recombinant viruses produced by the clones; and the use of these viral or subunit viral particles for vaccine development and diagnostic reagents; and the use of the virus as a gene therapy vector or an expression vector. The invention provides a new tool and a new way for detection, prevention and immunization of EV71 virus infection, and provides possibility for gene therapy and vaccine development by using the EV71 strain infectious clone as a virus vector.
Description
Technical Field
The invention belongs to the field of biological medicines, and particularly relates to construction of infectious cDNA clone based on an EV71 strain (js1) clinically isolated, viruses generated by the cDNA clone and derivative clones thereof, viruses with reporter genes and application of established animal models in research and development of antiviral drugs, research and development of vaccines and virus diagnosis.
Background
The prior art discloses that enteroviruses are a generic term for a class of viruses, including 3 types of Poliovirus (Poliovirus), 23 types of Coxsackie virus (Coxsackie virus a), 6 types of Coxsackie virus (Coxsackie virus B), 31 types of echovirus (ECHO virus), 68-71 types of Enterovirus (Enterovirus), and 67 types in total. The enteroviruses discovered after the traditional typing are named according to the discovery sequence, and the novel enteroviruses discovered now comprise enteroviruses of types 68, 69, 70, 71 and 72. A novel enterovirus type71, abbreviated EV71, belongs to the family picornaviridae, the genus Enterovirus, which was isolated in 1969 from Australia and the United states and in 1973 in Japan and is considered to be the main pathogen of outbreaks of hand-foot-and-mouth disease in children (Schmidt et al.J. Infect Dis 1974,129: 304-309; Hagiwara et al.Intervirology 1978,9: 60-63.). Prior to 1988, the EV71 virus caused an outbreak of hand-foot-and-mouth disease in infants primarily in the United states, Japan, Europe, and Australia (Weng et al microbees insert 2010; 12: 505-10; Tagaya et al Jpn J Med Sci Biol 1975; 28: 231-4; Blumberg et al Lancet 1974; 2: 112; Nagy et al. Arch Virol 1982; 71: 217-27; Kennett et al Bull World Health Organ 1974; 51: 609-15; Gilbert et al Pediatr Infect J1988; 7: 484-8). Since 1990, the EV71 virus caused a series of outbreaks in Asia-Pacific regions (Chan et al. Clin Infect Dis 2000; 31: 678-83; Tu et al. Emerg Infect Dis 2007; 13: 1733-41; Jeong et al. Arch Virol 2010; 155: 1707-12). By 2014, EV71 infection has spread to various continents and countries worldwide. Studies have shown that EV71 infection is primarily distributed in asian-pacific regions, and there is also a distribution of EV71 infection in north america, south america, europe and australia.
It has been reported that the EV71 virus is a single-stranded positive-strand RNA virus, the genome of which can encode a single long Open Reading Frame (ORF) and which also contains two long noncoding regions, 5'TURs and 3' TURs, in both sections of the genome. 5' TURs contain an Internal Ribosome Entry Site (IRES) that initiates the viral translation process (Hellen et al. genes Dev.2001; 15, 1593-1612). The open reading frame encoded by the virus is processed post-translationally by proteases encoded by the virus itself into individual viral proteins, including the structural proteins VP4, VP2, VP3, VP1 that make up the viral particle and the non-structural proteins 2A,2B,2C,3A,3B,3C and 3D responsible for viral replication (Racaniello, et al.
Studies have reported that primates can be used as infection models for EV 71. Early in 1978, Hashimoto et al reported that 1.8-3.8kg of cynomolgus monkeys, after 9 weeks of isolation, could be infected with a strain of EV71 virus isolated from stool specimens from a3 year old child, with EV71 virus being neurotoxic to the monkey, and showing clinical symptoms of neurological damage on day four of infection, with the degree of damage being positively correlated with viral titer. And EV71 virus can induce the production of serum neutralizing antibodies in monkeys (Hashimoto et al. Arch Virol.1978; 56: 257-61). Zhang et al, using a rhesus monkey of 3-3.5 years old, can establish an animal infection model with symptoms of intracerebral infection, pulmonary edema, hemorrhage with nerve injury, etc., while venous and respiratory infections can directly cause nervous system infections. Thus, models of different research purposes can be obtained by different infection routes (Zhang et al Lab invest.2011; 91: 1337-50). Furthermore, there are rhesus monkey animal models that cause central nervous system diseases (Liu et al. virology.2011; 412: 91-100).
Non-primate animal models of EV71 have also been reported, e.g., mouse-adapted mutant EV71 strain EV71/MP4 can infect ICR mice, developing nerve and lung injury (Chen et al.J Virol.2007,81: 8996-9003; Wang et al.J Virol.2004,78: 7916-24). Arita et al used immunodeficient, non-obese, severely diabetic mice (NOD/SCID mice), passaged for virus to obtain mice that could be infected with 3 week-sized NOD/SCID mice, adapted to the EV71 strain, and this mouse model was inhibited in natural killer cell function and lacked functional T, B cells. Furthermore, the obtained adapted strain of mouse mainly infects central nervous system, heart and skeletal muscle of animals (Arita et al. J Virol.2008,82(4): 1787-97). Using interferon receptors alpha, beta and gamma deficient immunodeficient mouse AG129 mice, AG129 mice 2 weeks or less old can infect the natural strain of EV71 and exhibit symptoms of acroparalysis before the mice die (Khong et al.j virol.2012,86(4): 2121-31). Three weeks old transgenic mice expressing the EV71 receptor hscabr 2 were successfully infected with the EV71Isehara/Japan/99(Isehara) strain; studies have shown that constructing a mouse model of EV71 requires either a special mouse adapted strain or a genetically deleted or modified mouse.
Studies have also reported that genomic RNA of single positive-strand (positive-strand) RNA viruses is released and can be translated directly as an mRNA template after entering the host cell cytoplasm; the viral nonstructural proteins produced by translation recruit the viral genome to form replication complexes to initiate viral gene replication and life cycle, and therefore the genomic RNA of single positive strand RNA viruses is infectious and, when introduced into a host cell, can completely initiate the entire life cycle of the virus (Racaniello, et al, science.1981,214(4523): 916). Methods for constructing infectious clones generally use total RNA of virus-infected cells as a template, reverse transcribe it to complementary DNA (cDNA), and then clone viral fragments into a cloning vector to form infectious clones of the virus. The constructed infectious clones use in vitro transcription to produce intact viral RNA, which is then transfected into host cells to initiate the viral life cycle, producing progeny virus. Or if the constructed infectious clone has a eukaryotic cell promoter, the infectious clone can be directly transfected with plasmids, and RNA polymerase of host cells transcribes the full-length RNA of the virus, so that the life cycle of the virus is started, and progeny virus is generated.
Mouse model studies demonstrated that glutamate at VP 1145 of EV71is the major site of viral mouse death, and that methylation of lysine at VP 2149 synergistically promoted the ability of VP 1145E to cause mouse death (Huang et al virology.2012,422(1): 132-43). This viral site is susceptible to mutation during in vitro passage of the virus, not 145G, resulting in a reduction in the ability of the virus to infect animals (Yi et al.
Based on the foundation and the current status of the prior art, the inventors of the present application intend to provide infectious cDNA clones based on the EV71 strain and applications thereof.
Disclosure of Invention
The invention aims to provide infectious cDNA clone based on EV71 strain and application thereof based on the foundation and the current situation of the prior art. In particular to a stable infectious cDNA clone of EV71 strain, which can be automatically replicated in cells, generate progeny virus particles and express a reporter gene.
The invention also provides recombinant virus or subunit virus particles, plasmids and the like constructed on the basis of the clone, and provides support for constructing animal models, developing vaccines and developing antiviral drugs.
The EV71 strain (named as js1) is clinically isolated, mouse adaptive mutation is not needed, mice which are not changed in gene background can be infected, virus particles with stable gene sequences can be generated by constructing infectious clone of the strain, common mice are infected, and a simple and efficient EV71 animal infection model is established.
More specifically, the present invention is to provide a novel,
the invention provides a cDNA, which comprises a nucleic acid sequence of EV71 strain and a nucleic acid sequence of a low-copy plasmid skeleton; the nucleic acid sequence of the strain EV71 covers the 5 'to 3' forward polarity sequence of the EV71 virus, including the 5 'and 3' non-coding regions of the virus and one open reading frame encoding viral proteins.
Preferably, it also includes the sequence of reporter gene luciferase or fluorescent protein inserted into the nucleic acid sequence of EV71 strain.
The amino acid sequence of the virus protein open reading frame is shown as SEQ ID NO 4.
The coding sequence of the low-copy plasmid skeleton is shown as SEQ ID NO 3.
The nucleic acid sequence of the EV71 strain is shown as SEQ ID NO 2.
In a preferred embodiment of the invention, the sequence of the infectious cDNA clone of the EV71 strain is shown as SEQ ID NO 1.
In one embodiment of the invention, the construction of a stable infectious cDNA clone of a clinically isolated EV71 strain (nucleic acid sequence 1), derivative clones containing various reporter genes (nucleic acid sequence 5 and nucleic acid sequence 6) and various mutant clones constructed by taking the derivative clones as a parent are provided. The viral RNA produced by these clones is capable of self-replication in cells, producing progeny viral particles, and expressing reporter genes.
The invention also comprises a recombinant virus clone which is constructed by taking the sequence of the nucleic acid sequence 6 or the nucleic acid sequence 7 as a female parent and replacing Nluc or EGFP and contains a heterologous report sequence or a target gene, and a sequence thereof.
The invention also includes various chimeric viruses produced by infectious cloning of various chimeric viruses and cloning of recombinant viruses containing heterologous reporter sequences or target genes and various viral particles containing reporter genes or foreign genes.
The invention also includes the recombinant virus clone which is constructed by the full-length infectious clone sequence and has heterologous resistance sequence inserted into the same open reading frame in the virus protein and the sequence thereof.
Specifically, the invention provides an infectious cDNA clone (nucleic acid sequence 1) containing a clinically isolated EV71 strain (js1), wherein the infectious clone (nucleic acid sequence 1) comprises a nucleic acid sequence (nucleic acid sequence 2) of a full-length EV71 strain (js1) and a low-copy plasmid skeleton (nucleic acid sequence 3). The nucleic acid sequence 2 covers 5 'to 3' positive polarity (positive-sense) sequences of EV71 virus, which comprises 5 'and 3' non-coding regions of the virus and an open reading frame (open reading frame) for coding virus proteins, an open reading frame virus coding protein (protein sequence 4), wherein a reporter gene luciferase NanoLuc (Nluc) and a fluorescent protein EGFP are inserted into the infectious clone (nucleic acid sequence 1) to respectively form an infectious clone with Nluc (nucleic acid sequence 5) and an infectious clone with EGFP (nucleic acid sequence 6), and mutant virus clones (attached viruses), attenuated virus clones (live-attenuated viruses), replication defective virus clones (defective viruses) and replication non-infectious clones (replication-reactive-infectious clones) obtained by means of changing nucleic acids on the basis of the clones, such as a subgenomic replicon comprising a deletion of a structural protein.
The sequences 1 to 6 are specifically as follows:
GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTATAGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACATGGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat
TTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACATGGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGC
AGCGCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGC
MGSQVSTQRSGSYENSNSATEGSTINYTTINYYKDSYAATAGKQSLKQDPDKFANPVKDIFTEMAAPLKSPSAEACGYSDRVAQLTIGNSTITTQEAANIIVGYGEWPSYCSDSDATAVDKPTRPDVSVNRFYTLDTKLWEKSSKGWYWKFPDVLTETGVFGQNAQFHYLYRSGFCIHVQCNASKFHQGALLVAVLPEYVIGTVAGGTGTEDTHPPYKQTQPGADGFELQHPYVLDAGIPISQLTVCPHQWINLRTNNCATIIVPYINALPFDSALNHCNFGLLVVPISPLDYDQGATPVIPITITLAPMCSEFAGLRQAVTQGFPTELKPGTNQFLTTDDGVSAPILPNFHPTPCIHIPGEVRNLLELCQVETILEVNNVPTNATSLMERLRFPVSAQAGKGELCAVFRADPGRNGPWQSTLLGQLCGYYTQWSGSLEVTFMFTGSFMATGKMLIAYTPPGGPLPKDRATAMLGTHVIWDFGLQSSVTLVIPWISNTHYRAHARDGVFDYYTTGLVSIWYQTNYVVPIGAPNTAYIIALAAAQKNFTMKLCKDASDILQTGTIQGDRVADVIESSIGDSVSRALTHALPAPTGQNTQVSSHRLDTGKVPALQAAEIGASSNASDESMIETRCVLNSHSTAETTLDSFFSRAGLVGEIDLPLEGTTNPNGYANWDIDITGYAQMRRKVELFTYMRFDAEFTFVACTPTGEVVPQLLQYMFVPPGAPKPDSRESLAWQTATNPSVFVKLSDPPAQVSVPFMSPASAYQWFYDGYPTFGEHKQEKDLEYGACPNNMMGTFSVRTVGTSKSKYPLVVRIYMRMKHVRAWIPRPMRNQNYLFKANPNYAGNSIKPTGASRTAITTLGKFGQQSGAIYVGNFRVVNRHLATHNDWANLVWEDSSRDLLVSSTTAQGCDTIARCDCQTGVYYCNSMRKHYPVSFSKPSLIYVEASEYYPARYQSHLMLAQGHSEPGDCGGILRCQHGVIGIVSTGGNGLVGFADVRDLLWLDEEAMEQGVSDYIKGLGDAFGTGFTDAVSREVEALKNYLIGSEGAVEKILKNLIKLISALVIVIRSDYDMVTLTATLALIGCHGSPWAWIKAKTASILGIPIAQKQSASWLKKFNDMANAAKGLEWVSNKISKFIDWLKEKIVPAAREKVEFLNNLKQLPLLENQISNLEQSAASQEDLEVMFGNVSYLAHFCRKFQPLYATEAKRVYALEKRMNNYMQFKSKHRIEPVCLIIRGSPGTGKSLATGIIARAIADKYHSSVYSLPPDPDHFDGYKQQVVTVMDDLCQNPDGKDMSLFCQMVSTVDFIPPMASLEEKGVSFTSKFVIASTNASNIIVPTVSDSDAIRRRFYMDCDIEVTDSYKTDLGRLDAGRAAKLCSENNTANFKRCSPLVCGKAIQLRDRKSKVRYSVDTVVSELIREYSNRSAIGNTIEALFQGPPKFRPIRISLEEKPAPDAISDLLASVDSEEVRQYCRDQGWIIPEAPTNVERHLNRAVLVMQSITTVVAVVSLVYVIYKLFAGFQGAYSGAPKQVLKKPALRTATVQGPSLDFALSLLRRNIRQVQTDQGHFTMLGVRDRLAVLPRHSQPGKTIWIEHKLVNVLDAVELVDEQGVNLELTLITLDTNEKFRDITKFIPENISTASDATLVINTEHMPSMFVPVGDVVQYGFLNLSGKPTHRTMMYNFPTKAGQCGGVVTSVGKVVGIHIGGNGRQGFCAGLKRSYFASEQGEIQWVKPNKETGRLNINGPTRTKLEPSVFHDIFEGNKEPAVLHSKDPRLEVDFEQALFSKYVGNTLHEPDEYIKEAALHYANQLKQLEINTSQMSMEEACYGTENLEAIDLHTSAGYPYSALGIKKRDILDPTTRDVSRMKFYMDKYGLDLPYSTYVKDELRSIDKIKKGKSRLIEASSLNDSVYLRMAFGHLYEAFHANPGTITGSAVGCNPDTFWSKLPILLPGSLFAFDYSGYDASLSPVWFRALELVLREIGYSEEAISLIEGINHTHHVYRNKTYCVLGGMPSGCSGTSIFNSMINNIIIRALLIKTFKGIDLDELNMVAYGDDVLASYPFPIDCLELAKTGKEYGLTMTPADKSPCFNEVNWGNATFLKRGFLPDEQFPFLIHPTMPMREIHESIRWTKDARNTQDHVRSLCLLAWHNGKQEYEKFVSTIRSVPVGRALAIPNYENLRRNWLELF
GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTATAGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACtctagaatggtcttcacactcgaagatttcgttggggactggcgacagacagccggctacaacctggaccaagtccttgaacagggaggtgtgtccagtttgtttcagaatctcggggtgtccgtaactccgatccaaaggattgtcctgagcggtgaaaatgggctgaagatcgacatccatgtcatcatcccgtatgaaggtctgagcggcgaccaaatgggccagatcgaaaaaatttttaaggtggtgtaccctgtggatgatcatcactttaaggtgatcctgcactatggcacactggtaatcgacggggttacgccgaacatgatcgactatttcggacggccgtatgaaggcatcgccgtgttcgacggcaaaaagatcactgtaacagggaccctgtggaacggcaacaaaattatcgacgagcgcctgatcaaccccgacggctccctgctgttccgagtaaccatcaacggagtgaccggctggcggctgtgcgaacgcattctggcgatgcatGCGATCACCACTCTTGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat
GCTAGCGGAGTGTATACTGGCTTACTATGTTGGCACTGATGAGGGTGTCAGTGAAGTGCTTCATGTGGCAGGAGAAAAAAGGCTGCACCGGTGCGTCAGCAGAATATGTGATACAGGATATATTCCGCTTCCTCGCTCACTGACTCGCTACGCTCGGTCGTTCGACTGCGGCGAGCGGAAATGGCTTACGAACGGGGCGGAGATTTCCTGGAAGATGCCAGGAAGATACTTAACAGGGAAGTGAGAGGGCCGCGGCAAAGCCGTTTTTCCATAGGCTCCGCCCCCCTGACAAGCATCACGAAATCTGACGCTCAAATCAGTGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCTGGCGGCTCCCTCGTGCGCTCTCCTGTTCCTGCCTTTCGGTTTACCGGTGTCATTCCGCTGTTATGGCCGCGTTTGTCTCATTCCACGCCTGACACTCAGTTCCGGGTAGGCAGTTCGCTCCAAGCTGGACTGTATGCACGAACCCCCCGTTCAGTCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGAAAGACATGCAAAAGCACCACTGGCAGCAGCCACTGGTAATTGATTTAGAGGAGTTAGTCTTGAAGTCATGCGCCGGTTAAGGCTAAACTGAAAGGACAAGTTTTGGTGACTGCGCTCCTCCAAGCCAGTTACCTCGGTTCAAAGAGTTGGTAGCTCAGAGAACCTTCGAAAAACCGCCCTGCAAGGCGGTTTTTTCGTTTTCAGAGCAAGAGATTACGCGCAGACCAAAACGATCTCAAGAAGATCATCTTATTAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTGTCGACGCGGCCGCTAATACGACTCACTATAGGTTAAAACAGCCTGTGGGTTGCACCCACTCACAGGGCCTACTGGGCGCAAGCACTCTGGTACCTCGGTACCTTTGTGCGCCTGTTTTACACCCCCCCCCCAATGAAACTTAGAAGCAATAAACCACGATCAATAGCAGGCATAACGCTCCAGTTATGTCTTGATCAAGCACTTCTGTTTCCCCGGACTGAGTATCAATAGACTGCTCGCGCGGTTGAAGGAGAAAACGTTCGTTATCCGGCTAACTACTTCGGAAAACCTAGTAACACCATGAAAGTTGCGGAGAGCTTCGTTCAGCACTCCCCCAGTGTAGATCAGGTCGATGAGTCACCGCGTTCCCCACGGGCGACCGTGGCGGTGGCTGCGTTGGCGGCCTGCCCATGGGGTAACCCATGGGGCGCTCTAATACGGACATGGTGTGAAGAGTCTACTGAGCTAGTTGGTAGTCCTCCGGCCCCTGAATGCGGCTAATCCCAACTGCGGAGCACACGCCCACAAGCCAGCGGGTAGTGTGTCGTAACGGGTAACTCTGCAGCGGAACCGACTACTTTGGGTGTCCGTGTTTCCTTTTATCTTTATATTGGCTGCTTATGGTGACAATTAAAGAATTGTTACCATATAGCTATTGGATTAGCCATCCGGTGTGCAACAGAGCAATTATTTACCTATTTATTGGTTTTGTACCATTAACCTCGAATTCTGTGACCACCCTTAATTATATCTTGACCCTTAACACAGCTAAACcatatgATGgtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagatgcatGCGATCACCACTCTTGGTTCGCAAGTGTCTACACAGCGCTCCGGTTCTTACGAAAACTCAAACTCAGCCACTGAGGGTTCTACCATAAACTACACCACCATTAATTACTACAAAGACTCCTATGCTGCCACAGCAGGCAAaCAGAGTCTCAAGCAGGATCCAGACAAGTTTGCAAATCCTGTTAAAGACATATTCACcGAAATGGCAGCGCCACTGAAGTCCCCATCCGCTGAGGCATGTGGATACAGTGATCGAGTGGCGCAATTAACTATTGGCAACTCCACCATCACGACGCAAGAAGCGGCTAACATCATAGTCGGCTATGGTGAGTGGCCTTCCTACTGCTCAGATTCTGACGCTACAGCAGTGGATAAACCAACGCGCCCGGATGTTTCAGTGAACAGGTTTTACACATTGGACACTAAATTGTGGGAGAAATCGTCCAAGGGATGGTACTGGAAGTTCCCGGATGTGTTAACTGAAACTGGGGTTTTTGGGCAAAATGCACAATTCCACTACCTCTACCGATCAGGGTTCTGCATCCACGTGCAGTGCAATGCCAGTAAATTCCACCAAGGAgCACTcCtAgTCGCTGTCCTACCAGAGTATGTCATTGGGACAGTGGCAGGCGGTACAGGGACGGAAGACACCCACCCCCCCTACAAGCAGACCCAACCCGGCGCCGATGGTTTCGAGTTGCAACACCCGTACGTGCTTGATGCTGGCATCCCAATATCACAGTTAACAGTGTGCCCACACCAGTGGATTAATTTGAGGACCAACAATTGTGCTACAATAATAGTGCCATACATTAACGCACTGCCTTTTGATTCTGCCTTGAACCATTGCAACTTTGGCCTGTTAGTTGTGCCTATTAGCCCACTAGACTACGACCAAGGAGCAACGCCAGTAATCCCTATAACTATCACATTGGCCCCAATGTGCTCTGAATTCGCAGGTCTTAGGCAGGCAGTCACGCAAGGGTTCCCCACCGAGCTAAAACCTGGCACAAATCAATTTTTAACCACCGATGATGGCGTCTCAGCACCTATTCTACCAAACTTCCACCCCACCCCGTGTATCCACATACCTGGTGAAGTTAGGAACTTGCTAGAGTTATGCCAGGTGGAGACCATTCTGGAGGTTAACAATGTGCCCACGAATGCCACTAGCTTAATGGAGAGACTGCGCTTCCCGGTCTCAGCACAAGCAGGGAAAGGTGAACTGTGTGCGGTGTTTAGAGCCGATCCTGGGCGAAATGGACCATGGCAATCCACCTTACTGGGCCAGTTGTGCGGGTACTACACCCAATGGTCAGGGTCATTGGAAGTCACCTTCATGTTTACTGGATCCTTCATGGCTACCGGCAAGATGCTCATAGCCTATACACCGCCAGGGGGTCCTCTGCCCAAGGACCGGGCGACCGCCATGTTGGGCACGCACGTCATCTGGGATTTTGGGCTGCAATCGTCTGTTACCCTTGTAATACCATGGATCAGTAACACTCATTATAGAGCACATGCCCGAGATGGAGTGTTTGACTATTACACTACAGGGTTAGTCAGTATATGGTACCAGACAAATTACGTGGTTCCAATCGGTGCGCCCAACACAGCCTATATAATAGCACTAGCGGCAGCCCAAAAGAACTTCACTATGAAATTGTGCAAGGATGCTAGTGATATCCTGCAGACGGGCACCATCCAGGGAGATAGGGTGGCAGATGTAATTGAAAGTTCCATAGGAGATAGCGTGAGCAGAGCCCTCACTCACGCTCTACCAGCACCCACAGGCCAAAACACACAGGTGAGCAGTCATCGACTGGATACAGGCAAGGTTCCAGCACTCCAAGCTGCTGAAATTGGGGCATCATCAAATGCTAGTGACGAGAGCATGATTGAAACACGTTGTGTTCTTAACTCGCATAGTACAGCTGAGACCACTCTTGATAGTTTCTTCAGTAGGGCAGGATTAGTTGGAGAGATAGATCTCCCTCTTGAGGGCACAACTAACCCAAATGGTTATGCCAACTGGGACATAGATATAACAGGTTACGCGCAAATGCGTAGAAAGGTAGAGCTATTCACCTACATGCGTTTTGATGCAGAGTTCACTTTTGTTGCGTGCACACCCACCGGGGAGGTTGTCCCACAATTGCTCCAATATATGTTTGTGCCACCTGGAGCCCCTAAGCCAGATTCTAGGGAATCCCTTGCATGGCAAACCGCCACCAACCCCTCAGTTTTTGTCAAGCTGTCAGACCCTCCGGCGCAGGTTTCAGTGCCATTCATGTCACCTGCGAGTGCTTATCAATGGTTTTATGACGGATATCCCACATTCGGAGAACACAAACAGGAGAAAGACCTTGAATACGGGGCATGTCCTAATAACATGATGGGTACATTCTCAGTGCGGACTGTGGGGACCTCCAAGTCCAAGTACCCTTTAGTGGTTAGGATTTACATGAGAATGAAGCACGTCAGGGCGTGGATACCTCGCCCGATGCGCAACCAGAACTACCTGTTCAAAGCCAACCCAAATTATGCTGGCAACTCTATTAAGCCAACTGGTGCCAGTCGCACAGCGATCACCACTCTTGGGAAATTTGGACAACAGTCTGGGGCTATTTATGTGGGCAACTTTAGAGTGGTCAACCGACATCTTGCCACCCATAATGATTGGGCAAATCTTGTTTGGGAAGACAGCTCTCGCGACTTGCTCGTGTCATCCACCACTGCCCAAGGTTGTGACACGATTGCCCGTTGCGATTGCCAGACAGGGGTGTACTACTGTAACTCGATGAGAAAACACTACCCAGTCAGTTTTTCAAAACCCAGCCTGATCTATGTAGAGGCTAGCGAGTATTACCCAGCCAGGTACCAATCACATCTCATGCTCGCACAGGGTCACTCGGAACCTGGTGATTGCGGTGGTATCCTTAGGTGCCAACATGGCGTCATCGGCATAGTGTCTACTGGTGGCAATGGGCTCGTTGGCTTTGCAGACGTCAGAGACCTCTTGTGGTTAGATGAAGAAGCTATGGAACAGGGCGTGTCCGACTACATTAAGGGTCTCGGAGATGCTTTTGGAACAGGCTTCACTGACGCAGTCTCAAGGGAGGTTGAAGCTCTCAAGAACTATCTTATAGGGTCTGAAGGAGCAGTTGAGAAAATTTTGAAAAATCTTATTAAACTAATCTCTGCACTGGTGATTGTGATCAGAAGTGATTACGACATGGTTACCCTCACTGCAACCTTAGCGCTGATAGGTTGTCATGGCAGTCCTTGGGCTTGGATTAAAGCCAAAACAGCCTCCATCTTAGGTATCCCTATCGCCCAAAAGCAGAGCGCTTCCTGGCTCAAGAAGTTCAATGACATGGCCAACGCCGCTAAGGGGTTAGAGTGGGTTTCCAACAAGATCAGCAAATTTATTGATTGGCTTAAGGAGAAAATAGTACCAGCAGCCAGGGAGAAGGTTGAATTCCTAAATAACTTGAAACAGCTGCCACTGCTAGAGAATCAGATCTCGAACTTGGAACAATCTGCTGCTTCACAAGAGGACCTTGAAGTCATGTTTGGGAATGTGTCGTACCTAGCTCACTTCTGTCGCAAGTTTCAACCGCTATACGCCACGGAAGCTAAAAGAGTCTATGCCCTGGAGAAGAGAATGAATAACTATATGCAGTTCAAGAGCAAACACCGAATTGAACCTGTATGTCTCATTATTAGGGGCTCACCAGGCACCGGGAAGTCTCTAGCCACTGGTATTATTGCTCGAGCAATCGCTGATAAGTACCACTCCAGCGTGTACTCGCTCCCACCAGACCCGGATCATTTTGACGGTTACAAGCAACAGGTGGTTACAGTGATGGATGATTTGTGTCAAAACCCCGATGGTAAGGATATGTCCTTATTCTGTCAAATGGTATCCACCGTAGATTTCATTCCACCAATGGCTTCTCTCGAGGAGAAGGGAGTTTCCTTCACCTCTAAGTTTGTCATCGCATCCACTAATGCCAGTAATATCATAGTACCAACAGTGTCTGATTCTGACGCTATTCGCCGCAGGTTCTACATGGACTGTGACATTGAAGTGACAGACTCGTACAAAACAGATCTAGGTAGACTGGATGCAGGGCGAGCCGCTAAACTGTGTTCTGAAAATAACACTGCAAATTTCAAACGTTGCAGCCCATTAGTGTGTGGGAAAGCCATCCAACTTAGAGATAGAAAGTCTAAAGTCAGATACAGTGTGGATACGGTGGTTTCAGAACTTATTAGGGAATACAGCAATAGGTCCGCCATTGGTAACACAATCGAGGCTCTTTTCCAAGGTCCACCCAAGTTCAGGCCAATTAGGATTAGCCTTGAAGAAAAACCAGCCCCAGACGCTATTAGCGATCTCCTTGCTAGTGTAGATAGTGAAGAAGTGCGCCAGTACTGCAGGGATCAAGGCTGGATTATTCCTGAAGCTCCCACCAATGTGGAGCGGCACCTTAATAGAGCGGTGCTCGTCATGCAATCCATCACCACAGTAGTGGCGGTTGTTTCGTTGGTGTACGTCATCTACAAGCTCTTTGCAGGGTTTCAGGGTGCATATTCTGGTGCTCCTAAGCAAGTGCTTAAGAAACCTGCTCTTCGCACAGCAACAGTGCAGGGTCCGAGCCTTGACTTTGCTCTCTCCCTACTGAGAAGGAACATCAGGCAGGTCCAAACAGACCAAGGGCATTTCACCATGTTGGGTGTTAGGGATCGCTTAGCAGTCCTCCCACGCCACTCACAACCTGGCAAAACCATTTGGATTGAGCACAAACTCGTGAACGTCCTTGATGCAGTTGAACTGGTGGATGAGCAAGGAGTCAACCTGGAATTAACCCTCATCACTCTTGACACCAACGAGAAGTTTAGGGATATCACCAAATTCATCCCAGAAAATATCAGCACTGCTAGCGATGCCACCCTAGTGATCAACACGGAGCACATGCCGTCAATGTTTGTCCCGGTGGGTGACGTTGTGCAGTATGGCTTTTTGAATCTCAGTGGCAAGCCTACCCATCGCACCATGATGTACAATTTTCCTACTAAAGCAGGACAGTGTGGAGGAGTGGTGACATCTGTTGGGAAGGTTGTCGGTATTCACATTGGTGGCAATGGCAGACAAGGTTTTTGCGCAGGCCTCAAAAGGAGTTACTTTGCTAGTGAACAAGGAGAGATCCAGTGGGTTAAGCCCAATAAAGAAAcTggAAGACTCAACATCAATGGACCAACCCGCACCAAGTTAGAACCTAGTGTATTCCATGACATCTTCGAGGGAAATAAGGAACCAGCTGTCTTGCACAGTAAAGACCCCCGACTTGAGGTAGATTTTGAACAGGCCCTGTTCTCTAAGTATGTGGGAAACACACTACATGAGCCTGACGAGTACATCAAAGAGGCAGCTCTACATTATGCAAACCAATTAAAGCAACTAGAAATCAATACCTCTCAAATGAGCATGGAGGAGGCCTGCTATGGTACTGAGAATCTTGAGGCTATTGATCTTCACACTAGTGCAGGTTACCCCTATAGTGCCCTAGGGATAAAGAAAAGAGACATCTTAGACCCTACCACCAGGGACGTGAGTAGAATGAAGTTCTACATGGACAAGTATGGTCTTGATCTTCCCTACTCCACTTATGTCAAGGACGAGCTACGCTCGATTGATAAAATCAAGAAAGGGAAGTCCCGCCTGATCGAGGCCAGTAGTCTAAATGATTCAGTGTACCTCAGAATGGCTTTCGGGCATTTGTATGAGGCTTTCCACGCAAATCCTGGGACGATAACTGGATCGGCCGTGGGGTGTAACCCTGACACATTCTGGAGCAAGCTGCCAATTTTGCTCCCTGGTTCACTCTTTGCCTTTGACTACTCAGGCTATGATGCCAGCCTTAGCCCTGTCTGGTTCAGAGCATTAGAATTGGTTCTTAGGGAGATAGGGTATAGTGAAGAGGCAATCTCACTCATTGAGGGAATCAACCACACACATCATGTGTATCGTAATAAGACCTATTGCGTGCTTGGTGGGATGCCCTCAGGCTGTTCAGGAACATCCATCTTCAACTCAATGATCAACAACATTATTATCAGAGCACTGCTCATAAAAACATTTAAGGGCATTGATTTGGATGAACTCAACATGGTCGCTTATGGAGACGATGTGCTCGCTAGCTATCCCTTCCCAATTGATTGCTTGGAACTAGCAAAGACTGGTAAGGAGTATGGTCTGACCATGACCCCTGCTGATAAATCTCCTTGCTTTAATGAGGTCAATTGGGGTAATGCGACCTTCCTCAAAAGGGGCTTTTTGCCCGATGAACAGTTTCCATTTTTGATTCACCCTACTATGCCAATGAGGGAGATCCATGAGTCCATTCGATGGACCAAGGACGCACGGAACACTCAAGATCATGTGCGGTCCTTGTGCCTCCTAGCATGGCATAATGGTAAGCAAGAATACGAGAAGTTTGTGAGCACAATTAGGTCTGTCCCAGTAGGGAGAGCGTTGGCTATTCCAAATTATGAAAATCTTAGACGAAATTGGCTCGAGTTATTTTAGAGGTTATACACACCTCAACCCCACCAGAAATCTGGTCGTGAATGTGACTGGTGGGGGTAAATTTGTTATAACCAGAATAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAaagcttat。
the present invention also includes expression products of these cDNA clones.
The present invention also includes double-stranded DNA containing the above cDNA, double-stranded DNA (double stranded DNA) capable of generating a full-length infectious clone sequence, positive-sense cDNA (positive-sense cDNA) or negative-sense cDNA (negative-sense cDNA).
The present invention also includes plasmids containing the above-mentioned cDNA or double-stranded DNA.
Preferably, the plasmid can be used for transcribing the full-length infectious RNA of the EV71 strain or a mutant thereof, can be used for generating the plasmid containing the full-length infectious RNA of the full-length EV71 strain (js1) through in vitro transcription, and can be used for generating the plasmid containing the full-length infectious RNA of the full-length EV71 strain (js1) and a derivative plasmid.
Wherein the derivative plasmid comprises:
A. a recombinant virus clone obtained by replacing a partial sequence of the full-length infectious clone of the EV71 strain (js1) of claim 1 with a partial sequence of another isolate (isolates);
B. a mutant virus clone obtained by mutating a sequence in a full-length infection clone of the EV71 strain (js1) according to claim 1 or 2 by using gene mutation;
C. the virus generated by the full-length infection clone of the EV71 strain (js1) is subjected to attenuation (live-attenuated) generated by adaptive mutation, replication non-infection virus (replication reactive non-infections) and non-replication virus (destructive viruses) and other derivative clones.
The present invention provides a plasmid containing the above double-stranded DNA or a derivative thereof.
Preferably, it is capable of transcribing to produce a full-length infectious RNA of the EV71 strain or a mutant thereof.
The present invention also provides a vaccine or viral vector prepared according to the above plasmid;
the present invention provides a viral particle prepared from the above cDNA clone or plasmid;
for example, attenuated (live-infected) virus particles, non-infectious replicating virus (replication competent non-infectious) particles, and non-replicating virus (destructive viruses) particles;
the virus can be separated, purified and subjected to an anti-EV 71 virus antibody by an immune animal method, and can also be used for screening a human antibody library; alternatively, it can be used to prepare the reagent kit for detecting EV71 virus and various cell lines, tissues and animal infection models.
The cell line, tissue and animal infection model can be used for screening the anti-EV 71 virus drugs.
The invention also includes the detection method and preparation method of the virus vector and virus particle;
for example, animals are immunized with the virus particles and antibodies are isolated, or a human antibody library is screened.
In another aspect, the invention provides a kit for detecting EV71, which contains the above-mentioned cDNA or viral particle.
The invention also provides a preparation method of the anti-virus EV71 drug, which uses the cDNA or the virus particles to construct cells or animal models for screening the anti-virus EV71 drug; alternatively, the cDNA or the virus particles are used for constructing cells or animal models for screening the medicine for resisting the virus EV 71.
The infectious clone (nucleic acid sequence 1) of the invention is a complete plasmid (plasmid) consisting of a DNA sequence. The recombinant plasmid contains a nucleic acid sequence (nucleic acid sequence 2) of a full-length EV71 strain (js1) and a low-copy plasmid backbone sequence (nucleic acid sequence 3). Plasmids (plasmids) are closed double-stranded DNA (double stranded DNA) bound by covalent bonds. It contains a sense strand (positive-sense strand) that is identical to the mRNA sequence and a complementary antisense or negative strand (negative-sense strand).
The full-length nucleic acid sequence (nucleic acid sequence 2) of the EV71 strain (js1) contained in the infectious clone (nucleic acid sequence 1) of the present invention includes a non-translated region (NTR) at the 5' end of the positive strand (positive sense) sequence of the virus, an Open Reading Frame (ORF) and a 3' end untranslated region (3 ' -NTR). In this infectious clone, the 5' end of the full-length viral nucleic acid sequence contains a T7 promoter (TAA TAC GAC TCA CTA TAG G, SEQ ID NO 7) (FIG. 1A), and the full-length viral RNA can be transcribed in vitro by a commercial T7 transcription kit; the 3' end of the full-length viral nucleic acid sequence contained a 30 nucleotide polyA tail (AAAAA AAAAA AAAAA AAAAA AAAAA AAAAA, SEQ ID NO 8) (fig. 1A).
Clinically separating strain to infect RD cells, extracting total RNA of the cells when the cells have cytopathy, and utilizing EV71 specific primer (GCT)AG CGCTtt tttttttt tttttttt ttttttt ttttt, SEQ ID NO 9), and then PCR-amplified using cDNA obtained by reverse transcription (Superscript II reverse transcriptase). Full-length EV71 gene component 4-segment amplification (FIG. 1A), wherein the amplification primer is F1(S: GAC)GC GGCCG CTAA TAC GACTC ACTATAG GTTAAA ACAGC CTGT GGGT TGCAC CC,SEQ ID NO 10;As:GCACTG CACGT GGATGC AGAAC,SEQ ID NO 11),F2(S:GACGCG GCCGCG TTCT GCAT CCAC GTGCA GTGC,SEQ ID NO 12;As:AAGTC GCGA GAGCT GTCTTC CC,SEQ ID NO 13),F3(S:GACGCG GCCGCG GGAA GACAG CTCTCG CGACTT,SEQ ID NO 14;As:AATTG TACAT CATG GTGC GATGG GTAGG,SEQ ID NO 15),F4(S:GACGC GGCCGCCCTAC CCATCG CACCATG ATGTAC AATT,SEQ ID NO 16;As:GCTAGC GCTtttttttt tttttttt tttttttt ttttttGCT ATTCT GGTTAT AACAA ATTTA CCCCCA CCAG, SEQ ID NO 17), cloning the amplified fragment into pANCR vector by using a step-by-step cloning method to obtain a final full-length cDNA clone which is named as pEV71-js1 (FIG. 1A),
this infectious clone was linearized in vitro with HindIII and transcribed to give a polyA tail containing the full length RNA of the virus and its 3' end using the T7 transcription kit. After the virus RNA generated in vitro is introduced into a host cell such as a Vero cell by an electrotransfer or transfection method, the RNA of the virus is used as a translation template to translate ORF of the RNA, and virus polypeptide (protein sequence 4) is generated; the virus polypeptide is processed to form virus structural protein and non-structural protein, and the whole virus life cycle is started to produce progeny virus.
Due to the encoded degeneracy, the same functional protein product can still be obtained by changing the codon without changing the protein sequence; the invention includes other nucleic acid sequences and infectious clones encoding the same as "protein sequence 4".
The virus produced by the infectious clone (nucleic acid sequence 1) of the invention shows strong replication capacity in cells (figure 2), and can be used for infecting cell lines cultured in vitro, nervous tissues, mice (figure 6), monkeys and the like to establish cell models infected by the virus and animal infection models for drug development.
By engineering this infectious clone (nucleic acid sequence 1), a reporter gene is inserted in a specific region of the virus (preceding the VP4 protein coding region), the reporter gene being initiated by translation of the viral IRES. An extra amino acid site (AITTL) was added to the C-terminus of the reporter (FIG. 1B), which is recognized and cleaved by the viral 3C protease, yielding the normal N-terminus of VP 4. The reporter genes, namely, NanoLuc (Nluc) and EGFP (fluorescent protein), were successfully inserted into the infectious clone (nucleic acid sequence 1) to construct an infectious clone with Nluc (nucleic acid sequence 5) and an infectious clone with EGFP (nucleic acid sequence 6), respectively (FIG. 1B). The reporter genes were ligated into pEV71-js1 plasmid (SEQ ID NO: 1) by fusion PCR, designated pEV71-js1-Nluc (SEQ ID NO: 5) (FIG. 1B) and pEV71-js1-EGFP (SEQ ID NO: 6) (FIG. 1C), respectively; the infectious clone was as above, after in vitro HindIII linearization, transcribed viral full-length RNA from T7 transcription kit, and the in vitro transcribed viral RNA was introduced into host cells such as Vero cells by electrotransfer or transfection, and the viral life cycle was initiated to generate progeny virus (FIG. 2). The virus expresses reporter genes Nluc and EGFP in the replication process. Nluc can be detected using a commercial luciferase activity detection kit (fig. 5). The expression of EGFP can be observed with a fluorescence microscope (fig. 4) or detected with a flow cytometer. The resulting progeny virus containing the reporter gene segment re-infects new cells, where it can replicate efficiently. The reporter gene, being in the same open reading frame as the viral protein, is expressed at a level that is responsive to the viral protein level and also to the viral replication level. And the recombinant virus containing the reporter gene continued to pass the reporter gene without loss for a considerable period of time (FIG. 4). The recombinant virus containing the reporter gene can be used for quickly and conveniently detecting the virus replication and packaging level, and can be used for researching the life cycle of the virus, the virus-host interaction, the virus immunology, the development of antiviral drugs and the like.
This infectious clone (nucleic acid sequence 1) is modified so that replication non-infectious virus (replication competent non-infectious virus) such as subgenomic replicon (subgenomic replication) capable of viral gene replication can be constructed by referring to the region of structural proteins VP4-VP3-VP2-VP1 of other viruses of the Enterovirus genus, for example, a knockout virus, but the progeny virus cannot be packaged due to the absence of structural proteins of the virus. Meanwhile, the subgenomic replicon RNA can be trans-complemented (trans-complete) by the expressed structural protein to be packaged into Recombinant Subviral Particles (RSPs) (Barclay, et al.J. Gen Virol.1998,79: 1725-1734; Jia, et al.J. Virol.1998,72:7972-7977), and the subviral particles can be subjected to one-round infection, but can not be packaged again after infection because the genome does not encode the structural protein, so that the subviral particles are non-replicative virus (destructive viruses) particles. These non-replicating viral particles may be used as a form of vaccine.
The infectious clone (nucleic acid sequence 1) is modified to form attenuated virus, and the attenuated virus can be used as vaccine. An attenuated vaccine can be constructed by mutating the 5' NTR with reference to the attenuation strategy of Polio viruses, which are also the family picornaviridae (Arita, et al.J Virol.2008,82: 1787-1797).
The method comprises the steps of infecting a mouse with virus generated by infectious clone, establishing a convenient and stable animal infection model, finding that the lethality of the initially separated strain is reduced after infecting a newborn mouse after more than 3 passages, finding that mutation from E to G occurs at the 145 th position of VP1 of the passaged virus through sequencing, and therefore, the consistency of the mouse model established by infecting the separated virus is poor, and generating the virus by utilizing the infectious clone can ensure that a virus sequence is not influenced by cell passage. The virus obtained by the infectious clone can be used for infecting newborn mice of different strains (ICR, Balb/C, C57) to obtain 100% mortality within 9 days (figure 6B), but the virus carrying the 145G mutation of VP1 can not cause the death of the mice after infecting the mice (figure 6C), and the animal model can be conveniently used for antiviral drug vaccine evaluation and the like.
Novel Enterovirus 71 (Human enterovirus type71, EV71) belongs to the family Microviridae in the family Microviridae (picornaviridae) Is a member of the enterovirus group (enterovirus) of (1). EV71is the major causative agent of hand-foot-and-mouth disease in children worldwide, which can cause children to suffer from mild and severe hand-foot-and-mouth disease. The virus can infect and cause damage to the central nervous system, but the mechanism is unknown. There is currently no effective antiviral drug for the treatment of EV 71. The invention constructs the full-length cDNA clone of the stable virus by separating a clinical EV71 strain and utilizing molecular cloning, and the virus RNA from the cDNA clone can generate the EV71 virus through in vitro transcription RNA and transfection of Vero cells; further, the recombinant virus containing the reporter genes Gluc (Gaussia luciferase) and EGFP is constructed, and the recombinant virus containing the reporter genes Gluc and EGFP is proved to have the capacity of infecting host cells and causing cytopathic effect,the EV71 virus from infectious clone is used to infect the immune-competent ICR, Bab/C and C57 suckling mice, and 100 percent of infected mice die due to nerve injury symptoms within 10 days.
The invention provides a stable infectious cDNA clone based on an EV71 strain which is clinically separated, derivative clones containing various reporter genes, and various mutant clones constructed by taking the infectious cDNA clone as a female parent; and various recombinant viruses, subunit viral particles produced using these clones; and animal models established by infecting animals with various recombinant viruses produced by the clones; and the use of these viral or subunit viral particles for vaccine development and diagnostic reagents; and the use of the virus as a gene therapy vector or an expression vector.
The invention has the following advantages:
the present invention includes various recombinant virus and subunit virus particle plasmids constructed through molecular biology with these cloned plasmids as female parent.
The invention also includes various recombinant viruses, subunit viral particles that can be produced using these clones; which contains the above cDNA.
The invention also includes animal infection models constructed using various recombinant viruses that these clones can produce.
The invention also includes the use of these viral or subunit viral particles and animal models for vaccine development and diagnostic reagents.
The invention also includes the use of these viruses or subunit viral particles to establish animal models for vaccine development and antiviral drug development.
The invention also includes the use of the viral or subviral unit plasmids as gene therapy vectors or expression vector plasmids and the viral or subviral particles produced using these plasmids.
The invention provides a new tool and a new way for detection, prevention and immunization of EV71 virus infection, and provides possibility for gene therapy and vaccine development by using the EV71 strain infectious clone as a virus vector.
Drawings
FIG. 1 construction of an infectious cDNA clone of EV71 strain js1, in which,
(A) infectious clone construction strategy; zika virus complete genome pattern diagram, two ends of the black column respectively represent 5 '-NTR and 3' -NTR; the virus structural protein region and the non-structural protein region are shown in the figure; the full-length viral sequence is divided into 4 segments for amplification, wherein the first segment F1 contains a T7 sequence, and the fourth segment F4 contains polyA introduced by PCR primers30A sequence; the synthesized sequence is sequentially connected and connected into a pACNR vector through restriction endonucleases according to the figure to obtain full-length clone; (B) by fusion PCR, Nluc or EGFP gene is fused in-frame at the N-terminal of VP4, and an additional amino acid sequence AITTL is added at the C-terminal of the Nluc or EGFP gene, so that the N-terminal of VP4 can be conveniently cut by viral protease.
FIG. 2 infectious cDNA clone of EV71 strain js1 shows the replication and infection capabilities of the virus,
infectious Clone plasmid as template, through in vitro transcription into virus RNA, viral RNA through electrotransfer into Vero cell, collect supernatant virus, in Vero cell using plaque experiment to perform titer titration, infectious Clone produced (Clone-WT) plaque and Parent virus (Parent) produced by the comparison of the plaque as shown in figure (upper), infectious Clone produced virus and Parent virus again infected Vero cell (MOI 0.1), after infection, collecting different time (h.p.i) cell supernatant, using plaque experiment to titrate, get the two growth curves as shown in figure (lower), virus titer is expressed by PFU/ml.
FIG. 3 production of recombinant virus containing reporter genes Nluc and EGFP, wherein,
(A) the infectious clone plasmid containing the reporter genes Nluc and EGFP is transcribed into virus RNA in vitro with the infectious clone plasmid without the reporter genes, the virus RNA is introduced into Vero cells through electrotransfer, supernatant viruses are collected, titer titration is carried out on the Vero cells by utilizing a plaque experiment, and plaques generated by recombinant viruses containing the reporter genes are compared with plaques generated by viruses without the reporter genes; (B) infecting Vero cells again with the same titer of virus containing each reporter gene and the virus without the reporter gene (MOI ═ 0.1), collecting supernatants at different days after infection, titrating the supernatants by using a plaque experiment to obtain a growth curve; viral titers were expressed as PFU/ml.
FIG. 4 stability of recombinant viruses containing EGFP reporter gene, wherein,
diluting cell supernatant infected by the recombinant virus EV71-EGFP at a ratio of 1:10, re-infecting new Vero cells, observing the cells with a fluorescence microscope after two days of infection, collecting supernatant, re-infecting new Vero cells (C +1) at a ratio of 1:10, observing the cells with the fluorescence microscope after two days of infection, collecting supernatant, and re-infecting the cells; and (5) sequentially carrying out passage infection, and observing the expression condition of the EGFP in the infected cells.
FIG. 5 Nluc-producing Activity of recombinant viruses containing Nluc reporter gene, wherein,
the infectious clone plasmid containing the reporter gene Nluc and the plasmid containing VP 1E 145G and 3C C147A mutations are transcribed into virus RNA in vitro, the virus RNA is introduced into Vero cells through electrotransfer, the cells are collected at different time points after the electrotransfer, the activity of Nluc in the cells is detected, and C147A is 3C protease activity deletion mutation.
FIG. 6 infectious cDNA clone of EV71 strain js1 producing virus-infected mice an animal infection model was constructed in which,
(A) virus generated by infectious cDNA clones infected different strains of 3-day-old fetal mice (1.4X 10)4pfu/mouse), observed 5 days after infection. (B) Survival curves of mice after viral infection (n ═ 5/group). (C) Infectious cDNA clone (WT) and clone carrying the VP 1E 145G mutation gave a virus that infected 3-day-old ICR mice with a survival curve (n 5/group).
Detailed Description
The methods used in the present invention are all conventional molecular biology methods, and the details of the operation are not repeated.
Example 1 construction of an infectious cDNA clone of EV71 strain js1
As shown in FIG. 1A, the virus isolated from the stool specimen was cultured in RD cells, and when the cells showed significant cytopathic effects, total cellular RNA was extracted and reverse transcribed using superscript II (Invitrogen)Enzyme, with sequence specific primer (GCT)AG CGCTttt tttttttttttt tttttttttt ttttt) and performing reverse transcription, using the obtained cDNA as a template, performing PCR amplification in 4 sections by using high fidelity enzyme super Fi (Invitrogen), wherein the amplification primer is F1(S: GAC)GC GGCCG CTAA TAC GACTC ACTATAG GTTAAA ACAGC CTGT GGGT TGCAC CC;As:GCACTG CACGT GGATGC AGAAC),F2(S:GACGCG GCCGCG TTCT GCAT CCAC GTGCA GTGC;As:AAGTC GCGA GAGCT GTCTTC CC),F3(S:GACGCG GCCGCG GGAA GACAG CTCTCG CGACTT;As:AATTG TACAT CATG GTGC GATGG GTAGG),F4(S:GACGC GGCCGCCCTAC CCATCG CACCATG ATGTAC AATT;As:GCTAGC GCTtttttttt tttttttt tttttttt ttttttGCT ATTCT GGTTAT AACAA ATTTA CCCCCA CCAG), the amplified F4 fragment was digested with restriction enzymes NotI/AfeI, and ligated to pANCR vector digested with the same restriction enzymes to obtain pANCR-F4 plasmid, the PCR amplified F3 fragment was ligated to pANCR-F4 with NruI/BsrGI to obtain pANCR-F34 plasmid, the PCR amplified F2 fragment was ligated to pANCR-F34 with PmlI/NruI to obtain pANCR-F234, and the PCR amplified F1 fragment was ligated to pANCR-F234 with NotI/PmlI to obtain the final full-length cDNA clone designated as pEV71-js 1.
In order to construct an infectious clone plasmid with a reporter gene EGFP (As shown in FIG. 1B), three sequences are fused by utilizing fusion PCR, wherein EGFP-F1 is an EV715UTR sequence, PCR amplification primers are S: CCTGA CGTG TCGA CGCGG, SEQ ID NO 18, As: cctc gccct tgctcac CATcatatgG TTTAGCTGT GTTAAG GGTCAAGA, SEQ ID NO 19, EGFP-F2 is a segment containing EGFP, and the PCR amplification primers are S: TCTT GACC CTTAAC ACAGC TAA ACcata tgATG gtga gcaag ggcg agg, SEQ ID NO 20, As: CGCT GTGT AGACAC TTGCGA ACCAAG AGTGGTG ATCGC atgcat cttgtac agctcgt ccatgc cg, SEQ ID NO 21, EGFP-F3 is a fragment containing VP4 and VP2 regions, PCR amplification primers are S: cggca tggac gagct gtaca agatgc atGCGA TCAC CACT CTTGG TTCGC AAGTG TCTA CACAG CG, SEQ ID NO 22; CTGC ACGT GGAT GCA GAA CCC, SEQ ID NO 23, and after the three fragments are fused by fusion PCR, the NotI/PmlI is used for connecting into pEV71-js1 plasmid to replace the sequence in the original plasmid, thus obtaining pEV71-js1-EGFP plasmid.
In order to construct an infectious clone plasmid (shown in figure 1B) with a reporter gene Nluc, fusion PCR is utilized to fuse two sequences, wherein Nluc-F1 is an EV715UTR sequence, PCR amplification primers are S: CTGC ACGT GGAT GCA GAA CCC, SEQ ID NO 24, As: gaaa tcttcg agtgtga agaccattct agaGTT TAGC TGTG TTA AGGG TCA AG, SEQ ID NO 25, EGFP-F2 is a fragment containing Nluc, and PCR amplification primers are S: CTTG ACCC TTAAC ACAG CTAA ACtct agaat ggtctt cacac tcgaa gatttc, SEQ ID NO 26; as CGCat gcatcg ccaga atgcgt tcgca, SEQ ID NO 27. After the two fragments are fused by fusion PCR, NotI/NsiI is used for connecting into pEV71-js1-EGFP plasmid, and the sequence in the original plasmid is replaced, so that pEV71-js1-Nluc plasmid is obtained.
Example 2 infectious cDNA clone of EV71 strain js1 produces viral replication and infection
The infectious cloning plasmid pEV71-js1 was digested with HindIII, linearized, and then T7 using an in vitro transcription kit (Ambion). The in vitro transcribed RNA3g was transferred into Vero cells by the method of electrical transduction. After 2 days of electrotransfer, when the cells are diseased, the virus supernatant is collected, centrifuged at 3000g for 10min, and then filtered through a 0.45m filter membrane to remove cell debris. The virus in the supernatant was titrated using a plaque assay. The plaques formed by the virus produced by the infectious cloned plasmid were compared with the plaques of the original isolated parent virus as shown in FIG. 2 (top), and there was no significant difference in the morphology and size of the plaques. The Vero cells were re-infected with the same titer of virus produced by the infectious clone as the parent virus (MOI ═ 0.1), and cell supernatants at different times after infection were collected and titrated by plaque assay (expressed as PFU/ml), and the growth curves of both were shown in fig. 2 (bottom), and there was no significant difference between the growth curves.
Example 3 production of recombinant viruses containing reporter genes Nluc and EGFP and their stability
The infectious clone plasmid containing the reporter genes Nluc and EGFP and the infectious clone plasmid without the reporter genes are transcribed into virus RNA in vitro in the same way, are electrically transferred into Vero cells, the virus in the cell supernatant is collected after two days, and the virus titer is titrated on the Vero cells by using a plaque experiment. As shown in FIG. 3A, the plaques of the viruses containing the reporter genes EGFP and Nluc are similar in morphology and size to those of the viruses not containing the reporter genes. Vero cells (MOI 0.1) were re-infected with the same titer of viruses containing the reporter genes and viruses not containing the reporter genes, supernatants were collected at different days after infection, and titrated by plaque assay to obtain growth curves, as shown in FIG. 3B, in which the growth cycle of the viruses carrying the reporter genes was shown to be delayed compared to the wild viruses, suggesting that fusion of the reporter genes results in a delay in the replication cycle of the viruses. The replication ability of the virus produced by the infectious clone plasmid containing the reporter gene Nluc can be judged by measuring the intracellular Nluc activity using the Nluc substrate (Promega). The plasmid containing mutations of VP 1E 145G and 3C C147A was transcribed in vitro to viral RNA, which was then introduced into Vero cells by electroporation and the intracellular Nluc activity was measured at different times, as shown in fig. 5, viral RNA containing a 3C protease inactivation mutation (C147A) showed no increase in intracellular Nluc activity after 8 hours after transfection and only responded to the initial translation signal of viral RNA, whereas Nluc appeared to gradually increase with time after transfection of wild-type viral live VP145G viral RNA, indicating a normal viral replication signal. To demonstrate the stability of reporter genes of viruses containing EGFP reporter genes, cell supernatants infected by EV71-EGFP virus are diluted by 1:10, new Vero cells are re-infected, the cells are observed by a fluorescence microscope after two days of infection and the supernatants are collected, the new Vero cells (C +1) are re-infected by 1:10 dilution, and the cells are observed by the fluorescence microscope and the supernatants are collected for re-infection after two days of infection; after successive passages, the expression of EGFP in the infected cells is observed, and the EGFP gene is still stable after at least 6 passages.
Example 4 cloning of infectious cDNA of EV71 strain js1 Generation of Virus-infected mice to construct animal infection model
As shown in FIG. 6A, the virus produced by the infectious cDNA clone infected different strains of 3-day-old fetal mice (1.4X 10)4pfu/mouse), mice were observed 5 days after infection, and mice infected with virus all appeared quadriplegia compared to uninfected mice. Survival curves of mice of different strains after infection are shown in fig. 6B, and within 10 days, 100% mortality is achieved. Portable bagThe virus generated by the infectious clone with the VP 1E 145G mutation does not cause death of mice after infecting ICR mice of 3 days old, which indicates that the E145 site is a decisive site for death of virus infected mice and explains the reason that the death rate of infection of the passaged virus is reduced along with the increase of the passage number.
Sequence listing
<110> university of Compound Dan
<120> infectious cDNA clone based on EV71 strain and application thereof
<130> 20190601
<160> 27
<170> SIPOSequenceListing 1.0
<210> 1
<211> 9446
<212> DNA
<213> Artificial
<400> 1
gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60
tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120
tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180
atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240
gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300
aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360
ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420
tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480
ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540
ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600
ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660
gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720
gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780
gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260
aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560
cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980
ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040
tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100
ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160
cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220
ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280
gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340
ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400
gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460
gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520
acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580
attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640
cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700
ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaacatggg ttcgcaagtg 2760
tctacacagc gctccggttc ttacgaaaac tcaaactcag ccactgaggg ttctaccata 2820
aactacacca ccattaatta ctacaaagac tcctatgctg ccacagcagg caaacagagt 2880
ctcaagcagg atccagacaa gtttgcaaat cctgttaaag acatattcac cgaaatggca 2940
gcgccactga agtccccatc cgctgaggca tgtggataca gtgatcgagt ggcgcaatta 3000
actattggca actccaccat cacgacgcaa gaagcggcta acatcatagt cggctatggt 3060
gagtggcctt cctactgctc agattctgac gctacagcag tggataaacc aacgcgcccg 3120
gatgtttcag tgaacaggtt ttacacattg gacactaaat tgtgggagaa atcgtccaag 3180
ggatggtact ggaagttccc ggatgtgtta actgaaactg gggtttttgg gcaaaatgca 3240
caattccact acctctaccg atcagggttc tgcatccacg tgcagtgcaa tgccagtaaa 3300
ttccaccaag gagcactcct agtcgctgtc ctaccagagt atgtcattgg gacagtggca 3360
ggcggtacag ggacggaaga cacccacccc ccctacaagc agacccaacc cggcgccgat 3420
ggtttcgagt tgcaacaccc gtacgtgctt gatgctggca tcccaatatc acagttaaca 3480
gtgtgcccac accagtggat taatttgagg accaacaatt gtgctacaat aatagtgcca 3540
tacattaacg cactgccttt tgattctgcc ttgaaccatt gcaactttgg cctgttagtt 3600
gtgcctatta gcccactaga ctacgaccaa ggagcaacgc cagtaatccc tataactatc 3660
acattggccc caatgtgctc tgaattcgca ggtcttaggc aggcagtcac gcaagggttc 3720
cccaccgagc taaaacctgg cacaaatcaa tttttaacca ccgatgatgg cgtctcagca 3780
cctattctac caaacttcca ccccaccccg tgtatccaca tacctggtga agttaggaac 3840
ttgctagagt tatgccaggt ggagaccatt ctggaggtta acaatgtgcc cacgaatgcc 3900
actagcttaa tggagagact gcgcttcccg gtctcagcac aagcagggaa aggtgaactg 3960
tgtgcggtgt ttagagccga tcctgggcga aatggaccat ggcaatccac cttactgggc 4020
cagttgtgcg ggtactacac ccaatggtca gggtcattgg aagtcacctt catgtttact 4080
ggatccttca tggctaccgg caagatgctc atagcctata caccgccagg gggtcctctg 4140
cccaaggacc gggcgaccgc catgttgggc acgcacgtca tctgggattt tgggctgcaa 4200
tcgtctgtta cccttgtaat accatggatc agtaacactc attatagagc acatgcccga 4260
gatggagtgt ttgactatta cactacaggg ttagtcagta tatggtacca gacaaattac 4320
gtggttccaa tcggtgcgcc caacacagcc tatataatag cactagcggc agcccaaaag 4380
aacttcacta tgaaattgtg caaggatgct agtgatatcc tgcagacggg caccatccag 4440
ggagataggg tggcagatgt aattgaaagt tccataggag atagcgtgag cagagccctc 4500
actcacgctc taccagcacc cacaggccaa aacacacagg tgagcagtca tcgactggat 4560
acaggcaagg ttccagcact ccaagctgct gaaattgggg catcatcaaa tgctagtgac 4620
gagagcatga ttgaaacacg ttgtgttctt aactcgcata gtacagctga gaccactctt 4680
gatagtttct tcagtagggc aggattagtt ggagagatag atctccctct tgagggcaca 4740
actaacccaa atggttatgc caactgggac atagatataa caggttacgc gcaaatgcgt 4800
agaaaggtag agctattcac ctacatgcgt tttgatgcag agttcacttt tgttgcgtgc 4860
acacccaccg gggaggttgt cccacaattg ctccaatata tgtttgtgcc acctggagcc 4920
cctaagccag attctaggga atcccttgca tggcaaaccg ccaccaaccc ctcagttttt 4980
gtcaagctgt cagaccctcc ggcgcaggtt tcagtgccat tcatgtcacc tgcgagtgct 5040
tatcaatggt tttatgacgg atatcccaca ttcggagaac acaaacagga gaaagacctt 5100
gaatacgggg catgtcctaa taacatgatg ggtacattct cagtgcggac tgtggggacc 5160
tccaagtcca agtacccttt agtggttagg atttacatga gaatgaagca cgtcagggcg 5220
tggatacctc gcccgatgcg caaccagaac tacctgttca aagccaaccc aaattatgct 5280
ggcaactcta ttaagccaac tggtgccagt cgcacagcga tcaccactct tgggaaattt 5340
ggacaacagt ctggggctat ttatgtgggc aactttagag tggtcaaccg acatcttgcc 5400
acccataatg attgggcaaa tcttgtttgg gaagacagct ctcgcgactt gctcgtgtca 5460
tccaccactg cccaaggttg tgacacgatt gcccgttgcg attgccagac aggggtgtac 5520
tactgtaact cgatgagaaa acactaccca gtcagttttt caaaacccag cctgatctat 5580
gtagaggcta gcgagtatta cccagccagg taccaatcac atctcatgct cgcacagggt 5640
cactcggaac ctggtgattg cggtggtatc cttaggtgcc aacatggcgt catcggcata 5700
gtgtctactg gtggcaatgg gctcgttggc tttgcagacg tcagagacct cttgtggtta 5760
gatgaagaag ctatggaaca gggcgtgtcc gactacatta agggtctcgg agatgctttt 5820
ggaacaggct tcactgacgc agtctcaagg gaggttgaag ctctcaagaa ctatcttata 5880
gggtctgaag gagcagttga gaaaattttg aaaaatctta ttaaactaat ctctgcactg 5940
gtgattgtga tcagaagtga ttacgacatg gttaccctca ctgcaacctt agcgctgata 6000
ggttgtcatg gcagtccttg ggcttggatt aaagccaaaa cagcctccat cttaggtatc 6060
cctatcgccc aaaagcagag cgcttcctgg ctcaagaagt tcaatgacat ggccaacgcc 6120
gctaaggggt tagagtgggt ttccaacaag atcagcaaat ttattgattg gcttaaggag 6180
aaaatagtac cagcagccag ggagaaggtt gaattcctaa ataacttgaa acagctgcca 6240
ctgctagaga atcagatctc gaacttggaa caatctgctg cttcacaaga ggaccttgaa 6300
gtcatgtttg ggaatgtgtc gtacctagct cacttctgtc gcaagtttca accgctatac 6360
gccacggaag ctaaaagagt ctatgccctg gagaagagaa tgaataacta tatgcagttc 6420
aagagcaaac accgaattga acctgtatgt ctcattatta ggggctcacc aggcaccggg 6480
aagtctctag ccactggtat tattgctcga gcaatcgctg ataagtacca ctccagcgtg 6540
tactcgctcc caccagaccc ggatcatttt gacggttaca agcaacaggt ggttacagtg 6600
atggatgatt tgtgtcaaaa ccccgatggt aaggatatgt ccttattctg tcaaatggta 6660
tccaccgtag atttcattcc accaatggct tctctcgagg agaagggagt ttccttcacc 6720
tctaagtttg tcatcgcatc cactaatgcc agtaatatca tagtaccaac agtgtctgat 6780
tctgacgcta ttcgccgcag gttctacatg gactgtgaca ttgaagtgac agactcgtac 6840
aaaacagatc taggtagact ggatgcaggg cgagccgcta aactgtgttc tgaaaataac 6900
actgcaaatt tcaaacgttg cagcccatta gtgtgtggga aagccatcca acttagagat 6960
agaaagtcta aagtcagata cagtgtggat acggtggttt cagaacttat tagggaatac 7020
agcaataggt ccgccattgg taacacaatc gaggctcttt tccaaggtcc acccaagttc 7080
aggccaatta ggattagcct tgaagaaaaa ccagccccag acgctattag cgatctcctt 7140
gctagtgtag atagtgaaga agtgcgccag tactgcaggg atcaaggctg gattattcct 7200
gaagctccca ccaatgtgga gcggcacctt aatagagcgg tgctcgtcat gcaatccatc 7260
accacagtag tggcggttgt ttcgttggtg tacgtcatct acaagctctt tgcagggttt 7320
cagggtgcat attctggtgc tcctaagcaa gtgcttaaga aacctgctct tcgcacagca 7380
acagtgcagg gtccgagcct tgactttgct ctctccctac tgagaaggaa catcaggcag 7440
gtccaaacag accaagggca tttcaccatg ttgggtgtta gggatcgctt agcagtcctc 7500
ccacgccact cacaacctgg caaaaccatt tggattgagc acaaactcgt gaacgtcctt 7560
gatgcagttg aactggtgga tgagcaagga gtcaacctgg aattaaccct catcactctt 7620
gacaccaacg agaagtttag ggatatcacc aaattcatcc cagaaaatat cagcactgct 7680
agcgatgcca ccctagtgat caacacggag cacatgccgt caatgtttgt cccggtgggt 7740
gacgttgtgc agtatggctt tttgaatctc agtggcaagc ctacccatcg caccatgatg 7800
tacaattttc ctactaaagc aggacagtgt ggaggagtgg tgacatctgt tgggaaggtt 7860
gtcggtattc acattggtgg caatggcaga caaggttttt gcgcaggcct caaaaggagt 7920
tactttgcta gtgaacaagg agagatccag tgggttaagc ccaataaaga aactggaaga 7980
ctcaacatca atggaccaac ccgcaccaag ttagaaccta gtgtattcca tgacatcttc 8040
gagggaaata aggaaccagc tgtcttgcac agtaaagacc cccgacttga ggtagatttt 8100
gaacaggccc tgttctctaa gtatgtggga aacacactac atgagcctga cgagtacatc 8160
aaagaggcag ctctacatta tgcaaaccaa ttaaagcaac tagaaatcaa tacctctcaa 8220
atgagcatgg aggaggcctg ctatggtact gagaatcttg aggctattga tcttcacact 8280
agtgcaggtt acccctatag tgccctaggg ataaagaaaa gagacatctt agaccctacc 8340
accagggacg tgagtagaat gaagttctac atggacaagt atggtcttga tcttccctac 8400
tccacttatg tcaaggacga gctacgctcg attgataaaa tcaagaaagg gaagtcccgc 8460
ctgatcgagg ccagtagtct aaatgattca gtgtacctca gaatggcttt cgggcatttg 8520
tatgaggctt tccacgcaaa tcctgggacg ataactggat cggccgtggg gtgtaaccct 8580
gacacattct ggagcaagct gccaattttg ctccctggtt cactctttgc ctttgactac 8640
tcaggctatg atgccagcct tagccctgtc tggttcagag cattagaatt ggttcttagg 8700
gagatagggt atagtgaaga ggcaatctca ctcattgagg gaatcaacca cacacatcat 8760
gtgtatcgta ataagaccta ttgcgtgctt ggtgggatgc cctcaggctg ttcaggaaca 8820
tccatcttca actcaatgat caacaacatt attatcagag cactgctcat aaaaacattt 8880
aagggcattg atttggatga actcaacatg gtcgcttatg gagacgatgt gctcgctagc 8940
tatcccttcc caattgattg cttggaacta gcaaagactg gtaaggagta tggtctgacc 9000
atgacccctg ctgataaatc tccttgcttt aatgaggtca attggggtaa tgcgaccttc 9060
ctcaaaaggg gctttttgcc cgatgaacag tttccatttt tgattcaccc tactatgcca 9120
atgagggaga tccatgagtc cattcgatgg accaaggacg cacggaacac tcaagatcat 9180
gtgcggtcct tgtgcctcct agcatggcat aatggtaagc aagaatacga gaagtttgtg 9240
agcacaatta ggtctgtccc agtagggaga gcgttggcta ttccaaatta tgaaaatctt 9300
agacgaaatt ggctcgagtt attttagagg ttatacacac ctcaacccca ccagaaatct 9360
ggtcgtgaat gtgactggtg ggggtaaatt tgttataacc agaatagcaa aaaaaaaaaa 9420
aaaaaaaaaa aaaaaaaaaa gcttat 9446
<210> 2
<211> 7405
<212> DNA
<213> Artificial
<400> 2
ttaaaacagc ctgtgggttg cacccactca cagggcctac tgggcgcaag cactctggta 60
cctcggtacc tttgtgcgcc tgttttacac ccccccccca atgaaactta gaagcaataa 120
accacgatca atagcaggca taacgctcca gttatgtctt gatcaagcac ttctgtttcc 180
ccggactgag tatcaataga ctgctcgcgc ggttgaagga gaaaacgttc gttatccggc 240
taactacttc ggaaaaccta gtaacaccat gaaagttgcg gagagcttcg ttcagcactc 300
ccccagtgta gatcaggtcg atgagtcacc gcgttcccca cgggcgaccg tggcggtggc 360
tgcgttggcg gcctgcccat ggggtaaccc atggggcgct ctaatacgga catggtgtga 420
agagtctact gagctagttg gtagtcctcc ggcccctgaa tgcggctaat cccaactgcg 480
gagcacacgc ccacaagcca gcgggtagtg tgtcgtaacg ggtaactctg cagcggaacc 540
gactactttg ggtgtccgtg tttcctttta tctttatatt ggctgcttat ggtgacaatt 600
aaagaattgt taccatatag ctattggatt agccatccgg tgtgcaacag agcaattatt 660
tacctattta ttggttttgt accattaacc tcgaattctg tgaccaccct taattatatc 720
ttgaccctta acacagctaa acatgggttc gcaagtgtct acacagcgct ccggttctta 780
cgaaaactca aactcagcca ctgagggttc taccataaac tacaccacca ttaattacta 840
caaagactcc tatgctgcca cagcaggcaa acagagtctc aagcaggatc cagacaagtt 900
tgcaaatcct gttaaagaca tattcaccga aatggcagcg ccactgaagt ccccatccgc 960
tgaggcatgt ggatacagtg atcgagtggc gcaattaact attggcaact ccaccatcac 1020
gacgcaagaa gcggctaaca tcatagtcgg ctatggtgag tggccttcct actgctcaga 1080
ttctgacgct acagcagtgg ataaaccaac gcgcccggat gtttcagtga acaggtttta 1140
cacattggac actaaattgt gggagaaatc gtccaaggga tggtactgga agttcccgga 1200
tgtgttaact gaaactgggg tttttgggca aaatgcacaa ttccactacc tctaccgatc 1260
agggttctgc atccacgtgc agtgcaatgc cagtaaattc caccaaggag cactcctagt 1320
cgctgtccta ccagagtatg tcattgggac agtggcaggc ggtacaggga cggaagacac 1380
ccaccccccc tacaagcaga cccaacccgg cgccgatggt ttcgagttgc aacacccgta 1440
cgtgcttgat gctggcatcc caatatcaca gttaacagtg tgcccacacc agtggattaa 1500
tttgaggacc aacaattgtg ctacaataat agtgccatac attaacgcac tgccttttga 1560
ttctgccttg aaccattgca actttggcct gttagttgtg cctattagcc cactagacta 1620
cgaccaagga gcaacgccag taatccctat aactatcaca ttggccccaa tgtgctctga 1680
attcgcaggt cttaggcagg cagtcacgca agggttcccc accgagctaa aacctggcac 1740
aaatcaattt ttaaccaccg atgatggcgt ctcagcacct attctaccaa acttccaccc 1800
caccccgtgt atccacatac ctggtgaagt taggaacttg ctagagttat gccaggtgga 1860
gaccattctg gaggttaaca atgtgcccac gaatgccact agcttaatgg agagactgcg 1920
cttcccggtc tcagcacaag cagggaaagg tgaactgtgt gcggtgttta gagccgatcc 1980
tgggcgaaat ggaccatggc aatccacctt actgggccag ttgtgcgggt actacaccca 2040
atggtcaggg tcattggaag tcaccttcat gtttactgga tccttcatgg ctaccggcaa 2100
gatgctcata gcctatacac cgccaggggg tcctctgccc aaggaccggg cgaccgccat 2160
gttgggcacg cacgtcatct gggattttgg gctgcaatcg tctgttaccc ttgtaatacc 2220
atggatcagt aacactcatt atagagcaca tgcccgagat ggagtgtttg actattacac 2280
tacagggtta gtcagtatat ggtaccagac aaattacgtg gttccaatcg gtgcgcccaa 2340
cacagcctat ataatagcac tagcggcagc ccaaaagaac ttcactatga aattgtgcaa 2400
ggatgctagt gatatcctgc agacgggcac catccaggga gatagggtgg cagatgtaat 2460
tgaaagttcc ataggagata gcgtgagcag agccctcact cacgctctac cagcacccac 2520
aggccaaaac acacaggtga gcagtcatcg actggataca ggcaaggttc cagcactcca 2580
agctgctgaa attggggcat catcaaatgc tagtgacgag agcatgattg aaacacgttg 2640
tgttcttaac tcgcatagta cagctgagac cactcttgat agtttcttca gtagggcagg 2700
attagttgga gagatagatc tccctcttga gggcacaact aacccaaatg gttatgccaa 2760
ctgggacata gatataacag gttacgcgca aatgcgtaga aaggtagagc tattcaccta 2820
catgcgtttt gatgcagagt tcacttttgt tgcgtgcaca cccaccgggg aggttgtccc 2880
acaattgctc caatatatgt ttgtgccacc tggagcccct aagccagatt ctagggaatc 2940
ccttgcatgg caaaccgcca ccaacccctc agtttttgtc aagctgtcag accctccggc 3000
gcaggtttca gtgccattca tgtcacctgc gagtgcttat caatggtttt atgacggata 3060
tcccacattc ggagaacaca aacaggagaa agaccttgaa tacggggcat gtcctaataa 3120
catgatgggt acattctcag tgcggactgt ggggacctcc aagtccaagt accctttagt 3180
ggttaggatt tacatgagaa tgaagcacgt cagggcgtgg atacctcgcc cgatgcgcaa 3240
ccagaactac ctgttcaaag ccaacccaaa ttatgctggc aactctatta agccaactgg 3300
tgccagtcgc acagcgatca ccactcttgg gaaatttgga caacagtctg gggctattta 3360
tgtgggcaac tttagagtgg tcaaccgaca tcttgccacc cataatgatt gggcaaatct 3420
tgtttgggaa gacagctctc gcgacttgct cgtgtcatcc accactgccc aaggttgtga 3480
cacgattgcc cgttgcgatt gccagacagg ggtgtactac tgtaactcga tgagaaaaca 3540
ctacccagtc agtttttcaa aacccagcct gatctatgta gaggctagcg agtattaccc 3600
agccaggtac caatcacatc tcatgctcgc acagggtcac tcggaacctg gtgattgcgg 3660
tggtatcctt aggtgccaac atggcgtcat cggcatagtg tctactggtg gcaatgggct 3720
cgttggcttt gcagacgtca gagacctctt gtggttagat gaagaagcta tggaacaggg 3780
cgtgtccgac tacattaagg gtctcggaga tgcttttgga acaggcttca ctgacgcagt 3840
ctcaagggag gttgaagctc tcaagaacta tcttataggg tctgaaggag cagttgagaa 3900
aattttgaaa aatcttatta aactaatctc tgcactggtg attgtgatca gaagtgatta 3960
cgacatggtt accctcactg caaccttagc gctgataggt tgtcatggca gtccttgggc 4020
ttggattaaa gccaaaacag cctccatctt aggtatccct atcgcccaaa agcagagcgc 4080
ttcctggctc aagaagttca atgacatggc caacgccgct aaggggttag agtgggtttc 4140
caacaagatc agcaaattta ttgattggct taaggagaaa atagtaccag cagccaggga 4200
gaaggttgaa ttcctaaata acttgaaaca gctgccactg ctagagaatc agatctcgaa 4260
cttggaacaa tctgctgctt cacaagagga ccttgaagtc atgtttggga atgtgtcgta 4320
cctagctcac ttctgtcgca agtttcaacc gctatacgcc acggaagcta aaagagtcta 4380
tgccctggag aagagaatga ataactatat gcagttcaag agcaaacacc gaattgaacc 4440
tgtatgtctc attattaggg gctcaccagg caccgggaag tctctagcca ctggtattat 4500
tgctcgagca atcgctgata agtaccactc cagcgtgtac tcgctcccac cagacccgga 4560
tcattttgac ggttacaagc aacaggtggt tacagtgatg gatgatttgt gtcaaaaccc 4620
cgatggtaag gatatgtcct tattctgtca aatggtatcc accgtagatt tcattccacc 4680
aatggcttct ctcgaggaga agggagtttc cttcacctct aagtttgtca tcgcatccac 4740
taatgccagt aatatcatag taccaacagt gtctgattct gacgctattc gccgcaggtt 4800
ctacatggac tgtgacattg aagtgacaga ctcgtacaaa acagatctag gtagactgga 4860
tgcagggcga gccgctaaac tgtgttctga aaataacact gcaaatttca aacgttgcag 4920
cccattagtg tgtgggaaag ccatccaact tagagataga aagtctaaag tcagatacag 4980
tgtggatacg gtggtttcag aacttattag ggaatacagc aataggtccg ccattggtaa 5040
cacaatcgag gctcttttcc aaggtccacc caagttcagg ccaattagga ttagccttga 5100
agaaaaacca gccccagacg ctattagcga tctccttgct agtgtagata gtgaagaagt 5160
gcgccagtac tgcagggatc aaggctggat tattcctgaa gctcccacca atgtggagcg 5220
gcaccttaat agagcggtgc tcgtcatgca atccatcacc acagtagtgg cggttgtttc 5280
gttggtgtac gtcatctaca agctctttgc agggtttcag ggtgcatatt ctggtgctcc 5340
taagcaagtg cttaagaaac ctgctcttcg cacagcaaca gtgcagggtc cgagccttga 5400
ctttgctctc tccctactga gaaggaacat caggcaggtc caaacagacc aagggcattt 5460
caccatgttg ggtgttaggg atcgcttagc agtcctccca cgccactcac aacctggcaa 5520
aaccatttgg attgagcaca aactcgtgaa cgtccttgat gcagttgaac tggtggatga 5580
gcaaggagtc aacctggaat taaccctcat cactcttgac accaacgaga agtttaggga 5640
tatcaccaaa ttcatcccag aaaatatcag cactgctagc gatgccaccc tagtgatcaa 5700
cacggagcac atgccgtcaa tgtttgtccc ggtgggtgac gttgtgcagt atggcttttt 5760
gaatctcagt ggcaagccta cccatcgcac catgatgtac aattttccta ctaaagcagg 5820
acagtgtgga ggagtggtga catctgttgg gaaggttgtc ggtattcaca ttggtggcaa 5880
tggcagacaa ggtttttgcg caggcctcaa aaggagttac tttgctagtg aacaaggaga 5940
gatccagtgg gttaagccca ataaagaaac tggaagactc aacatcaatg gaccaacccg 6000
caccaagtta gaacctagtg tattccatga catcttcgag ggaaataagg aaccagctgt 6060
cttgcacagt aaagaccccc gacttgaggt agattttgaa caggccctgt tctctaagta 6120
tgtgggaaac acactacatg agcctgacga gtacatcaaa gaggcagctc tacattatgc 6180
aaaccaatta aagcaactag aaatcaatac ctctcaaatg agcatggagg aggcctgcta 6240
tggtactgag aatcttgagg ctattgatct tcacactagt gcaggttacc cctatagtgc 6300
cctagggata aagaaaagag acatcttaga ccctaccacc agggacgtga gtagaatgaa 6360
gttctacatg gacaagtatg gtcttgatct tccctactcc acttatgtca aggacgagct 6420
acgctcgatt gataaaatca agaaagggaa gtcccgcctg atcgaggcca gtagtctaaa 6480
tgattcagtg tacctcagaa tggctttcgg gcatttgtat gaggctttcc acgcaaatcc 6540
tgggacgata actggatcgg ccgtggggtg taaccctgac acattctgga gcaagctgcc 6600
aattttgctc cctggttcac tctttgcctt tgactactca ggctatgatg ccagccttag 6660
ccctgtctgg ttcagagcat tagaattggt tcttagggag atagggtata gtgaagaggc 6720
aatctcactc attgagggaa tcaaccacac acatcatgtg tatcgtaata agacctattg 6780
cgtgcttggt gggatgccct caggctgttc aggaacatcc atcttcaact caatgatcaa 6840
caacattatt atcagagcac tgctcataaa aacatttaag ggcattgatt tggatgaact 6900
caacatggtc gcttatggag acgatgtgct cgctagctat cccttcccaa ttgattgctt 6960
ggaactagca aagactggta aggagtatgg tctgaccatg acccctgctg ataaatctcc 7020
ttgctttaat gaggtcaatt ggggtaatgc gaccttcctc aaaaggggct ttttgcccga 7080
tgaacagttt ccatttttga ttcaccctac tatgccaatg agggagatcc atgagtccat 7140
tcgatggacc aaggacgcac ggaacactca agatcatgtg cggtccttgt gcctcctagc 7200
atggcataat ggtaagcaag aatacgagaa gtttgtgagc acaattaggt ctgtcccagt 7260
agggagagcg ttggctattc caaattatga aaatcttaga cgaaattggc tcgagttatt 7320
ttagaggtta tacacacctc aaccccacca gaaatctggt cgtgaatgtg actggtgggg 7380
gtaaatttgt tataaccaga atagc 7405
<210> 3
<211> 1987
<212> DNA
<213> Artificial
<400> 3
agcgctagcg gagtgtatac tggcttacta tgttggcact gatgagggtg tcagtgaagt 60
gcttcatgtg gcaggagaaa aaaggctgca ccggtgcgtc agcagaatat gtgatacagg 120
atatattccg cttcctcgct cactgactcg ctacgctcgg tcgttcgact gcggcgagcg 180
gaaatggctt acgaacgggg cggagatttc ctggaagatg ccaggaagat acttaacagg 240
gaagtgagag ggccgcggca aagccgtttt tccataggct ccgcccccct gacaagcatc 300
acgaaatctg acgctcaaat cagtggtggc gaaacccgac aggactataa agataccagg 360
cgtttcccct ggcggctccc tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt 420
cattccgctg ttatggccgc gtttgtctca ttccacgcct gacactcagt tccgggtagg 480
cagttcgctc caagctggac tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct 540
tatccggtaa ctatcgtctt gagtccaacc cggaaagaca tgcaaaagca ccactggcag 600
cagccactgg taattgattt agaggagtta gtcttgaagt catgcgccgg ttaaggctaa 660
actgaaagga caagttttgg tgactgcgct cctccaagcc agttacctcg gttcaaagag 720
ttggtagctc agagaacctt cgaaaaaccg ccctgcaagg cggttttttc gttttcagag 780
caagagatta cgcgcagacc aaaacgatct caagaagatc atcttattaa ggggtctgac 840
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 900
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 960
taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 1020
ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 1080
ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 1140
gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 1200
ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 1260
gttaatagtt tgcgcaacgt tgttgccatt gctgcaggca tcgtggtgtc acgctcgtcg 1320
tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 1380
atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 1440
gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 1500
tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 1560
atgcggcgac cgagttgctc ttgcccggcg tcaacacggg ataataccgc gccacatagc 1620
agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 1680
ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 1740
tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 1800
aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 1860
tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 1920
aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtgtcgacg 1980
cggccgc 1987
<210> 4
<211> 2193
<212> PRT
<213> Artificial
<400> 4
Met Gly Ser Gln Val Ser Thr Gln Arg Ser Gly Ser Tyr Glu Asn Ser
1 5 10 15
Asn Ser Ala Thr Glu Gly Ser Thr Ile Asn Tyr Thr Thr Ile Asn Tyr
20 25 30
Tyr Lys Asp Ser Tyr Ala Ala Thr Ala Gly Lys Gln Ser Leu Lys Gln
35 40 45
Asp Pro Asp Lys Phe Ala Asn Pro Val Lys Asp Ile Phe Thr Glu Met
50 55 60
Ala Ala Pro Leu Lys Ser Pro Ser Ala Glu Ala Cys Gly Tyr Ser Asp
65 70 75 80
Arg Val Ala Gln Leu Thr Ile Gly Asn Ser Thr Ile Thr Thr Gln Glu
85 90 95
Ala Ala Asn Ile Ile Val Gly Tyr Gly Glu Trp Pro Ser Tyr Cys Ser
100 105 110
Asp Ser Asp Ala Thr Ala Val Asp Lys Pro Thr Arg Pro Asp Val Ser
115 120 125
Val Asn Arg Phe Tyr Thr Leu Asp Thr Lys Leu Trp Glu Lys Ser Ser
130 135 140
Lys Gly Trp Tyr Trp Lys Phe Pro Asp Val Leu Thr Glu Thr Gly Val
145 150 155 160
Phe Gly Gln Asn Ala Gln Phe His Tyr Leu Tyr Arg Ser Gly Phe Cys
165 170 175
Ile His Val Gln Cys Asn Ala Ser Lys Phe His Gln Gly Ala Leu Leu
180 185 190
Val Ala Val Leu Pro Glu Tyr Val Ile Gly Thr Val Ala Gly Gly Thr
195 200 205
Gly Thr Glu Asp Thr His Pro Pro Tyr Lys Gln Thr Gln Pro Gly Ala
210 215 220
Asp Gly Phe Glu Leu Gln His Pro Tyr Val Leu Asp Ala Gly Ile Pro
225 230 235 240
Ile Ser Gln Leu Thr Val Cys Pro His Gln Trp Ile Asn Leu Arg Thr
245 250 255
Asn Asn Cys Ala Thr Ile Ile Val Pro Tyr Ile Asn Ala Leu Pro Phe
260 265 270
Asp Ser Ala Leu Asn His Cys Asn Phe Gly Leu Leu Val Val Pro Ile
275 280 285
Ser Pro Leu Asp Tyr Asp Gln Gly Ala Thr Pro Val Ile Pro Ile Thr
290 295 300
Ile Thr Leu Ala Pro Met Cys Ser Glu Phe Ala Gly Leu Arg Gln Ala
305 310 315 320
Val Thr Gln Gly Phe Pro Thr Glu Leu Lys Pro Gly Thr Asn Gln Phe
325 330 335
Leu Thr Thr Asp Asp Gly Val Ser Ala Pro Ile Leu Pro Asn Phe His
340 345 350
Pro Thr Pro Cys Ile His Ile Pro Gly Glu Val Arg Asn Leu Leu Glu
355 360 365
Leu Cys Gln Val Glu Thr Ile Leu Glu Val Asn Asn Val Pro Thr Asn
370 375 380
Ala Thr Ser Leu Met Glu Arg Leu Arg Phe Pro Val Ser Ala Gln Ala
385 390 395 400
Gly Lys Gly Glu Leu Cys Ala Val Phe Arg Ala Asp Pro Gly Arg Asn
405 410 415
Gly Pro Trp Gln Ser Thr Leu Leu Gly Gln Leu Cys Gly Tyr Tyr Thr
420 425 430
Gln Trp Ser Gly Ser Leu Glu Val Thr Phe Met Phe Thr Gly Ser Phe
435 440 445
Met Ala Thr Gly Lys Met Leu Ile Ala Tyr Thr Pro Pro Gly Gly Pro
450 455 460
Leu Pro Lys Asp Arg Ala Thr Ala Met Leu Gly Thr His Val Ile Trp
465 470 475 480
Asp Phe Gly Leu Gln Ser Ser Val Thr Leu Val Ile Pro Trp Ile Ser
485 490 495
Asn Thr His Tyr Arg Ala His Ala Arg Asp Gly Val Phe Asp Tyr Tyr
500 505 510
Thr Thr Gly Leu Val Ser Ile Trp Tyr Gln Thr Asn Tyr Val Val Pro
515 520 525
Ile Gly Ala Pro Asn Thr Ala Tyr Ile Ile Ala Leu Ala Ala Ala Gln
530 535 540
Lys Asn Phe Thr Met Lys Leu Cys Lys Asp Ala Ser Asp Ile Leu Gln
545 550 555 560
Thr Gly Thr Ile Gln Gly Asp Arg Val Ala Asp Val Ile Glu Ser Ser
565 570 575
Ile Gly Asp Ser Val Ser Arg Ala Leu Thr His Ala Leu Pro Ala Pro
580 585 590
Thr Gly Gln Asn Thr Gln Val Ser Ser His Arg Leu Asp Thr Gly Lys
595 600 605
Val Pro Ala Leu Gln Ala Ala Glu Ile Gly Ala Ser Ser Asn Ala Ser
610 615 620
Asp Glu Ser Met Ile Glu Thr Arg Cys Val Leu Asn Ser His Ser Thr
625 630 635 640
Ala Glu Thr Thr Leu Asp Ser Phe Phe Ser Arg Ala Gly Leu Val Gly
645 650 655
Glu Ile Asp Leu Pro Leu Glu Gly Thr Thr Asn Pro Asn Gly Tyr Ala
660 665 670
Asn Trp Asp Ile Asp Ile Thr Gly Tyr Ala Gln Met Arg Arg Lys Val
675 680 685
Glu Leu Phe Thr Tyr Met Arg Phe Asp Ala Glu Phe Thr Phe Val Ala
690 695 700
Cys Thr Pro Thr Gly Glu Val Val Pro Gln Leu Leu Gln Tyr Met Phe
705 710 715 720
Val Pro Pro Gly Ala Pro Lys Pro Asp Ser Arg Glu Ser Leu Ala Trp
725 730 735
Gln Thr Ala Thr Asn Pro Ser Val Phe Val Lys Leu Ser Asp Pro Pro
740 745 750
Ala Gln Val Ser Val Pro Phe Met Ser Pro Ala Ser Ala Tyr Gln Trp
755 760 765
Phe Tyr Asp Gly Tyr Pro Thr Phe Gly Glu His Lys Gln Glu Lys Asp
770 775 780
Leu Glu Tyr Gly Ala Cys Pro Asn Asn Met Met Gly Thr Phe Ser Val
785 790 795 800
Arg Thr Val Gly Thr Ser Lys Ser Lys Tyr Pro Leu Val Val Arg Ile
805 810 815
Tyr Met Arg Met Lys His Val Arg Ala Trp Ile Pro Arg Pro Met Arg
820 825 830
Asn Gln Asn Tyr Leu Phe Lys Ala Asn Pro Asn Tyr Ala Gly Asn Ser
835 840 845
Ile Lys Pro Thr Gly Ala Ser Arg Thr Ala Ile Thr Thr Leu Gly Lys
850 855 860
Phe Gly Gln Gln Ser Gly Ala Ile Tyr Val Gly Asn Phe Arg Val Val
865 870 875 880
Asn Arg His Leu Ala Thr His Asn Asp Trp Ala Asn Leu Val Trp Glu
885 890 895
Asp Ser Ser Arg Asp Leu Leu Val Ser Ser Thr Thr Ala Gln Gly Cys
900 905 910
Asp Thr Ile Ala Arg Cys Asp Cys Gln Thr Gly Val Tyr Tyr Cys Asn
915 920 925
Ser Met Arg Lys His Tyr Pro Val Ser Phe Ser Lys Pro Ser Leu Ile
930 935 940
Tyr Val Glu Ala Ser Glu Tyr Tyr Pro Ala Arg Tyr Gln Ser His Leu
945 950 955 960
Met Leu Ala Gln Gly His Ser Glu Pro Gly Asp Cys Gly Gly Ile Leu
965 970 975
Arg Cys Gln His Gly Val Ile Gly Ile Val Ser Thr Gly Gly Asn Gly
980 985 990
Leu Val Gly Phe Ala Asp Val Arg Asp Leu Leu Trp Leu Asp Glu Glu
995 1000 1005
Ala Met Glu Gln Gly Val Ser Asp Tyr Ile Lys Gly Leu Gly Asp Ala
1010 1015 1020
Phe Gly Thr Gly Phe Thr Asp Ala Val Ser Arg Glu Val Glu Ala Leu
1025 1030 1035 1040
Lys Asn Tyr Leu Ile Gly Ser Glu Gly Ala Val Glu Lys Ile Leu Lys
1045 1050 1055
Asn Leu Ile Lys Leu Ile Ser Ala Leu Val Ile Val Ile Arg Ser Asp
1060 1065 1070
Tyr Asp Met Val Thr Leu Thr Ala Thr Leu Ala Leu Ile Gly Cys His
1075 1080 1085
Gly Ser Pro Trp Ala Trp Ile Lys Ala Lys Thr Ala Ser Ile Leu Gly
1090 1095 1100
Ile Pro Ile Ala Gln Lys Gln Ser Ala Ser Trp Leu Lys Lys Phe Asn
1105 1110 1115 1120
Asp Met Ala Asn Ala Ala Lys Gly Leu Glu Trp Val Ser Asn Lys Ile
1125 1130 1135
Ser Lys Phe Ile Asp Trp Leu Lys Glu Lys Ile Val Pro Ala Ala Arg
1140 1145 1150
Glu Lys Val Glu Phe Leu Asn Asn Leu Lys Gln Leu Pro Leu Leu Glu
1155 1160 1165
Asn Gln Ile Ser Asn Leu Glu Gln Ser Ala Ala Ser Gln Glu Asp Leu
1170 1175 1180
Glu Val Met Phe Gly Asn Val Ser Tyr Leu Ala His Phe Cys Arg Lys
1185 1190 1195 1200
Phe Gln Pro Leu Tyr Ala Thr Glu Ala Lys Arg Val Tyr Ala Leu Glu
1205 1210 1215
Lys Arg Met Asn Asn Tyr Met Gln Phe Lys Ser Lys His Arg Ile Glu
1220 1225 1230
Pro Val Cys Leu Ile Ile Arg Gly Ser Pro Gly Thr Gly Lys Ser Leu
1235 1240 1245
Ala Thr Gly Ile Ile Ala Arg Ala Ile Ala Asp Lys Tyr His Ser Ser
1250 1255 1260
Val Tyr Ser Leu Pro Pro Asp Pro Asp His Phe Asp Gly Tyr Lys Gln
1265 1270 1275 1280
Gln Val Val Thr Val Met Asp Asp Leu Cys Gln Asn Pro Asp Gly Lys
1285 1290 1295
Asp Met Ser Leu Phe Cys Gln Met Val Ser Thr Val Asp Phe Ile Pro
1300 1305 1310
Pro Met Ala Ser Leu Glu Glu Lys Gly Val Ser Phe Thr Ser Lys Phe
1315 1320 1325
Val Ile Ala Ser Thr Asn Ala Ser Asn Ile Ile Val Pro Thr Val Ser
1330 1335 1340
Asp Ser Asp Ala Ile Arg Arg Arg Phe Tyr Met Asp Cys Asp Ile Glu
1345 1350 1355 1360
Val Thr Asp Ser Tyr Lys Thr Asp Leu Gly Arg Leu Asp Ala Gly Arg
1365 1370 1375
Ala Ala Lys Leu Cys Ser Glu Asn Asn Thr Ala Asn Phe Lys Arg Cys
1380 1385 1390
Ser Pro Leu Val Cys Gly Lys Ala Ile Gln Leu Arg Asp Arg Lys Ser
1395 1400 1405
Lys Val Arg Tyr Ser Val Asp Thr Val Val Ser Glu Leu Ile Arg Glu
1410 1415 1420
Tyr Ser Asn Arg Ser Ala Ile Gly Asn Thr Ile Glu Ala Leu Phe Gln
1425 1430 1435 1440
Gly Pro Pro Lys Phe Arg Pro Ile Arg Ile Ser Leu Glu Glu Lys Pro
1445 1450 1455
Ala Pro Asp Ala Ile Ser Asp Leu Leu Ala Ser Val Asp Ser Glu Glu
1460 1465 1470
Val Arg Gln Tyr Cys Arg Asp Gln Gly Trp Ile Ile Pro Glu Ala Pro
1475 1480 1485
Thr Asn Val Glu Arg His Leu Asn Arg Ala Val Leu Val Met Gln Ser
1490 1495 1500
Ile Thr Thr Val Val Ala Val Val Ser Leu Val Tyr Val Ile Tyr Lys
1505 1510 1515 1520
Leu Phe Ala Gly Phe Gln Gly Ala Tyr Ser Gly Ala Pro Lys Gln Val
1525 1530 1535
Leu Lys Lys Pro Ala Leu Arg Thr Ala Thr Val Gln Gly Pro Ser Leu
1540 1545 1550
Asp Phe Ala Leu Ser Leu Leu Arg Arg Asn Ile Arg Gln Val Gln Thr
1555 1560 1565
Asp Gln Gly His Phe Thr Met Leu Gly Val Arg Asp Arg Leu Ala Val
1570 1575 1580
Leu Pro Arg His Ser Gln Pro Gly Lys Thr Ile Trp Ile Glu His Lys
1585 1590 1595 1600
Leu Val Asn Val Leu Asp Ala Val Glu Leu Val Asp Glu Gln Gly Val
1605 1610 1615
Asn Leu Glu Leu Thr Leu Ile Thr Leu Asp Thr Asn Glu Lys Phe Arg
1620 1625 1630
Asp Ile Thr Lys Phe Ile Pro Glu Asn Ile Ser Thr Ala Ser Asp Ala
1635 1640 1645
Thr Leu Val Ile Asn Thr Glu His Met Pro Ser Met Phe Val Pro Val
1650 1655 1660
Gly Asp Val Val Gln Tyr Gly Phe Leu Asn Leu Ser Gly Lys Pro Thr
1665 1670 1675 1680
His Arg Thr Met Met Tyr Asn Phe Pro Thr Lys Ala Gly Gln Cys Gly
1685 1690 1695
Gly Val Val Thr Ser Val Gly Lys Val Val Gly Ile His Ile Gly Gly
1700 1705 1710
Asn Gly Arg Gln Gly Phe Cys Ala Gly Leu Lys Arg Ser Tyr Phe Ala
1715 1720 1725
Ser Glu Gln Gly Glu Ile Gln Trp Val Lys Pro Asn Lys Glu Thr Gly
1730 1735 1740
Arg Leu Asn Ile Asn Gly Pro Thr Arg Thr Lys Leu Glu Pro Ser Val
1745 1750 1755 1760
Phe His Asp Ile Phe Glu Gly Asn Lys Glu Pro Ala Val Leu His Ser
1765 1770 1775
Lys Asp Pro Arg Leu Glu Val Asp Phe Glu Gln Ala Leu Phe Ser Lys
1780 1785 1790
Tyr Val Gly Asn Thr Leu His Glu Pro Asp Glu Tyr Ile Lys Glu Ala
1795 1800 1805
Ala Leu His Tyr Ala Asn Gln Leu Lys Gln Leu Glu Ile Asn Thr Ser
1810 1815 1820
Gln Met Ser Met Glu Glu Ala Cys Tyr Gly Thr Glu Asn Leu Glu Ala
1825 1830 1835 1840
Ile Asp Leu His Thr Ser Ala Gly Tyr Pro Tyr Ser Ala Leu Gly Ile
1845 1850 1855
Lys Lys Arg Asp Ile Leu Asp Pro Thr Thr Arg Asp Val Ser Arg Met
1860 1865 1870
Lys Phe Tyr Met Asp Lys Tyr Gly Leu Asp Leu Pro Tyr Ser Thr Tyr
1875 1880 1885
Val Lys Asp Glu Leu Arg Ser Ile Asp Lys Ile Lys Lys Gly Lys Ser
1890 1895 1900
Arg Leu Ile Glu Ala Ser Ser Leu Asn Asp Ser Val Tyr Leu Arg Met
1905 1910 1915 1920
Ala Phe Gly His Leu Tyr Glu Ala Phe His Ala Asn Pro Gly Thr Ile
1925 1930 1935
Thr Gly Ser Ala Val Gly Cys Asn Pro Asp Thr Phe Trp Ser Lys Leu
1940 1945 1950
Pro Ile Leu Leu Pro Gly Ser Leu Phe Ala Phe Asp Tyr Ser Gly Tyr
1955 1960 1965
Asp Ala Ser Leu Ser Pro Val Trp Phe Arg Ala Leu Glu Leu Val Leu
1970 1975 1980
Arg Glu Ile Gly Tyr Ser Glu Glu Ala Ile Ser Leu Ile Glu Gly Ile
1985 1990 1995 2000
Asn His Thr His His Val Tyr Arg Asn Lys Thr Tyr Cys Val Leu Gly
2005 2010 2015
Gly Met Pro Ser Gly Cys Ser Gly Thr Ser Ile Phe Asn Ser Met Ile
2020 2025 2030
Asn Asn Ile Ile Ile Arg Ala Leu Leu Ile Lys Thr Phe Lys Gly Ile
2035 2040 2045
Asp Leu Asp Glu Leu Asn Met Val Ala Tyr Gly Asp Asp Val Leu Ala
2050 2055 2060
Ser Tyr Pro Phe Pro Ile Asp Cys Leu Glu Leu Ala Lys Thr Gly Lys
2065 2070 2075 2080
Glu Tyr Gly Leu Thr Met Thr Pro Ala Asp Lys Ser Pro Cys Phe Asn
2085 2090 2095
Glu Val Asn Trp Gly Asn Ala Thr Phe Leu Lys Arg Gly Phe Leu Pro
2100 2105 2110
Asp Glu Gln Phe Pro Phe Leu Ile His Pro Thr Met Pro Met Arg Glu
2115 2120 2125
Ile His Glu Ser Ile Arg Trp Thr Lys Asp Ala Arg Asn Thr Gln Asp
2130 2135 2140
His Val Arg Ser Leu Cys Leu Leu Ala Trp His Asn Gly Lys Gln Glu
2145 2150 2155 2160
Tyr Glu Lys Phe Val Ser Thr Ile Arg Ser Val Pro Val Gly Arg Ala
2165 2170 2175
Leu Ala Ile Pro Asn Tyr Glu Asn Leu Arg Arg Asn Trp Leu Glu Leu
2180 2185 2190
Phe
<210> 5
<211> 9982
<212> DNA
<213> Artificial
<400> 5
gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60
tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120
tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180
atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240
gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300
aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360
ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420
tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480
ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540
ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600
ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660
gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720
gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780
gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260
aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560
cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980
ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040
tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100
ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160
cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220
ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280
gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340
ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400
gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460
gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520
acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580
attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640
cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700
ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaactctag aatggtcttc 2760
acactcgaag atttcgttgg ggactggcga cagacagccg gctacaacct ggaccaagtc 2820
cttgaacagg gaggtgtgtc cagtttgttt cagaatctcg gggtgtccgt aactccgatc 2880
caaaggattg tcctgagcgg tgaaaatggg ctgaagatcg acatccatgt catcatcccg 2940
tatgaaggtc tgagcggcga ccaaatgggc cagatcgaaa aaatttttaa ggtggtgtac 3000
cctgtggatg atcatcactt taaggtgatc ctgcactatg gcacactggt aatcgacggg 3060
gttacgccga acatgatcga ctatttcgga cggccgtatg aaggcatcgc cgtgttcgac 3120
ggcaaaaaga tcactgtaac agggaccctg tggaacggca acaaaattat cgacgagcgc 3180
ctgatcaacc ccgacggctc cctgctgttc cgagtaacca tcaacggagt gaccggctgg 3240
cggctgtgcg aacgcattct ggcgatgcat gcgatcacca ctcttggttc gcaagtgtct 3300
acacagcgct ccggttctta cgaaaactca aactcagcca ctgagggttc taccataaac 3360
tacaccacca ttaattacta caaagactcc tatgctgcca cagcaggcaa acagagtctc 3420
aagcaggatc cagacaagtt tgcaaatcct gttaaagaca tattcaccga aatggcagcg 3480
ccactgaagt ccccatccgc tgaggcatgt ggatacagtg atcgagtggc gcaattaact 3540
attggcaact ccaccatcac gacgcaagaa gcggctaaca tcatagtcgg ctatggtgag 3600
tggccttcct actgctcaga ttctgacgct acagcagtgg ataaaccaac gcgcccggat 3660
gtttcagtga acaggtttta cacattggac actaaattgt gggagaaatc gtccaaggga 3720
tggtactgga agttcccgga tgtgttaact gaaactgggg tttttgggca aaatgcacaa 3780
ttccactacc tctaccgatc agggttctgc atccacgtgc agtgcaatgc cagtaaattc 3840
caccaaggag cactcctagt cgctgtccta ccagagtatg tcattgggac agtggcaggc 3900
ggtacaggga cggaagacac ccaccccccc tacaagcaga cccaacccgg cgccgatggt 3960
ttcgagttgc aacacccgta cgtgcttgat gctggcatcc caatatcaca gttaacagtg 4020
tgcccacacc agtggattaa tttgaggacc aacaattgtg ctacaataat agtgccatac 4080
attaacgcac tgccttttga ttctgccttg aaccattgca actttggcct gttagttgtg 4140
cctattagcc cactagacta cgaccaagga gcaacgccag taatccctat aactatcaca 4200
ttggccccaa tgtgctctga attcgcaggt cttaggcagg cagtcacgca agggttcccc 4260
accgagctaa aacctggcac aaatcaattt ttaaccaccg atgatggcgt ctcagcacct 4320
attctaccaa acttccaccc caccccgtgt atccacatac ctggtgaagt taggaacttg 4380
ctagagttat gccaggtgga gaccattctg gaggttaaca atgtgcccac gaatgccact 4440
agcttaatgg agagactgcg cttcccggtc tcagcacaag cagggaaagg tgaactgtgt 4500
gcggtgttta gagccgatcc tgggcgaaat ggaccatggc aatccacctt actgggccag 4560
ttgtgcgggt actacaccca atggtcaggg tcattggaag tcaccttcat gtttactgga 4620
tccttcatgg ctaccggcaa gatgctcata gcctatacac cgccaggggg tcctctgccc 4680
aaggaccggg cgaccgccat gttgggcacg cacgtcatct gggattttgg gctgcaatcg 4740
tctgttaccc ttgtaatacc atggatcagt aacactcatt atagagcaca tgcccgagat 4800
ggagtgtttg actattacac tacagggtta gtcagtatat ggtaccagac aaattacgtg 4860
gttccaatcg gtgcgcccaa cacagcctat ataatagcac tagcggcagc ccaaaagaac 4920
ttcactatga aattgtgcaa ggatgctagt gatatcctgc agacgggcac catccaggga 4980
gatagggtgg cagatgtaat tgaaagttcc ataggagata gcgtgagcag agccctcact 5040
cacgctctac cagcacccac aggccaaaac acacaggtga gcagtcatcg actggataca 5100
ggcaaggttc cagcactcca agctgctgaa attggggcat catcaaatgc tagtgacgag 5160
agcatgattg aaacacgttg tgttcttaac tcgcatagta cagctgagac cactcttgat 5220
agtttcttca gtagggcagg attagttgga gagatagatc tccctcttga gggcacaact 5280
aacccaaatg gttatgccaa ctgggacata gatataacag gttacgcgca aatgcgtaga 5340
aaggtagagc tattcaccta catgcgtttt gatgcagagt tcacttttgt tgcgtgcaca 5400
cccaccgggg aggttgtccc acaattgctc caatatatgt ttgtgccacc tggagcccct 5460
aagccagatt ctagggaatc ccttgcatgg caaaccgcca ccaacccctc agtttttgtc 5520
aagctgtcag accctccggc gcaggtttca gtgccattca tgtcacctgc gagtgcttat 5580
caatggtttt atgacggata tcccacattc ggagaacaca aacaggagaa agaccttgaa 5640
tacggggcat gtcctaataa catgatgggt acattctcag tgcggactgt ggggacctcc 5700
aagtccaagt accctttagt ggttaggatt tacatgagaa tgaagcacgt cagggcgtgg 5760
atacctcgcc cgatgcgcaa ccagaactac ctgttcaaag ccaacccaaa ttatgctggc 5820
aactctatta agccaactgg tgccagtcgc acagcgatca ccactcttgg gaaatttgga 5880
caacagtctg gggctattta tgtgggcaac tttagagtgg tcaaccgaca tcttgccacc 5940
cataatgatt gggcaaatct tgtttgggaa gacagctctc gcgacttgct cgtgtcatcc 6000
accactgccc aaggttgtga cacgattgcc cgttgcgatt gccagacagg ggtgtactac 6060
tgtaactcga tgagaaaaca ctacccagtc agtttttcaa aacccagcct gatctatgta 6120
gaggctagcg agtattaccc agccaggtac caatcacatc tcatgctcgc acagggtcac 6180
tcggaacctg gtgattgcgg tggtatcctt aggtgccaac atggcgtcat cggcatagtg 6240
tctactggtg gcaatgggct cgttggcttt gcagacgtca gagacctctt gtggttagat 6300
gaagaagcta tggaacaggg cgtgtccgac tacattaagg gtctcggaga tgcttttgga 6360
acaggcttca ctgacgcagt ctcaagggag gttgaagctc tcaagaacta tcttataggg 6420
tctgaaggag cagttgagaa aattttgaaa aatcttatta aactaatctc tgcactggtg 6480
attgtgatca gaagtgatta cgacatggtt accctcactg caaccttagc gctgataggt 6540
tgtcatggca gtccttgggc ttggattaaa gccaaaacag cctccatctt aggtatccct 6600
atcgcccaaa agcagagcgc ttcctggctc aagaagttca atgacatggc caacgccgct 6660
aaggggttag agtgggtttc caacaagatc agcaaattta ttgattggct taaggagaaa 6720
atagtaccag cagccaggga gaaggttgaa ttcctaaata acttgaaaca gctgccactg 6780
ctagagaatc agatctcgaa cttggaacaa tctgctgctt cacaagagga ccttgaagtc 6840
atgtttggga atgtgtcgta cctagctcac ttctgtcgca agtttcaacc gctatacgcc 6900
acggaagcta aaagagtcta tgccctggag aagagaatga ataactatat gcagttcaag 6960
agcaaacacc gaattgaacc tgtatgtctc attattaggg gctcaccagg caccgggaag 7020
tctctagcca ctggtattat tgctcgagca atcgctgata agtaccactc cagcgtgtac 7080
tcgctcccac cagacccgga tcattttgac ggttacaagc aacaggtggt tacagtgatg 7140
gatgatttgt gtcaaaaccc cgatggtaag gatatgtcct tattctgtca aatggtatcc 7200
accgtagatt tcattccacc aatggcttct ctcgaggaga agggagtttc cttcacctct 7260
aagtttgtca tcgcatccac taatgccagt aatatcatag taccaacagt gtctgattct 7320
gacgctattc gccgcaggtt ctacatggac tgtgacattg aagtgacaga ctcgtacaaa 7380
acagatctag gtagactgga tgcagggcga gccgctaaac tgtgttctga aaataacact 7440
gcaaatttca aacgttgcag cccattagtg tgtgggaaag ccatccaact tagagataga 7500
aagtctaaag tcagatacag tgtggatacg gtggtttcag aacttattag ggaatacagc 7560
aataggtccg ccattggtaa cacaatcgag gctcttttcc aaggtccacc caagttcagg 7620
ccaattagga ttagccttga agaaaaacca gccccagacg ctattagcga tctccttgct 7680
agtgtagata gtgaagaagt gcgccagtac tgcagggatc aaggctggat tattcctgaa 7740
gctcccacca atgtggagcg gcaccttaat agagcggtgc tcgtcatgca atccatcacc 7800
acagtagtgg cggttgtttc gttggtgtac gtcatctaca agctctttgc agggtttcag 7860
ggtgcatatt ctggtgctcc taagcaagtg cttaagaaac ctgctcttcg cacagcaaca 7920
gtgcagggtc cgagccttga ctttgctctc tccctactga gaaggaacat caggcaggtc 7980
caaacagacc aagggcattt caccatgttg ggtgttaggg atcgcttagc agtcctccca 8040
cgccactcac aacctggcaa aaccatttgg attgagcaca aactcgtgaa cgtccttgat 8100
gcagttgaac tggtggatga gcaaggagtc aacctggaat taaccctcat cactcttgac 8160
accaacgaga agtttaggga tatcaccaaa ttcatcccag aaaatatcag cactgctagc 8220
gatgccaccc tagtgatcaa cacggagcac atgccgtcaa tgtttgtccc ggtgggtgac 8280
gttgtgcagt atggcttttt gaatctcagt ggcaagccta cccatcgcac catgatgtac 8340
aattttccta ctaaagcagg acagtgtgga ggagtggtga catctgttgg gaaggttgtc 8400
ggtattcaca ttggtggcaa tggcagacaa ggtttttgcg caggcctcaa aaggagttac 8460
tttgctagtg aacaaggaga gatccagtgg gttaagccca ataaagaaac tggaagactc 8520
aacatcaatg gaccaacccg caccaagtta gaacctagtg tattccatga catcttcgag 8580
ggaaataagg aaccagctgt cttgcacagt aaagaccccc gacttgaggt agattttgaa 8640
caggccctgt tctctaagta tgtgggaaac acactacatg agcctgacga gtacatcaaa 8700
gaggcagctc tacattatgc aaaccaatta aagcaactag aaatcaatac ctctcaaatg 8760
agcatggagg aggcctgcta tggtactgag aatcttgagg ctattgatct tcacactagt 8820
gcaggttacc cctatagtgc cctagggata aagaaaagag acatcttaga ccctaccacc 8880
agggacgtga gtagaatgaa gttctacatg gacaagtatg gtcttgatct tccctactcc 8940
acttatgtca aggacgagct acgctcgatt gataaaatca agaaagggaa gtcccgcctg 9000
atcgaggcca gtagtctaaa tgattcagtg tacctcagaa tggctttcgg gcatttgtat 9060
gaggctttcc acgcaaatcc tgggacgata actggatcgg ccgtggggtg taaccctgac 9120
acattctgga gcaagctgcc aattttgctc cctggttcac tctttgcctt tgactactca 9180
ggctatgatg ccagccttag ccctgtctgg ttcagagcat tagaattggt tcttagggag 9240
atagggtata gtgaagaggc aatctcactc attgagggaa tcaaccacac acatcatgtg 9300
tatcgtaata agacctattg cgtgcttggt gggatgccct caggctgttc aggaacatcc 9360
atcttcaact caatgatcaa caacattatt atcagagcac tgctcataaa aacatttaag 9420
ggcattgatt tggatgaact caacatggtc gcttatggag acgatgtgct cgctagctat 9480
cccttcccaa ttgattgctt ggaactagca aagactggta aggagtatgg tctgaccatg 9540
acccctgctg ataaatctcc ttgctttaat gaggtcaatt ggggtaatgc gaccttcctc 9600
aaaaggggct ttttgcccga tgaacagttt ccatttttga ttcaccctac tatgccaatg 9660
agggagatcc atgagtccat tcgatggacc aaggacgcac ggaacactca agatcatgtg 9720
cggtccttgt gcctcctagc atggcataat ggtaagcaag aatacgagaa gtttgtgagc 9780
acaattaggt ctgtcccagt agggagagcg ttggctattc caaattatga aaatcttaga 9840
cgaaattggc tcgagttatt ttagaggtta tacacacctc aaccccacca gaaatctggt 9900
cgtgaatgtg actggtgggg gtaaatttgt tataaccaga atagcaaaaa aaaaaaaaaa 9960
aaaaaaaaaa aaaaaaagct ta 9982
<210> 6
<211> 10187
<212> DNA
<213> Artificial
<400> 6
gctagcggag tgtatactgg cttactatgt tggcactgat gagggtgtca gtgaagtgct 60
tcatgtggca ggagaaaaaa ggctgcaccg gtgcgtcagc agaatatgtg atacaggata 120
tattccgctt cctcgctcac tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa 180
atggcttacg aacggggcgg agatttcctg gaagatgcca ggaagatact taacagggaa 240
gtgagagggc cgcggcaaag ccgtttttcc ataggctccg cccccctgac aagcatcacg 300
aaatctgacg ctcaaatcag tggtggcgaa acccgacagg actataaaga taccaggcgt 360
ttcccctggc ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat 420
tccgctgtta tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag 480
ttcgctccaa gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat 540
ccggtaacta tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag 600
ccactggtaa ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact 660
gaaaggacaa gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg 720
gtagctcaga gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa 780
gagattacgc gcagaccaaa acgatctcaa gaagatcatc ttattaaggg gtctgacgct 840
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 900
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 960
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 1020
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 1080
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 1140
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 1200
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 1260
aatagtttgc gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt 1320
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 1380
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 1440
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 1500
gtaagatgct tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg 1560
cggcgaccga gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga 1620
actttaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 1680
ccgctgttga gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct 1740
tttactttca ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag 1800
ggaataaggg cgacacggaa atgttgaata ctcatactct tcctttttca atattattga 1860
agcatttatc agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat 1920
aaacaaatag gggttccgcg cacatttccc cgaaaagtgc cacctgacgt gtcgacgcgg 1980
ccgctaatac gactcactat aggttaaaac agcctgtggg ttgcacccac tcacagggcc 2040
tactgggcgc aagcactctg gtacctcggt acctttgtgc gcctgtttta cacccccccc 2100
ccaatgaaac ttagaagcaa taaaccacga tcaatagcag gcataacgct ccagttatgt 2160
cttgatcaag cacttctgtt tccccggact gagtatcaat agactgctcg cgcggttgaa 2220
ggagaaaacg ttcgttatcc ggctaactac ttcggaaaac ctagtaacac catgaaagtt 2280
gcggagagct tcgttcagca ctcccccagt gtagatcagg tcgatgagtc accgcgttcc 2340
ccacgggcga ccgtggcggt ggctgcgttg gcggcctgcc catggggtaa cccatggggc 2400
gctctaatac ggacatggtg tgaagagtct actgagctag ttggtagtcc tccggcccct 2460
gaatgcggct aatcccaact gcggagcaca cgcccacaag ccagcgggta gtgtgtcgta 2520
acgggtaact ctgcagcgga accgactact ttgggtgtcc gtgtttcctt ttatctttat 2580
attggctgct tatggtgaca attaaagaat tgttaccata tagctattgg attagccatc 2640
cggtgtgcaa cagagcaatt atttacctat ttattggttt tgtaccatta acctcgaatt 2700
ctgtgaccac ccttaattat atcttgaccc ttaacacagc taaaccatat gatggtgagc 2760
aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga cggcgacgta 2820
aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta cggcaagctg 2880
accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac cctcgtgacc 2940
accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa gcagcacgac 3000
ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt cttcaaggac 3060
gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct ggtgaaccgc 3120
atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca caagctggag 3180
tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa cggcatcaag 3240
gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac 3300
cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc 3360
acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag 3420
ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagat gcatgcgatc 3480
accactcttg gttcgcaagt gtctacacag cgctccggtt cttacgaaaa ctcaaactca 3540
gccactgagg gttctaccat aaactacacc accattaatt actacaaaga ctcctatgct 3600
gccacagcag gcaaacagag tctcaagcag gatccagaca agtttgcaaa tcctgttaaa 3660
gacatattca ccgaaatggc agcgccactg aagtccccat ccgctgaggc atgtggatac 3720
agtgatcgag tggcgcaatt aactattggc aactccacca tcacgacgca agaagcggct 3780
aacatcatag tcggctatgg tgagtggcct tcctactgct cagattctga cgctacagca 3840
gtggataaac caacgcgccc ggatgtttca gtgaacaggt tttacacatt ggacactaaa 3900
ttgtgggaga aatcgtccaa gggatggtac tggaagttcc cggatgtgtt aactgaaact 3960
ggggtttttg ggcaaaatgc acaattccac tacctctacc gatcagggtt ctgcatccac 4020
gtgcagtgca atgccagtaa attccaccaa ggagcactcc tagtcgctgt cctaccagag 4080
tatgtcattg ggacagtggc aggcggtaca gggacggaag acacccaccc cccctacaag 4140
cagacccaac ccggcgccga tggtttcgag ttgcaacacc cgtacgtgct tgatgctggc 4200
atcccaatat cacagttaac agtgtgccca caccagtgga ttaatttgag gaccaacaat 4260
tgtgctacaa taatagtgcc atacattaac gcactgcctt ttgattctgc cttgaaccat 4320
tgcaactttg gcctgttagt tgtgcctatt agcccactag actacgacca aggagcaacg 4380
ccagtaatcc ctataactat cacattggcc ccaatgtgct ctgaattcgc aggtcttagg 4440
caggcagtca cgcaagggtt ccccaccgag ctaaaacctg gcacaaatca atttttaacc 4500
accgatgatg gcgtctcagc acctattcta ccaaacttcc accccacccc gtgtatccac 4560
atacctggtg aagttaggaa cttgctagag ttatgccagg tggagaccat tctggaggtt 4620
aacaatgtgc ccacgaatgc cactagctta atggagagac tgcgcttccc ggtctcagca 4680
caagcaggga aaggtgaact gtgtgcggtg tttagagccg atcctgggcg aaatggacca 4740
tggcaatcca ccttactggg ccagttgtgc gggtactaca cccaatggtc agggtcattg 4800
gaagtcacct tcatgtttac tggatccttc atggctaccg gcaagatgct catagcctat 4860
acaccgccag ggggtcctct gcccaaggac cgggcgaccg ccatgttggg cacgcacgtc 4920
atctgggatt ttgggctgca atcgtctgtt acccttgtaa taccatggat cagtaacact 4980
cattatagag cacatgcccg agatggagtg tttgactatt acactacagg gttagtcagt 5040
atatggtacc agacaaatta cgtggttcca atcggtgcgc ccaacacagc ctatataata 5100
gcactagcgg cagcccaaaa gaacttcact atgaaattgt gcaaggatgc tagtgatatc 5160
ctgcagacgg gcaccatcca gggagatagg gtggcagatg taattgaaag ttccatagga 5220
gatagcgtga gcagagccct cactcacgct ctaccagcac ccacaggcca aaacacacag 5280
gtgagcagtc atcgactgga tacaggcaag gttccagcac tccaagctgc tgaaattggg 5340
gcatcatcaa atgctagtga cgagagcatg attgaaacac gttgtgttct taactcgcat 5400
agtacagctg agaccactct tgatagtttc ttcagtaggg caggattagt tggagagata 5460
gatctccctc ttgagggcac aactaaccca aatggttatg ccaactggga catagatata 5520
acaggttacg cgcaaatgcg tagaaaggta gagctattca cctacatgcg ttttgatgca 5580
gagttcactt ttgttgcgtg cacacccacc ggggaggttg tcccacaatt gctccaatat 5640
atgtttgtgc cacctggagc ccctaagcca gattctaggg aatcccttgc atggcaaacc 5700
gccaccaacc cctcagtttt tgtcaagctg tcagaccctc cggcgcaggt ttcagtgcca 5760
ttcatgtcac ctgcgagtgc ttatcaatgg ttttatgacg gatatcccac attcggagaa 5820
cacaaacagg agaaagacct tgaatacggg gcatgtccta ataacatgat gggtacattc 5880
tcagtgcgga ctgtggggac ctccaagtcc aagtaccctt tagtggttag gatttacatg 5940
agaatgaagc acgtcagggc gtggatacct cgcccgatgc gcaaccagaa ctacctgttc 6000
aaagccaacc caaattatgc tggcaactct attaagccaa ctggtgccag tcgcacagcg 6060
atcaccactc ttgggaaatt tggacaacag tctggggcta tttatgtggg caactttaga 6120
gtggtcaacc gacatcttgc cacccataat gattgggcaa atcttgtttg ggaagacagc 6180
tctcgcgact tgctcgtgtc atccaccact gcccaaggtt gtgacacgat tgcccgttgc 6240
gattgccaga caggggtgta ctactgtaac tcgatgagaa aacactaccc agtcagtttt 6300
tcaaaaccca gcctgatcta tgtagaggct agcgagtatt acccagccag gtaccaatca 6360
catctcatgc tcgcacaggg tcactcggaa cctggtgatt gcggtggtat ccttaggtgc 6420
caacatggcg tcatcggcat agtgtctact ggtggcaatg ggctcgttgg ctttgcagac 6480
gtcagagacc tcttgtggtt agatgaagaa gctatggaac agggcgtgtc cgactacatt 6540
aagggtctcg gagatgcttt tggaacaggc ttcactgacg cagtctcaag ggaggttgaa 6600
gctctcaaga actatcttat agggtctgaa ggagcagttg agaaaatttt gaaaaatctt 6660
attaaactaa tctctgcact ggtgattgtg atcagaagtg attacgacat ggttaccctc 6720
actgcaacct tagcgctgat aggttgtcat ggcagtcctt gggcttggat taaagccaaa 6780
acagcctcca tcttaggtat ccctatcgcc caaaagcaga gcgcttcctg gctcaagaag 6840
ttcaatgaca tggccaacgc cgctaagggg ttagagtggg tttccaacaa gatcagcaaa 6900
tttattgatt ggcttaagga gaaaatagta ccagcagcca gggagaaggt tgaattccta 6960
aataacttga aacagctgcc actgctagag aatcagatct cgaacttgga acaatctgct 7020
gcttcacaag aggaccttga agtcatgttt gggaatgtgt cgtacctagc tcacttctgt 7080
cgcaagtttc aaccgctata cgccacggaa gctaaaagag tctatgccct ggagaagaga 7140
atgaataact atatgcagtt caagagcaaa caccgaattg aacctgtatg tctcattatt 7200
aggggctcac caggcaccgg gaagtctcta gccactggta ttattgctcg agcaatcgct 7260
gataagtacc actccagcgt gtactcgctc ccaccagacc cggatcattt tgacggttac 7320
aagcaacagg tggttacagt gatggatgat ttgtgtcaaa accccgatgg taaggatatg 7380
tccttattct gtcaaatggt atccaccgta gatttcattc caccaatggc ttctctcgag 7440
gagaagggag tttccttcac ctctaagttt gtcatcgcat ccactaatgc cagtaatatc 7500
atagtaccaa cagtgtctga ttctgacgct attcgccgca ggttctacat ggactgtgac 7560
attgaagtga cagactcgta caaaacagat ctaggtagac tggatgcagg gcgagccgct 7620
aaactgtgtt ctgaaaataa cactgcaaat ttcaaacgtt gcagcccatt agtgtgtggg 7680
aaagccatcc aacttagaga tagaaagtct aaagtcagat acagtgtgga tacggtggtt 7740
tcagaactta ttagggaata cagcaatagg tccgccattg gtaacacaat cgaggctctt 7800
ttccaaggtc cacccaagtt caggccaatt aggattagcc ttgaagaaaa accagcccca 7860
gacgctatta gcgatctcct tgctagtgta gatagtgaag aagtgcgcca gtactgcagg 7920
gatcaaggct ggattattcc tgaagctccc accaatgtgg agcggcacct taatagagcg 7980
gtgctcgtca tgcaatccat caccacagta gtggcggttg tttcgttggt gtacgtcatc 8040
tacaagctct ttgcagggtt tcagggtgca tattctggtg ctcctaagca agtgcttaag 8100
aaacctgctc ttcgcacagc aacagtgcag ggtccgagcc ttgactttgc tctctcccta 8160
ctgagaagga acatcaggca ggtccaaaca gaccaagggc atttcaccat gttgggtgtt 8220
agggatcgct tagcagtcct cccacgccac tcacaacctg gcaaaaccat ttggattgag 8280
cacaaactcg tgaacgtcct tgatgcagtt gaactggtgg atgagcaagg agtcaacctg 8340
gaattaaccc tcatcactct tgacaccaac gagaagttta gggatatcac caaattcatc 8400
ccagaaaata tcagcactgc tagcgatgcc accctagtga tcaacacgga gcacatgccg 8460
tcaatgtttg tcccggtggg tgacgttgtg cagtatggct ttttgaatct cagtggcaag 8520
cctacccatc gcaccatgat gtacaatttt cctactaaag caggacagtg tggaggagtg 8580
gtgacatctg ttgggaaggt tgtcggtatt cacattggtg gcaatggcag acaaggtttt 8640
tgcgcaggcc tcaaaaggag ttactttgct agtgaacaag gagagatcca gtgggttaag 8700
cccaataaag aaactggaag actcaacatc aatggaccaa cccgcaccaa gttagaacct 8760
agtgtattcc atgacatctt cgagggaaat aaggaaccag ctgtcttgca cagtaaagac 8820
ccccgacttg aggtagattt tgaacaggcc ctgttctcta agtatgtggg aaacacacta 8880
catgagcctg acgagtacat caaagaggca gctctacatt atgcaaacca attaaagcaa 8940
ctagaaatca atacctctca aatgagcatg gaggaggcct gctatggtac tgagaatctt 9000
gaggctattg atcttcacac tagtgcaggt tacccctata gtgccctagg gataaagaaa 9060
agagacatct tagaccctac caccagggac gtgagtagaa tgaagttcta catggacaag 9120
tatggtcttg atcttcccta ctccacttat gtcaaggacg agctacgctc gattgataaa 9180
atcaagaaag ggaagtcccg cctgatcgag gccagtagtc taaatgattc agtgtacctc 9240
agaatggctt tcgggcattt gtatgaggct ttccacgcaa atcctgggac gataactgga 9300
tcggccgtgg ggtgtaaccc tgacacattc tggagcaagc tgccaatttt gctccctggt 9360
tcactctttg cctttgacta ctcaggctat gatgccagcc ttagccctgt ctggttcaga 9420
gcattagaat tggttcttag ggagataggg tatagtgaag aggcaatctc actcattgag 9480
ggaatcaacc acacacatca tgtgtatcgt aataagacct attgcgtgct tggtgggatg 9540
ccctcaggct gttcaggaac atccatcttc aactcaatga tcaacaacat tattatcaga 9600
gcactgctca taaaaacatt taagggcatt gatttggatg aactcaacat ggtcgcttat 9660
ggagacgatg tgctcgctag ctatcccttc ccaattgatt gcttggaact agcaaagact 9720
ggtaaggagt atggtctgac catgacccct gctgataaat ctccttgctt taatgaggtc 9780
aattggggta atgcgacctt cctcaaaagg ggctttttgc ccgatgaaca gtttccattt 9840
ttgattcacc ctactatgcc aatgagggag atccatgagt ccattcgatg gaccaaggac 9900
gcacggaaca ctcaagatca tgtgcggtcc ttgtgcctcc tagcatggca taatggtaag 9960
caagaatacg agaagtttgt gagcacaatt aggtctgtcc cagtagggag agcgttggct 10020
attccaaatt atgaaaatct tagacgaaat tggctcgagt tattttagag gttatacaca 10080
cctcaacccc accagaaatc tggtcgtgaa tgtgactggt gggggtaaat ttgttataac 10140
cagaatagca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agcttat 10187
<210> 7
<211> 19
<212> DNA
<213> Artificial
<400> 7
taatacgact cactatagg 19
<210> 8
<211> 30
<212> DNA
<213> Artificial
<400> 8
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 30
<210> 9
<211> 39
<212> DNA
<213> Artificial
<400> 9
gctagcgctt tttttttttt tttttttttt ttttttttt 39
<210> 10
<211> 55
<212> DNA
<213> Artificial
<400> 10
gacgcggccg ctaatacgac tcactatagg ttaaaacagc ctgtgggttg caccc 55
<210> 11
<211> 22
<212> DNA
<213> Artificial
<400> 11
gcactgcacg tggatgcaga ac 22
<210> 12
<211> 33
<212> DNA
<213> Artificial
<400> 12
gacgcggccg cgttctgcat ccacgtgcag tgc 33
<210> 13
<211> 22
<212> DNA
<213> Artificial
<400> 13
aagtcgcgag agctgtcttc cc 22
<210> 14
<211> 33
<212> DNA
<213> Artificial
<400> 14
gacgcggccg cgggaagaca gctctcgcga ctt 33
<210> 15
<211> 28
<212> DNA
<213> Artificial
<400> 15
aattgtacat catggtgcga tgggtagg 28
<210> 16
<211> 39
<212> DNA
<213> Artificial
<400> 16
gacgcggccg ccctacccat cgcaccatga tgtacaatt 39
<210> 17
<211> 73
<212> DNA
<213> Artificial
<400> 17
gctagcgctt tttttttttt tttttttttt tttttttttg ctattctggt tataacaaat 60
ttacccccac cag 73
<210> 18
<211> 18
<212> DNA
<213> Artificial
<400> 18
cctgacgtgt cgacgcgg 18
<210> 19
<211> 49
<212> DNA
<213> Artificial
<400> 19
cctcgccctt gctcaccatc atatggttta gctgtgttaa gggtcaaga 49
<210> 20
<211> 49
<212> DNA
<213> Artificial
<400> 20
tcttgaccct taacacagct aaaccatatg atggtgagca agggcgagg 49
<210> 21
<211> 66
<212> DNA
<213> Artificial
<400> 21
cgctgtgtag acacttgcga accaagagtg gtgatcgcat gcatcttgta cagctcgtcc 60
atgccg 66
<210> 22
<211> 66
<212> DNA
<213> Artificial
<400> 22
cggcatggac gagctgtaca agatgcatgc gatcaccact cttggttcgc aagtgtctac 60
acagcg 66
<210> 23
<211> 21
<212> DNA
<213> Artificial
<400> 23
ctgcacgtgg atgcagaacc c 21
<210> 24
<211> 21
<212> DNA
<213> Artificial
<400> 24
ctgcacgtgg atgcagaacc c 21
<210> 25
<211> 53
<212> DNA
<213> Artificial
<400> 25
gaaatcttcg agtgtgaaga ccattctaga gtttagctgt gttaagggtc aag 53
<210> 26
<211> 53
<212> DNA
<213> Artificial
<400> 26
cttgaccctt aacacagcta aactctagaa tggtcttcac actcgaagat ttc 53
<210> 27
<211> 27
<212> DNA
<213> Artificial
<400> 27
cgcatgcatc gccagaatgc gttcgca 27
Claims (20)
1. A cDNA, characterized in that it comprises the nucleic acid sequence of the EV71 strain and the nucleic acid sequence of a low copy plasmid backbone;
the nucleic acid sequence of the strain EV71 covers the 5 'to 3' forward polarity sequence of the EV71 virus, including the 5 'and 3' non-coding regions of the virus and one open reading frame encoding viral proteins.
2. The cDNA of claim 1, further comprising a sequence of a reporter gene, luciferase or a fluorescent protein, inserted in the nucleic acid sequence of the EV71 strain.
3. The cDNA according to claim 1, wherein the amino acid sequence of the open reading frame of the viral protein is as shown in SEQ ID NO 4.
4. The cDNA according to claim 1, wherein the coding sequence of the low copy plasmid backbone is as shown in SEQ ID NO 3.
5. The cDNA of claim 1, wherein the EV71 strain has a nucleic acid sequence shown in SEQ ID NO 2.
6. The cDNA according to claim 1, characterized in that its sequence is as shown in SEQ ID NO 1.
7. The expression product of the cDNA according to any one of claims 1 to 6.
8. A recombinant virus comprising a cDNA according to any one of claims 1 to 6.
9. A subgenomic replicon having a cDNA sequence according to any one of claims 1 to 6.
10. A double-stranded DNA capable of producing the cDNA according to any one of claims 1 to 6.
11. A plasmid containing the double-stranded DNA according to claim 10 or a derivative thereof.
12. The plasmid of claim 11, which is capable of transcribing to produce the full-length infectious RNA of the EV71 strain, or a mutant thereof.
13. A vaccine prepared from the plasmid of claim 11 or 12.
14. A viral vector, characterized in that it is prepared according to the plasmid of claim 11 or 12.
15. A viral particle produced from a cDNA clone according to any one of claims 1 to 6 or prepared from a plasmid according to claim 11 or 12.
16. A method for detecting EV71 virus, which comprises using the virus particle according to claim 15.
17. A method for producing an EV71 virus antibody, which comprises using the cDNA according to any one of claims 1 to 6 or the viral particle according to claim 15.
18. The method of claim 17, wherein the viral particle of claim 15 is used to immunize an animal and isolate an antibody, or to screen a human antibody library.
19. A kit for detecting EV71, comprising the cDNA of any one of claims 1 to 6 or the viral particle of claim 15.
20. Use of the cDNA of any one of claims 1 to 6 or the virion construct of claim 15 for the construction of a cell or animal model for further screening for a medicament against viral EV 71.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910474088.3A CN112094822A (en) | 2019-06-02 | 2019-06-02 | Infectious cDNA clone based on EV71 strain and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910474088.3A CN112094822A (en) | 2019-06-02 | 2019-06-02 | Infectious cDNA clone based on EV71 strain and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112094822A true CN112094822A (en) | 2020-12-18 |
Family
ID=73748863
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910474088.3A Pending CN112094822A (en) | 2019-06-02 | 2019-06-02 | Infectious cDNA clone based on EV71 strain and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112094822A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115088674A (en) * | 2022-06-10 | 2022-09-23 | 桂林医学院第二附属医院 | Construction method and application of echovirus 30 type wild suckling mouse model |
CN116218907A (en) * | 2023-02-20 | 2023-06-06 | 复旦大学附属中山医院 | Enterovirus infectious clone with HiBiT novel reporter gene, construction method and application thereof |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102766607A (en) * | 2012-07-23 | 2012-11-07 | 哈尔滨医科大学 | Fusion protein for screening and evaluating anti-enterovirus 71 medicine and application of fusion protein |
CN103160475A (en) * | 2011-12-14 | 2013-06-19 | 北京微谷生物医药有限公司 | Enterovirus 71 type viral strain, its application, vaccine and preparation method |
CN103374580A (en) * | 2012-04-27 | 2013-10-30 | 中国医学科学院医药生物技术研究所 | Enterovirus 71 (EV 71) Fuyang strain and cDNA (deoxyribonucleic acid) infectious clone of attenuated strain of enterovirus 71 (EV 71) Fuyang strain as well as application of enterovirus 71 (EV 71) Fuyang strain |
CN103805634A (en) * | 2014-03-05 | 2014-05-21 | 中国科学院武汉病毒研究所 | CA16 infectious clone with green fluorescent protein gene as well as construction method and application of CA16 infectious clone |
US20180036398A1 (en) * | 2015-02-27 | 2018-02-08 | Novartis Ag | Flavivirus replicons |
CN107849540A (en) * | 2015-01-28 | 2018-03-27 | 淡马锡生命科学研究院有限公司 | Enterovirus 71 animal model |
-
2019
- 2019-06-02 CN CN201910474088.3A patent/CN112094822A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103160475A (en) * | 2011-12-14 | 2013-06-19 | 北京微谷生物医药有限公司 | Enterovirus 71 type viral strain, its application, vaccine and preparation method |
CN103374580A (en) * | 2012-04-27 | 2013-10-30 | 中国医学科学院医药生物技术研究所 | Enterovirus 71 (EV 71) Fuyang strain and cDNA (deoxyribonucleic acid) infectious clone of attenuated strain of enterovirus 71 (EV 71) Fuyang strain as well as application of enterovirus 71 (EV 71) Fuyang strain |
CN102766607A (en) * | 2012-07-23 | 2012-11-07 | 哈尔滨医科大学 | Fusion protein for screening and evaluating anti-enterovirus 71 medicine and application of fusion protein |
CN103805634A (en) * | 2014-03-05 | 2014-05-21 | 中国科学院武汉病毒研究所 | CA16 infectious clone with green fluorescent protein gene as well as construction method and application of CA16 infectious clone |
CN107849540A (en) * | 2015-01-28 | 2018-03-27 | 淡马锡生命科学研究院有限公司 | Enterovirus 71 animal model |
US20180036398A1 (en) * | 2015-02-27 | 2018-02-08 | Novartis Ag | Flavivirus replicons |
Non-Patent Citations (2)
Title |
---|
HUIQIANG WANG等: "Recent Progress on Functional Genomics Research of Enterovirus 71", 《VIROLOGICA SINICA》, vol. 34, no. 1, pages 9 - 21, XP036728199, DOI: 10.1007/s12250-018-0071-9 * |
JIE SONG等: "Suppression of the toll-like receptor 7-dependent type I interferon production pathway by autophagy resulting from enterovirus 71 and coxsackievirus A16 infections facilitates their replication", 《ARCH VIROL 》, vol. 163, no. 1, pages 135 - 144, XP036400088, DOI: 10.1007/s00705-017-3592-x * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115088674A (en) * | 2022-06-10 | 2022-09-23 | 桂林医学院第二附属医院 | Construction method and application of echovirus 30 type wild suckling mouse model |
CN116218907A (en) * | 2023-02-20 | 2023-06-06 | 复旦大学附属中山医院 | Enterovirus infectious clone with HiBiT novel reporter gene, construction method and application thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2788478T3 (en) | Multiplex IMMUNSCREENINGSASSAY | |
KR102077131B1 (en) | Recombinant measles virus expressing chikungunya virus polypeptides and their applications | |
CN109312360B (en) | Transposon-based transfection system for primary cells | |
AU2023241391A1 (en) | Novel crispr enzymes and systems | |
US6168943B1 (en) | Methods for making modified recombinant vesiculoviruses | |
AU2024216517A1 (en) | Enhanced systems for cell-mediated oncolytic viral therapy | |
KR101227128B1 (en) | INFECTIOUS cDNA OF AN APPROVED VACCINE STRAIN OF MEASLES VIRUS, USE FOR IMMUNOGENIC COMPOSITIONS | |
KR20230038804A (en) | Improved methods for modification of target nucleic acids | |
CN109439708B (en) | Method for producing kola acid by acid-resistant high-density growth escherichia coli | |
KR20070077140A (en) | Method for analyzing protein-protein interaction | |
WO1996034625A9 (en) | Recombinant vesiculoviruses and their uses | |
AU2023270345A1 (en) | Compositions and methods for nucleic acid expression and protein secretion in bacteroides | |
KR20120034652A (en) | Method for generating a genetically modified microbe | |
CN112094822A (en) | Infectious cDNA clone based on EV71 strain and application thereof | |
CN108949825A (en) | A kind of preparation method and application for the CAR-T cell targeting HER2 | |
CN107043783A (en) | A kind of carrier and its application for carrying out live body positioning to mammalian cell gene group based on CRISPRCas9 systems | |
CN110343713A (en) | It is a kind of based on the multi-functional luciferase reporter gene carrier and its construction method of source of people TLR4 gene and application | |
KR20220016485A (en) | AAV vectors having myelin protein zero promoter, and their use for treating Schwann cell-associated diseases such as Charcot-Marie-Tooth disease | |
CN109468244B (en) | Acid-resistant high-density-growth escherichia coli and application thereof | |
CN110777147A (en) | IKZF3 gene-silenced T cell and application thereof | |
US20030182668A1 (en) | Transgenic non-human mammals expressing constitutively activated tyrosine kinase receptors | |
CN114231513B (en) | Short peptide capable of inhibiting proteasome PSMB5 subunit activity and application thereof in resisting rickettsia infection | |
CN114231566B (en) | R26-e (CN 362-1) carrier and preparation method thereof | |
CN114317536B (en) | Preparation method for constructing uPA transgenic mice based on CRISPR/Cas9 | |
CN110129340A (en) | The infection clones of zika virus MR766 strain and its application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |