KR20200024296A - 비인간 대형 유인원 아데노바이러스 핵산-서열 및 아미노산-서열, 이를 포함하는 벡터 및 그의 용도 - Google Patents
비인간 대형 유인원 아데노바이러스 핵산-서열 및 아미노산-서열, 이를 포함하는 벡터 및 그의 용도 Download PDFInfo
- Publication number
- KR20200024296A KR20200024296A KR1020207003223A KR20207003223A KR20200024296A KR 20200024296 A KR20200024296 A KR 20200024296A KR 1020207003223 A KR1020207003223 A KR 1020207003223A KR 20207003223 A KR20207003223 A KR 20207003223A KR 20200024296 A KR20200024296 A KR 20200024296A
- Authority
- KR
- South Korea
- Prior art keywords
- seq
- adenovirus
- amino acid
- variant
- sequence identity
- Prior art date
Links
- 241000701161 unidentified adenovirus Species 0.000 title claims abstract description 415
- 239000013598 vector Substances 0.000 title claims abstract description 114
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 title description 3
- 239000002245 particle Substances 0.000 claims abstract description 51
- 238000000034 method Methods 0.000 claims abstract description 32
- 239000002773 nucleotide Substances 0.000 claims abstract description 19
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 19
- 238000011282 treatment Methods 0.000 claims abstract description 11
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 43
- 108090000623 proteins and genes Proteins 0.000 claims description 168
- 102000040430 polynucleotide Human genes 0.000 claims description 103
- 108091033319 polynucleotide Proteins 0.000 claims description 103
- 239000002157 polynucleotide Substances 0.000 claims description 103
- 102000004169 proteins and genes Human genes 0.000 claims description 84
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 65
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 60
- 229920001184 polypeptide Polymers 0.000 claims description 48
- 239000012634 fragment Substances 0.000 claims description 43
- 210000000234 capsid Anatomy 0.000 claims description 39
- 108091034131 VA RNA Proteins 0.000 claims description 35
- 239000002671 adjuvant Substances 0.000 claims description 25
- 101710094396 Hexon protein Proteins 0.000 claims description 18
- 101000666856 Homo sapiens Vasoactive intestinal polypeptide receptor 1 Proteins 0.000 claims description 16
- 102100038388 Vasoactive intestinal polypeptide receptor 1 Human genes 0.000 claims description 16
- 239000000203 mixture Substances 0.000 claims description 16
- 238000004519 manufacturing process Methods 0.000 claims description 14
- 201000010099 disease Diseases 0.000 claims description 13
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 13
- 108700026758 Adenovirus hexon capsid Proteins 0.000 claims description 8
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 8
- 108091027963 non-coding RNA Proteins 0.000 claims description 7
- 102000042567 non-coding RNA Human genes 0.000 claims description 7
- 238000000338 in vitro Methods 0.000 claims description 6
- 230000002265 prevention Effects 0.000 claims description 5
- 241000700605 Viruses Species 0.000 abstract description 47
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 abstract description 42
- 230000036039 immunity Effects 0.000 abstract description 11
- 230000002163 immunogen Effects 0.000 abstract description 9
- 108090000565 Capsid Proteins Proteins 0.000 abstract description 8
- 102100023321 Ceruloplasmin Human genes 0.000 abstract description 8
- 239000008194 pharmaceutical composition Substances 0.000 abstract description 4
- 230000002062 proliferating effect Effects 0.000 abstract description 4
- 230000006806 disease prevention Effects 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 144
- 150000001413 amino acids Chemical group 0.000 description 90
- 230000035772 mutation Effects 0.000 description 82
- 235000018102 proteins Nutrition 0.000 description 78
- 108020004414 DNA Proteins 0.000 description 60
- 235000001014 amino acid Nutrition 0.000 description 51
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 44
- 229940024606 amino acid Drugs 0.000 description 44
- 238000012217 deletion Methods 0.000 description 36
- 230000037430 deletion Effects 0.000 description 36
- 239000000427 antigen Substances 0.000 description 27
- 108091007433 antigens Proteins 0.000 description 27
- 102000036639 antigens Human genes 0.000 description 27
- 229960005486 vaccine Drugs 0.000 description 26
- 238000006467 substitution reaction Methods 0.000 description 22
- 230000003612 virological effect Effects 0.000 description 21
- 108010077245 asparaginyl-proline Proteins 0.000 description 20
- 239000013612 plasmid Substances 0.000 description 20
- 238000001415 gene therapy Methods 0.000 description 19
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 18
- 241001135569 Human adenovirus 5 Species 0.000 description 17
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 17
- -1 preferably E1A Proteins 0.000 description 17
- 239000000835 fiber Substances 0.000 description 16
- 230000006870 function Effects 0.000 description 16
- 241000598171 Human adenovirus sp. Species 0.000 description 15
- 238000002255 vaccination Methods 0.000 description 15
- 101710145505 Fiber protein Proteins 0.000 description 14
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 14
- 108010050848 glycylleucine Proteins 0.000 description 14
- 241000282412 Homo Species 0.000 description 13
- 101710087110 ORF6 protein Proteins 0.000 description 13
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 13
- 108010044940 alanylglutamine Proteins 0.000 description 13
- 230000028993 immune response Effects 0.000 description 13
- 208000015181 infectious disease Diseases 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- 210000003719 b-lymphocyte Anatomy 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 11
- 206010028980 Neoplasm Diseases 0.000 description 11
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 11
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 11
- 108010087924 alanylproline Proteins 0.000 description 11
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 11
- 108010078144 glutaminyl-glycine Proteins 0.000 description 11
- 238000004806 packaging method and process Methods 0.000 description 11
- 230000010076 replication Effects 0.000 description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 10
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 10
- 241000880493 Leptailurus serval Species 0.000 description 10
- 210000001744 T-lymphocyte Anatomy 0.000 description 10
- 239000003814 drug Substances 0.000 description 10
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 10
- 108010085203 methionylmethionine Proteins 0.000 description 10
- 230000035755 proliferation Effects 0.000 description 10
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 10
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 9
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 9
- 241000124008 Mammalia Species 0.000 description 9
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 9
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 9
- 108010089804 glycyl-threonine Proteins 0.000 description 9
- 108010037850 glycylvaline Proteins 0.000 description 9
- 108010003700 lysyl aspartic acid Proteins 0.000 description 9
- 108010038320 lysylphenylalanine Proteins 0.000 description 9
- 108010073969 valyllysine Proteins 0.000 description 9
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 8
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 8
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 8
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 8
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 8
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 8
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 8
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 8
- 241001272567 Hominoidea Species 0.000 description 8
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 8
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 8
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 8
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 8
- 230000000295 complement effect Effects 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 108010012058 leucyltyrosine Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 8
- 244000052769 pathogen Species 0.000 description 8
- 230000001717 pathogenic effect Effects 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 7
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 7
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 7
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 7
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 7
- 241001465754 Metazoa Species 0.000 description 7
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 7
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 7
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 7
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 7
- 108700019146 Transgenes Proteins 0.000 description 7
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 7
- 230000000890 antigenic effect Effects 0.000 description 7
- 238000010367 cloning Methods 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 7
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 7
- 230000002458 infectious effect Effects 0.000 description 7
- 108010053725 prolylvaline Proteins 0.000 description 7
- 210000002966 serum Anatomy 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 7
- 108010051110 tyrosyl-lysine Proteins 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 description 6
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 description 6
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 description 6
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 description 6
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 description 6
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 description 6
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 description 6
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 6
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 6
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 description 6
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 description 6
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 description 6
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 6
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 6
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 description 6
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 description 6
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 description 6
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 description 6
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 description 6
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 description 6
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 description 6
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 description 6
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 description 6
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 description 6
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 description 6
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 description 6
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 description 6
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 description 6
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 description 6
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 description 6
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 description 6
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 description 6
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 description 6
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 description 6
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 description 6
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 description 6
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 description 6
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 description 6
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 description 6
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 description 6
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 description 6
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 description 6
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 description 6
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 description 6
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 6
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 6
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 6
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 6
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 6
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 description 6
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 description 6
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 description 6
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 description 6
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 description 6
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 6
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 6
- 102100024407 Jouberin Human genes 0.000 description 6
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 description 6
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 description 6
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 description 6
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 description 6
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 description 6
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 6
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 6
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 6
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 description 6
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 description 6
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 description 6
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 6
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 description 6
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 6
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 6
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 description 6
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 description 6
- 101710130262 Probable Vpr-like protein Proteins 0.000 description 6
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 description 6
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 description 6
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 description 6
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 description 6
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 description 6
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 description 6
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 description 6
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 description 6
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 description 6
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 description 6
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 6
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 description 6
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 description 6
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 description 6
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 description 6
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 description 6
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 description 6
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 description 6
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 description 6
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 description 6
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 description 6
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 6
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 6
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 6
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 6
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 6
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 6
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 description 6
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 description 6
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 description 6
- 108020005202 Viral DNA Proteins 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010036533 arginylvaline Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 230000003053 immunization Effects 0.000 description 6
- 238000002649 immunization Methods 0.000 description 6
- 230000005847 immunogenicity Effects 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 230000003472 neutralizing effect Effects 0.000 description 6
- 150000007523 nucleic acids Chemical group 0.000 description 6
- 108010024607 phenylalanylalanine Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 5
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 5
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 5
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 5
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 5
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 5
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 5
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 5
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 5
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 5
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 5
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 5
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 5
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 5
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 5
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 5
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 5
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 5
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 5
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 5
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 5
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 5
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 5
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 5
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 5
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 5
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 5
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 5
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 5
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 5
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 5
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 5
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 5
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 5
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 5
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 5
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 5
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 5
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 5
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 5
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 5
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 5
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 5
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 description 5
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- 108010047562 NGR peptide Proteins 0.000 description 5
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 5
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 5
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 5
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 5
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 5
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 5
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 5
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 5
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 5
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 5
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 5
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 5
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 5
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 5
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 5
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 5
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 5
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 5
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 5
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 5
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 5
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 5
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 5
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 5
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 5
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 5
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 5
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 5
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 5
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 5
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 108010068265 aspartyltyrosine Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 230000013595 glycosylation Effects 0.000 description 5
- 238000006206 glycosylation reaction Methods 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010081551 glycylphenylalanine Proteins 0.000 description 5
- 210000000987 immune system Anatomy 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 5
- 239000002502 liposome Substances 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 108010068488 methionylphenylalanine Proteins 0.000 description 5
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 5
- 230000037452 priming Effects 0.000 description 5
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 239000013605 shuttle vector Substances 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 238000010361 transduction Methods 0.000 description 5
- 230000026683 transduction Effects 0.000 description 5
- 230000029812 viral genome replication Effects 0.000 description 5
- 210000002845 virion Anatomy 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 4
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 4
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 4
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 4
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 4
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 4
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 4
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 4
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 4
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 4
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 102100021277 Beta-secretase 2 Human genes 0.000 description 4
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 4
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 4
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 4
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 4
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 4
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 4
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 4
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 4
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 4
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 4
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 4
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 4
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 4
- 101150032643 IVa2 gene Proteins 0.000 description 4
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 4
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 4
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 4
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 4
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 4
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 4
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 4
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 4
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 4
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 4
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 4
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 4
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 4
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 4
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 4
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 4
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 4
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 4
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 4
- YKQNVTOIYFQMLW-IHRRRGAJSA-N Pro-Cys-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 YKQNVTOIYFQMLW-IHRRRGAJSA-N 0.000 description 4
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 4
- DGPGKMKUNGKHPK-QEJZJMRPSA-N Ser-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGPGKMKUNGKHPK-QEJZJMRPSA-N 0.000 description 4
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 4
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 4
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 4
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 4
- 102000008229 Toll-like receptor 1 Human genes 0.000 description 4
- 108010060889 Toll-like receptor 1 Proteins 0.000 description 4
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 4
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 4
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 4
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 4
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 4
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 230000006801 homologous recombination Effects 0.000 description 4
- 238000002744 homologous recombination Methods 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 230000003362 replicative effect Effects 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- 108010009962 valyltyrosine Proteins 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 3
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 3
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 3
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 3
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 3
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 3
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 3
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 3
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 3
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 3
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 3
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 3
- OCOZPTHLDVSFCZ-BPUTZDHNSA-N Arg-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N OCOZPTHLDVSFCZ-BPUTZDHNSA-N 0.000 description 3
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 3
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 3
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 3
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 3
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 3
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 3
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 3
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 3
- ZTRJUKDEALVRMW-SRVKXCTJSA-N Asn-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZTRJUKDEALVRMW-SRVKXCTJSA-N 0.000 description 3
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 3
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 3
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 3
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 3
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 3
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 3
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 3
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 3
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 3
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 3
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 3
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 3
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 3
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 3
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 3
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 3
- 102100031725 Cortactin-binding protein 2 Human genes 0.000 description 3
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 3
- 206010059866 Drug resistance Diseases 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 3
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 3
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 3
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 3
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 3
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 3
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 3
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 3
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 3
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 3
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 3
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 3
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 3
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 3
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 3
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 3
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 3
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 3
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 3
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 3
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 3
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 3
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 3
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 3
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 3
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 3
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 3
- 101000908757 Human adenovirus C serotype 2 Early 4 ORF4 protein Proteins 0.000 description 3
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 3
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 3
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 3
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 3
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 3
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 3
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 3
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 3
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 3
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 3
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 3
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 3
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 3
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 3
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 3
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 3
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 3
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 3
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 3
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 3
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 3
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 3
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 3
- MUYQDMBLDFEVRJ-LSJOCFKGSA-N Met-Ala-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 MUYQDMBLDFEVRJ-LSJOCFKGSA-N 0.000 description 3
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 3
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 3
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 3
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 3
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 3
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 3
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 3
- OOLVTRHJJBCJKB-IHRRRGAJSA-N Met-Tyr-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OOLVTRHJJBCJKB-IHRRRGAJSA-N 0.000 description 3
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 208000031662 Noncommunicable disease Diseases 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 3
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 3
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 3
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 3
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 3
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 3
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 3
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 3
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 3
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 3
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 3
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 3
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 3
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 3
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 3
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 3
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- 101710197985 Probable protein Rev Proteins 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 102100040307 Protein FAM3B Human genes 0.000 description 3
- 101500027983 Rattus norvegicus Octadecaneuropeptide Proteins 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 3
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 3
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 3
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 3
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 3
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 3
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 3
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 3
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 3
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 3
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 3
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 3
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 3
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 3
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 3
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 3
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 3
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 3
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 3
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 3
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 3
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 3
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 3
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 3
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 3
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 3
- 101710110895 Uncharacterized 7.3 kDa protein in cox-rep intergenic region Proteins 0.000 description 3
- 101710134973 Uncharacterized 9.7 kDa protein in cox-rep intergenic region Proteins 0.000 description 3
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 3
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 3
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 3
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 3
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 3
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 238000007385 chemical modification Methods 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 230000003111 delayed effect Effects 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- 239000006185 dispersion Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 108091005899 fibrous proteins Proteins 0.000 description 3
- 102000034240 fibrous proteins Human genes 0.000 description 3
- 210000003494 hepatocyte Anatomy 0.000 description 3
- 108010036413 histidylglycine Proteins 0.000 description 3
- 238000007918 intramuscular administration Methods 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 239000000314 lubricant Substances 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 239000012266 salt solution Substances 0.000 description 3
- 238000011856 somatic therapy Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 108010001055 thymocartin Proteins 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 210000004881 tumor cell Anatomy 0.000 description 3
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 2
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- YHOPXCAOTRUGLV-XAMCCFCMSA-N Ala-Ala-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YHOPXCAOTRUGLV-XAMCCFCMSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 2
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 2
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- XMGVWQWEWWULNS-BPUTZDHNSA-N Arg-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XMGVWQWEWWULNS-BPUTZDHNSA-N 0.000 description 2
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 2
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 2
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 2
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 2
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 2
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 2
- 108090000695 Cytokines Proteins 0.000 description 2
- 102000004127 Cytokines Human genes 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 241000702421 Dependoparvovirus Species 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 101710177291 Gag polyprotein Proteins 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 2
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 2
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 2
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- GQTNWYFWSUFFRA-KKUMJFAQSA-N Gln-Met-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GQTNWYFWSUFFRA-KKUMJFAQSA-N 0.000 description 2
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 2
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 2
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 2
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 2
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 2
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 2
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 2
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 2
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 2
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 2
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 2
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 2
- 101000669447 Homo sapiens Toll-like receptor 4 Proteins 0.000 description 2
- 101000669460 Homo sapiens Toll-like receptor 5 Proteins 0.000 description 2
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 description 2
- 101001082397 Human adenovirus B serotype 3 Hexon-associated protein Proteins 0.000 description 2
- 241000193096 Human adenovirus B3 Species 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 2
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 2
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 2
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- 108010074328 Interferon-gamma Proteins 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- 102000043129 MHC class I family Human genes 0.000 description 2
- 108091054437 MHC class I family Proteins 0.000 description 2
- 102000043131 MHC class II family Human genes 0.000 description 2
- 108091054438 MHC class II family Proteins 0.000 description 2
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 2
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 2
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 2
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 2
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 2
- 108700011259 MicroRNAs Proteins 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 2
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 2
- 101001120093 Pseudoalteromonas phage PM2 Protein P8 Proteins 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 241000710961 Semliki Forest virus Species 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 2
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- 230000005867 T cell response Effects 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 2
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 2
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 2
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 description 2
- 102000002689 Toll-like receptor Human genes 0.000 description 2
- 108020000411 Toll-like receptor Proteins 0.000 description 2
- 102100039360 Toll-like receptor 4 Human genes 0.000 description 2
- 102100039357 Toll-like receptor 5 Human genes 0.000 description 2
- 102100039390 Toll-like receptor 7 Human genes 0.000 description 2
- 102100033117 Toll-like receptor 9 Human genes 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 2
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 2
- 108060008683 Tumor Necrosis Factor Receptor Proteins 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 2
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 2
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 2
- 206010046865 Vaccinia virus infection Diseases 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 2
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- 108700005077 Viral Genes Proteins 0.000 description 2
- 108010067390 Viral Proteins Proteins 0.000 description 2
- 230000001464 adherent effect Effects 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 210000000612 antigen-presenting cell Anatomy 0.000 description 2
- 239000002246 antineoplastic agent Substances 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 229920001400 block copolymer Polymers 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 229960005084 calcitriol Drugs 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 239000001768 carboxy methyl cellulose Substances 0.000 description 2
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 2
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 229940110456 cocoa butter Drugs 0.000 description 2
- 235000019868 cocoa butter Nutrition 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 239000007884 disintegrant Substances 0.000 description 2
- 238000001493 electron microscopy Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000002158 endotoxin Substances 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 230000002550 fecal effect Effects 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- PQNFLJBBNBOBRQ-UHFFFAOYSA-N indane Chemical compound C1=CC=C2CCCC2=C1 PQNFLJBBNBOBRQ-UHFFFAOYSA-N 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 229920006008 lipopolysaccharide Polymers 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000012856 packing Methods 0.000 description 2
- 244000045947 parasite Species 0.000 description 2
- 238000007911 parenteral administration Methods 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 229920001451 polypropylene glycol Polymers 0.000 description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000000069 prophylactic effect Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 102000003298 tumor necrosis factor receptor Human genes 0.000 description 2
- 102000042286 type I cytokine receptor family Human genes 0.000 description 2
- 108091052247 type I cytokine receptor family Proteins 0.000 description 2
- 102000042287 type II cytokine receptor family Human genes 0.000 description 2
- 108091052254 type II cytokine receptor family Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 208000007089 vaccinia Diseases 0.000 description 2
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010012050 valyl-aspartyl-prolyl-proline Proteins 0.000 description 2
- 102000009310 vitamin D receptors Human genes 0.000 description 2
- 108050000156 vitamin D receptors Proteins 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- 206010069754 Acquired gene mutation Diseases 0.000 description 1
- 208000010370 Adenoviridae Infections Diseases 0.000 description 1
- 108010057856 Adenovirus E2 Proteins Proteins 0.000 description 1
- 108010027410 Adenovirus E3 Proteins Proteins 0.000 description 1
- 108010056962 Adenovirus E4 Proteins Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 229910017119 AlPO Inorganic materials 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 241000710929 Alphavirus Species 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- XUTOXNRSAGLAKO-UHFFFAOYSA-N Asn Val Asn Pro Chemical compound NC(=O)CC(N)C(=O)NC(C(C)C)C(=O)NC(CC(N)=O)C(=O)N1CCCC1C(O)=O XUTOXNRSAGLAKO-UHFFFAOYSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- CNKAZIGBGQIHLL-GUBZILKMSA-N Asp-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N CNKAZIGBGQIHLL-GUBZILKMSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- HSGOFISJLFDMBJ-CIUDSAMLSA-N Asp-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N HSGOFISJLFDMBJ-CIUDSAMLSA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 1
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 206010007269 Carcinogenicity Diseases 0.000 description 1
- 241001217856 Chimpanzee adenovirus Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 229920002785 Croscarmellose sodium Polymers 0.000 description 1
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- PXEGEYISOXISDV-XIRDDKMYSA-N Cys-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 PXEGEYISOXISDV-XIRDDKMYSA-N 0.000 description 1
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 1
- UOEYKPDDHSFMLI-DCAQKATOSA-N Cys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N UOEYKPDDHSFMLI-DCAQKATOSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 101710135281 DNA polymerase III PolC-type Proteins 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 101150066038 E4 gene Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 101001091269 Escherichia coli Hygromycin-B 4-O-kinase Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 102000016359 Fibronectins Human genes 0.000 description 1
- 108010067306 Fibronectins Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- YJSCHRBERYWPQL-DCAQKATOSA-N Gln-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N YJSCHRBERYWPQL-DCAQKATOSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 101710155188 Hexon-interlacing protein Proteins 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- XJQDHFMUUBRCGA-KKUMJFAQSA-N His-Asn-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XJQDHFMUUBRCGA-KKUMJFAQSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- DMAPKBANYNZHNR-ULQDDVLXSA-N His-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DMAPKBANYNZHNR-ULQDDVLXSA-N 0.000 description 1
- 241000282418 Hominidae Species 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 239000004354 Hydroxyethyl cellulose Substances 0.000 description 1
- 229920000663 Hydroxyethyl cellulose Polymers 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 229940125581 ImmunityBio COVID-19 vaccine Drugs 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 101000839464 Leishmania braziliensis Heat shock 70 kDa protein Proteins 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- 108010028921 Lipopeptides Proteins 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- AIPHUKOBUXJNKM-KKUMJFAQSA-N Lys-Cys-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AIPHUKOBUXJNKM-KKUMJFAQSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000712079 Measles morbillivirus Species 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 1
- GRKPXCKLOOUDFG-UFYCRDLUSA-N Met-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 GRKPXCKLOOUDFG-UFYCRDLUSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010013639 Peptidoglycan Proteins 0.000 description 1
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 101710182846 Polyhedrin Proteins 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- 101710101995 Pre-hexon-linking protein IIIa Proteins 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- SHAQGFGGJSLLHE-BQBZGAKWSA-N Pro-Gln Chemical compound NC(=O)CC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 SHAQGFGGJSLLHE-BQBZGAKWSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101710118538 Protease Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 101000584831 Pseudoalteromonas phage PM2 Protein P6 Proteins 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 101000999689 Saimiriine herpesvirus 2 (strain 11) Transcriptional regulator ICP22 homolog Proteins 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- JJUNLJTUIKFPRF-BPUTZDHNSA-N Ser-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N JJUNLJTUIKFPRF-BPUTZDHNSA-N 0.000 description 1
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 101100289792 Squirrel monkey polyomavirus large T gene Proteins 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 101001091268 Streptomyces hygroscopicus Hygromycin-B 7''-O-kinase Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 108010008038 Synthetic Vaccines Proteins 0.000 description 1
- 101150006914 TRP1 gene Proteins 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 229940123560 Toll-like receptor 4 agonist Drugs 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- XNRJFXBORWMIPY-DCPHZVHLSA-N Trp-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XNRJFXBORWMIPY-DCPHZVHLSA-N 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 1
- CDPXXGFRDZVVGF-OYDLWJJNSA-N Trp-Arg-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CDPXXGFRDZVVGF-OYDLWJJNSA-N 0.000 description 1
- NAQBQJOGGYGCOT-QEJZJMRPSA-N Trp-Asn-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NAQBQJOGGYGCOT-QEJZJMRPSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- HWCBFXAWVTXXHZ-NYVOZVTQSA-N Trp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HWCBFXAWVTXXHZ-NYVOZVTQSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102100040247 Tumor necrosis factor Human genes 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- AKFLVKKWVZMFOT-IHRRRGAJSA-N Tyr-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AKFLVKKWVZMFOT-IHRRRGAJSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- XPYNXORPPVTVQK-SRVKXCTJSA-N Val-Arg-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N XPYNXORPPVTVQK-SRVKXCTJSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 101001042198 Vicia sativa subsp. nigra Bowman-Birk type proteinase inhibitor Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000726445 Viroids Species 0.000 description 1
- 101710086987 X protein Proteins 0.000 description 1
- 229920000392 Zymosan Polymers 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- VJHCJDRQFCCTHL-UHFFFAOYSA-N acetic acid 2,3,4,5,6-pentahydroxyhexanal Chemical compound CC(O)=O.OCC(O)C(O)C(O)C(O)C=O VJHCJDRQFCCTHL-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 229940027570 adenoviral vector vaccine Drugs 0.000 description 1
- 239000003463 adsorbent Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000010419 agar Nutrition 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003429 antifungal agent Substances 0.000 description 1
- 101150010487 are gene Proteins 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 210000004666 bacterial spore Anatomy 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 239000000440 bentonite Substances 0.000 description 1
- 229910000278 bentonite Inorganic materials 0.000 description 1
- 235000012216 bentonite Nutrition 0.000 description 1
- SVPXDRXYRYOSEX-UHFFFAOYSA-N bentoquatam Chemical compound O.O=[Si]=O.O=[Al]O[Al]=O SVPXDRXYRYOSEX-UHFFFAOYSA-N 0.000 description 1
- 238000010256 biochemical assay Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000001815 biotherapy Methods 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000000337 buffer salt Substances 0.000 description 1
- GMRQFYUYWCNGIN-NKMMMXOESA-N calcitriol Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(C)(C)O)C)=C\C=C1\C[C@@H](O)C[C@H](O)C1=C GMRQFYUYWCNGIN-NKMMMXOESA-N 0.000 description 1
- 235000020964 calcitriol Nutrition 0.000 description 1
- 239000011612 calcitriol Substances 0.000 description 1
- CJZGTCYPCWQAJB-UHFFFAOYSA-L calcium stearate Chemical compound [Ca+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O CJZGTCYPCWQAJB-UHFFFAOYSA-L 0.000 description 1
- 239000008116 calcium stearate Substances 0.000 description 1
- 235000013539 calcium stearate Nutrition 0.000 description 1
- 229940023860 canarypox virus HIV vaccine Drugs 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 231100000260 carcinogenicity Toxicity 0.000 description 1
- 230000007670 carcinogenicity Effects 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 239000013553 cell monolayer Substances 0.000 description 1
- 230000002032 cellular defenses Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- SQQXRXKYTKFFSM-UHFFFAOYSA-N chembl1992147 Chemical compound OC1=C(OC)C(OC)=CC=C1C1=C(C)C(C(O)=O)=NC(C=2N=C3C4=NC(C)(C)N=C4C(OC)=C(O)C3=CC=2)=C1N SQQXRXKYTKFFSM-UHFFFAOYSA-N 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 238000011210 chromatographic step Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 239000013601 cosmid vector Substances 0.000 description 1
- 229960005168 croscarmellose Drugs 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 239000001767 crosslinked sodium carboxy methyl cellulose Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 108010057085 cytokine receptors Proteins 0.000 description 1
- 230000000120 cytopathologic effect Effects 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 239000001177 diphosphate Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 239000002612 dispersion medium Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000012377 drug delivery Methods 0.000 description 1
- 206010014599 encephalitis Diseases 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- MVPICKVDHDWCJQ-UHFFFAOYSA-N ethyl 3-pyrrolidin-1-ylpropanoate Chemical compound CCOC(=O)CCN1CCCC1 MVPICKVDHDWCJQ-UHFFFAOYSA-N 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 125000005456 glyceride group Chemical group 0.000 description 1
- 150000002337 glycosamines Chemical group 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 239000010440 gypsum Substances 0.000 description 1
- 229910052602 gypsum Inorganic materials 0.000 description 1
- 230000035931 haemagglutination Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000004727 humoral immunity Effects 0.000 description 1
- 230000008348 humoral response Effects 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 239000008172 hydrogenated vegetable oil Substances 0.000 description 1
- 235000019447 hydroxyethyl cellulose Nutrition 0.000 description 1
- 229940124669 imidazoquinoline Drugs 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 239000000568 immunological adjuvant Substances 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 230000004941 influx Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229960003130 interferon gamma Drugs 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 150000002605 large molecules Chemical class 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 230000021633 leukocyte mediated immunity Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 230000004777 loss-of-function mutation Effects 0.000 description 1
- 239000007937 lozenge Substances 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 208000024191 minimally invasive lung adenocarcinoma Diseases 0.000 description 1
- BSOQXXWZTUDTEL-ZUYCGGNHSA-N muramyl dipeptide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O BSOQXXWZTUDTEL-ZUYCGGNHSA-N 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- 229940124531 pharmaceutical excipient Drugs 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- 238000002205 phenol-chloroform extraction Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 101150088856 pix gene Proteins 0.000 description 1
- 229920000724 poly(L-arginine) polymer Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 235000013930 proline Nutrition 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 229940124551 recombinant vaccine Drugs 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 108010064927 seryl-glutaminyl-asparaginyl-tyrosyl-prolyl-isoleucyl-valyl-glutamine Proteins 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 229940045902 sodium stearyl fumarate Drugs 0.000 description 1
- HSFQBFMEWSTNOW-UHFFFAOYSA-N sodium;carbanide Chemical group [CH3-].[Na+] HSFQBFMEWSTNOW-UHFFFAOYSA-N 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 230000037439 somatic mutation Effects 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 235000010356 sorbitol Nutrition 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 230000003393 splenic effect Effects 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 208000003265 stomatitis Diseases 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 239000003970 toll like receptor agonist Substances 0.000 description 1
- 229940044655 toll-like receptor 9 agonist Drugs 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- 238000005829 trimerization reaction Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 238000003158 yeast two-hybrid assay Methods 0.000 description 1
- XOOUIPVCVHRTMJ-UHFFFAOYSA-L zinc stearate Chemical compound [Zn+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O XOOUIPVCVHRTMJ-UHFFFAOYSA-L 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/235—Adenoviridae
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10321—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10323—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10341—Use of virus, viral particle or viral elements as a vector
- C12N2710/10343—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Abstract
본 발명은 일반적인 인구 집단에서 면역원성이 높고, 기 존재하는 면역이 부재하는 신규한 아데노바이러스 균주에 관한 것이다. 기 존재하는 면역의 부재는 아데노바이러스 캡시드 단백질 헥손의 신규한 초가변 영역에 기인한다. 신규한 아데노바이러스 균주는 또한 증식 능력이 향상되었다. 본 발명은 이들 균주를 기반으로 한 재조합 바이러스, 바이러스-유사 입자 및 벡터뿐만 아니라, 이들 신규 아데노바이러스 균주의 뉴클레오타이드 및 아미노산 서열을 제공한다. 약제학적 조성물 및 질병의 치료 또는 예방에서의 의학적 용도, 및 신규 서열, 재조합 바이러스, 바이러스-유사 입자 및 벡터를 사용하는 아데노바이러스 또는 바이러스-유사 입자의 제조 방법이 추가로 제공된다.
Description
본 발명은 일반적인 인구 집단에서 면역원성이 높고, 기 존재하는 면역이 부재하는 신규 아데노바이러스 균주에 관한 것이다. 기 존재하는 면역의 부재는 아데노바이러스 캡시드 단백질 헥손의 신규한 초가변 영역에 기인한다. 신규 아데노바이러스 균주는 또한 증식 능력이 향상되었다. 본 발명은 이들 균주를 기반으로 한 재조합 바이러스, 바이러스-유사 입자 및 벡터뿐만 아니라, 이들 신규 아데노바이러스 균주의 뉴클레오타이드 및 아미노산 서열을 제공한다. 약제학적 조성물 및 질병의 치료 또는 예방에서의 의학적 용도, 및 신규 서열, 재조합 바이러스, 바이러스-유사 입자 및 벡터를 사용하는 아데노바이러스 또는 바이러스-유사 입자의 제조 방법이 추가로 제공된다.
아데노바이러스 (Ad)는 무 외피의 정이십면체형 캡시드 구조를 갖고, 양서류, 조류 및 포유류에서 발견되는 이중 가닥 DNA 바이러스의 대규모 패밀리를 포함한다 (문헌[Straus, Adenovirus infections in humans; The Adenoviruses, 451-498, 1984; Hierholzer et al., J. Infect.Dis.,158: 804-813,1988; Schnurr and Dondero, Intervirology., 36: 79-83,1993; Jong et al., J. Clin. Microbiol., 37: 3940-3945: 1999]). 레트로바이러스와는 달리, 아데노바이러스는 숙주 세포의 게놈으로 혼입되지 않으면서 분열 및 비 분열 세포 둘 모두를 포함하여 다수의 포유류 종의 수많은 세포 유형을 형질도입시킬 수 있다.
일반적으로 말하면, 아데노바이러스 DNA는 통상적으로 매우 안정하고, 형질전환 또는 종양 발생이 일어나지 않는 한, 에피솜 (예를 들어, 염색체외)으로 유지된다. 또한, 아데노바이러스 벡터는 임상 등급 조성물의 약제학적 규모의 생성에 용이한 적용 가능하고, 적절하게 규정되어 있는 생성 시스템에서 높은 수율로 증식될 수 있다. 이러한 특징 및 그의 적절하게 특성화된 분자 유전학은 재조합 아데노바이러스 벡터를 백신 전달체로서 사용하기에 적합한 후보가 되도록 한다. 재조합 아데노바이러스 벡터의 생성은 비기능성으로 설계되거나, 결실된 아데노바이러스 유전자 산물의 기능을 보완할 수 있는 패키징 (packaging) 세포주의 사용에 의존할 수 있다.
현재, 적절하게 특성화된 2개의 인간 서브 그룹 C 아데노바이러스 혈청형 (즉, hAd2 및 hAd5)은 유전자 요법에 사용되는 대부분의 아데노바이러스 벡터에 대한 바이러스 백본의 공급원으로서 광범위하게 사용된다. 복제-결함 인간 아데노바이러스 벡터는 또한 다양한 감염원으로부터 유래된 다양한 면역원의 전달을 위한 백신 전달체로서 시험되어 왔다. 실험 동물 (예를 들어, 설치류, 개 및 비인간 영장류)에서 수행된 연구에 의하면, 다른 항원뿐만 아니라, 면역원을 인코딩하는 전이유전자를 보유하는 재조합 복제-결함 인간 아데노바이러스 벡터가 전이유전자 산물에 대한 체액성 및 세포-매개 면역 반응을 모두 유발하는 것으로 나타난다. 일반적으로 말하면, 연구자들은 면역 반응을 유발할 것으로 예측되는 고 투여량의 재조합 아데노바이러스 벡터를 이용하는 면역화 프로토콜을 사용하거나; 상이한 혈청형으로부터 유래되지만 부스팅 면역화와 동일한 전이유전자 산물을 보유하는 아데노바이러스 벡터의 순차적 투여를 사용하는 면역화 프로토콜을 사용함으로써, 비인간 실험 시스템에서 백신 전달체로서 인간 아데노바이러스 벡터를 사용한 성공 예를 보고한 바 있다 (문헌[Mastrangeli, et al., Human Gene Therapy, 7: 79-87 (1996)]).
C 아데노바이러스종 (예를 들어, Ad5, Ad6 및 ChAd63)로부터 유래된 벡터는 가장 면역원성이 높다 (문헌[Colloca et al., Sci. Transl. Med. 4 (115), 2012]). 특히, 인간 아데노바이러스 5형 (Ad5)을 기반으로 한 바이러스 벡터가 유전자 요법 및 백신 적용을 위해 개발되었다. Ad5 기반 벡터는 동물 모델에서 매우 효율적이지만, 인간에서 Ad5 야생형 바이러스에 대한 기 존재하는 면역의 존재는 임상 실험에서 유전자 형질도입 효율을 감소시키는 것으로 나타났다 (문헌[Moore JP et al. Science. 2008 May 9; 320 (5877):753-5]). 따라서, 일반 인구에서의 면역은 Ad5를 기반으로 한 Ad 벡터 백신의 광범위한 적용을 제한한다. 한편, 희소 인간 아데노바이러스는 Ad5보다 면역원성이 더 적다 (문헌[Colloca et al., Sci. Transl. Med. 4 (115), 2012]). 비인간 아데노바이러스를 기반으로 한 벡터는 일반 인구 집단에서 기 존재하는 면역이 부재한다 (문헌[Farina et al., J. Virol. 75 (23), 11603-11613, 2001]).
따라서, 인간에서 면역원성이 높고, 기 존재하는 면역이 낮거나 부재하는 아데노바이러스 벡터가 필요한 상황이다. 바람직하게는, 이들 아데노바이러스 벡터는 복제 측면에서 높은 증식성을 갖는다.
[발명의 요약]
제1 양상에서, 본 발명은 다음을 포함하는 아데노바이러스 헥손 단백질을 인코딩하는 단리된 폴리뉴클레오타이드를 제공한다:
A) (i) 서열번호 11에 따른 아미노산 서열 또는 서열번호 11에 적어도 85% 서열 동일성을 갖고, 27 번이 A가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제1 초가변 영역 HVR1,
(ii) 서열번호 12에 따른 아미노산 서열 또는 서열번호 12에 적어도 85% 서열 동일성을 갖고, 1 번이 L이 아니고 바람직하게는 I인 그의 변이체를 포함하는 제2 가변 영역 HVR2,
(iii) 서열번호 13에 따른 아미노산 서열 또는 서열번호 13에 적어도 85% 서열 동일성을 갖고, 7 번이 V가 아니고 바람직하게는 A인 그의 변이체를 포함하는 제3 가변 영역 HVR3,
(iv) 서열번호 14에 따른 아미노산 서열 또는 서열번호 14에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제4 초가변 영역 HVR4,
(v) 서열번호 15에 따른 아미노산 서열 또는 서열번호 15에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제5 초가변 영역 HVR5,
(vi) 서열번호 16에 따른 아미노산 서열 또는 서열번호 16에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제6 초가변 영역 HVR6, 및
(vii) 서열번호 17에 따른 아미노산 서열 또는 서열번호 17에 적어도 85% 서열 동일성을 갖고, 1 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제7 초가변 영역 HVR7; 또는
B) (i) 서열번호 18에 따른 아미노산 서열 또는 서열번호 18에 적어도 85% 서열 동일성을 갖고, 8 번이 V가 아니고 바람직하게는 E이고/이거나, 12 번이 D가 아니고 바람직하게는 E이고/이거나, 13 번이 E가 아니고 바람직하게는 D이고/이거나, 14 번이 L이 아니고 바람직하게는 V인 그의 변이체를 포함하는 제1 초가변 영역 HVR1,
(ii) 서열번호 19에 따른 아미노산 서열 또는 서열번호 19에 적어도 85% 서열 동일성을 갖고, 10 번이 D가 아니고 바람직하게는 E인 그의 변이체를 포함하는 제2 가변 영역 HVR2,
(iii) 서열번호 20에 따른 아미노산 서열 또는 서열번호 20에 적어도 85% 서열 동일성을 갖고, 6 번이 T가 아니고 바람직하게는 A인 그의 변이체를 포함하는 제3 가변 영역 HVR3,
(iv) 서열번호 21에 따른 아미노산 서열 또는 서열번호 21에 적어도 85% 서열 동일성을 갖고, 9 번이 L이 아니고 바람직하게는 M인 그의 변이체를 포함하는 제4 초가변 영역 HVR4,
(v) 서열번호 22에 따른 아미노산 서열 또는 서열번호 22에 적어도 85% 서열 동일성을 갖고, 3 번이 T가 아니고 바람직하게는 S인 그의 변이체를 포함하는 제5 초가변 영역 HVR5,
(vi) 서열번호 23에 따른 아미노산 서열 또는 서열번호 23에 적어도 85% 서열 동일성을 갖고, 9 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제6 초가변 영역 HVR6 및
(vii) 서열번호 24에 따른 아미노산 서열 또는 서열번호 24에 적어도 85% 서열 동일성을 갖고, 8 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제7 초가변 영역 HVR7; 또는
C) (i) 서열번호 25에 따른 아미노산 서열 또는 서열번호 25에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제1 초가변 영역 HVR1,
(ii) 서열번호 26에 따른 아미노산 서열 또는 서열번호 26에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제2 가변 영역 HVR2,
(iii) 서열번호 27에 따른 아미노산 서열 또는 서열번호 27에 적어도 85% 서열 동일성을 갖고, 7 번이 V가 아니고 바람직하게는 A인 그의 변이체를 포함하는 제3 가변 영역 HVR3,
(iv) 서열번호 28에 따른 아미노산 서열 또는 서열번호 28에 적어도 85% 서열 동일성을 갖고, 10 번이 E가 아니고 바람직하게는 Q인 그의 변이체를 포함하는 제4 초가변 영역 HVR4,
(v) 서열번호 29에 따른 아미노산 서열 또는 서열번호 29에 적어도 85% 서열 동일성을 갖고, 3 번이 T가 아니고 바람직하게는 S인 그의 변이체를 포함하는 제5 초가변 영역 HVR5,
(vi) 서열번호 30에 따른 아미노산 서열 또는 서열번호 30에 적어도 85% 서열 동일성을 갖고, 9 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제6 초가변 영역 HVR6 및
(vii) 서열번호 31에 따른 아미노산 서열 또는 서열번호 31에 적어도 85% 서열 동일성을 갖고, 8 번이 I가 아니고 바람직하게는 V이고/이거나, 11 번이 T가 아니고 바람직하게는 S인 그의 변이체를 포함하는 제7 초가변 영역 HVR7; 또는
D) (i) 서열번호 32에 따른 아미노산 서열 또는 서열번호 32에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제1 초가변 영역 HVR1,
(ii) 서열번호 33에 따른 아미노산 서열 또는 서열번호 33에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제2 가변 영역 HVR2,
(iii) 서열번호34에 따른 아미노산 서열 또는 서열번호 34에 적어도 85% 서열 동일성을 갖고, 6 번이 T가 아니고 바람직하게는 A인 그의 변이체를 포함하는 제3 가변 영역 HVR3,
(iv) 서열번호 35에 따른 아미노산 서열 또는 서열번호 35에 적어도 85% 서열 동일성을 갖고, 6 번이 Q가 아니고 바람직하게는 K이고/이거나, 10 번이 E가 아니고 바람직하게는 Q인 그의 변이체를 포함하는 제4 초가변 영역 HVR4,
(v) 서열번호 36에 따른 아미노산 서열 또는 서열번호 36에 적어도 85% 서열 동일성을 갖고, 3 번이 T가 아니고 바람직하게는 S인 그의 변이체를 포함하는 제5 초가변 영역 HVR5,
(vi) 서열번호 37에 따른 아미노산 서열 또는 서열번호 37에 적어도 85% 서열 동일성을 갖고, 1 번이 K가 아니고 바람직하게는 T이고/이거나, 9 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제6 초가변 영역 HVR6 및
(vii) 서열번호 38에 따른 아미노산 서열 또는 서열번호 38에 적어도 85% 서열 동일성을 갖고, 8 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제7 초가변 영역 HVR7; 또는
E) (i) 서열번호 39에 따른 아미노산 서열 또는 서열번호 39에 적어도 85% 서열 동일성을 갖고, 27 번이 A가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제1 초가변 영역 HVR1,
(ii) 서열번호 40에 따른 아미노산 서열 또는 서열번호 40에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제2 가변 영역 HVR2,
(iii) 서열번호 41에 따른 아미노산 서열 또는 서열번호 41에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제3 가변 영역 HVR3,
(iv) 서열번호 42에 따른 아미노산 서열 또는 서열번호 42에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제4 초가변 영역 HVR4,
(v) 서열번호 43에 따른 아미노산 서열 또는 서열번호 43에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제5 초가변 영역 HVR5,
(vi) 서열번호 44에 따른 아미노산 서열 또는 서열번호 44에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 제6 초가변 영역 HVR6 및
(vii) 서열번호 45에 따른 아미노산 서열 또는 서열번호 45에 적어도 85% 서열 동일성을 갖고, 1 번이 I가 아니고 바람직하게는 V인 그의 변이체를 포함하는 제7 초가변 영역 HVR7.
제2 양상에서, 본 발명은 제1 양상의 폴리뉴클레오타이드를 포함하는 아데노바이러스, 바람직하게는 복제-불능 아데노바이러스를 인코딩하는 단리된 폴리뉴클레오타이드를 제공한다.
제양상에서, 본 발명은 제1 양상의 단리된 폴리뉴클레오타이드에 의해 인코딩되는 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드를 제공한다.
제4 양상에서, 본 발명은 제1 양상의 단리된 폴리뉴클레오타이드에 의해 인코딩되는 아데노바이러스, 또는 제1 양상에 따른 단리된 폴리뉴클레오타이드 및/또는 제3 양상에 따른 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드를 포함하는 단리된 아데노바이러스, 바람직하게는 복제-불능 아데노바이러스를 제공한다.
제5 양상에서, 본 발명은 제1 양상의 단리된 폴리뉴클레오타이드에 의해 인코딩되는 바이러스-유사 입자를 제공한다.
제6 양상에서, 본 발명은 제1 양상의 단리된 폴리뉴클레오타이드를 포함하는 벡터를 제공한다.
제7 양상에서, 본 발명은 (i) 애주번트, (ii) 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터 및 선택적으로 (iii) 약제학적으로 허용 가능한 부형제를 포함하는 조성물을 제공한다.
제8 양상에서, 본 발명은 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터를 포함하는 세포를 제공한다.
제9 양상에서, 본 발명은 질병의 치료 또는 예방에 사용하기 위한 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자, 또는 제6 양상의 벡터 및/또는 제7 양상의 조성물을 제공한다.
제10 양상에서, 본 발명은 (i) 세포에서 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드를 발현시켜, 아데노바이러스 또는 아데노바이러스-유사 입자가 세포내에서 조립되는 단계; 및
(ii) 세포 또는 세포 주위의 배지로부터 아데노바이러스 또는 아데노바이러스-유사 입자를 단리하는 단계를 포함하는, 아데노바이러스 또는 아데노바이러스-유사 입자를 생성하는 시험관내(in vitro) 방법에 관한 것이다.
본 발명을 하기에 상세하게 기재하기 전에, 본 발명이 본 명세서에 기재된 특정 방법론, 프로토콜 및 시약에 제한되지 않는 것으로 이해되어야 한다. 또한, 본 명세서에서 사용된 용어는 특정 실시형태를 설명하기 위한 것이고, 본 발명의 범위를 제한하려는 것이 아니며, 본 발명의 범위는 첨부된 청구범위에 의해서만 제한됨을 이해해야 한다. 달리 정의되지 않는 한, 본 명세서에 사용된 모든 기술 및 과학 용어는 당업자가 일반적으로 이해하는 것과 동일한 의미를 갖는다.
바람직하게는, 본 명세서에 사용되는 바와 같이, 용어는 문헌["A multilingual glossary of biotechnological terms: (IUPAC Recommendations)", Leuenberger, H.G.W, Nagel, B. and Klbl, H. eds. (1995), Helvetica Chimica Acta, CH-4010 Basel, Switzerland)]에 기재된 바 및 Axel Kleemann 및 Jurgen Engel의 문헌["Pharmaceutical Substances: Syntheses, Patents, Applications"]; Susan Budavari 등에 의해 편집된 문헌[Thieme Medical Publishing, 1999; the "Merck Index: An Encyclopedia of Chemicals, Drugs, and Biologicals"], 2001년 미국 메릴랜드주 록빌 소재의 United States Pharmcopeial Convention, Inc.에 의해 발간된 문헌[CRC Press, 1996, and the United States Pharmacopeia-25/National Formulary-20]에 기재된 바와 같이 규정된다.
본 명세서 및 하기의 청구범위 전체에서, 문맥상 달리 요구하지 않는 한, 단어 "~들을 포함하다" 및 "~를 포함하다" 및 "~를 포함하는"과 같은 변용예는 언급된 특징, 정수 또는 단계 또는 특징, 정수 또는 단계의 그룹을 포함하지만, 임의의 다른 특징, 정수 또는 단계 또는 정수 또는 단계의 그룹을 배제하지 않는 것을 의미하는 것으로 이해될 것이다. 하기 구절에서, 본 발명의 상이한 양상이 더욱 상세하게 규정된다. 이렇게 규정된 각각의 양상은 달리 명확하게 지시되지 않는 한 임의의 다른 양상 또는 양상들과 조합될 수 있다. 특히, 바람직하거나 유리한 것으로 언급된 임의의 특징은 바람직하거나 유리한 것으로 언급된 임의의 다른 특징 또는 특징들과 조합될 수 있다.
본 명세서의 내용 전체에 걸쳐 다수의 문헌이 인용되어 있다. 본 명세서에 인용된 각각의 문헌 (모든 특허, 특허 출원, 과학 간행물, 제조업체의 설명서, 지침 등 포함)은 상기 또는 하기에 기재되었는지에 상관없이 그 전문이 본 명세서에 참조로 포함된다. 본 명세서 내의 문헌은 모두 선행 발명의 관점에서 그 개시내용이 본 발명을 선행한다는 인정으로 해석되어서는 안된다.
도면의 간단한 설명
도 1: BAC GAd-GAG A/L/S 셔틀 벡터 (shuttle vector)의 개략도.
도 2: E1-및 E3-결실된 GAdNou19 GAG (DE1E3) BAC 플라스미드의 개략도.
도 3: E1-및 E3-결실된 GAdNou20 GAG (DE1E3) BAC 플라스미드의 개략도.
도 4: 동일한 발현 카세트를 보유하는 벤치마크 Ad5 벡터와 비교한 Hek293에서의 GADNOU19 및 GADNOU20의 증식성.
도 5: GAG 항원을 인코딩하는 GADNOU19 및 GADNOU20 벡터의 면역원성. GADNOU19 GAG (DE1DE3) 및 GADNOU20 GAG (DE1DE3) 벡터의 면역 역가를 IFN-γ ELISpot에 의해 결정하였다. 각 벡터의 3x10^7 및 3x10^6 vp로 면역화한지 3주 후에 T 세포 반응을 측정한다. CD8+ 에피토프를 인코딩하는 면역 우성 gag 펩타이드에 의한 자극에 반응하여 수백만개의 비장 세포당 IFNγ를 생성하는 T 세포의 수가 도시되어 있다.
뉴클레오타이드 및 아미노산 서열
하기 표 1은 본 명세서에 언급된 서열에 대한 요약을 제공한다 (GADNOU+ 수: 단리된 아데노바이러스 균주; *: 아미노산 서열을 인코딩하는 GADNOU 게놈의 상응하는 뉴클레오타이드 서열, 서열 상응성은 나열된 순서에 따르고, 예를 들어 서열번호 11의 경우, HVR1 GADNOU 20은 서열번호 1의 x-x에 상응하고, HVR1 GADNOU 21은 서열번호 2의 x-x에 상응하고, HVR1 GADNOU 25는 서열번호 3의 x-x에 상응함). GADNOU는 본 발명자들의 균주명이다. 하기에 제공된 헥손, 펜톤, 섬유에 대한 게놈 좌표의 범위 (GADNOU 게놈에서 서열번호 46 내지 54)는 최종 종결 코돈을 포함하지 않으며, 이는 좌표를 사용하여 헥손, 펜톤 또는 섬유를 인코딩하는 폴리뉴클레오타이드를 언급할 때 본 개시내용에 선택적으로 포함/추가된다.
하기 표 2a 및 표 2b는 GADNOU 게놈에서 CDS, RNA 및 ITR의 게놈 범위/좌표를 제공한다. 이는 표에 열거되고 각각의 실시형태에 바람직하게 포함된 본 명세서의 게놈 요소에 대한 임의의 인용에 적용된다.
[표 2a]
[표 2b]
본 발명의 양상 및 그의 특정 실시형태
본 발명은 상기 발명의 요약에서 제시된 바와 같은 수 개의 양상에 관한 것이다. 이들 양상은 하기에 기재되는 대안적인 실시형태 및 바람직한 실시형태를 포함한다.
제1 양상에서, 본 발명은 상기의 발명의 내용에 규정된 바와 같은 아데노바이러스 헥손 단백질을 인코딩하는 단리된 폴리뉴클레오타이드를 제공한다.
바람직한 실시형태에서, HVR 변이체는 각각의 서열번호에 적어도 90% 이상, 바람직하게는 적어도 95% 서열 동일성을 갖는다. 서열 동일성 백분율 수준에 의한 규정에 대안적으로, HVR은 각각의 서열번호 내에 특정 수의 아미노산 돌연변이를 갖는 것으로 규정될 수 있다. 그 후 돌연변이 수는 다음과 같다: 적어도 85% 서열 동일성 대신, 임의의 HVR1에 최대 4개의 돌연변이, 임의의 HVR2에 최대 2개의 돌연변이, 임의의 HVR3에 최대 1개의 돌연변이, 임의의 HVR4에 최대 1개의 돌연변이, 임의의 HVR5에 최대 2개의 돌연변이, 임의의 HVR6에 최대 1개의 돌연변이 및 임의의 HVR7에 최대 3 돌연변이; 적어도 90% 서열 동일성 대신, 임의의 HVR1에 최대 2개의 돌연변이, 임의의 HVR2에 최대 1개의 돌연변이, 임의의 HVR3에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이, 임의의 HVR4에 최대 1개의 돌연변이, 임의의 HVR5에 최대 1개의 돌연변이, 임의의 HVR6에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이 및 임의의 HVR7에 최대 2개의 돌연변이; 적어도 95% 서열 동일성 대신, 임의의 HVR1에 최대 1개의 돌연변이, 임의의 HVR2에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이, 임의의 HVR3에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이, 임의의 HVR4에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이, 임의의 HVR5에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이, 임의의 HVR6에 최대 1개의 돌연변이 및 바람직하게는 0개의 돌연변이 및 임의의 HVR7에 최대 1개의 돌연변이.
당 업계, 예를 들어 문헌[Bradley et al. (J Virol., 2012 Jan;86(2):1267-72)]에 공지된 바와 같이, 아데노바이러스 중화 항체는 헥손 초가변 영역을 표적화하고, 아데노바이러스의 HVR 영역을 혈청 내에 다량 존재하는 것으로 대체함으로써, 아데노바이러스가 면역 숙주의 면역 시스템을 우회할 수 있다. 따라서, 상기 HVR은 하기에 규정된 각각의 헥손 단백질을 사용할 수 있지만, 이는 헥손 단백질 및 또한 하기 펜톤 및 섬유 단백질과는 별개의, 즉 다른 헥손, 펜톤 및/또는 섬유 단백질을 갖는 상이한 아데노바이러스의 헥손 HVR을 대체함으로써 유용성을 갖는다.
바람직한 실시형태에서, 헥손 단백질은 다음을 포함한다:
A) 서열번호 46에 따른 아미노산 서열 또는 서열번호 46에 적어도 85% 서열 동일성을 갖는 그의 변이체,
B) 서열번호 47에 따른 아미노산 서열 또는 서열번호 47에 적어도 85% 서열 동일성을 갖는 그의 변이체,
C) 서열번호 48에 따른 아미노산 서열 또는 서열번호 48에 적어도 85% 서열 동일성을 갖는 그의 변이체,
D) 서열번호 49에 따른 아미노산 서열 또는 서열번호 49에 적어도 85% 서열 동일성을 갖는 그의 변이체, 및/또는
E) 서열번호 50에 따른 아미노산 서열 또는 서열번호 50에 적어도 85% 서열 동일성을 갖는 그의 변이체.
바람직한 실시형태에서, 헥손 변이체는 각각의 서열번호에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98% 또는 99% 서열 동일성을 갖는다. 서열 동일성 백분율 수준에 의한 규정에 대안적으로, 헥손 변이체는 각각의 서열번호내에 특정 수의 아미노산 돌연변이를 갖는 것으로 규정될 수 있다. 그 후 돌연변이 수는 다음과 같다: 적어도 85% 서열 동일성 대신, 임의의 헥손에 최대 143개의 돌연변이; 적어도 90% 서열 동일성 대신, 임의의 헥손에 최대 95개의 돌연변이; 적어도 95% 서열 동일성 대신, 임의의 헥손에 최대 47개의 돌연변이; 적어도 96% 서열 동일성 대신, 임의의 헥손에 최대 38개의 돌연변이; 적어도 97% 서열 동일성 대신, 임의의 헥손에 최대 28개의 돌연변이; 적어도 98% 서열 동일성 대신, 임의의 헥손에 최대 19개의 돌연변이; 적어도 99% 서열 동일성 대신, 임의의 헥손에 최대 9개의 돌연변이. 헥손 변이체이 상기에 각각의 HVR에 대해 규정된 것보다 더 적은 서열 동일성을 갖거나, 그의 HVR에 더 많은 돌연변이를 갖지 않음이 이해되어야 한다.
일 실시형태에서, 제1 양상의 단리된 폴리뉴클레오타이드는 서열번호 51 또는 52에 따른 아미노산 서열 또는 서열번호 51 또는 52에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 아데노바이러스 펜톤 단백질을 추가로 인코딩한다. 바람직한 실시형태에서, 펜톤 변이체는 각각의 서열번호에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98% 또는 99% 서열 동일성을 갖는다. 서열 동일성 백분율 수준에 의한 규정에 대안적으로, 펜톤 변이체는 각각의 서열번호 내에 특정 수의 아미노산 돌연변이를 갖는 것으로 규정될 수 있다. 그 후 돌연변이 수는 다음과 같다: 적어도 85% 서열 동일성 대신, 임의의 펜톤에 최대 97개의 돌연변이; 적어도 90% 서열 동일성 대신, 임의의 펜톤에 최대 65개의 돌연변이; 적어도 95% 서열 동일성 대신, 임의의 펜톤에 최대 32개의 돌연변이; 적어도 96% 서열 동일성 대신, 임의의 펜톤에 최대 26개의 돌연변이; 적어도 97% 서열 동일성 대신, 임의의 펜톤에 최대 19개의 돌연변이; 적어도 98% 서열 동일성 대신, 임의의 펜톤에 최대 13개의 돌연변이; 적어도 99% 서열 동일성 대신, 임의의 펜톤에 최대 6개의 돌연변이.
바람직하게는, 서열번호 51 및 52의 펜톤 변이체는 각각 289 번이 D가 아니고, 바람직하게는 G이고, 341 번이 D가 아니고, 바람직하게는 N이다. 더욱 바람직하게는, 서열번호 52의 변이체는 또한 442 번이 A가 아니고, 더욱 바람직하게는 T이다.
또 다른 실시형태에서, 제1 양상의 단리된 폴리뉴클레오타이드 (즉, 헥손 및 가능하게는 펜톤 단백질 다음에 위치함)는 서열번호 53 또는 54에 따른 아미노산 서열 또는 서열번호 53 또는 54에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 아데노바이러스 섬유 단백질을 추가로 인코딩한다. 바람직한 실시형태에서, 섬유 변이체는 각각의 서열번호에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98% 또는 99% 서열 동일성을 갖는다. 서열 동일성 백분율 수준에 의한 규정에 대안적으로, 섬유 변이체는 각각의 서열번호 내에 특정 수의 아미노산 돌연변이를 갖는 것으로 규정될 수 있다. 그 후 돌연변이 수는 다음과 같다: 적어도 85% 서열 동일성 대신, 임의의 섬유에 최대 89개의 돌연변이; 적어도 90% 서열 동일성 대신, 임의의 섬유에 최대 59개의 돌연변이; 적어도 95% 서열 동일성 대신, 임의의 섬유에 최대 29개의 돌연변이; 적어도 96% 서열 동일성 대신, 임의의 섬유에 최대 23개의 돌연변이; 적어도 97% 서열 동일성 대신, 임의의 섬유에 최대 17개의 돌연변이; 적어도 98% 서열 동일성 대신, 임의의 섬유에 최대 11개의 돌연변이; 적어도 99% 서열 동일성 대신, 임의의 섬유에 최대 5개의 돌연변이.
바람직하게는, 서열번호 53의 섬유 변이체는 181 번이 A가 아니고 바람직하게는 P이고/이거나, 474 번이 V가 아니고 바람직하게는 I이고/이거나, 4 번 및 5 번 사이에 S, 바람직하게는 아미노산이 삽입되어 있지 않다. 바람직하게는, 서열번호 54의 섬유 변이체는 90 번이 T가 아니고 바람직하게는 I이고/이거나, 7 번이 S이다 (바람직하게는 4 번 내지 7 번의 각각이 S임).
또 다른 실시형태에서, 제1 양상의 단리된 폴리뉴클레오타이드 (즉, 헥손 및 가능하게는 펜톤 및/또는 섬유 단백질 다음에 위치함)는 서열번호 57에 따른 뉴클레오타이드 서열 또는 서열번호 57에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 VA RNA II 비코딩 RNA를 추가로 인코딩한다. 대안적 또는 부가적으로, 이는 각각 서열번호 55 또는 56에 따른 뉴클레오타이드 서열 또는 서열번호 55 또는 56에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 VA RNA I 비코딩 RNA를 인코딩할 수 있다. 바람직한 실시형태에서, VA RNA 변이체는 각각의 서열번호에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98% 또는 99% 서열 동일성을 갖는다. 서열 동일성 백분율 수준에 의한 규정에 대안적으로, VA RNA 변이체는 각각의 서열번호 내에 특정 수의 뉴클레오타이드 돌연변이를 갖는 것으로 규정될 수 있다. 그 후 돌연변이 수는 다음과 같다: 적어도 85% 서열 동일성 대신, VA RNA I에 최대 25개의 돌연변이 및 VA RNA II에 최대 26개의 돌연변이; 적어도 90% 서열 동일성 대신, VA RNA I에 최대 16개의 돌연변이 및 VA RNA II에 최대 17개의 돌연변이; 적어도 95% 서열 동일성 대신, 임의의 VA RNA에 최대 8개의 돌연변이; 적어도 96% 서열 동일성 대신, 임의의 VA RNA에 최대 6개의 돌연변이; 적어도 97% 서열 동일성 대신, 임의의 VA RNA에 최대 5개의 돌연변이; 적어도 98% 서열 동일성 대신, 임의의 VA RNA에 최대 3개의 돌연변이; 적어도 99% 서열 동일성 대신, 임의의 VA RNA에 최대 1개의 돌연변이.
바람직하게는, 서열번호 57의 VA RNA II 변이체는 (a) 79 번이 C가 아니고/아니거나, 80 번이 A가 아니고, 바람직하게는 79 번은 T이고/이거나, 80 번이 G이고, (b) 81 번이 A가 아니고, 바람직하게는 81 번은 G이다. 서열번호 55의 VA RNA I 변이체는 80 번이 G가 아니고, 바람직하게는 80 번은 A이다.
본 발명에 따른 VA RNA는 실시예 5에 제시된 바와 같이, 아데노바이러스 또는 아데노바이러스-유사 입자 생성을 향상시킨다.
제1 양상의 폴리뉴클레오타이드는 아데노바이러스 게놈에서 헥손, 펜톤 및/또는 섬유 유전자에 인접한, 다른 아데노바이러스 유전자 및 뉴클레오타이드 절편을 추가로 포함하는 것이 바람직하고, 참조번호로서 서열번호 1 내지 10을 사용한다. 상기 폴리뉴클레오타이드는 또한 아데노바이러스 입자 내로의 폴리뉴클레오타이드 패키징에 필요한 서열을 포함하는 것이 특히 바람직하다.
일반적으로, 제1 양상의 단리된 폴리뉴클레오타이드는 다음 중 적어도 하나를 포함하는 것이 바람직하다:
(a) 아데노바이러스 5' 말단, 바람직하게는 아데노바이러스 5' 역위 말단 반복;
(b) 아데노바이러스 Ela 영역 또는 13S, 12S 및 9S 영역 중의 것으로부터 선택되는 그의 단편;
(c) 아데노바이러스 Elb 영역 또는 소형 T, 대형 T 및 IX 영역으로 이루어진 군 중의 것으로부터 선택되는 그의 단편;
(d) 아데노바이러스 VA RNA 영역; 또는 VA RNA I 및 VA RNA II 영역으로 이루어진 군 중의 것으로부터 선택되는 그의 단편;
(e) 아데노바이러스 E2b 영역; 또는 소형 pTP, 중합효소 및 IVa2 영역으로 이루어진 군 중의 것으로부터 선택되는 그의 단편;
(f) 아데노바이러스 L1 영역 또는 그의 단편 (상기 단편은 28.1 kD 단백질, 중합효소, 아그노단백질, 52/55 kDa 단백질 및 IIIa 단백질로 이루어진 군으로부터 선택되는 아데노바이러스 단백질을 인코딩함);
(g) 아데노바이러스 L2 영역 또는 그의 단편 (상기 단편은 상기에 규정된 바와 같은 펜톤 단백질, VII, V 및 X 단백질로 이루어진 군으로부터 선택되는 아데노바이러스 단백질을 인코딩함);
(h) 아데노바이러스 L3 영역 또는 그의 단편 (상기 단편은 VI 단백질, 상기에 규정된 바와 같은 헥손 단백질 및 엔도프로테아제로 이루어진 군으로부터 선택되는 아데노바이러스 단백질을 인코딩함);
(i) 아데노바이러스 E2a 영역 또는 그의 단편 (상기 단편은 DBP 단백질로 이루어진 아데노바이러스 단백질을 인코딩함);
(j) 아데노바이러스 L4 영역 또는 그의 단편 (상기 단편은 100 kD 단백질, 22 kD 상동체, 33 kD 상동체 및 단백질 VIII로 이루어진 군으로부터 선택되는 아데노바이러스 단백질을 인코딩함);
(k) 아데노바이러스 E3 영역 또는 E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORF5, E3 ORF6, E3 ORF7, E3 ORF8 및 E3 ORF9로 이루어진 군으로부터 선택되는 그의 단편;
(l) 아데노바이러스 L5 영역 또는 그의 단편 (상기 단편은 상기에 규정된 바와 같은 섬유 단백질을 인코딩함);
(m) 아데노바이러스 E4 영역 또는 E4 ORF6/7, E4 ORF6, E4 ORF5, E4 ORF4, E4 ORF3, E4 ORF2 및 E4 ORF1로 이루어진 군으로부터 선택되는 그의 단편; 및/또는
(n) 아데노바이러스 3' 말단, 바람직하게는 아데노바이러스 3' 역위 말단 반복.
키메라 아데노바이러스를 형성하기 위한 이러한 요소는 표 1에 따른 (즉, 동일한 GADNOU로 부터) 제1 양상의 폴리뉴클레오타이드의 HVR 및/또는 헥손과 동일한 아데노바이러스 또는 상이한 아데노바이러스, 특히 상이한 종 중 하나, 예를 들어 인간 아데노바이러스 유래의 것일 수 있다.
상기 언급된 폴리뉴클레오타이드의 일부 실시형태에서, 폴리뉴클레오타이드는 상기 요약된 바와 같은 ((a) 내지 (m), 예를 들어 영역 E3 및/또는 E4에서와 같은) 하나 이상의 게놈 영역을 포함하지 않고/않거나, 적어도 하나의 유전자가 비기능성이 되도록 하는 결실 및/또는 돌연변이를 포함하는 아데노바이러스 유전자를 포함하는 것이 바람직할 수 있다. 이러한 바람직한 실시형태에서, 적합한 아데노바이러스 영역은 상기 언급된 영역 (들)/유전자 (들)를 포함하지 않거나, 선택된 영역 (들)/유전자 (들)가 비기능성이 되도록 변형된다. 이를 비기능성이 되도록 하는 한 가지 가능성은 하나 이상의 인위적 종결 코돈 (예를 들어, TAA)을 이들 유전자의 오픈 리딩 프레임에 도입하는 것이다. 바이러스가 복제 결함이 있도록 하는 방법은 당 업계에 널리 공지되어 있다 (예를 들어, 문헌[Brody et al., 1994 Ann NY Acad Sci., 716: 90-101] 참조). 결실은 바람직하게는 본 명세서에 기재된 바와 같은 미니유전자 카세트와 같은 발현 카세트 내에 전이유전자를 삽입하기 위한 공간을 생성시킬 수 있다. 또한, 결실을 사용하여, 당 업계에 널리 공지된 바와 같은 패키징 세포주 또는 보조 바이러스를 사용하지 않고 복제할 수 없는 아데노바이러스 벡터를 생성할 수 있다. 따라서, 명시된 유전자/영역 결실 또는 기능 상실 돌연변이 중 하나 이상을 포함하는 상기 요약된 폴리뉴클레오타이드를 포함하는 최종 재조합 아데노바이러스는 예를 들어유전자 요법 또는 예방접종을 위해 더욱 안전한 재조합 아데노바이러스를 제공할 수 있다.
폴리뉴클레오타이드는 본 명세서에 요약된 바와 같은 (예를 들어 영역 E3 및/또는 E4와 같은) 적어도 하나의 게놈 영역/유전자, 구체적으로 E1A, E1B, E2A, E2B, E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORF5, E3 ORF6, E3 ORF7, E3 ORF8, E3 ORF9, E4 ORF6/7, E4 ORF6, E4 ORF5, E4 ORF4, E4 ORF3, E4 ORF2 및/또는 E4 ORF1, 바람직하게는 E1A, E1B, E2A, E2B, E3 및/또는 E4를 포함하지 않을 수 있고/거나, 적어도 하나의 유전자가 비기능성이 되도록 하는 결실 및/또는 돌연변이를 포함하는 아데노바이러스 유전자를 포함하고, 완전한 Ela 및/또는 Elb 영역을 보유하는 것이 바람직하다. 이러한 완전한 El 영역은 아데노바이러스 게놈의 그의 원 위치에 위치할 수 있거나, 본래의 아데노바이러스 게놈 (예를 들어, E3 영역)의 결실 부위에 배치될 수 있다.
바람직한 실시형태에서, 제1 양상의 단리된 폴리뉴클레오타이드는 다음 아데노바이러스 단백질: 단백질 VI, 단백질 VIII, 단백질 IX, 단백질 IIIa 및 단백질 IVa2 중 하나 이상, 바람직하게는 그 모두를 추가로 인코딩한다.
아데노바이러스의 당업자는 상기 명시된 아데노바이러스 단백질을 인코딩하는 오픈 리딩 프레임을 결정하는 방법을 잘 인식하고 있다. 당업자는 또한 아데노바이러스 게놈의 구조를 알고 있으며, 과도한 부담없이, 본 명세서에 요약된 개별 아데노바이러스 영역 및 ORF를 임의의 아데노바이러스 게놈에 맵핑 (maping)할 수 있다.
또 다른 실시형태에서, 제1 양상의 단리된 폴리뉴클레오타이드는 하나 이상의 이종성 단백질 또는 그의 단편을 추가로 인코딩한다. 하나 이상의 이종성 단백질 또는 그의 단편은 바람직하게는 비-아데노바이러스 단백질 또는 그의 단편이다. 바람직한 실시형태에서, 하나 이상의 비-아데노바이러스 단백질 또는 그의 단편은 하나 이상의 항원성 단백질 또는 그의 단편이다. 바람직하게는, 하나 이상의 이종성 단백질 또는 그의 단편은 하나 이상의 발현 카세트의 일부이다. 이종성 단백질을 인코딩하는 서열 및 바람직하게는 이종성 단백질을 인코딩하는 이러한 서열(들)을 포함하는 발현 카세트는, 예를 들어 본 명세서에 규정된 아데노바이러스 게놈의 결실 영역 내에 삽입될 수 있다. 예시적 이종성 단백질은 서열번호 74에 따른 폴리펩타이드 또는 서열번호 74에 적어도 85% 서열 동일성을 갖는 그의 변이체이다.
제2 양상에서, 본 발명은 아데노바이러스를 인코딩하는 단리된 폴리뉴클레오타이드를 제공하고, 이는 바람직하게는 서열번호 1 내지 10 중 어느 하나에 따른 아데노바이러스 게놈 또는 각각 서열번호 1 내지 10에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하여, 제1 양상의 폴리뉴클레오타이드를 포함한다.
바람직한 실시형태에서, 이는 바람직하게는 게놈 영역/유전자 E1A, E1B, E2A, E2B, E3 및 E4 중 하나 이상이 부재하는 서열번호 1 내지 10 중 어느 하나에 따른 아데노바이러스 게놈을 포함하는 복제-불능 아데노바이러스를 인코딩한다.
가장 바람직하게는, 이는 바람직하게는 서열번호 1 내지 10 중 어느 하나에 따른 아데노바이러스 게놈 또는 각각 서열번호 1 내지 10에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하고, 바람직하게는 하나 이상의 이종성 단백질 또는 그의 단편이 삽입되는 (전달체 아데노바이러스) 재조합 아데노바이러스를 인코딩한다. 바람직하게는, 하나 이상의 이종성 단백질 또는 그의 단편은 게놈 영역/유전자 E1A, E1B, E2A, E2B, E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORF5, E3 ORF6, E3 ORF7, E3 ORF8, E3 ORF9, E4 ORF6/7, E4 ORF6, E4 ORF5, E4 ORF4, E4 ORF3, E4 ORF2 및 E4 ORF1, 더욱 바람직하게는 E1, E3 및/또는 E4 중 하나 이상을 대체함으로써, 삽입된다. 이종성 단백질 또는 그의 단편은 바람직하게는 발현 카세트의 일부로서 삽입된다. 선택적으로, 전달체 아데노바이러스는 또한 본원에 기재된 바와 같이 복제-불능, 즉, 게놈 영역/유전자 E1A, E1B, E2A, E2B, E3 및 E4 중 하나 이상이 부재한다.
예시적 실시형태에서, 본 발명은 아데노바이러스를 인코딩하는 단리된 폴리뉴클레오타이드를 제공하고, 이는 서열번호 72 또는 73에 따른 폴리뉴클레오타이드 또는 각각 서열번호 72 또는 73에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함한다.
바람직한 실시형태에서, 아데노바이러스 게놈 변이체 각각의 서열번호 1에 적어도 85%, 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성 대신, 서열번호 2에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 서열번호 3에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 각각의 서열번호 4에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 각각의 서열번호 5에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 각각의 서열번호 6에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 각각의 서열번호 7에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 각각의 서열번호 8에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성, 각각의 서열번호 9에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성 또는 각각의 서열번호 10에 적어도 90% 및 바람직하게는 적어도 95%, 96%, 97%, 98%, 99%, 99.5 또는 99.9% 서열 동일성을 갖는다 (상기에 규정된 바와 같은 결실을 고려하는 경우).
일 실시형태에서, 제2 양상의 단리된 폴리뉴클레오타이드는 재조합 아데노바이러스를 인코딩하고, 재조합 아데노바이러스의 적어도 하나의 아데노바이러스 게놈 영역은 상기에 규정된 바와 같은 헥손 HVR 또는 헥손 단백질을 포함하지 않는 아데노바이러스로부터 유래된다 (키메라 아데노바이러스). 바람직하게는, 키메라 아데노바이러스는 대체로 또는 바람직하게는 본 명세서에 규정된 바와 같은 헥손 HVR 또는 헥손 단백질 및 선택적으로 또한 펜톤 및/또는 섬유 단백질에 대해서만 키메라이다. 즉, 폴리뉴클레오타이드는 상기에 규정된 바와 같은 헥손 HVR 또는 헥손 단백질 및 선택적으로 또한 상기에 규정된 바와 같은 펜톤 및/또는 섬유 단백질을 인코딩하나, 하나 이상의, 바람직하게는 다른 모든 게놈 영역은 상이한, 특히 서열번호 1 내지 10에 따른 아데노바이러스와 상이한 아데노바이러스로부터 유래한다. 상이한 아데노바이러스는 바람직하게는 상이한 숙주, 더욱 바람직하게는 인간 아데노바이러스에 자연적으로 존재하는 것이다. 이러한 폴리뉴클레오타이드는 바람직하게는 또한 상기에 규정된 바와 같은 하나 이상의 이종성 비-아데노바이러스 단백질 또는 그의 단편을 인코딩한다. 따라서, 하나 이상의 이종성 비-아데노바이러스 유전자는 키메라 아데노바이러스의 아데노바이러스 게놈 내에 삽입된다. 따라서, 키메라 아데노바이러스의 아데노바이러스 게놈은 상기에 규정된 바와 같은 헥손 HVR 또는 헥손 단백질을 인코딩하는 DNA 및 선택적으로 상기에 규정된 바와 같은 펜톤 및/또는 섬유 단백질을 인코딩하는 DNA를 제외하고, 비 유인원 아데노바이러스, 예를 들어 인간 아데노바이러스, 바람직하게는 비 유인원, 예를 들어 인간 전달체 아데노바이러스로부터 유래된 것이다.
일반적으로 아데노바이러스는 복제-불능인 것이 바람직하다. 결과적으로, 아데노바이러스가 게놈 영역 E1A, E1B, E2A, E2B, E3 및 E4 중 하나 이상이 부재하거나, 내부에 결실 및/또는 돌연변이를 포함함으로써, 그에 의해 인코딩되는 게놈 영역 또는 발현 산물이 비 기능성이 되는 것이 바람직하다.
특히 바람직한 일 실시형태에서, 그의 모든 변이체에서 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드는 기능적으로 손상된 IVa2 유전자, 바람직하게는 그 내부에 결실 또는 널 돌연변이 (null-mutation)를 가질 수 있다. 이러한 유전자는 바이러스 DNA 패킹 (packing)에 관여하고, 그 손상은 바이러스-유사 입자의 생성을 유발한다. 이러한 실시형태에서, 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드는 바람직하게는 하나 이상의 비-아데노바이러스 B 세포 에피토프 및/또는 T 세포 에피토프를 인코딩한다.
제3 양상에서, 본 발명은 제1 또는 제2 양상의 폴리뉴클레오타이드에 의해 인코딩되는 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드를 제공한다. 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드는 적어도, 상기에 규정된 바와 같은 HVR을 갖는 헥손, 바람직하게는 상기에 규정된 바와 같은 헥손 단백질 및 선택적으로 또한 상기에 규정된 펜톤 및/또는 섬유 단백질을 포함한다.
적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드는 세포내에서의 발현에 의해 획득될 수 있다. 발현된 폴리펩타이드(들)는 선택적으로 표준 기법을 사용하여 정제될 수 있다. 예를 들어, 세포를 물리적으로 또는 삼투압 충격에 의해 용해시킨 후, 침전 및 크로마토그래피 단계에 적용할 수 있으며, 그 특성 및 순서는 회수될 특정 재조합 물질에 따라 상이할 것이다. 대안적으로, 발현된 폴리펩타이드 (들)는 단백질 발현 기술에 공지된 바와 같이 재조합 세포가 배양된 배양 배지에 분비되고, 이로부터 회수될 수 있다.
제4 양상에서, 본 발명은 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드 및/또는 제3 양상의 아데노바이러스 캡시드 폴리펩타이드를 포함하는 아데노바이러스 (또한 본 명세서에서 아데노바이러스 벡터 또는 아데노바이러스 벡터로 언급됨)를 제공한다. 따라서, 아데노바이러스는, 예를 들어, 상기에 규정된 바와 같이, 서열번호 1 내지 10 중 어느 하나에 의해 인코딩되는 아데노바이러스 또는 재조합 아데노바이러스, 예컨대, 전달체 또는 키메라 아데노바이러스일 수 있다. 바람직하게는, 아데노바이러스는 단리된 아데노바이러스이다.
예시적 실시형태에서, 본 발명은 각각 서열번호 72 또는 73에 따른 폴리뉴클레오타이드 또는 서열번호 72 또는 73에 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 아데노바이러스를 제공한다.
아데노바이러스는 제1 또는 제2 양상의 폴리뉴클레오타이드를 포함할 수 있거나 포함하지 않을 수 있다. 이러한 폴리뉴클레오타이드가 아데노바이러스에 포함되지 않은 경우, 이는 이식형으로 제공된다 (즉, 아데노바이러스 내에 혼입된 아데노바이러스 게놈에 존재하지 않는 유전적 요소에 의함). 이는 일반적으로 보조 작제물 (예를 들어, 플라스미드 또는 바이러스) 또는 패키징 숙주 세포 (본 명세서에 기재된 바와 같은 보완 세포)의 게놈 또는 그 내부의 보조 작제물에 의해 제공된다. 이식형으로 제공되는 폴리뉴클레오타이드가 이들 폴리뉴클레오타이드의 상동체 또는 다른 서열 변이체를 포함하여, 아데노바이러스 내에 혼입된 게놈 내에 포함되지 않은 것이 더욱 바람직하다. 예를 들어, 이식형으로 제공되는 폴리뉴클레오타이드가 헥손, 펜톤 및/또는 섬유 유전자를 포함하는 경우, 아데노바이러스에 혼입된 게놈은 각각 헥손, 펜톤 및/또는 섬유 단백질을 인코딩하는 임의의 폴리뉴클레오타이드를 포함하지 않는다. 가장 바람직하게는, 이식형으로 제공되는 폴리뉴클레오타이드는 제3 양상에서 규정된 바와 같은 적어도 하나의 아데노바이러스 캡시드 폴리펩타이드, 즉 제1 또는 제2 양상에서 규정된 바와 같은 HVR을 갖는 헥손, 바람직하게는 제1 또는 제2 양상에서 규정된 바와 같은 헥손 단백질, 및 선택적으로 제1 또는 제2 양상에서 규정된 바와 같은 펜톤 및/또는 섬유 단백질을 인코딩한다.
숙주, 예를 들어 인간 또는 다른 포유류 세포에 유전자를 전달하기 위한 아데노바이러스 벡터의 작제시, 다양한 아데노바이러스 핵산 서열이 사용될 수 있다. 예를 들어, 재조합 바이러스의 일부를 형성하는 아데노바이러스 서열로부터 아데노바이러스 지연 초기 유전자 E3의 전부 또는 일부가 제거될 수 있다. 유인원 E3의 기능은 재조합 바이러스 입자의 기능 및 생성과 관련이 없는 것으로 여겨진다. 일부 실시형태에서, 더욱 바람직하게는 이 영역, 전체 E4 영역의 기능에서의 중복성으로 인해 E4 유전자의 적어도 ORF6 영역이 결실된 아데노바이러스 벡터가 또한 작제될 수 있다. 본 발명의 또 다른 벡터는 지연 초기 유전자 E2a에 결실을 포함한다. 유인원 아데노바이러스 게놈의 후기 유전자 L1 내지 L5 중 임의의 것에도 결실이 이루어질 수 있다. 유사하게, 중간 유전자 IX 및 IVa2에서의 결실은 일부 목적에 유용할 수 있다. 다른 결실은 다른 구조적 또는 비 구조적 아데노바이러스 유전자에 이루어질 수 있다. 상기 논의된 결실은 개별적으로 사용될 수 있으며, 즉, 본 발명에 사용하기 위한 아데노바이러스 서열은 단일 영역에만 결실을 포함할 수 있다. 대안적으로, 생물학적 활성을 파괴하는데 효과적인 전체 유전자 또는 그의 일부의 결실은 임의의 조합으로 사용될 수 있다. 예를 들어, 아데노바이러스 서열은 E3 등의 결실의 유무에 관계없이 El 및 E4 영역 또는 El, E2a 및 E3 영역 또는 E1 및 E3 영역 또는 El, E2a 및 E4 영역의 결실을 가질 수 있다. 적절한 결과를 획득하기 위해, 이러한 결실은 온도 감응성 돌연변이와 같은 다른 아데노바이러스 유전자 돌연변이와 조합하여 사용될 수 있다.
임의의 필수 아데노바이러스 서열 (예를 들어, Ela, Elb, E2a, E2b, E4 ORF6, L1 또는 L4로부터 선택된 영역)이 부재하는 아데노바이러스 벡터는, 아데노바이러스 입자의 바이러스 감염성 및 증식에 필요한 결손 아데노바이러스 유전자 산물의 존재 하에서 배양될 수 있다. 이들 보조 기능은 하나 이상의 보조 작제물 (예를 들어, 플라스미드 또는 바이러스) 또는 패키징 숙주 세포 (본 명세서에 기재된 바와 같은 보완 세포)의 존재하에 아데노바이러스 벡터를 배양함으로써 제공될 수 있다. 예를 들어, WO96/13597의 "최소" 인간 아데노바이러스 벡터의 제조에 대해 기재된 기법을 참조한다.
유용한 보조 작제물은 벡터 및 벡터가 형질감염된 세포에 의해 발현되지 않고/거나, 결실된 각각의 유전자를 보완하는 선택된 아데노바이러스 유전자 서열을 포함한다. 일 실시형태에서, 보조 작제물은 복제 결함이고, 필수적이고 선택적으로 추가의 아데노바이러스 유전자를 포함한다.
보조 작제물은 또한 문헌[Wu et al, J. Biol. Chem., 264: 16985-16987 (1989); K. J. Fisher and J. M. Wilson, Biochem. J., 299: 49 (April 1, 1994)]에 기재된 바와 같은 다가 양이온성 접합체로 형성될 수 있다. 보조 작제물은 선택적으로 리포터 유전자를 포함할 수 있다. 다수의 이러한 리포터 유전자는 당 업계에 공지되어 있다. 아데노바이러스 벡터상의 전이유전자와 상이한 보조 작제물 상에 리포터 유전자의 존재에 의해 아데노바이러스 및 보조 작제물 둘 모두를 독립적으로 모니터링할 수 있다. 이러한 제2 리포터는 정제시에 생성된 재조합 아데노바이러스 및 보조 작제물 사이의 분리를 용이하게 하기 위해 사용될 수 있다. 바람직한 보조 작제물은 보조 바이러스이다.
본 명세서의 바람직한 실시형태와 관련하여 기재된 임의의 유전자에서 결실된 재조합 아데노바이러스 (Ad)를 생성하기 위해, 결실된 유전자 영역의 기능이 바이러스의 복제 및 감염성에 필수적인 경우, 바람직하게는 보조 작제물 또는 세포, 즉 보완 또는 패키징 세포에 의해 재조합 바이러스에 제공된다. 다양한 환경에서, 인간 E1을 발현하는 작제물/세포를 사용하여 재조합 아데노바이러스를 생성하는데 사용되는 벡터를 이식 보완할 수 있다. 이는 본 발명의 폴리뉴클레오타이드 서열과 현재 이용 가능한 패키징 작제물/세포에 존재하는 인간 아데노바이러스 E1 서열 사이의 다양성으로 인해, 현재 인간 E1 포함 작제물/세포의 사용이 복제 및 생성 과정에서 복제 가능한 아데노바이러스의 생성을 방지할 것이기 때문에 특히 유리하다. 그러나, 특정 상황에서, E1-결실된 재조합 아데노바이러스의 생성을 위해 E1 유전자 산물을 발현하는 작제물/세포를 이용하는 것이 바람직할 것이다.
적절한 경우, 본 명세서에 제공된 서열을 이용하여, 예를 들어 HeLa 세포와 같은 선택된 모 세포주에서 발현을 위한 프로모터의 전사 제어 하에, 최소한 서열번호 1 내지 10 중 어느 하나에 따른 아데노바이러스로부터 아데노바이러스 E1 유전자를 발현하는 보조 작제물/세포 또는 세포주를 생성할 수 있다. 이 목적을 위해 유도성 또는 항시적 프로모터가 사용될 수 있다. 프로모터의 예는 예를 들어 본 명세서에 기재된 실시예에 제공된다. 이러한 E1-발현 세포는 재조합 아데노바이러스 E1 결실 벡터의 생성에 유용하다. 추가적으로 또는 대안적으로, 본 발명은 하나 이상의 아데노바이러스 유전자 산물, 예를 들어, Ela, Elb, E2a 및/또는 E4 ORF6, 바람직하게는 Ad5 E4 ORF6을 발현하는 작제물/세포를 제공하며, 이는 필수적으로, 재조합 아데노바이러스 벡터의 생성에 사용되는 것과 동일한 절차를 사용하여 작제될 수 있다. 이러한 작제물/세포는 이러한 산물을 인코딩하는 필수 유전자가 결실된 아데노바이러스 벡터를 이식 보완하거나, 보조-의존적 바이러스 (예를 들어, 아데노-연관 바이러스)의 패키징에 필요한 보조 기능을 제공하는데 이용될 수 있다.
일반적으로, 형질감염에 의해 아데노바이러스 벡터를 전달할 때, 벡터는 약 0.1 μg 내지 약 100 μg DNA 및 바람직하게는 약 10 내지 약 50 μg DNA 내지 약 1 x 104개의 세포 내지 약 1 x 103개의 세포 및 바람직하게는 약 105개의 세포의 양으로 전달된다. 그러나, 선택된 벡터, 전달 방법 및 선택된 숙주 세포와 같은 인자를 고려하여 숙주 세포에 대한 벡터 DNA의 상대적인 양이 조절될 수 있다. 벡터를 숙주 세포 내로 도입하는 것은, 예를 들어 CaPO4 형질감염 또는 전기 천공 법을 사용한 형질감염 및 감염을 포함하여, 당 업계에 공지되어 있거나, 본 명세서에 개시된 바와 같은 임의의 수단에 의해 달성될 수 있다.
적절한 재조합 아데노바이러스의 작제 및 조립을 위해, 하나의 예에서, 보조 작제물의 존재하에 시험관내에서 아데노바이러스 벡터를 패키징 세포주 내로 형질감염시켜, 보조 및 아데노바이러스 벡터 서열 사이에 상동성 재조합이 유발되도록 할 수 있으며, 이에 의해, 당 업계에 널리 공지된 바와 같이, 벡터 내의 아데노바이러스-전이유전자 서열이 복제되고, 비리온 캡시드로 패키징될 수 있음으로서, 재조합 바이러스 벡터 입자를 생성할 수 있다. 본 발명의 재조합 아데노바이러스는 예를 들어 선택된 전이유전자를 선택된 숙주 세포 내로 전달하는 단계에 유용하다.
바람직한 실시형태에서, 제4 양상의 아데노바이러스는 인간 대상체의 5% 미만에서 혈청 내에 다량 존재하고, 바람직하게는 인간 대상체에서 혈청 내에 다량 존재하지 않으며, 가장 바람직하게는 비인간 대형 유인원 아데노바이러스, 더욱 바람직하게는 서열번호 1 내지 10에 따른 하나 이상의, 특히 모든 아데노바이러스와 이전에 접촉되지 않은 인간 대상체에서 혈청 내에 다량 존재하지 않는다. 이와 관련하여, 바람직하게는, 인간 대상체는 유럽인, 아프리카 원주민, 아시아인, 미국 원주민 및 오세아니아 원주민으로부터 선택되는 인종 그룹에 속한다. 인간 대상체의 인종 기원을 확인하는 방법은 당 업계에 포함된다 (예를 들어, WO2003/102236 참조).
재조합 아데노바이러스의 추가의 바람직한 실시형태에서, 아데노바이러스 DNA는 포유류 표적 세포에 유입될 수 있으며, 즉 감염성이다. 본 발명의 감염성 재조합 아데노바이러스는 또한 본 명세서에 기재된 바와 같은 백신 및 유전자 요법으로 사용될 수 있다. 따라서, 다른 실시형태에서, 재조합 아데노바이러스는 표적 세포로 전달하기 위한 분자를 포함하는 것이 바람직하다. 바람직하게는, 표적 세포는 포유류 세포, 예를 들어 비인간 대형 유인원 세포, 설치류 세포 또는 인간 세포이다. 예를 들어, 표적 세포 내로 전달하기 위한 분자는 바람직하게는 발현 카세트 내의 본 명세서에 규정된 이종성 단백질 (즉, 이종성 유전자)을 인코딩하는 폴리뉴클레오타이드일 수 있다. 아데노바이러스의 게놈 내로 발현 카세트를 도입하는 방법은 당 업계에 널리 공지되어 있다. 하나의 예에서, 예를 들어 이종성 유전자를 인코딩하는 발현 카세트를 포함하는 본 발명의 재조합 아데노바이러스는 E1A, E1B, E2A, E2B, E3 및 E4로부터 선택된 아데노바이러스의 게놈 영역을 상기 발현 카세트로 대체함으로써 생성될 수 있다. 본 발명의 아데노바이러스의 게놈 영역 E1A, E1B, E2A, E2B, E3 및 E4는 인간 Ad5와 같이 주석의 공지된 아데노바이러스 게놈과의 정렬에 의해 용이하게 확인될 수 있다 (문헌[Birgitt Taeuber 및 Thomas Dobner, Oncogene (2001) 20, p. 7847 -7854; 및 또한: Andrew J. Davison, et al., "Genetic content and evolution of adenoviruses", Journal of General Virology (2003), 84, p. 2895-2908] 참조).
표적 세포 내로 전달하기 위한 분자는 바람직하게는 이종성 폴리뉴클레오타이드이지만, 또한 바람직하게는 치료적 또는 진단적 활성을 갖는 폴리펩타이드 또는 작은 화학 화합물일 수 있다. 특히 바람직한 일 실시형태에서, 표적 세포로 전달하기 위한 분자는 아데노바이러스 5' 역위 말단 반복 서열 (ITR) 및 3' ITR을 포함하는 이종성 폴리뉴클레오타이드이다. 예를 들어 패키징 세포에서 재조합 아데노바이러스가 생성되는 경우, 캡시드가 분자 주위에 형성되고, 이를 패키징할 수 있는 분자의 분자 크기가 선택되어야 한다는 것이 당업자에게 명백할 것이다. 따라서, 바람직하게는 이종성 유전자는 예를 들어 최대 7000 및 최대 8000개의 염기 쌍을 가질 수 있는 미니유전자이다.
제5 양상에서, 본 발명은 제1 또는 제2 양상의 폴리뉴클레오타이드에 의해 인코딩되는 바이러스-유사 입자 (VLP)를 제공한다. 따라서, VLP는 제3 양상에 따른 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드를 포함한다. 일 실시형태에서, VLP를 인코딩하는 폴리뉴클레오타이드는 결실된 Iva2 유전자를 갖거나, Iva2 유전자 내에 널-돌연변이를 갖는다.
하기 VLP의 규정에 따르면, 제5 양상의 VLP는 바이러스 게놈 DNA를 실질적으로 포함하지 않는다. 아데노바이러스 VLP를 포함하는 VLP는 백신화, 유전자 요법 또는 예를 들어 항암제의 약물 직접 전달을 위해 사용되어 왔다 (문헌[Chroboczek et al., ACTA ABP BIOCHIMICA POLONICA, Vol. 61, No. 3/2014]). 따라서, 제5 양상의 VLP는 하나 이상의 비-아데노바이러스 B 세포 및/또는 비-아데노바이러스 T 세포 에피토프, 유전자 요법을 위한 하나 이상의 비-아데노바이러스 유전자 및/또는 하나 이상의 약제학적 제제, 예를 들어 항암 제제를 포함할 수 있다. 일 실시형태에서, VLP는 하나 이상의 비-아데노바이러스 B 세포 에피토프를 혼입시키고, 바람직하게는 제시하고/거나, 하나 이상의 비-아데노바이러스 T 세포 에피토프를 혼입시킨다.
제6 양상에서, 본 발명은 제1 또는 제2 양상의 폴리뉴클레오타이드를 포함하는 벡터를 제공한다. 바람직한 실시형태에서, 벡터는 플라스미드 벡터, 예를 들어 발현 벡터이다. 플라스미드 벡터는 본 명세서에 기재된 바와 같은 재조합 아데노바이러스를 생성하기 위해 유리하게 사용될 수 있다. 본 발명의 신규한 헥손, 펜톤 및 섬유 단백질 및 VA RNA의 서열 정보가 제공되므로, 상기 재조합 아데노바이러스는 예를 들어 제1 또는 제2 양상 및 임의의 다른 아데노바이러스 게놈 영역의 폴리뉴클레오타이드에 의해 인코딩되는 재조합 아데노바이러스를 작제함으로써 획득될 수 있다. 재조합 아데노바이러스의 작제 방법은 당 업계에 널리 공지되어 있다. 재조합 아데노바이러스의 제조를 위해 유용한 기법은, 예를 들어 문헌[Graham & Prevec, 1991 In Methods in Molecular Biology: Gene Transfer and Expression Protocols, (Ed. Murray, EJ.), p. 109; and Hitt et al., 1997 "Human Adenovirus Vectors for Gene Transfer into Mammalian Cells" Advances in Pharmacology 40:137-206]에 검토되어 있다. WO 2006/086284에 추가의 방법이 기재되어 있다.
제1 또는 제2 양상의 폴리뉴클레오타이드를 발현시키기 위해, 바람직하게는 발현 카세트에 의해, 전사를 유도하는 강한 프로모터를 포함하는 발현 벡터로 상기 폴리뉴클레오타이드를 서브클로닝 (subcloning)할 수 있다. 적합한 박테리아 프로모터는 예를 들어 E. 콜라이 (E. coli), 바실러스 (Bacillus) 종 및 살모넬라 (Salmonella)에서와 같이 당 업계에 널리 공지되어 있으며, 이러한 발현 시스템용 키트는 시중에서 구입할 수 있다. 유사하게 포유류 세포, 효모 및 곤충 세포에 대한 진핵생물 발현 시스템은 당 업계에 널리 공지되어 있고, 또한 시중에서 구입할 수 있다. 발현 카세트에 대한 자세한 내용은 하기를 참조한다.
유전자 정보를 세포 내로 운반하는데 유용한 특정 발현 벡터는 특별히 중요하지 않다. 진핵세포 또는 원핵세포에서의 발현에 사용되는 임의의 통상적인 벡터가 사용될 수 있다. 표준 박테리아 발현 벡터는 플라스미드 예컨대, pBR322 기반 플라스미드, pSKF, pET23D 및 융합 발현 시스템 예컨대, GST 및 LacZ를 포함하지만, 당업자에게 유용하게 사용될 수 있는 많은 것이 공지되어 있다. 진핵생물 바이러스로부터의 조절 요소를 포함하는 발현 벡터는 통상적으로 진핵생물 발현 벡터, 예를 들어 SV40 벡터, 유두종 바이러스 벡터 및 엡스타인-바르 바이러스 (Epstein-Barr virus)로부터 유래된 벡터에 사용된다. 다른 예시적인 진핵생물 벡터는 pMSG, pAV009/A.sup.+, pMTO10/A.sup.+, pMAMneo-5, 바큘로바이러스 (baculovirus) pDSVE, pcDNA3.1, pIRES 및 예를 들어 HCMV 즉시-초기 프로모터, SV40 조기 프로모터, SV40 후기 프로모터, 메탈로티오네인 프로모터, 쥣과동물 유선 종양 바이러스 프로모터, 라우스 육종 바이러스 프로모터, 폴리헤드린 프로모터, 또는 진핵 세포에서의 발현에 효과적인 것으로 밝혀진 다른 프로모터의 방향에 따라 단백질의 발현을 가능하게 하는 임의의 다른 벡터를 포함한다. 일부 발현 시스템은 티미딘 키나제, 하이그로마이신 B 포스포트랜스퍼라제 및 디하이드로폴레이트 환원효소와 같은 유전자 증폭을 제공하는 마커를 갖는다. 대안적으로, 유전자 증폭을 포함하지 않는 고 수율 발현 시스템이 또한 적합하다. 발현 벡터에 또한 포함될 수 있는 요소는 E. 콜라이에서 기능하는 레플리콘 (replicon), 재조합 플라스미드를 보유하는 박테리아의 선택을 가능하게 하는 약물 내성을 인코딩하는 유전자, 및 진핵생물 서열의 삽입을 가능하게 하기 위한 플라스미드의 비 필수 영역의 고유한 제한 부위를 포함한다. 선택된 특정 약물 내성 유전자는 중요하지 않으며, 당 업계에 공지된 많은 약물 내성 유전자 중 임의의 것이 적합하다. 원핵생물 서열은 필요에 따라 선택적으로, 진핵세포에서 DNA의 복제를 방해하지 않는 것으로 선택된다.
제7 양상에서, 본 발명은 (i) 애주번트, (ii) 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터 및 선택적으로 (iii) 약제학적으로 허용 가능한 부형제를 포함하는 조성물을 제공한다. 바람직하게는, 애주번트는 I형 사이토카인 수용체, II형 사이토카인 수용체, TNF 수용체, 전사 인자로서 작용하는 비타민 D 수용체 및 Toll-유사 수용체 1 (TLR1), TLR-2, TLR 3, TLR4, TLR5, TLR-6, TLR7 및 TLR9로 이루어진 군으로부터 선택된 수용체에 대한 작용제이다.
애주번트를 포함하는 조성물은 예를 들어 인간 대상체를 위한 백신으로서 사용될 수 있다. 예를 들어, 특정 수용체의 활성화는 면역 반응을 자극할 수 있다. 이러한 수용체는 당업자에게 공지되어 있으며, 예를 들어 사이토카인 수용체, 특히 I형 사이토카인 수용체, II형 사이토카인 수용체, TNF 수용체; 및 전사 인자로서 작용하는 비타민 D 수용체; 및 Toll-유사 수용체 1 (TLR1), TLR-2, TLR 3, TLR4, TLR5, TLR-6, TLR7 및 TLR9를 포함한다. 이러한 수용체에 대한 작용제는 애주번트 활성을 가지며, 즉 면역 자극성이다. 바람직한 실시형태에서, 조성물의 애주번트는 하나 이상의 Toll-유사 수용체 작용제일 수 있다. 더욱 바람직한 실시형태에서, 애주번트는 Toll-유사 수용체 4 작용제이다. 특히 바람직한 실시형태에서, 애주번트는 Toll-유사 수용체 9 작용제이다. 애주번트의 예를 위해 다음을 참조한다. 또한, 바람직한 약제학적으로 허용 가능한 부형제가 다음에 언급되어 있다.
제8 양상에서, 본 발명은 제1 또는 제2 양상의 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터를 포함하는 세포를 제공한다.
바람직하게는, 세포는, 상기에 설명된 바와 같이, 결실되거나, 비 기능성이 되어, 아데노바이러스가 복제-불능이 되는 적어도 하나의 아데노바이러스 유전자 또는 바람직하게는 모든 아데노바이러스 유전자를 발현하는 숙주 세포이다. 이러한 적어도 하나의 유전자의 발현에 의해, 숙주 세포에서 바람직하게는 그외에는 복제-불능인 아데노바이러스의 복제가 가능할 수 있다. 일 실시형태에서, 숙주 세포는 E1A, E1B, E2A, E2B, E3 및 E4로 이루어진 군으로부터 선택되는 적어도 하나의 아데노바이러스 유전자를 발현한다. 특히, 이러한 적어도 하나의 아데노바이러스 유전자는 아데노바이러스 게놈에서 결실되거나, 비 기능성이 된다. 이러한 보완 세포는 예를 들어, 전술된 유전자 산물 중 하나가 부재하므로, 이는 복제 불능인 아데노바이러스의 증식 및 구제를 위해 사용될 수 있다.
세포는 박테리아 세포, 예컨대 E. 콜라이 세포, 효모 세포, 예컨대 사카로마이세스 세레비지아에 (Saccharomyces cerevisiae) 또는 피키아 파스토리스 (Pichia pastoris), 식물 세포, 곤충 세포, 예컨대 SF9 또는 Hi5 세포 또는 포유류 세포 중에서 선택될 수 있다. 포유류 세포의 바람직한 예는 중국 햄스터 난소 (CHO) 세포, 인간 배아 신장 (HEK 293) 세포, HELA 세포, 인간 간세포 (예를 들어, Huh7.5), Hep G2 인간 간세포, Hep 3B 인간 간세포 등이다.
세포가 제1 또는 제2 양상에 따른 폴리뉴클레오타이드를 포함하는 경우, 이 폴리뉴클레오타이드는 (i) 그 자체로 무작위 분산되거나, (ii) 세포 게놈 또는 미토콘드리아 DNA에 혼입되어 세포에 존재할 수 있다.
추가의 바람직한 실시형태에서, 세포는 E1a, E1b, E2a, E2b, E4, L1, L2, L3, L4 및 L5로 이루어진 군으로부터 선택된 적어도 하나의 아데노바이러스 유전자를 발현하는 숙주 세포, 바람직하게는 293 세포 또는 PER.C6™ 세포이다.
박테리아, 포유류, 효모 또는 곤충 세포주를 생성하기 위해 표준 형질감염 방법을 사용할 수 있다. 외래 폴리뉴클레오타이드 서열을 숙주 세포 내로 도입하기 위한 임의의 공지된 절차가 사용될 수 있다. 예를 들어, 시판되는 리포좀 기반 형질감염 키트, 예컨대 Lipofectamine™ (Invitrogen), 시판되는 지질 기반 형질감염 키트, 예컨대 Fugene (Roche Diagnostics), 폴리에틸렌 글리콜 기반 형질감염, 인산칼슘 침전, 유전자 총 (biolistic), 전기천공법, 또는 바이러스 감염 및 클로닝된 게놈 DNA, cDNA, 합성 DNA 또는 다른 외래 유전 물질을 숙주 세포 내로 도입하기 위한 임의의 다른 널리 공지된 방법이 사용될 수 있다. 사용된 특정 유전자 조작 절차는 수용체를 발현할 수 있는 숙주 세포 내로 적어도 하나의 유전자를 성공적으로 도입할 수 있으면 된다.
세포의 추가 실시형태는 상기 본 발명의 제4 양상과 관련하여 기재되어 있다.
제9 양상에서, 본 발명은 질병의 치료 또는 예방에 사용하기 위한 제1 또는 제2 양상의 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터 및/또는 제7 양상의 조성물을 제공한다. 일 실시형태에서, 치료 또는 예방은 백신화에 의한 것이다. 다른 실시형태에서, 치료는 유전자 요법에 의한 것이다. 백신화와 관련하여, 질병은 바람직하게는 본 명세서에 기재된 바와 같은 병원체에 의해 유발되는 감염성 질병, 또는 바람직하게는 건강한 세포에 의해 발현되지 않는 항원을 발현하는 질병 세포 (예를 들어, 종양 관련 항원을 발현하는 종양 세포)를 특징으로 하는 비 감염성 질병이다. 유전자 요법과 관련하여, 질병은 유전자 또는 단백질의 기능의 상실 또는 획득을 야기하는 하나 이상의 체세포 돌연변이에 의해 유발되는 유전성 질병이다.
아데노바이러스는 유전자 요법 및 백신으로서 유용하다는 것이 널리 공지되어 있다. 전임상 및 임상 연구에 의해 이 시스템을 사용하여 벡터 설계, 강력한 항원 발현 및 보호 면역의 가능성이 입증되었다. 따라서, 바람직한 사용 실시형태는 예를 들어 인단 대상체를 위한 백신화이다. 백신화를 위한 아데노바이러스의 사용 및 제조 방법에 대한 상세한 지침은 기술분야에 포함되어 있고, 당업자에게 공지된 방대한 문헌으로서 제공된다. 예를 들어, 비인간 대형 유인원 아데노바이러스를 기반으로 한 바이러스 벡터는 유전자 백신의 개발을 위한 인간 유래 Ad 벡터의 사용에 대한 대안으로 나타난다 (문헌[Farina SF, J Virol. 2001 Dec;75(23):11603-13.; Fattori E, Gene Ther. 2006 Jul;13(14):1088-96]). 비인간 대형 유인원으로부터 단리된 아데노바이러스는 인간 유래의 세포에서 그의 효율적인 증식에 의해 입증된 바와 같이 인간으로부터 단리된 아데노바이러스와 밀접하게 연관된다. 그러나, 인간 및 비인간 유인원 아데노바이러스는 연관되어 있기 때문에, 두 바이러스 종 사이에 일정 정도의 혈청 교차 반응성이 있을 수 있거나, 부재할 수 있다. 이러한 가정은 침팬지 아데노바이러스를 단리하여, 특성화한 경우에 확인되었다. 따라서, 본 발명에 따른 비인간 대형 유인원 아데노바이러스는 인간에서 인간 아데노바이러스의 통상적인 혈청형에 대한 기 존재하는 면역과 관련된 부작용을 감소시킴으로써, 예를 들어 면역화 및/또는 유전자 요법에 사용될 수 있는 중요한 의학적 툴을 위한 기반을 제공한다.
이는 헥손, 펜톤 및 섬유 단백질을 포함하는 아데노바이러스 캡시드 단백질의 신규한 서열, 특히 가장 표면에 노출된 아데노바이러스 에피토프를 나타내는 신규한 헥손 HVR 서열에 기인한다. 따라서, 캡시드 단백질 및 특히 본 발명에 따른 헥손 HVR에 특이적인 중화 항체는 인간 혈액 혈청에 존재하지 않거나, 매우 소수 존재할 것으로 예상된다. 따라서, 신규한 서열의 하나의 이점은 이것이, 예를 들어 의료 목적을 위해 조작된 종래의 아데노바이러스를 향상시키기 위해 사용될 수 있다는 것이다. 예를 들어, 이러한 서열은, 예를 들어, 인간에서 혈청 내 존재량이 감소된 개선된 재조합 아데노바이러스 (키메라 아데노바이러스)를 획득하기 위해, 상이한 아데노바이러스, 예를 들어 종래 아데노바이러스의 주요 구조적 캡시드 단백질 중 하나 이상 또는 특히 단지 헥손 HVR을 대체/치환하기 위해 사용될 수 있다. 기재된 바와 같이 재조작된 신규 서열 및 이에 따른 아데노바이러스는 투여시 인간에서 임의의 유의한 억제성 면역 반응을 유발하지 않을 것이기 때문에, 그의 전체 형질도입 효율 및 감염성이 향상될 것이다. 따라서, 이러한 개선된 아데노바이러스는 숙주 세포로의 유입 및 항원의 발현이 임의의 유의한 역가의 중화 항체에 의해 방해받지 않기 때문에 더욱 효과적인 백신이 될 것으로 예상된다.
백신은 애주번트를 포함하는 것이 바람직하다. 바람직한 면역학적 애주번트가 본 명세서에 언급되어 있으며, 이러한 백신에 사용될 수 있다.
백신화에 사용되는 경우, 본 발명의 재조합 아데노바이러스는 바람직하게는 1 x 108 내지 1 x 1011 바이러스 입자 (즉, 1 x 108, 5 x 108, 1 x 109, 5 x 109, 1 x 1010, 2.5 x 1010 또는 5 x 1010개의 입자)의 면역학적 및/또는 예방적으로 유효한 투여량으로 투여될 수 있다.
또한, 부스팅 (boosting)이 필요한 백신화의 경우, "이종성 프라이밍 (priming)-부스팅" 방법론을 적용하는 것이 바람직하다: 백신화에서, 제1 양상 또는 제2 양상의 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터 및/또는 제7 양상의 조성물은 프라이밍 또는 부스팅, 특히 이종성 프라이밍-부스팅 백신화에 사용될 수 있다. 이종성 프라이밍-부스팅의 2개의 상이한 백신의 바람직한 실시형태에서, 예를 들어 제1 또는 제2 양상의 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터 및/또는 제7 양상의 조성물은 예를 들어 인간에서 항체 부재 또는 중화 항체로 인한 부스팅 백신으로서 사용된다.
본 발명에 따른 폴리뉴클레오타이드 또는 재조합 아데노바이러스 단백질 또는 그의 단편을 사용하여 제조된 재조합 아데노바이러스는 숙주 세포를 폴리뉴클레오타이드, 예를 들어 DNA로 형질도입하는데 사용될 수 있다. 따라서, 숙주 세포에서 임의의 지정 단백질 또는 폴리펩타이드를 발현시키기 위해, 감염성 (즉, 숙주 세포로 유입될 수 있음)일지라도, 바람직하게는 복제 결합인 아데노바이러스를 제조할 수 있다. 따라서, 바람직한 실시형태에서, 본 발명에 따른 사용에 열거된 요법은 유전자 요법이다. 유전자 요법은 생체내, 생체외 또는 시험관내 유전자 요법일 수 있다. 바람직하게는 이는 체세포 유전자 요법이다. 본 발명에 따른 단리된 폴리뉴클레오타이드, 단리된 단백질, 벡터, 재조합 아데노바이러스 및/또는 약제학적 조성물이 유전자 요법에 사용되고, 치료될 대상체에 투여되는 경우, 이는 치료로 인해 환자의 하나 이상의 세포가 형질감염, 즉 형질도입되도록 충분히 큰 투여량으로 투여되는 것이 바람직하다. 본 발명에 따른 재조합 아데노바이러스 및/또는 약제학적 조성물이 본 명세서에 개시된 임의의 바람직한 투여 수단에 의해 투여되는 경우, 바람직하게는 1 x 108 내지 5 x 1011 바이러스 입자 (즉, 1 x 108, 5 x 108, 1 x 109, 5 x 109, 1 x 1010, 2.5 x 1010, 5 x 1010, 1 x 1011 또는 가장 바람직하게는, 5 x 1011개의 입자)의 유효한 투여량이 투여된다. 바람직한 실시형태에서, 본 발명의 재조합 아데노바이러스에 포함되는 바람직하게는 이종성 폴리뉴클레오타이드는 대상체의 숙주 세포에서 단백질 또는 폴리펩타이드를 발현할 수 있으며, 단백질 또는 폴리펩타이드는 상기 숙주 세포로부터의 단백질 또는 폴리펩타이드의 분비에 영향을 미치는 신호 펩타이드를 포함한다. 예를 들어, 특정 단백질이 필요한 환자는 이 단백질의 분비 가능한 형태를 인코딩하는 cDNA를 포함하는 본 발명의 아데노바이러스를 사용하여 치료될 수 있다.
본 발명의 사용의 추가 실시형태에서, 제1 또는 제2 양상의 폴리뉴클레오타이드, 제3 양상의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제4 양상의 아데노바이러스, 제5 양상의 바이러스-유사 입자 또는 제6 양상의 벡터 및/또는 제7 양상의 조성물 (이하, 본 발명에 따른 약제로 지칭됨)은 하나 이상의 약제학적으로 허용 가능한 희석제; 담체; 충전제, 결합제, 윤활제, 활택제, 붕해제 및 흡착제를 포함하는 부형제; 및/또는 보존제를 추가로 포함하도록 제형화된다.
본 발명에 따른 약제는 경구, 직장, 위내 및 비경구 투여, 예를 들어 정맥내, 근육내, 비강내, 피내, 피하 및 유사한 투여 경로를 포함하는 널리 공지된 다양한 경로에 의해 투여될 수 있다. 비경구, 근육내 및 정맥내 투여가 바람직하다. 바람직하게는 본 발명에 따른 약제는 시럽, 주입 또는 주사 용액, 정제, 캡슐, 캡슬렛, 로젠지 (lozenge), 리포좀, 좌약, 석고, 반창고, 지연 캡슐, 분말, 또는 서방출 제형으로서 제형화된다. 바람직하게는 희석제는 물, 완충액, 완충 염 용액 또는 염 용액이고, 담체는 바람직하게는 코코아 버터 및 비테베솔 (vitebesole)로 이루어진 군으로부터 선택된다.
본 발명의 사용 동안 본 발명에 따른 약제의 투여를 위한 특히 바람직한 약제학적 형태는 주사용으로 적합한 형태이며, 멸균 주사용 용액 또는 분산액의 즉석 제조를 위한 멸균 수용액 또는 분산액 및 멸균 분말을 포함한다. 통상적으로, 이러한 용액 또는 분산액은 예를 들어 물-완충 수용액, 예를 들어 생체적합성 완충액, 에탄올, 폴리올, 예컨대 글리세롤, 프로필렌 글리콜, 폴리에틸렌 글리콜, 이들의 적합한 혼합물, 계면활성제 또는 식물성 오일을 포함하는 용매 또는 분산 매질을 포함할 것이다.
주입 또는 주사 용액은, 항균제 또는 항진균제와 같은 보존제, 예를 들어, 파라벤, 클로로부탄올, 페놀, 소르브산 또는 티머살의 첨가에 제한되는 것은 아니지만 이를 포함한 수많은 당 업계에 공지된 기법에 의해 획득될 수 있다.
본 발명의 바람직한 희석제는 물, 생리학적으로 허용 가능한 완충액, 생리학적으로 허용 가능한 완충 염 용액 또는 염 용액이다. 바람직한 담체는 코코아 버터 및 비테베솔이다. 본 발명에 따른 다양한 약제학적 형태의 약제와 함께 사용될 수 있는 부형제는 하기 비 제한적 목록으로부터 선택될 수 있다:
a) 결합제, 예컨대 락토스, 만니톨, 결정질 소르비톨, 제2인산염, 인산칼슘, 당류, 미정질 셀룰로스, 카복시메틸 셀룰로스, 하이드록시에틸 셀룰로스, 폴리비닐 피롤리돈 등;
b) 윤활제, 예컨대 스테아르산마그네슘, 탈크, 스테아르산칼슘, 스테아르산아연, 스테아르산, 수소화된 식물성 오일, 류신, 글리세리드 및 스테아릴 푸마르산나트륨,
c) 붕해제, 예컨대 전분, 크로스카멜로스, 소듐 메틸 셀룰로스, 한천, 벤토나이트, 알긴산, 카복시메틸 셀룰로스, 폴리비닐 피롤리돈 등.
다른 적합한 부형제는 미극 약제학 협회에 의해 발행된 문헌[Handbook of Pharmaceutical Excipients]에서 찾을 수 있다.
본 발명에 따른 특정 양의 약제는 질병의 치료 또는 예방에 바람직하다. 그러나, 질병의 중증도, 질병의 유형뿐만 아니라, 치료될 각각의 환자, 예를 들어 환자의 일반적인 건강 상태 등에 따라, 치료 또는 예방 효과를 유도하기 위해 본 발명에 따른 약제의 상이한 투여량이 필요하다는 것을 이해한다. 적절한 투여량의 결정은 주치의의 재량에 의한다. 본 발명에 따른 약제가 예방적으로 사용될 경우, 이는 백신으로서 제형화될 수 있다. 이 경우, 본 발명에 따른 약제는 바람직하게는 상기에 요약된 바람직한 및 특히 바람직한 투여량으로 투여된다. 바람직하게는, 백신의 투여는 각각의 질병의 발병 위험이 감소하도록 하기 위해, 백신화 대상체가 본 발명에 따른 약제에 대해 충분한 항체를 생성할 때까지 소정의 기간 동안 적어도 2, 3, 4, 5, 6, 7, 8, 9 또는 적어도 10회 반복된다. 이 경우의 기간은 일반적으로 백신의 항원성에 따라 변동된다. 바람직하게는 기간은 4주, 3개월, 6개월 또는 3년 이하이다. 일 실시형태에서, 본 발명에 따른 아데노바이러스가 백신화 목적으로 사용되는 경우, 헥손 단백질의 초가변 도메인 중 적어도 하나는 백신화가 유도되는 각각의 질병 제제의 면역원성 에피토프로 대체될 수 있다. 백신은 통상적으로 상기 요약된 하나 이상의 애주번트를 포함한다. 백신화를 위한 아데노바이러스의 사용 및 이에 관한 방법에 대한 상세한 요약은 문헌[Bangari DS and Mittal SK (2006) Vaccine, 24(7), p. 849-862]에 제공되고; 또한: 문헌[Zhou D, et al., Expert Opin Biol Ther. 2006 Jan;6(1):63-72; 및 Folgori A, et al., Nat Med. 2006 Feb;12(2):190-7.]을 참조하며; 또한: 문헌[Draper SJ, et al., Nat Med. 2008 Aug;14(8):819-21. Epub 2008 Jul 27을 참조한다.
제10 양상에서, 본 발명은
(i) 세포에서 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드를 발현시켜, 아데노바이러스 또는 아데노바이러스-유사 입자가 세포내에서 조립되는 단계; 및
(ii) 세포 또는 세포 주위의 배지로부터 아데노바이러스 또는 아데노바이러스-유사 입자를 단리하는 단계를 포함하는, 아데노바이러스 또는 아데노바이러스-유사 입자를 생성하는 시험관내 방법에 관한 것이다.
본 방법은 선택적으로 예를 들어 상기에 기재된 바와 같이, 세포내로 제1 또는 제2 양상의 단리된 폴리뉴클레오타이드 또는 제6 양상의 벡터를 도입시키는 단계 (i) 전에 추가의 단계를 포함한다.
일반적으로 단리된 폴리뉴클레오타이드는 제4 양상의 아데노바이러스 또는 제5 양상의 바이러스-유사 입자를 인코딩하는 것이 바람직하다. 아데노바이러스는 바람직하게는 복제-불능이다. 세포는 바람직하게는 제7 양상의 세포이다. 단리된 폴리뉴클레오타이드가 복제-불능 아데노바이러스를 인코딩하는 경우, 세포는 본 명세서에 기재된 바와 같은 보조 세포이거나, 보조 작제물 (예를 들어, 단계 (i) 전 또는 그 동안 보조 작제물에 의해 형질도입된, 예를 들어 보조 작제물에 의해 감염된 보조 플라스미드 또는 보조 바이러스)을 포함하는 것이 바람직하며, 보조 세포 또는 보조 작제물은 각각 아데노바이러스가 복제-불능이 되는 유전자/게놈 영역을 발현한다.
"아데노바이러스 또는 아데노바이러스 유사 입자가 세포 내에서 조립된다"는 단계 (i)에서, 본 명세서에 기재된 바와 같이 아데노바이러스 또는 아데노바이러스 유사 입자를 조립하는데 필요한 모든 유전자가 세포 내에서 발현된다는 것을 의미한다. 이는 아데노바이러스가 조립되는 경우 아데노바이러스를 패키징하는데 (즉, 게놈을 바이러스 캡시드로 패키징하는데) 필요한 모든 유전자를 포함한다.
바람직한 실시형태에서, 단리된 폴리뉴클레오타이드는 상기에 규정된 바와 같은 VA RNA II 비코딩 RNA 및/또는 VA RNA I 비코딩 RNA를 인코딩한다. 본 발명에 따른 VA RNA는 실시예 1에 도시된 바와 같은 방법의 개선된 아데노바이러스 또는 아데노바이러스 유사 입자 수율을 야기한다.
본 발명의 정의 및 추가의 실시형태
하기에서, 본 명세서에서 빈번하게 사용되는 용어의 일부 정의가 제공된다. 이들 용어는, 명세서의 나머지 부분에서 사용되는 각 경우에, 각각 규정된 의미 및 바람직한 의미를 가질 것이다.
본 명세서에 사용되는 바와 같이, 용어 "단리된"은 자연적으로 관련된 다른 분자가 실질적으로 부재하는 분자를 지칭한다. 특히, 단리된 것은 분자가 동물 신체 또는 동물 신체 샘플에 존재하지 않음을 의미한다. 따라서 단리된 분자에는 동물에서 연관되거나 접촉되는 다른 분자가 부재한다. 단리된 것은 본 명세서에 기재된 바와 같이 관련된 다른 성분으로부터 단리된 것을 의미하지 않으며, 예를 들어 이것이 포함된 세포 또는 벡터로부터 단리되거나, 포함된 분자가 조성물의 다른 성분으로부터 단리된 것을 의미하지 않는다.
용어 "폴리뉴클레오타이드"는 핵산, 즉 복수의 뉴클레오타이드로 구성된 생물학적 분자를 지칭하는 것으로 의도된다. 여기에는 DNA, RNA 및 합성 유사체, 예를 들어 PNA가 포함된다. DNA가 바람직하다.
용어 "오픈 리딩 프레임" (ORF)은 아미노산으로 번역될 수 있는 뉴클레오타이드 서열을 지칭한다. 일반적으로, ORF는 개시 코돈을 포함하며, 후속 영역이 일반적으로 다수의 3개의 뉴클레오타이드 길이를 갖지만, 소정의 리딩 프레임에 종결 코돈 (TAG, TAA, TGA, UAG, UAA 또는 UGA)을 포함하지 않는다. ORF는 번역될 수 있는 아미노산이 펩타이드 연결 사슬을 형성하는 단백질을 코딩한다.
본 명세서에 사용되는 바와 같이, 용어 "단백질", "펩타이드", "폴리펩타이드", "펩타이드들" 및 "폴리펩타이드들"은 전반에 걸쳐 상호 혼용된다. 이러한 용어는 천연 발생 펩타이드, 예를 들어 천연 발생 단백질 및 천연 또는 비 천연 발생 아미노산을 포함할 수 있는 합성 펩타이드 둘 모두를 지칭한다. 펩타이드는 또한 천연 또는 비 천연 발생 아미노산의 측쇄 또는 유리 아미노 또는 카복시-말단을 변형시킴으로써 화학적으로 변형될 수 있다. 이러한 화학적 변형은 글리코실화와 같은 아미노산의 측쇄에서 작용기의 변형뿐만 아니라, 추가의 화학적 모이어티의 첨가를 포함한다. 펩타이드는 바람직하게는 적어도 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95개 또는 적어도 100개의 아미노산, 가장 바람직하게는 적어도 8 또는 적어도 30개의 아미노산을 갖는 중합체이다. 본 명세서에 개시된 폴리펩타이드 및 단백질은 아데노바이러스로부터 유래되므로, 본 명세서에 사용되는 바와 같은 단리된 폴리펩타이드 또는 단백질의 분자량은 200 kDa를 초과하지 않는 것이 바람직하다.
아데노바이러스 (Ad)는 수종의 조류 및 포유류 숙주에서 확인된 무외피의 정이십면체형 바이러스이다. 인간 아데노바이러스 (hAd)는 공지된 모든 인간 및 동물 (예를 들어, 소, 돼지, 개, 쥐, 말, 유인원 및 양) 유래의 다수의 Ad를 포함하는 마스타데노바이러스 (Mastadenovirus) 속에 속한다. 인간 아데노바이러스는 일반적으로 랫트 및 붉은털 원숭이 적혈구의 적혈구 응집 특성, DNA 상동성, 제한효소 절단 패턴, G+C 함량 백분율 및 발암성을 포함하는 다수의 생물학적, 화학적, 면역학적 및 구조적 기준에 따라 6개의 하위그룹 (A 내지 F)으로 분류된다 (문헌[Straus, 1984, in The Adenoviruses, ed. H. Ginsberg, pps.451-498, New York: Plenus Press, and Horwitz, 1990; in Virology, eds. B. N. Fields and D. M. Knipe, pps. 1679-1721]).
아데노바이러스 비리온은 정이십면체 대칭을 가지며, 혈청형에 따라 직경이 60 내지 90 nm이다. 정이십면체 캡시드는 3개의 주요 단백질, 헥손 (II), 펜톤 염기 (III) 및 노브형 섬유 (knobbed fiber) (IV) 단백질을 포함한다 (문헌[W. C. Russel, J. Gen.Virol., 81: 2573-2604 (2000)]). 더욱 구체적으로, 아데노바이러스 캡시드는 252개의 캡소머를 포함하며, 이중 240개는 헥손이고, 12개는 펜톤이다. 헥손 및 펜톤은 3개의 상이한 바이러스 폴리펩타이드로부터 유래된다. 헥손은 3개의 동일한 폴리펩타이드, 즉 폴리펩타이드 II를 포함한다. 펜톤은 캡시드에 대한 부착 지점을 제공하는 펜톤 염기, 및 펜톤 염기에 비공유적으로 결합되고 돌출된 삼량체 섬유 단백질을 포함한다. 다른 단백질, 즉 단백질 IX, VI 및 IIIa도 일반적으로 아데노바이러스 캡시드에 존재한다. 이들 단백질은 바이러스 캡시드를 안정화시키는 것으로 여겨진다.
인간에서 관찰되는 기 존재하는 면역의 한 양상은 체액성 면역이며, 이는 아데노바이러스 단백질에 특이적인 항체의 생성 및 지속성을 야기할 수 있다. 아데노바이러스에 의해 유발된 체액성 반응은 주로 구조 단백질 헥손의 초가변 영역에 대해 유도된다. 비인간 대형 유인원으로부터 단리된 아데노바이러스는 인간 유래의 세포에서 그의 효율적인 증식에 의해 입증된 바와 같이 인간으로부터 단리된 아데노바이러스와 밀접하게 연관된다.
캡시드는 T- 및/또는 B 세포 에피토프와 같은 비-아데노바이러스 폴리펩타이드를 혼입시킴으로써 본 명세서에 기재된 바와 같이 변형될 수 있다.
용어 "헥손 단백질"은 아데노바이러스에 포함된 헥손 (II) 단백질을 지칭한다. 본 발명에 따른 헥손 단백질 또는 그의 변이체는 감염성 아데노바이러스 비리온에서 헥손 단백질 또는 그의 단편과 동일한 기능을 한다. 따라서, 바람직하게는 캡시드 단백질로서 상기 헥손 또는 그의 변이체를 포함하는 아데노바이러스는 숙주 세포에 유입될 수 있다. 헥손 단백질의 변이체를 생성하기 위한 적합한 방법은 미국 특허 제 5,922,315호에 기재되어 있다. 이 방법에서, 아데노바이러스 헥손의 적어도 하나의 루프 영역을 다른 아데노바이러스 혈청형의 적어도 하나의 루프 영역으로 변경시킨다. 재조합 아데노바이러스가 숙주 세포에 유입될 수 있는지는 용이하게 결정될 수 있다. 예를 들어, 숙주 세포를 아데노바이러스와 접촉시킨 후, 재조합 숙주 세포를 세척하고, 용해시킬 수 있으며, 예를 들어 아데노바이러스 RNA 및/또는 DNA에 특이적인 적절한 혼성화 프로브를 사용하여, 아데노바이러스 RNA 및/또는 DNA가 숙주 세포에서 존재하는지를 결정할 수 있다. 대안적으로 또는 추가로, 재조합 아데노바이러스와 접촉된 후 숙주 세포를 세척하고, 용해시키고, 예를 들어 웨스턴 블롯을 사용하여, 아데노바이러스 특이적 항체에 의해 프로빙할 수 있다. 또 다른 대안에서, 숙주 세포에서 유전자 산물을 발현시키기에 적합한 발현 카세트를 포함하는 재조합 아데노바이러스로 감염시 숙주 세포가 유전자 산물, 예를 들어 형광 단백질을 발현하는지를 예를 들어 생체내에서 관찰한다.
용어 "초가변 영역"은 헥손 단백질의 용매-노출된 표면에 위치하여 바이러스 캡시드의 외부에 노출된 것으로, 균주 사이에 서열 변화가 큰 도메인을 지칭한다. 이는 중화 항체의 주요 결정 인자이다. HVR은 예를 들어 다른 헥손 단백질과의 서열 정렬에 의해 확인될 수 있다.
"아데노바이러스 펜톤 단백질"은 아데노바이러스에 포함된 펜톤 염기 (III) 단백질을 의미한다. 아데노바이러스 펜톤 단백질은 이것이 캡시드의 정이십면체 대칭의 모서리에 위치하는 것을 특징으로 한다. 본 발명에 따른 펜톤 단백질 또는 그의 변이체는 감염성 아데노바이러스 비리온에서 펜톤 단백질과 동일한 기능을 한다. 따라서, 바람직하게는 캡시드 단백질로서 상기 펜톤 또는 그의 변이체를 포함하는 아데노바이러스는 숙주 세포에 유입될 수 있으며, 이는 상기 기재된 바와 같이 시험될 수 있다. 또한, 기능성 펜톤은 아데노바이러스 섬유 단백질에 친화도를 갖는다. 당업자는 단백질-단백질 친화도를 시험하는 방법을 잘 알고 있다. 제1 단백질이 제2 단백질에 결합할 수 있는지를 결정하기 위해, 예를 들어, 유전자 효모 2-하이브리드 분석 또는 생화학적 분석, 예컨대 풀다운 (pull-down), 효소-결합 면역흡착 분석 (ELISA), 형광-활성화 세포 분류 (FACS) 기반 분석 또는 플라스몬 공명 분석이 사용될 수 있다. 풀다운 또는 플라스몬 공명 분석이 사용되는 경우, 생화학 분야에 널리 공지된 바와 같이, HIS-태그, GST-태그 또는 다른 친화도 태그에 적어도 하나의 단백질을 융합시키는 것이 유용하다.
용어 "섬유 단백질"은 아데노바이러스에 포함된 노브형 섬유 (IV) 단백질을 지칭한다. 본 발명에 따른 섬유 단백질 또는 그의 변이체는 감염성 아데노바이러스 비리온에서 섬유 단백질 또는 그의 단편과 동일한 기능을 한다. 따라서, 바람직하게는 캡시드 단백질로서 상기 섬유 또는 섬유 변이체를 포함하는 아데노바이러스는 숙주 세포에 유입될 수 있으며, 이는 상기 기재된 바와 같이 시험될 수 있다. 또한, 기능성 섬유 단백질은 아데노바이러스 펜톤 단백질에 친화도를 갖는다. 또한, 글리코실화된 형태의 기능성 아데노바이러스 섬유 단백질은 삼량화할 수 있다. 따라서, 변이체가 글리코실화되고/거나 삼량체를 형성할 수 있는 것이 또한 바람직하다. 삼량체화를 포함한 친화도는 상기 기재된 바와 같이 시험될 수 있으며, 글리코실화 분석은 또한 당 업계에 널리 공지되어 있다.
"VA (바이러스 연관) RNA"는 아데노바이러스에 존재하는 비코딩 유형이다. 이는 번역을 조절하는 역할을 한다. 이 RNA에는 VAI 또는 VA RNA I 및 VAII 또는 VA RNA II라는 2개의 카피가 있다. 2개의 VA RNA 유전자는 아데노바이러스 게놈에서 별개의 유전자이다. VA RNA I이 주요 종이며, VA RN AII는 더 낮은 수준으로 발현된다. 전사체는 폴리아데닐화되지 않으며, 둘 모두 PolIII에 의해 전사된다.
폴리뉴클레오타이드, 폴리펩타이드 또는 단백질 서열과 관련하여 용어 "동일성" 또는 "동일한"은 최대 대응성을 위해 정렬될 때 동일한 두 서열의 잔기의 수를 지칭한다. 구체적으로, 핵산 또는 아미노산 서열에 관계없이, 2개의 서열의 서열 동일성 백분율은 정렬된 2개의 서열 사이의 정확한 매칭 수를 더 짧은 서열의 길이로 나눈 후 100을 곱한 것이다. 2개의 서열을 정렬하는데 사용될 수 있는 정렬 툴은 당업자에게 널리 공지되어 있고, 예를 들어 월드 와이드 웹, 예를 들어 폴리펩타이드 정렬의 경우 Clustal Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/) 또는 폴리뉴클레오타이드 정렬의 경우 MUSCLE (http://www.ebi.ac.uk/Tools/msa/muscle/) 또는 MAFFT (http://www.ebi.ac.uk/Tools/msa/ mafft/) 또는 폴리뉴클레오타이드 및 폴리펩타이드 정렬의 경우 WATER (http://www.ebi.ac.uk/Tools/psa/ emboss_water/)에서 입수할 수 있다. 2개의 서열 사이의 정렬은, 예를 들어 MAFFT의 경우 바람직하게는: Matrix: Blosum62, Gap Open 1.53, Gap Extend 0.123, WATER 폴리뉴클레오타이드의 경우 바람직하게는: MATRIX: DNAFULL, Gap Open: 10.0, Gap Extend 0.5 및 WATER 폴리펩타이드의 경우 바람직하게는 MATRIX: BLOSUM62, Gap Open: 10.0, Gap Extend: 0.5의 디폴트 매개변수 설정을 사용하여 수행할 수 있다. 당업자는 만족스러운 정렬을 생성하기 위해 임의의 순서로 갭을 도입할 필요가 있을 수 있음을 이해한다. "최적의 서열 정렬"은 갭의 수가 가장 적고, 정렬된 잔기 중 동일한 것의 수가 가장 많은 경우의 정렬로 규정된다. 바람직하게는, 이는 정렬 내의 모든 서열의 모든 잔기를 포함하는 전체 정렬이다.
용어 "변이체"는 폴리펩타이드와 관련하여 일반적으로, 폴리펩타이드의 하나 이상의 아미노산이 결실, 삽입, 변형 및/또는 치환될 수 있는 폴리펩타이드의 변형된 형태, 예를 들어 돌연변이를 지칭한다. 일반적으로, 변이체는 기능적이며, 이는 기능적 변이체를 포함하는 아데노바이러스가 숙주 세포를 감염시킬 수 있음을 의미한다. 더욱 구체적인 기능이 본 명세서에 규정되어 있으며, 일반 정의보다 우선한다. "돌연변이" 또는 "아미노산 돌연변이"는 아미노산 치환, 결실 및/또는 삽입일 수 있다 ("그리고" 하나 이상의 돌연변이가 존재하는 경우 적용될 수 있음). 바람직하게는 이는 치환 (즉, 보존적 또는 비 보존적 아미노산 치환), 더욱 바람직하게는 보존적 아미노산 치환이다. 일부 실시형태에서, 치환은 또한 천연 발생 아미노산과 비 천연 발생 아미노산의 교체를 포함한다. 보존적 치환은 하나의 아미노산이, 치환된 아미노산과 유사한 화학적 특성을 갖는 다른 아미노산으로 치환된 것을 포함한다. 바람직하게는, 보존적 치환은 다음으로 이루어진 군으로부터 선택되는 치환이다:
(i) 염기성 아미노산의 또 다른 상이한 염기성 아미노산으로의 치환;
(ii) 산성 아미노산의 또 다른 상이한 산성 아미노산으로의 치환;
(iii) 방향족 아미노산의 또 다른 상이한 방향족 아미노산으로의 치환;
(iv) 비극성 지방족 아미노산의 또 다른 상이한 비극성 지방족 아미노산으로의 치환; 및
(v) 극성 비하전 아미노산의 또 다른 상이한 극성 비하전 아미노산으로의 치환.
염기성 아미노산은 바람직하게는 아르기닌, 히스티딘 및 리신으로 이루어진 군으로부터 선택된다. 산성 아미노산은 바람직하게는 아스파테이트 또는 글루타메이트이다. 방향족 아미노산은 바람직하게는 페닐알라닌, 티로신 및 트립토판으로 이루어진 군으로부터 선택된다. 비 극성 지방족 아미노산은 바람직하게는 글리신, 알라닌, 발린, 류신, 메티오닌 및 이소류신으로 이루어진 군으로부터 선택된다. 비하전된 극성 아미노산은 바람직하게는 세린, 트레오닌, 시스테인, 프롤린, 아스파라긴 및 글루타민으로 이루어진 군으로부터 선택된다. 보존적 아미노산 치환과 달리, 비 보존적 아미노산 치환은 상기에 요약된 보존적 치환 (i) 내지 (v)가 아닌 하나의 아미노산에서 임의의 아미노산으로의 교체이다.
서열 동일성을 결정하기 위한 수단은 상기에 기재되어 있다.
단백질의 아미노산은 또한 변형, 예를 들어 화학적으로 변형될 수 있다. 예를 들어, 단백질 또는 폴리펩타이드의 아미노산의 측쇄 또는 유리 아미노 또는 카복시-말단은 예를 들어 글리코실화, 아미드화, 인산화, 유비퀴틴화 등에 의해 변형될 수 있다. 화학적 변형은 생체내, 예를 들어 숙주 세포 내에서 이루어질 수 있으며, 이는 당 업계에 널리 공지된 바와 같다. 예를 들어, 단백질의 아미노산 서열에 존재하는 적절한 화학적 변형 모티프, 예를 들어 글리코실화 서열 모티프는 단백질의 글리코실화를 유발할 것이다. 변형이 변형된 아미노산의 동일성을 변화시키지 않으면 (예를 들어, 치환 또는 결실), 변형된 폴리펩타이드는 특정 서열번호와 관련하여 언급된 바와 같이 폴리펩타이드의 범위 내에 있으며, 즉 이는 본 명세서에 규정된 바와 같은 변이체가 아니다.
용어 "변이체"는 폴리뉴클레오타이드와 관련하여, 일반적으로, 폴리뉴클레오타이드의 하나 이상의 뉴클레오타이드가 결실, 삽입, 변형 및/또는 치환될 수 있는 폴리뉴클레오타이드의 변형된 형태, 예를 들어 돌연변이를 지칭한다. 일반적으로, 변이체는 기능적이며, 이는 기능적 변이체를 포함하는 아데노바이러스가 숙주 세포를 감염시킬 수 있음을 의미한다. 더욱 구체적인 기능이 본원에 규정되어 있으며, 일반 정의보다 우선한다. "돌연변이"는 뉴클레오타이드의 치환, 결실 및/또는 삽입일 수 있다 ("그리고" 하나 이상의 돌연변이가 존재하는 경우 적용될 수 있음). 바람직하게는 이는 치환이며, 더욱 바람직하게는 이는 아미노산 치환, 가장 바람직하게는 보존적 아미노산 치환을 야기한다.
"항원성 단백질 또는 그의 단편" (단편은 또한 항원성임)은 포유류에서 면역 반응을 유발할 수 있다. 바람직하게는 이는 종양 항원 또는 병원체 유래의 항원이다. 용어 "병원체"는 대상체에서 질병을 유발할 수 있는 임의의 유기체를 지칭한다. 이는 다음에 제한되는 것은 아니나, 박테리아, 원생동물, 진균, 선충, 비로이드, 바이러스 및 기생충을 포함 하며, 각 병원체는 자체적으로 또는 다른 병원체와 함께, 다음에 제한되는 것은 아니나 포유류를 포함하고, 다음에 제한되는 것은 아니나 인간을 포함하는 척추동물에서 질병을 유발할 수 있다. 본 명세서에 사용되는 바와 같이, 용어 "병원체"는 또한 일반적으로 비 면역 손상 숙주에서 병원성일 수 없지만, 면역 손상 숙주에서는 병원성인 유기체를 포함한다.
일반적으로 말하면, 아데노바이러스 게놈은 잘 특성화되어 있다. 아데노바이러스 게놈의 전체 체계에서, 특정 오픈 리딩 프레임이 유사하게 위치하는 것과 관련하여, 예를 들어 각 바이러스의 E1A, E1B, E2A, E2B, E3, E4, LI, L2, L3, L4 및 L5 유전자의 위치가 일반적으로 보존된다. 아데노바이러스 게놈의 각 말단은 바이러스 복제에 필요한 역위 말단 반복 (ITR)으로 공지된 서열을 포함한다. 바이러스는 또한 바이러스 인코딩 프로테아제를 포함하는데, 이는 감염성 비리온을 생성하는데 필요한 일부 구조 단백질을 처리하는데 필요하다. 아데노바이러스 게놈의 구조는 숙주 세포 형질도입 후에 바이러스 유전자가 발현되는 순서를 기반으로 하여 기재된다. 더욱 구체적으로, 바이러스 유전자는 전사가 DNA 복제의 개시 이전 또는 이후에 발생하는지에 따라 초기 (E) 또는 후기 (L) 유전자로 지칭된다. 형질도입의 초기 단계에서, 바이러스 복제를 위해 숙주 세포를 제조하기 위해, 아데노바이러스의 E1A, E1B, E2A, E2B, E3 및 E4 유전자가 발현된다. 감염의 후기 단계 동안, 바이러스 입자의 구조적 성분을 인코딩하는 후기 유전자 L1-L5의 발현이 활성화된다.
본 명세서에 사용되는 바와 같이, 용어 "벡터"는 플라스미드 벡터, 코스미드 벡터, 파지 벡터 예컨대, 람다 파지, 바이러스 벡터 예컨대, 아데노바이러스 (Ad) 벡터 (예를 들어, 당 업계, 예를 들어 WO 2005/071093 A2에 공지되어 있는 비 복제 Ad5, Ad11, Ad26, Ad35, Ad49, ChAd3, ChAd4, ChAd5, ChAd7, ChAd8, ChAd9, ChAd10, ChAd11, ChAd16, ChAd17, ChAd19, ChAd20, ChAd22, ChAd24, ChAd26, ChAd30, ChAd31, ChAd37, ChAd38, ChAd44, ChAd63 및 ChAd82 벡터 또는 복제 가능 Ad4 및 Ad7 벡터), 아데노-연관 바이러스 (AAV) 벡터 (예를 들어, AAV 5형), 알파바이러스 벡터 (예를 들어, 베네수엘라 말 뇌염 바이러스 (VEE), 신드비스열 바이러스 (SIN), 셈리키삼림열바이러스 (semliki forest virus) (SFV) 및 VEE-SIN 키메라), 헤르페스 바이러스 벡터, 홍역 바이러스 벡터, 폭스 바이러스 벡터 (예를 들어, 백시니아 바이러스, 변형된 백시니아 바이러스 앙카라 (MVA), NYVAC (백시니아 코펜하겐 균주로부터 유래) 및 조류폭스 (avipox) 벡터: 카나리폭스 (canarypox) (ALVAC) 및 계두 (FPV) 벡터), 및 수포성 구내염 바이러스 벡터, 바이러스성 입자 또는 박테리아 포자를 포함하여, 당업자에 공지된 임의의 벡터를 포함한다. 벡터는 또한 발현 벡터, 클로닝 벡터 및 숙주 세포에서 재조합 아데노 바이러스를 생성하는데 유용한 벡터를 포함한다.
상기 언급된 바와 같이, "이종성 단백질 또는 그의 단편"은 비-아데노바이러스 단백질 또는 그의 단편, 특히 항원성 단백질 또는 그의 단편일 수 있다. 이를 위해, 이종성 단백질을 인코딩하는 폴리뉴클레오타이드는 예를 들어 표적 세포로 전달될 분자, 예를 들어 항원성 단백질 또는 그의 단편, 바람직하게는 병원성 바이러스, 박테리아, 진균, 원생동물 또는 기생충과 같은 병원체 또는 종양 항원의 항원성 단백질 또는 그의 단편을 인코딩하는 폴리뉴클레오타이드일 수 있다. "항원"은 포유류에서 면역 반응을 유발할 수 있는 임의의 단백질 또는 펩타이드를 지칭한다. 항원은 바람직하게는 적어도 8개의 아미노산을 포함하고, 가장 바람직하게는 8 내지 12개의 아미노산을 포함한다.
용어 "발현 카세트"는 그의 전사 및 번역 제어 서열과 함께 발현될 적어도 하나의 핵산 서열을 포함하는 핵산 분자를 지칭한다. 발현 카세트를 변화시키면, 이를 포함하는 벡터가 상이한 서열 또는 서열 조합의 발현을 유도하게 할 것이다. 제한 부위는 5' 및 3' a말단에 존재하도록 조작되어 있기 때문에, 카세트를 용이하게 삽입되거나, 제거되거나, 다른 카세트로 교체될 수 있다. 바람직하게는, 발현 카세트는 소정의 유전자, 예컨대 프로모터, 개시 부위 및/또는 폴리아데닐화 부위의 효율적인 발현을 위한 시스형 (cis) 조절 요소를 포함한다. 본 발명과 관련하여 더욱 구체적으로, 발현 카세트는 숙주 세포에서 제1 또는 제2 양상의 폴리뉴클레오타이드의 발현에 필요한 모든 추가 요소를 포함한다. 따라서, 통상적인 발현 카세트는 제1 또는 제2 양상의 폴리뉴클레오타이드에 작동 가능하게 연결된 프로모터 및 전사체의 효율적인 폴리아데닐화에 필요한 신호, 리보솜 결합 부위 및 번역 종결 인자를 포함한다. 카세트의 추가 요소는 예를 들어 인핸서를 포함할 수 있다. 발현 카세트는 또한 효율적인 종결을 제공하기 위해 구조 유전자의 하류에 전사 종결 영역을 포함해야 한다. 종결 영역은 프로모터 서열과 동일한 유전자로부터 획득될 수 있거나, 상이한 유전자로부터 획득될 수 있다.
본 명세서에 사용되는 바와 같이, 용어 "미니유전자"는 천연 발생 유전자와 비교하여 유전자의 하나 이상의 기능적으로 비 필수적인 절편이 결실된 이종성 유전자 작제물을 지칭한다. "미니유전자 카세트"는 발현을 위한 미니유전자를 포함하는 발현 카세트이다.
용어 "복제-가능" 재조합 아데노바이러스 (AdV)는 세포에 포함된 임의의 재조합 보조 단백질의 부재 하에 숙주 세포에서 복제될 수 있는 아데노바이러스를 지칭한다. 바람직하게는, "복제-가능" 아데노바이러스는 다음과 같은 온전한 또는 기능적 필수 초기 유전자: E1A, E1B, E2A, E2B, E3 및 E4를 포함한다. 특정 동물로부터 단리된 야생형 아데노바이러스는 해당 동물에서 복제 가능할 것이다.
용어 "복제-결함" 또는 "복제-불능" 재조합 AdV는 적어도 기능적 결실, 즉, 그 전체가 제거되지 않은 상태에서 유전자의 기능을 손상시키는 결실, 예를 들어 인위적 종결 코돈의 도입, 활성 부위 또는 상호작용 도메인의 결실 또는 돌연변이, 유전자의 조절 서열의 돌연변이 또는 결실 등 또는 E1, E2, E3 및 E4로부터 선택되는 하나 이상의 아데노바이러스 유전자와 같이, 바이러스 복제에 필수적인 유전자 산물을 인코딩하는 유전자의 완전한 제거를 포함하도록 조작되어, 복제가 불가능하게 된 아데노바이러스를 지칭한다. 본 발명의 재조합 아데노바이러스 바이러스는 바람직하게는 복제 결함이다.
용어 "재조합 아데노바이러스"는 특히, 이종성 폴리뉴클레오타이드 및/또는 폴리펩타이드 서열을 포함하도록 변형된 아데노바이러스를 지칭한다. "이종성"은 다른 아데노바이러스 균주, 특히 상이한 숙주 (예를 들어, 인간 숙주, 따라서 인간 아데노바이러스, 예컨대 Ad3 또는 Ad5), 또는 비-아데노바이러스 유기체, 예컨대 본 명세서에 기재된 바와 같은 병원체, 또는 인간 종양 항원과 같은 인간으로부터의 균주를 의미할 수 있다. 이와 같이, 본 용어는 각각 키메라 및 전달체 아데노바이러스를 포함한다. 재조합 아데노바이러스는 다른 아데노바이러스 또는 비-아데노바이러스 유기체로부터의 이종성 폴리뉴클레오타이드 및/또는 폴리펩타이드 서열을 포함할 수 있으며, 즉 이는 키메라 및 전달체 아데노바이러스 둘 모두일 수 있다.
본 명세서에 사용되는 바와 같이, 용어 "바이러스-유사 입자" 또는 "VLP"는 이 경우 아데노바이러스로부터 유래된 비어 있는 비 복제 바이러스 껍질을 지칭한다. VLP는 일반적으로 다음에 제한되는 것은 아니나, 캡시드, 코팅, 껍질, 표면 및/또는 외피 단백질로 지칭되는 단백질과 같은 하나 이상의 바이러스 단백질로 구성된다. 이는 바이러스에 의한 세포 침투에 관여하는 기능성 바이러스 단백질을 포함하고 있어, 효율적인 세포 유입을 가능하게 한다. VLP는 적절한 발현 시스템에서 단백질의 재조합 발현시 자발적으로 형성될 수 있다. 특정 VLP를 생성하는 방법은 당 업계에 공지되어 있다. 아데노바이러스 VLP는 특히 기능적 손상, 예를 들어 바이러스 DNA 패킹에 관여하는 아데노바이러스의 Iva2 유전자로의 널 돌연변이의 도입 또는 결실에 의해 생성될 수 있다 (문헌[Ostapchuk et al. J Virol. 2011 Jun; 85(11): 5524-5531]). VLP의 존재는 전자 현미경법, X-선 결정학 등과 같은 당 업계에 공지된 통상적인 기법을 사용하여 검출될 수 있다. 예를 들어, 문헌[Baker et al., Biophys. J. (1991) 60:1445-1456; Hagensee et al., J. Virol. (1994) 68:4503-4505]을 참조한다. 예를 들어, 해당 VLP 제조물의 수성 샘플에 대해 글라스 하에 극저온 전자 현미경법을 수행할 수 있으며, 적절한 노출 조건 하에서 이미지를 기록하였다.
VLP에 포함된 "바이러스 게놈 DNA의 실질적 부재"는 VLP에 바이러스 게놈 DNA가 존재하지 않거나, VLP에 감염된 세포에서 바이러스 복제를 가능하게 하는 VLP 내의 바이러스 DNA가 충분하지 않고, 바이러스 복제가 이루어질 수 있도록 하는 VLP 내의 DNA를 보완하는 DNA를 발현하지 않는 것을 의미한다.
상기에 더하여, 항원 결정 인자로도 언급되는 "에피토프"는 면역 시스템, 구체적으로 항체, B 세포 또는 T 세포에 의해 인식되는 거대 분자의 절편이다. 본 발명과 관련하여, 바람직하게는, 용어 "에피토프"는 면역 시스템에 의해 인식되는 단백질 또는 폴리단백질의 절편을 지칭한다. 에피토프는 일반적으로 아미노산 또는 당 측쇄와 같은 분자의 화학적 활성 표면 기로 구성되며, 일반적으로 특정의 3차원 구조 특성뿐만 아니라, 특정 전하 특성을 갖는다. 입체형태 및 비 입체형태 에피토프는 변성 용매의 존재하에 전자의 경우 결합능이 존재하나, 후자의 경우 결합능을 상실한다는 점에서 구별된다.
"비-아데노바이러스 T 세포 에피토프"는 항원-제시 세포의 표면 상에 제시될 수 있는 에피토프이며, 이는 MHC 분자에 결합한다. 인간에서, 전문 항원-제시 세포는 MHC 클래스 II 펩타이드를 제시하도록 특화되어 있으며, 대부분의 핵 형성 체세포는 MHC 클래스 I 펩타이드를 제시한다. MHC 클래스 I 분자에 의해 제시된 T 세포 에피토프는 통상적으로 8 내지 11개의 아미노산 길이의 펩타이드이며, MHC 클래스 II 분자는 13 내지 17개의 아미노산 길이의 더 긴 펩타이드를 제시한다.
"비-아데노바이러스 B 세포 에피토프"는 B 세포에 의해 고유 항원의 표면 상의 3차원 구조로 인식되는 에피토프이다.
B-세포 및 T-세포 에피토프는 인실리코 툴 (in silico tool), 예를 들어 IEDB 분석 리소스의 온라인 B-세포 및 T-세포 예측 툴로 예측될 수 있다.
용어 "하나 이상의 비-아데노바이러스 B 세포 에피토프를 제시하다"는 하나 이상의 에피토프가 캡시드 내로 혼입되어, B 세포에 의해 인식되도록 하는 것을 의미한다. 용어 "하나 이상의 비-아데노바이러스 B/T 세포 에피토프를 혼입시키다"는 에피토프가 캡시드에 혼입되지 않았거나, 캡시드에 혼입된 상태에서 VLP에 포함됨을 의미한다. 이것이 캡시드에 혼입되어 있는 경우, 이는 면역 세포에 의해 인식될 수 있도록 외부에 제시되거나 제시되지 않을 수 있다.
"면역 애주번트" 또는 간단히 "애주번트"는 항원 단독의 투여와 비교하여, 항원/면역원에 대한 면역 반응의 특질 및/또는 강도를 가속화, 연장 및/또는 향상시킴으로써, 소정의 임의의 백신에 필요한 항원/면역원의 양 및/또는 대상 항원/면역원에 대한 적절한 면역 반응을 생성하기 위해 필요한 주사 횟수를 감소시키는 물질이다. 본 발명에 따른 조성물과 관련하여 사용될 수 있는 애주번트의 예는 수산화알루미늄 (alum)의 겔형 침전물; AlPO4; 알하이드로겔; 그람 음성 박테리아의 외막으로부터의 박테리아 산물, 특히 모노포스포릴 지질 A (MPLA), 리포다당류 (LPS), 뮤라밀 디펩타이드 (muramyl dipeptide) 및 이들의 유도체; Freund의 불완전 애주번트; 리포좀, 특히 중성 리포좀, 조성물 및 선택적으로 사이토카인을 포함하는 리포좀; 비이온성 블록 공중합체; ISCOMATRIX 애주번트 (문헌[Drane et al., 2007]); CpG 디뉴클레오타이드 (CpG 모티프), 특히 포스포로티오에이트 (PTO) 골격 (CpG PTO ODN) 또는 포스포디에스테르 (PO) 백본 (CpG PO ODN)을 갖는 CpG ODN을 포함하는 비 메틸화 DNA; 합성 리포펩타이드 유도체, 특히 Pam3Cys; 리포아라비노 만난; 펩티도 글리칸; 자이모산; 열 충격 단백질 (HSP), 특히 HSP 70; dsRNA 및 이들의 합성 유도체, 특히 폴리 I:폴리 C; 다가 양이온성 펩타이드, 특히 폴리-L-아르기닌; 탁솔; 피브로넥틴; 플라겔린; 이미다조퀴놀린; 애주번트 활성을 갖는 사이토카인, 특히 GM-CSF, 인터루킨-(IL-)2, IL-6, IL-7, IL-18, I 및 II형 인터페론, 특히 인터페론-감마, TNF-알파; 25-디하이드록시비타민 D3 (칼시트리올); 및 합성 올리고펩타이드, 특히 MHCII-제시 펩타이드이다. 폴리옥시에틸렌 (POE) 및 폴리옥시프로필렌 (POP)을 포함하는 비이온성 블록 중합체, 예컨대 POE-POP-POE 블록 공중합체가 애주번트로서 사용될 수 있다 (문헌[Newman et al., 1998]). 이러한 유형의 애주번트는 활성 구성성분으로서 핵산을 포함하는 조성물에 특히 유용하다.
본 발명과 관련하여 용어 "백신화"는 활성 면역화, 즉, 적합한 면역원성 제형 중 항원 (외래 물질임에 따라 면역원으로 인식되므로, 면역 시스템이 각각 백신화하는 물질)의 (예를 들어, 피하, 피내, 근육내, 경구, 비강) 투여에 의한 특이적 면역 반응의 유도이다. 따라서, 항원은 면역 시스템이 항원에 대한 특이적 면역 반응을 형성하기 위한 촉발 인자로 사용된다. 본 발명의 범주 내에서 백신화는 원칙적으로 치료적 의미뿐만 아니라, 예방적 의미로 수행될 수 있다. 여기에는 감염성 질병을 치료 또는 예방하기 위한 본 명세서에 기재된 병원체에 대한 백신화, 또는 암과 같은 비 감염성 질병을 치료 또는 예방하기 위한 백신화가 포함된다. 비 감염성 질병의 경우, 항원은 바람직하게는 세포막 항원, 특히 단지 발병 세포에 의해서는 발현되나, 비 발병 세포에 의해서는 발현되지 않는 것이 바람직하다. 예에는 종양 관련 항원이 있다. 이와 관련하여, 용어 "종양 관련 항원"은 주로 종양 세포에 의해 제시됨에 따라, 비 악성 조직으로부터의 분화를 가능하게 하는 구조를 의미한다. 바람직하게는, 이러한 종양 관련 항원은 종양 세포의 세포막 상 또는 그 내부에 위치한다. 종양 관련 항원의 예는 예를 들어 문헌[DeVita et al. (Eds., "Biological Therapy of Cancer", 2. Edition, Chapter 3: Biology of Tumor Antigens, Lippincott Company, ISBN 0-397-51416-6 (1995))]에 기재되어 있다.
본 명세서에 사용되는 바와 같이, "프라이밍(priming)"은 포유류에서 면역 반응을 유도/생성하기 위한 백신의 투여를 지칭하고, "부스팅"은 포유류에서 면역 반응을 향상시키기 위한 백신의 투여를 지칭한다. 어구 "이종성 프라이밍-부스팅"은 포유류에서 면역 반응을 유도/생성하기 위한 백신 (프라이밍) 및 포유류에서 면역 반응을 향상시키기 위한 백신 (부스팅)이 상이하다는 것을 의미한다. 이종성 프라이밍-부스팅은 대상체, 예를 들어 환자가 제1 벡터에 대한 항체를 발생시켰으며, 부스팅이 필요한 경우 유용하다. 이와 관련하여, 제1 백신에 의한 프라이밍 동안 유도되는 항체 반응이, 프라이밍 및 부스팅에 적용되는 동물 세포의 핵으로의 유입으로부터 부스팅을 위해 투여되는 제2 백신 입자의 70% 초과 또는 바람직하게는 80% 초과를 방지하지 않는 경우, 제1 (프라이밍) 및 제2 (부스팅) 백신, 예를 들어 아데노바이러스는 충분히 상이하다.
용어 "유전자 요법"은 환자의 임상 상태를 개선시키기 위한 목적으로 결함 유전자의 교정을 위해 외래 유전 물질을 세포, 조직 또는 기관에 직접 도입하는 개념으로 광범위하게 규정될 수 있다. 본 명세서에 사용되는 바와 같이, 용어 "유전자 요법"은 바람직하게는 "체성 요법"을 지칭하고, 세대에서 세대로 전달되는 유전적 변화를 유도하는 "생식선 요법"을 지칭하지 않으며, 체성 요법은 치료 효과를 치료되는 개체로 국한시킨다. 유전자 요법, 바람직하게는 체성 요법은 유기체로의 빠르고 쉬운 직접 유전자 전달 ("생체 내") 또는 치료 후 재이식되는 외식된 세포 또는 조직으로의 정교하지만 더욱 구체적이고 제어 가능한 유전자 전달 ("생체외" 또는 "시험관내")에 의해 추가로 구별될 수 있다.
용어 "중화 항체"는 아데노바이러스의 에피토프에 결합하여 숙주 세포에서 증식성 감염을 유발하는 것을 방지하거나, 전이유전자를 발현하는 복제 불능 벡터에 의한 표적 세포의 형질도입을 방지하는 항체를 지칭하고, 예를 들어 아데노바이러스 DNA는 세포, 특히 숙주 세포에 유입될 수 있다.
본 발명의 범위를 벗어남이 없이 본 발명의 다양한 수정 및 변형이 당업자에게 자명할 것이다. 본 발명은 바람직한 특정 실시형태와 관련하여 기재되었지만, 청구된 본 발명은 이러한 특정 실시형태에 과도하게 제한되어서는 안된다는 것을 이해해야 한다. 실제로, 당업자에 자명한 것으로, 본 발명을 수행하기 위한 기재된 방식의 다양한 변형이 본 발명에 의해 포함되는 것으로 의도된다.
본 발명은 하기 실시예에 의해 설명되며, 이는 단지 예시적이고, 본 발명의 범위를 제한하지 않는 것으로 해석되어야 한다.
실시예
실시예 1: 신규한 아데노바이러스 벡터의 단리
pGADNOU19 및 pGADNOU20 벡터의 작제를 하기 제공된 단계를 통해 진행하였다. pGADNOU19 및 pGADNOU20 벡터는 표준 절차를 사용하여 건강한 비인간 대형 유인원으로부터 채취한 분변 샘플로부터 단리된 야생형 아데노바이러스 균주로부터 유래하였다. 분변 추출물로 HEK 293 및 A549 세포의 단층을 접종하여 야생형 바이러스를 단리하였다. 세포 단층을 세포 변성 효과의 출현에 대해 매일 관찰하였다. 현미경으로 관찰하여 양의 값의 샘플을 수집한 후, 세포를 동결-해동 (-700℃/37℃)에 의해 용해시켰다. 이어서, 정화된 세포 용해물을 사용하여 바이러스 증식을 위해 단층의 새로운 세포를 감염시켰다. 바이러스 증폭의 2회 계대 후, 표준 절차를 사용하여 아데노바이러스를 정제하였다. 바이러스 게놈을 SDS/프로테아제 K 분해 및 페놀-클로로포름 추출에 의해 정제된 바이러스로부터 추출하였다. 정제된 아데노바이러스 DNA를 셔틀 플라스미드 벡터에 클로닝하여 다음과 같은 바이러스 게놈 결실을 수행함으로써 변형시켰다:
1) 바이러스 게놈의 E1 영역 (bp 461 내지 bp 3402)의 결실
2) 바이러스 게놈의 E3 영역 (bp 28472 내지 bp 31996)의 결실
실시예 2: GADNOU 셔틀 벡터의 생성
GADNOU 바이러스의 정제된 DNA 게놈을 먼저 시퀀싱 (sequencing)한 다음, DNA 서열 정보를 사용하여 상동성 재조합에 의해 GAd의 전체 게놈을 클로닝하기 위한 셔틀 벡터를 작제하였다. E1 영역 결실 (뉴클레오타이드 좌표: 461 내지 3402)을 도입시키기 위해 셔틀 벡터를 설계하였다. 간략하게, GADNOU 바이러스를 클로닝하는데 사용된 셔틀 벡터 (본 명세서에서 pGAd-GAG 셔틀로 지칭됨)를 다음과 같이 작제하였다:
a. EcoRI 및 SfiI로 분해한 올리고뉴클레오타이드 FW 5'-GAACTCCgaattcgtttaaaccatcatcaataatataccttattttggattgaggccaatatgataatgaggtgggcggggcgaggcggggcgggtgacgtagg-3' (서열번호 58) 및 RV 5'-cataatcGGCCGCAGCGGCCCGTCAG ATGACGGCGACAATAAA-3' (서열번호 59)에 의한 PCR에 의해 GAd-GAG 좌측 말단을 증폭시킨 후, EcoRI 및 SfiI로 분해한 pUC19 rc_MCS_좌측 말단_PIX_우측 말단_V1에 결찰시킴으로써, pUC19 L-ITR GAd-GAG를 생성시켰다.
b. PshA1 및 BamHI로 분해한 올리고뉴클레오타이드 FW 5'-Cataatcgacccgagtcgcactctcacagcaccagca-3' (서열번호 60) 및 RV 5'-GAACTCCggatccgtttaaacCATCATCAATAATATACCTTATTTTG-3' (서열번호 61)에 의한 PCR에 의해, GAd 우측 말단을 증폭시킨 후, PshA1 및 BamHI로 분해한 pUC19 L-ITR GAd-GAG에 결찰시킴으로써, pUC19 L/R-ITR GAd-GAG를 생성시켰다.
c. AsisI-AscI로 분해한 올리고뉴클레오타이드 FW 5'-Cataatcgcgatcgcgcttaggcctgaccatctgg-3' (서열번호 62) 및 RV 5'-GAACTCCggcgcgccTTAGGGGGAGGCAAGGCTG-3' (서열번호 63)에 의한 PCR에 의해 pIX 코딩 영역을 포함하는 DNA 단편을 증폭시킨 후, AsisI-AscI로 분해한 플라스미드 pUC19 L/R-ITR GAd-GAG에 클로닝시킴으로써, pUC19 L/R-ITR pIX GAd-GAG를 생성시켰다.
d. MscI-SfiI로 분해한 플라스미드 phCMV-GAG로부터 HCMV-GAG-BGH폴리A 카세트를 획득하였다. MscI-SfiI로 분해한 pUC19 L/R-ITR pIX GAd-GAG에 카세트를 클로닝시킨 후, 평활 말단이 되도록 함으로써, pGAd-GAG 셔틀을 생성시켰다.
e. 플라스미드 영역을 BAC 영역으로 대체함에 의한 셔틀 BAC의 작제: PmeI로 분해한 플라스미드 pBELO BAC RDL로부터 BAC 영역을 획득한 후, PmeI로 분해한 플라스미드 pGAd-GAG 셔틀에 클로닝시킴으로써, BAC GAd-GAG 셔틀을 생성시켰다.
f. 우측 말단 및 pIX 영역 사이에의 Amp-LacZ-SacB 선택 카세트의 삽입: AscI로 분해한 올리고뉴클레오타이드 FW (5'-GAACTCCGGCGCGCCTAGG GATAACAGGGTAAT ACCCCTATTTGTTTATTTTTCT-3', 서열번호 64) 및 RV (5'-CATAATCGGCGCGCCATTACCCTGTTATCCCTATTATTTGTTAACTGTTAA TTGTC-3', 서열번호 65)을 사용한 PCR에 의해 플라스미드 pChAd 셔틀 플라스미드로부터 Amp-LacZ-SacB 선택 카세트를 획득하였다. AscI로 분해한 BAC GAd-GAG 셔틀에 선택 카세트를 클로닝시킴으로써, BAC GAd-GAG A/L/S 셔틀을 생성시켰다 (도 1).
플라스미드 DNA로부터 바이러스 DNA가 방출되도록 하기 위해, 두 ITR의 말단에만 존재하는 제한효소 부위 (PmeI)를 포함하는 셔틀 플라스미드를 설계하였다.
실시예 3: ΔE1 벡터의 작제
GADNOU wt 게놈 DNA를 프로테아제 K 분해 후 페놀/클로로포름 추출에 의해 단리하였다. E. 콜라이 균주 BJ5183에서 상동성 재조합에 의해 pGADNOU19 및 pGADNOU20 벡터를 획득하였다. 정제된 WT 바이러스 DNA 및 BAC GAd-GAG A/L/S 셔틀로 E. 콜라이 균주 BJ5183 세포를 공동-형질전환시킴으로써 바이러스 DNA의 클로닝을 획득하였다. 셔틀 BAC (AscI로 분해함)의 말단에 존재하는 ITR DNA 서열 우측의 pIX 유전자 및 바이러스 게놈 DNA 사이의 상동성 재조합에 의해, 이를 BAC 벡터 내에 삽입하고, 동시에 E1 영역을 결실시키고, 발현 카세트로 치환함으로써, △E1/GAG (BAC) 벡터 GADNOU19 GAG BAC 및 GADNOU20 GAG BAC를 생성시킬 수 있다. 헥손 영역에서의 제한 분석 및 PCR 시퀀싱에 의해 스크리닝을 수행하였다.
실시예 4: △E3 벡터의 작제
작제 전략은 다음에 기재된 바와 같은 2개의 연속 단계를 기반으로 하였다:
a) Amp-LacZ-SacB 선택 카세트에 의한 E3 영역의 치환:
올리고뉴클레오타이드 FW (5'-GGATTACACCAAGATCTTTGCTGTC ATTTGTGTGCTGAGTATAATAAAGGCTGAGATCAGAATCTACTCGACCCCTATTTGTTTATTTTTCT-3', 서열번호 66) 및 RV (5'-CTTGCTATCAGA TTTCAAGTAAGTGATTTTTTATTGATTACAGTTATGATCAATTGAAAGGGATAAGGTCTTATTTGTTAACTGTTAATTGTC-3', 서열번호 67)를 사용하여 PCR에 의해 BAC GAd-GAG A/L/S 셔틀로부터 Amp-LacZ-SacB 선택 카세트를 획득하였다. PCR에 의해 획득된 DNA 단편을 그 후 GAdNou19 GAG BAC 및 GAdNou20 GAG BAC에 클로닝하고, 재조합 조작 기법에 의해 GAdNou19 GAG (DE1E3) A/L/S BAC 및 GAdNou20 GAG (DE1E3) A/L/S BAC를 획득하였다.
b) E3 영역 결실을 위한 Amp-lacZ-SacB 선택 카세트의 결실:
단일 가닥 올리고뉴클레오타이드 (5' ctgtcatttgtgtgctgagtataataaaggctgagatcagaatctactcggaccttatccctttcaattgatcataactgtaatcaataaaaaatcactt-3', 서열번호 68)를 사용하여, Amp-LacZ-SacB 선택 카세트를 결실시키고, Amp-LacZ-SacB 선택 카세트를 ss 올리고로 치환한 후, E3 영역을 결실시켰다. ss 올리고를 사용하여, 재조합 조작 기법에 의해 GADNOU19 GAG (DE1E3) A/L/S BAC 및 GADNOU20 GAG (DE1E3) A/L/S BAC에 선택 카세트를 대체함으로써, 최종 플라스미드 GADNOU19 GAG (DE1E3) BAC (도 2) 및 GADNOU20 GAG (DE1E3) BAC (도 3)를 생성시켰다.
실시예 5: 신규한 아데노바이러스 벡터의 증식성 향상
E1 결실을 보유하고, GAG 항원을 발현하는 2개의 비인간 대형 유인원 아데노바이러스 벡터, GADNOU19 및 GADNOU20의 증식성을 Hek293 부착성 세포에서 평가하였다. 동일한 발현 카세트를 보유하는 벤치마크 Ad5 벡터와 비교하여, MOI 100 및 MOI 300 vp/세포에서 정제된 바이러스로 T25 부착성 세포를 감염시켜 증식성을 평가하였다. 감염된 세포를 전체 세포 변성 효과가 명백한 감염 후 3일째에 수확하고; 3회 사이클의 동결/해동 (-70℃/37℃)에 의해 감염된 세포로부터 바이러스를 방출시키고, 이어서 용해물을 원심분리에 의해 정화시켰다. CMV 프로모터 영역에 상보적인 프로브 및 프라이머에 의한 정량적 PCR에 의해 정화된 용해물을 정량화하였다. 올리고 뉴클레오타이드 서열은 다음과 같다: CMVfw 5'-CATCTACGTATTAGTCATCGCTATTACCA-3' (서열번호 69), CMVrv 5'-GACTTGGAAATCCCCGTGAGT-3' (서열번호 70), CMVFAM-TAMRA 프로브 5'-ACATCAATGGGCGTGGATAGCGGTT-3' (서열번호 71). ABI Prism 7900 서열 검출기 - Applied Biosystem에서 QPCR을 구동시켰다. GAG를 발현하는 GADNOU 및 GADNOU19의 세포당 바이러스 입자 (vp/세포)에서 나타난 결과적인 특정 증식성은 동일한 발현 카세트를 보유하는 벤치마크 Ad5 벡터보다 유의하게 더 높았다 (도 1).
증식성 향상의 이론적 근거: 본 발명의 아데노바이러스 게놈은 면역 역가가 높은 것으로 공지된 아데노바이러스 그룹 C에 속한다. 동시에, 그룹 C 바이러스는 상대적으로 저조한 증식성을 특징으로 한다. 본 발명자들은 본 발명의 아데노바이러스 게놈이 많은 다른 그룹 C 아데노바이러스와 상이한 특정 게놈 특징을 포함한다는 것을 발견하였다. 이 특징은 게놈에 존재하는 한 쌍의 비코딩 RNA (이른바 바이러스 연관 (VA) RNA I 및 II)로 표시되고, 각각 약 170개의 뉴클레오타이드 길이이며, 약 60개의 뉴클레오타이드로 분리되어 있다. 일반적으로, VA RNA I 및 II가 모두 존재하지만, VA RNA I만이 존재하는 경우 (그룹 A 바이러스 및 일부 그룹 B 바이러스)가 존재한다. 이들 RNA는 바이러스의 세포 방어 메커니즘 간섭과 관련된 것으로 공지되어 있다. 또한, VA RNA I 및 II는 세포 효소에 의해 마이크로RNA로 추가 가공된다. 그러나 이들 마이크로RNA의 정확한 기능은 공지되어 있지 않다.
공지된 아데노바이러스의 서열을 분석함으로써, 본 발명자들은 본 발명의 게놈의 VA RNA I 및 II가 다른 그룹 C 아데노바이러스의 VA RNA I 및 II 서열 (예를 들어 인간 Ad5 및 Ad2뿐만 아니라, 그룹 C에 속하는 많은 침팬지 단리물)과 유사한 것이 아니라, 그룹 B 및 E의 VA RNA I 및 VA RNA II와 더 근사하게 유사함을 발견하였다. 그룹 내 및 그룹 간 VA RNA I 및 II 서열의 평균 서열 동일성을 계산하였으며, 이는 하기 표 3에 제시되어 있다.
따라서, 이들 RNA는 바이러스의 더 높은 복제를 야기하는 것으로 여겨진다. 본 발명자들의 최적 정보에 따르면, VA RNA는 현재까지는 아데노바이러스의 증식성 향상과 관련된 것으로 밝혀진 바 없다.
실시예 6: Gad 벡터 면역원성
HIV-1 gag (서열번호 74)를 인코딩하는 2개의 GADNOU 벡터 (GADNOU19 GAG (DE1E3), 서열번호 72 및 GADNOU20 GAG (DE1E3), 서열번호 73)의 면역원성을 BALB/c 마우스에서 평가하였다. 그룹당 6마리의 동물을 각 GADNOU 벡터의 증가된 투여량으로 근육내 면역화시켰다. 3주 후 HIV gag 메이저 H-2 Kd CD8+ 에피토프 (AMQMLKETI)를 인코딩하는 9량체 펩타이드를 항원으로 사용하여 수집된 비장 세포에서 ELISpot을 수행하였다. 데이터는 시험된 3x10^7 vp (바이러스 입자)의 최고 투여량에서 두 벡터에 의해 강한 면역원성이 유도되었음을 보여준다. 또한 3x10^6 vp의 저투여량에서, 두 벡터는 여전히 백신화된 마우스의 50%에서 HIV-1 gag 특이적 T 세포 반응을 유도할 수 있었다 (도 5).
SEQUENCE LISTING
<110> NOUSCOM
<120> GREAT APES ADENOVIRUS NUCLEIC ACID- AND AMINO ACID-SEQUENCES,
VECTORS CONTAINING SAME, AND USES THEREOF
<130> 854-10 PCT
<150> EP17179825.9
<151> 2017-07-05
<160> 74
<170> PatentIn version 3.5
<210> 1
<211> 37184
<212> DNA
<213> Great Ape Adenovirus
<400> 1
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg cggagggtat ttaaacccgc 480
tgcgctccta aagaggccac tcttgagtgc cagcgagaag agttttctcc tccgctccgt 540
ttcggcgatc gaaaaatgag acatttagcc tgcactccgg gtcttttgtc cggccgggcg 600
gcgtccgagc ttttggacgc tttgctcaat gaggttctga gcgatgattt tccgtctact 660
acccacttta gcccacctac tcttcacgaa ctgtacgatc tggatgtact ggtggatgtg 720
aacgatccca acgaggaggc ggtttctacg ttttttcccg agtctgcgct tttggctgcc 780
caggagggat ttgacctaca cactccgccg ctgcctattt tagagtctcc gctgccggag 840
cccagtggta taccttatat gcctgaactg cttcccgaag tggtagacct gacctgccac 900
gagccgggct ttccgcccag cgacgatgag ggtgagcctt ttgctttaga ctatgctgag 960
atacctgggc tcggttgcag gtcttgtgca tatcatcaga gggttaccgg agaccccgag 1020
gttaagtgtt cgctgtgcta tatgaggctg acctcttcct ttatctacag taagtttttt 1080
tgtgtaggtg ggctttttgg gtaggtgggt tttgtggcag gacaggtgta aatgttgctt 1140
gtgttttttg tacctgcagg tccggtgtcc gagccagacc cggagcccga ccgcgatccc 1200
gagccggatc ccgagcctcc tcgcaggcca aggaaattac cttccatttt gtgcaagcct 1260
aagacacctg tgaggaccag cgaggcggac agcactgact ctggcacttc tacctctcct 1320
cctgaaattc acccagtggt tcctctgggt atacatagac ctgttgctgt tagagtttgc 1380
gggcgacgcc ctgcagtaga gtgcattgag gacttgctta acgatcccga gggacctttg 1440
gacttgagca ttaaacgccc taggcaataa accccaccta agtaataaac cccacctaag 1500
taataaactt taccgccctt ggttattgag atgacgccca atgtttgctt ttgaatgact 1560
tcatgtgtat aataaaagtg agtgtggtca taggtctctt gtttgtctgg gcggggttta 1620
agggtatata agtttctcgg ggctaaactt ggttacactt gaccccaatg gaggcgtggg 1680
ggtgcttgga ggagtttgcg gacgtgcgcc gtttgctgga cgagagctct agcaatacct 1740
atagtatttg gaggtatctg tggggctcta ctcaggccaa gttggtcttc agaattaagc 1800
aggattacaa gtgcgatttt gaagagcttt ttagttcctg tggtgagctt ttgcaatcct 1860
tgaatctggg ccaccaggct atcttccagg aaaaggttct ctcgactttg gatttttcca 1920
ctcccgggcg caccgccgct tgtgtggctt ttgtgtcttt tgtgcaagat aaatggagcg 1980
gggagaccca cctgagtcac ggctacgtgc tggatttcat ggcgatggct ctttggaggg 2040
cttacaacaa atggaagatt cagaaggaac tgtacggttc cgccctacgt cgtccacttc 2100
tgcagcggca ggggctgatg tttcccgacc atcgccagca tcagaatctg gaagacgagc 2160
gagcggagaa gatcagcttg agagccggcc tggaccctcc tcaggaggaa tgaatctccc 2220
gcaggtggtt gagctgtttc ccgaactgag acgggtcctg actatcaggg aggatggtca 2280
gtttgtgaag aagctgaaga gggatcgggg tgagggagat gatgaggcgg ctagcaattt 2340
agcttttagt ctgataactc gccaccgacc ggaatgtatt acctatcagc agattaagga 2400
gagttgtgcc aacgagctgg atcttttggg tcagaagtat agcatagaac agcttaccac 2460
ttactggctt cagcccgggg atgattggga agaggcgatt agggtgtatg caaaggtggc 2520
cctgcggccc gattgcaagt ataagattac taagttggtt aatattagaa actgctgcta 2580
tatttctgga aacggggccg aagtggagat agatactgag gacagggtgg ctattaggtg 2640
ttgcatgata aacatgtggc ccgggatact ggggatggat ggggtgatat ttatgaatgt 2700
gaggttcacg ggccccaact ttaatggtac ggtgttcatg ggcaacacca acttgctcct 2760
gcatggtgcg agtttctatg ggtttaacaa cacctgtata gaggcctgga ccgatgtaaa 2820
ggttcgaggt tgttcctttt atagctgttg gaaggcggtg gtgtgtcgcc ctaaaagcag 2880
gggttctgtg aagaaatgct tgtttgaaag gtgcacccta ggtatccttt ctgagggcaa 2940
ctccagggtg cgccataatg tggcttcgaa ctgcggttgc ttcatgcaag tgaagggggt 3000
gagcgttatc aagcataact cggtctgtgg aaactgcgag gatcgcgcct ctcagatgct 3060
gacctgcttt gatggcaact gtcacctgtt gaagaccatt catataagca gtcaccccag 3120
aaaggcctgg cccgtgtttg agcataacat tctgacccgc tgttccttgc atctgggggt 3180
caggaggggt atgttcctgc cttaccagtg taactttagc cacactaaaa tcctgctgga 3240
acccgagtgc atgactaagg tcagcctgaa tggtgtgttt gatgtgagtc tgaagatttg 3300
gaaggtgctg aggtatgatg agaccaggac caggtgccga ccctgcgagt gcggcggcaa 3360
gcacatgaga aatcagcctg tgatgttgga tgtgaccgag gagcttaggc ctgaccatct 3420
ggtgctggcc tgcaccaggg ccgagtttgg gtctagcgat gaggataccg attgaggtgg 3480
gtaaggtggg cgtggctagc agggtgggcg tgtataaatt gggggtctaa ggggtctctc 3540
tgtttgtctt gcaacagccg ccgccatgag cgacaccggc aacagctttg atggaagcat 3600
ctttagtccc tatctgacag tgcgcatgcc tcactgggcc ggagtgcgtc agaatgtgat 3660
gggttccaac gtggatggac gtcccgttct gccttcaaat tcgtctacta tggcctacgc 3720
gaccgtggga ggaactccgc tggacgccgc gacctccgcc gccgcctccg ccgccgccgc 3780
gaccgcgcgc agcatggcta cggaccttta cagctctttg gtggcgagca gcgcggcctc 3840
tcgcgcgtct gctcgggatg agaaactgac tgctctgctg cttaaactgg aagacttgac 3900
ccgggagctg ggtcaactga cccagcaggt ttccagcttg cgtgagagca gccttgcctc 3960
cccctaatgg cccataatat aaataaaagc cagtctgttt ggattaagca agtgtatgtt 4020
ctttatttaa ctctccgcgc gcggtaagcc cgggaccagc ggtctcggtc gtttagggtg 4080
cggtggattt tttccaacac gtggtacagg tggctctgga tgtttagata catgggcatg 4140
agtccatccc tggggtggag gtagcaccac tgcagagctt cgtgctcggg ggtggtgttg 4200
tatatgatcc agtcgtagca ggagcgctgg gcgtggtgct gaaaaatgtc cttaagcaag 4260
aggcttatag ctagggggag gcccttggtg taagtgttta caaatctgct tagctgggag 4320
gggtgcatcc ggggggatat gatgtgcatc ttggactgga tttttaggtt ggctatgttc 4380
ccgcccagat cccttctggg attcatgttg tgcaggacca ccagcacggt atatccagtg 4440
cacttgggaa atttatcgtg gagcttagac gggaatgcat ggaagaactt ggagacgccc 4500
ttgtggcctc ccagattttc catacattcg tccatgatga tggcaatggg cccgtgggaa 4560
gctgcctgag caaaaacgtt tctggcatcg ctcacatcgt agttatgttc cagggtgagg 4620
tcatcatagg acatctttac gaatcggggg cgaagggtcc cggactgggg gatgatggta 4680
ccctcgggcc ccggggcgta gttcccctca cagatctgca tctcccaggc tttcatttca 4740
gagggaggga tcatatccac ctgcggggcg atgaaaaaga cagtttctgg cgcaggggag 4800
attaactggg atgagagcag gtttctgagc agctgtgact ttccacagcc ggtgggccca 4860
tatatcacgc ctatcaccgg ctgcagctgg tagttaagag agctgcagct gccgtcctcc 4920
cggagcaggg gggccacctc gttgagcata tccctgacgt ggatgttctc cctgaccagt 4980
tccgccagaa ggcgctcgcc gcccagcgaa agcagctctt gcaaggaagc aaaatttttc 5040
agcggtttca ggccatcggc cgtgggcatg tttttcagcg tctgggtcag cagctccagc 5100
ctgtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagatc tcctcgtttc 5160
gcgggttggg gcggctttcg ctgtagggca ccagccgatg ggcgtccagc ggggccagag 5220
tcatgtcctt ccatgggcgc agggtcctcg tcagggtggt ctgggtcacg gtgaaggggt 5280
gcgctccggg ttgggcactg gccagggtgc gcttgaggct ggttctgctg gtgctgaatc 5340
gctgccgctc ttcgccctgc gcgtcggcca ggtagcattt gaccatggtc tcgtagtcga 5400
gaccctcggc ggcgtgcccc ttggcgcgga gctttccctt ggaggtggcg ccgcacgagg 5460
ggcactgcag gctcttcagg gcgtagagct tgggagcgag aaacacggac tctggggagt 5520
aggcgtccgc gccgcaggcc gagcagaccg tctcgcattc caccagccaa gtgagttccg 5580
ggcggtcagg gtcaaaaacc aggttgcccc catgcttttt gatgcgtttc ttaccttggc 5640
tctccatgag gcggtgtccc ttctcggtga cgaagaggct gtccgtgtcc ccgtagaccg 5700
acttcagggg cctgtcttcc agcggagtgc ctctgtcctc ctcgtagaga aactctgacc 5760
actctgagac gaaggcccgc gtccaggcca ggacgaagga ggccacgtgg gaggggtagc 5820
ggtcgttgtc cactagcggg tccaccttct ccagggtgtg caggcacatg tccccctcct 5880
ccgcgtccag aaaagtgatt ggcttgtagg tgtaggacac gtgaccgggg gttcccaacg 5940
ggggggtata aaagggggtg ggtgcccttt catcttcact ctcttccgca tcgctgtctg 6000
cgagagccag ctgctggggt aagtattccc tctcgaaggc gggcatgacc tcagcgctca 6060
ggttgtcagt ttctaaaaat gaggaggatt tgatgttcac ctgtccggag gtgatacctt 6120
tgagggtacc tgggtccatc tggtcagaaa acactatttt tttgttatca agcttggtgg 6180
cgaatgaccc gtagagggcg ttggagagca gcttggcgat ggagcgcagg gtctggtttt 6240
tgtcgcggtc ggctcgctcc ttggccgcga tgttgagttg cacgtactcg cgggccacgc 6300
acttccactc ggggaacacg gtggtgcgct cgtctgggat caggcgcacc ctccagccgc 6360
ggttgtgcag ggtgaccatg tcgacgctgg tggcgacctc accgcgcaga cgctcgttgg 6420
tccagcagag gcggccgccc ttgcgcgagc agaagggggg tagggggtcc agctggtcct 6480
cgtttggggg gtccgcgtcg atggtaaaga ccccggggag caggcgcggg tcaaagtagt 6540
cgatcttgca agcttgcatg tccagagccc gctgccattc gcgggcggcg agcgcgcgct 6600
cgtaggggtt gaggggcggg ccccagggca tggggtgggt gagcgcggag gcgtacatgc 6660
cgcagatgtc atacacgtac aggggttccc tgaggatacc gaggtaggtg gggtagcagc 6720
gccccccgcg gatgctggcg cgcacgtagt catagagctc gtgggagggg gccagcatgt 6780
tgggcccgag gttggtgcgc tgggggcgct cggcgcggaa gacgatctgc ctgaagatgg 6840
cgtgggagtt ggaggagatg gtgggccgct ggaagacgtt gaagcttgct tcttgcaagc 6900
ccacggagtc cctgacgaag gaggcgtagg actcgcgcag cttgtgcacc agctcggcgg 6960
tgacctggac gtcgagcgca cagtagtcga gggtctcgcg gatgatgtca tacctatcct 7020
cccccttctt tttccacagc tcgcggttga ggacgaactc ttcgcggtct ttccagtact 7080
cttggagggg aaacccgtcc gtgtccgaac ggtaagagcc tagcatgtag aactggttga 7140
cggcctggta ggggcagcag cccttctcca cgggcagcgc gtaggcctgc gccgccttgc 7200
ggagggaggt gtgggtgagg gcgaaagtgt ccctgaccat gactttgagg tattgatgtc 7260
tgaagtctgt gtcatcgcag ccgccctgtt cccacagggt gtagtccgtg cgctttttgg 7320
agcgcgggtt gggcagggag aaggtgaggt cattgaagag gatcttcccc gctcgaggca 7380
tgaagtttct ggtgatgcga aagggccctg ggaccgagga gcggttgttg atgacctggg 7440
cggccaggac gatctcgtca aagccgttta tgttgtgtcc cacgatgtag agctccagga 7500
agcggggctg gcccttgatg gaggggagct ttttaagttc ctcgtaggta agctcctcgg 7560
gcgattccag gccgtgctcc tccagggccc agtcttgcaa gtgagggttg gccgccagga 7620
aggatcgcca gaggtcgcgg gccatgaggg tctgcaggcg gtcgcggaag gttctgaact 7680
gccgccccac ggccattttt tcgggggtga tgcagtagaa ggtgaggggg tctttctccc 7740
aggggtccca tctgagctct cgggcgaggt cgcgcgcggc agcgaccaga gcctcgtcgc 7800
cccccagttt catgaccagc atgaagggca cgagttgctt gccaaaggct cccatccaag 7860
tgtaggtttc tacatcgtag gtgacaaaga ggcgctccgt gcgaggatga gagccgattg 7920
ggaagaactg gatctcccgc caccagttgg aggattggct gttgatgtgg tgaaagtaga 7980
agtcccgtct gcgggccgag cactcgtgct ggcttttgta aaagcgaccg cagtactggc 8040
agcgctgcac gggttgtata tcttgcacga ggtgaacctg gcgacctctg acgaggaagc 8100
gcagcgggaa tctaagtccc ccgcctgggg tcccgtgtgg ctggtggtct tttactttgg 8160
ttgtctggcc gccagcatct gtctcctgga gggcgatggt ggaacagacc accacgccgc 8220
gagagccgca ggtccagatc tcggcgctcg gcgggcggag tttgatgacg acatcgcgca 8280
cattggagct gtccatggtc tccagctccc gcggcggcag gtcagccggg agttcctgga 8340
ggttcacctc gcagagacgg gtcaaggcgc ggacagtgtt gagatggtat ctgatttcaa 8400
ggggcatgtt ggaggcggag tcgatggctt gcaggaggcc gcagccccgg ggggccacga 8460
tggttccccg cggggcgcga ggggaggcgg aagctggggg tgtgttcaga agcggtgacg 8520
cgggcgggcc cccggaggta gggggggttc cggccccaca ggcatgggcg gcaggggcac 8580
gtcttcgccg cgcgcgggca ggggctggtg ctggctccga agagcgcttg cgtgcgcgac 8640
gacgcgacgg ttggtgtcct gtatctggcg cctctgagtg aagaccacgg gtcccgtgac 8700
cttgaacctg aaagagagtt cgacagaatc aatctcggca tcgttgacag cggcctggcg 8760
caggatctcc tgcacgtcgc ccgagttgtc ctggtaggcg atttctgcca tgaactgctc 8820
gatctcttcc tcctggagat ctcctcgtcc ggcgcgctcc acggtggccg ccaggtcgtt 8880
ggagatgcga cccatgagct gcgagaaggc gttgagtccg ccctcgttcc agacccggct 8940
gtagaccacg cccccctcgg cgtcgcgggc gcgcatgacc acctgggcca ggttgagctc 9000
cacgtgtcgc gtgaagacgg cgtagttgcg caggcgctgg aaaaggtagt tcagggtggt 9060
ggcggtgtgc tcggcgacga agaagtacat gacccagcgc cgcaacgtgg attcattgat 9120
gtcccccaag gcctccaggc gctccatggc ctcgtagaag tccacggcga agttgaaaaa 9180
ctgggagttg cgagcggaca cggtcaactc ctcctccaga agacggatga gctcggcgac 9240
agtgtcgcgc acctcgcgct cgaaggccac ggggggcgct tcttcctctt ccacctcttc 9300
ttccatgatt gcttcttctt cttcctcagc cgggacggga gggggcggcg gcgggggagg 9360
ggcgcggcgg cggcggcggc gcaccgggag gcggtcgatg aagcgctcga tcatctcccc 9420
ccgcatgcgg cgcatggtct cggtgacggc gcggccgttc tcccgggggc gcagctcgaa 9480
gacgccgcct ctcatttcgc cgcggggcgg gcggccgtga ggtagcgaga cggcgctgac 9540
tatgcatctt aacaattgct gtgtaggtac gccgccaagg gacctgattg agtccagatc 9600
caccggatcc gaaaaccttt ggaggaaagc gtctatccag tcgcagtcgc aaggtaggct 9660
gagcaccgtg gcgggcgggg gcgggtcggg agagttcctg gcggagatgc tgctgatgat 9720
gtaattaaag taggcggtct tgagaaggcg gatggtggac aggagcacca tgtctttggg 9780
tccggcctgt tggatgcgga ggcggtcggc catgccccag gcctcgttct gacaccggcg 9840
caggtctttg tagtaatctt gcatgagtct ttccaccggc acttcttctc cttcctcttc 9900
ttcatctcgc cggtggtttc tcgcgccgcc catgcgcgtg accccaaagc ccctgagcgg 9960
ctgcagcagg gccaggtcgg cgaccacgcg ctcggccaag atggcctgct gtacctgagt 10020
gagggtcctc tcgaagtcat ccatgtccac gaagcggtgg taggcacccg tgttgatggt 10080
gtaggtgcag ttggccatga cggaccagtt gacggtctgg tgtcccggct gcgagagctc 10140
cgtgtaccgc aggcgcgaga aggcgcggga atcgaacacg tagtcgttgc aagtccgcac 10200
cagatactgg tagcccacca ggaagtgcgg cggaggttgg cgatagaggg gccagcgctg 10260
ggtggcgggg gcgccgggcg ccaggtcttc cagcatgagg cggtggtatc cgtagatgta 10320
cctggacatc caggtgatgc ctgcggcggt ggtggtggcg cgcgcgtagt cgcggacccg 10380
gttccagatg tttcgcaggg gcgagaagtg ttccatggtc ggcacgctct ggccggtgag 10440
gcgcgcgcag tcgttgacgc tctatacaca cacaaaaacg aaagcgttta cagggctttc 10500
gttctgtagc ctggaggaaa gtaaatgggt tgggttgcgg tgtgccccgg ttcgagacca 10560
agctgagctc agccggctga agccgcagct aacgtggtat tggcagtccc gtctcgaccc 10620
aggccctgta tcctccagga tacggtcgag agcccttttg ctttcttggc caagcgcccg 10680
tggcgcgatc tgggatagat ggtcgcgatg agaggacaaa agcggctcgc ttccgtagtc 10740
tggagaaaca atcgccaggg ttgcgttgcg gcgtaccccg gttcgagccc ctatggcggc 10800
ttggatcggc cggaaccgcg gctaacgtgg gctgtggcag ccccgtcctc aggaccccgc 10860
cagccgactt ctccagttac gggagcgagc cccttttgtt tttttatttt ttagatgcat 10920
cccgtgctgc ggcagatgcg cccctcgccc cggcccgatc agcagcagca acagcaggca 10980
tgcagacccc cctctcctct ccccgccccg gtcaccacgg ccgcggcggc cgtgtccggt 11040
gcggggggcg cgctggagtc agatgagcca ccgcggcggc gacctaggca gtatctggac 11100
ttggaagagg gcgagggact ggcgcggctg ggggcgagct ctccagagcg ccacccgcgg 11160
gtgcagttga aaagggacgc gcgtgaggcg tacctgccgc ggcaaaacct gtttcgcgac 11220
cgcgggggcg aggagcccga ggagatgcgg gactgcaggt tccaagcggg gcgcgagctg 11280
cgccgcggct tggacagaca gcgcctgctg cgcgaggagg actttgagcc cgacacgcag 11340
acgggcatca gccccgcgcg cgcgcacgtg gccgcggccg acctggtgac cgcctacgag 11400
cagacggtga accaggagcg caacttccaa aaaagcttca acaaccacgt gcgcacgctg 11460
gtggcgcgcg aggaggtgac cctgggtctc atgcatctgt gggacctggt ggaggcgatc 11520
gtgcagaacc ccagcagcaa gcccctgacc gcgcagctgt tcctggtggt gcagcacagc 11580
agggacaacg aggccttcag ggaggcgctg ctgaacatca ccgagccgga ggggcgctgg 11640
ctcctggacc tgataaacat cctgcagagc atagtggtgc aggagcgcag cctgagcctg 11700
gccgagaagg tggcggccat taactattct atgctgagcc tgggcaagtt ctacgctcgc 11760
aagatctaca agacccccta cgtgcccata gacaaggagg tgaagataga cagcttctac 11820
atgcgcatgg cgctgaaggt gctaaccctg agcgacgacc tgggagtgta ccgcaacgag 11880
cgcatccaca aggccgtgag cgccagccgg cggcgcgagc tgagcgaccg cgaactgatg 11940
cacagtctgc agcgcgcgct gaccggcgcg ggcgagggcg acagggaggt cgagtcctac 12000
tttgacatgg gggccgacct gcactggcag ccgagccgcc gcgccctgga agcggcgggg 12060
gcgtacggcg gccccctggc ggccgatgac gaggaagagg aggactatga gctagaggag 12120
ggcgagtacc tggaggactg acctggctgg tggtgttttg gtatagatgc aagatccgaa 12180
cgtggcggac ccggcggtcc gggcggcgct gcagagccag ccgtccggca ttaactcctc 12240
tgacgactgg gccgcggcca tgggtcgcat catggccctg accgcgcgca accccgaggc 12300
cttcaggcag cagcctcagg ctaaccggct ggcggccatc ttggaagcgg tagtgcccgc 12360
gcgctccaac cccacccacg agaaggtgct ggccatagtc aacgcgctgg cggagagcag 12420
ggccatccgg gcagacgagg ccggactggt gtacgatgcg ctgctgcagc gggtggcgcg 12480
gtacaacagc ggcaacgtgc agaccaacct ggaccgcctg gtgacggacg tgcgcgaggc 12540
cgtggcgcag cgcgagcgct tgcatcagga cggcaacctg ggctcgctgg tggcgctaaa 12600
cgccttcctt agcacccagc cggccaacgt accgcggggg caggaggact acaccaactt 12660
cttgagcgcg ctgcggctga tggtgaccga ggtccctcag agcgaggtgt accagtcggg 12720
gcccgactac ttcttccaga ccagcagaca gggcttgcaa accgtgaacc tgagccaggc 12780
tttcaagaac ctgcgggggc tgtggggagt gaaggcgccc accggcgacc gggctacggt 12840
gtccagcctg ctaaccccca actcgcgcct gctgctgctg ctgatcgcgc ccttcacgga 12900
cagcgggagc gtctcgcggg agacctatct gggccacctg ctgacgctgt accgcgaggc 12960
catcgggcag gcgcaggtgg acgagcacac cttccaggag atcaccagcg tgagccacgc 13020
gctggggcag gaggacacgg gcagcctgca ggcgaccctg aactacctgc tgaccaacag 13080
gcggcagaag attcccacgc tgcacagcct gacccaggag gaggagcgca tcttgcgcta 13140
cgtgcagcag agcgtgagcc tgaacctgat gcgcgacggc gtgacgccca gcgtggcgct 13200
ggacatgacc gcgcgcaaca tggaaccggg catgtacgct tcccagcggc cgttcatcaa 13260
ccgcctgatg gactacttgc atcgggcggc ggccgtgaac cccgagtact tcaccaatgc 13320
cattctgaat ccccactgga tgccccctcc gggtttctac aacggggact tcgaggtgcc 13380
tgaggtcaac gatgggttcc tctgggatga catggatgac agtgtgttct cccccaaccc 13440
gctgcgcgcc gcgtctctgc gattgaagga gggctctgac agggaaggac caaggagtct 13500
ggcctcctcc ctggctctgg gggcggtggg cgccacgggc gcggcggcgc ggggcagcag 13560
ccccttcccc agcctggcgg actctctgaa tagcgggcgg gtgagcaggc cccgcttgct 13620
aggcgaggag gagtatctga acaactccct gctgcagccc gtgagggaca aaaacgctca 13680
gcggcagcag tttcccaaca atgggataga gagcctggtg gacaagatgt ccagatggaa 13740
gacgtatgcg caggagtaca aggagtggga ggaccgccag ccgcggcccc tgccgccccc 13800
tagacagcgc tggcagcggc gcgcgtccaa ccgccgctgg aggcaggggc ccgaggacga 13860
tgatgactct gcagatgaca gcagcgtgtt ggacctgggc gggagcggga accccttttc 13920
gcacctgcgc ccacgcctgg gcaagatgtt ttaaaagaga aaaataaaaa ctcaccaagg 13980
ccatggcgac gagcgttggt tttttgttcc cttccttagt atgcggcgcg cggcgatgtt 14040
cgaggagggg cctcccccct cttacgagag cgcgatggga atttctcctg cggcgcccct 14100
gcagcctccc tacgtgcctc ctcggtacct gcaacctaca ggggggagaa atagcatctg 14160
ttactctgag ctgcagcccc tgtacgatac caccagactg tacctggtgg acaacaagtc 14220
cgcggacgtg gcctccctga actaccagaa cgaccacagc gattttttga ccacggtgat 14280
ccaaaacaac gacttcaccc caaccgaggc cagtacccag accataaacc tggacaacag 14340
gtcgaactgg ggcggcgacc tgaagactat cctgcacacc aatatgccca acgtgaacga 14400
gttcatgttc accaactctt ttaaggcgcg ggtgatggtg gcgcgcgagc agggggaggc 14460
gaagtacgag tgggtggact tcacgctgcc cgagggcaac tactcagaga ccatgactct 14520
cgacctgatg aacaatgcga tcgtggaaca ctatctgaaa gtgggcaggc agaacggggt 14580
gaaggagagc gatatcgggg tcaagtttga caccagaaac ttccgtctgg gctgggaccc 14640
tgtgaccggg ctggtcatgc cgggggtcta caccaacgag gcctttcatc ccgatatagt 14700
gctcctgccc ggctgtgggg tggacttcac ccagagccgg ctgagcaacc tgctgggcgt 14760
tcgcaagcgg caacctttcc aggagggttt caagatcacc tatgaggatc tggagggggg 14820
caacattccc gcgctccttg atctggacgc ctacgaggag agcttgaaac ccgaggagag 14880
cgctggcgac agcggcgaga gtggcgagga gcaagccggc ggcggcggca gcgcgtcggt 14940
agaaaacgaa agtactcccg cagtggcggc ggacgctgcg gaggtcgagc cggaggccat 15000
gcagcaggac gcagaggagg gcgcgcagga ggacatgaac aatggggaga tcaggggcga 15060
cactttcgcc acccggggcg aagaaaaaga ggcagaggcg gcggcggcga cggcggaagc 15120
cgaaaccgag gcagaggcag agcccgagac cgaagttatg gaagacatga atgatggaga 15180
acgtaggggt gacacgtttg ccacccgggg cgaagagaag gcggcggagg cagaagccgc 15240
ggctgaggag gcggctgcgg ctgcggccaa ggctgaggct gcggctgagg ctaaggtcga 15300
agccgatgtt gcggttgagg ctcaggctga ggaggaggcg gcggctgaag cagttaagga 15360
aaaggcccag gcagagcagg aagagaaaaa acctgtcatt caacctctaa aagaagatag 15420
caaaaagcgc agttacaacg tcattgaggg cagcaccttt acccaatacc gcagctggta 15480
cctggcttac aactacggcg acccggtcaa gggggtgcgc tcgtggaccc tgctctgcac 15540
gccggacgtc acctgcggct ccgagcagat gtactggtcg ctgccaaaca tgatgcaaga 15600
cccggtgacc ttccgttcca cgcggcaggt tagcaacttt ccggtggtgg gcgccgaact 15660
gctgccagta cactccaaga gtttttacaa cgagcaggcc gtctactccc agctgatccg 15720
ccaggccacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga ttttggcgcg 15780
cccgccggcc cccaccatca ccaccgtcag tgaaaacgtt cctgccctca cagatcacgg 15840
gacgctaccg ctgcgcaaca gcatctcagg agtccagcga gtgaccatta ctgacgccag 15900
acgccggacc tgcccctacg tttacaaggc cttgggcata gtctcgccgc gcgtcctctc 15960
cagtcgcact ttttaaaaca catccaccca cacgctccaa aatcatgtcc gtactcatct 16020
cgcccagcaa caacaccggc tgggggctgc gcgcacccag caagatgttt ggaggggcaa 16080
ggaagcgctc cgaccagcac cccgtgcgcg tgcgcggcca ctaccgcgcg ccctggggtg 16140
cgcacaagcg cgggcgcaca gggcgcacca ctgtggatga tgtcattgac tccgtagtgg 16200
agcaggcgcg ccactacaca cccggcgcgc cgaccgcctc cgccgtgtcc accgtggacc 16260
aggcgatcga aagcgtggta cagggggcgc ggcactatgc caaccttaaa agtcgccgcc 16320
gccgcgtggc gcgccgccat cgccggagac cccgggctac tgccgccgcg cgccttacca 16380
aggctctgct caagcgcgcc aggcgaactg gccaccgggc cgccatgagg gccgcacggc 16440
gggctgccgc tgccgcgagc gccgtggccc cgcgggcacg aaggcgcgcg gccgctgccg 16500
ccgccgccgc catttccagc ttggcctcga cgcggcgcgg taacatatac tgggtgcgcg 16560
actcggtgag cggcacacgt gtgcccgtgc gctttcgccc cccacggaat tagcacaaga 16620
caacatacac actgagtctc ctgctgttgt gtatcccagc ggcgaccgtc agcagcggcg 16680
acatgtccaa gcgcaaaatt aaagaagaga tgctccaggt catcgcgccg gagatctatg 16740
ggcccccgaa gaaggaggag gaggattaca agccccgcaa gctaaagcgg gtcaaaaaga 16800
aaaagaaaga tgatgacgtt gacgaggcgg tggagtttgt ccgccgcatg gcgcccaggc 16860
gccctgtgca gtggaagggt cggcgcgtgc agcgagtcct gcgccccggc accgcggtgg 16920
tctttacgcc cggcgagcgt tccacgcgca ctttcaagcg ggtgtacgat gaggtgtacg 16980
gcgacgagga tctgttggag caggccaacc atcgatttgg ggagtttgca tatgggaaac 17040
ggcctcgcga gagtctaaaa gaggacctgc tggcgctacc gctggacgag ggcaatccca 17100
ccccgagtct gaagccggtg accctgcaac aggtgctgcc tttgagcgcg cccagcgagc 17160
agaagcgagg gttaaagcgc gagggcgggg acctggcacc caccgtgcag ttgatggtgc 17220
ccaagcggca gaagctggag gacgtgctgg agaaaatgaa agtagagccc gggatccagc 17280
ccgagatcaa ggtccgccct atcaagcagg tggcgcccgg cgtgggagtc cagaccgtgg 17340
acgttaggat tcccacggag gagatggaaa cccaaaccgc cactccctct tcggcagcaa 17400
gcgccaccac cggcgccgct tcggtagagg tgcagacgga cccctggcta cccgccgcca 17460
ctatcgccgt cgccgccgcc ccccgttcgc gcggacgcaa gagaaattat ccagcggcca 17520
gcgcgcttat gccccagtat gcgctgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcactcgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcatcg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg ccttcgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcaggg gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgttg 17940
cccctgctgg tcccgctact cgccgcggcg atcggcgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc atttttggag 18060
gaaaaaataa aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagc ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggtcaga tgctgagaga caagttgaaa gagcagaact tccaggagaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccgccctcag gtggaggaaa cgcctccagc 18480
catggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcccaccact cgccccatag ctcccatggc caccggtgtg gtgggtcaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgagccgact cgccagccaa aggcggtgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgcct ctgcgccgcg ctgcgagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gttcatcccc gtggatcggg 19200
aggacaccgc ttactcttac aaggcgcggt tcacgctggc cgtgggcgac aaccgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaat caggtggtcg ctgcagatga tgaacttgaa gatgaagaag 19440
cgcaagcaca agaggaagcc cctgtgaaaa aaattcatgt atatgctcag gcgcctcttt 19500
ctggcgaaaa gatttccaag gatggtatcc aaataggtac tgaagtcgta ggagatacat 19560
ctaaggacac ttttgcagat aaaacattcc aacccgaacc tcagataggc gagtctcagt 19620
ggaacgaggc tgatgccaca gcagcaggag gtagagtttt gaaaaagact acccctatga 19680
gaccttgcta tggatcctat gccaggccta ccaatgccaa cgggggtcaa ggaattatgg 19740
ttgccaatga acaaggagtg ttggagtcta aagtagaaat gcaatttttc tctaacacca 19800
caacccttaa tgcgcgggat ggaaccggca atcccgaacc aaaggtggtg ttgtacagcg 19860
aagatgtcca cttggaatct cccgatactc atctgtctta caagcccaaa aaggatgatg 19920
ttaatgccaa aatcatgttg ggtcagcaag ccatgcccaa cagacccaac ctcattggat 19980
ttagagataa tttcattggg cttatgtttt acaacagcac cggtaacatg ggagtgctgg 20040
cgggtcaggc ctctcagttg aatgctgtgg tggacttgca ggatagaaac acagaactgt 20100
catatcagct tctgcttgat tcaattgggg atagaaccag atacttctcc atgtggaacc 20160
aggcagtgga tagctatgat ccagatgtca gaattattga aaaccatggg actgaggatg 20220
aactgcccaa ctactgcttc cctttgggcg gcataggagt tactgatact tatcaaggga 20280
taaaaaatac caatggcaat ggtcagtgga ccaaagatga tcagttcgcg gaccgcaacg 20340
aaataggggt gggaaacaac ttcgccatgg agatcaacat ccaggccaac ctttggagaa 20400
acttcctcta tgcaaacgtg gggctctacc tgccagacaa gctcaagtac aaccccacca 20460
acgtggacat ctctgacaac cccaacacct atgactacat gaacaagcgg gtggtggccc 20520
ctggcctggt ggactgcttt gtcaatgtgg gagccaggtg gtccctggac tacatggaca 20580
acgtcaaccc cttcaaccac caccgcaatg cgggtctgcg ctaccgctcc atgatcctgg 20640
gcaacgggcg ctatgtgccc tttcacatcc aggtacccca gaagttcttt gccatcaaga 20700
acctcctgct cctgcccggc tcctacacct acgagtggaa cttcaggaag gatgtgaaca 20760
tggtcctaca gagctctctg ggcaatgacc ttagggtgga tggggccagc atcaagtttg 20820
acagcatcac cctctatgct acatttttcc ccatggccca caacaccgcc tccacgcttg 20880
aggccatgct gagaaacgac accaacgacc agtcctttaa tgactacctc tctggggcca 20940
acatgctcta cccaatccca gccaaggcca ccaacgtgcc catctccatc ccctctcgca 21000
actgggccgc ctttagaggc tgggccttta cccgccttaa gaccaaggag accccctccc 21060
tgggctcggg ttttgatccc tactttgttt actcgggatc catcccctac ctggatggca 21120
ccttctacct caaccacact ttcaagaaga tatccatcat gtatgactcc tccgtcagct 21180
ggccgggcaa cgaccgcttg ctcaccccca atgagttcga ggtcaagcgc gccgtggacg 21240
gcgagggcta caacgtggcc cagtgcaaca tgaccaagga ctggttcctg gtgcagatgc 21300
tggccaacta caacataggc taccagggct tttacatccc agagagctac aaggacagga 21360
tgtactcctt cttcagaaat ttccaaccca tgagccgaca ggtggtggac gagaccaatt 21420
acaaggacta tcaagccatt ggcatcaccc accagcacaa caactcgggt ttcgtgggct 21480
acctggcgcc caccatgcgc gagggtcagg cctaccccgc caacttcccc taccccttga 21540
taggcaagac cgcggtcgac agcgtcaccc agaaaaagtt cctctgcgac cgcaccctct 21600
ggcgcatccc cttctctagc aacttcatgt ccatgggtgc gctcacggac ctgggccaaa 21660
acctgcttta tgccaactct gcccatgcgc tggacatgac ttttgaggtg gaccccatgg 21720
acgagcccac ccttctctat attgtgtttg aagtgttcga cgtggtcaga gtgcaccagc 21780
cgcaccgcgg tgtcatcgag accgtgtacc tgcgtacgcc cttctcagcc ggcaacgcca 21840
ccacctaagg agacagcgcc gccgccgcct gcatgacggg ttccaccgag caagagctca 21900
gggccattgc cagagacctg ggatgcggac cctatttttt gggcacctat gacaaacgct 21960
tcccgggctt tatctcccga gacaagctcg cctgcgccat tgtcaacacg gccgcgcgcg 22020
agaccggggg cgtgcactgg ctggcctttg gctgggaccc gcgctccaaa acttgctacc 22080
tctttgaccc ctttggcttc tccgatcagc gcctcaggca gatttatgag tttgagtacg 22140
aggggctgct gcgccgcagc gcgctcgcct cctcgcccga ccgctgcatc acccttgaga 22200
agtccaccga aaccgtgcag gggccccact cggccgcctg cggtctcttc tgttgcatgt 22260
ttttgcacgc ctttgtgcac tggcctcaga gtcccatgga ttgcaacccc accatgaact 22320
tgctaaaggg agtgcccaac gccatgctcc agagccccca ggtccagccc accctgcgcc 22380
gcaaccagga acagctttac cgcttcctgg agcgccactc cccctacttc cgcagccaca 22440
gcgcgcgcat ccggggggcc acctcttttt gccacttgca agaaaacatg caagacggaa 22500
aatgatgtac agcatgcttt taataaatgt aaagactgtg cactttaatt atacacgggc 22560
tctttctggt tatttattca acaccgccgt cgccatttag aaatcgaaag ggttctgccg 22620
tgcgtcgccg tgcgccacgg gcagagacac gttgcgatac tggaagcggc tcgcccactt 22680
gaactcgggc accaccatgc ggggcagtgg ttcctcgggg aagttctcgc tccacagggt 22740
gcgggtcagc tgcagcgcgc tcaggaggtc gggagccgag atcttgaagt cgcagttggg 22800
gccggaaccc tgcgcgcgcg agttgcggta cacggggttg cagcactgga acaccagcag 22860
ggccggatta ttcacgctgg ccagcaggct ctcgtcgctg atcatgtcgc tgtccagatc 22920
ctccgcgttg ctcagggcga atggggtcat cttgcagacc tgcctgccca ggaaaggcgg 22980
gagcccaggc ttgccgttgc agtcgcagcg caggggcatt agcaggtgcc cacggcccga 23040
ctgcgcctgc gggtacaacg cgcgcatgaa ggcttcgatc tgcctaaaag ccacctgggt 23100
cttggctccc tccgaaaaga acatcccaca ggacttgctg gagaactggt tcgcgggaca 23160
gctggcatcg tgcaggcagc agcgcgcgtc agtgttggca atctgcacca cgttgcgacc 23220
ccaccggttt ttcactatct tggccttgga agcctgctcc tttagcgcgc gctggccgtt 23280
ctcgctggtc acatccatct ctatcacctg ttccttgttg atcatgtttg tcccgtgcag 23340
acactttagg tcgccctccg tctgggtgca gcggtgctcc cacagcgcgc aaccggtggg 23400
ctcccaattc ttgtgggtca cccccgcgta ggcctgcagg taggcctgca ggaagcgccc 23460
catcatggtc ataaaggtct tctggctcgt aaaggtcagc tgcaggccgc gatgctcttc 23520
gttcagccag gtcttgcaga tggcggccag cgcctcggtc tgctcgggca gcatcttaaa 23580
atttgtcttc aggtcgttat ccacgtggta cttgtccatc atggcacgcg ccgcctccat 23640
gcccttctcc caggcggaca ccatgggcag gcttaggggg tttatcactt ccagcggcga 23700
ggacaccgta ctttcgattt cttcttcctc cccctcttcc cggcgcgcgc ccccgctgtt 23760
gcgcgctctt accgcctgca ccaaggggtc gtcttcaggc aagcgccgca ccgagcgctt 23820
gccgcccttg acctgcttga tcagtaccgg cgggttgctg aagcccacca tggtcagcgc 23880
cgcctgctct tcttcgtctt cgctgtctac cactatttct ggggaggggc ttctccgctc 23940
tgcggcaaag gcggcggatc gcttcttttt tttcttggga gccgccgcga tggagtccgc 24000
cacggcgacc gaggtcgagg gcgtggggct gggggtgcgc ggtaccaggg cctcgtcgcc 24060
ctcggactct tcctctgact ccaggcggcg gcggagtcgc ttctttgggg gcgcgcgcgt 24120
cagcggcggc ggagacgggg acggggacgg ggacgggacg ccctccacag ggggtggtct 24180
tcgcgcagac ccgcggccgc gctcgggggt cttctcgcgc tggtcttggt cccgactggc 24240
cattgtatcc tcctcctcct aggcagagag acataaggag tctatcatgc aagtcgagaa 24300
ggaggagagc ttaaccaccc cctcagagac cgccgatgcg cccgccgtcg ccgtcgcccc 24360
cgctaccgcc gacgcgcccg ccacaccgag cgacaccccc acggaccccc ccgccgacgc 24420
acccctgttc gaggaagcgg ccgtggagca ggacccgggc tttgtctcgg cagaggagga 24480
tttgcaagag gaggagaata aggaggagaa gccctcagtg ccaaaagatc ataaagagca 24540
agacgagcac gacgcagacg cacaccaggg tgaagtcggg cggggggacg gagggcatgg 24600
cggcgccgac tacctagacg aaggaaacga cgtgctcttg aagcacctgc atcgtcagtg 24660
cgccatcgtc tgcgacgctc tgcaggagcg cagcgaggtg cccctcagcg tggcggaggt 24720
cagccgcgcc tacgagctca gcctcttttc cccccgggtg cccccccgcc gccgcgaaaa 24780
cggcacatgc gagcccaacc cgcgcctcaa cttctacccc gcctttgtgg tgcccgaggt 24840
cctggccacc tatcacatct tctttcaaaa ttgcaagatc cccatctcgt gccgcgccaa 24900
ccgtagccgc gccgataaga tgctggccct gcgccagggc gaccacatac ctgatatcgc 24960
cgctttggaa gatgtgccaa agatcttcga gggtctgggg cgcaacgaga agcgggcagc 25020
aaactctctg caacaggaaa acagcgaaaa tgagagtcac actggagcgc tggtggagct 25080
ggagggcgac aacgcccgcc tggcggtgct caagcgcagc atcgaggtca cccactttgc 25140
ctaccccgcg ctcaacctgc cccccaaagt catgaacgcg gtcatggacg ggctgatcat 25200
gcgccgcggc cggcccctcg ctccagatgc aaacttgcat gaggagaccg aggacggtca 25260
gcccgtggtc agcgacgagc agctgacgcg ctggctggag agcgcggacc ccgccgaact 25320
ggaggagcgg cgcaagatga tgatggccgc ggtgctggtc accgtagagc tggagtgtct 25380
gcagcgcttc ttcggtgacc ccgagatgca gagaaaggtc gaggagaccc tacactacac 25440
cttccgccag ggctacgtgc gccaggcttg caagatctcc aacgtggagc tcagcaacct 25500
ggtgtcctac ctgggcatct tgcatgaaaa ccgccttggg cagagcgtgc tacactccac 25560
cctgcgcggg gaggcgcgcc gcgactacgt gcgcgactgc gtttacctct tcctctgcta 25620
cacctggcag acggccatgg gggtctggca gcagtgcctg gaggagcgca acctcaagga 25680
gctggagaag cttctgcagc gcgcgctcaa agacctctgg acgggcttca acgagcgctc 25740
ggtggccgcc gcgctagccg acctcatctt ccccgagcgc ctgctcaaaa ccctccagca 25800
ggggctgccc gacttcacca gccaaagcat gttgcaaaat tttaggaact ttatcctgga 25860
gcgttctggc atcctacccg ccacctgctg cgccctgccc agcgactttg tccccctcgt 25920
gtaccgcgag tgccccccgc cgctgtgggg ccactgctac ctgttccaac tggccaacta 25980
cctgtcctac cacgcggacc tcatggagga ctccagcggc gaggggctca tggagtgcca 26040
ctgccgctgc aacctctgca cgccccaccg ctccctggtc tgcaacaccc aactgctcag 26100
cgagagtcag attatcggta ccttcgagct acagggtccg tcctcctcag acgagaagtc 26160
cgcggctccg gggctaaaac tcactccggg gctgtggact tccgcctacc tgcgcaaatt 26220
tgtacctgaa gactaccacg cccacgaaat caggttttac gaggaccaat cccgcccgcc 26280
caaggcggag ctgaccgcct gcgtcatcac ccagggcgag atcctaggcc aattgcaagc 26340
catccaaaaa gcccgccaag agtttttgct gaagaggggt cggggggtgt atctggaccc 26400
ccagtcgggt gaggagctca acccggttcc cccgctgcca ccgccgcggg accttgcttc 26460
ccaggataag catcgccatg gctcccagaa agaagcagca gcggccgccg ctgccgccgc 26520
cccacatgct ggaggaagag gaggaatact gggacagtca ggcagaggag gtttcggacg 26580
aggaggagcc ggagacggag atggaagagt gggaggagga cagcttagac gaggaggctt 26640
ccgaagccga agaggcaggc gcaacaccgt caccctcggc cgcagccccc tcgcaggcgc 26700
ccccgaagtc cgctcccagc atcagcagca acagcagcgc tataacctcc gctcctccac 26760
cgccgcgacc cacggccgac cgcagaccca accgtagatg ggacaccacc ggaaccgggg 26820
ccggtaagtc ctccgggaga ggcaagcaag cgcagcgcca aggctaccgc tcgtggcgcg 26880
ctcacaagaa cgccatagtc gcttgcttgc aagactgcgg ggggaacatc tccttcgccc 26940
gccgcttcct gctcttccac cacggtgtgg ccttcccccg taacgtcctg cattactacc 27000
gtcatctcta cagcccctac tgcggcggca gtgagccaga ggcggccagc ggcggcggcg 27060
cccgtttcgg tgcctaggaa gacccagggc aagacttcag ccaagaaact cgcggcgacc 27120
gcggcgaacg cggtcgcggg ggccctgcgc ctgacggtga acgaacccct gtcgacccgc 27180
gaactgagga accgaatctt ccccactctc tatgccatct tccagcagag cagagggcag 27240
gatcaggaac tgaaagtaaa aaacaggtct ctgcgctccc tcacccgcag ctgtctgtat 27300
cacaagagcg aagaccagct tcggcgcacg ctggaggacg ctgaggcact cttcagcaaa 27360
tactgcgcgc tcactcttaa ggactagctc cgcgcccttc tcgaatttag gcgggaacgc 27420
ctacgtcatc gcagcgccgc cgtcatgagc aaggacattc ccacgccata catgtggagc 27480
tatcagccgc agatgggact cgcggcgggc gcctcccaag actactccac ccgcatgaac 27540
tggctcagtg ccggcccaca catgatctca caggttaatg acatccgcac ccatcgaaac 27600
caaatattgg tgaagcaggc ggcaattacc accacgcccc gcaataatcc caaccccagg 27660
gagtggcccg cgtccctggt gtatcaggaa attcccggcc ccaccaccgt actacttccg 27720
cgtgattccc aggccgaagt ccaaatgact aactcagggg cacagctcgc gggcggctgt 27780
cgtcacaggg tgcggcctcc tcgccagggt ataactcacc tggagatccg aggcagaggt 27840
attcagctca acgacgagtc ggtgagctcc tcgctcggtc tcagacctga cgggaccttc 27900
cagatagccg gagccggccg atcttccttc acgccccgcc aggcgtacct gactctgcag 27960
agctcgtcct cggcgccgcg ctcgggcggc atcgggactc tccagttcgt gcaggagttt 28020
gtgccctcgg tctacttcaa ccccttctcg ggctctcccg gtcgctaccc ggaccagttt 28080
atcccgaact ttgacgccgc gagggactcg gtggacggct acgactgaat gtcgggtgga 28140
cccggtgcag agcaacttcg cctgaagcac cttgaccact gccgccgccc tcagtgcttt 28200
gcccgctgtc agaccggtga gttccagtac ttttccctgc ccgactcgca cccggacggc 28260
ccggcgcacg gggtgcgctt tttcatcccg agtcaggtcc gctctaccct aatcagggag 28320
ttcaccgccc gtcccctact ggcggagttg gaaaaggggc cttctatcct aaccattgcc 28380
tgcatttgct ctaaccctgg attacaccaa gatctttgct gtcatttgtg tgctgagtat 28440
aataaaggct gagatcagaa tctactcggg ctcctgtcgc catcctgtca acgccaccgt 28500
ccaagcccgg cccgatcagc ccgaggtgaa cctcacctgt ggtctgcacc ggcgcctgag 28560
gaaataccta gcttggtact acaacagcac tccctttgtg gtttacaaca gctttgacca 28620
ggacggggtc tcactgaggg ataacctctc gaacctgagc tactccatca ggaagaacaa 28680
caccctcgag ctacttcctc cttacctgcc cgggacttac cagtgtgtca ccggcccctg 28740
cacccacacc cacctgttga tcgtaaacga ctctcttccg agaacagacc tcaataactc 28800
ctctccgcag ttccccagaa caggaggtga gctcaggaaa ccccgggtaa agaagggtgg 28860
acaagagtta acacttgtgg ggtttctggt atatgtgacg ctggtggtgg ctcttttgat 28920
taaggctttt ccttccatgt ctgaactatc cctcttcttt tatgaacaac tcgactagtg 28980
ctaacgggac cctacccaac gaatcgggat tgaatatcgg taaccaggtt gcagtttcac 29040
ttttgattac cttcatagtc ctcttcctgc tagtgctgtc gcttctgtgc ctgcggatcg 29100
ggggctgctg catccacgtt tatatctggt gctggctgtt tagaaggttc ggagaccacc 29160
gcaggtagaa taatgctgct taccctcttt gtcctggcgc tggctgccag ctgccaagcc 29220
ttttccgagg ctgacttcat agagccccag tgcaatatca cttataaatc tgaacgtgcc 29280
atctgtacta ttctaatcaa atgtgttact caacacgata aggtgactgt taaatacaaa 29340
gatcaattaa aaaaagacgc actttacagc agctggcaac caggagatga tcaaaaatac 29400
aatgtaaccg tcttccaggg caaactctcc aaaacttaca attacaattt cccatttgag 29460
cagatgtgtg actttgtcat gtacatggaa aagcagtaca agctgtggcc tccaactccc 29520
cagggctgtg tggaaaatcc aggctctttc tgtatgatct ctctctgtgt aactgtgctg 29580
gcactaatac tcacgcttct gtatctcaga tttaaatcaa ggcaaagctt cattgatgaa 29640
aagaaaatgc cataatcgct caacgcttga ttgctaacac cgggttttta tccgcagaat 29700
gattggaatc accctactaa tcacctccct ccttgcgatt gcccatgggt tggaacgaat 29760
cgaagtccct gtgggggcca atgttaccct ggtggggcct gtcggcaatg ctacattaat 29820
gtgggaaaaa tatactaaaa atcaatgggt ttcttactgc actaacaaaa acagccacaa 29880
gcccagagcc atctgcgatg ggcaaaatct aaccttgatt gatgttcaat tgctggatgc 29940
gggctactat tatgggcagc tgggtacaat gattaattac tggagacccc acagagatta 30000
catgcttcac gtagtaaagg gtcccattag cagcccaacc accacctcta ccacacccac 30060
taccaccact actcccacca ccagcactgc cgcccagcct cctcatagca gaacaaccac 30120
ttttatcaat tccaagtccc actcccccca cattgccggc gggccctccg cctcagactc 30180
cgagaccacc gagatctgct tctgcaaatg ctctgacgcc attgcccagg atttggaaga 30240
tcacgaggaa gatgagcatg actacgcaga tgcatgccag gcatcagagg cagaagcgct 30300
accggtggcc ctaaaacagt atgcagactc ccacaccacc cccaaccttc ctccaccttc 30360
ccagaagcca agtttcctgg gggaaaatga aactctgcct ctttccatac tagctctgac 30420
atctgttgct attttggccg ctctgctggt gcttctatgc tctatatgct acctgatctg 30480
ctgcagaaag aaaaaatctc acggccatgc tcaccagccc ctcatgcact tcccttaccc 30540
tccagagctg ggcgaccaca aactttaagt ctgcagtagc tatctgccca tcccttgtca 30600
gtcgacagcg atgagcccca ctaatctaac agcctctgga cttacaacat tgtctcttaa 30660
tgagaccacc gctcctcaag acctgtacga tggtgtctcc gcgctggtta accagtggga 30720
tcacctgggc atatggtggc tcctcatagg agcagtgacc ctgtgcctaa tcctggtctg 30780
gatcatctgc tgcatcaaaa gcagaagacc caggcggcgg cccatctaca ggcccttcgt 30840
catcacacct gaagataatg atgatgatga caccacctcc aggctgcaga gcctaaagca 30900
gctactcttc tcttttacag catggtaaat tgaatcatgc cccgcatttt catctacttg 30960
cttctccttc cactttttct gggctcctct acattggcca ctgtgtccca catcgaggta 31020
gactgcctca cgcccttcac agtctacctg cttttcggct ttgtcatctg cacctttgtc 31080
tgcagcgtta tcactgtagt gatctgcttc atacagtgca tcgactacat ctgtgtgcgg 31140
gtggcctact ttagacacca cccccagtat cgcaacaggg acatagcggc tctcctaaga 31200
cttgtttaaa tcatggccaa attacctgtg attggtcttc tgattatctg ctgcgtccta 31260
gccgcgattg ggactcaacc taataccacc accagcgctc ccagaaagag acatgtatcc 31320
tgcagcttca agcgtccctg gaatataccc caatgcttta ctgatgaacc tgaaatctct 31380
ttggcttggt acttcagcgt caccgccctt ctcatcttct gcagtacggt tattgctctt 31440
gccatctacc cttcccttaa cctgggctgg aatgctgtca actctatgga atatcccacc 31500
ttcccagaac cagacctgcc agacctggtt gttctaaacg cgtttcctcc tcctccagtt 31560
caaaatcagt ttcgccctcc gtcccctacg cccactgagg tcagctactt taatctaaca 31620
ggcggagatg actgaaaacc tagacctaga aatggacggt ctctgcagcg agcaacgcac 31680
actagagagg cgccggcaaa aagcagagct cgagcgtctt aaacaagagc tccaagacgc 31740
cgtggccata caccagtgca aaaaagggct cttctgtctg gtaaaacagg ccacgctcac 31800
ctatgaaaaa acaggtgaca cccaccgcct aggatacaag ctgcccacac agcgccaaaa 31860
gtttgccctt atgataggtg aacaacccat caccgtcacc cagcactccg tggagacaga 31920
aggctgcatt catgctccct gcaggggcgc tgactgcctc tacaccttga tcaaaaccct 31980
ctgcggtctc agagacctta tccctttcaa ttgatcataa ctgtaatcaa taaaaaatca 32040
cttacttgaa atctgatagc aagactctgt ccaatttttt cagcaacact tccttcccct 32100
cctcccaact ctggtactct aggcgcctcc tagctgcaaa cttcctccac agtctgaagg 32160
gaatgtcaga ttcctcctcc tgtccctccg cacccacgat cttcatgttg ttacagatga 32220
aacgcgcgag atcgtctgac gagaccttca accccgtgta cccctacgat accgagatcg 32280
ctccgacttc tgtccctttc cttacccctc cctttgtatc atccgcagga atgcaagaaa 32340
atccagctgg ggtgctgtcc ctgcacctgt cagagcccct taccacccac aatggggccc 32400
tgactctaaa aatggggggc ggcctgaccc tggacaagga agggaatctc acttcccaaa 32460
acatcaccag tgtcgatccc cctctcaaaa aaagcaagaa caacatcagc cttcagaccg 32520
ccgcacccct cgccgtcagc tccggggccc taaccctttt tgccactccc cccctagcgg 32580
tcagtggcga caaccttact gtgcagtctc aggcccctct tactttggaa gactcaaaac 32640
taactctggc caccaaagga cccctaactg tgtccgaagg caaacttgtc ctagaaacag 32700
agcctcccct gcatgcaagt gacagcagta gcctgggcct tagcgtcacg gccccactta 32760
gcattaacaa tgacagccta ggactagaca tgcaagcgcc catcagctct cgagatggaa 32820
aactggctct aacagtggcg gcccccctaa ctgtggccga gggtatcaat gctttggcag 32880
tagccacagg taatggtatt ggactaaatg aaaccaacac acacctgcag gcaaaactgg 32940
tcgcgcccct aggctttgat accaacggca acattaagct aagcgtcgca ggaggcatga 33000
ggctaaacaa taacacactg atactagatg taaactaccc atttgaggct caaggccaac 33060
tgagcctaag agtgggctcg ggcccactat atgtagattc tagtagtcat aacctaacca 33120
ttagatgcct taggggattg tatgtaacat cttctaacaa ccaaaacggt ctagaggcca 33180
acattaaact aacaaaaggc cttgtgtatg acggaaatgc catagcagtt aatgttggca 33240
aagggctgga atacagccct actggcacaa cagaaaaacc tatacagact aaaataggtc 33300
taggcatgga gtatgacact gagggagcca tgatgacaaa actaggctct ggactaagct 33360
ttgacaattc aggagccatt gtggtgggaa acaaaaatga tgacaggctt actttgtgga 33420
ccacaccgga cccatcgccc aactgtcaga tttactctga aaaagatgct aaactaacct 33480
tggtactgac taaatgtggc agtcaggttg taggcacagt atctattgcc gctcttaaag 33540
gtagccttgt gccaatcact agtgcaatca gtgtggttca gatataccta aggtttgatg 33600
aaaatggggt gctgatgagt aactcttcac ttaatggcga atactggaat tttagaaacg 33660
gagactcaac taatggcaca ccatatacaa acgcagtggg ttttatgcct aatctactgg 33720
cctatcctaa aggtcaaact acaactgcaa aaagtaacat tgtcagccag gtctacatga 33780
acggggacga tactaaaccc atgacattta caatcaactt caatggcctt agtgaaacag 33840
gggatacccc tgtcagtaaa tattccatga cattctcatg gaggtggcca aatggaagct 33900
acatagggca caattttgta acaaactcct ttactttctc ctacatcgcc caagaataaa 33960
gaaagcacag agatgcttgt ttttgatttc aaaattgtgt gcttttattt attttcaagc 34020
ttacagtatt tccagtagtc attagaatag agcttaatta aactgcatga gaacccttcc 34080
acatagctta aattatcacc agtgcaaatg gaaaaaaatc aacatacctt tttatccaga 34140
tatcaaagaa ctctagtggt cagttttccc ccaccctccc agctcacaga atacacagtc 34200
ctttcccccc ggctggcttt aaacaacact atctcattgg taacagacat atttttaggt 34260
gtaataatcc acacggtctc ttggcgggcc aaacgctggt ctgtgatgtt aataaactcc 34320
ccaggcagct ctttcaagtt cacgtcgctg tccaactgct gaagcgctcg cggctccgac 34380
tgcgcctcta gcggaggcaa cggcagcacc cgatccttga tctataaagg agtagagtca 34440
taatccccca taagaatagg gcggtgatgc agcaacaagg cgcgcagcaa ctcctgccgc 34500
cgcctctccg tacgacagga atgcaacggg gtggtggtct cctccgcgat aatccgcacc 34560
gctcgcagca tcagcatcct cgtcctccgg gcacagcagc gcatcctgat ctcactgaga 34620
tcggcgcagt aagtgcagca caacaccaag atgttattta agatcccaca gtgcaaagca 34680
ctgtacccaa agctcatggc gggaaggaca gcccccacgt gaccatcgta ccagatcctc 34740
aggtaaatca aatgacgacc tctcataaac acgctggaca tatacatcac ctccttgggc 34800
atgagctgat tcaccacctc tcgataccac aggcatcgct gattaattaa agacccctcg 34860
agcaccatcc tgaaccagga agccagcacc tgaccccccg ccaggcactg cagggacccc 34920
ggtgaatcgc agtggcagtg aagactccag cgctcgtagc cgtgaaccat agagctggtc 34980
attatatcca cattggcaca acacagacac actttcatac actttttcat gattagcagc 35040
tcctctctag tcaagaccat atcccaagga atcacccact cttgaatcaa ggtaaatccc 35100
acacagcagg gcaggcctct cacataactc acgttatgca tagtgagcgt gtcgcaatct 35160
ggaaataccg gatgatcttc catcaccgaa gcccgggtct ccgtctcaaa gggaggtaaa 35220
cggtccctcg tgtagggaca gtggcgggat aatcgagatc gtgttgaacg tagagtcatg 35280
ccaaagggaa cagcggacgt actcatattt cctccagcag aaccaagtgc gcgcgtggca 35340
gctatccctg cgtcttctgt ctcgccgcct gccccgctcg gtgtagtagt tgtaatacag 35400
ccactccctc agaccgtcaa ggcgctccct ggcgtccgga tctataacaa caccgtcctg 35460
cagcgccgcc ctgatgacat ccaccaccgt agagtatgcc aagcccagcc acgaaatgca 35520
ctcactttga cagcgagaga taggaggagc gggaagagat ggaagaacca tgatagtaaa 35580
agaactttta ttccaatcga tcctctacaa tgtcaaagtg tagatctatc agatggcact 35640
ggtctcctcc gctgagtcga tcaaaaataa cagctaaacc acaaacaaca cgattggtca 35700
aatgctgcac aagggcttgc agcataaaat cgcctcgaaa gtccaccgca agcataacat 35760
caaagccacc gcccctatca tgatctatga taaaaacccc acagctatcc accagaccca 35820
tatagttttc atctctccat cgtgaaaaaa tatttacaag ctcctccttt aaatcacctc 35880
caaccaattc aaaaagttga gccagaccgc cctccacctt cattttcagc atgcgcatca 35940
tgattgcaaa aattcaggct cctcagacac ctgtataaga ttgagaagcg gaacgttaac 36000
atcaatgttt cgctcgcgaa gatcgcgcct cagtgcaagc atgatataat cccacaggtc 36060
ggagcggatc agcgaggaca tctccccgcc aggaaccaac tcaacggagc ctatgctgat 36120
tataatacgc atattcgggg ctatgctaac cagcacggcc cccaaatagg cgtactgcat 36180
aggcggcgac aaaaagtgaa cagtttgggt taaaaaatca ggcaaacact cgcgcaaaaa 36240
agcaagaaca tcataaccat gctcatgcaa atagatgcaa gtaagctcag gaacgaccac 36300
agaaaaatgc acaatttttc tctcaaacat gactgcgagc cctgcaaaaa ataaaaaaga 36360
aacattacac aagagtagcc tgtcttacaa tgggatagac tactctaacc aacataagac 36420
gggccacgac atcgcccgcg tggccataaa aaaaattatc cgtgtgatta aaaagaagca 36480
cagatagctg gccagtcata tccggagtca tcacgtgcga acccgtgtag acccccgggt 36540
tggacacatc ggccaaacaa agaaagcggc caatgtatcc cggaggaatg ataacactaa 36600
gacgaagata caacagaata accccatggg ggggaataac aaagttagta ggtgaataaa 36660
aacgataaac acccgaaact ccctcctgcg taggcaaaat agcgccctcc ccttccaaaa 36720
caacatacag cgcttccaca gcagccatga caaaagactc aaaacactca aaagactcag 36780
tcttaccagg aaaataaaag cactctcaca gcaccagcac taatcagagt gtgaagaggg 36840
ccaagtgccg aacgagtata tataggaatt aaaaatgacg taaatgtgta aaggtcaaaa 36900
aacgcccaga aaaatacaca gaccaacgcc cgaaacgaaa acccgcgaaa aaatacccag 36960
aagttcctca acaaccgcca cttccgcttt cccacgatac gtcacttcct caaaaatagc 37020
aaactacatt tcccacatgt acaaaaccaa aacccctccc cttgtcaccg cccacaactt 37080
acataatcac aaacgtcaaa gcctacgtca cccgccccgc ctcgccccgc ccacctcatt 37140
atcatattgg cctcaatcca aaataaggta tattattgat gatg 37184
<210> 2
<211> 37184
<212> DNA
<213> Great Ape Adenovirus
<400> 2
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg cggagggtat ttaaacccgc 480
tgcgctccta aagaggccac tcttgagtgc cagcgagaag agttttctcc tccgctccgt 540
ttcggcgatc gaaaaatgag acatttagcc tgcactccgg gtcttttgtc cggccgggcg 600
gcgtccgagc ttttggacgc tttgctcaat gaggttctga gcgatgattt tccgtctact 660
acccacttta gcccacctac tcttcacgaa ctgtacgatc tggatgtact ggtggatgtg 720
aacgatccca acgaggaggc ggtttctacg ttttttcccg agtctgcgct tttggctgcc 780
caggagggat ttgacctaca cactccgccg ctgcctattt tagagtctcc gctgccggag 840
cccagtggta taccttatat gcctgaactg cttcccgaag tggtagacct gacctgccac 900
gagccgggct ttccgcccag cgacgatgag ggtgagcctt ttgctttaga ctatgctgag 960
atacctgggc tcggttgcag gtcttgtgca tatcatcaga gggttaccgg agaccccgag 1020
gttaagtgtt cgctgtgcta tatgaggctg acctcttcct ttatctacag taagtttttt 1080
tgtgtaggtg ggctttttgg gtaggtgggt tttgtggcag gacaggtgta aatgttgctt 1140
gtgttttttg tacctgcagg tccggtgtcc gagccagacc cggagcccga ccgcgatccc 1200
gagccggatc ccgagcctcc tcgcaggcca aggaaattac cttccatttt gtgcaagcct 1260
aagacacctg tgaggaccag cgaggcggac agcactgact ctggcacttc tacctctcct 1320
cctgaaattc acccagtggt tcctctgggt atacatagac ctgttgctgt tagagtttgc 1380
gggcgacgcc ctgcagtaga gtgcattgag gacttgctta acgatcccga gggacctttg 1440
gacttgagca ttaaacgccc taggcaataa accccaccta agtaataaac cccacctaag 1500
taataaactt taccgccctt ggttattgag atgacgccca atgtttgctt ttgaatgact 1560
tcatgtgtat aataaaagtg agtgtggtca taggtctctt gtttgtctgg gcggggttta 1620
agggtatata agtttctcgg ggctaaactt ggttacactt gaccccaatg gaggcgtggg 1680
ggtgcttgga ggagtttgcg gacgtgcgcc gtttgctgga cgagagctct agcaatacct 1740
atagtatttg gaggtatctg tggggctcta ctcaggccaa gttggtcttc agaattaagc 1800
aggattacaa gtgcgatttt gaagagcttt ttagttcctg tggtgagctt ttgcaatcct 1860
tgaatctggg ccaccaggct atcttccagg aaaaggttct ctcgactttg gatttttcca 1920
ctcccgggcg caccgccgct tgtgtggctt ttgtgtcttt tgtgcaagat aaatggagcg 1980
gggagaccca cctgagtcac ggctacgtgc tggatttcat ggcgatggct ctttggaggg 2040
cttacaacaa atggaagatt cagaaggaac tgtacggttc cgccctacgt cgtccacttc 2100
tgcagcggca ggggctgatg tttcccgacc atcgccagca tcagaatctg gaagacgagc 2160
gagcggagaa gatcagcttg agagccggcc tggaccctcc tcaggaggaa tgaatctccc 2220
gcaggtggtt gagctgtttc ccgaactgag acgggtcctg actatcaggg aggatggtca 2280
gtttgtgaag aagctgaaga gggatcgggg tgagggagat gatgaggcgg ctagcaattt 2340
agcttttagt ctgataactc gccaccgacc ggaatgtatt acctatcagc agattaagga 2400
gagttgtgcc aacgagctgg atcttttggg tcagaagtat agcatagaac agcttaccac 2460
ttactggctt cagcccgggg atgattggga agaggcgatt agggtgtatg caaaggtggc 2520
cctgcggccc gattgcaagt ataagattac taagttggtt aatattagaa actgctgcta 2580
tatttctgga aacggggccg aagtggagat agatactgag gacagggtgg ctattaggtg 2640
ttgcatgata aacatgtggc ccgggatact ggggatggat ggggtgatat ttatgaatgt 2700
gaggttcacg ggccccaact ttaatggtac ggtgttcatg ggcaacacca acttgctcct 2760
gcatggtgcg agtttctatg ggtttaacaa cacctgtata gaggcctgga ccgatgtaaa 2820
ggttcgaggt tgttcctttt atagctgttg gaaggcggtg gtgtgtcgcc ctaaaagcag 2880
gggttctgtg aagaaatgct tgtttgaaag gtgcacccta ggtatccttt ctgagggcaa 2940
ctccagggtg cgccataatg tggcttcgaa ctgcggttgc ttcatgcaag tgaagggggt 3000
gagcgttatc aagcataact cggtctgtgg aaactgcgag gatcgcgcct ctcagatgct 3060
gacctgcttt gatggcaact gtcacctgtt gaagaccatt catataagca gtcaccccag 3120
aaaggcctgg cccgtgtttg agcataacat tctgacccgc tgttccttgc atctgggggt 3180
caggaggggt atgttcctgc cttaccagtg taactttagc cacactaaaa tcctgctgga 3240
acccgagtgc atgactaagg tcagcctgaa tggtgtgttt gatgtgagtc tgaagatttg 3300
gaaggtgctg aggtatgatg agaccaggac caggtgccga ccctgcgagt gcggcggcaa 3360
gcacatgaga aatcagcctg tgatgttgga tgtgaccgag gagcttaggc ctgaccatct 3420
ggtgctggcc tgcaccaggg ccgagtttgg gtctagcgat gaggataccg attgaggtgg 3480
gtaaggtggg cgtggctagc agggtgggcg tgtataaatt gggggtctaa ggggtctctc 3540
tgtttgtctt gcaacagccg ccgccatgag cgacaccggc aacagctttg atggaagcat 3600
ctttagtccc tatctgacag tgcgcatgcc tcactgggcc ggagtgcgtc agaatgtgat 3660
gggttccaac gtggatggac gtcccgttct gccttcaaat tcgtctacta tggcctacgc 3720
gaccgtggga ggaactccgc tggacgccgc gacctccgcc gccgcctccg ccgccgccgc 3780
gaccgcgcgc agcatggcta cggaccttta cagctctttg gtggcgagca gcgcggcctc 3840
tcgcgcgtct gctcgggatg agaaactgac tgctctgctg cttaaactgg aagacttgac 3900
ccgggagctg ggtcaactga cccagcaggt ttccagcttg cgtgagagca gccttgcctc 3960
cccctaatgg cccataatat aaataaaagc cagtctgttt ggattaagca agtgtatgtt 4020
ctttatttaa ctctccgcgc gcggtaagcc cgggaccagc ggtctcggtc gtttagggtg 4080
cggtggattt tttccaacac gtggtacagg tggctctgga tgtttagata catgggcatg 4140
agtccatccc tggggtggag gtagcaccac tgcagagctt cgtgctcggg ggtggtgttg 4200
tatatgatcc agtcgtagca ggagcgctgg gcgtggtgct gaaaaatgtc cttaagcaag 4260
aggcttatag ctagggggag gcccttggtg taagtgttta caaatctgct tagctgggag 4320
gggtgcatcc ggggggatat gatgtgcatc ttggactgga tttttaggtt ggctatgttc 4380
ccgcccagat cccttctggg attcatgttg tgcaggacca ccagcacggt atatccagtg 4440
cacttgggaa atttatcgtg gagcttagac gggaatgcat ggaagaactt ggagacgccc 4500
ttgtggcctc ccagattttc catacattcg tccatgatga tggcaatggg cccgtgggaa 4560
gctgcctgag caaaaacgtt tctggcatcg ctcacatcgt agttatgttc cagggtgagg 4620
tcatcatagg acatctttac gaatcggggg cgaagggtcc cggactgggg gatgatggta 4680
ccctcgggcc ccggggcgta gttcccctca cagatctgca tctcccaggc tttcatttca 4740
gagggaggga tcatatccac ctgcggggcg atgaaaaaga cagtttctgg cgcaggggag 4800
attaactggg atgagagcag gtttctgagc agctgtgact ttccacagcc ggtgggccca 4860
tatatcacgc ctatcaccgg ctgcagctgg tagttaagag agctgcagct gccgtcctcc 4920
cggagcaggg gggccacctc gttgagcata tccctgacgt ggatgttctc cctgaccagt 4980
tccgccagaa ggcgctcgcc gcccagcgaa agcagctctt gcaaggaagc aaaatttttc 5040
agcggtttca ggccatcggc cgtgggcatg tttttcagcg tctgggtcag cagctccagc 5100
ctgtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagatc tcctcgtttc 5160
gcgggttggg gcggctttcg ctgtagggca ccagccgatg ggcgtccagc ggggccagag 5220
tcatgtcctt ccatgggcgc agggtcctcg tcagggtggt ctgggtcacg gtgaaggggt 5280
gcgctccggg ttgggcactg gccagggtgc gcttgaggct ggttctgctg gtgctgaatc 5340
gctgccgctc ttcgccctgc gcgtcggcca ggtagcattt gaccatggtc tcgtagtcga 5400
gaccctcggc ggcgtgcccc ttggcgcgga gctttccctt ggaggtggcg ccgcacgagg 5460
ggcactgcag gctcttcagg gcgtagagct tgggagcgag aaacacggac tctggggagt 5520
aggcgtccgc gccgcaggcc gagcagaccg tctcgcattc caccagccaa gtgagttccg 5580
ggcggtcagg gtcaaaaacc aggttgcccc catgcttttt gatgcgtttc ttaccttggc 5640
tctccatgag gcggtgtccc ttctcggtga cgaagaggct gtccgtgtcc ccgtagaccg 5700
acttcagggg cctgtcttcc agcggagtgc ctctgtcctc ctcgtagaga aactctgacc 5760
actctgagac gaaggcccgc gtccaggcca ggacgaagga ggccacgtgg gaggggtagc 5820
ggtcgttgtc cactagcggg tccaccttct ccagggtgtg caggcacatg tccccctcct 5880
ccgcgtccag aaaagtgatt ggcttgtagg tgtaggacac gtgaccgggg gttcccaacg 5940
ggggggtata aaagggggtg ggtgcccttt catcttcact ctcttccgca tcgctgtctg 6000
cgagagccag ctgctggggt aagtattccc tctcgaaggc gggcatgacc tcagcgctca 6060
ggttgtcagt ttctaaaaat gaggaggatt tgatgttcac ctgtccggag gtgatacctt 6120
tgagggtacc tgggtccatc tggtcagaaa acactatttt tttgttatca agcttggtgg 6180
cgaatgaccc gtagagggcg ttggagagca gcttggcgat ggagcgcagg gtctggtttt 6240
tgtcgcggtc ggctcgctcc ttggccgcga tgttgagttg cacgtactcg cgggccacgc 6300
acttccactc ggggaacacg gtggtgcgct cgtctgggat caggcgcacc ctccagccgc 6360
ggttgtgcag ggtgaccatg tcgacgctgg tggcgacctc accgcgcaga cgctcgttgg 6420
tccagcagag gcggccgccc ttgcgcgagc agaagggggg tagggggtcc agctggtcct 6480
cgtttggggg gtccgcgtcg atggtaaaga ccccggggag caggcgcggg tcaaagtagt 6540
cgatcttgca agcttgcatg tccagagccc gctgccattc gcgggcggcg agcgcgcgct 6600
cgtaggggtt gaggggcggg ccccagggca tggggtgggt gagcgcggag gcgtacatgc 6660
cgcagatgtc atacacgtac aggggttccc tgaggatacc gaggtaggtg gggtagcagc 6720
gccccccgcg gatgctggcg cgcacgtagt catagagctc gtgggagggg gccagcatgt 6780
tgggcccgag gttggtgcgc tgggggcgct cggcgcggaa gacgatctgc ctgaagatgg 6840
cgtgggagtt ggaggagatg gtgggccgct ggaagacgtt gaagcttgct tcttgcaagc 6900
ccacggagtc cctgacgaag gaggcgtagg actcgcgcag cttgtgcacc agctcggcgg 6960
tgacctggac gtcgagcgca cagtagtcga gggtctcgcg gatgatgtca tacctatcct 7020
cccccttctt tttccacagc tcgcggttga ggacgaactc ttcgcggtct ttccagtact 7080
cttggagggg aaacccgtcc gtgtccgaac ggtaagagcc tagcatgtag aactggttga 7140
cggcctggta ggggcagcag cccttctcca cgggcagcgc gtaggcctgc gccgccttgc 7200
ggagggaggt gtgggtgagg gcgaaagtgt ccctgaccat gactttgagg tattgatgtc 7260
tgaagtctgt gtcatcgcag ccgccctgtt cccacagggt gtagtccgtg cgctttttgg 7320
agcgcgggtt gggcagggag aaggtgaggt cattgaagag gatcttcccc gctcgaggca 7380
tgaagtttct ggtgatgcga aagggccctg ggaccgagga gcggttgttg atgacctggg 7440
cggccaggac gatctcgtca aagccgttta tgttgtgtcc cacgatgtag agctccagga 7500
agcggggctg gcccttgatg gaggggagct ttttaagttc ctcgtaggta agctcctcgg 7560
gcgattccag gccgtgctcc tccagggccc agtcttgcaa gtgagggttg gccgccagga 7620
aggatcgcca gaggtcgcgg gccatgaggg tctgcaggcg gtcgcggaag gttctgaact 7680
gccgccccac ggccattttt tcgggggtga tgcagtagaa ggtgaggggg tctttctccc 7740
aggggtccca tctgagctct cgggcgaggt cgcgcgcggc agcgaccaga gcctcgtcgc 7800
cccccagttt catgaccagc atgaagggca cgagttgctt gccaaaggct cccatccaag 7860
tgtaggtttc tacatcgtag gtgacaaaga ggcgctccgt gcgaggatga gagccgattg 7920
ggaagaactg gatctcccgc caccagttgg aggattggct gttgatgtgg tgaaagtaga 7980
agtcccgtct gcgggccgag cactcgtgct ggcttttgta aaagcgaccg cagtactggc 8040
agcgctgcac gggttgtata tcttgcacga ggtgaacctg gcgacctctg acgaggaagc 8100
gcagcgggaa tctaagtccc ccgcctgggg tcccgtgtgg ctggtggtct tttactttgg 8160
ttgtctggcc gccagcatct gtctcctgga gggcgatggt ggaacagacc accacgccgc 8220
gagagccgca ggtccagatc tcggcgctcg gcgggcggag tttgatgacg acatcgcgca 8280
cattggagct gtccatggtc tccagctccc gcggcggcag gtcagccggg agttcctgga 8340
ggttcacctc gcagagacgg gtcaaggcgc ggacagtgtt gagatggtat ctgatttcaa 8400
ggggcatgtt ggaggcggag tcgatggctt gcaggaggcc gcagccccgg ggggccacga 8460
tggttccccg cggggcgcga ggggaggcgg aagctggggg tgtgttcaga agcggtgacg 8520
cgggcgggcc cccggaggta gggggggttc cggccccaca ggcatgggcg gcaggggcac 8580
gtcttcgccg cgcgcgggca ggggctggtg ctggctccga agagcgcttg cgtgcgcgac 8640
gacgcgacgg ttggtgtcct gtatctggcg cctctgagtg aagaccacgg gtcccgtgac 8700
cttgaacctg aaagagagtt cgacagaatc aatctcggca tcgttgacag cggcctggcg 8760
caggatctcc tgcacgtcgc ccgagttgtc ctggtaggcg atttctgcca tgaactgctc 8820
gatctcttcc tcctggagat ctcctcgtcc ggcgcgctcc acggtggccg ccaggtcgtt 8880
ggagatgcga cccatgagct gcgagaaggc gttgagtccg ccctcgttcc agacccggct 8940
gtagaccacg cccccctcgg cgtcgcgggc gcgcatgacc acctgggcca ggttgagctc 9000
cacgtgtcgc gtgaagacgg cgtagttgcg caggcgctgg aaaaggtagt tcagggtggt 9060
ggcggtgtgc tcggcgacga agaagtacat gacccagcgc cgcaacgtgg attcattgat 9120
gtcccccaag gcctccaggc gctccatggc ctcgtagaag tccacggcga agttgaaaaa 9180
ctgggagttg cgagcggaca cggtcaactc ctcctccaga agacggatga gctcggcgac 9240
agtgtcgcgc acctcgcgct cgaaggccac ggggggcgct tcttcctctt ccacctcttc 9300
ttccatgatt gcttcttctt cttcctcagc cgggacggga gggggcggcg gcgggggagg 9360
ggcgcggcgg cggcggcggc gcaccgggag gcggtcgatg aagcgctcga tcatctcccc 9420
ccgcatgcgg cgcatggtct cggtgacggc gcggccgttc tcccgggggc gcagctcgaa 9480
gacgccgcct ctcatttcgc cgcggggcgg gcggccgtga ggtagcgaga cggcgctgac 9540
tatgcatctt aacaattgct gtgtaggtac gccgccaagg gacctgattg agtccagatc 9600
caccggatcc gaaaaccttt ggaggaaagc gtctatccag tcgcagtcgc aaggtaggct 9660
gagcaccgtg gcgggcgggg gcgggtcggg agagttcctg gcggagatgc tgctgatgat 9720
gtaattaaag taggcggtct tgagaaggcg gatggtggac aggagcacca tgtctttggg 9780
tccggcctgt tggatgcgga ggcggtcggc catgccccag gcctcgttct gacaccggcg 9840
caggtctttg tagtaatctt gcatgagtct ttccaccggc acttcttctc cttcctcttc 9900
ttcatctcgc cggtggtttc tcgcgccgcc catgcgcgtg accccaaagc ccctgagcgg 9960
ctgcagcagg gccaggtcgg cgaccacgcg ctcggccaag atggcctgct gtacctgagt 10020
gagggtcctc tcgaagtcat ccatgtccac gaagcggtgg taggcacccg tgttgatggt 10080
gtaggtgcag ttggccatga cggaccagtt gacggtctgg tgtcccggct gcgagagctc 10140
cgtgtaccgc aggcgcgaga aggcgcggga atcgaacacg tagtcgttgc aagtccgcac 10200
cagatactgg tagcccacca ggaagtgcgg cggaggttgg cgatagaggg gccagcgctg 10260
ggtggcgggg gcgccgggcg ccaggtcttc cagcatgagg cggtggtatc cgtagatgta 10320
cctggacatc caggtgatgc ctgcggcggt ggtggtggcg cgcgcgtagt cgcggacccg 10380
gttccagatg tttcgcaggg gcgagaagtg ttccatggtc ggcacgctct ggccggtgag 10440
gcgcgcgcag tcgttgacgc tctatacaca cacaaaaacg aaagcgttta cagggctttc 10500
gttctgtagc ctggaggaaa gtaaatgggt tgggttgcgg tgtgccccgg ttcgagacca 10560
agctgagctc agccggctga agccgcagct aacgtggtat tggcagtccc gtctcgaccc 10620
aggccctgta tcctccagga tacggtcgag agcccttttg ctttcttggc caagcgcccg 10680
tggcgcgatc tgggatagat ggtcgcgatg agaggacaaa agcggctcgc ttccgtagtc 10740
tggagaaaca atcgccaggg ttgcgttgcg gcgtaccccg gttcgagccc ctatggcggc 10800
ttggatcggc cggaaccgcg gctaacgtgg gctgtggcag ccccgtcctc aggaccccgc 10860
cagccgactt ctccagttac gggagcgagc cccttttgtt tttttatttt ttagatgcat 10920
cccgtgctgc ggcagatgcg cccctcgccc cggcccgatc agcagcagca acagcaggca 10980
tgcagacccc cctctcctct ccccgccccg gtcaccacgg ccgcggcggc cgtgtccggt 11040
gcggggggcg cgctggagtc agatgagcca ccgcggcggc gacctaggca gtatctggac 11100
ttggaagagg gcgagggact ggcgcggctg ggggcgagct ctccagagcg ccacccgcgg 11160
gtgcagttga aaagggacgc gcgtgaggcg tacctgccgc ggcaaaacct gtttcgcgac 11220
cgcgggggcg aggagcccga ggagatgcgg gactgcaggt tccaagcggg gcgcgagctg 11280
cgccgcggct tggacagaca gcgcctgctg cgcgaggagg actttgagcc cgacacgcag 11340
acgggcatca gccccgcgcg cgcgcacgtg gccgcggccg acctggtgac cgcctacgag 11400
cagacggtga accaggagcg caacttccaa aaaagcttca acaaccacgt gcgcacgctg 11460
gtggcgcgcg aggaggtgac cctgggtctc atgcatctgt gggacctggt ggaggcgatc 11520
gtgcagaacc ccagcagcaa gcccctgacc gcgcagctgt tcctggtggt gcagcacagc 11580
agggacaacg aggccttcag ggaggcgctg ctgaacatca ccgagccgga ggggcgctgg 11640
ctcctggacc tgataaacat cctgcagagc atagtggtgc aggagcgcag cctgagcctg 11700
gccgagaagg tggcggccat taactattct atgctgagcc tgggcaagtt ctacgctcgc 11760
aagatctaca agacccccta cgtgcccata gacaaggagg tgaagataga cagcttctac 11820
atgcgcatgg cgctgaaggt gctaaccctg agcgacgacc tgggagtgta ccgcaacgag 11880
cgcatccaca aggccgtgag cgccagccgg cggcgcgagc tgagcgaccg cgaactgatg 11940
cacagtctgc agcgcgcgct gaccggcgcg ggcgagggcg acagggaggt cgagtcctac 12000
tttgacatgg gggccgacct gcactggcag ccgagccgcc gcgccctgga agcggcgggg 12060
gcgtacggcg gccccctggc ggccgatgac gaggaagagg aggactatga gctagaggag 12120
ggcgagtacc tggaggactg acctggctgg tggtgttttg gtatagatgc aagatccgaa 12180
cgtggcggac ccggcggtcc gggcggcgct gcagagccag ccgtccggca ttaactcctc 12240
tgacgactgg gccgcggcca tgggtcgcat catggccctg accgcgcgca accccgaggc 12300
cttcaggcag cagcctcagg ctaaccggct ggcggccatc ttggaagcgg tagtgcccgc 12360
gcgctccaac cccacccacg agaaggtgct ggccatagtc aacgcgctgg cggagagcag 12420
ggccatccgg gcagacgagg ccggactggt gtacgatgcg ctgctgcagc gggtggcgcg 12480
gtacaacagc ggcaacgtgc agaccaacct ggaccgcctg gtgacggacg tgcgcgaggc 12540
cgtggcgcag cgcgagcgct tgcatcagga cggcaacctg ggctcgctgg tggcgctaaa 12600
cgccttcctt agcacccagc cggccaacgt accgcggggg caggaggact acaccaactt 12660
cttgagcgcg ctgcggctga tggtgaccga ggtccctcag agcgaggtgt accagtcggg 12720
gcccgactac ttcttccaga ccagcagaca gggcttgcaa accgtgaacc tgagccaggc 12780
tttcaagaac ctgcgggggc tgtggggagt gaaggcgccc accggcgacc gggctacggt 12840
gtccagcctg ctaaccccca actcgcgcct gctgctgctg ctgatcgcgc ccttcacgga 12900
cagcgggagc gtctcgcggg agacctatct gggccacctg ctgacgctgt accgcgaggc 12960
catcgggcag gcgcaggtgg acgagcacac cttccaggag atcaccagcg tgagccacgc 13020
gctggggcag gaggacacgg gcagcctgca ggcgaccctg aactacctgc tgaccaacag 13080
gcggcagaag attcccacgc tgcacagcct gacccaggag gaggagcgca tcttgcgcta 13140
cgtgcagcag agcgtgagcc tgaacctgat gcgcgacggc gtgacgccca gcgtggcgct 13200
ggacatgacc gcgcgcaaca tggaaccggg catgtacgct tcccagcggc cgttcatcaa 13260
ccgcctgatg gactacttgc atcgggcggc ggccgtgaac cccgagtact tcaccaatgc 13320
cattctgaat ccccactgga tgccccctcc gggtttctac aacggggact tcgaggtgcc 13380
tgaggtcaac gatgggttcc tctgggatga catggatgac agtgtgttct cccccaaccc 13440
gctgcgcgcc gcgtctctgc gattgaagga gggctctgac agggaaggac caaggagtct 13500
ggcctcctcc ctggctctgg gggcggtggg cgccacgggc gcggcggcgc ggggcagcag 13560
ccccttcccc agcctggcgg actctctgaa tagcgggcgg gtgagcaggc cccgcttgct 13620
aggcgaggag gagtatctga acaactccct gctgcagccc gtgagggaca aaaacgctca 13680
gcggcagcag tttcccaaca atgggataga gagcctggtg gacaagatgt ccagatggaa 13740
gacgtatgcg caggagtaca aggagtggga ggaccgccag ccgcggcccc tgccgccccc 13800
tagacagcgc tggcagcggc gcgcgtccaa ccgccgctgg aggcaggggc ccgaggacga 13860
tgatgactct gcagatgaca gcagcgtgtt ggacctgggc gggagcggga accccttttc 13920
gcacctgcgc ccacgcctgg gcaagatgtt ttaaaagaga aaaataaaaa ctcaccaagg 13980
ccatggcgac gagcgttggt tttttgttcc cttccttagt atgcggcgcg cggcgatgtt 14040
cgaggagggg cctcccccct cttacgagag cgcgatggga atttctcctg cggcgcccct 14100
gcagcctccc tacgtgcctc ctcggtacct gcaacctaca ggggggagaa atagcatctg 14160
ttactctgag ctgcagcccc tgtacgatac caccagactg tacctggtgg acaacaagtc 14220
cgcggacgtg gcctccctga actaccagaa cgaccacagc gattttttga ccacggtgat 14280
ccaaaacaac gacttcaccc caaccgaggc cagtacccag accataaacc tggacaacag 14340
gtcgaactgg ggcggcgacc tgaagactat cctgcacacc aatatgccca acgtgaacga 14400
gttcatgttc accaactctt ttaaggcgcg ggtgatggtg gcgcgcgagc agggggaggc 14460
gaagtacgag tgggtggact tcacgctgcc cgagggcaac tactcagaga ccatgactct 14520
cgacctgatg aacaatgcga tcgtggaaca ctatctgaaa gtgggcaggc agaacggggt 14580
gaaggagagc gatatcgggg tcaagtttga caccagaaac ttccgtctgg gctgggaccc 14640
tgtgaccggg ctggtcatgc cgggggtcta caccaacgag gcctttcatc ccgatatagt 14700
gctcctgccc ggctgtgggg tggacttcac ccagagccgg ctgagcaacc tgctgggcgt 14760
tcgcaagcgg caacctttcc aggagggttt caagatcacc tatgaggatc tggagggggg 14820
caacattccc gcgctccttg atctggacgc ctacgaggag agcttgaaac ccgaggagag 14880
cgctggcgac agcggcgaga gtggcgagga gcaagccggc ggcggcggca gcgcgtcggt 14940
agaaaacgaa agtactcccg cagtggcggc ggacgctgcg gaggtcgagc cggaggccat 15000
gcagcaggac gcagaggagg gcgcgcagga ggacatgaac aatggggaga tcaggggcga 15060
cactttcgcc acccggggcg aagaaaaaga ggcagaggcg gcggcggcga cggcggaagc 15120
cgaaaccgag gcagaggcag agcccgagac cgaagttatg gaagacatga atgatggaga 15180
acgtaggggt gacacgtttg ccacccgggg cgaagagaag gcggcggagg cagaagccgc 15240
ggctgaggag gcggctgcgg ctgcggccaa ggctgaggct gcggctgagg ctaaggtcga 15300
agccgatgtt gcggttgagg ctcaggctga ggaggaggcg gcggctgaag cagttaagga 15360
aaaggcccag gcagagcagg aagagaaaaa acctgtcatt caacctctaa aagaagatag 15420
caaaaagcgc agttacaacg tcattgaggg cagcaccttt acccaatacc gcagctggta 15480
cctggcttac aactacggcg acccggtcaa gggggtgcgc tcgtggaccc tgctctgcac 15540
gccggacgtc acctgcggct ccgagcagat gtactggtcg ctgccaaaca tgatgcaaga 15600
cccggtgacc ttccgttcca cgcggcaggt tagcaacttt ccggtggtgg gcgccgaact 15660
gctgccagta cactccaaga gtttttacaa cgagcaggcc gtctactccc agctgatccg 15720
ccaggccacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga ttttggcgcg 15780
cccgccggcc cccaccatca ccaccgtcag tgaaaacgtt cctgccctca cagatcacgg 15840
gacgctaccg ctgcgcaaca gcatctcagg agtccagcga gtgaccatta ctgacgccag 15900
acgccggacc tgcccctacg tttacaaggc cttgggcata gtctcgccgc gcgtcctctc 15960
cagtcgcact ttttaaaaca catccaccca cacgctccaa aatcatgtcc gtactcatct 16020
cgcccagcaa caacaccggc tgggggctgc gcgcacccag caagatgttt ggaggggcaa 16080
ggaagcgctc cgaccagcac cccgtgcgcg tgcgcggcca ctaccgcgcg ccctggggtg 16140
cgcacaagcg cgggcgcaca gggcgcacca ctgtggatga tgtcattgac tccgtagtgg 16200
agcaggcgcg ccactacaca cccggcgcgc cgaccgcctc cgccgtgtcc accgtggacc 16260
aggcgatcga aagcgtggta cagggggcgc ggcactatgc caaccttaaa agtcgccgcc 16320
gccgcgtggc gcgccgccat cgccggagac cccgggctac tgccgccgcg cgccttacca 16380
aggctctgct caagcgcgcc aggcgaactg gccaccgggc cgccatgagg gccgcacggc 16440
gggctgccgc tgccgcgagc gccgtggccc cgcgggcacg aaggcgcgcg gccgctgccg 16500
ccgccgccgc catttccagc ttggcctcga cgcggcgcgg taacatatac tgggtgcgcg 16560
actcggtgag cggcacacgt gtgcccgtgc gctttcgccc cccacggaat tagcacaaga 16620
caacatacac actgagtctc ctgctgttgt gtatcccagc ggcgaccgtc agcagcggcg 16680
acatgtccaa gcgcaaaatt aaagaagaga tgctccaggt catcgcgccg gagatctatg 16740
ggcccccgaa gaaggaggag gaggattaca agccccgcaa gctaaagcgg gtcaaaaaga 16800
aaaagaaaga tgatgacgtt gacgaggcgg tggagtttgt ccgccgcatg gcgcccaggc 16860
gccctgtgca gtggaagggt cggcgcgtgc agcgagtcct gcgccccggc accgcggtgg 16920
tctttacgcc cggcgagcgt tccacgcgca ctttcaagcg ggtgtacgat gaggtgtacg 16980
gcgacgagga tctgttggag caggccaacc atcgatttgg ggagtttgca tatgggaaac 17040
ggcctcgcga gagtctaaaa gaggacctgc tggcgctacc gctggacgag ggcaatccca 17100
ccccgagtct gaagccggtg accctgcaac aggtgctgcc tttgagcgcg cccagcgagc 17160
agaagcgagg gttaaagcgc gagggcgggg acctggcacc caccgtgcag ttgatggtgc 17220
ccaagcggca gaagctggag gacgtgctgg agaaaatgaa agtagagccc gggatccagc 17280
ccgagatcaa ggtccgccct atcaagcagg tggcgcccgg cgtgggagtc cagaccgtgg 17340
acgttaggat tcccacggag gagatggaaa cccaaaccgc cactccctct tcggcagcaa 17400
gcgccaccac cggcgccgct tcggtagagg tgcagacgga cccctggcta cccgccgcca 17460
ctatcgccgt cgccgccgcc ccccgttcgc gcggacgcaa gagaaattat ccagcggcca 17520
gcgcgcttat gccccagtat gcgctgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcactcgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcatcg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg ccttcgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcaggg gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgttg 17940
cccctgctgg tcccgctact cgccgcggcg atcggcgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc atttttggag 18060
gaaaaaataa aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagc ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggtcaga tgctgagaga caagttgaaa gagcagaact tccaggagaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccgccctcag gtggaggaaa cgcctccagc 18480
catggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcccaccact cgccccatag ctcccatggc caccggtgtg gtgggtcaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgagccgact cgccagccaa aggcggtgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgcct ctgcgccgcg ctgcgagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gttcatcccc gtggatcggg 19200
aggacaccgc ttactcttac aaggcgcggt tcacgctggc cgtgggcgac aaccgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggt cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggctcctaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaac caggtggtcg ctgcagatga tgaacttgag gatgaggaag 19440
cgcaggcaca agaggaagcc cctgtgaaaa aaattcatgt gtatgctcag gcgcctcttt 19500
ccggcgaaaa gatttccaag gatggcatcc aaataggcac tgaagtcgta ggagatacat 19560
ctaaggacac ttttgccgac aaaacattcc agcccgaacc tcagataggc gaatctcagt 19620
ggaatgaagc tgatgccaca gcagcaggag gtagggtttt gaagaagact actcccatga 19680
ggccttgtta tggatcttat gctaggccta ccaatgccaa cggaggccaa ggaattatgg 19740
ttgccaatga acaaggagtg ttggagtcta aagtagaaat gcagtttttc tctaacacca 19800
caacccttaa tgcgcgggat ggaaccggca atcccgaacc aaaggtggtg ttgtatagtg 19860
aagatgtcca cttggaatct cccgatactc acctgtctta caagcccaaa aaggatgatg 19920
ttaatgccaa aatcatgttg ggccaacagg ctatgcccaa taggcccaat cttattggat 19980
ttagagataa tttcattggg ctcatgtttt acaacagcac cggtaacatg ggagtgctgg 20040
cgggtcaagc ctctcagttg aatgctgtgg tggacttgca ggatagaaac acagaactgt 20100
cgtatcagct tttgcttgat tccattgggg atagaaccag atatttctcc atgtggaacc 20160
aggcagtgga tagttatgac ccagatgtca gaatcattga aaaccacggg actgaggacg 20220
aactgcctaa ctactgtttt cctctgggcg gcattggagt tacagatact tatcaaggga 20280
taaaaaatac taatggcaat ggtcaatgga ccaaagatga tcagtttgcg gaccgcaatg 20340
aaataggggt gggaaacaac tttgccatgg agatcaacat ccaggccaac ctctggagaa 20400
acttcctcta tgcaaacgtg gggctctacc tgccagacaa gctcaagtac aaccccacca 20460
acgtggacat ctctgacaac cccaacacct atgactacat gaacaagcgg gtggtggccc 20520
ctggcctggt ggactgcttt gtcaatgtgg gagccaggtg gtccctggac tacatggaca 20580
acgtcaaccc cttcaaccac caccgcaatg cgggtctgcg ctaccgctcc atgatcctgg 20640
gcaacgggcg ctatgtgccc tttcacatcc aggtacccca gaagttcttt gccatcaaga 20700
acctcctgct cctgcccggc tcctacacct acgagtggaa cttcaggaag gatgtgaaca 20760
tggtcctaca gagctctctg ggcaatgacc ttagggtgga tggggccagc atcaagtttg 20820
acagcatcac cctctatgct acatttttcc ccatggccca caacaccgcc tccacgcttg 20880
aggccatgct gagaaacgac accaacgacc agtcctttaa tgactacctc tctggggcca 20940
acatgctcta cccaatccca gccaaggcca ccaacgtgcc catctccatc ccctctcgca 21000
actgggccgc ctttagaggc tgggccttta cccgccttaa gaccaaggag accccctccc 21060
tgggctcggg ttttgatccc tactttgttt actcgggatc catcccctac ctggatggca 21120
ccttctacct caaccacact ttcaagaaga tatccatcat gtatgactcc tccgtcagct 21180
ggccgggcaa cgaccgcttg ctcaccccca atgagttcga ggtcaagcgc gccgtggacg 21240
gcgagggcta caacgtggcc cagtgcaaca tgaccaagga ctggttcctg gtgcagatgc 21300
tggccaacta caacataggc taccagggct tttacatccc agagagctac aaggacagga 21360
tgtactcctt cttcagaaat ttccaaccca tgagccgaca ggtggtggac gagaccaatt 21420
acaaggacta tcaagccatt ggcatcaccc accagcacaa caactcgggt ttcgtgggct 21480
acctggcgcc caccatgcgc gagggtcagg cctaccccgc caacttcccc taccccttga 21540
taggcaagac cgcggtcgac agcgtcaccc agaaaaagtt cctctgcgac cgcaccctct 21600
ggcgcatccc cttctctagc aacttcatgt ccatgggtgc gctcacggac ctgggccaaa 21660
acctgcttta tgccaactct gcccatgcgc tggacatgac ttttgaggtg gaccccatgg 21720
acgagcccac ccttctctat attgtgtttg aagtgttcga cgtggtcaga gtgcaccagc 21780
cgcaccgcgg tgtcatcgag accgtgtacc tgcgtacgcc cttctcagcc ggcaacgcca 21840
ccacctaagg agacagcgcc gccgccgcct gcatgacggg ttccaccgag caagagctca 21900
gggccattgc cagagacctg ggatgcggac cctatttttt gggcacctat gacaaacgct 21960
tcccgggctt tatctcccga gacaagctcg cctgcgccat tgtcaacacg gccgcgcgcg 22020
agaccggggg cgtgcactgg ctggcctttg gctgggaccc gcgctccaaa acttgctacc 22080
tctttgaccc ctttggcttc tccgatcagc gcctcaggca gatttatgag tttgagtacg 22140
aggggctgct gcgccgcagc gcgctcgcct cctcgcccga ccgctgcatc acccttgaga 22200
agtccaccga aaccgtgcag gggccccact cggccgcctg cggtctcttc tgttgcatgt 22260
ttttgcacgc ctttgtgcac tggcctcaga gtcccatgga ttgcaacccc accatgaact 22320
tgctaaaggg agtgcccaac gccatgctcc agagccccca ggtccagccc accctgcgcc 22380
gcaaccagga acagctttac cgcttcctgg agcgccactc cccctacttc cgcagccaca 22440
gcgcgcgcat ccggggggcc acctcttttt gccacttgca agaaaacatg caagacggaa 22500
aatgatgtac agcatgcttt taataaatgt aaagactgtg cactttaatt atacacgggc 22560
tctttctggt tatttattca acaccgccgt cgccatttag aaatcgaaag ggttctgccg 22620
tgcgtcgccg tgcgccacgg gcagagacac gttgcgatac tggaagcggc tcgcccactt 22680
gaactcgggc accaccatgc ggggcagtgg ttcctcgggg aagttctcgc tccacagggt 22740
gcgggtcagc tgcagcgcgc tcaggaggtc gggagccgag atcttgaagt cgcagttggg 22800
gccggaaccc tgcgcgcgcg agttgcggta cacggggttg cagcactgga acaccagcag 22860
ggccggatta ttcacgctgg ccagcaggct ctcgtcgctg atcatgtcgc tgtccagatc 22920
ctccgcgttg ctcagggcga atggggtcat cttgcagacc tgcctgccca ggaaaggcgg 22980
gagcccaggc ttgccgttgc agtcgcagcg caggggcatt agcaggtgcc cacggcccga 23040
ctgcgcctgc gggtacaacg cgcgcatgaa ggcttcgatc tgcctaaaag ccacctgggt 23100
cttggctccc tccgaaaaga acatcccaca ggacttgctg gagaactggt tcgcgggaca 23160
gctggcatcg tgcaggcagc agcgcgcgtc agtgttggca atctgcacca cgttgcgacc 23220
ccaccggttt ttcactatct tggccttgga agcctgctcc tttagcgcgc gctggccgtt 23280
ctcgctggtc acatccatct ctatcacctg ttccttgttg atcatgtttg tcccgtgcag 23340
acactttagg tcgccctccg tctgggtgca gcggtgctcc cacagcgcgc aaccggtggg 23400
ctcccaattc ttgtgggtca cccccgcgta ggcctgcagg taggcctgca ggaagcgccc 23460
catcatggtc ataaaggtct tctggctcgt aaaggtcagc tgcaggccgc gatgctcttc 23520
gttcagccag gtcttgcaga tggcggccag cgcctcggtc tgctcgggca gcatcttaaa 23580
atttgtcttc aggtcgttat ccacgtggta cttgtccatc atggcacgcg ccgcctccat 23640
gcccttctcc caggcggaca ccatgggcag gcttaggggg tttatcactt ccagcggcga 23700
ggacaccgta ctttcgattt cttcttcctc cccctcttcc cggcgcgcgc ccccgctgtt 23760
gcgcgctctt accgcctgca ccaaggggtc gtcttcaggc aagcgccgca ccgagcgctt 23820
gccgcccttg acctgcttga tcagtaccgg cgggttgctg aagcccacca tggtcagcgc 23880
cgcctgctct tcttcgtctt cgctgtctac cactatttct ggggaggggc ttctccgctc 23940
tgcggcaaag gcggcggatc gcttcttttt tttcttggga gccgccgcga tggagtccgc 24000
cacggcgacc gaggtcgagg gcgtggggct gggggtgcgc ggtaccaggg cctcgtcgcc 24060
ctcggactct tcctctgact ccaggcggcg gcggagtcgc ttctttgggg gcgcgcgcgt 24120
cagcggcggc ggagacgggg acggggacgg ggacgggacg ccctccacag ggggtggtct 24180
tcgcgcagac ccgcggccgc gctcgggggt cttctcgcgc tggtcttggt cccgactggc 24240
cattgtatcc tcctcctcct aggcagagag acataaggag tctatcatgc aagtcgagaa 24300
ggaggagagc ttaaccaccc cctcagagac cgccgatgcg cccgccgtcg ccgtcgcccc 24360
cgctaccgcc gacgcgcccg ccacaccgag cgacaccccc acggaccccc ccgccgacgc 24420
acccctgttc gaggaagcgg ccgtggagca ggacccgggc tttgtctcgg cagaggagga 24480
tttgcaagag gaggagaata aggaggagaa gccctcagtg ccaaaagatc ataaagagca 24540
agacgagcac gacgcagacg cacaccaggg tgaagtcggg cggggggacg gagggcatgg 24600
cggcgccgac tacctagacg aaggaaacga cgtgctcttg aagcacctgc atcgtcagtg 24660
cgccatcgtc tgcgacgctc tgcaggagcg cagcgaggtg cccctcagcg tggcggaggt 24720
cagccgcgcc tacgagctca gcctcttttc cccccgggtg cccccccgcc gccgcgaaaa 24780
cggcacatgc gagcccaacc cgcgcctcaa cttctacccc gcctttgtgg tgcccgaggt 24840
cctggccacc tatcacatct tctttcaaaa ttgcaagatc cccatctcgt gccgcgccaa 24900
ccgtagccgc gccgataaga tgctggccct gcgccagggc gaccacatac ctgatatcgc 24960
cgctttggaa gatgtgccaa agatcttcga gggtctgggg cgcaacgaga agcgggcagc 25020
aaactctctg caacaggaaa acagcgaaaa tgagagtcac actggagcgc tggtggagct 25080
ggagggcgac aacgcccgcc tggcggtgct caagcgcagc atcgaggtca cccactttgc 25140
ctaccccgcg ctcaacctgc cccccaaagt catgaacgcg gtcatggacg ggctgatcat 25200
gcgccgcggc cggcccctcg ctccagatgc aaacttgcat gaggagaccg aggacggtca 25260
gcccgtggtc agcgacgagc agctgacgcg ctggctggag agcgcggacc ccgccgaact 25320
ggaggagcgg cgcaagatga tgatggccgc ggtgctggtc accgtagagc tggagtgtct 25380
gcagcgcttc ttcggtgacc ccgagatgca gagaaaggtc gaggagaccc tacactacac 25440
cttccgccag ggctacgtgc gccaggcttg caagatctcc aacgtggagc tcagcaacct 25500
ggtgtcctac ctgggcatct tgcatgaaaa ccgccttggg cagagcgtgc tacactccac 25560
cctgcgcggg gaggcgcgcc gcgactacgt gcgcgactgc gtttacctct tcctctgcta 25620
cacctggcag acggccatgg gggtctggca gcagtgcctg gaggagcgca acctcaagga 25680
gctggagaag cttctgcagc gcgcgctcaa agacctctgg acgggcttca acgagcgctc 25740
ggtggccgcc gcgctagccg acctcatctt ccccgagcgc ctgctcaaaa ccctccagca 25800
ggggctgccc gacttcacca gccaaagcat gttgcaaaat tttaggaact ttatcctgga 25860
gcgttctggc atcctacccg ccacctgctg cgccctgccc agcgactttg tccccctcgt 25920
gtaccgcgag tgccccccgc cgctgtgggg ccactgctac ctgttccaac tggccaacta 25980
cctgtcctac cacgcggacc tcatggagga ctccagcggc gaggggctca tggagtgcca 26040
ctgccgctgc aacctctgca cgccccaccg ctccctggtc tgcaacaccc aactgctcag 26100
cgagagtcag attatcggta ccttcgagct acagggtccg tcctcctcag acgagaagtc 26160
cgcggctccg gggctaaaac tcactccggg gctgtggact tccgcctacc tgcgcaaatt 26220
tgtacctgaa gactaccacg cccacgaaat caggttttac gaggaccaat cccgcccgcc 26280
caaggcggag ctgaccgcct gcgtcatcac ccagggcgag atcctaggcc aattgcaagc 26340
catccaaaaa gcccgccaag agtttttgct gaagaggggt cggggggtgt atctggaccc 26400
ccagtcgggt gaggagctca acccggttcc cccgctgcca ccgccgcggg accttgcttc 26460
ccaggataag catcgccatg gctcccagaa agaagcagca gcggccgccg ctgccgccgc 26520
cccacatgct ggaggaagag gaggaatact gggacagtca ggcagaggag gtttcggacg 26580
aggaggagcc ggagacggag atggaagagt gggaggagga cagcttagac gaggaggctt 26640
ccgaagccga agaggcaggc gcaacaccgt caccctcggc cgcagccccc tcgcaggcgc 26700
ccccgaagtc cgctcccagc atcagcagca acagcagcgc tataacctcc gctcctccac 26760
cgccgcgacc cacggccgac cgcagaccca accgtagatg ggacaccacc ggaaccgggg 26820
ccggtaagtc ctccgggaga ggcaagcaag cgcagcgcca aggctaccgc tcgtggcgcg 26880
ctcacaagaa cgccatagtc gcttgcttgc aagactgcgg ggggaacatc tccttcgccc 26940
gccgcttcct gctcttccac cacggtgtgg ccttcccccg taacgtcctg cattactacc 27000
gtcatctcta cagcccctac tgcggcggca gtgagccaga ggcggccagc ggcggcggcg 27060
cccgtttcgg tgcctaggaa gacccagggc aagacttcag ccaagaaact cgcggcgacc 27120
gcggcgaacg cggtcgcggg ggccctgcgc ctgacggtga acgaacccct gtcgacccgc 27180
gaactgagga accgaatctt ccccactctc tatgccatct tccagcagag cagagggcag 27240
gatcaggaac tgaaagtaaa aaacaggtct ctgcgctccc tcacccgcag ctgtctgtat 27300
cacaagagcg aagaccagct tcggcgcacg ctggaggacg ctgaggcact cttcagcaaa 27360
tactgcgcgc tcactcttaa ggactagctc cgcgcccttc tcgaatttag gcgggaacgc 27420
ctacgtcatc gcagcgccgc cgtcatgagc aaggacattc ccacgccata catgtggagc 27480
tatcagccgc agatgggact cgcggcgggc gcctcccaag actactccac ccgcatgaac 27540
tggctcagtg ccggcccaca catgatctca caggttaatg acatccgcac ccatcgaaac 27600
caaatattgg tgaagcaggc ggcaattacc accacgcccc gcaataatcc caaccccagg 27660
gagtggcccg cgtccctggt gtatcaggaa attcccggcc ccaccaccgt actacttccg 27720
cgtgattccc aggccgaagt ccaaatgact aactcagggg cacagctcgc gggcggctgt 27780
cgtcacaggg tgcggcctcc tcgccagggt ataactcacc tggagatccg aggcagaggt 27840
attcagctca acgacgagtc ggtgagctcc tcgctcggtc tcagacctga cgggaccttc 27900
cagatagccg gagccggccg atcttccttc acgccccgcc aggcgtacct gactctgcag 27960
agctcgtcct cggcgccgcg ctcgggcggc atcgggactc tccagttcgt gcaggagttt 28020
gtgccctcgg tctacttcaa ccccttctcg ggctctcccg gtcgctaccc ggaccagttt 28080
atcccgaact ttgacgccgc gagggactcg gtggacggct acgactgaat gtcgggtgga 28140
cccggtgcag agcaacttcg cctgaagcac cttgaccact gccgccgccc tcagtgcttt 28200
gcccgctgtc agaccggtga gttccagtac ttttccctgc ccgactcgca cccggacggc 28260
ccggcgcacg gggtgcgctt tttcatcccg agtcaggtcc gctctaccct aatcagggag 28320
ttcaccgccc gtcccctact ggcggagttg gaaaaggggc cttctatcct aaccattgcc 28380
tgcatttgct ctaaccctgg attacaccaa gatctttgct gtcatttgtg tgctgagtat 28440
aataaaggct gagatcagaa tctactcggg ctcctgtcgc catcctgtca acgccaccgt 28500
ccaagcccgg cccgatcagc ccgaggtgaa cctcacctgt ggtctgcacc ggcgcctgag 28560
gaaataccta gcttggtact acaacagcac tccctttgtg gtttacaaca gctttgacca 28620
ggacggggtc tcactgaggg ataacctctc gaacctgagc tactccatca ggaagaacaa 28680
caccctcgag ctacttcctc cttacctgcc cgggacttac cagtgtgtca ccggcccctg 28740
cacccacacc cacctgttga tcgtaaacga ctctcttccg agaacagacc tcaataactc 28800
ctctccgcag ttccccagaa caggaggtga gctcaggaaa ccccgggtaa agaagggtgg 28860
acaagagtta acacttgtgg ggtttctggt atatgtgacg ctggtggtgg ctcttttgat 28920
taaggctttt ccttccatgt ctgaactatc cctcttcttt tatgaacaac tcgactagtg 28980
ctaacgggac cctacccaac gaatcgggat tgaatatcgg taaccaggtt gcagtttcac 29040
ttttgattac cttcatagtc ctcttcctgc tagtgctgtc gcttctgtgc ctgcggatcg 29100
ggggctgctg catccacgtt tatatctggt gctggctgtt tagaaggttc ggagaccacc 29160
gcaggtagaa taatgctgct taccctcttt gtcctggcgc tggctgccag ctgccaagcc 29220
ttttccgagg ctgacttcat agagccccag tgcaatatca cttataaatc tgaacgtgcc 29280
atctgtacta ttctaatcaa atgtgttact caacacgata aggtgactgt taaatacaaa 29340
gatcaattaa aaaaagacgc actttacagc agctggcaac caggagatga tcaaaaatac 29400
aatgtaaccg tcttccaggg caaactctcc aaaacttaca attacaattt cccatttgag 29460
cagatgtgtg actttgtcat gtacatggaa aagcagtaca agctgtggcc tccaactccc 29520
cagggctgtg tggaaaatcc aggctctttc tgtatgatct ctctctgtgt aactgtgctg 29580
gcactaatac tcacgcttct gtatctcaga tttaaatcaa ggcaaagctt cattgatgaa 29640
aagaaaatgc cataatcgct caacgcttga ttgctaacac cgggttttta tccgcagaat 29700
gattggaatc accctactaa tcacctccct ccttgcgatt gcccatgggt tggaacgaat 29760
cgaagtccct gtgggggcca atgttaccct ggtggggcct gtcggcaatg ctacattaat 29820
gtgggaaaaa tatactaaaa atcaatgggt ttcttactgc actaacaaaa acagccacaa 29880
gcccagagcc atctgcgatg ggcaaaatct aaccttgatt gatgttcaat tgctggatgc 29940
gggctactat tatgggcagc tgggtacaat gattaattac tggagacccc acagagatta 30000
catgcttcac gtagtaaagg gtcccattag cagcccaacc accacctcta ccacacccac 30060
taccaccact actcccacca ccagcactgc cgcccagcct cctcatagca gaacaaccac 30120
ttttatcaat tccaagtccc actcccccca cattgccggc gggccctccg cctcagactc 30180
cgagaccacc gagatctgct tctgcaaatg ctctgacgcc attgcccagg atttggaaga 30240
tcacgaggaa gatgagcatg actacgcaga tgcatgccag gcatcagagg cagaagcgct 30300
accggtggcc ctaaaacagt atgcagactc ccacaccacc cccaaccttc ctccaccttc 30360
ccagaagcca agtttcctgg gggaaaatga aactctgcct ctttccatac tagctctgac 30420
atctgttgct attttggccg ctctgctggt gcttctatgc tctatatgct acctgatctg 30480
ctgcagaaag aaaaaatctc acggccatgc tcaccagccc ctcatgcact tcccttaccc 30540
tccagagctg ggcgaccaca aactttaagt ctgcagtagc tatctgccca tcccttgtca 30600
gtcgacagcg atgagcccca ctaatctaac agcctctgga cttacaacat tgtctcttaa 30660
tgagaccacc gctcctcaag acctgtacga tggtgtctcc gcgctggtta accagtggga 30720
tcacctgggc atatggtggc tcctcatagg agcagtgacc ctgtgcctaa tcctggtctg 30780
gatcatctgc tgcatcaaaa gcagaagacc caggcggcgg cccatctaca ggcccttcgt 30840
catcacacct gaagataatg atgatgatga caccacctcc aggctgcaga gcctaaagca 30900
gctactcttc tcttttacag catggtaaat tgaatcatgc cccgcatttt catctacttg 30960
cttctccttc cactttttct gggctcctct acattggcca ctgtgtccca catcgaggta 31020
gactgcctca cgcccttcac agtctacctg cttttcggct ttgtcatctg cacctttgtc 31080
tgcagcgtta tcactgtagt gatctgcttc atacagtgca tcgactacat ctgtgtgcgg 31140
gtggcctact ttagacacca cccccagtat cgcaacaggg acatagcggc tctcctaaga 31200
cttgtttaaa tcatggccaa attacctgtg attggtcttc tgattatctg ctgcgtccta 31260
gccgcgattg ggactcaacc taataccacc accagcgctc ccagaaagag acatgtatcc 31320
tgcagcttca agcgtccctg gaatataccc caatgcttta ctgatgaacc tgaaatctct 31380
ttggcttggt acttcagcgt caccgccctt ctcatcttct gcagtacggt tattgctctt 31440
gccatctacc cttcccttaa cctgggctgg aatgctgtca actctatgga atatcccacc 31500
ttcccagaac cagacctgcc agacctggtt gttctaaacg cgtttcctcc tcctccagtt 31560
caaaatcagt ttcgccctcc gtcccctacg cccactgagg tcagctactt taatctaaca 31620
ggcggagatg actgaaaacc tagacctaga aatggacggt ctctgcagcg agcaacgcac 31680
actagagagg cgccggcaaa aagcagagct cgagcgtctt aaacaagagc tccaagacgc 31740
cgtggccata caccagtgca aaaaagggct cttctgtctg gtaaaacagg ccacgctcac 31800
ctatgaaaaa acaggtgaca cccaccgcct aggatacaag ctgcccacac agcgccaaaa 31860
gtttgccctt atgataggtg aacaacccat caccgtcacc cagcactccg tggagacaga 31920
aggctgcatt catgctccct gcaggggcgc tgactgcctc tacaccttga tcaaaaccct 31980
ctgcggtctc agagacctta tccctttcaa ttgatcataa ctgtaatcaa taaaaaatca 32040
cttacttgaa atctgatagc aagactctgt ccaatttttt cagcaacact tccttcccct 32100
cctcccaact ctggtactct aggcgcctcc tagctgcaaa cttcctccac agtctgaagg 32160
gaatgtcaga ttcctcctcc tgtccctccg cacccacgat cttcatgttg ttacagatga 32220
aacgcgcgag atcgtctgac gagaccttca accccgtgta cccctacgat accgagatcg 32280
ctccgacttc tgtccctttc cttacccctc cctttgtatc atccgcagga atgcaagaaa 32340
atccagctgg ggtgctgtcc ctgcacctgt cagagcccct taccacccac aatggggccc 32400
tgactctaaa aatggggggc ggcctgaccc tggacaagga agggaatctc acttcccaaa 32460
acatcaccag tgtcgatccc cctctcaaaa aaagcaagaa caacatcagc cttcagaccg 32520
ccgcacccct cgccgtcagc tccggggccc taaccctttt tgccactccc cccctagcgg 32580
tcagtggcga caaccttact gtgcagtctc aggcccctct tactttggaa gactcaaaac 32640
taactctggc caccaaagga cccctaactg tgtccgaagg caaacttgtc ctagaaacag 32700
agcctcccct gcatgcaagt gacagcagta gcctgggcct tagcgtcacg gccccactta 32760
gcattaacaa tgacagccta ggactagaca tgcaagcgcc catcagctct cgagatggaa 32820
aactggctct aacagtggcg gcccccctaa ctgtggccga gggtatcaat gctttggcag 32880
tagccacagg taatggtatt ggactaaatg aaaccaacac acacctgcag gcaaaactgg 32940
tcgcgcccct aggctttgat accaacggca acattaagct aagcgtcgca ggaggcatga 33000
ggctaaacaa taacacactg atactagatg taaactaccc atttgaggct caaggccaac 33060
tgagcctaag agtgggctcg ggcccactat atgtagattc tagtagtcat aacctaacca 33120
ttagatgcct taggggattg tatgtaacat cttctaacaa ccaaaacggt ctagaggcca 33180
acattaaact aacaaaaggc cttgtgtatg acggaaatgc catagcagtt aatgttggca 33240
aagggctgga atacagccct actggcacaa cagaaaaacc tatacagact aaaataggtc 33300
taggcatgga gtatgacact gagggagcca tgatgacaaa actaggctct ggactaagct 33360
ttgacaattc aggagccatt gtggtgggaa acaaaaatga tgacaggctt actttgtgga 33420
ccacaccgga cccatcgccc aactgtcaga tttactctga aaaagatgct aaactaacct 33480
tggtactgac taaatgtggc agtcaggttg taggcacagt atctattgcc gctcttaaag 33540
gtagccttgt gccaatcact agtgcaatca gtgtggttca gatataccta aggtttgatg 33600
aaaatggggt gctgatgagt aactcttcac ttaatggcga atactggaat tttagaaacg 33660
gagactcaac taatggcaca ccatatacaa acgcagtggg ttttatgcct aatctactgg 33720
cctatcctaa aggtcaaact acaactgcaa aaagtaacat tgtcagccag gtctacatga 33780
acggggacga tactaaaccc atgacattta caatcaactt caatggcctt agtgaaacag 33840
gggatacccc tgtcagtaaa tattccatga cattctcatg gaggtggcca aatggaagct 33900
acatagggca caattttgta acaaactcct ttactttctc ctacatcgcc caagaataaa 33960
gaaagcacag agatgcttgt ttttgatttc aaaattgtgt gcttttattt attttcaagc 34020
ttacagtatt tccagtagtc attagaatag agcttaatta aactgcatga gaacccttcc 34080
acatagctta aattatcacc agtgcaaatg gaaaaaaatc aacatacctt tttatccaga 34140
tatcaaagaa ctctagtggt cagttttccc ccaccctccc agctcacaga atacacagtc 34200
ctttcccccc ggctggcttt aaacaacact atctcattgg taacagacat atttttaggt 34260
gtaataatcc acacggtctc ttggcgggcc aaacgctggt ctgtgatgtt aataaactcc 34320
ccaggcagct ctttcaagtt cacgtcgctg tccaactgct gaagcgctcg cggctccgac 34380
tgcgcctcta gcggaggcaa cggcagcacc cgatccttga tctataaagg agtagagtca 34440
taatccccca taagaatagg gcggtgatgc agcaacaagg cgcgcagcaa ctcctgccgc 34500
cgcctctccg tacgacagga atgcaacggg gtggtggtct cctccgcgat aatccgcacc 34560
gctcgcagca tcagcatcct cgtcctccgg gcacagcagc gcatcctgat ctcactgaga 34620
tcggcgcagt aagtgcagca caacaccaag atgttattta agatcccaca gtgcaaagca 34680
ctgtacccaa agctcatggc gggaaggaca gcccccacgt gaccatcgta ccagatcctc 34740
aggtaaatca aatgacgacc tctcataaac acgctggaca tatacatcac ctccttgggc 34800
atgagctgat tcaccacctc tcgataccac aggcatcgct gattaattaa agacccctcg 34860
agcaccatcc tgaaccagga agccagcacc tgaccccccg ccaggcactg cagggacccc 34920
ggtgaatcgc agtggcagtg aagactccag cgctcgtagc cgtgaaccat agagctggtc 34980
attatatcca cattggcaca acacagacac actttcatac actttttcat gattagcagc 35040
tcctctctag tcaagaccat atcccaagga atcacccact cttgaatcaa ggtaaatccc 35100
acacagcagg gcaggcctct cacataactc acgttatgca tagtgagcgt gtcgcaatct 35160
ggaaataccg gatgatcttc catcaccgaa gcccgggtct ccgtctcaaa gggaggtaaa 35220
cggtccctcg tgtagggaca gtggcgggat aatcgagatc gtgttgaacg tagagtcatg 35280
ccaaagggaa cagcggacgt actcatattt cctccagcag aaccaagtgc gcgcgtggca 35340
gctatccctg cgtcttctgt ctcgccgcct gccccgctcg gtgtagtagt tgtaatacag 35400
ccactccctc agaccgtcaa ggcgctccct ggcgtccgga tctataacaa caccgtcctg 35460
cagcgccgcc ctgatgacat ccaccaccgt agagtatgcc aagcccagcc acgaaatgca 35520
ctcactttga cagcgagaga taggaggagc gggaagagat ggaagaacca tgatagtaaa 35580
agaactttta ttccaatcga tcctctacaa tgtcaaagtg tagatctatc agatggcact 35640
ggtctcctcc gctgagtcga tcaaaaataa cagctaaacc acaaacaaca cgattggtca 35700
aatgctgcac aagggcttgc agcataaaat cgcctcgaaa gtccaccgca agcataacat 35760
caaagccacc gcccctatca tgatctatga taaaaacccc acagctatcc accagaccca 35820
tatagttttc atctctccat cgtgaaaaaa tatttacaag ctcctccttt aaatcacctc 35880
caaccaattc aaaaagttga gccagaccgc cctccacctt cattttcagc atgcgcatca 35940
tgattgcaaa aattcaggct cctcagacac ctgtataaga ttgagaagcg gaacgttaac 36000
atcaatgttt cgctcgcgaa gatcgcgcct cagtgcaagc atgatataat cccacaggtc 36060
ggagcggatc agcgaggaca tctccccgcc aggaaccaac tcaacggagc ctatgctgat 36120
tataatacgc atattcgggg ctatgctaac cagcacggcc cccaaatagg cgtactgcat 36180
aggcggcgac aaaaagtgaa cagtttgggt taaaaaatca ggcaaacact cgcgcaaaaa 36240
agcaagaaca tcataaccat gctcatgcaa atagatgcaa gtaagctcag gaacgaccac 36300
agaaaaatgc acaatttttc tctcaaacat gactgcgagc cctgcaaaaa ataaaaaaga 36360
aacattacac aagagtagcc tgtcttacaa tgggatagac tactctaacc aacataagac 36420
gggccacgac atcgcccgcg tggccataaa aaaaattatc cgtgtgatta aaaagaagca 36480
cagatagctg gccagtcata tccggagtca tcacgtgcga acccgtgtag acccccgggt 36540
tggacacatc ggccaaacaa agaaagcggc caatgtatcc cggaggaatg ataacactaa 36600
gacgaagata caacagaata accccatggg ggggaataac aaagttagta ggtgaataaa 36660
aacgataaac acccgaaact ccctcctgcg taggcaaaat agcgccctcc ccttccaaaa 36720
caacatacag cgcttccaca gcagccatga caaaagactc aaaacactca aaagactcag 36780
tcttaccagg aaaataaaag cactctcaca gcaccagcac taatcagagt gtgaagaggg 36840
ccaagtgccg aacgagtata tataggaatt aaaaatgacg taaatgtgta aaggtcaaaa 36900
aacgcccaga aaaatacaca gaccaacgcc cgaaacgaaa acccgcgaaa aaatacccag 36960
aagttcctca acaaccgcca cttccgcttt cccacgatac gtcacttcct caaaaatagc 37020
aaactacatt tcccacatgt acaaaaccaa aacccctccc cttgtcaccg cccacaactt 37080
acataatcac aaacgtcaaa gcctacgtca cccgccccgc ctcgccccgc ccacctcatt 37140
atcatattgg cctcaatcca aaataaggta tattattgat gatg 37184
<210> 3
<211> 37169
<212> DNA
<213> Great Ape Adenovirus
<400> 3
catcatcaat aatatacctt attttggatt gtggccaata tgataatgag gtgggcgggg 60
cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgc ggaagtgtgg catttgcaag 120
tgggaggagc tcacatgtaa gcttccgtcg cggaaaatgt gacgttttta atgagcgccg 180
cctacctccg gaagtgccaa ttttcgcgcg cttttcaccg gatatcgtag taattttggg 240
cgggaccatg taagatttgg ccattttcgc gcgaaaagtg aaacggggaa gtgaaaactg 300
aataataggg cgttagtcat agcgcgtaat atttaccgag ggccgaggga ctttgaccga 360
ttacgtggag gactcgccca ggtgtttttt acgtgaattt ccgcgttccg ggtcaaagtc 420
tccgttttta ttgtcaccgt catctgacgc ggagggtatt taaacccgct gcgctcctaa 480
agaggccact cttgagtgcc agcgagaaga gttttctcct ccgctccgtt tcggcgatcg 540
aaaaatgaga cacttagcct gcactccggg tcttttgtcc ggccgggcgg cgtccgagct 600
tttggacgct ttgctcaatg aggttctgag cgatgatttt ccgtctacta cccactttag 660
cccacctact cttcacgaac tgtacgatct ggatgtactg gtggatgtga acgatcccaa 720
cgaggaggcg gtttctacgt tttttcccga gtctgcgctt ttggccgccc aggagggatt 780
tgacctacac actccgccgc tgcctatttt agagtctccg ctgccggagc ccagtggtat 840
accttatatg cctgaactgc ttcccgaagt ggtagacctg acctgccacg agccgggctt 900
tccgcccagc gacgatgagg gtgagccttt tgctttagac tatgctgaga tacctgggct 960
cggttgcagg tcttgtgcat atcatcagag ggttaccgga gaccccgagg ttaagtgttc 1020
gctgtgctat atgaggctga cctcttcctt tatctacagt aagttttttg tgtaggtggg 1080
ctttttgggt aggtgggttt tgtggcagga caggtgtaaa tgttgcttgt gttttttgta 1140
cctgcaggtc cggtgtccga gccagacccg gagcccgacc gcgatcccga gccggatccc 1200
gagcctcctc gcagggcaag gaaattacct tccattttgt gcaagcctaa gacacctgtg 1260
aggaccagcg aggcggacag cactgactct ggcacttcta cctctcctcc tgaaattcac 1320
ccagtggttc ctttgggtat acataaacct gttgctatta gagtttgcgg gcgacgccct 1380
gcagtagagt gcattgagga cttgcttaac gatcccgagg gacctttgga cttgagcatt 1440
aaacgcccta ggcaataaac cccacctaag taataaaccc cacctaagta ataaacttta 1500
ccgcccttgg ttattgagat gacgcccaat gtttgctttt gaatgacttc atgtgtataa 1560
taaaagtgag tgtggtcata ggtctcttgt ttgtctgggc ggggcttaag ggtatataag 1620
tttctcgggg ctaaacttgg ttacacttga ccccaatgga ggcgtggggg tgcttggagg 1680
agtttgcgga cgtgcgccgt ttgctggacg agagctctag caatacctat agtatttgga 1740
ggtatctgtg gggctctact caggccaagt tggtctccag aattaagcag gattacaagt 1800
gcgattttga agagcttttt agttcctgtg gtgagctttt gcaatccttg aatctgggcc 1860
accaggctat cttccaggaa aaggttctct cgactttgga tttttccact cccgggcgca 1920
ccgccgcttg tgtggctttt gtgtcttttg tgcaagataa atggagcggg gagacccacc 1980
tgagtcacgg ctacgtgctg gatttcatgg cgatggctct ttggagggct tacaacaaat 2040
ggaagattca gaaggaactg tacggttccg ccctacgtcg tccacttctg cagcggcagg 2100
ggctgatgtt tcccgaccat cgccagcatc agaatctgga agacgagtcg gaggagcgag 2160
cggagaagat cagcttgaga gccggcctgg accctcctca ggaggaatga atctcccgca 2220
ggtggttgac ctgtttcccg aactgagacg ggtcctgact atcagggaag atggtcagtt 2280
tgtgaagaag ctgaagaggg atcggggtga gggagatgat gaggcggcta gcaatttagc 2340
ttttagtctg ataacccgcc accgaccgga atgtattacc tatcagcaga ttaaggagag 2400
ttgtgccaac gagctggatc ttttgggtca gaagtatagc atagaacagc ttaccactta 2460
ctggcttcag cccggggatg attgggaaga ggcgatcagg gtgtatgcaa aggtggccct 2520
gcggcccgat tgcaagtata agattactaa gttggttaat attagaaact gctgctatat 2580
ttctgggaac ggggccgaag tggagataga tactgaggac agggtggcta ttaggtgttg 2640
catgataaac atgtggcccg ggatactggg gatggatggg gtgatattta tgaatgtaag 2700
gttcacgggc cccaacttta atggtacggt gttcatgggc aacaccaact tgctcctgca 2760
tggtgcgagt ttctatgggt ttaacaacac ctgtatagag gcctggaccg atgtaaaggt 2820
tcgaggttgt tccttttata gctgttggaa ggcggtggtg tgtcgcccta aaagcagggg 2880
ttctgtgaag aaatgcttgt ttgaaaggtg caccctaggt atcctttctg agggcaactc 2940
cagggtgcgc cataatgtgg cttcgaactg cggttgcttc atgcaagtga agggggtgag 3000
cgttatcaag cataactcgg tctgtggaaa ctgcgaggat cgcgcctctc agatgctgac 3060
ctgctttgat ggcaactgtc acctgttgaa gaccattcat ataagcagtc accccagaaa 3120
ggcctggccc gtgtttgagc ataacattct gacccgctgt tccttgcatc tgggggtcag 3180
gaggggtatg ttcctgcctt accagtgtaa cttcagccac actaaaatcc tgctggaacc 3240
cgagtgcatg actaaggtca gcctgaatgg tgtgtttgat gtgagtctga agatttggaa 3300
ggtgctgagg tatgatgaga ccaggaccag gtgccgaccc tgcgagtgcg gcggcaagca 3360
catgagaaat cagcctgtga tgttggatgt gaccgaggag cttaggcctg accatctggt 3420
gctggcctgc accagggccg agtttgggtc tagcgatgag gataccgatt gaggtgggta 3480
aggtgggcgt ggctagcagg gtgggcgtgt ataaattggg ggtctaaggg gtctctctgt 3540
ttgtcttgca acagccgccg ccatgagcga caccggcaac agctttgatg gaagcatctt 3600
tagcccctat ctgacagtgc gcatgcctca ctgggccgga gtgcgtcaga atgtgatggg 3660
ttccaacgtg gatggacgtc ccgttctgcc ttcaaattcg tctacgatgg cctacgcgac 3720
cgtgggagga actccgttgg acgccgcgac ctccgccgcc gcctccgccg ccgccgcgac 3780
cgcgcgcagc atggctacgg acctttacag ctctttggtg gcgagcagcg cggcctctcg 3840
cgcgtctgct cgggatgaga aactgactgc tctgctgctt aaactggaag acttgacccg 3900
ggagctgggt caactgaccc agcaggtctc cagcttgcgt gagagcagcc ttgcctcccc 3960
ctaatggccc ataatataaa taaaagccag tctgtttgga ttaagcaagt gtatgttctt 4020
tatttaactc tccgcgcgcg gtaagcccgg gaccagcggt ctcggtcgtt tagggtgcgg 4080
tggattcttt ccaacacgtg gtacaggtgg ctctggatgt ttagatacat gggcatgagt 4140
ccatccctgg ggtggaggta gcaccactgc agagcttcgt gctcgggggt ggtgttgtat 4200
atgatccagt cgtagcagga gcgctgggcg tggtgctgaa aaatgtcctt aagcaagagg 4260
cttatagcta gggggaggcc cttggtgtaa gtgtttacaa atctgctcag ctgggagggg 4320
tgcatccggg gggatatgat gtgcatcttg gactggattt ttaggttggc tatgttccca 4380
cccagatccc ttctgggatt catgttgtgc aggaccacca gcacggtata tccagtgcac 4440
ttgggaaatt tatcgtggag cttagacggg aatgcatgga agaacttgga gacgcccttg 4500
tggcctccca gattttccat acattcgtcc atgatgatgg caatgggccc gtgggaagct 4560
gcctgagcaa aaacgtttct gggatcgctc acatcgtagt tatgttccag ggtgaggtca 4620
tcataggaca tctttacgaa tcgggggcgg agggtcccgg actgggggat gatggtaccc 4680
tcgggccccg gggcgtagtt cccctcacag atctgcatct cccaggcttt catttcagag 4740
ggagggatca tatccacctg cggggcgatg aaaaagacag tttctggcgc aggggagatt 4800
aactgggatg agagcaggtt tctgagcagc tgtgactttc cacagccggt gggcccatat 4860
atcacgccta tcaccggctg cagctggtag ttaagagagc tgcagctgcc gtcctcccgg 4920
agcagggggg ccacctcgtt gagcatatcc ctgacgtgga tgttttccct gaccagttcc 4980
gccagaaggc gctcgccgcc cagcgaaagc agctcttgca aggaagcaaa atttttcagc 5040
ggtttcaggc catcggccgt gggcatgttt ttcagcgtct gggtcagcag ctccagcctg 5100
tcccagagct cggtgatgtg ctctacggca tctcgatcca gcagatctcc tcgtttcgcg 5160
ggttggggcg gctttcgctg tagggcacca gccgatgggc gtccagcggg gccagagtca 5220
tgtccttcca tgggcgcaga gtcctcgtca gggtggtctg ggtcacggtg aaggggtgcg 5280
ctccgggttg ggcgctggcc agggtgcgct tgaggctggt tctgctggtg ctgaatcgct 5340
gccgctcttc gccctgcgcg tcggccaggt agcatttgac catggtctcg tagtcgagac 5400
cctcggcggc gtgccccttg gcgcggagct ttcccttgga ggtggcgccg cacgaggggc 5460
actgcaggct cttcagggcg tagagcttgg gagcgagaaa cacggactct ggggagtagg 5520
cgtccgcgcc gcaggccgag cagaccgtct cgcattccac cagccaagtg agttccgggc 5580
ggtcagggtc aaaaaccagg ctgcccccat gctttttgat gcgtttctta cctcggctct 5640
ccatgaggcg gtgtcccttc tcggtgacga agaggctgtc cgtgtccccg tagaccgatt 5700
tcaggggcct gtcttccagc ggagtgcctc tgtcctcctc gtagagaaac tctgaccact 5760
ctgagacaaa ggcccgtgtc caggccagga cgaaggaggc cacgtgggag gggtagcggt 5820
cgttgtccac tagcgggtcc accttctcca gggtgtgcag gcacatgtcc ccctcctccg 5880
cgtccagaaa agtgattggc ttgtaggtgt aggacacgtg accgggggtt cccgacgggg 5940
gggtataaaa gggggtgggt gccctttcat cttcactctc ttccgcatcg ctgtctgcga 6000
gagccagctg ctggggtaag tattcccttt cgaaggcggg catgacctca gcgctcaggt 6060
tgtcagtttc taaaaatgag gaagatttga tgttcacctg tccggaggtg atacctttga 6120
gggtacctgg gtctatctgg tcagaaaaca ctattttttt gttatcaagc ttggtggcga 6180
acgacccgta gagggcgttg gagagcagct tggcgatgga gcgcagggtc tggtttttgt 6240
cgcggtcggc tcgctccttg gccgcgatgt tgagttgcac gtactcgcgg gccacgcact 6300
tccactcggg gaagacggtg gtgcgctcgt ctgggatcag gcgcaccctc cagccgcggt 6360
tgtgcagggt gaccatgtcg acgctggtgg cgacctcacc gcgcaggcgc tcgttggtcc 6420
agcagaggcg gccgcccttg cgcgagcaga aggggggtag ggggtccagc tggtcctcgt 6480
tcggggggtc cgcgtcgatg gtaaagaccc cggggagcag acgcgggtca aagtagtcga 6540
tcttgcaagc ttgcatgtcc agagcccgct gccattcgcg ggcggcgagc gcgcgctcgt 6600
aggggttgag gggcgggccc cagggcatgg ggtgggtgag cgcagaggcg tacatgccgc 6660
agatgtcata cacgtacagg ggttccctga ggatgccgag gtaggtgggg tagcagcgcc 6720
ccccgcggat gctggcgcgc acgtagtcat agagttcgtg ggagggggcc agcatgttgg 6780
gcccgaggtt ggtgcgctgg gggcgctcgg cgcggaagac gatctgcctg aagatggcgt 6840
gggagttgga ggagatggtg ggccgctgga agacgttgaa gcttgcttct tgcaagccca 6900
cggagtccct gacgaaggag gcgtaggact cgcgcagctt gtgcaccagc tcggcggtga 6960
cctggacgtc gagcgcacag tagtcgaggg tctcacggat gatgtcatac ttatcctccc 7020
ccttcttttt ccacagctcg cggttgagga cgaactcttc gcggtctttc cagtactctt 7080
ggaggggaaa cccgtccgtg tccgaacggt aagagcctag catgtagaac tggttgacgg 7140
cctggtaggg gcagcagccc ttctccacgg gcagcgcgta ggcctgcgcc gccttgcgga 7200
gggaggtgtg ggtgagggcg aaagtgtccc tgaccatgac tttgaggtat tgatgtctga 7260
agtctgtgtc atcgcagccg ccctgttccc acagggtgta gtccgtgcgc tttttggagc 7320
gcgggttggg cagggagaag gtgaggtcat tgaagaggat cttccccgct cgaggcatga 7380
agtttctggt gatgcgaaag ggccctggga ccgaggagcg gttgttgatg acctgggcgg 7440
ccaggacgat ctcgtcaaag ccgtttatgt tgtggcccac gatgtagagc tccaggaagc 7500
ggggctggcc cttgatggag gggagctttt taagttcctc gtaggtgagc tcctcgggcg 7560
attccaggcc gtgctcctcc agggcccagt cttgcaagtg agggttggcc gccaggaagg 7620
atcgccagag gtcgcgggcc atgagggtct gcaggcggtc gcggaaggtt ctgaactgtc 7680
gccccacggc catcttttcg ggggtgatgc aatagaaggt gagggggtct ttctcccagg 7740
ggtcccatct gagctctcgg gcgaggtcgc gtgcggcggc gaccagagcc tcgtcgcccc 7800
ccagtttcat gaccagcatg aagggcacga gctgcttgcc aaaggctccc atccaagtgt 7860
aggtctctac atcgtaggtg acaaagaggc gctccgtgcg aggatgagag ccgatcggga 7920
agaactggat ctcccgccac cagttggagg attggctgtt gatgtggtga aagtagaagt 7980
cccgtctgcg ggccgagcac tcgtgctggc ttttgtaaaa gcgaccgcag tactggcagc 8040
gctgcacggg ttgtatatct tgcacgaggt gaacctggcg acctctgacg aggaagcgca 8100
gcgggaatct aagtcccccg cctggggtcc cgtgtggctg gtggtcttct actttggttg 8160
tctggccgcc agcatctgtc tcctggaggg cgatggtgga acagaccacc acgccgcgag 8220
agccgcaggt ccagatctcg gcgctcggcg ggcggagttt gatgacgaca tcgcgcacat 8280
tggagctgtc catggtctcc agctcccgcg gcggcaggtc agccgggagt tcctggaggt 8340
ttacctcgca gagacgggtc aacgcacggg cagtgttaag atggtatctg atttcaaggg 8400
gcgtgttggc ggcggagtcg atggcttgca ggaggccgca gccccggggg gccacgatgg 8460
ttccccgtgg ggcgcgaggg gaggcggaag ctgggggtgt gttcagaagc ggtgacgcgg 8520
gcgggccccc ggaggtaggg ggggttccgg ccccacaggc atgggcggca ggggcacgtc 8580
ttcgccgcgc gcgggcaggg gctggtgctg gctccgaaga gcgcttgcgt gcgcgacgac 8640
gcgacggttg gtgtcctgta tctggcgcct ctgagtgaag accacgggtc ccgtgacctt 8700
gaacctgaaa gagagttcga cagaatcaat ctcggcatcg ttgacagcgg cctggcgcag 8760
gatctcctgc acgtcgcccg agttgtcctg gtaggcgatc tctgccatga actgctcgat 8820
ctcttcctcc tggagatctc ctcgtccggc gcgctccacg gtggccgcca ggtcgttgga 8880
gatgcgaccc atgagctgcg agaaggcgtt gagtccgccc tcgttccaga cccggctgta 8940
gaccacgccc ccctcggcgt cgcgggcgcg catgaccacc tgggccaggt tgagctccac 9000
gtgtcgcgtg aagacggcgt agttgcgcag gcgctggaaa aggtagttca gggtggtggc 9060
ggtgtgctcg gcgacaaaga agtacatgac ccagcgccgc aacgtggatt cattgatgtc 9120
ccccaaggcc tccaggcgct ccatggcctc gtagaagtcc acggcgaagt tgaaaaactg 9180
ggagttgcga gcggacacgg tcaactcctc ctccagaaga cggatgagct cggcgacagt 9240
gtcgcgcacc tcgcgctcga aggccacggg gggcgcttct tcctcttcca cctcttcttc 9300
catgattgct tcttcttctt cctcagccgg gacgggaggg ggcggcggcg ggggaggggc 9360
gcggcggcgg cggcggcgca ccgggaggcg gtcgatgaag cgctcgatca tctccccccg 9420
catgcggcgc atggtctcgg tgacggcgcg gccgttctcc cgggggcgca gctcgaagac 9480
gccgcctttc atctcgccgc ggggcgggcg gccgtgaggt agcgagacgg cgctgactat 9540
gcatcttaac aattgctgtg taggtacgcc gccaagggac ctgattgagt ccagatccac 9600
cggatccgaa aacctttgga ggaaagcgtc tatccagtcg cagtcgcaag gtaggctgag 9660
caccgtggcg ggcgggggcg ggtcgggaga gttcctggcg gagatgctgc tgatgatgta 9720
attaaagtag gcggtcttga gaaggcggat ggtggacagg agcaccatgt ctttgggtcc 9780
ggcctgttgg atgcggaggc ggtcggccat gccccaggcc tcgttctgac accggcgcag 9840
gtctttgtag tagtcttgca tgagtctttc caccggcacc tcttctcctt cctcttctcc 9900
atctcgccgg tggtttctcg cgccgcccat gcgcgtgacc ccaaagcccc tgagcggctg 9960
cagcagggcc aggtcggcga ccacgcgctc ggccaagatg gcctgctgta cctgagtgag 10020
ggtcctctcg aagtcatcca tgtccacgaa gcggtggtag gcgcccgtgt tgatggtgta 10080
ggtgcagttg gccatgacgg accagttgac ggtctggtgt cccggctgcg agagctccgt 10140
gtaccgcagg cgcgagaagg cgcgggaatc gaacacgtag tcgttgcaag tccgcaccag 10200
atactggtag cccaccagga agtgcggcgg aggttggcga tagaggggcc agcgctgggt 10260
ggcgggggcg ccgggcgcca ggtcttccag catgaggcgg tggtatccgt agatgtacct 10320
ggacatccag gtgatgccgg cggcggtggt ggtggcgcgc gcgtagtcgc ggacccggtt 10380
ccagatgttt cgcaggggcg agaagtgttc catggtcggc acgctctggc cggtgaggcg 10440
cgcgcagtcg ttgacgctct atacacacac aaaaacgaaa gcgtttacag ggctttcgtt 10500
ctgtagcctg gaggaaagta aatgggttgg gttgcggtgt gccccggttc gagaccaagc 10560
tgagctcggc cggctgaagc cgcagctaac gtggtattgg cagtcccgtc tcgacccagg 10620
ccctgtatcc tccaggatac ggtcgagagc ccttttgctt tcttggccaa gcgcccgtgg 10680
cgcgatctgg gatagatggt cgcgatgaga ggacaaaagc ggctcgcttc cgtagtctgg 10740
agaaacaatc gccagggttg cgttgcggcg taccccggtt cgagccccta tggcggcttg 10800
gatcggccgg aaccgcggct aacgtgggct gtggcagccc cgtcctcagg accccgccag 10860
ccgacttctc cagttacggg agcgagcccc ttttgttttt tattttttag atgcatcccg 10920
tgctgcggca gatgcgcccc tcgccccggc ccgatcagca gcagcaacag caggcatgca 10980
gacccccctc tcctctcccc gccccggtca ccacggccgc ggcggccgtg tccggcgcgg 11040
ggggtgcgct ggagtcagat gagccaccgc ggcggcgacc taggcagtat ctggacttgg 11100
aagagggcga gggactggcg cggctggggg cgagctcccc agagcgtcac ccgcgggtgc 11160
agttgaaaag ggacgcgcgc gaggcgtacc tgccgcggca aaacctgttt cgcgaccgcg 11220
ggggcgagga gcccgaggag atgcgagact gcaggttcca agcagggcgc gagctgcgcc 11280
gcggcttgga cagagagcgc ttgctgcgcg aggaggactt tgagcccgac acgcagacgg 11340
gcatcagccc cgcgcgcgcg cacgtggccg cggccgacct ggtgaccgcc tacgagcaga 11400
cggtgaacca ggagcgcaac ttccaaaaaa gcttcaacaa ccacgtgcgc acgctggtgg 11460
cgcgcgagga ggtgaccctg ggtctcatgc atctgtggga cctggtggag gcgatcgtgc 11520
agaaccccag cagcaagccc ctgaccgcgc agctgttcct ggtggtgcag cacagcaggg 11580
acaacgatgc cttcagggag gcgctgctga acatcaccga gccggagggg cgctggctcc 11640
tggacctgat aaacatcctg cagagcatag tggtgcagga gcgcagcctg agcctggccg 11700
agaaggtggc ggccattaac tattctatgc tgagcctggg caagttctac gcccgcaaga 11760
tctacaagac cccctacgtg cccatagaca aggaggtgaa gatagacagc ttctacatgc 11820
gcatggcgct aaaggtgctg accctgagcg acgacctggg agtgtaccgc aacgagcgca 11880
tccacaaggc cgtgagcgcc agccggcggc gcgagctgag cgaccgcgag ctgatgcaca 11940
gtctgcaacg cgcgctgacc ggcgcgggcg agggcgacag ggaggtcgag tcctacttcg 12000
acatgggggc cgacctgcac tggcagccga gccgccgcgc cctggaggcg gcgggggcgt 12060
atggcggccc cctggcggcc gatggcgagg aagaggagga ctatgagcta gaggagggcg 12120
agtacctgga ggactgacct ggctggtggt gttttggtat agatgcaaga tccgaacgtg 12180
gcggacccgg cggtccgggc ggcgctgcag agccagccgt ccggcattaa ctcctctgac 12240
gactgggccg cggccatggg tcgcatcatg gccctgaccg cgcgcaaccc cgaggccttc 12300
aggcagcagc ctcaggctaa ccggctggcg gccatcttgg aagcggtagt gcccgcgcgc 12360
tccaacccca cccacgagaa ggtgctggcc atagtcaacg cgctggcgga gagcagggcc 12420
atccgggcgg acgaggccgg actggtgtac gatgcgctgc tgcagcgggt ggcgcggtac 12480
aacagcggca acgtgcaaac caacctggac cgcctggtga cggacgtgcg cgaggccgtg 12540
gcgcagcgcg agcgcttgca tcaggacggt aacctgggct cgctggtggc gctaaacgcc 12600
ttcctcagca cccagccggc caacgtaccg cgggggcagg aggactacac caacttcttg 12660
agcgcgctgc ggctgatggt gaccgaggtc cctcagagcg aagtgtacca gtcggggccc 12720
gactacttct tccagaccag cagacagggc ttgcaaaccg tgaacctgag ccaggctttc 12780
aagaacctgc gggggctgtg gggagtgaag gcgcccaccg gcgaccgggc tacggtgtcc 12840
agcctgctaa cccccaactc gcgcctgctg ctgctgctga tcgcgccctt cacggacagc 12900
gggagcgtct cgcgggagac ctatctgggc cacctgctga cgctgtaccg cgaggccatc 12960
gggcaggcgc aggtggacga gcacaccttc caggagatca ccagcgtgag ccacgcgctg 13020
gggcaggagg acacgggcag cctgcaggcg accctgaact acctgctgac caacaggcgg 13080
cagaagattc ccacgctgca cagcctgacc caggaggagg agcgcatctt gcgctacgtg 13140
cagcagagcg tgagcctgaa cctgatgcgc gacggcgtga cgcccagcgt ggcgctggac 13200
atgaccgcgc gcaacatgga accgggcatg tacgcttccc agcggccgtt catcaaccgc 13260
ctgatggact acttgcatcg ggcggcggcc gtgaaccccg agtacttcac caatgccatt 13320
ctgaatcccc actggatgcc ccctccgggt ttctacaacg gggactttga ggtgcccgag 13380
gtcaacgacg ggttcctctg ggatgacatg gatgacagtg tgttctcccc caacccgctg 13440
cgcgccgcgt ctctgcgatt gaaggagggc tctgacaggg aaggaccgag gagtttggcc 13500
tcctccctgg ctctgggggc ggtgggcgcc acgggcgcgg cggcgcgggg cagcagcccc 13560
ttccccagcc tggcggactc tctgaatagc gggcgggtga gcaggccccg cttgctaggc 13620
gaggaggagt atctgaacaa ctccctgcta cagcccgtga gggacaaaaa cgctcagcgg 13680
cagcagtttc ccaacaacgg gatagagagc ctggtggaca agatgtccag atggaagacg 13740
tatgcgcagg agtacaagga gtgggaggac cgacagccgc ggcccctgcc gccccctaga 13800
cagcgctggc agcggcgtgc gtccaaccgc cgctggaggc aggggcccga ggacgatgat 13860
gactctgcag atgacagcag cgtgttggat ctgggcggga gcgggaaccc cttttcgcac 13920
ctgcgcccac gcctgggcaa gatgttttaa aagagaaaaa taaaaactca ccaaggccat 13980
ggcgacgagc gttggttttt ttgttccctt ccttagtatg cggcgcgcgg cgatgttcga 14040
ggaggggcct cccccctctt acgagagcgc gatgggaatt tctcctgcgg cgcccctgca 14100
gcctccctac gtgcctcctc ggtacctgca acctacaggg gggagaaata gcatctgtta 14160
ctctgagctg cagcccctgt acgataccac cagactgtac ctggtggaca acaagtccgc 14220
ggacgtggcc tccctgaact accagaacga ccacagcgat tttttgacca cggtgatcca 14280
aaacaacgac ttcaccccaa ccgaggccag tacccagacc ataaacctgg acaacaggtc 14340
gaactggggc ggcgacctga agactatcct gcacaccaat atgcccaacg tgaacgagtt 14400
catgttcacc aactctttta aggcgcgggt gatggtggcg cgcgagcagg gggaggcgaa 14460
gtacgagtgg gtggacttca cgctgcccga gggcaactat tcagagacca tgactctcga 14520
cctgatgaac aatgcgatcg tggaacacta tctgaaagtg ggcaggcaga acggggtgaa 14580
ggagagcgat atcggggtca agtttgacac cagaaacttt cgtctgggct gggaccccgt 14640
gaccgggctg gtcatgccgg gggtctacac caacgaggcc tttcatcccg atatagtgct 14700
cctgcccggc tgtggggtgg actttaccca gagccggctg agcaacctgc tgggcgttcg 14760
caagcggcaa cctttccagg agggtttcaa gatcacctat gaggatctgg aggggggcaa 14820
cattcccgcg ctccttgatc tggacgccta cgaggagagc ttgaaacccg aggagagcgc 14880
tggcgacagc ggcgagagtg gcgaggagca agccggcggc ggtggcagcg cgtcggtaga 14940
aaacgaaagt actcccgcag tggcggcgga cgctgcggag gtcgagccgg aggccatgca 15000
gcaggacgca gaggagggcg cgcaggagga catgaacaat ggggagatca ggggcgacac 15060
tttcgccacc cggggcgaag aaaaagaggc agaggcggcg gcggcgacgg cggaagccga 15120
aaccgaggca gaggcagagc ccgagaccga agttatggaa gacatgaatg atggagaacg 15180
taggggtgac acgtttgcca cccggggcga agagaaggcg gcggaggcag aagccgcggc 15240
tgaggaggcg gctgcggctg cggccaaggc tgaggctgcg gctgaggcta aggtcgaagc 15300
cgatgttgcg gttgaggctc aggctgagga ggaggcggcg actgaagcag ttaaggaaaa 15360
ggcccaggca gagcaggaag agaaaaaacc tgtcattcaa cctctaaaag aagatagcaa 15420
aaagcgcagt tacaacgtca tcgagggcag cacctttacc caataccgca gctggtacct 15480
ggcttacaac tacggcgacc cggtcaaggg ggtgcgctcg tggaccctgc tctgcacgcc 15540
ggacgtcacc tgcggctccg agcagatgta ctggtcgctg ccaaacatga tgcaagaccc 15600
ggtgaccttc cgttccacgc ggcaggttag caactttccg gtggtgggcg ccgaactgct 15660
gccagtgcac tccaagagtt tttacaacga gcaggccgtc tactcccagc tgatccgcca 15720
ggccacctct ctgacccacg tgttcaatcg ctttcccgag aaccagattt tggcgcgccc 15780
gccggccccc accatcacca ccgtcagtga aaacgttcct gccctcacag atcacgggac 15840
gctaccgctg cgcaacagca tctcaggagt ccagcgagtg accattactg acgccagacg 15900
ccggacctgc ccctacgttt acaaggcctt gggcatagtc tcgccgcgcg tcctctccag 15960
tcgcactttt taaaacacat ctaccctcac gctccaaaat catgtccgta ctcatctcgc 16020
ccagcaacaa caccggctgg gggctgcgcg cgcccagcaa gatgtttgga ggggcgagga 16080
aacgctccga acagcaccca gtgcgcgtgc gcggccacta ccgcgcgccc tggggtgcgc 16140
acaagcgcgg gcgcacaggg cgcaccactg tggatgatgt cattgactcc gtagtggagc 16200
aggcgcgcca ctacacaccc ggcgcgccga ccgcctccgc cgtgtccacc gtggaccagg 16260
cgatcgaaag cgtggtacag ggggcgcggc actatgccaa ccttaaaagt cgccgccgcc 16320
gcgtggcgcg ccgccatcgc cggagacccc gggctactgc cgccgcgcgc cttaccaagg 16380
ctctgctcaa gcgcgccagg cgaactggcc accgggccgc catgagggcc gcacggcggg 16440
ctgccgctgc cgcgagcgcc gtggccccgc gggcacgaag gcgcgcggcc gctgccgccg 16500
ccgccgccat ttccagcttg gcctcgacgc ggcgcggtaa catatactgg gtgcgcgact 16560
cggtaagcgg cacacgggtg cccgtgcgct ttcgcccccc acggaattag cacaaaacaa 16620
catacacact gagtctcctg ctgttgtgta tcccagcggc gaccgtcagc agcggcgaca 16680
tgtccaagcg caaaattaaa gaagagatgc tccaggtcat cgcgccggag atctatgggc 16740
ccccgaagaa ggaggaggat gattacaagc cccgcaagct aaagcgggtc aaaaagaaaa 16800
agaaagatga tgacgttgac gaggcggtgg agtttgtccg ccgcatggcg cccaggcgcc 16860
ccgtgcagtg gaagggtcgg cgcgtgcagc gagtcctgcg ccccggcacc gcggtggtct 16920
ttacgcccgg cgagcgttcc acgcgcactt tcaagcgggt gtacgatgag gtgtacggcg 16980
acgaggatct gttggagcag gccaaccatc gctttgggga gtttgcatat gggaaacggc 17040
cccgcgagag cctaaaagag gacctgctgg cgctaccgct ggacgagggc aatcccaccc 17100
cgagtctgaa gccggtaacc ctgcaacagg tgctgccttt gagcgcgccc agcgagcaga 17160
agcgagggtt gaagcgcgag ggcggggacc tggcacccac cgtgcagttg atggtgccca 17220
agcggcagaa gctggaggac gtgctggaga aaatgaaagt agagcccggg atccagcccg 17280
aaatcaaggt ccgccccatc aagcaggtgg cgcccggcgt gggagtccag accgtggacg 17340
ttaggattcc cacggaggag atggaaaccc aaaccgccac tccctcttcg gcggctagcg 17400
ccaccaccgg cgccgcttcg gtagaggtgc agacggaccc ctggctacct gccgccactg 17460
tcgccgccgc cgccgccgcc ccccgttcgc gcgggcgcaa gagaaattat ccagcggcca 17520
gcgcgctcat gccccagtac gcactgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcacccgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcattg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg cctccgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcagag gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgctg 17940
cccttcctaa tcccgctaat cgccgcggcg atcggtgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc attttttgga 18060
ggaaaaaata aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagt ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggccaga tgctgagaga caagttgaaa gagcagaact tccaggaaaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccggcctcag gtggaggaaa cgcctccagc 18480
aatggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcctaccact cgccccatag cccccatggc caccggtgtg gtgggacaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgatccgcct cgccagccaa aggcggcgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgccc ctgcgccgcg ctgcaagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gtttatcccc gtggatcggg 19200
aggacaccgc ctactcttac aaggcgcggt ttacgctggc cgtgggcgac aatcgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggt cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggctcctaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaac caggtggtcg ctgcagatga tgaacttgag gatgaggaag 19440
cgcaggcaca agaggaagcc cctgtgaaaa aaattcatgt gtatgctcag gcgcctcttt 19500
ccggcgaaaa gatttccaag gatggcatcc aaataggcac tgaagtcgta ggagatacat 19560
ctaaggacac ttttgccgac aaaacattcc agcccgaacc tcagataggc gaatctcagt 19620
ggaatgaagc tgatgccaca gcagcaggag gtagggtttt gaagaagact actcccatga 19680
ggccttgtta tggatcttat gctaggccta ccaatgccaa cggaggccaa ggaattatgg 19740
ttgccaatga acaaggagtg ttggagtcta aagtagaaat gcagtttttc tctaacacca 19800
caacccttaa tgcgcgggat ggaaccggca atcccgaacc aaaggtggtg ttgtatagtg 19860
aagatgtcca cttggaatct cccgatactc acctgtctta caagcccaaa aaggatgatg 19920
ttaatgccaa aatcatgttg ggccaacagg ctatgcccaa taggcccaat cttattggat 19980
ttagagataa tttcattggg ctcatgtttt acaacagcac cggtaacatg ggagtgctgg 20040
cgggtcaagc ctctcagttg aatgctgtgg tggacttgca ggatagaaac acagaactgt 20100
cgtatcagct tttgcttgat tccattgggg atagaaccag atatttctcc atgtggaacc 20160
aggcagtgga tagttatgac ccagatgtca gaatcattga aaaccacggg actgaggacg 20220
aactgcctaa ctactgtttt cctctgggcg gcattggagt tacagatact tatcaaggga 20280
taaaaaatac taatggcaat ggtcaatgga ccaaagatga tcagtttgcg gaccgcaatg 20340
aaataggggt gggaaacaac tttgccatgg agatcaacat ccaggccaac ctctggagaa 20400
acttcctcta tgcaaacgtg gggctctacc tgccagacaa gctcaagtac aaccccacca 20460
acgtggacat ctctgacaac cccaacacct atgactacat gaacaagcgg gtggtggccc 20520
ctggcctggt ggactgcttt gtcaatgtgg gagccaggtg gtccctggac tacatggaca 20580
acgtcaaccc cttcaaccac caccgcaatg cgggtctgcg ctaccgctcc atgatcctgg 20640
gcaacgggcg ctatgtgccc tttcacatcc aggtacccca gaagttcttt gccatcaaga 20700
acctcctgct cctgcccggc tcctacacct acgagtggaa cttcaggaag gatgtgaaca 20760
tggtcctaca gagctctctg ggcaatgacc ttagggtgga tggggccagc atcaagtttg 20820
acagcatcac cctctatgct acatttttcc ccatggccca caacaccgcc tccacgcttg 20880
aggccatgct gagaaacgac accaacgacc agtcctttaa tgactacctc tctggggcca 20940
acatgctcta cccaatccca gccaaggcca ccaacgtgcc catctccatc ccctctcgca 21000
actgggccgc ctttagaggc tgggccttta cccgccttaa gaccaaggag accccctccc 21060
tgggctcggg ttttgatccc tactttgttt actcgggatc catcccctac ctggatggca 21120
ccttctacct caaccacact ttcaagaaga tatccatcat gtatgactcc tccgtcagct 21180
ggccgggcaa cgaccgcttg ctcaccccca atgagttcga ggtcaagcgc gccgtggacg 21240
gcgagggcta caacgtggcc cagtgcaaca tgaccaagga ctggttcctg gtgcagatgc 21300
tggccaacta caacataggc taccagggct tttacatccc agagagctac aaggacagga 21360
tgtactcctt cttcagaaat ttccaaccca tgagccgaca ggtggtggac gagaccaatt 21420
acaaggacta tcaagccatt ggcatcaccc accagcacaa caactcgggt ttcgtgggct 21480
acctggcacc cactatgcgc gagggacagg cctaccccgc caacttcccc taccccttga 21540
taggcaagac cgcggtcgac agcgtcaccc agaaaaagtt cctctgcgac cgcaccctct 21600
ggcgcatccc cttctctagc aacttcatgt ccatgggtgc gctcacggac ctgggccaaa 21660
acctgcttta tgccaactct gcccatgcgc tggacatgac tttcgaggtg gaccccatgg 21720
acgagcccac ccttctctat attgtgtttg aagtgttcga cgtggtcaga gtgcaccagc 21780
cgcaccgcgg tgtcatcgag accgtgtacc tgcgtacgcc cttctcagcc ggcaacgcca 21840
ccacctaagg agacagcgcc gccgcctgca tgactggttc caccgagcaa gagctcaggg 21900
ccatcgccag agacctggga tgcggaccct actttttggg cacctatgac aaacgcttcc 21960
cgggtttcat ctcccgagac aagctcgcct gcgccatcgt caacacggcc gcgcgcgaga 22020
ccgggggcgt gcactggctg gcctttggct gggacccgcg ctctaaaact tgctacctct 22080
ttgacccctt tggcttctct gatcagcgcc tcaggcagat ttatgagttt gagtacgagg 22140
ggctgctgcg ccgcagcgcg cttgcctcct cgcccgaccg ctgcatcacc cttgagaagt 22200
ccaccgagac cgtgcagggg ccccactcgg ccgcctgcgg tctcttctgt tgcatgtttt 22260
tgcacgcctt tgtacactgg cctcagagtc ccatggatcg caaccccacc atgaacttgc 22320
taaagggagt gcccaacgcc atgctccaga gcccccaggt cctgcccacc ctgcgccgca 22380
accaggaaca gctctaccgc ttcctggagc gccactcccc ctacttccgc agccacagcg 22440
cgcgcatccg gggggccacc tctttttgcc acttgcaaga aaacatgcaa gacggaaaat 22500
gatgtacagc atgcttttaa taaatgtaaa gactgtgcac tttatttata cacgggctct 22560
ttctggttat ttattcaaca ccgccgtcgc catctagaaa tcgaaagggt tctgccgcgc 22620
gtcgccgtgc gccacgggca gagacacgtt gcgatactgg aagcggctcg cccacttgaa 22680
ctcgggcacc accatgcggg gcagtggttc ctcggggaaa ttctcgctcc acagggtgcg 22740
ggtcagctgc agcgcgctca ggaggtcggg agccgagatc ttgaagtcgc agttggggcc 22800
ggaaccctgc gcgcgcgagt tgcggtacac ggggttgcag cactggaaca ccagcagggc 22860
cggattattc acgctggcca gcaggctctc gtcgctgatc atgtcgctgt ccagatcctc 22920
cgcgttgctc agggcgaatg gggtcatctt gcagacctgc ctgcccagga aaggcgggag 22980
cccaggcttg ccgttgcagt cgcagcgcag gggcattagc aggtgcccac ggcccgactg 23040
cgcctgcggg tacaacgcgc gcatgaaggc ttcgatctgc ctaaaagcca cctgggtctt 23100
ggctccctcc gaaaagaaca tcccacagga cttgctggag aactgattcg cgggacagct 23160
ggcatcgtgc aggcagcagc gcgcgtcagt gttggcgatc tgcaccacgt tgcgacccca 23220
ccggtttttc actatcttgg ccttggaagc ctgctccttt agcgcgcgct ggccgttctc 23280
gctggtcaca tccatctcta tcacctgttc cttgttgatc atgtttgtcc cgtgcagaca 23340
ctttaggtcg ccctccgtct gggtgcagcg gtgctcccac agcgcgcaac cggtgggctc 23400
ccaattcttg tgggtcaccc ccgcgtaggc ctgcaggtag gcctgcagga agcgccccat 23460
catggtcata aaggtcttct ggctcgtaaa ggtcagctgc aggccgcgat gctcttcgtt 23520
cagccaggtc ttgcagatgg cggccagcgc ctcggtctgc tcgggcagca tcttaaaatt 23580
tgtcttcagg tcgttatcca cgtggtactt gtccatcatg gcacgcgccg cctccatgcc 23640
cttctcccag gcggacacca tgggcaggct tagggggttt atcacttcca gcggcgagga 23700
caccgtactt tcgatttctt cttcctcccc ctcttcccgg cgcgcgcccc cgctgttgcg 23760
cgctcttacc gcctgcacca aggggtcgtc ttcaggcaag cgccgcaccg agcgcttgcc 23820
gcccttgacc tgcttgatca gtaccggcgg gttgctgaag cccaccatag tcagcgccgc 23880
ctgctcttct tcgtcttcgc tgtctaccac tatttctggg gaggggcttc tccgctctgc 23940
ggcaaaggcg gcggatcgct tctttttttt cttgggagcc gccgcgatgg agtccgccac 24000
ggcgaccgag gtcgagggcg tggggctggg ggtgcgcggc accagggcct cgtcgccctc 24060
ggactcttcc tctgactcca ggcggcggcg gagtcgcttc tttgggggcg cgcgcgtcag 24120
cggcggcgga gacggggacg gggacgggga cgggacgccc tccacagggg gcggtcttcg 24180
cgcagacccg cggccgcgct cgggggtctt ctcgcgctgg tcttggtccc gactggccat 24240
tgtatcctcc tcctcctagg cagagagaca taaggagtct atcatgcaag tcgagaagga 24300
ggagagctta accaccccct ctgagaccgc cgtcgccgtc gcccccgcta ccgccgacgc 24360
gcccgccaca ccgagcgaca cccccgcgga cccccccgcc gacgcacccc tgttcgagga 24420
agcggccgtg gagcaggacc cgggctttgt ctcggcagag gaggatttgc aagaggagga 24480
ggataaggag gagaagccct cagtgccaaa agatcataaa gagcaagacg agcacgacgc 24540
agacgcacac cagggtgaag tcgggcgggg ggacggaggg catggcggcg ccgactacct 24600
agacgaagga aacgacgtgc tcttgaagca cctgcatcgt cagtgcgcca tcgtctgcga 24660
cgctctgcag gagcgcagcg aggtgcccct cagcgtggcg gaggtcagcc gcgcctacga 24720
gctcagcctc ttttcccccc gggtgccccc ccgccgccgc gaaaacggca catgcgagcc 24780
caacccgcgc ctcaacttct accccgcctt tgtggtgccc gaggtcctgg ccacctatca 24840
catcttcttt caaaattgca agatccccat ctcgtgccgc gccaaccgta gccgcgccga 24900
taagatgctg gccctgcgcc agggcgacca catacctgat atcgccgctt tggaagatgt 24960
accaaagatc ttcgagggtc tgggtcgcaa cgaaaagcgg gcagcaaact ctctgcaaca 25020
ggaaaacagc gaaaatgaga gtcacaccgg ggtgctggtg gagctcgagg gcgacaacgc 25080
ccgcctggcg gtgctcaagc gcagcatcga ggtcacccac tttgcctacc ccgcgctcaa 25140
cctgcccccc aaagtcatga acgcggtcat ggacgggctg atcatgcgcc gcggccagcc 25200
ccttgctcca gatgcaaact tgcatgagga gaccgaggac ggccagcccg tggtcagcga 25260
cgagcagctg gcgcgctggc tggaaaccgc ggaccccgcc gaactggagg agcggcgcaa 25320
gatgatgatg gccgcggtgc tggtcaccgt agagctggag tgtctgcagc gcttcttcgg 25380
tgaccccgag atgcagagaa aggtcgagga gaccctacac tacaccttcc gccagggcta 25440
cgtgcgccag gcttgcaaga tctccaacgt ggagctcagc aacctggtgt cctacctggg 25500
catcttgcat gagaaccgcc ttgggcagag cgtgctgcac tccaccctgc gcggggaagc 25560
gcgccgcgac tacgtgcgcg actgcgttta ccttttcctc tgctacacct ggcagacggc 25620
catgggggtc tggcagcagt gcctggagga gcgcaacctc aaggagctgg agaagctcct 25680
gcagcgcgcg ctcaaagacc tctggacggg cttcaacgag cgctcggtgg ccgccgcgct 25740
ggccgacctc atcttccccg agcgcctgct caaaactctc cagcaggggc tgcccgactt 25800
caccagccaa agcatgttgc aaaactttag gaactttatc ctggagcgtt ctggcatcct 25860
acccgccacc tgctgcgccc tgcccagtga ctttgttccc ctcgtgtacc gcgagtgccc 25920
cccgccgctg tggggccact gctacctgtt ccaactggcc aactacctgt cctaccacgc 25980
ggacctcatg gaggactcca gcggcgaggg gctcatggag tgccactgcc gctgcaacct 26040
ctgcacgccc caccgctccc tggtctgcaa cacccaactg ctcagcgaga gtcagattat 26100
cggtaccttc gagctacagg gtccgtcctc ctcagacgag aagtccgcgg ctccggggct 26160
aaaactcact ccggggctgt ggacttccgc ctacctgcgc aaatttgtac ctgaagacta 26220
ccacgcccac gagatcaggt tttacgagga ccaatcccgc ccgcccaagg cggagctgac 26280
cgcctgcgtc atcacccagg gcgagatcct aggccaattg caagccatcc aaaaagcccg 26340
ccaagagttt ttgctgagaa agggtcgggg ggtgtatctg gacccccagt cgggtgagga 26400
gctcaacccg gttcccccgc tgccgccgcc gcgggacctt gcttcccagg ataagcatcg 26460
ccatggctcc cagaaagaag cagcagcggc cgccactgcc gccaccccac atgctggagg 26520
aagaggagta ctgggacagt caggcagagg aggtttcgga cgaggaggag ccggagacgg 26580
agatggaaga gtgggaggag gacagcttag acgaggaggc ttccgaagcc gaagaggcag 26640
gcgcaacacc gtcaccctcg gccgcagccc cctcgcaggc gcccccgaag tccgctccca 26700
gcatcagcag caacagcagc gctataacct ccgctcctcc accgccgcga cccacggccg 26760
accgcagacc caaccgtaga tgggacacca ccggaaccgg ggccggtaag tcctccggga 26820
aaggcaagca agcgcagcgc caaggctacc gctcgtggcg cgctcacaag aacgccatag 26880
tcgcttgctt gcaagactgc ggggggaaca tctccttcgc ccgccgcttc ctgctcttcc 26940
accacggtgt ggccttcccc cgtaacgtcc tgcattacta ccgtcatctc tacagcccct 27000
actgcggcgg cagtgagcca gaggcggccg gcggcagcgg cgcccgtttc ggtgcctagg 27060
aagacccagg gcaagacttc agccaagaaa ctcgcggcgg ccgcggcgaa cgcggtcgcg 27120
ggggccctgc gcctgacggt gaacgaaccc ctgtcgaccc gcgaactgag gaaccgaatc 27180
ttccccactc tctatgccat cttccagcag agcagagggc aggatcagga actgaaagta 27240
aaaaacaggt ctctgcgctc cctcacccgc agctgtctgt atcacaagag cgaagaccag 27300
cttcggcgca cgctggagga cgctgaggca ctcttcagca aatactgcgc gctcactctt 27360
aaggactagc tccgcgccct tctcgaattt aggcgggaac gcctacgtca tcgcagcgcc 27420
gccgtcatga gcaaggacat tcccacgcca tacatgtgga gctatcagcc gcagatggga 27480
ctcgcggcgg gcgcctccca agactactcc acccgcatga actggctcag tgccggccca 27540
cacatgatct cacaggttaa tgacatccgc acccatcgaa accaaatatt ggtggagcag 27600
gcggcaatta ccaccacgcc ccgcaataat cccaacccca gggagtggcc cgcgtccctg 27660
gtgtatcagg aaattcccgg ccccaccacc gtactacttc cgcgtgattc ccaggccgaa 27720
gtccaaatga ctaactcagg ggcacagctc gcgggcggct gtcgtcacag ggtgcggcct 27780
cctcgccagg gtataactca cctggagatc cgaggcagag gtattcagct caacgacgag 27840
tcggtgagct cctcgctcgg tctcagacct gacgggacct tccagatagc cggagccggc 27900
cgatcttcct tcacgccccg ccaggcgtac ctgactctgc agagctcgtc ctcggcgccg 27960
cgctcgggcg gcatcgggac tctccagttc gtgcaggagt ttgtgccctc ggtctacttc 28020
aaccccttct cgggctctcc cggtcgctac ccggaccagt tcatcccgaa ctttgacgcc 28080
gcgagggact cggtggacgg ctacgactga atgtcgggtg gacccggtgc agagcaactt 28140
cgcctgaagc accttgacca ctgccgccgc cctcagtgct ttgcccgctg tcagaccggt 28200
gagttccagt acttttccct gcccgactcg cacccggacg gcccggcaca cggggtgcgc 28260
tttttcatcc cgagtcaggt ccgctctacc ctaatcaggg agtttacagc ccgtccccta 28320
ctggcggagt tggaaaaggg gccttctatc ctaaccattg cctgcatctg ctctaaccct 28380
ggattacacc aagatctttg ctgtcatttg tgtgctgagt ataataaagg ctgagatcag 28440
aatctactcg ggctcctgtc gccatcctgt caacgccacc gtccaagccc ggcccgatca 28500
gcccgaggtg aacctcacct gcggtctgca ccggcgcctg aggaaatacc tagcttggta 28560
ctacaacagc actccctttg tggtttacaa cagctttgac caggacgggg tctcactgag 28620
ggataacctc tcgaacctga gctactccat caggaagaac aacaccctcg agctacttcc 28680
tccttacctg cccgggactt accagtgtgt caccggtccc tgcacccaca cccacctgtt 28740
gatcgtaaac gactctcttc cgagaacaga cctcaataac tcctctccgc agttccccag 28800
aacaggaggt gagctcagga aaccccgggt aaagaagggt ggacaagagt taacacttgt 28860
ggggtttctg gtgtatgtga cgctggtggt ggctcttttg attaaggctt ttccttccat 28920
gtctgaactc tccctcttct tttatgaaca actcgactag tgctaacgag accctaccca 28980
acgaatcggg attgaatatc ggtaaccagg ttgcagtttc acttttgatt acctttatag 29040
tcctcttcct gctagtgctg tcgcttctgt gcctgcggat cgggggctgc tgcatccacg 29100
tttatatctg gtgctggctg tttagaaggt tcggagacca ccgcaggtag aataatgctg 29160
cttaccctct ttgtcctggc gctggctgcc agctgccaag ccttttccga ggctgacttc 29220
atagagcccc agtgcaatat cacttataaa tctgaacgtg ccatctgtac tatcctaatc 29280
aaatgtgtta ctcaacacga taaggtaact gttaaataca aagatcaatt aaaaaaagac 29340
gcactttaca gcagctggca accaggagat gaacaaaaat acaatgtaac cgtcttccag 29400
ggcaaactct ccaaaactta caattacact ttcccatttg agcagatgtg tgactttgtc 29460
atgtacatgg aaaagcagta caagctgtgg cctccaactc cccagggctg tgtggaaaat 29520
ccaggctctt tctgtatgat ctctctctgt gtaactgtgc tggcactaat actcacgctt 29580
ctgtatatca gatttaaatc aaggcaaagc tttattgatg aaaagaaaat gccttaatcg 29640
ctttcacgct tgattgctaa caccgggttt ttatccgcag aatgattgga atcaccctac 29700
taatcacctc cctccttgcg attgcccatg ggttggaacg aatcgaagtc cctgtggggg 29760
ccaatgttac cctggtgggg cctgtcggca atgctacatt aatgtgggaa aaatatacta 29820
aaaatcaatg ggtctcttac tgcactaaca aaaacagcca caagcccaga gccatctgcg 29880
atgggcaaaa tttaaccttg attgatgttc aattgctgga tgcgggctac tattatgggc 29940
agctgggtac aatgattaat tactggagac cccacagaga ttacatgctt cacgtagtaa 30000
agggtcccat tagcagccca accaccacct ctaccacccc cactaccacc actactccca 30060
ccaccagcac tgccgcccag cctcctcata gcagaacaac cacttttatc aattccaagt 30120
cccactcccc ccacattgcc ggcgggccct ccgcctcaga ctccgagacc accgagatct 30180
gcttctgcaa atgctctgac gccattgccc aggatttgga agatcacgag gaagatgagc 30240
atgactacgc agatgcatgc caggcatcag agtcagaagc gctgccggtg gccctaaaac 30300
agtatgcaga cccccacacc acccccgacc ttcctccacc ttcccagaag ccaagtttcc 30360
tgggggaaaa tgaaactctg cctctctcca tactagctct gacatctgtt gctattttgg 30420
ccgctctgct ggtgcttcta tgctctatat gctacctgat ctgctgcaga aagaaaaaat 30480
ctcacggcca tgctcaccag cccctcatgc acttccctta ccctccagag ctgggcgacc 30540
acaaacttta agtctgcagt agctatctgc ccatcccttg tcagtcgaca gcgatgagcc 30600
ccactaatct aacagcctct ggacttacaa cattgtctct taatgagacc accgctcctc 30660
aagacctgta cgatggtgtc tccgcgctgg ttaaccagtg ggatcacctg ggcatatggt 30720
ggctcctcat aggagcagtg accctgtgcc taatcctggt ctggatcatc tgctgcatca 30780
aaagcagaag acccaggcgg cggcccatct acaggccctt cgtcatcaca cctgaagata 30840
atgatgatga tgacaccacc tccaggctgc agagcctaaa gcagctactc ttctctttta 30900
cagcatggta aattgaatca tgccccgcat tttcatctac ttgcttctcc ttccactttt 30960
tctgggctcc tctacattgg ccgctgtgtc ccacatcgag gtagactgcc tcacgccctt 31020
cacagtctac ctgcttttcg gctttgtcat ctgcaccttt gtctgcagcg ttatcactgt 31080
agtgatctgc ttcatacagt gcatcgacta catctgtgtg cgggtggcct actttagaca 31140
ccacccccag tatcgcaaca gggacatagc ggctctccta agacttgttt aaatcatggc 31200
caaattacct gtgattggtc ttctgattat ctgctgcgtc ctagccgcga ttgggactca 31260
acctaatacc accaccagcg ctcccagaaa gagacatgta tcctgcagct tcaagcgtcc 31320
ctggaatata ccccaatgct ttactgatga acctgaaatc tctttggctt ggtacttcag 31380
cgtcaccgcc cttctcatct tctgcagtac ggttattgct cttgccatct acccttccct 31440
taacctgggc tggaatgctg tcaactctat ggaatatccc accttcccag aaccagacct 31500
gccagacctg gttgttctaa acgcgtttcc tcctcctcca gttcaaaatc agtttcgccc 31560
tccgtcccct acgcccactg aggtcagcta ctttaatcta acaggcggag atgactgaaa 31620
acctagacct agaaatggac ggtctctgca gcgagcaacg cacactagag aggcgccggc 31680
aaaaagcaga gctcgagcgt cttaaacaag agctccaaga cgccgtggcc atacaccagt 31740
gcaaaaaagg gctcttctgt ctggtaaaac aggccacgct cacctatgaa aaaacaggtg 31800
acacccaccg cctaggatac aagctgccca cacagcgcca aaagtttgcc cttatgatag 31860
gtgaacaacc catcaccgtc acccagcact ccgtggagac agaaggctgc attcatgctc 31920
cctgcagggg cgctgactgc ctctacacct tgatcaaaac cctctgcggt ctcagagacc 31980
ttatcccttt caattgatca taactgtaat caataaaaaa tcacttactt gaaatctgat 32040
agcaagcctc tgtccaattt tttcagcaac acttccttcc cctcttccca actctggtac 32100
tctaggcgcc tcctagctgc aaacttcctc cacagtctga agggaatgtc agattcctcc 32160
tcctcctgtc cctccgcacc cacaatcttc atgttgttgc agatgaaacg cgcgagatcg 32220
tctgacgaga ccttcaaccc cgtgtacccc tacgataccg agatcgctcc gacttctgtc 32280
cctttcctta cccctccctt tgtgtcaccc gcaggaatgc aagaaaatcc agctggggtg 32340
ctgtccctgc acctgtcaga gccccttacc acccacaatg gggccctgac tctaaaaatg 32400
gggggcggcc tgaccctgga caaggaaggg aatctcactt cccaaaacat caccagtgtc 32460
gatccccctc tcaaaaaaag caagaacaac atcagccttc agaccgccgc acccctcgcc 32520
gtcagctccg gggccctaac cctttttgcc actccccccc tagcggtcag tggcgacaac 32580
cttactgtgc agtctcaggc ccctcttact ttggaagact caaaactaac tctggccacc 32640
aaaggacccc taactgtgtc cgaaggcaaa cttgtcctag aaacagaggc tcccctgcat 32700
gcaagtgaca gcagtagcct gggccttagc gtcacggccc cacttagcat taacaatgac 32760
agcctaggac tagatctgca ggcacccatt gtctctcaaa atggaaaact ggctctaaat 32820
atagcaggcc ccctagctgt agccgatagc attaatgctt tgacagtagg cactggcaaa 32880
ggtattggac taaatgaaac cagcactcac ttgcaagcaa aattggttgc ccccctaggc 32940
tttgatacca atggcaatat taagctaagc gttgcaggag gcatgaggct aaacaatgac 33000
acactgatac tagatgtaaa ctacccattt gaagctcaag gtcaactaag cctaagagtg 33060
ggcacaggtc cactgtatgt agattctagc agtcataatc taaccattag atgccttagg 33120
ggattgtata taacatcatc taacaaccaa aacggtctag aggccaacat taaactaaca 33180
aaaggccttg tgtatgaagg aaatgccata gcagttaatg ttggtcaagg attgcaatac 33240
agcactactg ccacatcgga aggtgtgtat cctatacagt ctaagatagg tttgggaatg 33300
gaatatgata ccaacggagc catgatggca aaactaggct ccggtctaag ctttgataat 33360
tcaggagcca ttgtggtggg aaacaaaaat gatgacaaac ttaccctatg gaccacacct 33420
gacccgtctc ctaactgtag aatttattct gaaaaagata ctaaactaac cttggtgctg 33480
actaagtgtg gcagtcaaat cctaggcaca gtatctgccc ttgctgtcag aggcagcctt 33540
gcgcccatca ctaacgcatc cagcatagtc caaatatttc tacgatttga tgaaaatgga 33600
ctattgatga gcaactcatc gctagacggt gattactgga attacagaaa tggggactcc 33660
actaatggca caccatatac aaatgcagta ggctttatgc ctaatctagc tgcctatcct 33720
aaaggtcagg ctacaactgc aaaaagcagt attgtaagcc aggtatacat ggatggtgat 33780
actactaaac ctataacact aaaaataaac tttaatggca ttgatgaaac aacagaaaat 33840
acccctgtta gtaaatattc catgacattc tcatggagct ggcccaccgc aagctacata 33900
ggccacactt ttgcaacaaa ctcttttact ttctcctaca tcgcccaaga ataaagaaag 33960
cacagagatg cttgtttttg atttcaaaat tgtgtgcttt tatttatttt caagcttaca 34020
gtatttccag tagtcattca aatagagctt aatgaaactg catgagaacc cttccacata 34080
gcttaaatta tcaccagtgc aaatggagaa aaaatcaaca taccttttta tccagatatc 34140
atagaactct agtggtcagt tttcccccac cctcccagct cacagaatac acagtccttt 34200
ccccccggct ggctttaaac aacactatct cattggtaac agacatattc ttaggtgtaa 34260
taatccacac ggtctcttgg cgggccaaac gctggtcagt gatgttaata aactccccag 34320
gcagctcttt caagttcacg tcgctgtcca actgctgaag cgctcgcggc tccgactgcg 34380
cctctagcgg aggcaacggc aacacccgat ccttgatcta taaaggagta gagtcataat 34440
cccccataag aatagggcgg tgatgctgca acaaggcgcg cagcaactcc tgccgccgcc 34500
tttccgtacg acaggaatgc aacggggtgg tggtctcctc cgcgataatc cgcaccgctc 34560
gcaacatcag cgtcctcgtc ctccgggcac agcagcgcat cctgatctca ctgagatcgg 34620
cgcagtaagt gcagcacaac accaagatgt tatttaagat cccacagtgc aaagcactgt 34680
acccaaagct catggcggga aggacagccc ccacgtgacc atcataccag atcctcaggt 34740
aaatcaaatg acgacctctc atgaacacgc tggacatgta catcacctcc ttaggcatgt 34800
gctgattcac cacctctcga taccacaggc atcgctgatt aattaaagac ccctcgagca 34860
ccatcctgaa ccaggaagcc agcacctgac cccccgccag gcactgcagg gaccccggtg 34920
aatcgcagtg gcagtgaaga ctccagcgct cgtagccgtg aaccatagag ctggtcatta 34980
tatccacatt ggcacaacac agacacactt tcatacactt tttcatgatt agcagctcct 35040
ctctagtcag gaccatatcc caaggaatca cccactcttg aatcaaggta aatcccacac 35100
agcagggcag gcctctcaca taactcacgt tatgcatagt gagcgtgtcg caatctggaa 35160
ataccggatg atcttccatc accgaagccc gggtctccgt ctcaaaggga ggtaaacggt 35220
ccctcgtgta gggacagtgg cgggataatc gagatcgtgt tgaacgtaga gtcatgccaa 35280
agggaacagc ggacgtactc atatttcctc cagcagaacc aagtgcgcgc gtggcagcta 35340
tccttgcgtc ttctgtctcg ccgcctgccc cgctcggtgt agtagttgta atacagccac 35400
tccctcagac cgtcaaggcg ctccctggcg tccggatcta taacaacacc atcctgcagc 35460
gccgccctga tgacatccac caccgtagag tatgccaagc ccagccagga aatgcactca 35520
ctttgacagc gagagatagg aggagcggga agagatggaa gaaccatgat agtaaaagaa 35580
cttttattcc aatcgatcct ctacaatgtc aaagtgtaga tctatcagat ggcactggtc 35640
tcctccgctg agtcgatcaa aaataacagc taaaccacaa acaacacgat tggtcaaatg 35700
ctgcacaagg gcttgcagca taaaatcgcc tcgaaagtcc accgcaagca taacatcaaa 35760
gccaccgccc ctatcatgat ctatgataaa aaccccacag ctatccacca gacccatata 35820
gttttcatct ctccatcgtg aaaaaatatt tacaagctcc tcctttaaat cacctccaac 35880
caattcaaaa agttgagcca gaccgccctc caccttcatt ttcagcatgc gcatcatgat 35940
tgcaaaaatt caggctcctc agacacctgt ataagattga gaagcggaac gttaacatca 36000
atgtttcgct cgcgaagatc gcgcctcagt gcaagcatga tataatccca caggtcggag 36060
cggatcagcg aggacatctc cccgccagga accaactcaa cggagcctat gctgattata 36120
atacgcatat tcggggctat gctaaccagc acggccccca aataggcgta ctgcataggc 36180
ggcgacaaaa agtgaacagt ttgggttaaa aaatcaggca aacactcgcg caaaaaagca 36240
agaacatcat aaccatgctc atgcaaatag atgcaagtaa gctcaggaac gaccacagaa 36300
aaatgcacaa tttttctctc aaacatgact gcgagccctg caaaaataaa aaagaaacat 36360
tacacaagag tagcctgtct tacaatggga tagactactc taaccaacat aagacgggcc 36420
acaacatcgc ccgcgtggcc ataaaaaaaa ttatccgtgt gattaaaaag aagcacagat 36480
agctggccag tcatatccgg agtcatcacg tgcgaacccg tgtagacccc cgggttggac 36540
acatcggcca aacaaagaaa gcggccaatg tatcccggag gaatgataac actaagacga 36600
agatacaaca gaataacccc atggggggga ataacaaagt tagtaggtga ataaaaacga 36660
taaacacccg aaactccctc ctgcgtaggc aaaatagcgc cctccccttc caaaacaaca 36720
tatagcgctt ccacagcagc catgacaaaa gactcaaaac actcaaaaga ctcagtctta 36780
ccaggaaaat aaaagcactc tcacagcacc agcactaatc agagtgtgaa aaaggccaag 36840
tgccgaacga gtatatatag gaattaaaaa tgacgtaaat gtgtaaaggt cagaaaacgc 36900
ccagaaaaat acacagacca acgcccgaaa cgaaaacccg cgaaaaaata cccagaagtt 36960
cctcaacaac cgccacttcc gctttcccac gagacgtcac ttcctcaaaa atagcaaact 37020
acatttccca catatacaaa accaaaaccc ctccccttgt caccgcccac aacttacatc 37080
atcacaaacg tcaaagccta cgtcacccgc cccgcccacc tcattatcat attggccaca 37140
atccaaaata aggtatatta ttgatgatg 37169
<210> 4
<211> 37169
<212> DNA
<213> Great Ape Adenovirus
<400> 4
catcatcaat aatatacctt attttggatt gtggccaata tgataatgag gtgggcgggg 60
cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgc ggaagtgtgg catttgcaag 120
tgggaggagc tcacatgtaa gcttccgtcg cggaaaatgt gacgttttta atgagcgccg 180
cctacctccg gaagtgccaa ttttcgcgcg cttttcaccg gatatcgtag taattttggg 240
cgggaccatg taagatttgg ccattttcgc gcgaaaagtg aaacggggaa gtgaaaactg 300
aataataggg cgttagtcat agcgcgtaat atttaccgag ggccgaggga ctttgaccga 360
ttacgtggag gactcgccca ggtgtttttt acgtgaattt ccgcgttccg ggtcaaagtc 420
tccgttttta ttgtcaccgt catctgacgc ggagggtatt taaacccgct gcgctcctaa 480
agaggccact cttgagtgcc agcgagaaga gttttctcct ccgctccgtt tcggcgatcg 540
aaaaatgaga cacttagcct gcactccggg tcttttgtcc ggccgggcgg cgtccgagct 600
tttggacgct ttgctcaatg aggttctgag cgatgatttt ccgtctacta cccactttag 660
cccacctact cttcacgaac tgtacgatct ggatgtactg gtggatgtga acgatcccaa 720
cgaggaggcg gtttctacgt tttttcccga gtctgcgctt ttggccgccc aggagggatt 780
tgacctacac actccgccgc tgcctatttt agagtctccg ctgccggagc ccagtggtat 840
accttatatg cctgaactgc ttcccgaagt ggtagacctg acctgccacg agccgggctt 900
tccgcccagc gacgatgagg gtgagccttt tgctttagac tatgctgaga tacctgggct 960
cggttgcagg tcttgtgcat atcatcagag ggttaccgga gaccccgagg ttaagtgttc 1020
gctgtgctat atgaggctga cctcttcctt tatctacagt aagttttttg tgtaggtggg 1080
ctttttgggt aggtgggttt tgtggcagga caggtgtaaa tgttgcttgt gttttttgta 1140
cctgcaggtc cggtgtccga gccagacccg gagcccgacc gcgatcccga gccggatccc 1200
gagcctcctc gcagggcaag gaaattacct tccattttgt gcaagcctaa gacacctgtg 1260
aggaccagcg aggcggacag cactgactct ggcacttcta cctctcctcc tgaaattcac 1320
ccagtggttc ctttgggtat acataaacct gttgctatta gagtttgcgg gcgacgccct 1380
gcagtagagt gcattgagga cttgcttaac gatcccgagg gacctttgga cttgagcatt 1440
aaacgcccta ggcaataaac cccacctaag taataaaccc cacctaagta ataaacttta 1500
ccgcccttgg ttattgagat gacgcccaat gtttgctttt gaatgacttc atgtgtataa 1560
taaaagtgag tgtggtcata ggtctcttgt ttgtctgggc ggggcttaag ggtatataag 1620
tttctcgggg ctaaacttgg ttacacttga ccccaatgga ggcgtggggg tgcttggagg 1680
agtttgcgga cgtgcgccgt ttgctggacg agagctctag caatacctat agtatttgga 1740
ggtatctgtg gggctctact caggccaagt tggtctccag aattaagcag gattacaagt 1800
gcgattttga agagcttttt agttcctgtg gtgagctttt gcaatccttg aatctgggcc 1860
accaggctat cttccaggaa aaggttctct cgactttgga tttttccact cccgggcgca 1920
ccgccgcttg tgtggctttt gtgtcttttg tgcaagataa atggagcggg gagacccacc 1980
tgagtcacgg ctacgtgctg gatttcatgg cgatggctct ttggagggct tacaacaaat 2040
ggaagattca gaaggaactg tacggttccg ccctacgtcg tccacttctg cagcggcagg 2100
ggctgatgtt tcccgaccat cgccagcatc agaatctgga agacgagtcg gaggagcgag 2160
cggagaagat cagcttgaga gccggcctgg accctcctca ggaggaatga atctcccgca 2220
ggtggttgac ctgtttcccg aactgagacg ggtcctgact atcagggaag atggtcagtt 2280
tgtgaagaag ctgaagaggg atcggggtga gggagatgat gaggcggcta gcaatttagc 2340
ttttagtctg ataacccgcc accgaccgga atgtattacc tatcagcaga ttaaggagag 2400
ttgtgccaac gagctggatc ttttgggtca gaagtatagc atagaacagc ttaccactta 2460
ctggcttcag cccggggatg attgggaaga ggcgatcagg gtgtatgcaa aggtggccct 2520
gcggcccgat tgcaagtata agattactaa gttggttaat attagaaact gctgctatat 2580
ttctgggaac ggggccgaag tggagataga tactgaggac agggtggcta ttaggtgttg 2640
catgataaac atgtggcccg ggatactggg gatggatggg gtgatattta tgaatgtaag 2700
gttcacgggc cccaacttta atggtacggt gttcatgggc aacaccaact tgctcctgca 2760
tggtgcgagt ttctatgggt ttaacaacac ctgtatagag gcctggaccg atgtaaaggt 2820
tcgaggttgt tccttttata gctgttggaa ggcggtggtg tgtcgcccta aaagcagggg 2880
ttctgtgaag aaatgcttgt ttgaaaggtg caccctaggt atcctttctg agggcaactc 2940
cagggtgcgc cataatgtgg cttcgaactg cggttgcttc atgcaagtga agggggtgag 3000
cgttatcaag cataactcgg tctgtggaaa ctgcgaggat cgcgcctctc agatgctgac 3060
ctgctttgat ggcaactgtc acctgttgaa gaccattcat ataagcagtc accccagaaa 3120
ggcctggccc gtgtttgagc ataacattct gacccgctgt tccttgcatc tgggggtcag 3180
gaggggtatg ttcctgcctt accagtgtaa cttcagccac actaaaatcc tgctggaacc 3240
cgagtgcatg actaaggtca gcctgaatgg tgtgtttgat gtgagtctga agatttggaa 3300
ggtgctgagg tatgatgaga ccaggaccag gtgccgaccc tgcgagtgcg gcggcaagca 3360
catgagaaat cagcctgtga tgttggatgt gaccgaggag cttaggcctg accatctggt 3420
gctggcctgc accagggccg agtttgggtc tagcgatgag gataccgatt gaggtgggta 3480
aggtgggcgt ggctagcagg gtgggcgtgt ataaattggg ggtctaaggg gtctctctgt 3540
ttgtcttgca acagccgccg ccatgagcga caccggcaac agctttgatg gaagcatctt 3600
tagcccctat ctgacagtgc gcatgcctca ctgggccgga gtgcgtcaga atgtgatggg 3660
ttccaacgtg gatggacgtc ccgttctgcc ttcaaattcg tctacgatgg cctacgcgac 3720
cgtgggagga actccgttgg acgccgcgac ctccgccgcc gcctccgccg ccgccgcgac 3780
cgcgcgcagc atggctacgg acctttacag ctctttggtg gcgagcagcg cggcctctcg 3840
cgcgtctgct cgggatgaga aactgactgc tctgctgctt aaactggaag acttgacccg 3900
ggagctgggt caactgaccc agcaggtctc cagcttgcgt gagagcagcc ttgcctcccc 3960
ctaatggccc ataatataaa taaaagccag tctgtttgga ttaagcaagt gtatgttctt 4020
tatttaactc tccgcgcgcg gtaagcccgg gaccagcggt ctcggtcgtt tagggtgcgg 4080
tggattcttt ccaacacgtg gtacaggtgg ctctggatgt ttagatacat gggcatgagt 4140
ccatccctgg ggtggaggta gcaccactgc agagcttcgt gctcgggggt ggtgttgtat 4200
atgatccagt cgtagcagga gcgctgggcg tggtgctgaa aaatgtcctt aagcaagagg 4260
cttatagcta gggggaggcc cttggtgtaa gtgtttacaa atctgctcag ctgggagggg 4320
tgcatccggg gggatatgat gtgcatcttg gactggattt ttaggttggc tatgttccca 4380
cccagatccc ttctgggatt catgttgtgc aggaccacca gcacggtata tccagtgcac 4440
ttgggaaatt tatcgtggag cttagacggg aatgcatgga agaacttgga gacgcccttg 4500
tggcctccca gattttccat acattcgtcc atgatgatgg caatgggccc gtgggaagct 4560
gcctgagcaa aaacgtttct gggatcgctc acatcgtagt tatgttccag ggtgaggtca 4620
tcataggaca tctttacgaa tcgggggcgg agggtcccgg actgggggat gatggtaccc 4680
tcgggccccg gggcgtagtt cccctcacag atctgcatct cccaggcttt catttcagag 4740
ggagggatca tatccacctg cggggcgatg aaaaagacag tttctggcgc aggggagatt 4800
aactgggatg agagcaggtt tctgagcagc tgtgactttc cacagccggt gggcccatat 4860
atcacgccta tcaccggctg cagctggtag ttaagagagc tgcagctgcc gtcctcccgg 4920
agcagggggg ccacctcgtt gagcatatcc ctgacgtgga tgttttccct gaccagttcc 4980
gccagaaggc gctcgccgcc cagcgaaagc agctcttgca aggaagcaaa atttttcagc 5040
ggtttcaggc catcggccgt gggcatgttt ttcagcgtct gggtcagcag ctccagcctg 5100
tcccagagct cggtgatgtg ctctacggca tctcgatcca gcagatctcc tcgtttcgcg 5160
ggttggggcg gctttcgctg tagggcacca gccgatgggc gtccagcggg gccagagtca 5220
tgtccttcca tgggcgcaga gtcctcgtca gggtggtctg ggtcacggtg aaggggtgcg 5280
ctccgggttg ggcgctggcc agggtgcgct tgaggctggt tctgctggtg ctgaatcgct 5340
gccgctcttc gccctgcgcg tcggccaggt agcatttgac catggtctcg tagtcgagac 5400
cctcggcggc gtgccccttg gcgcggagct ttcccttgga ggtggcgccg cacgaggggc 5460
actgcaggct cttcagggcg tagagcttgg gagcgagaaa cacggactct ggggagtagg 5520
cgtccgcgcc gcaggccgag cagaccgtct cgcattccac cagccaagtg agttccgggc 5580
ggtcagggtc aaaaaccagg ctgcccccat gctttttgat gcgtttctta cctcggctct 5640
ccatgaggcg gtgtcccttc tcggtgacga agaggctgtc cgtgtccccg tagaccgatt 5700
tcaggggcct gtcttccagc ggagtgcctc tgtcctcctc gtagagaaac tctgaccact 5760
ctgagacaaa ggcccgtgtc caggccagga cgaaggaggc cacgtgggag gggtagcggt 5820
cgttgtccac tagcgggtcc accttctcca gggtgtgcag gcacatgtcc ccctcctccg 5880
cgtccagaaa agtgattggc ttgtaggtgt aggacacgtg accgggggtt cccgacgggg 5940
gggtataaaa gggggtgggt gccctttcat cttcactctc ttccgcatcg ctgtctgcga 6000
gagccagctg ctggggtaag tattcccttt cgaaggcggg catgacctca gcgctcaggt 6060
tgtcagtttc taaaaatgag gaagatttga tgttcacctg tccggaggtg atacctttga 6120
gggtacctgg gtctatctgg tcagaaaaca ctattttttt gttatcaagc ttggtggcga 6180
acgacccgta gagggcgttg gagagcagct tggcgatgga gcgcagggtc tggtttttgt 6240
cgcggtcggc tcgctccttg gccgcgatgt tgagttgcac gtactcgcgg gccacgcact 6300
tccactcggg gaagacggtg gtgcgctcgt ctgggatcag gcgcaccctc cagccgcggt 6360
tgtgcagggt gaccatgtcg acgctggtgg cgacctcacc gcgcaggcgc tcgttggtcc 6420
agcagaggcg gccgcccttg cgcgagcaga aggggggtag ggggtccagc tggtcctcgt 6480
tcggggggtc cgcgtcgatg gtaaagaccc cggggagcag acgcgggtca aagtagtcga 6540
tcttgcaagc ttgcatgtcc agagcccgct gccattcgcg ggcggcgagc gcgcgctcgt 6600
aggggttgag gggcgggccc cagggcatgg ggtgggtgag cgcagaggcg tacatgccgc 6660
agatgtcata cacgtacagg ggttccctga ggatgccgag gtaggtgggg tagcagcgcc 6720
ccccgcggat gctggcgcgc acgtagtcat agagttcgtg ggagggggcc agcatgttgg 6780
gcccgaggtt ggtgcgctgg gggcgctcgg cgcggaagac gatctgcctg aagatggcgt 6840
gggagttgga ggagatggtg ggccgctgga agacgttgaa gcttgcttct tgcaagccca 6900
cggagtccct gacgaaggag gcgtaggact cgcgcagctt gtgcaccagc tcggcggtga 6960
cctggacgtc gagcgcacag tagtcgaggg tctcacggat gatgtcatac ttatcctccc 7020
ccttcttttt ccacagctcg cggttgagga cgaactcttc gcggtctttc cagtactctt 7080
ggaggggaaa cccgtccgtg tccgaacggt aagagcctag catgtagaac tggttgacgg 7140
cctggtaggg gcagcagccc ttctccacgg gcagcgcgta ggcctgcgcc gccttgcgga 7200
gggaggtgtg ggtgagggcg aaagtgtccc tgaccatgac tttgaggtat tgatgtctga 7260
agtctgtgtc atcgcagccg ccctgttccc acagggtgta gtccgtgcgc tttttggagc 7320
gcgggttggg cagggagaag gtgaggtcat tgaagaggat cttccccgct cgaggcatga 7380
agtttctggt gatgcgaaag ggccctggga ccgaggagcg gttgttgatg acctgggcgg 7440
ccaggacgat ctcgtcaaag ccgtttatgt tgtggcccac gatgtagagc tccaggaagc 7500
ggggctggcc cttgatggag gggagctttt taagttcctc gtaggtgagc tcctcgggcg 7560
attccaggcc gtgctcctcc agggcccagt cttgcaagtg agggttggcc gccaggaagg 7620
atcgccagag gtcgcgggcc atgagggtct gcaggcggtc gcggaaggtt ctgaactgtc 7680
gccccacggc catcttttcg ggggtgatgc aatagaaggt gagggggtct ttctcccagg 7740
ggtcccatct gagctctcgg gcgaggtcgc gtgcggcggc gaccagagcc tcgtcgcccc 7800
ccagtttcat gaccagcatg aagggcacga gctgcttgcc aaaggctccc atccaagtgt 7860
aggtctctac atcgtaggtg acaaagaggc gctccgtgcg aggatgagag ccgatcggga 7920
agaactggat ctcccgccac cagttggagg attggctgtt gatgtggtga aagtagaagt 7980
cccgtctgcg ggccgagcac tcgtgctggc ttttgtaaaa gcgaccgcag tactggcagc 8040
gctgcacggg ttgtatatct tgcacgaggt gaacctggcg acctctgacg aggaagcgca 8100
gcgggaatct aagtcccccg cctggggtcc cgtgtggctg gtggtcttct actttggttg 8160
tctggccgcc agcatctgtc tcctggaggg cgatggtgga acagaccacc acgccgcgag 8220
agccgcaggt ccagatctcg gcgctcggcg ggcggagttt gatgacgaca tcgcgcacat 8280
tggagctgtc catggtctcc agctcccgcg gcggcaggtc agccgggagt tcctggaggt 8340
ttacctcgca gagacgggtc aacgcacggg cagtgttaag atggtatctg atttcaaggg 8400
gcgtgttggc ggcggagtcg atggcttgca ggaggccgca gccccggggg gccacgatgg 8460
ttccccgtgg ggcgcgaggg gaggcggaag ctgggggtgt gttcagaagc ggtgacgcgg 8520
gcgggccccc ggaggtaggg ggggttccgg ccccacaggc atgggcggca ggggcacgtc 8580
ttcgccgcgc gcgggcaggg gctggtgctg gctccgaaga gcgcttgcgt gcgcgacgac 8640
gcgacggttg gtgtcctgta tctggcgcct ctgagtgaag accacgggtc ccgtgacctt 8700
gaacctgaaa gagagttcga cagaatcaat ctcggcatcg ttgacagcgg cctggcgcag 8760
gatctcctgc acgtcgcccg agttgtcctg gtaggcgatc tctgccatga actgctcgat 8820
ctcttcctcc tggagatctc ctcgtccggc gcgctccacg gtggccgcca ggtcgttgga 8880
gatgcgaccc atgagctgcg agaaggcgtt gagtccgccc tcgttccaga cccggctgta 8940
gaccacgccc ccctcggcgt cgcgggcgcg catgaccacc tgggccaggt tgagctccac 9000
gtgtcgcgtg aagacggcgt agttgcgcag gcgctggaaa aggtagttca gggtggtggc 9060
ggtgtgctcg gcgacaaaga agtacatgac ccagcgccgc aacgtggatt cattgatgtc 9120
ccccaaggcc tccaggcgct ccatggcctc gtagaagtcc acggcgaagt tgaaaaactg 9180
ggagttgcga gcggacacgg tcaactcctc ctccagaaga cggatgagct cggcgacagt 9240
gtcgcgcacc tcgcgctcga aggccacggg gggcgcttct tcctcttcca cctcttcttc 9300
catgattgct tcttcttctt cctcagccgg gacgggaggg ggcggcggcg ggggaggggc 9360
gcggcggcgg cggcggcgca ccgggaggcg gtcgatgaag cgctcgatca tctccccccg 9420
catgcggcgc atggtctcgg tgacggcgcg gccgttctcc cgggggcgca gctcgaagac 9480
gccgcctttc atctcgccgc ggggcgggcg gccgtgaggt agcgagacgg cgctgactat 9540
gcatcttaac aattgctgtg taggtacgcc gccaagggac ctgattgagt ccagatccac 9600
cggatccgaa aacctttgga ggaaagcgtc tatccagtcg cagtcgcaag gtaggctgag 9660
caccgtggcg ggcgggggcg ggtcgggaga gttcctggcg gagatgctgc tgatgatgta 9720
attaaagtag gcggtcttga gaaggcggat ggtggacagg agcaccatgt ctttgggtcc 9780
ggcctgttgg atgcggaggc ggtcggccat gccccaggcc tcgttctgac accggcgcag 9840
gtctttgtag tagtcttgca tgagtctttc caccggcacc tcttctcctt cctcttctcc 9900
atctcgccgg tggtttctcg cgccgcccat gcgcgtgacc ccaaagcccc tgagcggctg 9960
cagcagggcc aggtcggcga ccacgcgctc ggccaagatg gcctgctgta cctgagtgag 10020
ggtcctctcg aagtcatcca tgtccacgaa gcggtggtag gcgcccgtgt tgatggtgta 10080
ggtgcagttg gccatgacgg accagttgac ggtctggtgt cccggctgcg agagctccgt 10140
gtaccgcagg cgcgagaagg cgcgggaatc gaacacgtag tcgttgcaag tccgcaccag 10200
atactggtag cccaccagga agtgcggcgg aggttggcga tagaggggcc agcgctgggt 10260
ggcgggggcg ccgggcgcca ggtcttccag catgaggcgg tggtatccgt agatgtacct 10320
ggacatccag gtgatgccgg cggcggtggt ggtggcgcgc gcgtagtcgc ggacccggtt 10380
ccagatgttt cgcaggggcg agaagtgttc catggtcggc acgctctggc cggtgaggcg 10440
cgcgcagtcg ttgacgctct atacacacac aaaaacgaaa gcgtttacag ggctttcgtt 10500
ctgtagcctg gaggaaagta aatgggttgg gttgcggtgt gccccggttc gagaccaagc 10560
tgagctcggc cggctgaagc cgcagctaac gtggtattgg cagtcccgtc tcgacccagg 10620
ccctgtatcc tccaggatac ggtcgagagc ccttttgctt tcttggccaa gcgcccgtgg 10680
cgcgatctgg gatagatggt cgcgatgaga ggacaaaagc ggctcgcttc cgtagtctgg 10740
agaaacaatc gccagggttg cgttgcggcg taccccggtt cgagccccta tggcggcttg 10800
gatcggccgg aaccgcggct aacgtgggct gtggcagccc cgtcctcagg accccgccag 10860
ccgacttctc cagttacggg agcgagcccc ttttgttttt tattttttag atgcatcccg 10920
tgctgcggca gatgcgcccc tcgccccggc ccgatcagca gcagcaacag caggcatgca 10980
gacccccctc tcctctcccc gccccggtca ccacggccgc ggcggccgtg tccggcgcgg 11040
ggggtgcgct ggagtcagat gagccaccgc ggcggcgacc taggcagtat ctggacttgg 11100
aagagggcga gggactggcg cggctggggg cgagctcccc agagcgtcac ccgcgggtgc 11160
agttgaaaag ggacgcgcgc gaggcgtacc tgccgcggca aaacctgttt cgcgaccgcg 11220
ggggcgagga gcccgaggag atgcgagact gcaggttcca agcagggcgc gagctgcgcc 11280
gcggcttgga cagagagcgc ttgctgcgcg aggaggactt tgagcccgac acgcagacgg 11340
gcatcagccc cgcgcgcgcg cacgtggccg cggccgacct ggtgaccgcc tacgagcaga 11400
cggtgaacca ggagcgcaac ttccaaaaaa gcttcaacaa ccacgtgcgc acgctggtgg 11460
cgcgcgagga ggtgaccctg ggtctcatgc atctgtggga cctggtggag gcgatcgtgc 11520
agaaccccag cagcaagccc ctgaccgcgc agctgttcct ggtggtgcag cacagcaggg 11580
acaacgatgc cttcagggag gcgctgctga acatcaccga gccggagggg cgctggctcc 11640
tggacctgat aaacatcctg cagagcatag tggtgcagga gcgcagcctg agcctggccg 11700
agaaggtggc ggccattaac tattctatgc tgagcctggg caagttctac gcccgcaaga 11760
tctacaagac cccctacgtg cccatagaca aggaggtgaa gatagacagc ttctacatgc 11820
gcatggcgct aaaggtgctg accctgagcg acgacctggg agtgtaccgc aacgagcgca 11880
tccacaaggc cgtgagcgcc agccggcggc gcgagctgag cgaccgcgag ctgatgcaca 11940
gtctgcaacg cgcgctgacc ggcgcgggcg agggcgacag ggaggtcgag tcctacttcg 12000
acatgggggc cgacctgcac tggcagccga gccgccgcgc cctggaggcg gcgggggcgt 12060
atggcggccc cctggcggcc gatggcgagg aagaggagga ctatgagcta gaggagggcg 12120
agtacctgga ggactgacct ggctggtggt gttttggtat agatgcaaga tccgaacgtg 12180
gcggacccgg cggtccgggc ggcgctgcag agccagccgt ccggcattaa ctcctctgac 12240
gactgggccg cggccatggg tcgcatcatg gccctgaccg cgcgcaaccc cgaggccttc 12300
aggcagcagc ctcaggctaa ccggctggcg gccatcttgg aagcggtagt gcccgcgcgc 12360
tccaacccca cccacgagaa ggtgctggcc atagtcaacg cgctggcgga gagcagggcc 12420
atccgggcgg acgaggccgg actggtgtac gatgcgctgc tgcagcgggt ggcgcggtac 12480
aacagcggca acgtgcaaac caacctggac cgcctggtga cggacgtgcg cgaggccgtg 12540
gcgcagcgcg agcgcttgca tcaggacggt aacctgggct cgctggtggc gctaaacgcc 12600
ttcctcagca cccagccggc caacgtaccg cgggggcagg aggactacac caacttcttg 12660
agcgcgctgc ggctgatggt gaccgaggtc cctcagagcg aagtgtacca gtcggggccc 12720
gactacttct tccagaccag cagacagggc ttgcaaaccg tgaacctgag ccaggctttc 12780
aagaacctgc gggggctgtg gggagtgaag gcgcccaccg gcgaccgggc tacggtgtcc 12840
agcctgctaa cccccaactc gcgcctgctg ctgctgctga tcgcgccctt cacggacagc 12900
gggagcgtct cgcgggagac ctatctgggc cacctgctga cgctgtaccg cgaggccatc 12960
gggcaggcgc aggtggacga gcacaccttc caggagatca ccagcgtgag ccacgcgctg 13020
gggcaggagg acacgggcag cctgcaggcg accctgaact acctgctgac caacaggcgg 13080
cagaagattc ccacgctgca cagcctgacc caggaggagg agcgcatctt gcgctacgtg 13140
cagcagagcg tgagcctgaa cctgatgcgc gacggcgtga cgcccagcgt ggcgctggac 13200
atgaccgcgc gcaacatgga accgggcatg tacgcttccc agcggccgtt catcaaccgc 13260
ctgatggact acttgcatcg ggcggcggcc gtgaaccccg agtacttcac caatgccatt 13320
ctgaatcccc actggatgcc ccctccgggt ttctacaacg gggactttga ggtgcccgag 13380
gtcaacgacg ggttcctctg ggatgacatg gatgacagtg tgttctcccc caacccgctg 13440
cgcgccgcgt ctctgcgatt gaaggagggc tctgacaggg aaggaccgag gagtttggcc 13500
tcctccctgg ctctgggggc ggtgggcgcc acgggcgcgg cggcgcgggg cagcagcccc 13560
ttccccagcc tggcggactc tctgaatagc gggcgggtga gcaggccccg cttgctaggc 13620
gaggaggagt atctgaacaa ctccctgcta cagcccgtga gggacaaaaa cgctcagcgg 13680
cagcagtttc ccaacaacgg gatagagagc ctggtggaca agatgtccag atggaagacg 13740
tatgcgcagg agtacaagga gtgggaggac cgacagccgc ggcccctgcc gccccctaga 13800
cagcgctggc agcggcgtgc gtccaaccgc cgctggaggc aggggcccga ggacgatgat 13860
gactctgcag atgacagcag cgtgttggat ctgggcggga gcgggaaccc cttttcgcac 13920
ctgcgcccac gcctgggcaa gatgttttaa aagagaaaaa taaaaactca ccaaggccat 13980
ggcgacgagc gttggttttt ttgttccctt ccttagtatg cggcgcgcgg cgatgttcga 14040
ggaggggcct cccccctctt acgagagcgc gatgggaatt tctcctgcgg cgcccctgca 14100
gcctccctac gtgcctcctc ggtacctgca acctacaggg gggagaaata gcatctgtta 14160
ctctgagctg cagcccctgt acgataccac cagactgtac ctggtggaca acaagtccgc 14220
ggacgtggcc tccctgaact accagaacga ccacagcgat tttttgacca cggtgatcca 14280
aaacaacgac ttcaccccaa ccgaggccag tacccagacc ataaacctgg acaacaggtc 14340
gaactggggc ggcgacctga agactatcct gcacaccaat atgcccaacg tgaacgagtt 14400
catgttcacc aactctttta aggcgcgggt gatggtggcg cgcgagcagg gggaggcgaa 14460
gtacgagtgg gtggacttca cgctgcccga gggcaactat tcagagacca tgactctcga 14520
cctgatgaac aatgcgatcg tggaacacta tctgaaagtg ggcaggcaga acggggtgaa 14580
ggagagcgat atcggggtca agtttgacac cagaaacttt cgtctgggct gggaccccgt 14640
gaccgggctg gtcatgccgg gggtctacac caacgaggcc tttcatcccg atatagtgct 14700
cctgcccggc tgtggggtgg actttaccca gagccggctg agcaacctgc tgggcgttcg 14760
caagcggcaa cctttccagg agggtttcaa gatcacctat gaggatctgg aggggggcaa 14820
cattcccgcg ctccttgatc tggacgccta cgaggagagc ttgaaacccg aggagagcgc 14880
tggcgacagc ggcgagagtg gcgaggagca agccggcggc ggtggcagcg cgtcggtaga 14940
aaacgaaagt actcccgcag tggcggcgga cgctgcggag gtcgagccgg aggccatgca 15000
gcaggacgca gaggagggcg cgcaggagga catgaacaat ggggagatca ggggcgacac 15060
tttcgccacc cggggcgaag aaaaagaggc agaggcggcg gcggcgacgg cggaagccga 15120
aaccgaggca gaggcagagc ccgagaccga agttatggaa gacatgaatg atggagaacg 15180
taggggtgac acgtttgcca cccggggcga agagaaggcg gcggaggcag aagccgcggc 15240
tgaggaggcg gctgcggctg cggccaaggc tgaggctgcg gctgaggcta aggtcgaagc 15300
cgatgttgcg gttgaggctc aggctgagga ggaggcggcg actgaagcag ttaaggaaaa 15360
ggcccaggca gagcaggaag agaaaaaacc tgtcattcaa cctctaaaag aagatagcaa 15420
aaagcgcagt tacaacgtca tcgagggcag cacctttacc caataccgca gctggtacct 15480
ggcttacaac tacggcgacc cggtcaaggg ggtgcgctcg tggaccctgc tctgcacgcc 15540
ggacgtcacc tgcggctccg agcagatgta ctggtcgctg ccaaacatga tgcaagaccc 15600
ggtgaccttc cgttccacgc ggcaggttag caactttccg gtggtgggcg ccgaactgct 15660
gccagtgcac tccaagagtt tttacaacga gcaggccgtc tactcccagc tgatccgcca 15720
ggccacctct ctgacccacg tgttcaatcg ctttcccgag aaccagattt tggcgcgccc 15780
gccggccccc accatcacca ccgtcagtga aaacgttcct gccctcacag atcacgggac 15840
gctaccgctg cgcaacagca tctcaggagt ccagcgagtg accattactg acgccagacg 15900
ccggacctgc ccctacgttt acaaggcctt gggcatagtc tcgccgcgcg tcctctccag 15960
tcgcactttt taaaacacat ctaccctcac gctccaaaat catgtccgta ctcatctcgc 16020
ccagcaacaa caccggctgg gggctgcgcg cgcccagcaa gatgtttgga ggggcgagga 16080
aacgctccga acagcaccca gtgcgcgtgc gcggccacta ccgcgcgccc tggggtgcgc 16140
acaagcgcgg gcgcacaggg cgcaccactg tggatgatgt cattgactcc gtagtggagc 16200
aggcgcgcca ctacacaccc ggcgcgccga ccgcctccgc cgtgtccacc gtggaccagg 16260
cgatcgaaag cgtggtacag ggggcgcggc actatgccaa ccttaaaagt cgccgccgcc 16320
gcgtggcgcg ccgccatcgc cggagacccc gggctactgc cgccgcgcgc cttaccaagg 16380
ctctgctcaa gcgcgccagg cgaactggcc accgggccgc catgagggcc gcacggcggg 16440
ctgccgctgc cgcgagcgcc gtggccccgc gggcacgaag gcgcgcggcc gctgccgccg 16500
ccgccgccat ttccagcttg gcctcgacgc ggcgcggtaa catatactgg gtgcgcgact 16560
cggtaagcgg cacacgggtg cccgtgcgct ttcgcccccc acggaattag cacaaaacaa 16620
catacacact gagtctcctg ctgttgtgta tcccagcggc gaccgtcagc agcggcgaca 16680
tgtccaagcg caaaattaaa gaagagatgc tccaggtcat cgcgccggag atctatgggc 16740
ccccgaagaa ggaggaggat gattacaagc cccgcaagct aaagcgggtc aaaaagaaaa 16800
agaaagatga tgacgttgac gaggcggtgg agtttgtccg ccgcatggcg cccaggcgcc 16860
ccgtgcagtg gaagggtcgg cgcgtgcagc gagtcctgcg ccccggcacc gcggtggtct 16920
ttacgcccgg cgagcgttcc acgcgcactt tcaagcgggt gtacgatgag gtgtacggcg 16980
acgaggatct gttggagcag gccaaccatc gctttgggga gtttgcatat gggaaacggc 17040
cccgcgagag cctaaaagag gacctgctgg cgctaccgct ggacgagggc aatcccaccc 17100
cgagtctgaa gccggtaacc ctgcaacagg tgctgccttt gagcgcgccc agcgagcaga 17160
agcgagggtt gaagcgcgag ggcggggacc tggcacccac cgtgcagttg atggtgccca 17220
agcggcagaa gctggaggac gtgctggaga aaatgaaagt agagcccggg atccagcccg 17280
aaatcaaggt ccgccccatc aagcaggtgg cgcccggcgt gggagtccag accgtggacg 17340
ttaggattcc cacggaggag atggaaaccc aaaccgccac tccctcttcg gcggctagcg 17400
ccaccaccgg cgccgcttcg gtagaggtgc agacggaccc ctggctacct gccgccactg 17460
tcgccgccgc cgccgccgcc ccccgttcgc gcgggcgcaa gagaaattat ccagcggcca 17520
gcgcgctcat gccccagtac gcactgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcacccgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcattg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg cctccgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcagag gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgctg 17940
cccttcctaa tcccgctaat cgccgcggcg atcggtgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc attttttgga 18060
ggaaaaaata aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagt ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggccaga tgctgagaga caagttgaaa gagcagaact tccaggaaaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccggcctcag gtggaggaaa cgcctccagc 18480
aatggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcctaccact cgccccatag cccccatggc caccggtgtg gtgggacaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgatccgcct cgccagccaa aggcggcgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgccc ctgcgccgcg ctgcaagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gtttatcccc gtggatcggg 19200
aggacaccgc ctactcttac aaggcgcggt ttacgctggc cgtgggcgac aatcgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaat caggtggagg ctgcagatga ggacgtcgaa gatgaagaag 19440
cgcaagcaca agaggaagcc cctgttaaaa aaattcatgt atatgctcag gcgcctcttg 19500
ctggcgaaaa gattaccaag gatggtttgc aaataggtac tgaagtcgta ggagagacat 19560
ctaaggacac ttttgcagat aaaacattcc aacccgaacc tcagataggc gagtctcagt 19620
ggaacgaggc tgatgccgca gtagcaggag gtagagtttt gaaaaagact acccctatga 19680
gaccttgcta tggatcctat gccaggccta ccaatgccaa cgggggtcaa ggaattctgg 19740
ttgccaatga acaaggagtg atggagtcta aagtagaaat gcaatttttc tctaacacct 19800
caacccttaa tgcgcgggat ggaaccggca atcccgaacc aaaggtggtg ttgtacagcg 19860
aagatgtcca cttggaatct cccgatactc atctgtctta caagcccaaa aaggatgatg 19920
ttaatgccaa agtcatgttg ggtcagcaag ccatgcccaa cagacccaac ctcattggat 19980
ttagagataa tttcattggg cttatgtttt acaacagcac cggtaacatg ggagtgctgg 20040
cgggtcaggc ctctcagttg aatgctgtgg tggacttgca ggatagaaac acagaactgt 20100
catatcagct tatgcttgat tcaattgggg atagaaccag atacttctcc atgtggaacc 20160
aggcagtgga tagctatgat ccagatgtca gaattattga aaaccatggg gttgaggatg 20220
aactgcccaa ctactgcttc cctttgggcg gcataggaat tactgatact tatcaagggg 20280
tgaaaaatac caatggcaat ggtcagtgga ccaaagatga tcagttcgcg gaccgcaacg 20340
aaataggggt gggaaacaac ttcgccatgg agatcaacat ccaggccaac ctttggagaa 20400
acttcctcta tgcaaacgtg gggctctacc tgccagacaa gctcaagtac aaccccacca 20460
acgtggacat ctctgacaac cccaacacct atgactacat gaacaagcgg gtggtggccc 20520
ctggcctggt ggactgcttt gtcaatgtgg gagccaggtg gtccctggac tacatggaca 20580
acgtcaaccc cttcaaccac caccgcaatg cgggtctgcg ctaccgctcc atgatcctgg 20640
gcaacgggcg ctatgtgccc tttcacatcc aggtacccca gaagttcttt gccatcaaga 20700
acctcctgct cctgcccggc tcctacacct acgagtggaa cttcaggaag gatgtgaaca 20760
tggtcctaca gagctctctg ggcaatgacc ttagggtgga tggggccagc atcaagtttg 20820
acagcatcac cctctatgct acatttttcc ccatggccca caacaccgcc tccacgcttg 20880
aggccatgct gagaaacgac accaacgacc agtcctttaa tgactacctc tctggggcca 20940
acatgctcta cccaatccca gccaaggcca ccaacgtgcc catctccatc ccctctcgca 21000
actgggccgc ctttagaggc tgggccttta cccgccttaa gaccaaggag accccctccc 21060
tgggctcggg ttttgatccc tactttgttt actcgggatc catcccctac ctggatggca 21120
ccttctacct caaccacact ttcaagaaga tatccatcat gtatgactcc tccgtcagct 21180
ggccgggcaa cgaccgcttg ctcaccccca atgagttcga ggtcaagcgc gccgtggacg 21240
gcgagggcta caacgtggcc cagtgcaaca tgaccaagga ctggttcctg gtgcagatgc 21300
tggccaacta caacataggc taccagggct tttacatccc agagagctac aaggacagga 21360
tgtactcctt cttcagaaat ttccaaccca tgagccgaca ggtggtggac gagaccaatt 21420
acaaggacta tcaagccatt ggcatcaccc accagcacaa caactcgggt ttcgtgggct 21480
acctggcgcc caccatgcgc gagggtcagg cctaccccgc caacttcccc taccccttga 21540
taggcaagac cgcggtcgac agcgtcaccc agaaaaagtt cctctgcgac cgcaccctct 21600
ggcgcatccc cttctctagc aacttcatgt ccatgggtgc gctcacggac ctgggccaaa 21660
acctgcttta tgccaactct gcccatgcgc tggacatgac tttcgaggtg gaccccatgg 21720
acgagcccac ccttctctat attgtgtttg aagtgttcga cgtggtcaga gtgcaccagc 21780
cgcaccgcgg tgtcatcgag accgtgtacc tgcgtacgcc cttctcagcc ggcaacgcca 21840
ccacctaagg agacagcgcc gccgcctgca tgactggttc caccgagcaa gagctcaggg 21900
ccatcgccag agacctggga tgcggaccct actttttggg cacctatgac aaacgcttcc 21960
cgggtttcat ctcccgagac aagctcgcct gcgccatcgt caacacggcc gcgcgcgaga 22020
ccgggggcgt gcactggctg gcctttggct gggacccgcg ctctaaaact tgctacctct 22080
ttgacccctt tggcttctct gatcagcgcc tcaggcagat ttatgagttt gagtacgagg 22140
ggctgctgcg ccgcagcgcg cttgcctcct cgcccgaccg ctgcatcacc cttgagaagt 22200
ccaccgagac cgtgcagggg ccccactcgg ccgcctgcgg tctcttctgt tgcatgtttt 22260
tgcacgcctt tgtacactgg cctcagagtc ccatggatcg caaccccacc atgaacttgc 22320
taaagggagt gcccaacgcc atgctccaga gcccccaggt cctgcccacc ctgcgccgca 22380
accaggaaca gctctaccgc ttcctggagc gccactcccc ctacttccgc agccacagcg 22440
cgcgcatccg gggggccacc tctttttgcc acttgcaaga aaacatgcaa gacggaaaat 22500
gatgtacagc atgcttttaa taaatgtaaa gactgtgcac tttatttata cacgggctct 22560
ttctggttat ttattcaaca ccgccgtcgc catctagaaa tcgaaagggt tctgccgcgc 22620
gtcgccgtgc gccacgggca gagacacgtt gcgatactgg aagcggctcg cccacttgaa 22680
ctcgggcacc accatgcggg gcagtggttc ctcggggaaa ttctcgctcc acagggtgcg 22740
ggtcagctgc agcgcgctca ggaggtcggg agccgagatc ttgaagtcgc agttggggcc 22800
ggaaccctgc gcgcgcgagt tgcggtacac ggggttgcag cactggaaca ccagcagggc 22860
cggattattc acgctggcca gcaggctctc gtcgctgatc atgtcgctgt ccagatcctc 22920
cgcgttgctc agggcgaatg gggtcatctt gcagacctgc ctgcccagga aaggcgggag 22980
cccaggcttg ccgttgcagt cgcagcgcag gggcattagc aggtgcccac ggcccgactg 23040
cgcctgcggg tacaacgcgc gcatgaaggc ttcgatctgc ctaaaagcca cctgggtctt 23100
ggctccctcc gaaaagaaca tcccacagga cttgctggag aactgattcg cgggacagct 23160
ggcatcgtgc aggcagcagc gcgcgtcagt gttggcgatc tgcaccacgt tgcgacccca 23220
ccggtttttc actatcttgg ccttggaagc ctgctccttt agcgcgcgct ggccgttctc 23280
gctggtcaca tccatctcta tcacctgttc cttgttgatc atgtttgtcc cgtgcagaca 23340
ctttaggtcg ccctccgtct gggtgcagcg gtgctcccac agcgcgcaac cggtgggctc 23400
ccaattcttg tgggtcaccc ccgcgtaggc ctgcaggtag gcctgcagga agcgccccat 23460
catggtcata aaggtcttct ggctcgtaaa ggtcagctgc aggccgcgat gctcttcgtt 23520
cagccaggtc ttgcagatgg cggccagcgc ctcggtctgc tcgggcagca tcttaaaatt 23580
tgtcttcagg tcgttatcca cgtggtactt gtccatcatg gcacgcgccg cctccatgcc 23640
cttctcccag gcggacacca tgggcaggct tagggggttt atcacttcca gcggcgagga 23700
caccgtactt tcgatttctt cttcctcccc ctcttcccgg cgcgcgcccc cgctgttgcg 23760
cgctcttacc gcctgcacca aggggtcgtc ttcaggcaag cgccgcaccg agcgcttgcc 23820
gcccttgacc tgcttgatca gtaccggcgg gttgctgaag cccaccatag tcagcgccgc 23880
ctgctcttct tcgtcttcgc tgtctaccac tatttctggg gaggggcttc tccgctctgc 23940
ggcaaaggcg gcggatcgct tctttttttt cttgggagcc gccgcgatgg agtccgccac 24000
ggcgaccgag gtcgagggcg tggggctggg ggtgcgcggc accagggcct cgtcgccctc 24060
ggactcttcc tctgactcca ggcggcggcg gagtcgcttc tttgggggcg cgcgcgtcag 24120
cggcggcgga gacggggacg gggacgggga cgggacgccc tccacagggg gcggtcttcg 24180
cgcagacccg cggccgcgct cgggggtctt ctcgcgctgg tcttggtccc gactggccat 24240
tgtatcctcc tcctcctagg cagagagaca taaggagtct atcatgcaag tcgagaagga 24300
ggagagctta accaccccct ctgagaccgc cgtcgccgtc gcccccgcta ccgccgacgc 24360
gcccgccaca ccgagcgaca cccccgcgga cccccccgcc gacgcacccc tgttcgagga 24420
agcggccgtg gagcaggacc cgggctttgt ctcggcagag gaggatttgc aagaggagga 24480
ggataaggag gagaagccct cagtgccaaa agatcataaa gagcaagacg agcacgacgc 24540
agacgcacac cagggtgaag tcgggcgggg ggacggaggg catggcggcg ccgactacct 24600
agacgaagga aacgacgtgc tcttgaagca cctgcatcgt cagtgcgcca tcgtctgcga 24660
cgctctgcag gagcgcagcg aggtgcccct cagcgtggcg gaggtcagcc gcgcctacga 24720
gctcagcctc ttttcccccc gggtgccccc ccgccgccgc gaaaacggca catgcgagcc 24780
caacccgcgc ctcaacttct accccgcctt tgtggtgccc gaggtcctgg ccacctatca 24840
catcttcttt caaaattgca agatccccat ctcgtgccgc gccaaccgta gccgcgccga 24900
taagatgctg gccctgcgcc agggcgacca catacctgat atcgccgctt tggaagatgt 24960
accaaagatc ttcgagggtc tgggtcgcaa cgaaaagcgg gcagcaaact ctctgcaaca 25020
ggaaaacagc gaaaatgaga gtcacaccgg ggtgctggtg gagctcgagg gcgacaacgc 25080
ccgcctggcg gtgctcaagc gcagcatcga ggtcacccac tttgcctacc ccgcgctcaa 25140
cctgcccccc aaagtcatga acgcggtcat ggacgggctg atcatgcgcc gcggccagcc 25200
ccttgctcca gatgcaaact tgcatgagga gaccgaggac ggccagcccg tggtcagcga 25260
cgagcagctg gcgcgctggc tggaaaccgc ggaccccgcc gaactggagg agcggcgcaa 25320
gatgatgatg gccgcggtgc tggtcaccgt agagctggag tgtctgcagc gcttcttcgg 25380
tgaccccgag atgcagagaa aggtcgagga gaccctacac tacaccttcc gccagggcta 25440
cgtgcgccag gcttgcaaga tctccaacgt ggagctcagc aacctggtgt cctacctggg 25500
catcttgcat gagaaccgcc ttgggcagag cgtgctgcac tccaccctgc gcggggaagc 25560
gcgccgcgac tacgtgcgcg actgcgttta ccttttcctc tgctacacct ggcagacggc 25620
catgggggtc tggcagcagt gcctggagga gcgcaacctc aaggagctgg agaagctcct 25680
gcagcgcgcg ctcaaagacc tctggacggg cttcaacgag cgctcggtgg ccgccgcgct 25740
ggccgacctc atcttccccg agcgcctgct caaaactctc cagcaggggc tgcccgactt 25800
caccagccaa agcatgttgc aaaactttag gaactttatc ctggagcgtt ctggcatcct 25860
acccgccacc tgctgcgccc tgcccagtga ctttgttccc ctcgtgtacc gcgagtgccc 25920
cccgccgctg tggggccact gctacctgtt ccaactggcc aactacctgt cctaccacgc 25980
ggacctcatg gaggactcca gcggcgaggg gctcatggag tgccactgcc gctgcaacct 26040
ctgcacgccc caccgctccc tggtctgcaa cacccaactg ctcagcgaga gtcagattat 26100
cggtaccttc gagctacagg gtccgtcctc ctcagacgag aagtccgcgg ctccggggct 26160
aaaactcact ccggggctgt ggacttccgc ctacctgcgc aaatttgtac ctgaagacta 26220
ccacgcccac gagatcaggt tttacgagga ccaatcccgc ccgcccaagg cggagctgac 26280
cgcctgcgtc atcacccagg gcgagatcct aggccaattg caagccatcc aaaaagcccg 26340
ccaagagttt ttgctgagaa agggtcgggg ggtgtatctg gacccccagt cgggtgagga 26400
gctcaacccg gttcccccgc tgccgccgcc gcgggacctt gcttcccagg ataagcatcg 26460
ccatggctcc cagaaagaag cagcagcggc cgccactgcc gccaccccac atgctggagg 26520
aagaggagta ctgggacagt caggcagagg aggtttcgga cgaggaggag ccggagacgg 26580
agatggaaga gtgggaggag gacagcttag acgaggaggc ttccgaagcc gaagaggcag 26640
gcgcaacacc gtcaccctcg gccgcagccc cctcgcaggc gcccccgaag tccgctccca 26700
gcatcagcag caacagcagc gctataacct ccgctcctcc accgccgcga cccacggccg 26760
accgcagacc caaccgtaga tgggacacca ccggaaccgg ggccggtaag tcctccggga 26820
aaggcaagca agcgcagcgc caaggctacc gctcgtggcg cgctcacaag aacgccatag 26880
tcgcttgctt gcaagactgc ggggggaaca tctccttcgc ccgccgcttc ctgctcttcc 26940
accacggtgt ggccttcccc cgtaacgtcc tgcattacta ccgtcatctc tacagcccct 27000
actgcggcgg cagtgagcca gaggcggccg gcggcagcgg cgcccgtttc ggtgcctagg 27060
aagacccagg gcaagacttc agccaagaaa ctcgcggcgg ccgcggcgaa cgcggtcgcg 27120
ggggccctgc gcctgacggt gaacgaaccc ctgtcgaccc gcgaactgag gaaccgaatc 27180
ttccccactc tctatgccat cttccagcag agcagagggc aggatcagga actgaaagta 27240
aaaaacaggt ctctgcgctc cctcacccgc agctgtctgt atcacaagag cgaagaccag 27300
cttcggcgca cgctggagga cgctgaggca ctcttcagca aatactgcgc gctcactctt 27360
aaggactagc tccgcgccct tctcgaattt aggcgggaac gcctacgtca tcgcagcgcc 27420
gccgtcatga gcaaggacat tcccacgcca tacatgtgga gctatcagcc gcagatggga 27480
ctcgcggcgg gcgcctccca agactactcc acccgcatga actggctcag tgccggccca 27540
cacatgatct cacaggttaa tgacatccgc acccatcgaa accaaatatt ggtggagcag 27600
gcggcaatta ccaccacgcc ccgcaataat cccaacccca gggagtggcc cgcgtccctg 27660
gtgtatcagg aaattcccgg ccccaccacc gtactacttc cgcgtgattc ccaggccgaa 27720
gtccaaatga ctaactcagg ggcacagctc gcgggcggct gtcgtcacag ggtgcggcct 27780
cctcgccagg gtataactca cctggagatc cgaggcagag gtattcagct caacgacgag 27840
tcggtgagct cctcgctcgg tctcagacct gacgggacct tccagatagc cggagccggc 27900
cgatcttcct tcacgccccg ccaggcgtac ctgactctgc agagctcgtc ctcggcgccg 27960
cgctcgggcg gcatcgggac tctccagttc gtgcaggagt ttgtgccctc ggtctacttc 28020
aaccccttct cgggctctcc cggtcgctac ccggaccagt tcatcccgaa ctttgacgcc 28080
gcgagggact cggtggacgg ctacgactga atgtcgggtg gacccggtgc agagcaactt 28140
cgcctgaagc accttgacca ctgccgccgc cctcagtgct ttgcccgctg tcagaccggt 28200
gagttccagt acttttccct gcccgactcg cacccggacg gcccggcaca cggggtgcgc 28260
tttttcatcc cgagtcaggt ccgctctacc ctaatcaggg agtttacagc ccgtccccta 28320
ctggcggagt tggaaaaggg gccttctatc ctaaccattg cctgcatctg ctctaaccct 28380
ggattacacc aagatctttg ctgtcatttg tgtgctgagt ataataaagg ctgagatcag 28440
aatctactcg ggctcctgtc gccatcctgt caacgccacc gtccaagccc ggcccgatca 28500
gcccgaggtg aacctcacct gcggtctgca ccggcgcctg aggaaatacc tagcttggta 28560
ctacaacagc actccctttg tggtttacaa cagctttgac caggacgggg tctcactgag 28620
ggataacctc tcgaacctga gctactccat caggaagaac aacaccctcg agctacttcc 28680
tccttacctg cccgggactt accagtgtgt caccggtccc tgcacccaca cccacctgtt 28740
gatcgtaaac gactctcttc cgagaacaga cctcaataac tcctctccgc agttccccag 28800
aacaggaggt gagctcagga aaccccgggt aaagaagggt ggacaagagt taacacttgt 28860
ggggtttctg gtgtatgtga cgctggtggt ggctcttttg attaaggctt ttccttccat 28920
gtctgaactc tccctcttct tttatgaaca actcgactag tgctaacgag accctaccca 28980
acgaatcggg attgaatatc ggtaaccagg ttgcagtttc acttttgatt acctttatag 29040
tcctcttcct gctagtgctg tcgcttctgt gcctgcggat cgggggctgc tgcatccacg 29100
tttatatctg gtgctggctg tttagaaggt tcggagacca ccgcaggtag aataatgctg 29160
cttaccctct ttgtcctggc gctggctgcc agctgccaag ccttttccga ggctgacttc 29220
atagagcccc agtgcaatat cacttataaa tctgaacgtg ccatctgtac tatcctaatc 29280
aaatgtgtta ctcaacacga taaggtaact gttaaataca aagatcaatt aaaaaaagac 29340
gcactttaca gcagctggca accaggagat gaacaaaaat acaatgtaac cgtcttccag 29400
ggcaaactct ccaaaactta caattacact ttcccatttg agcagatgtg tgactttgtc 29460
atgtacatgg aaaagcagta caagctgtgg cctccaactc cccagggctg tgtggaaaat 29520
ccaggctctt tctgtatgat ctctctctgt gtaactgtgc tggcactaat actcacgctt 29580
ctgtatatca gatttaaatc aaggcaaagc tttattgatg aaaagaaaat gccttaatcg 29640
ctttcacgct tgattgctaa caccgggttt ttatccgcag aatgattgga atcaccctac 29700
taatcacctc cctccttgcg attgcccatg ggttggaacg aatcgaagtc cctgtggggg 29760
ccaatgttac cctggtgggg cctgtcggca atgctacatt aatgtgggaa aaatatacta 29820
aaaatcaatg ggtctcttac tgcactaaca aaaacagcca caagcccaga gccatctgcg 29880
atgggcaaaa tttaaccttg attgatgttc aattgctgga tgcgggctac tattatgggc 29940
agctgggtac aatgattaat tactggagac cccacagaga ttacatgctt cacgtagtaa 30000
agggtcccat tagcagccca accaccacct ctaccacccc cactaccacc actactccca 30060
ccaccagcac tgccgcccag cctcctcata gcagaacaac cacttttatc aattccaagt 30120
cccactcccc ccacattgcc ggcgggccct ccgcctcaga ctccgagacc accgagatct 30180
gcttctgcaa atgctctgac gccattgccc aggatttgga agatcacgag gaagatgagc 30240
atgactacgc agatgcatgc caggcatcag agtcagaagc gctgccggtg gccctaaaac 30300
agtatgcaga cccccacacc acccccgacc ttcctccacc ttcccagaag ccaagtttcc 30360
tgggggaaaa tgaaactctg cctctctcca tactagctct gacatctgtt gctattttgg 30420
ccgctctgct ggtgcttcta tgctctatat gctacctgat ctgctgcaga aagaaaaaat 30480
ctcacggcca tgctcaccag cccctcatgc acttccctta ccctccagag ctgggcgacc 30540
acaaacttta agtctgcagt agctatctgc ccatcccttg tcagtcgaca gcgatgagcc 30600
ccactaatct aacagcctct ggacttacaa cattgtctct taatgagacc accgctcctc 30660
aagacctgta cgatggtgtc tccgcgctgg ttaaccagtg ggatcacctg ggcatatggt 30720
ggctcctcat aggagcagtg accctgtgcc taatcctggt ctggatcatc tgctgcatca 30780
aaagcagaag acccaggcgg cggcccatct acaggccctt cgtcatcaca cctgaagata 30840
atgatgatga tgacaccacc tccaggctgc agagcctaaa gcagctactc ttctctttta 30900
cagcatggta aattgaatca tgccccgcat tttcatctac ttgcttctcc ttccactttt 30960
tctgggctcc tctacattgg ccgctgtgtc ccacatcgag gtagactgcc tcacgccctt 31020
cacagtctac ctgcttttcg gctttgtcat ctgcaccttt gtctgcagcg ttatcactgt 31080
agtgatctgc ttcatacagt gcatcgacta catctgtgtg cgggtggcct actttagaca 31140
ccacccccag tatcgcaaca gggacatagc ggctctccta agacttgttt aaatcatggc 31200
caaattacct gtgattggtc ttctgattat ctgctgcgtc ctagccgcga ttgggactca 31260
acctaatacc accaccagcg ctcccagaaa gagacatgta tcctgcagct tcaagcgtcc 31320
ctggaatata ccccaatgct ttactgatga acctgaaatc tctttggctt ggtacttcag 31380
cgtcaccgcc cttctcatct tctgcagtac ggttattgct cttgccatct acccttccct 31440
taacctgggc tggaatgctg tcaactctat ggaatatccc accttcccag aaccagacct 31500
gccagacctg gttgttctaa acgcgtttcc tcctcctcca gttcaaaatc agtttcgccc 31560
tccgtcccct acgcccactg aggtcagcta ctttaatcta acaggcggag atgactgaaa 31620
acctagacct agaaatggac ggtctctgca gcgagcaacg cacactagag aggcgccggc 31680
aaaaagcaga gctcgagcgt cttaaacaag agctccaaga cgccgtggcc atacaccagt 31740
gcaaaaaagg gctcttctgt ctggtaaaac aggccacgct cacctatgaa aaaacaggtg 31800
acacccaccg cctaggatac aagctgccca cacagcgcca aaagtttgcc cttatgatag 31860
gtgaacaacc catcaccgtc acccagcact ccgtggagac agaaggctgc attcatgctc 31920
cctgcagggg cgctgactgc ctctacacct tgatcaaaac cctctgcggt ctcagagacc 31980
ttatcccttt caattgatca taactgtaat caataaaaaa tcacttactt gaaatctgat 32040
agcaagcctc tgtccaattt tttcagcaac acttccttcc cctcttccca actctggtac 32100
tctaggcgcc tcctagctgc aaacttcctc cacagtctga agggaatgtc agattcctcc 32160
tcctcctgtc cctccgcacc cacaatcttc atgttgttgc agatgaaacg cgcgagatcg 32220
tctgacgaga ccttcaaccc cgtgtacccc tacgataccg agatcgctcc gacttctgtc 32280
cctttcctta cccctccctt tgtgtcaccc gcaggaatgc aagaaaatcc agctggggtg 32340
ctgtccctgc acctgtcaga gccccttacc acccacaatg gggccctgac tctaaaaatg 32400
gggggcggcc tgaccctgga caaggaaggg aatctcactt cccaaaacat caccagtgtc 32460
gatccccctc tcaaaaaaag caagaacaac atcagccttc agaccgccgc acccctcgcc 32520
gtcagctccg gggccctaac cctttttgcc actccccccc tagcggtcag tggcgacaac 32580
cttactgtgc agtctcaggc ccctcttact ttggaagact caaaactaac tctggccacc 32640
aaaggacccc taactgtgtc cgaaggcaaa cttgtcctag aaacagaggc tcccctgcat 32700
gcaagtgaca gcagtagcct gggccttagc gtcacggccc cacttagcat taacaatgac 32760
agcctaggac tagatctgca ggcacccatt gtctctcaaa atggaaaact ggctctaaat 32820
atagcaggcc ccctagctgt agccgatagc attaatgctt tgacagtagg cactggcaaa 32880
ggtattggac taaatgaaac cagcactcac ttgcaagcaa aattggttgc ccccctaggc 32940
tttgatacca atggcaatat taagctaagc gttgcaggag gcatgaggct aaacaatgac 33000
acactgatac tagatgtaaa ctacccattt gaagctcaag gtcaactaag cctaagagtg 33060
ggcacaggtc cactgtatgt agattctagc agtcataatc taaccattag atgccttagg 33120
ggattgtata taacatcatc taacaaccaa aacggtctag aggccaacat taaactaaca 33180
aaaggccttg tgtatgaagg aaatgccata gcagttaatg ttggtcaagg attgcaatac 33240
agcactactg ccacatcgga aggtgtgtat cctatacagt ctaagatagg tttgggaatg 33300
gaatatgata ccaacggagc catgatggca aaactaggct ccggtctaag ctttgataat 33360
tcaggagcca ttgtggtggg aaacaaaaat gatgacaaac ttaccctatg gaccacacct 33420
gacccgtctc ctaactgtag aatttattct gaaaaagata ctaaactaac cttggtgctg 33480
actaagtgtg gcagtcaaat cctaggcaca gtatctgccc ttgctgtcag aggcagcctt 33540
gcgcccatca ctaacgcatc cagcatagtc caaatatttc tacgatttga tgaaaatgga 33600
ctattgatga gcaactcatc gctagacggt gattactgga attacagaaa tggggactcc 33660
actaatggca caccatatac aaatgcagta ggctttatgc ctaatctagc tgcctatcct 33720
aaaggtcagg ctacaactgc aaaaagcagt attgtaagcc aggtatacat ggatggtgat 33780
actactaaac ctataacact aaaaataaac tttaatggca ttgatgaaac aacagaaaat 33840
acccctgtta gtaaatattc catgacattc tcatggagct ggcccaccgc aagctacata 33900
ggccacactt ttgcaacaaa ctcttttact ttctcctaca tcgcccaaga ataaagaaag 33960
cacagagatg cttgtttttg atttcaaaat tgtgtgcttt tatttatttt caagcttaca 34020
gtatttccag tagtcattca aatagagctt aatgaaactg catgagaacc cttccacata 34080
gcttaaatta tcaccagtgc aaatggagaa aaaatcaaca taccttttta tccagatatc 34140
atagaactct agtggtcagt tttcccccac cctcccagct cacagaatac acagtccttt 34200
ccccccggct ggctttaaac aacactatct cattggtaac agacatattc ttaggtgtaa 34260
taatccacac ggtctcttgg cgggccaaac gctggtcagt gatgttaata aactccccag 34320
gcagctcttt caagttcacg tcgctgtcca actgctgaag cgctcgcggc tccgactgcg 34380
cctctagcgg aggcaacggc aacacccgat ccttgatcta taaaggagta gagtcataat 34440
cccccataag aatagggcgg tgatgctgca acaaggcgcg cagcaactcc tgccgccgcc 34500
tttccgtacg acaggaatgc aacggggtgg tggtctcctc cgcgataatc cgcaccgctc 34560
gcaacatcag cgtcctcgtc ctccgggcac agcagcgcat cctgatctca ctgagatcgg 34620
cgcagtaagt gcagcacaac accaagatgt tatttaagat cccacagtgc aaagcactgt 34680
acccaaagct catggcggga aggacagccc ccacgtgacc atcataccag atcctcaggt 34740
aaatcaaatg acgacctctc atgaacacgc tggacatgta catcacctcc ttaggcatgt 34800
gctgattcac cacctctcga taccacaggc atcgctgatt aattaaagac ccctcgagca 34860
ccatcctgaa ccaggaagcc agcacctgac cccccgccag gcactgcagg gaccccggtg 34920
aatcgcagtg gcagtgaaga ctccagcgct cgtagccgtg aaccatagag ctggtcatta 34980
tatccacatt ggcacaacac agacacactt tcatacactt tttcatgatt agcagctcct 35040
ctctagtcag gaccatatcc caaggaatca cccactcttg aatcaaggta aatcccacac 35100
agcagggcag gcctctcaca taactcacgt tatgcatagt gagcgtgtcg caatctggaa 35160
ataccggatg atcttccatc accgaagccc gggtctccgt ctcaaaggga ggtaaacggt 35220
ccctcgtgta gggacagtgg cgggataatc gagatcgtgt tgaacgtaga gtcatgccaa 35280
agggaacagc ggacgtactc atatttcctc cagcagaacc aagtgcgcgc gtggcagcta 35340
tccttgcgtc ttctgtctcg ccgcctgccc cgctcggtgt agtagttgta atacagccac 35400
tccctcagac cgtcaaggcg ctccctggcg tccggatcta taacaacacc atcctgcagc 35460
gccgccctga tgacatccac caccgtagag tatgccaagc ccagccagga aatgcactca 35520
ctttgacagc gagagatagg aggagcggga agagatggaa gaaccatgat agtaaaagaa 35580
cttttattcc aatcgatcct ctacaatgtc aaagtgtaga tctatcagat ggcactggtc 35640
tcctccgctg agtcgatcaa aaataacagc taaaccacaa acaacacgat tggtcaaatg 35700
ctgcacaagg gcttgcagca taaaatcgcc tcgaaagtcc accgcaagca taacatcaaa 35760
gccaccgccc ctatcatgat ctatgataaa aaccccacag ctatccacca gacccatata 35820
gttttcatct ctccatcgtg aaaaaatatt tacaagctcc tcctttaaat cacctccaac 35880
caattcaaaa agttgagcca gaccgccctc caccttcatt ttcagcatgc gcatcatgat 35940
tgcaaaaatt caggctcctc agacacctgt ataagattga gaagcggaac gttaacatca 36000
atgtttcgct cgcgaagatc gcgcctcagt gcaagcatga tataatccca caggtcggag 36060
cggatcagcg aggacatctc cccgccagga accaactcaa cggagcctat gctgattata 36120
atacgcatat tcggggctat gctaaccagc acggccccca aataggcgta ctgcataggc 36180
ggcgacaaaa agtgaacagt ttgggttaaa aaatcaggca aacactcgcg caaaaaagca 36240
agaacatcat aaccatgctc atgcaaatag atgcaagtaa gctcaggaac gaccacagaa 36300
aaatgcacaa tttttctctc aaacatgact gcgagccctg caaaaataaa aaagaaacat 36360
tacacaagag tagcctgtct tacaatggga tagactactc taaccaacat aagacgggcc 36420
acaacatcgc ccgcgtggcc ataaaaaaaa ttatccgtgt gattaaaaag aagcacagat 36480
agctggccag tcatatccgg agtcatcacg tgcgaacccg tgtagacccc cgggttggac 36540
acatcggcca aacaaagaaa gcggccaatg tatcccggag gaatgataac actaagacga 36600
agatacaaca gaataacccc atggggggga ataacaaagt tagtaggtga ataaaaacga 36660
taaacacccg aaactccctc ctgcgtaggc aaaatagcgc cctccccttc caaaacaaca 36720
tatagcgctt ccacagcagc catgacaaaa gactcaaaac actcaaaaga ctcagtctta 36780
ccaggaaaat aaaagcactc tcacagcacc agcactaatc agagtgtgaa aaaggccaag 36840
tgccgaacga gtatatatag gaattaaaaa tgacgtaaat gtgtaaaggt cagaaaacgc 36900
ccagaaaaat acacagacca acgcccgaaa cgaaaacccg cgaaaaaata cccagaagtt 36960
cctcaacaac cgccacttcc gctttcccac gagacgtcac ttcctcaaaa atagcaaact 37020
acatttccca catatacaaa accaaaaccc ctccccttgt caccgcccac aacttacatc 37080
atcacaaacg tcaaagccta cgtcacccgc cccgcccacc tcattatcat attggccaca 37140
atccaaaata aggtatatta ttgatgatg 37169
<210> 5
<211> 37184
<212> DNA
<213> Great Ape Adenovirus
<400> 5
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg cggagggtat ttaaacccgc 480
tgcgctccta aagaggccac tcttgagtgc cagcgagaag agttttctcc tccgctccgt 540
ttcggcgatc gaaaaatgag acatttagcc tgcactccgg gtcttttgtc cggccgggcg 600
gcgtccgagc ttttggacgc tttgctcaat gaggttctga gcgatgattt tccgtctact 660
acccacttta gcccacctac tcttcacgaa ctgtacgatc tggatgtact ggtggatgtg 720
aacgatccca acgaggaggc ggtttctacg ttttttcccg agtctgcgct tttggctgcc 780
caggagggat ttgacctaca cactccgccg ctgcctattt tagagtctcc gctgccggag 840
cccagtggta taccttatat gcctgaactg cttcccgaag tggtagacct gacctgccac 900
gagccgggct ttccgcccag cgacgatgag ggtgagcctt ttgctttaga ctatgctgag 960
atacctgggc tcggttgcag gtcttgtgca tatcatcaga gggttaccgg agaccccgag 1020
gttaagtgtt cgctgtgcta tatgaggctg acctcttcct ttatctacag taagtttttt 1080
tgtgtaggtg ggctttttgg gtaggtgggt tttgtggcag gacaggtgta aatgttgctt 1140
gtgttttttg tacctgcagg tccggtgtcc gagccagacc cggagcccga ccgcgatccc 1200
gagccggatc ccgagcctcc tcgcaggcca aggaaattac cttccatttt gtgcaagcct 1260
aagacacctg tgaggaccag cgaggcggac agcactgact ctggcacttc tacctctcct 1320
cctgaaattc acccagtggt tcctctgggt atacatagac ctgttgctgt tagagtttgc 1380
gggcgacgcc ctgcagtaga gtgcattgag gacttgctta acgatcccga gggacctttg 1440
gacttgagca ttaaacgccc taggcaataa accccaccta agtaataaac cccacctaag 1500
taataaactt taccgccctt ggttattgag atgacgccca atgtttgctt ttgaatgact 1560
tcatgtgtat aataaaagtg agtgtggtca taggtctctt gtttgtctgg gcggggttta 1620
agggtatata agtttctcgg ggctaaactt ggttacactt gaccccaatg gaggcgtggg 1680
ggtgcttgga ggagtttgcg gacgtgcgcc gtttgctgga cgagagctct agcaatacct 1740
atagtatttg gaggtatctg tggggctcta ctcaggccaa gttggtcttc agaattaagc 1800
aggattacaa gtgcgatttt gaagagcttt ttagttcctg tggtgagctt ttgcaatcct 1860
tgaatctggg ccaccaggct atcttccagg aaaaggttct ctcgactttg gatttttcca 1920
ctcccgggcg caccgccgct tgtgtggctt ttgtgtcttt tgtgcaagat aaatggagcg 1980
gggagaccca cctgagtcac ggctacgtgc tggatttcat ggcgatggct ctttggaggg 2040
cttacaacaa atggaagatt cagaaggaac tgtacggttc cgccctacgt cgtccacttc 2100
tgcagcggca ggggctgatg tttcccgacc atcgccagca tcagaatctg gaagacgagc 2160
gagcggagaa gatcagcttg agagccggcc tggaccctcc tcaggaggaa tgaatctccc 2220
gcaggtggtt gagctgtttc ccgaactgag acgggtcctg actatcaggg aggatggtca 2280
gtttgtgaag aagctgaaga gggatcgggg tgagggagat gatgaggcgg ctagcaattt 2340
agcttttagt ctgataactc gccaccgacc ggaatgtatt acctatcagc agattaagga 2400
gagttgtgcc aacgagctgg atcttttggg tcagaagtat agcatagaac agcttaccac 2460
ttactggctt cagcccgggg atgattggga agaggcgatt agggtgtatg caaaggtggc 2520
cctgcggccc gattgcaagt ataagattac taagttggtt aatattagaa actgctgcta 2580
tatttctgga aacggggccg aagtggagat agatactgag gacagggtgg ctattaggtg 2640
ttgcatgata aacatgtggc ccgggatact ggggatggat ggggtgatat ttatgaatgt 2700
gaggttcacg ggccccaact ttaatggtac ggtgttcatg ggcaacacca acttgctcct 2760
gcatggtgcg agtttctatg ggtttaacaa cacctgtata gaggcctgga ccgatgtaaa 2820
ggttcgaggt tgttcctttt atagctgttg gaaggcggtg gtgtgtcgcc ctaaaagcag 2880
gggttctgtg aagaaatgct tgtttgaaag gtgcacccta ggtatccttt ctgagggcaa 2940
ctccagggtg cgccataatg tggcttcgaa ctgcggttgc ttcatgcaag tgaagggggt 3000
gagcgttatc aagcataact cggtctgtgg aaactgcgag gatcgcgcct ctcagatgct 3060
gacctgcttt gatggcaact gtcacctgtt gaagaccatt catataagca gtcaccccag 3120
aaaggcctgg cccgtgtttg agcataacat tctgacccgc tgttccttgc atctgggggt 3180
caggaggggt atgttcctgc cttaccagtg taactttagc cacactaaaa tcctgctgga 3240
acccgagtgc atgactaagg tcagcctgaa tggtgtgttt gatgtgagtc tgaagatttg 3300
gaaggtgctg aggtatgatg agaccaggac caggtgccga ccctgcgagt gcggcggcaa 3360
gcacatgaga aatcagcctg tgatgttgga tgtgaccgag gagcttaggc ctgaccatct 3420
ggtgctggcc tgcaccaggg ccgagtttgg gtctagcgat gaggataccg attgaggtgg 3480
gtaaggtggg cgtggctagc agggtgggcg tgtataaatt gggggtctaa ggggtctctc 3540
tgtttgtctt gcaacagccg ccgccatgag cgacaccggc aacagctttg atggaagcat 3600
ctttagtccc tatctgacag tgcgcatgcc tcactgggcc ggagtgcgtc agaatgtgat 3660
gggttccaac gtggatggac gtcccgttct gccttcaaat tcgtctacta tggcctacgc 3720
gaccgtggga ggaactccgc tggacgccgc gacctccgcc gccgcctccg ccgccgccgc 3780
gaccgcgcgc agcatggcta cggaccttta cagctctttg gtggcgagca gcgcggcctc 3840
tcgcgcgtct gctcgggatg agaaactgac tgctctgctg cttaaactgg aagacttgac 3900
ccgggagctg ggtcaactga cccagcaggt ttccagcttg cgtgagagca gccttgcctc 3960
cccctaatgg cccataatat aaataaaagc cagtctgttt ggattaagca agtgtatgtt 4020
ctttatttaa ctctccgcgc gcggtaagcc cgggaccagc ggtctcggtc gtttagggtg 4080
cggtggattt tttccaacac gtggtacagg tggctctgga tgtttagata catgggcatg 4140
agtccatccc tggggtggag gtagcaccac tgcagagctt cgtgctcggg ggtggtgttg 4200
tatatgatcc agtcgtagca ggagcgctgg gcgtggtgct gaaaaatgtc cttaagcaag 4260
aggcttatag ctagggggag gcccttggtg taagtgttta caaatctgct tagctgggag 4320
gggtgcatcc ggggggatat gatgtgcatc ttggactgga tttttaggtt ggctatgttc 4380
ccgcccagat cccttctggg attcatgttg tgcaggacca ccagcacggt atatccagtg 4440
cacttgggaa atttatcgtg gagcttagac gggaatgcat ggaagaactt ggagacgccc 4500
ttgtggcctc ccagattttc catacattcg tccatgatga tggcaatggg cccgtgggaa 4560
gctgcctgag caaaaacgtt tctggcatcg ctcacatcgt agttatgttc cagggtgagg 4620
tcatcatagg acatctttac gaatcggggg cgaagggtcc cggactgggg gatgatggta 4680
ccctcgggcc ccggggcgta gttcccctca cagatctgca tctcccaggc tttcatttca 4740
gagggaggga tcatatccac ctgcggggcg atgaaaaaga cagtttctgg cgcaggggag 4800
attaactggg atgagagcag gtttctgagc agctgtgact ttccacagcc ggtgggccca 4860
tatatcacgc ctatcaccgg ctgcagctgg tagttaagag agctgcagct gccgtcctcc 4920
cggagcaggg gggccacctc gttgagcata tccctgacgt ggatgttctc cctgaccagt 4980
tccgccagaa ggcgctcgcc gcccagcgaa agcagctctt gcaaggaagc aaaatttttc 5040
agcggtttca ggccatcggc cgtgggcatg tttttcagcg tctgggtcag cagctccagc 5100
ctgtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagatc tcctcgtttc 5160
gcgggttggg gcggctttcg ctgtagggca ccagccgatg ggcgtccagc ggggccagag 5220
tcatgtcctt ccatgggcgc agggtcctcg tcagggtggt ctgggtcacg gtgaaggggt 5280
gcgctccggg ttgggcactg gccagggtgc gcttgaggct ggttctgctg gtgctgaatc 5340
gctgccgctc ttcgccctgc gcgtcggcca ggtagcattt gaccatggtc tcgtagtcga 5400
gaccctcggc ggcgtgcccc ttggcgcgga gctttccctt ggaggtggcg ccgcacgagg 5460
ggcactgcag gctcttcagg gcgtagagct tgggagcgag aaacacggac tctggggagt 5520
aggcgtccgc gccgcaggcc gagcagaccg tctcgcattc caccagccaa gtgagttccg 5580
ggcggtcagg gtcaaaaacc aggttgcccc catgcttttt gatgcgtttc ttaccttggc 5640
tctccatgag gcggtgtccc ttctcggtga cgaagaggct gtccgtgtcc ccgtagaccg 5700
acttcagggg cctgtcttcc agcggagtgc ctctgtcctc ctcgtagaga aactctgacc 5760
actctgagac gaaggcccgc gtccaggcca ggacgaagga ggccacgtgg gaggggtagc 5820
ggtcgttgtc cactagcggg tccaccttct ccagggtgtg caggcacatg tccccctcct 5880
ccgcgtccag aaaagtgatt ggcttgtagg tgtaggacac gtgaccgggg gttcccaacg 5940
ggggggtata aaagggggtg ggtgcccttt catcttcact ctcttccgca tcgctgtctg 6000
cgagagccag ctgctggggt aagtattccc tctcgaaggc gggcatgacc tcagcgctca 6060
ggttgtcagt ttctaaaaat gaggaggatt tgatgttcac ctgtccggag gtgatacctt 6120
tgagggtacc tgggtccatc tggtcagaaa acactatttt tttgttatca agcttggtgg 6180
cgaatgaccc gtagagggcg ttggagagca gcttggcgat ggagcgcagg gtctggtttt 6240
tgtcgcggtc ggctcgctcc ttggccgcga tgttgagttg cacgtactcg cgggccacgc 6300
acttccactc ggggaacacg gtggtgcgct cgtctgggat caggcgcacc ctccagccgc 6360
ggttgtgcag ggtgaccatg tcgacgctgg tggcgacctc accgcgcaga cgctcgttgg 6420
tccagcagag gcggccgccc ttgcgcgagc agaagggggg tagggggtcc agctggtcct 6480
cgtttggggg gtccgcgtcg atggtaaaga ccccggggag caggcgcggg tcaaagtagt 6540
cgatcttgca agcttgcatg tccagagccc gctgccattc gcgggcggcg agcgcgcgct 6600
cgtaggggtt gaggggcggg ccccagggca tggggtgggt gagcgcggag gcgtacatgc 6660
cgcagatgtc atacacgtac aggggttccc tgaggatacc gaggtaggtg gggtagcagc 6720
gccccccgcg gatgctggcg cgcacgtagt catagagctc gtgggagggg gccagcatgt 6780
tgggcccgag gttggtgcgc tgggggcgct cggcgcggaa gacgatctgc ctgaagatgg 6840
cgtgggagtt ggaggagatg gtgggccgct ggaagacgtt gaagcttgct tcttgcaagc 6900
ccacggagtc cctgacgaag gaggcgtagg actcgcgcag cttgtgcacc agctcggcgg 6960
tgacctggac gtcgagcgca cagtagtcga gggtctcgcg gatgatgtca tacctatcct 7020
cccccttctt tttccacagc tcgcggttga ggacgaactc ttcgcggtct ttccagtact 7080
cttggagggg aaacccgtcc gtgtccgaac ggtaagagcc tagcatgtag aactggttga 7140
cggcctggta ggggcagcag cccttctcca cgggcagcgc gtaggcctgc gccgccttgc 7200
ggagggaggt gtgggtgagg gcgaaagtgt ccctgaccat gactttgagg tattgatgtc 7260
tgaagtctgt gtcatcgcag ccgccctgtt cccacagggt gtagtccgtg cgctttttgg 7320
agcgcgggtt gggcagggag aaggtgaggt cattgaagag gatcttcccc gctcgaggca 7380
tgaagtttct ggtgatgcga aagggccctg ggaccgagga gcggttgttg atgacctggg 7440
cggccaggac gatctcgtca aagccgttta tgttgtgtcc cacgatgtag agctccagga 7500
agcggggctg gcccttgatg gaggggagct ttttaagttc ctcgtaggta agctcctcgg 7560
gcgattccag gccgtgctcc tccagggccc agtcttgcaa gtgagggttg gccgccagga 7620
aggatcgcca gaggtcgcgg gccatgaggg tctgcaggcg gtcgcggaag gttctgaact 7680
gccgccccac ggccattttt tcgggggtga tgcagtagaa ggtgaggggg tctttctccc 7740
aggggtccca tctgagctct cgggcgaggt cgcgcgcggc agcgaccaga gcctcgtcgc 7800
cccccagttt catgaccagc atgaagggca cgagttgctt gccaaaggct cccatccaag 7860
tgtaggtttc tacatcgtag gtgacaaaga ggcgctccgt gcgaggatga gagccgattg 7920
ggaagaactg gatctcccgc caccagttgg aggattggct gttgatgtgg tgaaagtaga 7980
agtcccgtct gcgggccgag cactcgtgct ggcttttgta aaagcgaccg cagtactggc 8040
agcgctgcac gggttgtata tcttgcacga ggtgaacctg gcgacctctg acgaggaagc 8100
gcagcgggaa tctaagtccc ccgcctgggg tcccgtgtgg ctggtggtct tttactttgg 8160
ttgtctggcc gccagcatct gtctcctgga gggcgatggt ggaacagacc accacgccgc 8220
gagagccgca ggtccagatc tcggcgctcg gcgggcggag tttgatgacg acatcgcgca 8280
cattggagct gtccatggtc tccagctccc gcggcggcag gtcagccggg agttcctgga 8340
ggttcacctc gcagagacgg gtcaaggcgc ggacagtgtt gagatggtat ctgatttcaa 8400
ggggcatgtt ggaggcggag tcgatggctt gcaggaggcc gcagccccgg ggggccacga 8460
tggttccccg cggggcgcga ggggaggcgg aagctggggg tgtgttcaga agcggtgacg 8520
cgggcgggcc cccggaggta gggggggttc cggccccaca ggcatgggcg gcaggggcac 8580
gtcttcgccg cgcgcgggca ggggctggtg ctggctccga agagcgcttg cgtgcgcgac 8640
gacgcgacgg ttggtgtcct gtatctggcg cctctgagtg aagaccacgg gtcccgtgac 8700
cttgaacctg aaagagagtt cgacagaatc aatctcggca tcgttgacag cggcctggcg 8760
caggatctcc tgcacgtcgc ccgagttgtc ctggtaggcg atttctgcca tgaactgctc 8820
gatctcttcc tcctggagat ctcctcgtcc ggcgcgctcc acggtggccg ccaggtcgtt 8880
ggagatgcga cccatgagct gcgagaaggc gttgagtccg ccctcgttcc agacccggct 8940
gtagaccacg cccccctcgg cgtcgcgggc gcgcatgacc acctgggcca ggttgagctc 9000
cacgtgtcgc gtgaagacgg cgtagttgcg caggcgctgg aaaaggtagt tcagggtggt 9060
ggcggtgtgc tcggcgacga agaagtacat gacccagcgc cgcaacgtgg attcattgat 9120
gtcccccaag gcctccaggc gctccatggc ctcgtagaag tccacggcga agttgaaaaa 9180
ctgggagttg cgagcggaca cggtcaactc ctcctccaga agacggatga gctcggcgac 9240
agtgtcgcgc acctcgcgct cgaaggccac ggggggcgct tcttcctctt ccacctcttc 9300
ttccatgatt gcttcttctt cttcctcagc cgggacggga gggggcggcg gcgggggagg 9360
ggcgcggcgg cggcggcggc gcaccgggag gcggtcgatg aagcgctcga tcatctcccc 9420
ccgcatgcgg cgcatggtct cggtgacggc gcggccgttc tcccgggggc gcagctcgaa 9480
gacgccgcct ctcatttcgc cgcggggcgg gcggccgtga ggtagcgaga cggcgctgac 9540
tatgcatctt aacaattgct gtgtaggtac gccgccaagg gacctgattg agtccagatc 9600
caccggatcc gaaaaccttt ggaggaaagc gtctatccag tcgcagtcgc aaggtaggct 9660
gagcaccgtg gcgggcgggg gcgggtcggg agagttcctg gcggagatgc tgctgatgat 9720
gtaattaaag taggcggtct tgagaaggcg gatggtggac aggagcacca tgtctttggg 9780
tccggcctgt tggatgcgga ggcggtcggc catgccccag gcctcgttct gacaccggcg 9840
caggtctttg tagtaatctt gcatgagtct ttccaccggc acttcttctc cttcctcttc 9900
ttcatctcgc cggtggtttc tcgcgccgcc catgcgcgtg accccaaagc ccctgagcgg 9960
ctgcagcagg gccaggtcgg cgaccacgcg ctcggccaag atggcctgct gtacctgagt 10020
gagggtcctc tcgaagtcat ccatgtccac gaagcggtgg taggcacccg tgttgatggt 10080
gtaggtgcag ttggccatga cggaccagtt gacggtctgg tgtcccggct gcgagagctc 10140
cgtgtaccgc aggcgcgaga aggcgcggga atcgaacacg tagtcgttgc aagtccgcac 10200
cagatactgg tagcccacca ggaagtgcgg cggaggttgg cgatagaggg gccagcgctg 10260
ggtggcgggg gcgccgggcg ccaggtcttc cagcatgagg cggtggtatc cgtagatgta 10320
cctggacatc caggtgatgc ctgcggcggt ggtggtggcg cgcgcgtagt cgcggacccg 10380
gttccagatg tttcgcaggg gcgagaagtg ttccatggtc ggcacgctct ggccggtgag 10440
gcgcgcgcag tcgttgacgc tctatacaca cacaaaaacg aaagcgttta cagggctttc 10500
gttctgtagc ctggaggaaa gtaaatgggt tgggttgcgg tgtgccccgg ttcgagacca 10560
agctgagctc agccggctga agccgcagct aacgtggtat tggcagtccc gtctcgaccc 10620
aggccctgta tcctccagga tacggtcgag agcccttttg ctttcttggc caagcgcccg 10680
tggcgcgatc tgggatagat ggtcgcgatg agaggacaaa agcggctcgc ttccgtagtc 10740
tggagaaaca atcgccaggg ttgcgttgcg gcgtaccccg gttcgagccc ctatggcggc 10800
ttggatcggc cggaaccgcg gctaacgtgg gctgtggcag ccccgtcctc aggaccccgc 10860
cagccgactt ctccagttac gggagcgagc cccttttgtt tttttatttt ttagatgcat 10920
cccgtgctgc ggcagatgcg cccctcgccc cggcccgatc agcagcagca acagcaggca 10980
tgcagacccc cctctcctct ccccgccccg gtcaccacgg ccgcggcggc cgtgtccggt 11040
gcggggggcg cgctggagtc agatgagcca ccgcggcggc gacctaggca gtatctggac 11100
ttggaagagg gcgagggact ggcgcggctg ggggcgagct ctccagagcg ccacccgcgg 11160
gtgcagttga aaagggacgc gcgtgaggcg tacctgccgc ggcaaaacct gtttcgcgac 11220
cgcgggggcg aggagcccga ggagatgcgg gactgcaggt tccaagcggg gcgcgagctg 11280
cgccgcggct tggacagaca gcgcctgctg cgcgaggagg actttgagcc cgacacgcag 11340
acgggcatca gccccgcgcg cgcgcacgtg gccgcggccg acctggtgac cgcctacgag 11400
cagacggtga accaggagcg caacttccaa aaaagcttca acaaccacgt gcgcacgctg 11460
gtggcgcgcg aggaggtgac cctgggtctc atgcatctgt gggacctggt ggaggcgatc 11520
gtgcagaacc ccagcagcaa gcccctgacc gcgcagctgt tcctggtggt gcagcacagc 11580
agggacaacg aggccttcag ggaggcgctg ctgaacatca ccgagccgga ggggcgctgg 11640
ctcctggacc tgataaacat cctgcagagc atagtggtgc aggagcgcag cctgagcctg 11700
gccgagaagg tggcggccat taactattct atgctgagcc tgggcaagtt ctacgctcgc 11760
aagatctaca agacccccta cgtgcccata gacaaggagg tgaagataga cagcttctac 11820
atgcgcatgg cgctgaaggt gctaaccctg agcgacgacc tgggagtgta ccgcaacgag 11880
cgcatccaca aggccgtgag cgccagccgg cggcgcgagc tgagcgaccg cgaactgatg 11940
cacagtctgc agcgcgcgct gaccggcgcg ggcgagggcg acagggaggt cgagtcctac 12000
tttgacatgg gggccgacct gcactggcag ccgagccgcc gcgccctgga agcggcgggg 12060
gcgtacggcg gccccctggc ggccgatgac gaggaagagg aggactatga gctagaggag 12120
ggcgagtacc tggaggactg acctggctgg tggtgttttg gtatagatgc aagatccgaa 12180
cgtggcggac ccggcggtcc gggcggcgct gcagagccag ccgtccggca ttaactcctc 12240
tgacgactgg gccgcggcca tgggtcgcat catggccctg accgcgcgca accccgaggc 12300
cttcaggcag cagcctcagg ctaaccggct ggcggccatc ttggaagcgg tagtgcccgc 12360
gcgctccaac cccacccacg agaaggtgct ggccatagtc aacgcgctgg cggagagcag 12420
ggccatccgg gcagacgagg ccggactggt gtacgatgcg ctgctgcagc gggtggcgcg 12480
gtacaacagc ggcaacgtgc agaccaacct ggaccgcctg gtgacggacg tgcgcgaggc 12540
cgtggcgcag cgcgagcgct tgcatcagga cggcaacctg ggctcgctgg tggcgctaaa 12600
cgccttcctt agcacccagc cggccaacgt accgcggggg caggaggact acaccaactt 12660
cttgagcgcg ctgcggctga tggtgaccga ggtccctcag agcgaggtgt accagtcggg 12720
gcccgactac ttcttccaga ccagcagaca gggcttgcaa accgtgaacc tgagccaggc 12780
tttcaagaac ctgcgggggc tgtggggagt gaaggcgccc accggcgacc gggctacggt 12840
gtccagcctg ctaaccccca actcgcgcct gctgctgctg ctgatcgcgc ccttcacgga 12900
cagcgggagc gtctcgcggg agacctatct gggccacctg ctgacgctgt accgcgaggc 12960
catcgggcag gcgcaggtgg acgagcacac cttccaggag atcaccagcg tgagccacgc 13020
gctggggcag gaggacacgg gcagcctgca ggcgaccctg aactacctgc tgaccaacag 13080
gcggcagaag attcccacgc tgcacagcct gacccaggag gaggagcgca tcttgcgcta 13140
cgtgcagcag agcgtgagcc tgaacctgat gcgcgacggc gtgacgccca gcgtggcgct 13200
ggacatgacc gcgcgcaaca tggaaccggg catgtacgct tcccagcggc cgttcatcaa 13260
ccgcctgatg gactacttgc atcgggcggc ggccgtgaac cccgagtact tcaccaatgc 13320
cattctgaat ccccactgga tgccccctcc gggtttctac aacggggact tcgaggtgcc 13380
tgaggtcaac gatgggttcc tctgggatga catggatgac agtgtgttct cccccaaccc 13440
gctgcgcgcc gcgtctctgc gattgaagga gggctctgac agggaaggac caaggagtct 13500
ggcctcctcc ctggctctgg gggcggtggg cgccacgggc gcggcggcgc ggggcagcag 13560
ccccttcccc agcctggcgg actctctgaa tagcgggcgg gtgagcaggc cccgcttgct 13620
aggcgaggag gagtatctga acaactccct gctgcagccc gtgagggaca aaaacgctca 13680
gcggcagcag tttcccaaca atgggataga gagcctggtg gacaagatgt ccagatggaa 13740
gacgtatgcg caggagtaca aggagtggga ggaccgccag ccgcggcccc tgccgccccc 13800
tagacagcgc tggcagcggc gcgcgtccaa ccgccgctgg aggcaggggc ccgaggacga 13860
tgatgactct gcagatgaca gcagcgtgtt ggacctgggc gggagcggga accccttttc 13920
gcacctgcgc ccacgcctgg gcaagatgtt ttaaaagaga aaaataaaaa ctcaccaagg 13980
ccatggcgac gagcgttggt tttttgttcc cttccttagt atgcggcgcg cggcgatgtt 14040
cgaggagggg cctcccccct cttacgagag cgcgatggga atttctcctg cggcgcccct 14100
gcagcctccc tacgtgcctc ctcggtacct gcaacctaca ggggggagaa atagcatctg 14160
ttactctgag ctgcagcccc tgtacgatac caccagactg tacctggtgg acaacaagtc 14220
cgcggacgtg gcctccctga actaccagaa cgaccacagc gattttttga ccacggtgat 14280
ccaaaacaac gacttcaccc caaccgaggc cagtacccag accataaacc tggacaacag 14340
gtcgaactgg ggcggcgacc tgaagactat cctgcacacc aatatgccca acgtgaacga 14400
gttcatgttc accaactctt ttaaggcgcg ggtgatggtg gcgcgcgagc agggggaggc 14460
gaagtacgag tgggtggact tcacgctgcc cgagggcaac tactcagaga ccatgactct 14520
cgacctgatg aacaatgcga tcgtggaaca ctatctgaaa gtgggcaggc agaacggggt 14580
gaaggagagc gatatcgggg tcaagtttga caccagaaac ttccgtctgg gctgggaccc 14640
tgtgaccggg ctggtcatgc cgggggtcta caccaacgag gcctttcatc ccgatatagt 14700
gctcctgccc ggctgtgggg tggacttcac ccagagccgg ctgagcaacc tgctgggcgt 14760
tcgcaagcgg caacctttcc aggagggttt caagatcacc tatgaggatc tggagggggg 14820
caacattccc gcgctccttg atctggacgc ctacgaggag agcttgaaac ccgaggagag 14880
cgctggcgac agcggcgaga gtggcgagga gcaagccggc ggcggcggca gcgcgtcggt 14940
agaaaacgaa agtactcccg cagtggcggc ggacgctgcg gaggtcgagc cggaggccat 15000
gcagcaggac gcagaggagg gcgcgcagga ggacatgaac aatggggaga tcaggggcga 15060
cactttcgcc acccggggcg aagaaaaaga ggcagaggcg gcggcggcga cggcggaagc 15120
cgaaaccgag gcagaggcag agcccgagac cgaagttatg gaagacatga atgatggaga 15180
acgtaggggt gacacgtttg ccacccgggg cgaagagaag gcggcggagg cagaagccgc 15240
ggctgaggag gcggctgcgg ctgcggccaa ggctgaggct gcggctgagg ctaaggtcga 15300
agccgatgtt gcggttgagg ctcaggctga ggaggaggcg gcggctgaag cagttaagga 15360
aaaggcccag gcagagcagg aagagaaaaa acctgtcatt caacctctaa aagaagatag 15420
caaaaagcgc agttacaacg tcattgaggg cagcaccttt acccaatacc gcagctggta 15480
cctggcttac aactacggcg acccggtcaa gggggtgcgc tcgtggaccc tgctctgcac 15540
gccggacgtc acctgcggct ccgagcagat gtactggtcg ctgccaaaca tgatgcaaga 15600
cccggtgacc ttccgttcca cgcggcaggt tagcaacttt ccggtggtgg gcgccgaact 15660
gctgccagta cactccaaga gtttttacaa cgagcaggcc gtctactccc agctgatccg 15720
ccaggccacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga ttttggcgcg 15780
cccgccggcc cccaccatca ccaccgtcag tgaaaacgtt cctgccctca cagatcacgg 15840
gacgctaccg ctgcgcaaca gcatctcagg agtccagcga gtgaccatta ctgacgccag 15900
acgccggacc tgcccctacg tttacaaggc cttgggcata gtctcgccgc gcgtcctctc 15960
cagtcgcact ttttaaaaca catccaccca cacgctccaa aatcatgtcc gtactcatct 16020
cgcccagcaa caacaccggc tgggggctgc gcgcacccag caagatgttt ggaggggcaa 16080
ggaagcgctc cgaccagcac cccgtgcgcg tgcgcggcca ctaccgcgcg ccctggggtg 16140
cgcacaagcg cgggcgcaca gggcgcacca ctgtggatga tgtcattgac tccgtagtgg 16200
agcaggcgcg ccactacaca cccggcgcgc cgaccgcctc cgccgtgtcc accgtggacc 16260
aggcgatcga aagcgtggta cagggggcgc ggcactatgc caaccttaaa agtcgccgcc 16320
gccgcgtggc gcgccgccat cgccggagac cccgggctac tgccgccgcg cgccttacca 16380
aggctctgct caagcgcgcc aggcgaactg gccaccgggc cgccatgagg gccgcacggc 16440
gggctgccgc tgccgcgagc gccgtggccc cgcgggcacg aaggcgcgcg gccgctgccg 16500
ccgccgccgc catttccagc ttggcctcga cgcggcgcgg taacatatac tgggtgcgcg 16560
actcggtgag cggcacacgt gtgcccgtgc gctttcgccc cccacggaat tagcacaaga 16620
caacatacac actgagtctc ctgctgttgt gtatcccagc ggcgaccgtc agcagcggcg 16680
acatgtccaa gcgcaaaatt aaagaagaga tgctccaggt catcgcgccg gagatctatg 16740
ggcccccgaa gaaggaggag gaggattaca agccccgcaa gctaaagcgg gtcaaaaaga 16800
aaaagaaaga tgatgacgtt gacgaggcgg tggagtttgt ccgccgcatg gcgcccaggc 16860
gccctgtgca gtggaagggt cggcgcgtgc agcgagtcct gcgccccggc accgcggtgg 16920
tctttacgcc cggcgagcgt tccacgcgca ctttcaagcg ggtgtacgat gaggtgtacg 16980
gcgacgagga tctgttggag caggccaacc atcgatttgg ggagtttgca tatgggaaac 17040
ggcctcgcga gagtctaaaa gaggacctgc tggcgctacc gctggacgag ggcaatccca 17100
ccccgagtct gaagccggtg accctgcaac aggtgctgcc tttgagcgcg cccagcgagc 17160
agaagcgagg gttaaagcgc gagggcgggg acctggcacc caccgtgcag ttgatggtgc 17220
ccaagcggca gaagctggag gacgtgctgg agaaaatgaa agtagagccc gggatccagc 17280
ccgagatcaa ggtccgccct atcaagcagg tggcgcccgg cgtgggagtc cagaccgtgg 17340
acgttaggat tcccacggag gagatggaaa cccaaaccgc cactccctct tcggcagcaa 17400
gcgccaccac cggcgccgct tcggtagagg tgcagacgga cccctggcta cccgccgcca 17460
ctatcgccgt cgccgccgcc ccccgttcgc gcggacgcaa gagaaattat ccagcggcca 17520
gcgcgcttat gccccagtat gcgctgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcactcgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcatcg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg ccttcgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcaggg gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgttg 17940
cccctgctgg tcccgctact cgccgcggcg atcggcgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc atttttggag 18060
gaaaaaataa aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagc ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggtcaga tgctgagaga caagttgaaa gagcagaact tccaggagaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccgccctcag gtggaggaaa cgcctccagc 18480
catggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcccaccact cgccccatag ctcccatggc caccggtgtg gtgggtcaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgagccgact cgccagccaa aggcggtgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgcct ctgcgccgcg ctgcgagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gttcatcccc gtggatcggg 19200
aggacaccgc ttactcttac aaggcgcggt tcacgctggc cgtgggcgac aaccgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaat caggtggagg ctgcagatga ggacgtcgaa gatgaagaag 19440
cgcaagcaca agaggaagcc cctgttaaaa aaattcatgt atatgctcag gcgcctcttg 19500
ctggcgaaaa gattaccaag gatggtttgc aaataggtac tgaagtcgta ggagagacat 19560
ctaaggacac ttttgcagat aaaacattcc aacccgaacc tcagataggc gagtctcagt 19620
ggaacgaggc tgatgccgca gtagcaggag gtagagtttt gaaaaagact acccctatga 19680
gaccttgcta tggatcctat gccaggccta ccaatgccaa cgggggtcaa ggaattctgg 19740
ttgccaatga acaaggagtg atggagtcta aagtagaaat gcaatttttc tctaacacct 19800
caacccttaa tgcgcgggat ggaaccggca atcccgaacc aaaggtggtg ttgtacagcg 19860
aagatgtcca cttggaatct cccgatactc atctgtctta caagcccaaa aaggatgatg 19920
ttaatgccaa agtcatgttg ggtcagcaag ccatgcccaa cagacccaac ctcattggat 19980
ttagagataa tttcattggg cttatgtttt acaacagcac cggtaacatg ggagtgctgg 20040
cgggtcaggc ctctcagttg aatgctgtgg tggacttgca ggatagaaac acagaactgt 20100
catatcagct tatgcttgat tcaattgggg atagaaccag atacttctcc atgtggaacc 20160
aggcagtgga tagctatgat ccagatgtca gaattattga aaaccatggg gttgaggatg 20220
aactgcccaa ctactgcttc cctttgggcg gcataggaat tactgatact tatcaagggg 20280
tgaaaaatac caatggcaat ggtcagtgga ccaaagatga tcagttcgcg gaccgcaacg 20340
aaataggggt gggaaacaac ttcgccatgg agatcaacat ccaggccaac ctttggagaa 20400
acttcctcta tgcaaacgtg gggctctacc tgccagacaa gctcaagtac aaccccacca 20460
acgtggacat ctctgacaac cccaacacct atgactacat gaacaagcgg gtggtggccc 20520
ctggcctggt ggactgcttt gtcaatgtgg gagccaggtg gtccctggac tacatggaca 20580
acgtcaaccc cttcaaccac caccgcaatg cgggtctgcg ctaccgctcc atgatcctgg 20640
gcaacgggcg ctatgtgccc tttcacatcc aggtacccca gaagttcttt gccatcaaga 20700
acctcctgct cctgcccggc tcctacacct acgagtggaa cttcaggaag gatgtgaaca 20760
tggtcctaca gagctctctg ggcaatgacc ttagggtgga tggggccagc atcaagtttg 20820
acagcatcac cctctatgct acatttttcc ccatggccca caacaccgcc tccacgcttg 20880
aggccatgct gagaaacgac accaacgacc agtcctttaa tgactacctc tctggggcca 20940
acatgctcta cccaatccca gccaaggcca ccaacgtgcc catctccatc ccctctcgca 21000
actgggccgc ctttagaggc tgggccttta cccgccttaa gaccaaggag accccctccc 21060
tgggctcggg ttttgatccc tactttgttt actcgggatc catcccctac ctggatggca 21120
ccttctacct caaccacact ttcaagaaga tatccatcat gtatgactcc tccgtcagct 21180
ggccgggcaa cgaccgcttg ctcaccccca atgagttcga ggtcaagcgc gccgtggacg 21240
gcgagggcta caacgtggcc cagtgcaaca tgaccaagga ctggttcctg gtgcagatgc 21300
tggccaacta caacataggc taccagggct tttacatccc agagagctac aaggacagga 21360
tgtactcctt cttcagaaat ttccaaccca tgagccgaca ggtggtggac gagaccaatt 21420
acaaggacta tcaagccatt ggcatcaccc accagcacaa caactcgggt ttcgtgggct 21480
acctggcgcc caccatgcgc gagggtcagg cctaccccgc caacttcccc taccccttga 21540
taggcaagac cgcggtcgac agcgtcaccc agaaaaagtt cctctgcgac cgcaccctct 21600
ggcgcatccc cttctctagc aacttcatgt ccatgggtgc gctcacggac ctgggccaaa 21660
acctgcttta tgccaactct gcccatgcgc tggacatgac ttttgaggtg gaccccatgg 21720
acgagcccac ccttctctat attgtgtttg aagtgttcga cgtggtcaga gtgcaccagc 21780
cgcaccgcgg tgtcatcgag accgtgtacc tgcgtacgcc cttctcagcc ggcaacgcca 21840
ccacctaagg agacagcgcc gccgccgcct gcatgacggg ttccaccgag caagagctca 21900
gggccattgc cagagacctg ggatgcggac cctatttttt gggcacctat gacaaacgct 21960
tcccgggctt tatctcccga gacaagctcg cctgcgccat tgtcaacacg gccgcgcgcg 22020
agaccggggg cgtgcactgg ctggcctttg gctgggaccc gcgctccaaa acttgctacc 22080
tctttgaccc ctttggcttc tccgatcagc gcctcaggca gatttatgag tttgagtacg 22140
aggggctgct gcgccgcagc gcgctcgcct cctcgcccga ccgctgcatc acccttgaga 22200
agtccaccga aaccgtgcag gggccccact cggccgcctg cggtctcttc tgttgcatgt 22260
ttttgcacgc ctttgtgcac tggcctcaga gtcccatgga ttgcaacccc accatgaact 22320
tgctaaaggg agtgcccaac gccatgctcc agagccccca ggtccagccc accctgcgcc 22380
gcaaccagga acagctttac cgcttcctgg agcgccactc cccctacttc cgcagccaca 22440
gcgcgcgcat ccggggggcc acctcttttt gccacttgca agaaaacatg caagacggaa 22500
aatgatgtac agcatgcttt taataaatgt aaagactgtg cactttaatt atacacgggc 22560
tctttctggt tatttattca acaccgccgt cgccatttag aaatcgaaag ggttctgccg 22620
tgcgtcgccg tgcgccacgg gcagagacac gttgcgatac tggaagcggc tcgcccactt 22680
gaactcgggc accaccatgc ggggcagtgg ttcctcgggg aagttctcgc tccacagggt 22740
gcgggtcagc tgcagcgcgc tcaggaggtc gggagccgag atcttgaagt cgcagttggg 22800
gccggaaccc tgcgcgcgcg agttgcggta cacggggttg cagcactgga acaccagcag 22860
ggccggatta ttcacgctgg ccagcaggct ctcgtcgctg atcatgtcgc tgtccagatc 22920
ctccgcgttg ctcagggcga atggggtcat cttgcagacc tgcctgccca ggaaaggcgg 22980
gagcccaggc ttgccgttgc agtcgcagcg caggggcatt agcaggtgcc cacggcccga 23040
ctgcgcctgc gggtacaacg cgcgcatgaa ggcttcgatc tgcctaaaag ccacctgggt 23100
cttggctccc tccgaaaaga acatcccaca ggacttgctg gagaactggt tcgcgggaca 23160
gctggcatcg tgcaggcagc agcgcgcgtc agtgttggca atctgcacca cgttgcgacc 23220
ccaccggttt ttcactatct tggccttgga agcctgctcc tttagcgcgc gctggccgtt 23280
ctcgctggtc acatccatct ctatcacctg ttccttgttg atcatgtttg tcccgtgcag 23340
acactttagg tcgccctccg tctgggtgca gcggtgctcc cacagcgcgc aaccggtggg 23400
ctcccaattc ttgtgggtca cccccgcgta ggcctgcagg taggcctgca ggaagcgccc 23460
catcatggtc ataaaggtct tctggctcgt aaaggtcagc tgcaggccgc gatgctcttc 23520
gttcagccag gtcttgcaga tggcggccag cgcctcggtc tgctcgggca gcatcttaaa 23580
atttgtcttc aggtcgttat ccacgtggta cttgtccatc atggcacgcg ccgcctccat 23640
gcccttctcc caggcggaca ccatgggcag gcttaggggg tttatcactt ccagcggcga 23700
ggacaccgta ctttcgattt cttcttcctc cccctcttcc cggcgcgcgc ccccgctgtt 23760
gcgcgctctt accgcctgca ccaaggggtc gtcttcaggc aagcgccgca ccgagcgctt 23820
gccgcccttg acctgcttga tcagtaccgg cgggttgctg aagcccacca tggtcagcgc 23880
cgcctgctct tcttcgtctt cgctgtctac cactatttct ggggaggggc ttctccgctc 23940
tgcggcaaag gcggcggatc gcttcttttt tttcttggga gccgccgcga tggagtccgc 24000
cacggcgacc gaggtcgagg gcgtggggct gggggtgcgc ggtaccaggg cctcgtcgcc 24060
ctcggactct tcctctgact ccaggcggcg gcggagtcgc ttctttgggg gcgcgcgcgt 24120
cagcggcggc ggagacgggg acggggacgg ggacgggacg ccctccacag ggggtggtct 24180
tcgcgcagac ccgcggccgc gctcgggggt cttctcgcgc tggtcttggt cccgactggc 24240
cattgtatcc tcctcctcct aggcagagag acataaggag tctatcatgc aagtcgagaa 24300
ggaggagagc ttaaccaccc cctcagagac cgccgatgcg cccgccgtcg ccgtcgcccc 24360
cgctaccgcc gacgcgcccg ccacaccgag cgacaccccc acggaccccc ccgccgacgc 24420
acccctgttc gaggaagcgg ccgtggagca ggacccgggc tttgtctcgg cagaggagga 24480
tttgcaagag gaggagaata aggaggagaa gccctcagtg ccaaaagatc ataaagagca 24540
agacgagcac gacgcagacg cacaccaggg tgaagtcggg cggggggacg gagggcatgg 24600
cggcgccgac tacctagacg aaggaaacga cgtgctcttg aagcacctgc atcgtcagtg 24660
cgccatcgtc tgcgacgctc tgcaggagcg cagcgaggtg cccctcagcg tggcggaggt 24720
cagccgcgcc tacgagctca gcctcttttc cccccgggtg cccccccgcc gccgcgaaaa 24780
cggcacatgc gagcccaacc cgcgcctcaa cttctacccc gcctttgtgg tgcccgaggt 24840
cctggccacc tatcacatct tctttcaaaa ttgcaagatc cccatctcgt gccgcgccaa 24900
ccgtagccgc gccgataaga tgctggccct gcgccagggc gaccacatac ctgatatcgc 24960
cgctttggaa gatgtgccaa agatcttcga gggtctgggg cgcaacgaga agcgggcagc 25020
aaactctctg caacaggaaa acagcgaaaa tgagagtcac actggagcgc tggtggagct 25080
ggagggcgac aacgcccgcc tggcggtgct caagcgcagc atcgaggtca cccactttgc 25140
ctaccccgcg ctcaacctgc cccccaaagt catgaacgcg gtcatggacg ggctgatcat 25200
gcgccgcggc cggcccctcg ctccagatgc aaacttgcat gaggagaccg aggacggtca 25260
gcccgtggtc agcgacgagc agctgacgcg ctggctggag agcgcggacc ccgccgaact 25320
ggaggagcgg cgcaagatga tgatggccgc ggtgctggtc accgtagagc tggagtgtct 25380
gcagcgcttc ttcggtgacc ccgagatgca gagaaaggtc gaggagaccc tacactacac 25440
cttccgccag ggctacgtgc gccaggcttg caagatctcc aacgtggagc tcagcaacct 25500
ggtgtcctac ctgggcatct tgcatgaaaa ccgccttggg cagagcgtgc tacactccac 25560
cctgcgcggg gaggcgcgcc gcgactacgt gcgcgactgc gtttacctct tcctctgcta 25620
cacctggcag acggccatgg gggtctggca gcagtgcctg gaggagcgca acctcaagga 25680
gctggagaag cttctgcagc gcgcgctcaa agacctctgg acgggcttca acgagcgctc 25740
ggtggccgcc gcgctagccg acctcatctt ccccgagcgc ctgctcaaaa ccctccagca 25800
ggggctgccc gacttcacca gccaaagcat gttgcaaaat tttaggaact ttatcctgga 25860
gcgttctggc atcctacccg ccacctgctg cgccctgccc agcgactttg tccccctcgt 25920
gtaccgcgag tgccccccgc cgctgtgggg ccactgctac ctgttccaac tggccaacta 25980
cctgtcctac cacgcggacc tcatggagga ctccagcggc gaggggctca tggagtgcca 26040
ctgccgctgc aacctctgca cgccccaccg ctccctggtc tgcaacaccc aactgctcag 26100
cgagagtcag attatcggta ccttcgagct acagggtccg tcctcctcag acgagaagtc 26160
cgcggctccg gggctaaaac tcactccggg gctgtggact tccgcctacc tgcgcaaatt 26220
tgtacctgaa gactaccacg cccacgaaat caggttttac gaggaccaat cccgcccgcc 26280
caaggcggag ctgaccgcct gcgtcatcac ccagggcgag atcctaggcc aattgcaagc 26340
catccaaaaa gcccgccaag agtttttgct gaagaggggt cggggggtgt atctggaccc 26400
ccagtcgggt gaggagctca acccggttcc cccgctgcca ccgccgcggg accttgcttc 26460
ccaggataag catcgccatg gctcccagaa agaagcagca gcggccgccg ctgccgccgc 26520
cccacatgct ggaggaagag gaggaatact gggacagtca ggcagaggag gtttcggacg 26580
aggaggagcc ggagacggag atggaagagt gggaggagga cagcttagac gaggaggctt 26640
ccgaagccga agaggcaggc gcaacaccgt caccctcggc cgcagccccc tcgcaggcgc 26700
ccccgaagtc cgctcccagc atcagcagca acagcagcgc tataacctcc gctcctccac 26760
cgccgcgacc cacggccgac cgcagaccca accgtagatg ggacaccacc ggaaccgggg 26820
ccggtaagtc ctccgggaga ggcaagcaag cgcagcgcca aggctaccgc tcgtggcgcg 26880
ctcacaagaa cgccatagtc gcttgcttgc aagactgcgg ggggaacatc tccttcgccc 26940
gccgcttcct gctcttccac cacggtgtgg ccttcccccg taacgtcctg cattactacc 27000
gtcatctcta cagcccctac tgcggcggca gtgagccaga ggcggccagc ggcggcggcg 27060
cccgtttcgg tgcctaggaa gacccagggc aagacttcag ccaagaaact cgcggcgacc 27120
gcggcgaacg cggtcgcggg ggccctgcgc ctgacggtga acgaacccct gtcgacccgc 27180
gaactgagga accgaatctt ccccactctc tatgccatct tccagcagag cagagggcag 27240
gatcaggaac tgaaagtaaa aaacaggtct ctgcgctccc tcacccgcag ctgtctgtat 27300
cacaagagcg aagaccagct tcggcgcacg ctggaggacg ctgaggcact cttcagcaaa 27360
tactgcgcgc tcactcttaa ggactagctc cgcgcccttc tcgaatttag gcgggaacgc 27420
ctacgtcatc gcagcgccgc cgtcatgagc aaggacattc ccacgccata catgtggagc 27480
tatcagccgc agatgggact cgcggcgggc gcctcccaag actactccac ccgcatgaac 27540
tggctcagtg ccggcccaca catgatctca caggttaatg acatccgcac ccatcgaaac 27600
caaatattgg tgaagcaggc ggcaattacc accacgcccc gcaataatcc caaccccagg 27660
gagtggcccg cgtccctggt gtatcaggaa attcccggcc ccaccaccgt actacttccg 27720
cgtgattccc aggccgaagt ccaaatgact aactcagggg cacagctcgc gggcggctgt 27780
cgtcacaggg tgcggcctcc tcgccagggt ataactcacc tggagatccg aggcagaggt 27840
attcagctca acgacgagtc ggtgagctcc tcgctcggtc tcagacctga cgggaccttc 27900
cagatagccg gagccggccg atcttccttc acgccccgcc aggcgtacct gactctgcag 27960
agctcgtcct cggcgccgcg ctcgggcggc atcgggactc tccagttcgt gcaggagttt 28020
gtgccctcgg tctacttcaa ccccttctcg ggctctcccg gtcgctaccc ggaccagttt 28080
atcccgaact ttgacgccgc gagggactcg gtggacggct acgactgaat gtcgggtgga 28140
cccggtgcag agcaacttcg cctgaagcac cttgaccact gccgccgccc tcagtgcttt 28200
gcccgctgtc agaccggtga gttccagtac ttttccctgc ccgactcgca cccggacggc 28260
ccggcgcacg gggtgcgctt tttcatcccg agtcaggtcc gctctaccct aatcagggag 28320
ttcaccgccc gtcccctact ggcggagttg gaaaaggggc cttctatcct aaccattgcc 28380
tgcatttgct ctaaccctgg attacaccaa gatctttgct gtcatttgtg tgctgagtat 28440
aataaaggct gagatcagaa tctactcggg ctcctgtcgc catcctgtca acgccaccgt 28500
ccaagcccgg cccgatcagc ccgaggtgaa cctcacctgt ggtctgcacc ggcgcctgag 28560
gaaataccta gcttggtact acaacagcac tccctttgtg gtttacaaca gctttgacca 28620
ggacggggtc tcactgaggg ataacctctc gaacctgagc tactccatca ggaagaacaa 28680
caccctcgag ctacttcctc cttacctgcc cgggacttac cagtgtgtca ccggcccctg 28740
cacccacacc cacctgttga tcgtaaacga ctctcttccg agaacagacc tcaataactc 28800
ctctccgcag ttccccagaa caggaggtga gctcaggaaa ccccgggtaa agaagggtgg 28860
acaagagtta acacttgtgg ggtttctggt atatgtgacg ctggtggtgg ctcttttgat 28920
taaggctttt ccttccatgt ctgaactatc cctcttcttt tatgaacaac tcgactagtg 28980
ctaacgggac cctacccaac gaatcgggat tgaatatcgg taaccaggtt gcagtttcac 29040
ttttgattac cttcatagtc ctcttcctgc tagtgctgtc gcttctgtgc ctgcggatcg 29100
ggggctgctg catccacgtt tatatctggt gctggctgtt tagaaggttc ggagaccacc 29160
gcaggtagaa taatgctgct taccctcttt gtcctggcgc tggctgccag ctgccaagcc 29220
ttttccgagg ctgacttcat agagccccag tgcaatatca cttataaatc tgaacgtgcc 29280
atctgtacta ttctaatcaa atgtgttact caacacgata aggtgactgt taaatacaaa 29340
gatcaattaa aaaaagacgc actttacagc agctggcaac caggagatga tcaaaaatac 29400
aatgtaaccg tcttccaggg caaactctcc aaaacttaca attacaattt cccatttgag 29460
cagatgtgtg actttgtcat gtacatggaa aagcagtaca agctgtggcc tccaactccc 29520
cagggctgtg tggaaaatcc aggctctttc tgtatgatct ctctctgtgt aactgtgctg 29580
gcactaatac tcacgcttct gtatctcaga tttaaatcaa ggcaaagctt cattgatgaa 29640
aagaaaatgc cataatcgct caacgcttga ttgctaacac cgggttttta tccgcagaat 29700
gattggaatc accctactaa tcacctccct ccttgcgatt gcccatgggt tggaacgaat 29760
cgaagtccct gtgggggcca atgttaccct ggtggggcct gtcggcaatg ctacattaat 29820
gtgggaaaaa tatactaaaa atcaatgggt ttcttactgc actaacaaaa acagccacaa 29880
gcccagagcc atctgcgatg ggcaaaatct aaccttgatt gatgttcaat tgctggatgc 29940
gggctactat tatgggcagc tgggtacaat gattaattac tggagacccc acagagatta 30000
catgcttcac gtagtaaagg gtcccattag cagcccaacc accacctcta ccacacccac 30060
taccaccact actcccacca ccagcactgc cgcccagcct cctcatagca gaacaaccac 30120
ttttatcaat tccaagtccc actcccccca cattgccggc gggccctccg cctcagactc 30180
cgagaccacc gagatctgct tctgcaaatg ctctgacgcc attgcccagg atttggaaga 30240
tcacgaggaa gatgagcatg actacgcaga tgcatgccag gcatcagagg cagaagcgct 30300
accggtggcc ctaaaacagt atgcagactc ccacaccacc cccaaccttc ctccaccttc 30360
ccagaagcca agtttcctgg gggaaaatga aactctgcct ctttccatac tagctctgac 30420
atctgttgct attttggccg ctctgctggt gcttctatgc tctatatgct acctgatctg 30480
ctgcagaaag aaaaaatctc acggccatgc tcaccagccc ctcatgcact tcccttaccc 30540
tccagagctg ggcgaccaca aactttaagt ctgcagtagc tatctgccca tcccttgtca 30600
gtcgacagcg atgagcccca ctaatctaac agcctctgga cttacaacat tgtctcttaa 30660
tgagaccacc gctcctcaag acctgtacga tggtgtctcc gcgctggtta accagtggga 30720
tcacctgggc atatggtggc tcctcatagg agcagtgacc ctgtgcctaa tcctggtctg 30780
gatcatctgc tgcatcaaaa gcagaagacc caggcggcgg cccatctaca ggcccttcgt 30840
catcacacct gaagataatg atgatgatga caccacctcc aggctgcaga gcctaaagca 30900
gctactcttc tcttttacag catggtaaat tgaatcatgc cccgcatttt catctacttg 30960
cttctccttc cactttttct gggctcctct acattggcca ctgtgtccca catcgaggta 31020
gactgcctca cgcccttcac agtctacctg cttttcggct ttgtcatctg cacctttgtc 31080
tgcagcgtta tcactgtagt gatctgcttc atacagtgca tcgactacat ctgtgtgcgg 31140
gtggcctact ttagacacca cccccagtat cgcaacaggg acatagcggc tctcctaaga 31200
cttgtttaaa tcatggccaa attacctgtg attggtcttc tgattatctg ctgcgtccta 31260
gccgcgattg ggactcaacc taataccacc accagcgctc ccagaaagag acatgtatcc 31320
tgcagcttca agcgtccctg gaatataccc caatgcttta ctgatgaacc tgaaatctct 31380
ttggcttggt acttcagcgt caccgccctt ctcatcttct gcagtacggt tattgctctt 31440
gccatctacc cttcccttaa cctgggctgg aatgctgtca actctatgga atatcccacc 31500
ttcccagaac cagacctgcc agacctggtt gttctaaacg cgtttcctcc tcctccagtt 31560
caaaatcagt ttcgccctcc gtcccctacg cccactgagg tcagctactt taatctaaca 31620
ggcggagatg actgaaaacc tagacctaga aatggacggt ctctgcagcg agcaacgcac 31680
actagagagg cgccggcaaa aagcagagct cgagcgtctt aaacaagagc tccaagacgc 31740
cgtggccata caccagtgca aaaaagggct cttctgtctg gtaaaacagg ccacgctcac 31800
ctatgaaaaa acaggtgaca cccaccgcct aggatacaag ctgcccacac agcgccaaaa 31860
gtttgccctt atgataggtg aacaacccat caccgtcacc cagcactccg tggagacaga 31920
aggctgcatt catgctccct gcaggggcgc tgactgcctc tacaccttga tcaaaaccct 31980
ctgcggtctc agagacctta tccctttcaa ttgatcataa ctgtaatcaa taaaaaatca 32040
cttacttgaa atctgatagc aagactctgt ccaatttttt cagcaacact tccttcccct 32100
cctcccaact ctggtactct aggcgcctcc tagctgcaaa cttcctccac agtctgaagg 32160
gaatgtcaga ttcctcctcc tgtccctccg cacccacgat cttcatgttg ttacagatga 32220
aacgcgcgag atcgtctgac gagaccttca accccgtgta cccctacgat accgagatcg 32280
ctccgacttc tgtccctttc cttacccctc cctttgtatc atccgcagga atgcaagaaa 32340
atccagctgg ggtgctgtcc ctgcacctgt cagagcccct taccacccac aatggggccc 32400
tgactctaaa aatggggggc ggcctgaccc tggacaagga agggaatctc acttcccaaa 32460
acatcaccag tgtcgatccc cctctcaaaa aaagcaagaa caacatcagc cttcagaccg 32520
ccgcacccct cgccgtcagc tccggggccc taaccctttt tgccactccc cccctagcgg 32580
tcagtggcga caaccttact gtgcagtctc aggcccctct tactttggaa gactcaaaac 32640
taactctggc caccaaagga cccctaactg tgtccgaagg caaacttgtc ctagaaacag 32700
agcctcccct gcatgcaagt gacagcagta gcctgggcct tagcgtcacg gccccactta 32760
gcattaacaa tgacagccta ggactagaca tgcaagcgcc catcagctct cgagatggaa 32820
aactggctct aacagtggcg gcccccctaa ctgtggccga gggtatcaat gctttggcag 32880
tagccacagg taatggtatt ggactaaatg aaaccaacac acacctgcag gcaaaactgg 32940
tcgcgcccct aggctttgat accaacggca acattaagct aagcgtcgca ggaggcatga 33000
ggctaaacaa taacacactg atactagatg taaactaccc atttgaggct caaggccaac 33060
tgagcctaag agtgggctcg ggcccactat atgtagattc tagtagtcat aacctaacca 33120
ttagatgcct taggggattg tatgtaacat cttctaacaa ccaaaacggt ctagaggcca 33180
acattaaact aacaaaaggc cttgtgtatg acggaaatgc catagcagtt aatgttggca 33240
aagggctgga atacagccct actggcacaa cagaaaaacc tatacagact aaaataggtc 33300
taggcatgga gtatgacact gagggagcca tgatgacaaa actaggctct ggactaagct 33360
ttgacaattc aggagccatt gtggtgggaa acaaaaatga tgacaggctt actttgtgga 33420
ccacaccgga cccatcgccc aactgtcaga tttactctga aaaagatgct aaactaacct 33480
tggtactgac taaatgtggc agtcaggttg taggcacagt atctattgcc gctcttaaag 33540
gtagccttgt gccaatcact agtgcaatca gtgtggttca gatataccta aggtttgatg 33600
aaaatggggt gctgatgagt aactcttcac ttaatggcga atactggaat tttagaaacg 33660
gagactcaac taatggcaca ccatatacaa acgcagtggg ttttatgcct aatctactgg 33720
cctatcctaa aggtcaaact acaactgcaa aaagtaacat tgtcagccag gtctacatga 33780
acggggacga tactaaaccc atgacattta caatcaactt caatggcctt agtgaaacag 33840
gggatacccc tgtcagtaaa tattccatga cattctcatg gaggtggcca aatggaagct 33900
acatagggca caattttgta acaaactcct ttactttctc ctacatcgcc caagaataaa 33960
gaaagcacag agatgcttgt ttttgatttc aaaattgtgt gcttttattt attttcaagc 34020
ttacagtatt tccagtagtc attagaatag agcttaatta aactgcatga gaacccttcc 34080
acatagctta aattatcacc agtgcaaatg gaaaaaaatc aacatacctt tttatccaga 34140
tatcaaagaa ctctagtggt cagttttccc ccaccctccc agctcacaga atacacagtc 34200
ctttcccccc ggctggcttt aaacaacact atctcattgg taacagacat atttttaggt 34260
gtaataatcc acacggtctc ttggcgggcc aaacgctggt ctgtgatgtt aataaactcc 34320
ccaggcagct ctttcaagtt cacgtcgctg tccaactgct gaagcgctcg cggctccgac 34380
tgcgcctcta gcggaggcaa cggcagcacc cgatccttga tctataaagg agtagagtca 34440
taatccccca taagaatagg gcggtgatgc agcaacaagg cgcgcagcaa ctcctgccgc 34500
cgcctctccg tacgacagga atgcaacggg gtggtggtct cctccgcgat aatccgcacc 34560
gctcgcagca tcagcatcct cgtcctccgg gcacagcagc gcatcctgat ctcactgaga 34620
tcggcgcagt aagtgcagca caacaccaag atgttattta agatcccaca gtgcaaagca 34680
ctgtacccaa agctcatggc gggaaggaca gcccccacgt gaccatcgta ccagatcctc 34740
aggtaaatca aatgacgacc tctcataaac acgctggaca tatacatcac ctccttgggc 34800
atgagctgat tcaccacctc tcgataccac aggcatcgct gattaattaa agacccctcg 34860
agcaccatcc tgaaccagga agccagcacc tgaccccccg ccaggcactg cagggacccc 34920
ggtgaatcgc agtggcagtg aagactccag cgctcgtagc cgtgaaccat agagctggtc 34980
attatatcca cattggcaca acacagacac actttcatac actttttcat gattagcagc 35040
tcctctctag tcaagaccat atcccaagga atcacccact cttgaatcaa ggtaaatccc 35100
acacagcagg gcaggcctct cacataactc acgttatgca tagtgagcgt gtcgcaatct 35160
ggaaataccg gatgatcttc catcaccgaa gcccgggtct ccgtctcaaa gggaggtaaa 35220
cggtccctcg tgtagggaca gtggcgggat aatcgagatc gtgttgaacg tagagtcatg 35280
ccaaagggaa cagcggacgt actcatattt cctccagcag aaccaagtgc gcgcgtggca 35340
gctatccctg cgtcttctgt ctcgccgcct gccccgctcg gtgtagtagt tgtaatacag 35400
ccactccctc agaccgtcaa ggcgctccct ggcgtccgga tctataacaa caccgtcctg 35460
cagcgccgcc ctgatgacat ccaccaccgt agagtatgcc aagcccagcc acgaaatgca 35520
ctcactttga cagcgagaga taggaggagc gggaagagat ggaagaacca tgatagtaaa 35580
agaactttta ttccaatcga tcctctacaa tgtcaaagtg tagatctatc agatggcact 35640
ggtctcctcc gctgagtcga tcaaaaataa cagctaaacc acaaacaaca cgattggtca 35700
aatgctgcac aagggcttgc agcataaaat cgcctcgaaa gtccaccgca agcataacat 35760
caaagccacc gcccctatca tgatctatga taaaaacccc acagctatcc accagaccca 35820
tatagttttc atctctccat cgtgaaaaaa tatttacaag ctcctccttt aaatcacctc 35880
caaccaattc aaaaagttga gccagaccgc cctccacctt cattttcagc atgcgcatca 35940
tgattgcaaa aattcaggct cctcagacac ctgtataaga ttgagaagcg gaacgttaac 36000
atcaatgttt cgctcgcgaa gatcgcgcct cagtgcaagc atgatataat cccacaggtc 36060
ggagcggatc agcgaggaca tctccccgcc aggaaccaac tcaacggagc ctatgctgat 36120
tataatacgc atattcgggg ctatgctaac cagcacggcc cccaaatagg cgtactgcat 36180
aggcggcgac aaaaagtgaa cagtttgggt taaaaaatca ggcaaacact cgcgcaaaaa 36240
agcaagaaca tcataaccat gctcatgcaa atagatgcaa gtaagctcag gaacgaccac 36300
agaaaaatgc acaatttttc tctcaaacat gactgcgagc cctgcaaaaa ataaaaaaga 36360
aacattacac aagagtagcc tgtcttacaa tgggatagac tactctaacc aacataagac 36420
gggccacgac atcgcccgcg tggccataaa aaaaattatc cgtgtgatta aaaagaagca 36480
cagatagctg gccagtcata tccggagtca tcacgtgcga acccgtgtag acccccgggt 36540
tggacacatc ggccaaacaa agaaagcggc caatgtatcc cggaggaatg ataacactaa 36600
gacgaagata caacagaata accccatggg ggggaataac aaagttagta ggtgaataaa 36660
aacgataaac acccgaaact ccctcctgcg taggcaaaat agcgccctcc ccttccaaaa 36720
caacatacag cgcttccaca gcagccatga caaaagactc aaaacactca aaagactcag 36780
tcttaccagg aaaataaaag cactctcaca gcaccagcac taatcagagt gtgaagaggg 36840
ccaagtgccg aacgagtata tataggaatt aaaaatgacg taaatgtgta aaggtcaaaa 36900
aacgcccaga aaaatacaca gaccaacgcc cgaaacgaaa acccgcgaaa aaatacccag 36960
aagttcctca acaaccgcca cttccgcttt cccacgatac gtcacttcct caaaaatagc 37020
aaactacatt tcccacatgt acaaaaccaa aacccctccc cttgtcaccg cccacaactt 37080
acataatcac aaacgtcaaa gcctacgtca cccgccccgc ctcgccccgc ccacctcatt 37140
atcatattgg cctcaatcca aaataaggta tattattgat gatg 37184
<210> 6
<211> 37172
<212> DNA
<213> Great Ape Adenovirus
<400> 6
catcatcaat aatatacctt attttggatt gtggccaata tgataatgag gtgggcgggg 60
cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgc ggaagtgtgg catttgcaag 120
tgggaggagc tcacatgtaa gcttccgtcg cggaaaatgt gacgttttta atgagcgccg 180
cctacctccg gaagtgccaa ttttcgcgcg cttttcaccg gatatcgtag taattttggg 240
cgggaccatg taagatttgg ccattttcgc gcgaaaagtg aaacggggaa gtgaaaactg 300
aataataggg cgttagtcat agcgcgtaat atttaccgag ggccgaggga ctttgaccga 360
ttacgtggag gactcgccca ggtgtttttt acgtgaattt ccgcgttccg ggtcaaagtc 420
tccgttttta ttgtcaccgt catctgacgc ggagggtatt taaacccgct gcgctcctaa 480
agaggccact cttgagtgcc agcgagaaga gttttctcct ccgctccgtt tcggcgatcg 540
aaaaatgaga cacttagcct gcactccggg tcttttgtcc ggccgggcgg cgtccgagct 600
tttggacgct ttgctcaatg aggttctgag cgatgatttt ccgtctacta cccactttag 660
cccacctact cttcacgaac tgtacgatct ggatgtactg gtggatgtga acgatcccaa 720
cgaggaggcg gtttctacgt tttttcccga gtctgcgctt ttggccgccc aggagggatt 780
tgacctacac actccgccgc tgcctatttt agagtctccg ctgccggagc ccagtggtat 840
accttatatg cctgaactgc ttcccgaagt ggtagacctg acctgccacg agccgggctt 900
tccgcccagc gacgatgagg gtgagccttt tgctttagac tatgctgaga tacctgggct 960
cggttgcagg tcttgtgcat atcatcagag ggttaccgga gaccccgagg ttaagtgttc 1020
gctgtgctat atgaggctga cctcttcctt tatctacagt aagttttttg tgtaggtggg 1080
ctttttgggt aggtgggttt tgtggcagga caggtgtaaa tgttgcttgt gttttttgta 1140
cctgcaggtc cggtgtccga gccagacccg gagcccgacc gcgatcccga gccggatccc 1200
gagcctcctc gcagggcaag gaaattacct tccattttgt gcaagcctaa gacacctgtg 1260
aggaccagcg aggcggacag cactgactct ggcacttcta cctctcctcc tgaaattcac 1320
ccagtggttc ctttgggtat acataaacct gttgctatta gagtttgcgg gcgacgccct 1380
gcagtagagt gcattgagga cttgcttaac gatcccgagg gacctttgga cttgagcatt 1440
aaacgcccta ggcaataaac cccacctaag taataaaccc cacctaagta ataaacttta 1500
ccgcccttgg ttattgagat gacgcccaat gtttgctttt gaatgacttc atgtgtataa 1560
taaaagtgag tgtggtcata ggtctcttgt ttgtctgggc ggggcttaag ggtatataag 1620
tttctcgggg ctaaacttgg ttacacttga ccccaatgga ggcgtggggg tgcttggagg 1680
agtttgcgga cgtgcgccgt ttgctggacg agagctctag caatacctat agtatttgga 1740
ggtatctgtg gggctctact caggccaagt tggtctccag aattaagcag gattacaagt 1800
gcgattttga agagcttttt agttcctgtg gtgagctttt gcaatccttg aatctgggcc 1860
accaggctat cttccaggaa aaggttctct cgactttgga tttttccact cccgggcgca 1920
ccgccgcttg tgtggctttt gtgtcttttg tgcaagataa atggagcggg gagacccacc 1980
tgagtcacgg ctacgtgctg gatttcatgg cgatggctct ttggagggct tacaacaaat 2040
ggaagattca gaaggaactg tacggttccg ccctacgtcg tccacttctg cagcggcagg 2100
ggctgatgtt tcccgaccat cgccagcatc agaatctgga agacgagtcg gaggagcgag 2160
cggagaagat cagcttgaga gccggcctgg accctcctca ggaggaatga atctcccgca 2220
ggtggttgac ctgtttcccg aactgagacg ggtcctgact atcagggaag atggtcagtt 2280
tgtgaagaag ctgaagaggg atcggggtga gggagatgat gaggcggcta gcaatttagc 2340
ttttagtctg ataacccgcc accgaccgga atgtattacc tatcagcaga ttaaggagag 2400
ttgtgccaac gagctggatc ttttgggtca gaagtatagc atagaacagc ttaccactta 2460
ctggcttcag cccggggatg attgggaaga ggcgatcagg gtgtatgcaa aggtggccct 2520
gcggcccgat tgcaagtata agattactaa gttggttaat attagaaact gctgctatat 2580
ttctgggaac ggggccgaag tggagataga tactgaggac agggtggcta ttaggtgttg 2640
catgataaac atgtggcccg ggatactggg gatggatggg gtgatattta tgaatgtaag 2700
gttcacgggc cccaacttta atggtacggt gttcatgggc aacaccaact tgctcctgca 2760
tggtgcgagt ttctatgggt ttaacaacac ctgtatagag gcctggaccg atgtaaaggt 2820
tcgaggttgt tccttttata gctgttggaa ggcggtggtg tgtcgcccta aaagcagggg 2880
ttctgtgaag aaatgcttgt ttgaaaggtg caccctaggt atcctttctg agggcaactc 2940
cagggtgcgc cataatgtgg cttcgaactg cggttgcttc atgcaagtga agggggtgag 3000
cgttatcaag cataactcgg tctgtggaaa ctgcgaggat cgcgcctctc agatgctgac 3060
ctgctttgat ggcaactgtc acctgttgaa gaccattcat ataagcagtc accccagaaa 3120
ggcctggccc gtgtttgagc ataacattct gacccgctgt tccttgcatc tgggggtcag 3180
gaggggtatg ttcctgcctt accagtgtaa cttcagccac actaaaatcc tgctggaacc 3240
cgagtgcatg actaaggtca gcctgaatgg tgtgtttgat gtgagtctga agatttggaa 3300
ggtgctgagg tatgatgaga ccaggaccag gtgccgaccc tgcgagtgcg gcggcaagca 3360
catgagaaat cagcctgtga tgttggatgt gaccgaggag cttaggcctg accatctggt 3420
gctggcctgc accagggccg agtttgggtc tagcgatgag gataccgatt gaggtgggta 3480
aggtgggcgt ggctagcagg gtgggcgtgt ataaattggg ggtctaaggg gtctctctgt 3540
ttgtcttgca acagccgccg ccatgagcga caccggcaac agctttgatg gaagcatctt 3600
tagcccctat ctgacagtgc gcatgcctca ctgggccgga gtgcgtcaga atgtgatggg 3660
ttccaacgtg gatggacgtc ccgttctgcc ttcaaattcg tctacgatgg cctacgcgac 3720
cgtgggagga actccgttgg acgccgcgac ctccgccgcc gcctccgccg ccgccgcgac 3780
cgcgcgcagc atggctacgg acctttacag ctctttggtg gcgagcagcg cggcctctcg 3840
cgcgtctgct cgggatgaga aactgactgc tctgctgctt aaactggaag acttgacccg 3900
ggagctgggt caactgaccc agcaggtctc cagcttgcgt gagagcagcc ttgcctcccc 3960
ctaatggccc ataatataaa taaaagccag tctgtttgga ttaagcaagt gtatgttctt 4020
tatttaactc tccgcgcgcg gtaagcccgg gaccagcggt ctcggtcgtt tagggtgcgg 4080
tggattcttt ccaacacgtg gtacaggtgg ctctggatgt ttagatacat gggcatgagt 4140
ccatccctgg ggtggaggta gcaccactgc agagcttcgt gctcgggggt ggtgttgtat 4200
atgatccagt cgtagcagga gcgctgggcg tggtgctgaa aaatgtcctt aagcaagagg 4260
cttatagcta gggggaggcc cttggtgtaa gtgtttacaa atctgctcag ctgggagggg 4320
tgcatccggg gggatatgat gtgcatcttg gactggattt ttaggttggc tatgttccca 4380
cccagatccc ttctgggatt catgttgtgc aggaccacca gcacggtata tccagtgcac 4440
ttgggaaatt tatcgtggag cttagacggg aatgcatgga agaacttgga gacgcccttg 4500
tggcctccca gattttccat acattcgtcc atgatgatgg caatgggccc gtgggaagct 4560
gcctgagcaa aaacgtttct gggatcgctc acatcgtagt tatgttccag ggtgaggtca 4620
tcataggaca tctttacgaa tcgggggcgg agggtcccgg actgggggat gatggtaccc 4680
tcgggccccg gggcgtagtt cccctcacag atctgcatct cccaggcttt catttcagag 4740
ggagggatca tatccacctg cggggcgatg aaaaagacag tttctggcgc aggggagatt 4800
aactgggatg agagcaggtt tctgagcagc tgtgactttc cacagccggt gggcccatat 4860
atcacgccta tcaccggctg cagctggtag ttaagagagc tgcagctgcc gtcctcccgg 4920
agcagggggg ccacctcgtt gagcatatcc ctgacgtgga tgttttccct gaccagttcc 4980
gccagaaggc gctcgccgcc cagcgaaagc agctcttgca aggaagcaaa atttttcagc 5040
ggtttcaggc catcggccgt gggcatgttt ttcagcgtct gggtcagcag ctccagcctg 5100
tcccagagct cggtgatgtg ctctacggca tctcgatcca gcagatctcc tcgtttcgcg 5160
ggttggggcg gctttcgctg tagggcacca gccgatgggc gtccagcggg gccagagtca 5220
tgtccttcca tgggcgcaga gtcctcgtca gggtggtctg ggtcacggtg aaggggtgcg 5280
ctccgggttg ggcgctggcc agggtgcgct tgaggctggt tctgctggtg ctgaatcgct 5340
gccgctcttc gccctgcgcg tcggccaggt agcatttgac catggtctcg tagtcgagac 5400
cctcggcggc gtgccccttg gcgcggagct ttcccttgga ggtggcgccg cacgaggggc 5460
actgcaggct cttcagggcg tagagcttgg gagcgagaaa cacggactct ggggagtagg 5520
cgtccgcgcc gcaggccgag cagaccgtct cgcattccac cagccaagtg agttccgggc 5580
ggtcagggtc aaaaaccagg ctgcccccat gctttttgat gcgtttctta cctcggctct 5640
ccatgaggcg gtgtcccttc tcggtgacga agaggctgtc cgtgtccccg tagaccgatt 5700
tcaggggcct gtcttccagc ggagtgcctc tgtcctcctc gtagagaaac tctgaccact 5760
ctgagacaaa ggcccgtgtc caggccagga cgaaggaggc cacgtgggag gggtagcggt 5820
cgttgtccac tagcgggtcc accttctcca gggtgtgcag gcacatgtcc ccctcctccg 5880
cgtccagaaa agtgattggc ttgtaggtgt aggacacgtg accgggggtt cccgacgggg 5940
gggtataaaa gggggtgggt gccctttcat cttcactctc ttccgcatcg ctgtctgcga 6000
gagccagctg ctggggtaag tattcccttt cgaaggcggg catgacctca gcgctcaggt 6060
tgtcagtttc taaaaatgag gaagatttga tgttcacctg tccggaggtg atacctttga 6120
gggtacctgg gtctatctgg tcagaaaaca ctattttttt gttatcaagc ttggtggcga 6180
acgacccgta gagggcgttg gagagcagct tggcgatgga gcgcagggtc tggtttttgt 6240
cgcggtcggc tcgctccttg gccgcgatgt tgagttgcac gtactcgcgg gccacgcact 6300
tccactcggg gaagacggtg gtgcgctcgt ctgggatcag gcgcaccctc cagccgcggt 6360
tgtgcagggt gaccatgtcg acgctggtgg cgacctcacc gcgcaggcgc tcgttggtcc 6420
agcagaggcg gccgcccttg cgcgagcaga aggggggtag ggggtccagc tggtcctcgt 6480
tcggggggtc cgcgtcgatg gtaaagaccc cggggagcag acgcgggtca aagtagtcga 6540
tcttgcaagc ttgcatgtcc agagcccgct gccattcgcg ggcggcgagc gcgcgctcgt 6600
aggggttgag gggcgggccc cagggcatgg ggtgggtgag cgcagaggcg tacatgccgc 6660
agatgtcata cacgtacagg ggttccctga ggatgccgag gtaggtgggg tagcagcgcc 6720
ccccgcggat gctggcgcgc acgtagtcat agagttcgtg ggagggggcc agcatgttgg 6780
gcccgaggtt ggtgcgctgg gggcgctcgg cgcggaagac gatctgcctg aagatggcgt 6840
gggagttgga ggagatggtg ggccgctgga agacgttgaa gcttgcttct tgcaagccca 6900
cggagtccct gacgaaggag gcgtaggact cgcgcagctt gtgcaccagc tcggcggtga 6960
cctggacgtc gagcgcacag tagtcgaggg tctcacggat gatgtcatac ttatcctccc 7020
ccttcttttt ccacagctcg cggttgagga cgaactcttc gcggtctttc cagtactctt 7080
ggaggggaaa cccgtccgtg tccgaacggt aagagcctag catgtagaac tggttgacgg 7140
cctggtaggg gcagcagccc ttctccacgg gcagcgcgta ggcctgcgcc gccttgcgga 7200
gggaggtgtg ggtgagggcg aaagtgtccc tgaccatgac tttgaggtat tgatgtctga 7260
agtctgtgtc atcgcagccg ccctgttccc acagggtgta gtccgtgcgc tttttggagc 7320
gcgggttggg cagggagaag gtgaggtcat tgaagaggat cttccccgct cgaggcatga 7380
agtttctggt gatgcgaaag ggccctggga ccgaggagcg gttgttgatg acctgggcgg 7440
ccaggacgat ctcgtcaaag ccgtttatgt tgtggcccac gatgtagagc tccaggaagc 7500
ggggctggcc cttgatggag gggagctttt taagttcctc gtaggtgagc tcctcgggcg 7560
attccaggcc gtgctcctcc agggcccagt cttgcaagtg agggttggcc gccaggaagg 7620
atcgccagag gtcgcgggcc atgagggtct gcaggcggtc gcggaaggtt ctgaactgtc 7680
gccccacggc catcttttcg ggggtgatgc aatagaaggt gagggggtct ttctcccagg 7740
ggtcccatct gagctctcgg gcgaggtcgc gtgcggcggc gaccagagcc tcgtcgcccc 7800
ccagtttcat gaccagcatg aagggcacga gctgcttgcc aaaggctccc atccaagtgt 7860
aggtctctac atcgtaggtg acaaagaggc gctccgtgcg aggatgagag ccgatcggga 7920
agaactggat ctcccgccac cagttggagg attggctgtt gatgtggtga aagtagaagt 7980
cccgtctgcg ggccgagcac tcgtgctggc ttttgtaaaa gcgaccgcag tactggcagc 8040
gctgcacggg ttgtatatct tgcacgaggt gaacctggcg acctctgacg aggaagcgca 8100
gcgggaatct aagtcccccg cctggggtcc cgtgtggctg gtggtcttct actttggttg 8160
tctggccgcc agcatctgtc tcctggaggg cgatggtgga acagaccacc acgccgcgag 8220
agccgcaggt ccagatctcg gcgctcggcg ggcggagttt gatgacgaca tcgcgcacat 8280
tggagctgtc catggtctcc agctcccgcg gcggcaggtc agccgggagt tcctggaggt 8340
ttacctcgca gagacgggtc aacgcacggg cagtgttaag atggtatctg atttcaaggg 8400
gcgtgttggc ggcggagtcg atggcttgca ggaggccgca gccccggggg gccacgatgg 8460
ttccccgtgg ggcgcgaggg gaggcggaag ctgggggtgt gttcagaagc ggtgacgcgg 8520
gcgggccccc ggaggtaggg ggggttccgg ccccacaggc atgggcggca ggggcacgtc 8580
ttcgccgcgc gcgggcaggg gctggtgctg gctccgaaga gcgcttgcgt gcgcgacgac 8640
gcgacggttg gtgtcctgta tctggcgcct ctgagtgaag accacgggtc ccgtgacctt 8700
gaacctgaaa gagagttcga cagaatcaat ctcggcatcg ttgacagcgg cctggcgcag 8760
gatctcctgc acgtcgcccg agttgtcctg gtaggcgatc tctgccatga actgctcgat 8820
ctcttcctcc tggagatctc ctcgtccggc gcgctccacg gtggccgcca ggtcgttgga 8880
gatgcgaccc atgagctgcg agaaggcgtt gagtccgccc tcgttccaga cccggctgta 8940
gaccacgccc ccctcggcgt cgcgggcgcg catgaccacc tgggccaggt tgagctccac 9000
gtgtcgcgtg aagacggcgt agttgcgcag gcgctggaaa aggtagttca gggtggtggc 9060
ggtgtgctcg gcgacaaaga agtacatgac ccagcgccgc aacgtggatt cattgatgtc 9120
ccccaaggcc tccaggcgct ccatggcctc gtagaagtcc acggcgaagt tgaaaaactg 9180
ggagttgcga gcggacacgg tcaactcctc ctccagaaga cggatgagct cggcgacagt 9240
gtcgcgcacc tcgcgctcga aggccacggg gggcgcttct tcctcttcca cctcttcttc 9300
catgattgct tcttcttctt cctcagccgg gacgggaggg ggcggcggcg ggggaggggc 9360
gcggcggcgg cggcggcgca ccgggaggcg gtcgatgaag cgctcgatca tctccccccg 9420
catgcggcgc atggtctcgg tgacggcgcg gccgttctcc cgggggcgca gctcgaagac 9480
gccgcctttc atctcgccgc ggggcgggcg gccgtgaggt agcgagacgg cgctgactat 9540
gcatcttaac aattgctgtg taggtacgcc gccaagggac ctgattgagt ccagatccac 9600
cggatccgaa aacctttgga ggaaagcgtc tatccagtcg cagtcgcaag gtaggctgag 9660
caccgtggcg ggcgggggcg ggtcgggaga gttcctggcg gagatgctgc tgatgatgta 9720
attaaagtag gcggtcttga gaaggcggat ggtggacagg agcaccatgt ctttgggtcc 9780
ggcctgttgg atgcggaggc ggtcggccat gccccaggcc tcgttctgac accggcgcag 9840
gtctttgtag tagtcttgca tgagtctttc caccggcacc tcttctcctt cctcttctcc 9900
atctcgccgg tggtttctcg cgccgcccat gcgcgtgacc ccaaagcccc tgagcggctg 9960
cagcagggcc aggtcggcga ccacgcgctc ggccaagatg gcctgctgta cctgagtgag 10020
ggtcctctcg aagtcatcca tgtccacgaa gcggtggtag gcgcccgtgt tgatggtgta 10080
ggtgcagttg gccatgacgg accagttgac ggtctggtgt cccggctgcg agagctccgt 10140
gtaccgcagg cgcgagaagg cgcgggaatc gaacacgtag tcgttgcaag tccgcaccag 10200
atactggtag cccaccagga agtgcggcgg aggttggcga tagaggggcc agcgctgggt 10260
ggcgggggcg ccgggcgcca ggtcttccag catgaggcgg tggtatccgt agatgtacct 10320
ggacatccag gtgatgccgg cggcggtggt ggtggcgcgc gcgtagtcgc ggacccggtt 10380
ccagatgttt cgcaggggcg agaagtgttc catggtcggc acgctctggc cggtgaggcg 10440
cgcgcagtcg ttgacgctct atacacacac aaaaacgaaa gcgtttacag ggctttcgtt 10500
ctgtagcctg gaggaaagta aatgggttgg gttgcggtgt gccccggttc gagaccaagc 10560
tgagctcggc cggctgaagc cgcagctaac gtggtattgg cagtcccgtc tcgacccagg 10620
ccctgtatcc tccaggatac ggtcgagagc ccttttgctt tcttggccaa gcgcccgtgg 10680
cgcgatctgg gatagatggt cgcgatgaga ggacaaaagc ggctcgcttc cgtagtctgg 10740
agaaacaatc gccagggttg cgttgcggcg taccccggtt cgagccccta tggcggcttg 10800
gatcggccgg aaccgcggct aacgtgggct gtggcagccc cgtcctcagg accccgccag 10860
ccgacttctc cagttacggg agcgagcccc ttttgttttt tattttttag atgcatcccg 10920
tgctgcggca gatgcgcccc tcgccccggc ccgatcagca gcagcaacag caggcatgca 10980
gacccccctc tcctctcccc gccccggtca ccacggccgc ggcggccgtg tccggcgcgg 11040
ggggtgcgct ggagtcagat gagccaccgc ggcggcgacc taggcagtat ctggacttgg 11100
aagagggcga gggactggcg cggctggggg cgagctcccc agagcgtcac ccgcgggtgc 11160
agttgaaaag ggacgcgcgc gaggcgtacc tgccgcggca aaacctgttt cgcgaccgcg 11220
ggggcgagga gcccgaggag atgcgagact gcaggttcca agcagggcgc gagctgcgcc 11280
gcggcttgga cagagagcgc ttgctgcgcg aggaggactt tgagcccgac acgcagacgg 11340
gcatcagccc cgcgcgcgcg cacgtggccg cggccgacct ggtgaccgcc tacgagcaga 11400
cggtgaacca ggagcgcaac ttccaaaaaa gcttcaacaa ccacgtgcgc acgctggtgg 11460
cgcgcgagga ggtgaccctg ggtctcatgc atctgtggga cctggtggag gcgatcgtgc 11520
agaaccccag cagcaagccc ctgaccgcgc agctgttcct ggtggtgcag cacagcaggg 11580
acaacgatgc cttcagggag gcgctgctga acatcaccga gccggagggg cgctggctcc 11640
tggacctgat aaacatcctg cagagcatag tggtgcagga gcgcagcctg agcctggccg 11700
agaaggtggc ggccattaac tattctatgc tgagcctggg caagttctac gcccgcaaga 11760
tctacaagac cccctacgtg cccatagaca aggaggtgaa gatagacagc ttctacatgc 11820
gcatggcgct aaaggtgctg accctgagcg acgacctggg agtgtaccgc aacgagcgca 11880
tccacaaggc cgtgagcgcc agccggcggc gcgagctgag cgaccgcgag ctgatgcaca 11940
gtctgcaacg cgcgctgacc ggcgcgggcg agggcgacag ggaggtcgag tcctacttcg 12000
acatgggggc cgacctgcac tggcagccga gccgccgcgc cctggaggcg gcgggggcgt 12060
atggcggccc cctggcggcc gatggcgagg aagaggagga ctatgagcta gaggagggcg 12120
agtacctgga ggactgacct ggctggtggt gttttggtat agatgcaaga tccgaacgtg 12180
gcggacccgg cggtccgggc ggcgctgcag agccagccgt ccggcattaa ctcctctgac 12240
gactgggccg cggccatggg tcgcatcatg gccctgaccg cgcgcaaccc cgaggccttc 12300
aggcagcagc ctcaggctaa ccggctggcg gccatcttgg aagcggtagt gcccgcgcgc 12360
tccaacccca cccacgagaa ggtgctggcc atagtcaacg cgctggcgga gagcagggcc 12420
atccgggcgg acgaggccgg actggtgtac gatgcgctgc tgcagcgggt ggcgcggtac 12480
aacagcggca acgtgcaaac caacctggac cgcctggtga cggacgtgcg cgaggccgtg 12540
gcgcagcgcg agcgcttgca tcaggacggt aacctgggct cgctggtggc gctaaacgcc 12600
ttcctcagca cccagccggc caacgtaccg cgggggcagg aggactacac caacttcttg 12660
agcgcgctgc ggctgatggt gaccgaggtc cctcagagcg aagtgtacca gtcggggccc 12720
gactacttct tccagaccag cagacagggc ttgcaaaccg tgaacctgag ccaggctttc 12780
aagaacctgc gggggctgtg gggagtgaag gcgcccaccg gcgaccgggc tacggtgtcc 12840
agcctgctaa cccccaactc gcgcctgctg ctgctgctga tcgcgccctt cacggacagc 12900
gggagcgtct cgcgggagac ctatctgggc cacctgctga cgctgtaccg cgaggccatc 12960
gggcaggcgc aggtggacga gcacaccttc caggagatca ccagcgtgag ccacgcgctg 13020
gggcaggagg acacgggcag cctgcaggcg accctgaact acctgctgac caacaggcgg 13080
cagaagattc ccacgctgca cagcctgacc caggaggagg agcgcatctt gcgctacgtg 13140
cagcagagcg tgagcctgaa cctgatgcgc gacggcgtga cgcccagcgt ggcgctggac 13200
atgaccgcgc gcaacatgga accgggcatg tacgcttccc agcggccgtt catcaaccgc 13260
ctgatggact acttgcatcg ggcggcggcc gtgaaccccg agtacttcac caatgccatt 13320
ctgaatcccc actggatgcc ccctccgggt ttctacaacg gggactttga ggtgcccgag 13380
gtcaacgacg ggttcctctg ggatgacatg gatgacagtg tgttctcccc caacccgctg 13440
cgcgccgcgt ctctgcgatt gaaggagggc tctgacaggg aaggaccgag gagtttggcc 13500
tcctccctgg ctctgggggc ggtgggcgcc acgggcgcgg cggcgcgggg cagcagcccc 13560
ttccccagcc tggcggactc tctgaatagc gggcgggtga gcaggccccg cttgctaggc 13620
gaggaggagt atctgaacaa ctccctgcta cagcccgtga gggacaaaaa cgctcagcgg 13680
cagcagtttc ccaacaacgg gatagagagc ctggtggaca agatgtccag atggaagacg 13740
tatgcgcagg agtacaagga gtgggaggac cgacagccgc ggcccctgcc gccccctaga 13800
cagcgctggc agcggcgtgc gtccaaccgc cgctggaggc aggggcccga ggacgatgat 13860
gactctgcag atgacagcag cgtgttggat ctgggcggga gcgggaaccc cttttcgcac 13920
ctgcgcccac gcctgggcaa gatgttttaa aagagaaaaa taaaaactca ccaaggccat 13980
ggcgacgagc gttggttttt ttgttccctt ccttagtatg cggcgcgcgg cgatgttcga 14040
ggaggggcct cccccctctt acgagagcgc gatgggaatt tctcctgcgg cgcccctgca 14100
gcctccctac gtgcctcctc ggtacctgca acctacaggg gggagaaata gcatctgtta 14160
ctctgagctg cagcccctgt acgataccac cagactgtac ctggtggaca acaagtccgc 14220
ggacgtggcc tccctgaact accagaacga ccacagcgat tttttgacca cggtgatcca 14280
aaacaacgac ttcaccccaa ccgaggccag tacccagacc ataaacctgg acaacaggtc 14340
gaactggggc ggcgacctga agactatcct gcacaccaat atgcccaacg tgaacgagtt 14400
catgttcacc aactctttta aggcgcgggt gatggtggcg cgcgagcagg gggaggcgaa 14460
gtacgagtgg gtggacttca cgctgcccga gggcaactat tcagagacca tgactctcga 14520
cctgatgaac aatgcgatcg tggaacacta tctgaaagtg ggcaggcaga acggggtgaa 14580
ggagagcgat atcggggtca agtttgacac cagaaacttt cgtctgggct gggaccccgt 14640
gaccgggctg gtcatgccgg gggtctacac caacgaggcc tttcatcccg atatagtgct 14700
cctgcccggc tgtggggtgg actttaccca gagccggctg agcaacctgc tgggcgttcg 14760
caagcggcaa cctttccagg agggtttcaa gatcacctat gaggatctgg aggggggcaa 14820
cattcccgcg ctccttgatc tggacgccta cgaggagagc ttgaaacccg aggagagcgc 14880
tggcgacagc ggcgagagtg gcgaggagca agccggcggc ggtggcagcg cgtcggtaga 14940
aaacgaaagt actcccgcag tggcggcgga cgctgcggag gtcgagccgg aggccatgca 15000
gcaggacgca gaggagggcg cgcaggagga catgaacaat ggggagatca ggggcgacac 15060
tttcgccacc cggggcgaag aaaaagaggc agaggcggcg gcggcgacgg cggaagccga 15120
aaccgaggca gaggcagagc ccgagaccga agttatggaa gacatgaatg atggagaacg 15180
taggggtgac acgtttgcca cccggggcga agagaaggcg gcggaggcag aagccgcggc 15240
tgaggaggcg gctgcggctg cggccaaggc tgaggctgcg gctgaggcta aggtcgaagc 15300
cgatgttgcg gttgaggctc aggctgagga ggaggcggcg actgaagcag ttaaggaaaa 15360
ggcccaggca gagcaggaag agaaaaaacc tgtcattcaa cctctaaaag aagatagcaa 15420
aaagcgcagt tacaacgtca tcgagggcag cacctttacc caataccgca gctggtacct 15480
ggcttacaac tacggcgacc cggtcaaggg ggtgcgctcg tggaccctgc tctgcacgcc 15540
ggacgtcacc tgcggctccg agcagatgta ctggtcgctg ccaaacatga tgcaagaccc 15600
ggtgaccttc cgttccacgc ggcaggttag caactttccg gtggtgggcg ccgaactgct 15660
gccagtgcac tccaagagtt tttacaacga gcaggccgtc tactcccagc tgatccgcca 15720
ggccacctct ctgacccacg tgttcaatcg ctttcccgag aaccagattt tggcgcgccc 15780
gccggccccc accatcacca ccgtcagtga aaacgttcct gccctcacag atcacgggac 15840
gctaccgctg cgcaacagca tctcaggagt ccagcgagtg accattactg acgccagacg 15900
ccggacctgc ccctacgttt acaaggcctt gggcatagtc tcgccgcgcg tcctctccag 15960
tcgcactttt taaaacacat ctaccctcac gctccaaaat catgtccgta ctcatctcgc 16020
ccagcaacaa caccggctgg gggctgcgcg cgcccagcaa gatgtttgga ggggcgagga 16080
aacgctccga acagcaccca gtgcgcgtgc gcggccacta ccgcgcgccc tggggtgcgc 16140
acaagcgcgg gcgcacaggg cgcaccactg tggatgatgt cattgactcc gtagtggagc 16200
aggcgcgcca ctacacaccc ggcgcgccga ccgcctccgc cgtgtccacc gtggaccagg 16260
cgatcgaaag cgtggtacag ggggcgcggc actatgccaa ccttaaaagt cgccgccgcc 16320
gcgtggcgcg ccgccatcgc cggagacccc gggctactgc cgccgcgcgc cttaccaagg 16380
ctctgctcaa gcgcgccagg cgaactggcc accgggccgc catgagggcc gcacggcggg 16440
ctgccgctgc cgcgagcgcc gtggccccgc gggcacgaag gcgcgcggcc gctgccgccg 16500
ccgccgccat ttccagcttg gcctcgacgc ggcgcggtaa catatactgg gtgcgcgact 16560
cggtaagcgg cacacgggtg cccgtgcgct ttcgcccccc acggaattag cacaaaacaa 16620
catacacact gagtctcctg ctgttgtgta tcccagcggc gaccgtcagc agcggcgaca 16680
tgtccaagcg caaaattaaa gaagagatgc tccaggtcat cgcgccggag atctatgggc 16740
ccccgaagaa ggaggaggat gattacaagc cccgcaagct aaagcgggtc aaaaagaaaa 16800
agaaagatga tgacgttgac gaggcggtgg agtttgtccg ccgcatggcg cccaggcgcc 16860
ccgtgcagtg gaagggtcgg cgcgtgcagc gagtcctgcg ccccggcacc gcggtggtct 16920
ttacgcccgg cgagcgttcc acgcgcactt tcaagcgggt gtacgatgag gtgtacggcg 16980
acgaggatct gttggagcag gccaaccatc gctttgggga gtttgcatat gggaaacggc 17040
cccgcgagag cctaaaagag gacctgctgg cgctaccgct ggacgagggc aatcccaccc 17100
cgagtctgaa gccggtaacc ctgcaacagg tgctgccttt gagcgcgccc agcgagcaga 17160
agcgagggtt gaagcgcgag ggcggggacc tggcacccac cgtgcagttg atggtgccca 17220
agcggcagaa gctggaggac gtgctggaga aaatgaaagt agagcccggg atccagcccg 17280
aaatcaaggt ccgccccatc aagcaggtgg cgcccggcgt gggagtccag accgtggacg 17340
ttaggattcc cacggaggag atggaaaccc aaaccgccac tccctcttcg gcggctagcg 17400
ccaccaccgg cgccgcttcg gtagaggtgc agacggaccc ctggctacct gccgccactg 17460
tcgccgccgc cgccgccgcc ccccgttcgc gcgggcgcaa gagaaattat ccagcggcca 17520
gcgcgctcat gccccagtac gcactgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcacccgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcattg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg cctccgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcagag gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgctg 17940
cccttcctaa tcccgctaat cgccgcggcg atcggtgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc attttttgga 18060
ggaaaaaata aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagt ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggccaga tgctgagaga caagttgaaa gagcagaact tccaggaaaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccggcctcag gtggaggaaa cgcctccagc 18480
aatggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcctaccact cgccccatag cccccatggc caccggtgtg gtgggacaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgatccgcct cgccagccaa aggcggcgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgccc ctgcgccgcg ctgcaagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gtttatcccc gtggatcggg 19200
aggacaccgc ctactcttac aaggcgcggt ttacgctggc cgtgggcgac aatcgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaat caggtggtcg ctgcagatga ggacctggaa gaagatgaag 19440
aagcgcaagc agaagagcaa gcccctgtta aaaaaattca tgtatatgct caggcgcctc 19500
ttgctggcga aaagattacc aaggatggtt tgcaaatagg tactgaagtc gtaggagata 19560
catctaagga cacttttgca gataaaacat tccaacccga acctcagata ggcgagtctc 19620
agtggaacga ggctgatgcc acagcagcag gaggtagagt tttgaaaaag actaccccta 19680
tgagaccttg ctatggatcc tatgccaggc ctaccaatgc caacgggggt caaggaatta 19740
tggttgccaa tgaacaagga gtgttgcagt ctaaagtaga aatgcaattt ttctctaaca 19800
cctcaaccct taatgcgcgg gatggaaccg gcaatcccga accaaaggtg gtgttgtaca 19860
gcgaagatgt ccacttggaa tctcccgata ctcatctgtc ttacaagccc aaaaaggatg 19920
atgttaatgc caaagtcatg ttgggtcagc aagccatgcc caacagaccc aacctcattg 19980
gatttagaga taatttcatt gggcttatgt tttacaacag caccggtaac atgggagtgc 20040
tggcgggtca ggcctctcag ttgaatgctg tggtggactt gcaggataga aacacagaac 20100
tgtcatatca gcttatgctt gattcaattg gggatagaac cagatacttc tccatgtgga 20160
accaggcagt ggatagctat gatccagatg tcagaattat tgaaaaccat ggggttgagg 20220
atgaactgcc caactactgc ttccctttgg gcggcatagg aattactgat acttatcaag 20280
gggtgaaaaa tagcaatggc aatggtcagt ggaccaaaga tgatcagttc gcggaccgca 20340
acgaaatagg ggtgggaaac aacttcgcca tggagatcaa catccaggcc aacctttgga 20400
gaaacttcct ctatgcaaac gtggggctct acctgccaga caagctcaag tacaacccca 20460
ccaacgtgga catctctgac aaccccaaca cctatgacta catgaacaag cgggtggtgg 20520
cccctggcct ggtggactgc tttgtcaatg tgggagccag gtggtccctg gactacatgg 20580
acaacgtcaa ccccttcaac caccaccgca atgcgggtct gcgctaccgc tccatgatcc 20640
tgggcaacgg gcgctatgtg ccctttcaca tccaggtacc ccagaagttc tttgccatca 20700
agaacctcct gctcctgccc ggctcctaca cctacgagtg gaacttcagg aaggatgtga 20760
acatggtcct acagagctct ctgggcaatg accttagggt ggatggggcc agcatcaagt 20820
ttgacagcat caccctctat gctacatttt tccccatggc ccacaacacc gcctccacgc 20880
ttgaggccat gctgagaaac gacaccaacg accagtcctt taatgactac ctctctgggg 20940
ccaacatgct ctacccaatc ccagccaagg ccaccaacgt gcccatctcc atcccctctc 21000
gcaactgggc cgcctttaga ggctgggcct ttacccgcct taagaccaag gagaccccct 21060
ccctgggctc gggttttgat ccctactttg tttactcggg atccatcccc tacctggatg 21120
gcaccttcta cctcaaccac actttcaaga agatatccat catgtatgac tcctccgtca 21180
gctggccggg caacgaccgc ttgctcaccc ccaatgagtt cgaggtcaag cgcgccgtgg 21240
acggcgaggg ctacaacgtg gcccagtgca acatgaccaa ggactggttc ctggtgcaga 21300
tgctggccaa ctacaacata ggctaccagg gcttttacat cccagagagc tacaaggaca 21360
ggatgtactc cttcttcaga aatttccaac ccatgagccg acaggtggtg gacgagacca 21420
attacaagga ctatcaagcc attggcatca cccaccagca caacaactcg ggtttcgtgg 21480
gctacctggc gcccaccatg cgcgagggtc aggcctaccc cgccaacttc ccctacccct 21540
tgataggcaa gaccgcggtc gacagcgtca cccagaaaaa gttcctctgc gaccgcaccc 21600
tctggcgcat ccccttctct agcaacttca tgtccatggg tgcgctcacg gacctgggcc 21660
aaaacctgct ttatgccaac tctgcccatg cgctggacat gactttcgag gtggacccca 21720
tggacgagcc cacccttctc tatattgtgt ttgaagtgtt cgacgtggtc agagtgcacc 21780
agccgcaccg cggtgtcatc gagaccgtgt acctgcgtac gcccttctca gccggcaacg 21840
ccaccaccta aggagacagc gccgccgcct gcatgactgg ttccaccgag caagagctca 21900
gggccatcgc cagagacctg ggatgcggac cctacttttt gggcacctat gacaaacgct 21960
tcccgggttt catctcccga gacaagctcg cctgcgccat cgtcaacacg gccgcgcgcg 22020
agaccggggg cgtgcactgg ctggcctttg gctgggaccc gcgctctaaa acttgctacc 22080
tctttgaccc ctttggcttc tctgatcagc gcctcaggca gatttatgag tttgagtacg 22140
aggggctgct gcgccgcagc gcgcttgcct cctcgcccga ccgctgcatc acccttgaga 22200
agtccaccga gaccgtgcag gggccccact cggccgcctg cggtctcttc tgttgcatgt 22260
ttttgcacgc ctttgtacac tggcctcaga gtcccatgga tcgcaacccc accatgaact 22320
tgctaaaggg agtgcccaac gccatgctcc agagccccca ggtcctgccc accctgcgcc 22380
gcaaccagga acagctctac cgcttcctgg agcgccactc cccctacttc cgcagccaca 22440
gcgcgcgcat ccggggggcc acctcttttt gccacttgca agaaaacatg caagacggaa 22500
aatgatgtac agcatgcttt taataaatgt aaagactgtg cactttattt atacacgggc 22560
tctttctggt tatttattca acaccgccgt cgccatctag aaatcgaaag ggttctgccg 22620
cgcgtcgccg tgcgccacgg gcagagacac gttgcgatac tggaagcggc tcgcccactt 22680
gaactcgggc accaccatgc ggggcagtgg ttcctcgggg aaattctcgc tccacagggt 22740
gcgggtcagc tgcagcgcgc tcaggaggtc gggagccgag atcttgaagt cgcagttggg 22800
gccggaaccc tgcgcgcgcg agttgcggta cacggggttg cagcactgga acaccagcag 22860
ggccggatta ttcacgctgg ccagcaggct ctcgtcgctg atcatgtcgc tgtccagatc 22920
ctccgcgttg ctcagggcga atggggtcat cttgcagacc tgcctgccca ggaaaggcgg 22980
gagcccaggc ttgccgttgc agtcgcagcg caggggcatt agcaggtgcc cacggcccga 23040
ctgcgcctgc gggtacaacg cgcgcatgaa ggcttcgatc tgcctaaaag ccacctgggt 23100
cttggctccc tccgaaaaga acatcccaca ggacttgctg gagaactgat tcgcgggaca 23160
gctggcatcg tgcaggcagc agcgcgcgtc agtgttggcg atctgcacca cgttgcgacc 23220
ccaccggttt ttcactatct tggccttgga agcctgctcc tttagcgcgc gctggccgtt 23280
ctcgctggtc acatccatct ctatcacctg ttccttgttg atcatgtttg tcccgtgcag 23340
acactttagg tcgccctccg tctgggtgca gcggtgctcc cacagcgcgc aaccggtggg 23400
ctcccaattc ttgtgggtca cccccgcgta ggcctgcagg taggcctgca ggaagcgccc 23460
catcatggtc ataaaggtct tctggctcgt aaaggtcagc tgcaggccgc gatgctcttc 23520
gttcagccag gtcttgcaga tggcggccag cgcctcggtc tgctcgggca gcatcttaaa 23580
atttgtcttc aggtcgttat ccacgtggta cttgtccatc atggcacgcg ccgcctccat 23640
gcccttctcc caggcggaca ccatgggcag gcttaggggg tttatcactt ccagcggcga 23700
ggacaccgta ctttcgattt cttcttcctc cccctcttcc cggcgcgcgc ccccgctgtt 23760
gcgcgctctt accgcctgca ccaaggggtc gtcttcaggc aagcgccgca ccgagcgctt 23820
gccgcccttg acctgcttga tcagtaccgg cgggttgctg aagcccacca tagtcagcgc 23880
cgcctgctct tcttcgtctt cgctgtctac cactatttct ggggaggggc ttctccgctc 23940
tgcggcaaag gcggcggatc gcttcttttt tttcttggga gccgccgcga tggagtccgc 24000
cacggcgacc gaggtcgagg gcgtggggct gggggtgcgc ggcaccaggg cctcgtcgcc 24060
ctcggactct tcctctgact ccaggcggcg gcggagtcgc ttctttgggg gcgcgcgcgt 24120
cagcggcggc ggagacgggg acggggacgg ggacgggacg ccctccacag ggggcggtct 24180
tcgcgcagac ccgcggccgc gctcgggggt cttctcgcgc tggtcttggt cccgactggc 24240
cattgtatcc tcctcctcct aggcagagag acataaggag tctatcatgc aagtcgagaa 24300
ggaggagagc ttaaccaccc cctctgagac cgccgtcgcc gtcgcccccg ctaccgccga 24360
cgcgcccgcc acaccgagcg acacccccgc ggaccccccc gccgacgcac ccctgttcga 24420
ggaagcggcc gtggagcagg acccgggctt tgtctcggca gaggaggatt tgcaagagga 24480
ggaggataag gaggagaagc cctcagtgcc aaaagatcat aaagagcaag acgagcacga 24540
cgcagacgca caccagggtg aagtcgggcg gggggacgga gggcatggcg gcgccgacta 24600
cctagacgaa ggaaacgacg tgctcttgaa gcacctgcat cgtcagtgcg ccatcgtctg 24660
cgacgctctg caggagcgca gcgaggtgcc cctcagcgtg gcggaggtca gccgcgccta 24720
cgagctcagc ctcttttccc cccgggtgcc cccccgccgc cgcgaaaacg gcacatgcga 24780
gcccaacccg cgcctcaact tctaccccgc ctttgtggtg cccgaggtcc tggccaccta 24840
tcacatcttc tttcaaaatt gcaagatccc catctcgtgc cgcgccaacc gtagccgcgc 24900
cgataagatg ctggccctgc gccagggcga ccacatacct gatatcgccg ctttggaaga 24960
tgtaccaaag atcttcgagg gtctgggtcg caacgaaaag cgggcagcaa actctctgca 25020
acaggaaaac agcgaaaatg agagtcacac cggggtgctg gtggagctcg agggcgacaa 25080
cgcccgcctg gcggtgctca agcgcagcat cgaggtcacc cactttgcct accccgcgct 25140
caacctgccc cccaaagtca tgaacgcggt catggacggg ctgatcatgc gccgcggcca 25200
gccccttgct ccagatgcaa acttgcatga ggagaccgag gacggccagc ccgtggtcag 25260
cgacgagcag ctggcgcgct ggctggaaac cgcggacccc gccgaactgg aggagcggcg 25320
caagatgatg atggccgcgg tgctggtcac cgtagagctg gagtgtctgc agcgcttctt 25380
cggtgacccc gagatgcaga gaaaggtcga ggagacccta cactacacct tccgccaggg 25440
ctacgtgcgc caggcttgca agatctccaa cgtggagctc agcaacctgg tgtcctacct 25500
gggcatcttg catgagaacc gccttgggca gagcgtgctg cactccaccc tgcgcgggga 25560
agcgcgccgc gactacgtgc gcgactgcgt ttaccttttc ctctgctaca cctggcagac 25620
ggccatgggg gtctggcagc agtgcctgga ggagcgcaac ctcaaggagc tggagaagct 25680
cctgcagcgc gcgctcaaag acctctggac gggcttcaac gagcgctcgg tggccgccgc 25740
gctggccgac ctcatcttcc ccgagcgcct gctcaaaact ctccagcagg ggctgcccga 25800
cttcaccagc caaagcatgt tgcaaaactt taggaacttt atcctggagc gttctggcat 25860
cctacccgcc acctgctgcg ccctgcccag tgactttgtt cccctcgtgt accgcgagtg 25920
ccccccgccg ctgtggggcc actgctacct gttccaactg gccaactacc tgtcctacca 25980
cgcggacctc atggaggact ccagcggcga ggggctcatg gagtgccact gccgctgcaa 26040
cctctgcacg ccccaccgct ccctggtctg caacacccaa ctgctcagcg agagtcagat 26100
tatcggtacc ttcgagctac agggtccgtc ctcctcagac gagaagtccg cggctccggg 26160
gctaaaactc actccggggc tgtggacttc cgcctacctg cgcaaatttg tacctgaaga 26220
ctaccacgcc cacgagatca ggttttacga ggaccaatcc cgcccgccca aggcggagct 26280
gaccgcctgc gtcatcaccc agggcgagat cctaggccaa ttgcaagcca tccaaaaagc 26340
ccgccaagag tttttgctga gaaagggtcg gggggtgtat ctggaccccc agtcgggtga 26400
ggagctcaac ccggttcccc cgctgccgcc gccgcgggac cttgcttccc aggataagca 26460
tcgccatggc tcccagaaag aagcagcagc ggccgccact gccgccaccc cacatgctgg 26520
aggaagagga gtactgggac agtcaggcag aggaggtttc ggacgaggag gagccggaga 26580
cggagatgga agagtgggag gaggacagct tagacgagga ggcttccgaa gccgaagagg 26640
caggcgcaac accgtcaccc tcggccgcag ccccctcgca ggcgcccccg aagtccgctc 26700
ccagcatcag cagcaacagc agcgctataa cctccgctcc tccaccgccg cgacccacgg 26760
ccgaccgcag acccaaccgt agatgggaca ccaccggaac cggggccggt aagtcctccg 26820
ggaaaggcaa gcaagcgcag cgccaaggct accgctcgtg gcgcgctcac aagaacgcca 26880
tagtcgcttg cttgcaagac tgcgggggga acatctcctt cgcccgccgc ttcctgctct 26940
tccaccacgg tgtggccttc ccccgtaacg tcctgcatta ctaccgtcat ctctacagcc 27000
cctactgcgg cggcagtgag ccagaggcgg ccggcggcag cggcgcccgt ttcggtgcct 27060
aggaagaccc agggcaagac ttcagccaag aaactcgcgg cggccgcggc gaacgcggtc 27120
gcgggggccc tgcgcctgac ggtgaacgaa cccctgtcga cccgcgaact gaggaaccga 27180
atcttcccca ctctctatgc catcttccag cagagcagag ggcaggatca ggaactgaaa 27240
gtaaaaaaca ggtctctgcg ctccctcacc cgcagctgtc tgtatcacaa gagcgaagac 27300
cagcttcggc gcacgctgga ggacgctgag gcactcttca gcaaatactg cgcgctcact 27360
cttaaggact agctccgcgc ccttctcgaa tttaggcggg aacgcctacg tcatcgcagc 27420
gccgccgtca tgagcaagga cattcccacg ccatacatgt ggagctatca gccgcagatg 27480
ggactcgcgg cgggcgcctc ccaagactac tccacccgca tgaactggct cagtgccggc 27540
ccacacatga tctcacaggt taatgacatc cgcacccatc gaaaccaaat attggtggag 27600
caggcggcaa ttaccaccac gccccgcaat aatcccaacc ccagggagtg gcccgcgtcc 27660
ctggtgtatc aggaaattcc cggccccacc accgtactac ttccgcgtga ttcccaggcc 27720
gaagtccaaa tgactaactc aggggcacag ctcgcgggcg gctgtcgtca cagggtgcgg 27780
cctcctcgcc agggtataac tcacctggag atccgaggca gaggtattca gctcaacgac 27840
gagtcggtga gctcctcgct cggtctcaga cctgacggga ccttccagat agccggagcc 27900
ggccgatctt ccttcacgcc ccgccaggcg tacctgactc tgcagagctc gtcctcggcg 27960
ccgcgctcgg gcggcatcgg gactctccag ttcgtgcagg agtttgtgcc ctcggtctac 28020
ttcaacccct tctcgggctc tcccggtcgc tacccggacc agttcatccc gaactttgac 28080
gccgcgaggg actcggtgga cggctacgac tgaatgtcgg gtggacccgg tgcagagcaa 28140
cttcgcctga agcaccttga ccactgccgc cgccctcagt gctttgcccg ctgtcagacc 28200
ggtgagttcc agtacttttc cctgcccgac tcgcacccgg acggcccggc acacggggtg 28260
cgctttttca tcccgagtca ggtccgctct accctaatca gggagtttac agcccgtccc 28320
ctactggcgg agttggaaaa ggggccttct atcctaacca ttgcctgcat ctgctctaac 28380
cctggattac accaagatct ttgctgtcat ttgtgtgctg agtataataa aggctgagat 28440
cagaatctac tcgggctcct gtcgccatcc tgtcaacgcc accgtccaag cccggcccga 28500
tcagcccgag gtgaacctca cctgcggtct gcaccggcgc ctgaggaaat acctagcttg 28560
gtactacaac agcactccct ttgtggttta caacagcttt gaccaggacg gggtctcact 28620
gagggataac ctctcgaacc tgagctactc catcaggaag aacaacaccc tcgagctact 28680
tcctccttac ctgcccggga cttaccagtg tgtcaccggt ccctgcaccc acacccacct 28740
gttgatcgta aacgactctc ttccgagaac agacctcaat aactcctctc cgcagttccc 28800
cagaacagga ggtgagctca ggaaaccccg ggtaaagaag ggtggacaag agttaacact 28860
tgtggggttt ctggtgtatg tgacgctggt ggtggctctt ttgattaagg cttttccttc 28920
catgtctgaa ctctccctct tcttttatga acaactcgac tagtgctaac gagaccctac 28980
ccaacgaatc gggattgaat atcggtaacc aggttgcagt ttcacttttg attaccttta 29040
tagtcctctt cctgctagtg ctgtcgcttc tgtgcctgcg gatcgggggc tgctgcatcc 29100
acgtttatat ctggtgctgg ctgtttagaa ggttcggaga ccaccgcagg tagaataatg 29160
ctgcttaccc tctttgtcct ggcgctggct gccagctgcc aagccttttc cgaggctgac 29220
ttcatagagc cccagtgcaa tatcacttat aaatctgaac gtgccatctg tactatccta 29280
atcaaatgtg ttactcaaca cgataaggta actgttaaat acaaagatca attaaaaaaa 29340
gacgcacttt acagcagctg gcaaccagga gatgaacaaa aatacaatgt aaccgtcttc 29400
cagggcaaac tctccaaaac ttacaattac actttcccat ttgagcagat gtgtgacttt 29460
gtcatgtaca tggaaaagca gtacaagctg tggcctccaa ctccccaggg ctgtgtggaa 29520
aatccaggct ctttctgtat gatctctctc tgtgtaactg tgctggcact aatactcacg 29580
cttctgtata tcagatttaa atcaaggcaa agctttattg atgaaaagaa aatgccttaa 29640
tcgctttcac gcttgattgc taacaccggg tttttatccg cagaatgatt ggaatcaccc 29700
tactaatcac ctccctcctt gcgattgccc atgggttgga acgaatcgaa gtccctgtgg 29760
gggccaatgt taccctggtg gggcctgtcg gcaatgctac attaatgtgg gaaaaatata 29820
ctaaaaatca atgggtctct tactgcacta acaaaaacag ccacaagccc agagccatct 29880
gcgatgggca aaatttaacc ttgattgatg ttcaattgct ggatgcgggc tactattatg 29940
ggcagctggg tacaatgatt aattactgga gaccccacag agattacatg cttcacgtag 30000
taaagggtcc cattagcagc ccaaccacca cctctaccac ccccactacc accactactc 30060
ccaccaccag cactgccgcc cagcctcctc atagcagaac aaccactttt atcaattcca 30120
agtcccactc cccccacatt gccggcgggc cctccgcctc agactccgag accaccgaga 30180
tctgcttctg caaatgctct gacgccattg cccaggattt ggaagatcac gaggaagatg 30240
agcatgacta cgcagatgca tgccaggcat cagagtcaga agcgctgccg gtggccctaa 30300
aacagtatgc agacccccac accacccccg accttcctcc accttcccag aagccaagtt 30360
tcctggggga aaatgaaact ctgcctctct ccatactagc tctgacatct gttgctattt 30420
tggccgctct gctggtgctt ctatgctcta tatgctacct gatctgctgc agaaagaaaa 30480
aatctcacgg ccatgctcac cagcccctca tgcacttccc ttaccctcca gagctgggcg 30540
accacaaact ttaagtctgc agtagctatc tgcccatccc ttgtcagtcg acagcgatga 30600
gccccactaa tctaacagcc tctggactta caacattgtc tcttaatgag accaccgctc 30660
ctcaagacct gtacgatggt gtctccgcgc tggttaacca gtgggatcac ctgggcatat 30720
ggtggctcct cataggagca gtgaccctgt gcctaatcct ggtctggatc atctgctgca 30780
tcaaaagcag aagacccagg cggcggccca tctacaggcc cttcgtcatc acacctgaag 30840
ataatgatga tgatgacacc acctccaggc tgcagagcct aaagcagcta ctcttctctt 30900
ttacagcatg gtaaattgaa tcatgccccg cattttcatc tacttgcttc tccttccact 30960
ttttctgggc tcctctacat tggccgctgt gtcccacatc gaggtagact gcctcacgcc 31020
cttcacagtc tacctgcttt tcggctttgt catctgcacc tttgtctgca gcgttatcac 31080
tgtagtgatc tgcttcatac agtgcatcga ctacatctgt gtgcgggtgg cctactttag 31140
acaccacccc cagtatcgca acagggacat agcggctctc ctaagacttg tttaaatcat 31200
ggccaaatta cctgtgattg gtcttctgat tatctgctgc gtcctagccg cgattgggac 31260
tcaacctaat accaccacca gcgctcccag aaagagacat gtatcctgca gcttcaagcg 31320
tccctggaat ataccccaat gctttactga tgaacctgaa atctctttgg cttggtactt 31380
cagcgtcacc gcccttctca tcttctgcag tacggttatt gctcttgcca tctacccttc 31440
ccttaacctg ggctggaatg ctgtcaactc tatggaatat cccaccttcc cagaaccaga 31500
cctgccagac ctggttgttc taaacgcgtt tcctcctcct ccagttcaaa atcagtttcg 31560
ccctccgtcc cctacgccca ctgaggtcag ctactttaat ctaacaggcg gagatgactg 31620
aaaacctaga cctagaaatg gacggtctct gcagcgagca acgcacacta gagaggcgcc 31680
ggcaaaaagc agagctcgag cgtcttaaac aagagctcca agacgccgtg gccatacacc 31740
agtgcaaaaa agggctcttc tgtctggtaa aacaggccac gctcacctat gaaaaaacag 31800
gtgacaccca ccgcctagga tacaagctgc ccacacagcg ccaaaagttt gcccttatga 31860
taggtgaaca acccatcacc gtcacccagc actccgtgga gacagaaggc tgcattcatg 31920
ctccctgcag gggcgctgac tgcctctaca ccttgatcaa aaccctctgc ggtctcagag 31980
accttatccc tttcaattga tcataactgt aatcaataaa aaatcactta cttgaaatct 32040
gatagcaagc ctctgtccaa ttttttcagc aacacttcct tcccctcttc ccaactctgg 32100
tactctaggc gcctcctagc tgcaaacttc ctccacagtc tgaagggaat gtcagattcc 32160
tcctcctcct gtccctccgc acccacaatc ttcatgttgt tgcagatgaa acgcgcgaga 32220
tcgtctgacg agaccttcaa ccccgtgtac ccctacgata ccgagatcgc tccgacttct 32280
gtccctttcc ttacccctcc ctttgtgtca cccgcaggaa tgcaagaaaa tccagctggg 32340
gtgctgtccc tgcacctgtc agagcccctt accacccaca atggggccct gactctaaaa 32400
atggggggcg gcctgaccct ggacaaggaa gggaatctca cttcccaaaa catcaccagt 32460
gtcgatcccc ctctcaaaaa aagcaagaac aacatcagcc ttcagaccgc cgcacccctc 32520
gccgtcagct ccggggccct aacccttttt gccactcccc ccctagcggt cagtggcgac 32580
aaccttactg tgcagtctca ggcccctctt actttggaag actcaaaact aactctggcc 32640
accaaaggac ccctaactgt gtccgaaggc aaacttgtcc tagaaacaga ggctcccctg 32700
catgcaagtg acagcagtag cctgggcctt agcgtcacgg ccccacttag cattaacaat 32760
gacagcctag gactagatct gcaggcaccc attgtctctc aaaatggaaa actggctcta 32820
aatatagcag gccccctagc tgtagccgat agcattaatg ctttgacagt aggcactggc 32880
aaaggtattg gactaaatga aaccagcact cacttgcaag caaaattggt tgccccccta 32940
ggctttgata ccaatggcaa tattaagcta agcgttgcag gaggcatgag gctaaacaat 33000
gacacactga tactagatgt aaactaccca tttgaagctc aaggtcaact aagcctaaga 33060
gtgggcacag gtccactgta tgtagattct agcagtcata atctaaccat tagatgcctt 33120
aggggattgt atataacatc atctaacaac caaaacggtc tagaggccaa cattaaacta 33180
acaaaaggcc ttgtgtatga aggaaatgcc atagcagtta atgttggtca aggattgcaa 33240
tacagcacta ctgccacatc ggaaggtgtg tatcctatac agtctaagat aggtttggga 33300
atggaatatg ataccaacgg agccatgatg gcaaaactag gctccggtct aagctttgat 33360
aattcaggag ccattgtggt gggaaacaaa aatgatgaca aacttaccct atggaccaca 33420
cctgacccgt ctcctaactg tagaatttat tctgaaaaag atactaaact aaccttggtg 33480
ctgactaagt gtggcagtca aatcctaggc acagtatctg cccttgctgt cagaggcagc 33540
cttgcgccca tcactaacgc atccagcata gtccaaatat ttctacgatt tgatgaaaat 33600
ggactattga tgagcaactc atcgctagac ggtgattact ggaattacag aaatggggac 33660
tccactaatg gcacaccata tacaaatgca gtaggcttta tgcctaatct agctgcctat 33720
cctaaaggtc aggctacaac tgcaaaaagc agtattgtaa gccaggtata catggatggt 33780
gatactacta aacctataac actaaaaata aactttaatg gcattgatga aacaacagaa 33840
aatacccctg ttagtaaata ttccatgaca ttctcatgga gctggcccac cgcaagctac 33900
ataggccaca cttttgcaac aaactctttt actttctcct acatcgccca agaataaaga 33960
aagcacagag atgcttgttt ttgatttcaa aattgtgtgc ttttatttat tttcaagctt 34020
acagtatttc cagtagtcat tcaaatagag cttaatgaaa ctgcatgaga acccttccac 34080
atagcttaaa ttatcaccag tgcaaatgga gaaaaaatca acataccttt ttatccagat 34140
atcatagaac tctagtggtc agttttcccc caccctccca gctcacagaa tacacagtcc 34200
tttccccccg gctggcttta aacaacacta tctcattggt aacagacata ttcttaggtg 34260
taataatcca cacggtctct tggcgggcca aacgctggtc agtgatgtta ataaactccc 34320
caggcagctc tttcaagttc acgtcgctgt ccaactgctg aagcgctcgc ggctccgact 34380
gcgcctctag cggaggcaac ggcaacaccc gatccttgat ctataaagga gtagagtcat 34440
aatcccccat aagaataggg cggtgatgct gcaacaaggc gcgcagcaac tcctgccgcc 34500
gcctttccgt acgacaggaa tgcaacgggg tggtggtctc ctccgcgata atccgcaccg 34560
ctcgcaacat cagcgtcctc gtcctccggg cacagcagcg catcctgatc tcactgagat 34620
cggcgcagta agtgcagcac aacaccaaga tgttatttaa gatcccacag tgcaaagcac 34680
tgtacccaaa gctcatggcg ggaaggacag cccccacgtg accatcatac cagatcctca 34740
ggtaaatcaa atgacgacct ctcatgaaca cgctggacat gtacatcacc tccttaggca 34800
tgtgctgatt caccacctct cgataccaca ggcatcgctg attaattaaa gacccctcga 34860
gcaccatcct gaaccaggaa gccagcacct gaccccccgc caggcactgc agggaccccg 34920
gtgaatcgca gtggcagtga agactccagc gctcgtagcc gtgaaccata gagctggtca 34980
ttatatccac attggcacaa cacagacaca ctttcataca ctttttcatg attagcagct 35040
cctctctagt caggaccata tcccaaggaa tcacccactc ttgaatcaag gtaaatccca 35100
cacagcaggg caggcctctc acataactca cgttatgcat agtgagcgtg tcgcaatctg 35160
gaaataccgg atgatcttcc atcaccgaag cccgggtctc cgtctcaaag ggaggtaaac 35220
ggtccctcgt gtagggacag tggcgggata atcgagatcg tgttgaacgt agagtcatgc 35280
caaagggaac agcggacgta ctcatatttc ctccagcaga accaagtgcg cgcgtggcag 35340
ctatccttgc gtcttctgtc tcgccgcctg ccccgctcgg tgtagtagtt gtaatacagc 35400
cactccctca gaccgtcaag gcgctccctg gcgtccggat ctataacaac accatcctgc 35460
agcgccgccc tgatgacatc caccaccgta gagtatgcca agcccagcca ggaaatgcac 35520
tcactttgac agcgagagat aggaggagcg ggaagagatg gaagaaccat gatagtaaaa 35580
gaacttttat tccaatcgat cctctacaat gtcaaagtgt agatctatca gatggcactg 35640
gtctcctccg ctgagtcgat caaaaataac agctaaacca caaacaacac gattggtcaa 35700
atgctgcaca agggcttgca gcataaaatc gcctcgaaag tccaccgcaa gcataacatc 35760
aaagccaccg cccctatcat gatctatgat aaaaacccca cagctatcca ccagacccat 35820
atagttttca tctctccatc gtgaaaaaat atttacaagc tcctccttta aatcacctcc 35880
aaccaattca aaaagttgag ccagaccgcc ctccaccttc attttcagca tgcgcatcat 35940
gattgcaaaa attcaggctc ctcagacacc tgtataagat tgagaagcgg aacgttaaca 36000
tcaatgtttc gctcgcgaag atcgcgcctc agtgcaagca tgatataatc ccacaggtcg 36060
gagcggatca gcgaggacat ctccccgcca ggaaccaact caacggagcc tatgctgatt 36120
ataatacgca tattcggggc tatgctaacc agcacggccc ccaaataggc gtactgcata 36180
ggcggcgaca aaaagtgaac agtttgggtt aaaaaatcag gcaaacactc gcgcaaaaaa 36240
gcaagaacat cataaccatg ctcatgcaaa tagatgcaag taagctcagg aacgaccaca 36300
gaaaaatgca caatttttct ctcaaacatg actgcgagcc ctgcaaaaat aaaaaagaaa 36360
cattacacaa gagtagcctg tcttacaatg ggatagacta ctctaaccaa cataagacgg 36420
gccacaacat cgcccgcgtg gccataaaaa aaattatccg tgtgattaaa aagaagcaca 36480
gatagctggc cagtcatatc cggagtcatc acgtgcgaac ccgtgtagac ccccgggttg 36540
gacacatcgg ccaaacaaag aaagcggcca atgtatcccg gaggaatgat aacactaaga 36600
cgaagataca acagaataac cccatggggg ggaataacaa agttagtagg tgaataaaaa 36660
cgataaacac ccgaaactcc ctcctgcgta ggcaaaatag cgccctcccc ttccaaaaca 36720
acatatagcg cttccacagc agccatgaca aaagactcaa aacactcaaa agactcagtc 36780
ttaccaggaa aataaaagca ctctcacagc accagcacta atcagagtgt gaaaaaggcc 36840
aagtgccgaa cgagtatata taggaattaa aaatgacgta aatgtgtaaa ggtcagaaaa 36900
cgcccagaaa aatacacaga ccaacgcccg aaacgaaaac ccgcgaaaaa atacccagaa 36960
gttcctcaac aaccgccact tccgctttcc cacgagacgt cacttcctca aaaatagcaa 37020
actacatttc ccacatatac aaaaccaaaa cccctcccct tgtcaccgcc cacaacttac 37080
atcatcacaa acgtcaaagc ctacgtcacc cgccccgccc acctcattat catattggcc 37140
acaatccaaa ataaggtata ttattgatga tg 37172
<210> 7
<211> 37187
<212> DNA
<213> Great Ape Adenovirus
<400> 7
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg cggagggtat ttaaacccgc 480
tgcgctccta aagaggccac tcttgagtgc cagcgagaag agttttctcc tccgctccgt 540
ttcggcgatc gaaaaatgag acatttagcc tgcactccgg gtcttttgtc cggccgggcg 600
gcgtccgagc ttttggacgc tttgctcaat gaggttctga gcgatgattt tccgtctact 660
acccacttta gcccacctac tcttcacgaa ctgtacgatc tggatgtact ggtggatgtg 720
aacgatccca acgaggaggc ggtttctacg ttttttcccg agtctgcgct tttggctgcc 780
caggagggat ttgacctaca cactccgccg ctgcctattt tagagtctcc gctgccggag 840
cccagtggta taccttatat gcctgaactg cttcccgaag tggtagacct gacctgccac 900
gagccgggct ttccgcccag cgacgatgag ggtgagcctt ttgctttaga ctatgctgag 960
atacctgggc tcggttgcag gtcttgtgca tatcatcaga gggttaccgg agaccccgag 1020
gttaagtgtt cgctgtgcta tatgaggctg acctcttcct ttatctacag taagtttttt 1080
tgtgtaggtg ggctttttgg gtaggtgggt tttgtggcag gacaggtgta aatgttgctt 1140
gtgttttttg tacctgcagg tccggtgtcc gagccagacc cggagcccga ccgcgatccc 1200
gagccggatc ccgagcctcc tcgcaggcca aggaaattac cttccatttt gtgcaagcct 1260
aagacacctg tgaggaccag cgaggcggac agcactgact ctggcacttc tacctctcct 1320
cctgaaattc acccagtggt tcctctgggt atacatagac ctgttgctgt tagagtttgc 1380
gggcgacgcc ctgcagtaga gtgcattgag gacttgctta acgatcccga gggacctttg 1440
gacttgagca ttaaacgccc taggcaataa accccaccta agtaataaac cccacctaag 1500
taataaactt taccgccctt ggttattgag atgacgccca atgtttgctt ttgaatgact 1560
tcatgtgtat aataaaagtg agtgtggtca taggtctctt gtttgtctgg gcggggttta 1620
agggtatata agtttctcgg ggctaaactt ggttacactt gaccccaatg gaggcgtggg 1680
ggtgcttgga ggagtttgcg gacgtgcgcc gtttgctgga cgagagctct agcaatacct 1740
atagtatttg gaggtatctg tggggctcta ctcaggccaa gttggtcttc agaattaagc 1800
aggattacaa gtgcgatttt gaagagcttt ttagttcctg tggtgagctt ttgcaatcct 1860
tgaatctggg ccaccaggct atcttccagg aaaaggttct ctcgactttg gatttttcca 1920
ctcccgggcg caccgccgct tgtgtggctt ttgtgtcttt tgtgcaagat aaatggagcg 1980
gggagaccca cctgagtcac ggctacgtgc tggatttcat ggcgatggct ctttggaggg 2040
cttacaacaa atggaagatt cagaaggaac tgtacggttc cgccctacgt cgtccacttc 2100
tgcagcggca ggggctgatg tttcccgacc atcgccagca tcagaatctg gaagacgagc 2160
gagcggagaa gatcagcttg agagccggcc tggaccctcc tcaggaggaa tgaatctccc 2220
gcaggtggtt gagctgtttc ccgaactgag acgggtcctg actatcaggg aggatggtca 2280
gtttgtgaag aagctgaaga gggatcgggg tgagggagat gatgaggcgg ctagcaattt 2340
agcttttagt ctgataactc gccaccgacc ggaatgtatt acctatcagc agattaagga 2400
gagttgtgcc aacgagctgg atcttttggg tcagaagtat agcatagaac agcttaccac 2460
ttactggctt cagcccgggg atgattggga agaggcgatt agggtgtatg caaaggtggc 2520
cctgcggccc gattgcaagt ataagattac taagttggtt aatattagaa actgctgcta 2580
tatttctgga aacggggccg aagtggagat agatactgag gacagggtgg ctattaggtg 2640
ttgcatgata aacatgtggc ccgggatact ggggatggat ggggtgatat ttatgaatgt 2700
gaggttcacg ggccccaact ttaatggtac ggtgttcatg ggcaacacca acttgctcct 2760
gcatggtgcg agtttctatg ggtttaacaa cacctgtata gaggcctgga ccgatgtaaa 2820
ggttcgaggt tgttcctttt atagctgttg gaaggcggtg gtgtgtcgcc ctaaaagcag 2880
gggttctgtg aagaaatgct tgtttgaaag gtgcacccta ggtatccttt ctgagggcaa 2940
ctccagggtg cgccataatg tggcttcgaa ctgcggttgc ttcatgcaag tgaagggggt 3000
gagcgttatc aagcataact cggtctgtgg aaactgcgag gatcgcgcct ctcagatgct 3060
gacctgcttt gatggcaact gtcacctgtt gaagaccatt catataagca gtcaccccag 3120
aaaggcctgg cccgtgtttg agcataacat tctgacccgc tgttccttgc atctgggggt 3180
caggaggggt atgttcctgc cttaccagtg taactttagc cacactaaaa tcctgctgga 3240
acccgagtgc atgactaagg tcagcctgaa tggtgtgttt gatgtgagtc tgaagatttg 3300
gaaggtgctg aggtatgatg agaccaggac caggtgccga ccctgcgagt gcggcggcaa 3360
gcacatgaga aatcagcctg tgatgttgga tgtgaccgag gagcttaggc ctgaccatct 3420
ggtgctggcc tgcaccaggg ccgagtttgg gtctagcgat gaggataccg attgaggtgg 3480
gtaaggtggg cgtggctagc agggtgggcg tgtataaatt gggggtctaa ggggtctctc 3540
tgtttgtctt gcaacagccg ccgccatgag cgacaccggc aacagctttg atggaagcat 3600
ctttagtccc tatctgacag tgcgcatgcc tcactgggcc ggagtgcgtc agaatgtgat 3660
gggttccaac gtggatggac gtcccgttct gccttcaaat tcgtctacta tggcctacgc 3720
gaccgtggga ggaactccgc tggacgccgc gacctccgcc gccgcctccg ccgccgccgc 3780
gaccgcgcgc agcatggcta cggaccttta cagctctttg gtggcgagca gcgcggcctc 3840
tcgcgcgtct gctcgggatg agaaactgac tgctctgctg cttaaactgg aagacttgac 3900
ccgggagctg ggtcaactga cccagcaggt ttccagcttg cgtgagagca gccttgcctc 3960
cccctaatgg cccataatat aaataaaagc cagtctgttt ggattaagca agtgtatgtt 4020
ctttatttaa ctctccgcgc gcggtaagcc cgggaccagc ggtctcggtc gtttagggtg 4080
cggtggattt tttccaacac gtggtacagg tggctctgga tgtttagata catgggcatg 4140
agtccatccc tggggtggag gtagcaccac tgcagagctt cgtgctcggg ggtggtgttg 4200
tatatgatcc agtcgtagca ggagcgctgg gcgtggtgct gaaaaatgtc cttaagcaag 4260
aggcttatag ctagggggag gcccttggtg taagtgttta caaatctgct tagctgggag 4320
gggtgcatcc ggggggatat gatgtgcatc ttggactgga tttttaggtt ggctatgttc 4380
ccgcccagat cccttctggg attcatgttg tgcaggacca ccagcacggt atatccagtg 4440
cacttgggaa atttatcgtg gagcttagac gggaatgcat ggaagaactt ggagacgccc 4500
ttgtggcctc ccagattttc catacattcg tccatgatga tggcaatggg cccgtgggaa 4560
gctgcctgag caaaaacgtt tctggcatcg ctcacatcgt agttatgttc cagggtgagg 4620
tcatcatagg acatctttac gaatcggggg cgaagggtcc cggactgggg gatgatggta 4680
ccctcgggcc ccggggcgta gttcccctca cagatctgca tctcccaggc tttcatttca 4740
gagggaggga tcatatccac ctgcggggcg atgaaaaaga cagtttctgg cgcaggggag 4800
attaactggg atgagagcag gtttctgagc agctgtgact ttccacagcc ggtgggccca 4860
tatatcacgc ctatcaccgg ctgcagctgg tagttaagag agctgcagct gccgtcctcc 4920
cggagcaggg gggccacctc gttgagcata tccctgacgt ggatgttctc cctgaccagt 4980
tccgccagaa ggcgctcgcc gcccagcgaa agcagctctt gcaaggaagc aaaatttttc 5040
agcggtttca ggccatcggc cgtgggcatg tttttcagcg tctgggtcag cagctccagc 5100
ctgtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagatc tcctcgtttc 5160
gcgggttggg gcggctttcg ctgtagggca ccagccgatg ggcgtccagc ggggccagag 5220
tcatgtcctt ccatgggcgc agggtcctcg tcagggtggt ctgggtcacg gtgaaggggt 5280
gcgctccggg ttgggcactg gccagggtgc gcttgaggct ggttctgctg gtgctgaatc 5340
gctgccgctc ttcgccctgc gcgtcggcca ggtagcattt gaccatggtc tcgtagtcga 5400
gaccctcggc ggcgtgcccc ttggcgcgga gctttccctt ggaggtggcg ccgcacgagg 5460
ggcactgcag gctcttcagg gcgtagagct tgggagcgag aaacacggac tctggggagt 5520
aggcgtccgc gccgcaggcc gagcagaccg tctcgcattc caccagccaa gtgagttccg 5580
ggcggtcagg gtcaaaaacc aggttgcccc catgcttttt gatgcgtttc ttaccttggc 5640
tctccatgag gcggtgtccc ttctcggtga cgaagaggct gtccgtgtcc ccgtagaccg 5700
acttcagggg cctgtcttcc agcggagtgc ctctgtcctc ctcgtagaga aactctgacc 5760
actctgagac gaaggcccgc gtccaggcca ggacgaagga ggccacgtgg gaggggtagc 5820
ggtcgttgtc cactagcggg tccaccttct ccagggtgtg caggcacatg tccccctcct 5880
ccgcgtccag aaaagtgatt ggcttgtagg tgtaggacac gtgaccgggg gttcccaacg 5940
ggggggtata aaagggggtg ggtgcccttt catcttcact ctcttccgca tcgctgtctg 6000
cgagagccag ctgctggggt aagtattccc tctcgaaggc gggcatgacc tcagcgctca 6060
ggttgtcagt ttctaaaaat gaggaggatt tgatgttcac ctgtccggag gtgatacctt 6120
tgagggtacc tgggtccatc tggtcagaaa acactatttt tttgttatca agcttggtgg 6180
cgaatgaccc gtagagggcg ttggagagca gcttggcgat ggagcgcagg gtctggtttt 6240
tgtcgcggtc ggctcgctcc ttggccgcga tgttgagttg cacgtactcg cgggccacgc 6300
acttccactc ggggaacacg gtggtgcgct cgtctgggat caggcgcacc ctccagccgc 6360
ggttgtgcag ggtgaccatg tcgacgctgg tggcgacctc accgcgcaga cgctcgttgg 6420
tccagcagag gcggccgccc ttgcgcgagc agaagggggg tagggggtcc agctggtcct 6480
cgtttggggg gtccgcgtcg atggtaaaga ccccggggag caggcgcggg tcaaagtagt 6540
cgatcttgca agcttgcatg tccagagccc gctgccattc gcgggcggcg agcgcgcgct 6600
cgtaggggtt gaggggcggg ccccagggca tggggtgggt gagcgcggag gcgtacatgc 6660
cgcagatgtc atacacgtac aggggttccc tgaggatacc gaggtaggtg gggtagcagc 6720
gccccccgcg gatgctggcg cgcacgtagt catagagctc gtgggagggg gccagcatgt 6780
tgggcccgag gttggtgcgc tgggggcgct cggcgcggaa gacgatctgc ctgaagatgg 6840
cgtgggagtt ggaggagatg gtgggccgct ggaagacgtt gaagcttgct tcttgcaagc 6900
ccacggagtc cctgacgaag gaggcgtagg actcgcgcag cttgtgcacc agctcggcgg 6960
tgacctggac gtcgagcgca cagtagtcga gggtctcgcg gatgatgtca tacctatcct 7020
cccccttctt tttccacagc tcgcggttga ggacgaactc ttcgcggtct ttccagtact 7080
cttggagggg aaacccgtcc gtgtccgaac ggtaagagcc tagcatgtag aactggttga 7140
cggcctggta ggggcagcag cccttctcca cgggcagcgc gtaggcctgc gccgccttgc 7200
ggagggaggt gtgggtgagg gcgaaagtgt ccctgaccat gactttgagg tattgatgtc 7260
tgaagtctgt gtcatcgcag ccgccctgtt cccacagggt gtagtccgtg cgctttttgg 7320
agcgcgggtt gggcagggag aaggtgaggt cattgaagag gatcttcccc gctcgaggca 7380
tgaagtttct ggtgatgcga aagggccctg ggaccgagga gcggttgttg atgacctggg 7440
cggccaggac gatctcgtca aagccgttta tgttgtgtcc cacgatgtag agctccagga 7500
agcggggctg gcccttgatg gaggggagct ttttaagttc ctcgtaggta agctcctcgg 7560
gcgattccag gccgtgctcc tccagggccc agtcttgcaa gtgagggttg gccgccagga 7620
aggatcgcca gaggtcgcgg gccatgaggg tctgcaggcg gtcgcggaag gttctgaact 7680
gccgccccac ggccattttt tcgggggtga tgcagtagaa ggtgaggggg tctttctccc 7740
aggggtccca tctgagctct cgggcgaggt cgcgcgcggc agcgaccaga gcctcgtcgc 7800
cccccagttt catgaccagc atgaagggca cgagttgctt gccaaaggct cccatccaag 7860
tgtaggtttc tacatcgtag gtgacaaaga ggcgctccgt gcgaggatga gagccgattg 7920
ggaagaactg gatctcccgc caccagttgg aggattggct gttgatgtgg tgaaagtaga 7980
agtcccgtct gcgggccgag cactcgtgct ggcttttgta aaagcgaccg cagtactggc 8040
agcgctgcac gggttgtata tcttgcacga ggtgaacctg gcgacctctg acgaggaagc 8100
gcagcgggaa tctaagtccc ccgcctgggg tcccgtgtgg ctggtggtct tttactttgg 8160
ttgtctggcc gccagcatct gtctcctgga gggcgatggt ggaacagacc accacgccgc 8220
gagagccgca ggtccagatc tcggcgctcg gcgggcggag tttgatgacg acatcgcgca 8280
cattggagct gtccatggtc tccagctccc gcggcggcag gtcagccggg agttcctgga 8340
ggttcacctc gcagagacgg gtcaaggcgc ggacagtgtt gagatggtat ctgatttcaa 8400
ggggcatgtt ggaggcggag tcgatggctt gcaggaggcc gcagccccgg ggggccacga 8460
tggttccccg cggggcgcga ggggaggcgg aagctggggg tgtgttcaga agcggtgacg 8520
cgggcgggcc cccggaggta gggggggttc cggccccaca ggcatgggcg gcaggggcac 8580
gtcttcgccg cgcgcgggca ggggctggtg ctggctccga agagcgcttg cgtgcgcgac 8640
gacgcgacgg ttggtgtcct gtatctggcg cctctgagtg aagaccacgg gtcccgtgac 8700
cttgaacctg aaagagagtt cgacagaatc aatctcggca tcgttgacag cggcctggcg 8760
caggatctcc tgcacgtcgc ccgagttgtc ctggtaggcg atttctgcca tgaactgctc 8820
gatctcttcc tcctggagat ctcctcgtcc ggcgcgctcc acggtggccg ccaggtcgtt 8880
ggagatgcga cccatgagct gcgagaaggc gttgagtccg ccctcgttcc agacccggct 8940
gtagaccacg cccccctcgg cgtcgcgggc gcgcatgacc acctgggcca ggttgagctc 9000
cacgtgtcgc gtgaagacgg cgtagttgcg caggcgctgg aaaaggtagt tcagggtggt 9060
ggcggtgtgc tcggcgacga agaagtacat gacccagcgc cgcaacgtgg attcattgat 9120
gtcccccaag gcctccaggc gctccatggc ctcgtagaag tccacggcga agttgaaaaa 9180
ctgggagttg cgagcggaca cggtcaactc ctcctccaga agacggatga gctcggcgac 9240
agtgtcgcgc acctcgcgct cgaaggccac ggggggcgct tcttcctctt ccacctcttc 9300
ttccatgatt gcttcttctt cttcctcagc cgggacggga gggggcggcg gcgggggagg 9360
ggcgcggcgg cggcggcggc gcaccgggag gcggtcgatg aagcgctcga tcatctcccc 9420
ccgcatgcgg cgcatggtct cggtgacggc gcggccgttc tcccgggggc gcagctcgaa 9480
gacgccgcct ctcatttcgc cgcggggcgg gcggccgtga ggtagcgaga cggcgctgac 9540
tatgcatctt aacaattgct gtgtaggtac gccgccaagg gacctgattg agtccagatc 9600
caccggatcc gaaaaccttt ggaggaaagc gtctatccag tcgcagtcgc aaggtaggct 9660
gagcaccgtg gcgggcgggg gcgggtcggg agagttcctg gcggagatgc tgctgatgat 9720
gtaattaaag taggcggtct tgagaaggcg gatggtggac aggagcacca tgtctttggg 9780
tccggcctgt tggatgcgga ggcggtcggc catgccccag gcctcgttct gacaccggcg 9840
caggtctttg tagtaatctt gcatgagtct ttccaccggc acttcttctc cttcctcttc 9900
ttcatctcgc cggtggtttc tcgcgccgcc catgcgcgtg accccaaagc ccctgagcgg 9960
ctgcagcagg gccaggtcgg cgaccacgcg ctcggccaag atggcctgct gtacctgagt 10020
gagggtcctc tcgaagtcat ccatgtccac gaagcggtgg taggcacccg tgttgatggt 10080
gtaggtgcag ttggccatga cggaccagtt gacggtctgg tgtcccggct gcgagagctc 10140
cgtgtaccgc aggcgcgaga aggcgcggga atcgaacacg tagtcgttgc aagtccgcac 10200
cagatactgg tagcccacca ggaagtgcgg cggaggttgg cgatagaggg gccagcgctg 10260
ggtggcgggg gcgccgggcg ccaggtcttc cagcatgagg cggtggtatc cgtagatgta 10320
cctggacatc caggtgatgc ctgcggcggt ggtggtggcg cgcgcgtagt cgcggacccg 10380
gttccagatg tttcgcaggg gcgagaagtg ttccatggtc ggcacgctct ggccggtgag 10440
gcgcgcgcag tcgttgacgc tctatacaca cacaaaaacg aaagcgttta cagggctttc 10500
gttctgtagc ctggaggaaa gtaaatgggt tgggttgcgg tgtgccccgg ttcgagacca 10560
agctgagctc agccggctga agccgcagct aacgtggtat tggcagtccc gtctcgaccc 10620
aggccctgta tcctccagga tacggtcgag agcccttttg ctttcttggc caagcgcccg 10680
tggcgcgatc tgggatagat ggtcgcgatg agaggacaaa agcggctcgc ttccgtagtc 10740
tggagaaaca atcgccaggg ttgcgttgcg gcgtaccccg gttcgagccc ctatggcggc 10800
ttggatcggc cggaaccgcg gctaacgtgg gctgtggcag ccccgtcctc aggaccccgc 10860
cagccgactt ctccagttac gggagcgagc cccttttgtt tttttatttt ttagatgcat 10920
cccgtgctgc ggcagatgcg cccctcgccc cggcccgatc agcagcagca acagcaggca 10980
tgcagacccc cctctcctct ccccgccccg gtcaccacgg ccgcggcggc cgtgtccggt 11040
gcggggggcg cgctggagtc agatgagcca ccgcggcggc gacctaggca gtatctggac 11100
ttggaagagg gcgagggact ggcgcggctg ggggcgagct ctccagagcg ccacccgcgg 11160
gtgcagttga aaagggacgc gcgtgaggcg tacctgccgc ggcaaaacct gtttcgcgac 11220
cgcgggggcg aggagcccga ggagatgcgg gactgcaggt tccaagcggg gcgcgagctg 11280
cgccgcggct tggacagaca gcgcctgctg cgcgaggagg actttgagcc cgacacgcag 11340
acgggcatca gccccgcgcg cgcgcacgtg gccgcggccg acctggtgac cgcctacgag 11400
cagacggtga accaggagcg caacttccaa aaaagcttca acaaccacgt gcgcacgctg 11460
gtggcgcgcg aggaggtgac cctgggtctc atgcatctgt gggacctggt ggaggcgatc 11520
gtgcagaacc ccagcagcaa gcccctgacc gcgcagctgt tcctggtggt gcagcacagc 11580
agggacaacg aggccttcag ggaggcgctg ctgaacatca ccgagccgga ggggcgctgg 11640
ctcctggacc tgataaacat cctgcagagc atagtggtgc aggagcgcag cctgagcctg 11700
gccgagaagg tggcggccat taactattct atgctgagcc tgggcaagtt ctacgctcgc 11760
aagatctaca agacccccta cgtgcccata gacaaggagg tgaagataga cagcttctac 11820
atgcgcatgg cgctgaaggt gctaaccctg agcgacgacc tgggagtgta ccgcaacgag 11880
cgcatccaca aggccgtgag cgccagccgg cggcgcgagc tgagcgaccg cgaactgatg 11940
cacagtctgc agcgcgcgct gaccggcgcg ggcgagggcg acagggaggt cgagtcctac 12000
tttgacatgg gggccgacct gcactggcag ccgagccgcc gcgccctgga agcggcgggg 12060
gcgtacggcg gccccctggc ggccgatgac gaggaagagg aggactatga gctagaggag 12120
ggcgagtacc tggaggactg acctggctgg tggtgttttg gtatagatgc aagatccgaa 12180
cgtggcggac ccggcggtcc gggcggcgct gcagagccag ccgtccggca ttaactcctc 12240
tgacgactgg gccgcggcca tgggtcgcat catggccctg accgcgcgca accccgaggc 12300
cttcaggcag cagcctcagg ctaaccggct ggcggccatc ttggaagcgg tagtgcccgc 12360
gcgctccaac cccacccacg agaaggtgct ggccatagtc aacgcgctgg cggagagcag 12420
ggccatccgg gcagacgagg ccggactggt gtacgatgcg ctgctgcagc gggtggcgcg 12480
gtacaacagc ggcaacgtgc agaccaacct ggaccgcctg gtgacggacg tgcgcgaggc 12540
cgtggcgcag cgcgagcgct tgcatcagga cggcaacctg ggctcgctgg tggcgctaaa 12600
cgccttcctt agcacccagc cggccaacgt accgcggggg caggaggact acaccaactt 12660
cttgagcgcg ctgcggctga tggtgaccga ggtccctcag agcgaggtgt accagtcggg 12720
gcccgactac ttcttccaga ccagcagaca gggcttgcaa accgtgaacc tgagccaggc 12780
tttcaagaac ctgcgggggc tgtggggagt gaaggcgccc accggcgacc gggctacggt 12840
gtccagcctg ctaaccccca actcgcgcct gctgctgctg ctgatcgcgc ccttcacgga 12900
cagcgggagc gtctcgcggg agacctatct gggccacctg ctgacgctgt accgcgaggc 12960
catcgggcag gcgcaggtgg acgagcacac cttccaggag atcaccagcg tgagccacgc 13020
gctggggcag gaggacacgg gcagcctgca ggcgaccctg aactacctgc tgaccaacag 13080
gcggcagaag attcccacgc tgcacagcct gacccaggag gaggagcgca tcttgcgcta 13140
cgtgcagcag agcgtgagcc tgaacctgat gcgcgacggc gtgacgccca gcgtggcgct 13200
ggacatgacc gcgcgcaaca tggaaccggg catgtacgct tcccagcggc cgttcatcaa 13260
ccgcctgatg gactacttgc atcgggcggc ggccgtgaac cccgagtact tcaccaatgc 13320
cattctgaat ccccactgga tgccccctcc gggtttctac aacggggact tcgaggtgcc 13380
tgaggtcaac gatgggttcc tctgggatga catggatgac agtgtgttct cccccaaccc 13440
gctgcgcgcc gcgtctctgc gattgaagga gggctctgac agggaaggac caaggagtct 13500
ggcctcctcc ctggctctgg gggcggtggg cgccacgggc gcggcggcgc ggggcagcag 13560
ccccttcccc agcctggcgg actctctgaa tagcgggcgg gtgagcaggc cccgcttgct 13620
aggcgaggag gagtatctga acaactccct gctgcagccc gtgagggaca aaaacgctca 13680
gcggcagcag tttcccaaca atgggataga gagcctggtg gacaagatgt ccagatggaa 13740
gacgtatgcg caggagtaca aggagtggga ggaccgccag ccgcggcccc tgccgccccc 13800
tagacagcgc tggcagcggc gcgcgtccaa ccgccgctgg aggcaggggc ccgaggacga 13860
tgatgactct gcagatgaca gcagcgtgtt ggacctgggc gggagcggga accccttttc 13920
gcacctgcgc ccacgcctgg gcaagatgtt ttaaaagaga aaaataaaaa ctcaccaagg 13980
ccatggcgac gagcgttggt tttttgttcc cttccttagt atgcggcgcg cggcgatgtt 14040
cgaggagggg cctcccccct cttacgagag cgcgatggga atttctcctg cggcgcccct 14100
gcagcctccc tacgtgcctc ctcggtacct gcaacctaca ggggggagaa atagcatctg 14160
ttactctgag ctgcagcccc tgtacgatac caccagactg tacctggtgg acaacaagtc 14220
cgcggacgtg gcctccctga actaccagaa cgaccacagc gattttttga ccacggtgat 14280
ccaaaacaac gacttcaccc caaccgaggc cagtacccag accataaacc tggacaacag 14340
gtcgaactgg ggcggcgacc tgaagactat cctgcacacc aatatgccca acgtgaacga 14400
gttcatgttc accaactctt ttaaggcgcg ggtgatggtg gcgcgcgagc agggggaggc 14460
gaagtacgag tgggtggact tcacgctgcc cgagggcaac tactcagaga ccatgactct 14520
cgacctgatg aacaatgcga tcgtggaaca ctatctgaaa gtgggcaggc agaacggggt 14580
gaaggagagc gatatcgggg tcaagtttga caccagaaac ttccgtctgg gctgggaccc 14640
tgtgaccggg ctggtcatgc cgggggtcta caccaacgag gcctttcatc ccgatatagt 14700
gctcctgccc ggctgtgggg tggacttcac ccagagccgg ctgagcaacc tgctgggcgt 14760
tcgcaagcgg caacctttcc aggagggttt caagatcacc tatgaggatc tggagggggg 14820
caacattccc gcgctccttg atctggacgc ctacgaggag agcttgaaac ccgaggagag 14880
cgctggcgac agcggcgaga gtggcgagga gcaagccggc ggcggcggca gcgcgtcggt 14940
agaaaacgaa agtactcccg cagtggcggc ggacgctgcg gaggtcgagc cggaggccat 15000
gcagcaggac gcagaggagg gcgcgcagga ggacatgaac aatggggaga tcaggggcga 15060
cactttcgcc acccggggcg aagaaaaaga ggcagaggcg gcggcggcga cggcggaagc 15120
cgaaaccgag gcagaggcag agcccgagac cgaagttatg gaagacatga atgatggaga 15180
acgtaggggt gacacgtttg ccacccgggg cgaagagaag gcggcggagg cagaagccgc 15240
ggctgaggag gcggctgcgg ctgcggccaa ggctgaggct gcggctgagg ctaaggtcga 15300
agccgatgtt gcggttgagg ctcaggctga ggaggaggcg gcggctgaag cagttaagga 15360
aaaggcccag gcagagcagg aagagaaaaa acctgtcatt caacctctaa aagaagatag 15420
caaaaagcgc agttacaacg tcattgaggg cagcaccttt acccaatacc gcagctggta 15480
cctggcttac aactacggcg acccggtcaa gggggtgcgc tcgtggaccc tgctctgcac 15540
gccggacgtc acctgcggct ccgagcagat gtactggtcg ctgccaaaca tgatgcaaga 15600
cccggtgacc ttccgttcca cgcggcaggt tagcaacttt ccggtggtgg gcgccgaact 15660
gctgccagta cactccaaga gtttttacaa cgagcaggcc gtctactccc agctgatccg 15720
ccaggccacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga ttttggcgcg 15780
cccgccggcc cccaccatca ccaccgtcag tgaaaacgtt cctgccctca cagatcacgg 15840
gacgctaccg ctgcgcaaca gcatctcagg agtccagcga gtgaccatta ctgacgccag 15900
acgccggacc tgcccctacg tttacaaggc cttgggcata gtctcgccgc gcgtcctctc 15960
cagtcgcact ttttaaaaca catccaccca cacgctccaa aatcatgtcc gtactcatct 16020
cgcccagcaa caacaccggc tgggggctgc gcgcacccag caagatgttt ggaggggcaa 16080
ggaagcgctc cgaccagcac cccgtgcgcg tgcgcggcca ctaccgcgcg ccctggggtg 16140
cgcacaagcg cgggcgcaca gggcgcacca ctgtggatga tgtcattgac tccgtagtgg 16200
agcaggcgcg ccactacaca cccggcgcgc cgaccgcctc cgccgtgtcc accgtggacc 16260
aggcgatcga aagcgtggta cagggggcgc ggcactatgc caaccttaaa agtcgccgcc 16320
gccgcgtggc gcgccgccat cgccggagac cccgggctac tgccgccgcg cgccttacca 16380
aggctctgct caagcgcgcc aggcgaactg gccaccgggc cgccatgagg gccgcacggc 16440
gggctgccgc tgccgcgagc gccgtggccc cgcgggcacg aaggcgcgcg gccgctgccg 16500
ccgccgccgc catttccagc ttggcctcga cgcggcgcgg taacatatac tgggtgcgcg 16560
actcggtgag cggcacacgt gtgcccgtgc gctttcgccc cccacggaat tagcacaaga 16620
caacatacac actgagtctc ctgctgttgt gtatcccagc ggcgaccgtc agcagcggcg 16680
acatgtccaa gcgcaaaatt aaagaagaga tgctccaggt catcgcgccg gagatctatg 16740
ggcccccgaa gaaggaggag gaggattaca agccccgcaa gctaaagcgg gtcaaaaaga 16800
aaaagaaaga tgatgacgtt gacgaggcgg tggagtttgt ccgccgcatg gcgcccaggc 16860
gccctgtgca gtggaagggt cggcgcgtgc agcgagtcct gcgccccggc accgcggtgg 16920
tctttacgcc cggcgagcgt tccacgcgca ctttcaagcg ggtgtacgat gaggtgtacg 16980
gcgacgagga tctgttggag caggccaacc atcgatttgg ggagtttgca tatgggaaac 17040
ggcctcgcga gagtctaaaa gaggacctgc tggcgctacc gctggacgag ggcaatccca 17100
ccccgagtct gaagccggtg accctgcaac aggtgctgcc tttgagcgcg cccagcgagc 17160
agaagcgagg gttaaagcgc gagggcgggg acctggcacc caccgtgcag ttgatggtgc 17220
ccaagcggca gaagctggag gacgtgctgg agaaaatgaa agtagagccc gggatccagc 17280
ccgagatcaa ggtccgccct atcaagcagg tggcgcccgg cgtgggagtc cagaccgtgg 17340
acgttaggat tcccacggag gagatggaaa cccaaaccgc cactccctct tcggcagcaa 17400
gcgccaccac cggcgccgct tcggtagagg tgcagacgga cccctggcta cccgccgcca 17460
ctatcgccgt cgccgccgcc ccccgttcgc gcggacgcaa gagaaattat ccagcggcca 17520
gcgcgcttat gccccagtat gcgctgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcactcgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcatcg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg ccttcgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcaggg gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgttg 17940
cccctgctgg tcccgctact cgccgcggcg atcggcgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc atttttggag 18060
gaaaaaataa aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagc ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggtcaga tgctgagaga caagttgaaa gagcagaact tccaggagaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccgccctcag gtggaggaaa cgcctccagc 18480
catggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcccaccact cgccccatag ctcccatggc caccggtgtg gtgggtcaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgagccgact cgccagccaa aggcggtgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgcct ctgcgccgcg ctgcgagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gttcatcccc gtggatcggg 19200
aggacaccgc ttactcttac aaggcgcggt tcacgctggc cgtgggcgac aaccgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaat caggtggtcg ctgcagatga ggacctggaa gaagatgaag 19440
aagcgcaagc agaagagcaa gcccctgtta aaaaaattca tgtatatgct caggcgcctc 19500
ttgctggcga aaagattacc aaggatggtt tgcaaatagg tactgaagtc gtaggagata 19560
catctaagga cacttttgca gataaaacat tccaacccga acctcagata ggcgagtctc 19620
agtggaacga ggctgatgcc acagcagcag gaggtagagt tttgaaaaag actaccccta 19680
tgagaccttg ctatggatcc tatgccaggc ctaccaatgc caacgggggt caaggaatta 19740
tggttgccaa tgaacaagga gtgttgcagt ctaaagtaga aatgcaattt ttctctaaca 19800
cctcaaccct taatgcgcgg gatggaaccg gcaatcccga accaaaggtg gtgttgtaca 19860
gcgaagatgt ccacttggaa tctcccgata ctcatctgtc ttacaagccc aaaaaggatg 19920
atgttaatgc caaagtcatg ttgggtcagc aagccatgcc caacagaccc aacctcattg 19980
gatttagaga taatttcatt gggcttatgt tttacaacag caccggtaac atgggagtgc 20040
tggcgggtca ggcctctcag ttgaatgctg tggtggactt gcaggataga aacacagaac 20100
tgtcatatca gcttatgctt gattcaattg gggatagaac cagatacttc tccatgtgga 20160
accaggcagt ggatagctat gatccagatg tcagaattat tgaaaaccat ggggttgagg 20220
atgaactgcc caactactgc ttccctttgg gcggcatagg aattactgat acttatcaag 20280
gggtgaaaaa tagcaatggc aatggtcagt ggaccaaaga tgatcagttc gcggaccgca 20340
acgaaatagg ggtgggaaac aacttcgcca tggagatcaa catccaggcc aacctttgga 20400
gaaacttcct ctatgcaaac gtggggctct acctgccaga caagctcaag tacaacccca 20460
ccaacgtgga catctctgac aaccccaaca cctatgacta catgaacaag cgggtggtgg 20520
cccctggcct ggtggactgc tttgtcaatg tgggagccag gtggtccctg gactacatgg 20580
acaacgtcaa ccccttcaac caccaccgca atgcgggtct gcgctaccgc tccatgatcc 20640
tgggcaacgg gcgctatgtg ccctttcaca tccaggtacc ccagaagttc tttgccatca 20700
agaacctcct gctcctgccc ggctcctaca cctacgagtg gaacttcagg aaggatgtga 20760
acatggtcct acagagctct ctgggcaatg accttagggt ggatggggcc agcatcaagt 20820
ttgacagcat caccctctat gctacatttt tccccatggc ccacaacacc gcctccacgc 20880
ttgaggccat gctgagaaac gacaccaacg accagtcctt taatgactac ctctctgggg 20940
ccaacatgct ctacccaatc ccagccaagg ccaccaacgt gcccatctcc atcccctctc 21000
gcaactgggc cgcctttaga ggctgggcct ttacccgcct taagaccaag gagaccccct 21060
ccctgggctc gggttttgat ccctactttg tttactcggg atccatcccc tacctggatg 21120
gcaccttcta cctcaaccac actttcaaga agatatccat catgtatgac tcctccgtca 21180
gctggccggg caacgaccgc ttgctcaccc ccaatgagtt cgaggtcaag cgcgccgtgg 21240
acggcgaggg ctacaacgtg gcccagtgca acatgaccaa ggactggttc ctggtgcaga 21300
tgctggccaa ctacaacata ggctaccagg gcttttacat cccagagagc tacaaggaca 21360
ggatgtactc cttcttcaga aatttccaac ccatgagccg acaggtggtg gacgagacca 21420
attacaagga ctatcaagcc attggcatca cccaccagca caacaactcg ggtttcgtgg 21480
gctacctggc gcccaccatg cgcgagggtc aggcctaccc cgccaacttc ccctacccct 21540
tgataggcaa gaccgcggtc gacagcgtca cccagaaaaa gttcctctgc gaccgcaccc 21600
tctggcgcat ccccttctct agcaacttca tgtccatggg tgcgctcacg gacctgggcc 21660
aaaacctgct ttatgccaac tctgcccatg cgctggacat gacttttgag gtggacccca 21720
tggacgagcc cacccttctc tatattgtgt ttgaagtgtt cgacgtggtc agagtgcacc 21780
agccgcaccg cggtgtcatc gagaccgtgt acctgcgtac gcccttctca gccggcaacg 21840
ccaccaccta aggagacagc gccgccgccg cctgcatgac gggttccacc gagcaagagc 21900
tcagggccat tgccagagac ctgggatgcg gaccctattt tttgggcacc tatgacaaac 21960
gcttcccggg ctttatctcc cgagacaagc tcgcctgcgc cattgtcaac acggccgcgc 22020
gcgagaccgg gggcgtgcac tggctggcct ttggctggga cccgcgctcc aaaacttgct 22080
acctctttga cccctttggc ttctccgatc agcgcctcag gcagatttat gagtttgagt 22140
acgaggggct gctgcgccgc agcgcgctcg cctcctcgcc cgaccgctgc atcacccttg 22200
agaagtccac cgaaaccgtg caggggcccc actcggccgc ctgcggtctc ttctgttgca 22260
tgtttttgca cgcctttgtg cactggcctc agagtcccat ggattgcaac cccaccatga 22320
acttgctaaa gggagtgccc aacgccatgc tccagagccc ccaggtccag cccaccctgc 22380
gccgcaacca ggaacagctt taccgcttcc tggagcgcca ctccccctac ttccgcagcc 22440
acagcgcgcg catccggggg gccacctctt tttgccactt gcaagaaaac atgcaagacg 22500
gaaaatgatg tacagcatgc ttttaataaa tgtaaagact gtgcacttta attatacacg 22560
ggctctttct ggttatttat tcaacaccgc cgtcgccatt tagaaatcga aagggttctg 22620
ccgtgcgtcg ccgtgcgcca cgggcagaga cacgttgcga tactggaagc ggctcgccca 22680
cttgaactcg ggcaccacca tgcggggcag tggttcctcg gggaagttct cgctccacag 22740
ggtgcgggtc agctgcagcg cgctcaggag gtcgggagcc gagatcttga agtcgcagtt 22800
ggggccggaa ccctgcgcgc gcgagttgcg gtacacgggg ttgcagcact ggaacaccag 22860
cagggccgga ttattcacgc tggccagcag gctctcgtcg ctgatcatgt cgctgtccag 22920
atcctccgcg ttgctcaggg cgaatggggt catcttgcag acctgcctgc ccaggaaagg 22980
cgggagccca ggcttgccgt tgcagtcgca gcgcaggggc attagcaggt gcccacggcc 23040
cgactgcgcc tgcgggtaca acgcgcgcat gaaggcttcg atctgcctaa aagccacctg 23100
ggtcttggct ccctccgaaa agaacatccc acaggacttg ctggagaact ggttcgcggg 23160
acagctggca tcgtgcaggc agcagcgcgc gtcagtgttg gcaatctgca ccacgttgcg 23220
accccaccgg tttttcacta tcttggcctt ggaagcctgc tcctttagcg cgcgctggcc 23280
gttctcgctg gtcacatcca tctctatcac ctgttccttg ttgatcatgt ttgtcccgtg 23340
cagacacttt aggtcgccct ccgtctgggt gcagcggtgc tcccacagcg cgcaaccggt 23400
gggctcccaa ttcttgtggg tcacccccgc gtaggcctgc aggtaggcct gcaggaagcg 23460
ccccatcatg gtcataaagg tcttctggct cgtaaaggtc agctgcaggc cgcgatgctc 23520
ttcgttcagc caggtcttgc agatggcggc cagcgcctcg gtctgctcgg gcagcatctt 23580
aaaatttgtc ttcaggtcgt tatccacgtg gtacttgtcc atcatggcac gcgccgcctc 23640
catgcccttc tcccaggcgg acaccatggg caggcttagg gggtttatca cttccagcgg 23700
cgaggacacc gtactttcga tttcttcttc ctccccctct tcccggcgcg cgcccccgct 23760
gttgcgcgct cttaccgcct gcaccaaggg gtcgtcttca ggcaagcgcc gcaccgagcg 23820
cttgccgccc ttgacctgct tgatcagtac cggcgggttg ctgaagccca ccatggtcag 23880
cgccgcctgc tcttcttcgt cttcgctgtc taccactatt tctggggagg ggcttctccg 23940
ctctgcggca aaggcggcgg atcgcttctt ttttttcttg ggagccgccg cgatggagtc 24000
cgccacggcg accgaggtcg agggcgtggg gctgggggtg cgcggtacca gggcctcgtc 24060
gccctcggac tcttcctctg actccaggcg gcggcggagt cgcttctttg ggggcgcgcg 24120
cgtcagcggc ggcggagacg gggacgggga cggggacggg acgccctcca cagggggtgg 24180
tcttcgcgca gacccgcggc cgcgctcggg ggtcttctcg cgctggtctt ggtcccgact 24240
ggccattgta tcctcctcct cctaggcaga gagacataag gagtctatca tgcaagtcga 24300
gaaggaggag agcttaacca ccccctcaga gaccgccgat gcgcccgccg tcgccgtcgc 24360
ccccgctacc gccgacgcgc ccgccacacc gagcgacacc cccacggacc cccccgccga 24420
cgcacccctg ttcgaggaag cggccgtgga gcaggacccg ggctttgtct cggcagagga 24480
ggatttgcaa gaggaggaga ataaggagga gaagccctca gtgccaaaag atcataaaga 24540
gcaagacgag cacgacgcag acgcacacca gggtgaagtc gggcgggggg acggagggca 24600
tggcggcgcc gactacctag acgaaggaaa cgacgtgctc ttgaagcacc tgcatcgtca 24660
gtgcgccatc gtctgcgacg ctctgcagga gcgcagcgag gtgcccctca gcgtggcgga 24720
ggtcagccgc gcctacgagc tcagcctctt ttccccccgg gtgccccccc gccgccgcga 24780
aaacggcaca tgcgagccca acccgcgcct caacttctac cccgcctttg tggtgcccga 24840
ggtcctggcc acctatcaca tcttctttca aaattgcaag atccccatct cgtgccgcgc 24900
caaccgtagc cgcgccgata agatgctggc cctgcgccag ggcgaccaca tacctgatat 24960
cgccgctttg gaagatgtgc caaagatctt cgagggtctg gggcgcaacg agaagcgggc 25020
agcaaactct ctgcaacagg aaaacagcga aaatgagagt cacactggag cgctggtgga 25080
gctggagggc gacaacgccc gcctggcggt gctcaagcgc agcatcgagg tcacccactt 25140
tgcctacccc gcgctcaacc tgccccccaa agtcatgaac gcggtcatgg acgggctgat 25200
catgcgccgc ggccggcccc tcgctccaga tgcaaacttg catgaggaga ccgaggacgg 25260
tcagcccgtg gtcagcgacg agcagctgac gcgctggctg gagagcgcgg accccgccga 25320
actggaggag cggcgcaaga tgatgatggc cgcggtgctg gtcaccgtag agctggagtg 25380
tctgcagcgc ttcttcggtg accccgagat gcagagaaag gtcgaggaga ccctacacta 25440
caccttccgc cagggctacg tgcgccaggc ttgcaagatc tccaacgtgg agctcagcaa 25500
cctggtgtcc tacctgggca tcttgcatga aaaccgcctt gggcagagcg tgctacactc 25560
caccctgcgc ggggaggcgc gccgcgacta cgtgcgcgac tgcgtttacc tcttcctctg 25620
ctacacctgg cagacggcca tgggggtctg gcagcagtgc ctggaggagc gcaacctcaa 25680
ggagctggag aagcttctgc agcgcgcgct caaagacctc tggacgggct tcaacgagcg 25740
ctcggtggcc gccgcgctag ccgacctcat cttccccgag cgcctgctca aaaccctcca 25800
gcaggggctg cccgacttca ccagccaaag catgttgcaa aattttagga actttatcct 25860
ggagcgttct ggcatcctac ccgccacctg ctgcgccctg cccagcgact ttgtccccct 25920
cgtgtaccgc gagtgccccc cgccgctgtg gggccactgc tacctgttcc aactggccaa 25980
ctacctgtcc taccacgcgg acctcatgga ggactccagc ggcgaggggc tcatggagtg 26040
ccactgccgc tgcaacctct gcacgcccca ccgctccctg gtctgcaaca cccaactgct 26100
cagcgagagt cagattatcg gtaccttcga gctacagggt ccgtcctcct cagacgagaa 26160
gtccgcggct ccggggctaa aactcactcc ggggctgtgg acttccgcct acctgcgcaa 26220
atttgtacct gaagactacc acgcccacga aatcaggttt tacgaggacc aatcccgccc 26280
gcccaaggcg gagctgaccg cctgcgtcat cacccagggc gagatcctag gccaattgca 26340
agccatccaa aaagcccgcc aagagttttt gctgaagagg ggtcgggggg tgtatctgga 26400
cccccagtcg ggtgaggagc tcaacccggt tcccccgctg ccaccgccgc gggaccttgc 26460
ttcccaggat aagcatcgcc atggctccca gaaagaagca gcagcggccg ccgctgccgc 26520
cgccccacat gctggaggaa gaggaggaat actgggacag tcaggcagag gaggtttcgg 26580
acgaggagga gccggagacg gagatggaag agtgggagga ggacagctta gacgaggagg 26640
cttccgaagc cgaagaggca ggcgcaacac cgtcaccctc ggccgcagcc ccctcgcagg 26700
cgcccccgaa gtccgctccc agcatcagca gcaacagcag cgctataacc tccgctcctc 26760
caccgccgcg acccacggcc gaccgcagac ccaaccgtag atgggacacc accggaaccg 26820
gggccggtaa gtcctccggg agaggcaagc aagcgcagcg ccaaggctac cgctcgtggc 26880
gcgctcacaa gaacgccata gtcgcttgct tgcaagactg cggggggaac atctccttcg 26940
cccgccgctt cctgctcttc caccacggtg tggccttccc ccgtaacgtc ctgcattact 27000
accgtcatct ctacagcccc tactgcggcg gcagtgagcc agaggcggcc agcggcggcg 27060
gcgcccgttt cggtgcctag gaagacccag ggcaagactt cagccaagaa actcgcggcg 27120
accgcggcga acgcggtcgc gggggccctg cgcctgacgg tgaacgaacc cctgtcgacc 27180
cgcgaactga ggaaccgaat cttccccact ctctatgcca tcttccagca gagcagaggg 27240
caggatcagg aactgaaagt aaaaaacagg tctctgcgct ccctcacccg cagctgtctg 27300
tatcacaaga gcgaagacca gcttcggcgc acgctggagg acgctgaggc actcttcagc 27360
aaatactgcg cgctcactct taaggactag ctccgcgccc ttctcgaatt taggcgggaa 27420
cgcctacgtc atcgcagcgc cgccgtcatg agcaaggaca ttcccacgcc atacatgtgg 27480
agctatcagc cgcagatggg actcgcggcg ggcgcctccc aagactactc cacccgcatg 27540
aactggctca gtgccggccc acacatgatc tcacaggtta atgacatccg cacccatcga 27600
aaccaaatat tggtgaagca ggcggcaatt accaccacgc cccgcaataa tcccaacccc 27660
agggagtggc ccgcgtccct ggtgtatcag gaaattcccg gccccaccac cgtactactt 27720
ccgcgtgatt cccaggccga agtccaaatg actaactcag gggcacagct cgcgggcggc 27780
tgtcgtcaca gggtgcggcc tcctcgccag ggtataactc acctggagat ccgaggcaga 27840
ggtattcagc tcaacgacga gtcggtgagc tcctcgctcg gtctcagacc tgacgggacc 27900
ttccagatag ccggagccgg ccgatcttcc ttcacgcccc gccaggcgta cctgactctg 27960
cagagctcgt cctcggcgcc gcgctcgggc ggcatcggga ctctccagtt cgtgcaggag 28020
tttgtgccct cggtctactt caaccccttc tcgggctctc ccggtcgcta cccggaccag 28080
tttatcccga actttgacgc cgcgagggac tcggtggacg gctacgactg aatgtcgggt 28140
ggacccggtg cagagcaact tcgcctgaag caccttgacc actgccgccg ccctcagtgc 28200
tttgcccgct gtcagaccgg tgagttccag tacttttccc tgcccgactc gcacccggac 28260
ggcccggcgc acggggtgcg ctttttcatc ccgagtcagg tccgctctac cctaatcagg 28320
gagttcaccg cccgtcccct actggcggag ttggaaaagg ggccttctat cctaaccatt 28380
gcctgcattt gctctaaccc tggattacac caagatcttt gctgtcattt gtgtgctgag 28440
tataataaag gctgagatca gaatctactc gggctcctgt cgccatcctg tcaacgccac 28500
cgtccaagcc cggcccgatc agcccgaggt gaacctcacc tgtggtctgc accggcgcct 28560
gaggaaatac ctagcttggt actacaacag cactcccttt gtggtttaca acagctttga 28620
ccaggacggg gtctcactga gggataacct ctcgaacctg agctactcca tcaggaagaa 28680
caacaccctc gagctacttc ctccttacct gcccgggact taccagtgtg tcaccggccc 28740
ctgcacccac acccacctgt tgatcgtaaa cgactctctt ccgagaacag acctcaataa 28800
ctcctctccg cagttcccca gaacaggagg tgagctcagg aaaccccggg taaagaaggg 28860
tggacaagag ttaacacttg tggggtttct ggtatatgtg acgctggtgg tggctctttt 28920
gattaaggct tttccttcca tgtctgaact atccctcttc ttttatgaac aactcgacta 28980
gtgctaacgg gaccctaccc aacgaatcgg gattgaatat cggtaaccag gttgcagttt 29040
cacttttgat taccttcata gtcctcttcc tgctagtgct gtcgcttctg tgcctgcgga 29100
tcgggggctg ctgcatccac gtttatatct ggtgctggct gtttagaagg ttcggagacc 29160
accgcaggta gaataatgct gcttaccctc tttgtcctgg cgctggctgc cagctgccaa 29220
gccttttccg aggctgactt catagagccc cagtgcaata tcacttataa atctgaacgt 29280
gccatctgta ctattctaat caaatgtgtt actcaacacg ataaggtgac tgttaaatac 29340
aaagatcaat taaaaaaaga cgcactttac agcagctggc aaccaggaga tgatcaaaaa 29400
tacaatgtaa ccgtcttcca gggcaaactc tccaaaactt acaattacaa tttcccattt 29460
gagcagatgt gtgactttgt catgtacatg gaaaagcagt acaagctgtg gcctccaact 29520
ccccagggct gtgtggaaaa tccaggctct ttctgtatga tctctctctg tgtaactgtg 29580
ctggcactaa tactcacgct tctgtatctc agatttaaat caaggcaaag cttcattgat 29640
gaaaagaaaa tgccataatc gctcaacgct tgattgctaa caccgggttt ttatccgcag 29700
aatgattgga atcaccctac taatcacctc cctccttgcg attgcccatg ggttggaacg 29760
aatcgaagtc cctgtggggg ccaatgttac cctggtgggg cctgtcggca atgctacatt 29820
aatgtgggaa aaatatacta aaaatcaatg ggtttcttac tgcactaaca aaaacagcca 29880
caagcccaga gccatctgcg atgggcaaaa tctaaccttg attgatgttc aattgctgga 29940
tgcgggctac tattatgggc agctgggtac aatgattaat tactggagac cccacagaga 30000
ttacatgctt cacgtagtaa agggtcccat tagcagccca accaccacct ctaccacacc 30060
cactaccacc actactccca ccaccagcac tgccgcccag cctcctcata gcagaacaac 30120
cacttttatc aattccaagt cccactcccc ccacattgcc ggcgggccct ccgcctcaga 30180
ctccgagacc accgagatct gcttctgcaa atgctctgac gccattgccc aggatttgga 30240
agatcacgag gaagatgagc atgactacgc agatgcatgc caggcatcag aggcagaagc 30300
gctaccggtg gccctaaaac agtatgcaga ctcccacacc acccccaacc ttcctccacc 30360
ttcccagaag ccaagtttcc tgggggaaaa tgaaactctg cctctttcca tactagctct 30420
gacatctgtt gctattttgg ccgctctgct ggtgcttcta tgctctatat gctacctgat 30480
ctgctgcaga aagaaaaaat ctcacggcca tgctcaccag cccctcatgc acttccctta 30540
ccctccagag ctgggcgacc acaaacttta agtctgcagt agctatctgc ccatcccttg 30600
tcagtcgaca gcgatgagcc ccactaatct aacagcctct ggacttacaa cattgtctct 30660
taatgagacc accgctcctc aagacctgta cgatggtgtc tccgcgctgg ttaaccagtg 30720
ggatcacctg ggcatatggt ggctcctcat aggagcagtg accctgtgcc taatcctggt 30780
ctggatcatc tgctgcatca aaagcagaag acccaggcgg cggcccatct acaggccctt 30840
cgtcatcaca cctgaagata atgatgatga tgacaccacc tccaggctgc agagcctaaa 30900
gcagctactc ttctctttta cagcatggta aattgaatca tgccccgcat tttcatctac 30960
ttgcttctcc ttccactttt tctgggctcc tctacattgg ccactgtgtc ccacatcgag 31020
gtagactgcc tcacgccctt cacagtctac ctgcttttcg gctttgtcat ctgcaccttt 31080
gtctgcagcg ttatcactgt agtgatctgc ttcatacagt gcatcgacta catctgtgtg 31140
cgggtggcct actttagaca ccacccccag tatcgcaaca gggacatagc ggctctccta 31200
agacttgttt aaatcatggc caaattacct gtgattggtc ttctgattat ctgctgcgtc 31260
ctagccgcga ttgggactca acctaatacc accaccagcg ctcccagaaa gagacatgta 31320
tcctgcagct tcaagcgtcc ctggaatata ccccaatgct ttactgatga acctgaaatc 31380
tctttggctt ggtacttcag cgtcaccgcc cttctcatct tctgcagtac ggttattgct 31440
cttgccatct acccttccct taacctgggc tggaatgctg tcaactctat ggaatatccc 31500
accttcccag aaccagacct gccagacctg gttgttctaa acgcgtttcc tcctcctcca 31560
gttcaaaatc agtttcgccc tccgtcccct acgcccactg aggtcagcta ctttaatcta 31620
acaggcggag atgactgaaa acctagacct agaaatggac ggtctctgca gcgagcaacg 31680
cacactagag aggcgccggc aaaaagcaga gctcgagcgt cttaaacaag agctccaaga 31740
cgccgtggcc atacaccagt gcaaaaaagg gctcttctgt ctggtaaaac aggccacgct 31800
cacctatgaa aaaacaggtg acacccaccg cctaggatac aagctgccca cacagcgcca 31860
aaagtttgcc cttatgatag gtgaacaacc catcaccgtc acccagcact ccgtggagac 31920
agaaggctgc attcatgctc cctgcagggg cgctgactgc ctctacacct tgatcaaaac 31980
cctctgcggt ctcagagacc ttatcccttt caattgatca taactgtaat caataaaaaa 32040
tcacttactt gaaatctgat agcaagactc tgtccaattt tttcagcaac acttccttcc 32100
cctcctccca actctggtac tctaggcgcc tcctagctgc aaacttcctc cacagtctga 32160
agggaatgtc agattcctcc tcctgtccct ccgcacccac gatcttcatg ttgttacaga 32220
tgaaacgcgc gagatcgtct gacgagacct tcaaccccgt gtacccctac gataccgaga 32280
tcgctccgac ttctgtccct ttccttaccc ctccctttgt atcatccgca ggaatgcaag 32340
aaaatccagc tggggtgctg tccctgcacc tgtcagagcc ccttaccacc cacaatgggg 32400
ccctgactct aaaaatgggg ggcggcctga ccctggacaa ggaagggaat ctcacttccc 32460
aaaacatcac cagtgtcgat ccccctctca aaaaaagcaa gaacaacatc agccttcaga 32520
ccgccgcacc cctcgccgtc agctccgggg ccctaaccct ttttgccact ccccccctag 32580
cggtcagtgg cgacaacctt actgtgcagt ctcaggcccc tcttactttg gaagactcaa 32640
aactaactct ggccaccaaa ggacccctaa ctgtgtccga aggcaaactt gtcctagaaa 32700
cagagcctcc cctgcatgca agtgacagca gtagcctggg ccttagcgtc acggccccac 32760
ttagcattaa caatgacagc ctaggactag acatgcaagc gcccatcagc tctcgagatg 32820
gaaaactggc tctaacagtg gcggcccccc taactgtggc cgagggtatc aatgctttgg 32880
cagtagccac aggtaatggt attggactaa atgaaaccaa cacacacctg caggcaaaac 32940
tggtcgcgcc cctaggcttt gataccaacg gcaacattaa gctaagcgtc gcaggaggca 33000
tgaggctaaa caataacaca ctgatactag atgtaaacta cccatttgag gctcaaggcc 33060
aactgagcct aagagtgggc tcgggcccac tatatgtaga ttctagtagt cataacctaa 33120
ccattagatg ccttagggga ttgtatgtaa catcttctaa caaccaaaac ggtctagagg 33180
ccaacattaa actaacaaaa ggccttgtgt atgacggaaa tgccatagca gttaatgttg 33240
gcaaagggct ggaatacagc cctactggca caacagaaaa acctatacag actaaaatag 33300
gtctaggcat ggagtatgac actgagggag ccatgatgac aaaactaggc tctggactaa 33360
gctttgacaa ttcaggagcc attgtggtgg gaaacaaaaa tgatgacagg cttactttgt 33420
ggaccacacc ggacccatcg cccaactgtc agatttactc tgaaaaagat gctaaactaa 33480
ccttggtact gactaaatgt ggcagtcagg ttgtaggcac agtatctatt gccgctctta 33540
aaggtagcct tgtgccaatc actagtgcaa tcagtgtggt tcagatatac ctaaggtttg 33600
atgaaaatgg ggtgctgatg agtaactctt cacttaatgg cgaatactgg aattttagaa 33660
acggagactc aactaatggc acaccatata caaacgcagt gggttttatg cctaatctac 33720
tggcctatcc taaaggtcaa actacaactg caaaaagtaa cattgtcagc caggtctaca 33780
tgaacgggga cgatactaaa cccatgacat ttacaatcaa cttcaatggc cttagtgaaa 33840
caggggatac ccctgtcagt aaatattcca tgacattctc atggaggtgg ccaaatggaa 33900
gctacatagg gcacaatttt gtaacaaact cctttacttt ctcctacatc gcccaagaat 33960
aaagaaagca cagagatgct tgtttttgat ttcaaaattg tgtgctttta tttattttca 34020
agcttacagt atttccagta gtcattagaa tagagcttaa ttaaactgca tgagaaccct 34080
tccacatagc ttaaattatc accagtgcaa atggaaaaaa atcaacatac ctttttatcc 34140
agatatcaaa gaactctagt ggtcagtttt cccccaccct cccagctcac agaatacaca 34200
gtcctttccc cccggctggc tttaaacaac actatctcat tggtaacaga catattttta 34260
ggtgtaataa tccacacggt ctcttggcgg gccaaacgct ggtctgtgat gttaataaac 34320
tccccaggca gctctttcaa gttcacgtcg ctgtccaact gctgaagcgc tcgcggctcc 34380
gactgcgcct ctagcggagg caacggcagc acccgatcct tgatctataa aggagtagag 34440
tcataatccc ccataagaat agggcggtga tgcagcaaca aggcgcgcag caactcctgc 34500
cgccgcctct ccgtacgaca ggaatgcaac ggggtggtgg tctcctccgc gataatccgc 34560
accgctcgca gcatcagcat cctcgtcctc cgggcacagc agcgcatcct gatctcactg 34620
agatcggcgc agtaagtgca gcacaacacc aagatgttat ttaagatccc acagtgcaaa 34680
gcactgtacc caaagctcat ggcgggaagg acagccccca cgtgaccatc gtaccagatc 34740
ctcaggtaaa tcaaatgacg acctctcata aacacgctgg acatatacat cacctccttg 34800
ggcatgagct gattcaccac ctctcgatac cacaggcatc gctgattaat taaagacccc 34860
tcgagcacca tcctgaacca ggaagccagc acctgacccc ccgccaggca ctgcagggac 34920
cccggtgaat cgcagtggca gtgaagactc cagcgctcgt agccgtgaac catagagctg 34980
gtcattatat ccacattggc acaacacaga cacactttca tacacttttt catgattagc 35040
agctcctctc tagtcaagac catatcccaa ggaatcaccc actcttgaat caaggtaaat 35100
cccacacagc agggcaggcc tctcacataa ctcacgttat gcatagtgag cgtgtcgcaa 35160
tctggaaata ccggatgatc ttccatcacc gaagcccggg tctccgtctc aaagggaggt 35220
aaacggtccc tcgtgtaggg acagtggcgg gataatcgag atcgtgttga acgtagagtc 35280
atgccaaagg gaacagcgga cgtactcata tttcctccag cagaaccaag tgcgcgcgtg 35340
gcagctatcc ctgcgtcttc tgtctcgccg cctgccccgc tcggtgtagt agttgtaata 35400
cagccactcc ctcagaccgt caaggcgctc cctggcgtcc ggatctataa caacaccgtc 35460
ctgcagcgcc gccctgatga catccaccac cgtagagtat gccaagccca gccacgaaat 35520
gcactcactt tgacagcgag agataggagg agcgggaaga gatggaagaa ccatgatagt 35580
aaaagaactt ttattccaat cgatcctcta caatgtcaaa gtgtagatct atcagatggc 35640
actggtctcc tccgctgagt cgatcaaaaa taacagctaa accacaaaca acacgattgg 35700
tcaaatgctg cacaagggct tgcagcataa aatcgcctcg aaagtccacc gcaagcataa 35760
catcaaagcc accgccccta tcatgatcta tgataaaaac cccacagcta tccaccagac 35820
ccatatagtt ttcatctctc catcgtgaaa aaatatttac aagctcctcc tttaaatcac 35880
ctccaaccaa ttcaaaaagt tgagccagac cgccctccac cttcattttc agcatgcgca 35940
tcatgattgc aaaaattcag gctcctcaga cacctgtata agattgagaa gcggaacgtt 36000
aacatcaatg tttcgctcgc gaagatcgcg cctcagtgca agcatgatat aatcccacag 36060
gtcggagcgg atcagcgagg acatctcccc gccaggaacc aactcaacgg agcctatgct 36120
gattataata cgcatattcg gggctatgct aaccagcacg gcccccaaat aggcgtactg 36180
cataggcggc gacaaaaagt gaacagtttg ggttaaaaaa tcaggcaaac actcgcgcaa 36240
aaaagcaaga acatcataac catgctcatg caaatagatg caagtaagct caggaacgac 36300
cacagaaaaa tgcacaattt ttctctcaaa catgactgcg agccctgcaa aaaataaaaa 36360
agaaacatta cacaagagta gcctgtctta caatgggata gactactcta accaacataa 36420
gacgggccac gacatcgccc gcgtggccat aaaaaaaatt atccgtgtga ttaaaaagaa 36480
gcacagatag ctggccagtc atatccggag tcatcacgtg cgaacccgtg tagacccccg 36540
ggttggacac atcggccaaa caaagaaagc ggccaatgta tcccggagga atgataacac 36600
taagacgaag atacaacaga ataaccccat gggggggaat aacaaagtta gtaggtgaat 36660
aaaaacgata aacacccgaa actccctcct gcgtaggcaa aatagcgccc tccccttcca 36720
aaacaacata cagcgcttcc acagcagcca tgacaaaaga ctcaaaacac tcaaaagact 36780
cagtcttacc aggaaaataa aagcactctc acagcaccag cactaatcag agtgtgaaga 36840
gggccaagtg ccgaacgagt atatatagga attaaaaatg acgtaaatgt gtaaaggtca 36900
aaaaacgccc agaaaaatac acagaccaac gcccgaaacg aaaacccgcg aaaaaatacc 36960
cagaagttcc tcaacaaccg ccacttccgc tttcccacga tacgtcactt cctcaaaaat 37020
agcaaactac atttcccaca tgtacaaaac caaaacccct ccccttgtca ccgcccacaa 37080
cttacataat cacaaacgtc aaagcctacg tcacccgccc cgcctcgccc cgcccacctc 37140
attatcatat tggcctcaat ccaaaataag gtatattatt gatgatg 37187
<210> 8
<211> 37175
<212> DNA
<213> Great Ape Adenovirus
<400> 8
catcatcaat aatatacctt attttggatt gtggccaata tgataatgag gtgggcgggg 60
cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgc ggaagtgtgg catttgcaag 120
tgggaggagc tcacatgtaa gcttccgtcg cggaaaatgt gacgttttta atgagcgccg 180
cctacctccg gaagtgccaa ttttcgcgcg cttttcaccg gatatcgtag taattttggg 240
cgggaccatg taagatttgg ccattttcgc gcgaaaagtg aaacggggaa gtgaaaactg 300
aataataggg cgttagtcat agcgcgtaat atttaccgag ggccgaggga ctttgaccga 360
ttacgtggag gactcgccca ggtgtttttt acgtgaattt ccgcgttccg ggtcaaagtc 420
tccgttttta ttgtcaccgt catctgacgc ggagggtatt taaacccgct gcgctcctaa 480
agaggccact cttgagtgcc agcgagaaga gttttctcct ccgctccgtt tcggcgatcg 540
aaaaatgaga cacttagcct gcactccggg tcttttgtcc ggccgggcgg cgtccgagct 600
tttggacgct ttgctcaatg aggttctgag cgatgatttt ccgtctacta cccactttag 660
cccacctact cttcacgaac tgtacgatct ggatgtactg gtggatgtga acgatcccaa 720
cgaggaggcg gtttctacgt tttttcccga gtctgcgctt ttggccgccc aggagggatt 780
tgacctacac actccgccgc tgcctatttt agagtctccg ctgccggagc ccagtggtat 840
accttatatg cctgaactgc ttcccgaagt ggtagacctg acctgccacg agccgggctt 900
tccgcccagc gacgatgagg gtgagccttt tgctttagac tatgctgaga tacctgggct 960
cggttgcagg tcttgtgcat atcatcagag ggttaccgga gaccccgagg ttaagtgttc 1020
gctgtgctat atgaggctga cctcttcctt tatctacagt aagttttttg tgtaggtggg 1080
ctttttgggt aggtgggttt tgtggcagga caggtgtaaa tgttgcttgt gttttttgta 1140
cctgcaggtc cggtgtccga gccagacccg gagcccgacc gcgatcccga gccggatccc 1200
gagcctcctc gcagggcaag gaaattacct tccattttgt gcaagcctaa gacacctgtg 1260
aggaccagcg aggcggacag cactgactct ggcacttcta cctctcctcc tgaaattcac 1320
ccagtggttc ctttgggtat acataaacct gttgctatta gagtttgcgg gcgacgccct 1380
gcagtagagt gcattgagga cttgcttaac gatcccgagg gacctttgga cttgagcatt 1440
aaacgcccta ggcaataaac cccacctaag taataaaccc cacctaagta ataaacttta 1500
ccgcccttgg ttattgagat gacgcccaat gtttgctttt gaatgacttc atgtgtataa 1560
taaaagtgag tgtggtcata ggtctcttgt ttgtctgggc ggggcttaag ggtatataag 1620
tttctcgggg ctaaacttgg ttacacttga ccccaatgga ggcgtggggg tgcttggagg 1680
agtttgcgga cgtgcgccgt ttgctggacg agagctctag caatacctat agtatttgga 1740
ggtatctgtg gggctctact caggccaagt tggtctccag aattaagcag gattacaagt 1800
gcgattttga agagcttttt agttcctgtg gtgagctttt gcaatccttg aatctgggcc 1860
accaggctat cttccaggaa aaggttctct cgactttgga tttttccact cccgggcgca 1920
ccgccgcttg tgtggctttt gtgtcttttg tgcaagataa atggagcggg gagacccacc 1980
tgagtcacgg ctacgtgctg gatttcatgg cgatggctct ttggagggct tacaacaaat 2040
ggaagattca gaaggaactg tacggttccg ccctacgtcg tccacttctg cagcggcagg 2100
ggctgatgtt tcccgaccat cgccagcatc agaatctgga agacgagtcg gaggagcgag 2160
cggagaagat cagcttgaga gccggcctgg accctcctca ggaggaatga atctcccgca 2220
ggtggttgac ctgtttcccg aactgagacg ggtcctgact atcagggaag atggtcagtt 2280
tgtgaagaag ctgaagaggg atcggggtga gggagatgat gaggcggcta gcaatttagc 2340
ttttagtctg ataacccgcc accgaccgga atgtattacc tatcagcaga ttaaggagag 2400
ttgtgccaac gagctggatc ttttgggtca gaagtatagc atagaacagc ttaccactta 2460
ctggcttcag cccggggatg attgggaaga ggcgatcagg gtgtatgcaa aggtggccct 2520
gcggcccgat tgcaagtata agattactaa gttggttaat attagaaact gctgctatat 2580
ttctgggaac ggggccgaag tggagataga tactgaggac agggtggcta ttaggtgttg 2640
catgataaac atgtggcccg ggatactggg gatggatggg gtgatattta tgaatgtaag 2700
gttcacgggc cccaacttta atggtacggt gttcatgggc aacaccaact tgctcctgca 2760
tggtgcgagt ttctatgggt ttaacaacac ctgtatagag gcctggaccg atgtaaaggt 2820
tcgaggttgt tccttttata gctgttggaa ggcggtggtg tgtcgcccta aaagcagggg 2880
ttctgtgaag aaatgcttgt ttgaaaggtg caccctaggt atcctttctg agggcaactc 2940
cagggtgcgc cataatgtgg cttcgaactg cggttgcttc atgcaagtga agggggtgag 3000
cgttatcaag cataactcgg tctgtggaaa ctgcgaggat cgcgcctctc agatgctgac 3060
ctgctttgat ggcaactgtc acctgttgaa gaccattcat ataagcagtc accccagaaa 3120
ggcctggccc gtgtttgagc ataacattct gacccgctgt tccttgcatc tgggggtcag 3180
gaggggtatg ttcctgcctt accagtgtaa cttcagccac actaaaatcc tgctggaacc 3240
cgagtgcatg actaaggtca gcctgaatgg tgtgtttgat gtgagtctga agatttggaa 3300
ggtgctgagg tatgatgaga ccaggaccag gtgccgaccc tgcgagtgcg gcggcaagca 3360
catgagaaat cagcctgtga tgttggatgt gaccgaggag cttaggcctg accatctggt 3420
gctggcctgc accagggccg agtttgggtc tagcgatgag gataccgatt gaggtgggta 3480
aggtgggcgt ggctagcagg gtgggcgtgt ataaattggg ggtctaaggg gtctctctgt 3540
ttgtcttgca acagccgccg ccatgagcga caccggcaac agctttgatg gaagcatctt 3600
tagcccctat ctgacagtgc gcatgcctca ctgggccgga gtgcgtcaga atgtgatggg 3660
ttccaacgtg gatggacgtc ccgttctgcc ttcaaattcg tctacgatgg cctacgcgac 3720
cgtgggagga actccgttgg acgccgcgac ctccgccgcc gcctccgccg ccgccgcgac 3780
cgcgcgcagc atggctacgg acctttacag ctctttggtg gcgagcagcg cggcctctcg 3840
cgcgtctgct cgggatgaga aactgactgc tctgctgctt aaactggaag acttgacccg 3900
ggagctgggt caactgaccc agcaggtctc cagcttgcgt gagagcagcc ttgcctcccc 3960
ctaatggccc ataatataaa taaaagccag tctgtttgga ttaagcaagt gtatgttctt 4020
tatttaactc tccgcgcgcg gtaagcccgg gaccagcggt ctcggtcgtt tagggtgcgg 4080
tggattcttt ccaacacgtg gtacaggtgg ctctggatgt ttagatacat gggcatgagt 4140
ccatccctgg ggtggaggta gcaccactgc agagcttcgt gctcgggggt ggtgttgtat 4200
atgatccagt cgtagcagga gcgctgggcg tggtgctgaa aaatgtcctt aagcaagagg 4260
cttatagcta gggggaggcc cttggtgtaa gtgtttacaa atctgctcag ctgggagggg 4320
tgcatccggg gggatatgat gtgcatcttg gactggattt ttaggttggc tatgttccca 4380
cccagatccc ttctgggatt catgttgtgc aggaccacca gcacggtata tccagtgcac 4440
ttgggaaatt tatcgtggag cttagacggg aatgcatgga agaacttgga gacgcccttg 4500
tggcctccca gattttccat acattcgtcc atgatgatgg caatgggccc gtgggaagct 4560
gcctgagcaa aaacgtttct gggatcgctc acatcgtagt tatgttccag ggtgaggtca 4620
tcataggaca tctttacgaa tcgggggcgg agggtcccgg actgggggat gatggtaccc 4680
tcgggccccg gggcgtagtt cccctcacag atctgcatct cccaggcttt catttcagag 4740
ggagggatca tatccacctg cggggcgatg aaaaagacag tttctggcgc aggggagatt 4800
aactgggatg agagcaggtt tctgagcagc tgtgactttc cacagccggt gggcccatat 4860
atcacgccta tcaccggctg cagctggtag ttaagagagc tgcagctgcc gtcctcccgg 4920
agcagggggg ccacctcgtt gagcatatcc ctgacgtgga tgttttccct gaccagttcc 4980
gccagaaggc gctcgccgcc cagcgaaagc agctcttgca aggaagcaaa atttttcagc 5040
ggtttcaggc catcggccgt gggcatgttt ttcagcgtct gggtcagcag ctccagcctg 5100
tcccagagct cggtgatgtg ctctacggca tctcgatcca gcagatctcc tcgtttcgcg 5160
ggttggggcg gctttcgctg tagggcacca gccgatgggc gtccagcggg gccagagtca 5220
tgtccttcca tgggcgcaga gtcctcgtca gggtggtctg ggtcacggtg aaggggtgcg 5280
ctccgggttg ggcgctggcc agggtgcgct tgaggctggt tctgctggtg ctgaatcgct 5340
gccgctcttc gccctgcgcg tcggccaggt agcatttgac catggtctcg tagtcgagac 5400
cctcggcggc gtgccccttg gcgcggagct ttcccttgga ggtggcgccg cacgaggggc 5460
actgcaggct cttcagggcg tagagcttgg gagcgagaaa cacggactct ggggagtagg 5520
cgtccgcgcc gcaggccgag cagaccgtct cgcattccac cagccaagtg agttccgggc 5580
ggtcagggtc aaaaaccagg ctgcccccat gctttttgat gcgtttctta cctcggctct 5640
ccatgaggcg gtgtcccttc tcggtgacga agaggctgtc cgtgtccccg tagaccgatt 5700
tcaggggcct gtcttccagc ggagtgcctc tgtcctcctc gtagagaaac tctgaccact 5760
ctgagacaaa ggcccgtgtc caggccagga cgaaggaggc cacgtgggag gggtagcggt 5820
cgttgtccac tagcgggtcc accttctcca gggtgtgcag gcacatgtcc ccctcctccg 5880
cgtccagaaa agtgattggc ttgtaggtgt aggacacgtg accgggggtt cccgacgggg 5940
gggtataaaa gggggtgggt gccctttcat cttcactctc ttccgcatcg ctgtctgcga 6000
gagccagctg ctggggtaag tattcccttt cgaaggcggg catgacctca gcgctcaggt 6060
tgtcagtttc taaaaatgag gaagatttga tgttcacctg tccggaggtg atacctttga 6120
gggtacctgg gtctatctgg tcagaaaaca ctattttttt gttatcaagc ttggtggcga 6180
acgacccgta gagggcgttg gagagcagct tggcgatgga gcgcagggtc tggtttttgt 6240
cgcggtcggc tcgctccttg gccgcgatgt tgagttgcac gtactcgcgg gccacgcact 6300
tccactcggg gaagacggtg gtgcgctcgt ctgggatcag gcgcaccctc cagccgcggt 6360
tgtgcagggt gaccatgtcg acgctggtgg cgacctcacc gcgcaggcgc tcgttggtcc 6420
agcagaggcg gccgcccttg cgcgagcaga aggggggtag ggggtccagc tggtcctcgt 6480
tcggggggtc cgcgtcgatg gtaaagaccc cggggagcag acgcgggtca aagtagtcga 6540
tcttgcaagc ttgcatgtcc agagcccgct gccattcgcg ggcggcgagc gcgcgctcgt 6600
aggggttgag gggcgggccc cagggcatgg ggtgggtgag cgcagaggcg tacatgccgc 6660
agatgtcata cacgtacagg ggttccctga ggatgccgag gtaggtgggg tagcagcgcc 6720
ccccgcggat gctggcgcgc acgtagtcat agagttcgtg ggagggggcc agcatgttgg 6780
gcccgaggtt ggtgcgctgg gggcgctcgg cgcggaagac gatctgcctg aagatggcgt 6840
gggagttgga ggagatggtg ggccgctgga agacgttgaa gcttgcttct tgcaagccca 6900
cggagtccct gacgaaggag gcgtaggact cgcgcagctt gtgcaccagc tcggcggtga 6960
cctggacgtc gagcgcacag tagtcgaggg tctcacggat gatgtcatac ttatcctccc 7020
ccttcttttt ccacagctcg cggttgagga cgaactcttc gcggtctttc cagtactctt 7080
ggaggggaaa cccgtccgtg tccgaacggt aagagcctag catgtagaac tggttgacgg 7140
cctggtaggg gcagcagccc ttctccacgg gcagcgcgta ggcctgcgcc gccttgcgga 7200
gggaggtgtg ggtgagggcg aaagtgtccc tgaccatgac tttgaggtat tgatgtctga 7260
agtctgtgtc atcgcagccg ccctgttccc acagggtgta gtccgtgcgc tttttggagc 7320
gcgggttggg cagggagaag gtgaggtcat tgaagaggat cttccccgct cgaggcatga 7380
agtttctggt gatgcgaaag ggccctggga ccgaggagcg gttgttgatg acctgggcgg 7440
ccaggacgat ctcgtcaaag ccgtttatgt tgtggcccac gatgtagagc tccaggaagc 7500
ggggctggcc cttgatggag gggagctttt taagttcctc gtaggtgagc tcctcgggcg 7560
attccaggcc gtgctcctcc agggcccagt cttgcaagtg agggttggcc gccaggaagg 7620
atcgccagag gtcgcgggcc atgagggtct gcaggcggtc gcggaaggtt ctgaactgtc 7680
gccccacggc catcttttcg ggggtgatgc aatagaaggt gagggggtct ttctcccagg 7740
ggtcccatct gagctctcgg gcgaggtcgc gtgcggcggc gaccagagcc tcgtcgcccc 7800
ccagtttcat gaccagcatg aagggcacga gctgcttgcc aaaggctccc atccaagtgt 7860
aggtctctac atcgtaggtg acaaagaggc gctccgtgcg aggatgagag ccgatcggga 7920
agaactggat ctcccgccac cagttggagg attggctgtt gatgtggtga aagtagaagt 7980
cccgtctgcg ggccgagcac tcgtgctggc ttttgtaaaa gcgaccgcag tactggcagc 8040
gctgcacggg ttgtatatct tgcacgaggt gaacctggcg acctctgacg aggaagcgca 8100
gcgggaatct aagtcccccg cctggggtcc cgtgtggctg gtggtcttct actttggttg 8160
tctggccgcc agcatctgtc tcctggaggg cgatggtgga acagaccacc acgccgcgag 8220
agccgcaggt ccagatctcg gcgctcggcg ggcggagttt gatgacgaca tcgcgcacat 8280
tggagctgtc catggtctcc agctcccgcg gcggcaggtc agccgggagt tcctggaggt 8340
ttacctcgca gagacgggtc aacgcacggg cagtgttaag atggtatctg atttcaaggg 8400
gcgtgttggc ggcggagtcg atggcttgca ggaggccgca gccccggggg gccacgatgg 8460
ttccccgtgg ggcgcgaggg gaggcggaag ctgggggtgt gttcagaagc ggtgacgcgg 8520
gcgggccccc ggaggtaggg ggggttccgg ccccacaggc atgggcggca ggggcacgtc 8580
ttcgccgcgc gcgggcaggg gctggtgctg gctccgaaga gcgcttgcgt gcgcgacgac 8640
gcgacggttg gtgtcctgta tctggcgcct ctgagtgaag accacgggtc ccgtgacctt 8700
gaacctgaaa gagagttcga cagaatcaat ctcggcatcg ttgacagcgg cctggcgcag 8760
gatctcctgc acgtcgcccg agttgtcctg gtaggcgatc tctgccatga actgctcgat 8820
ctcttcctcc tggagatctc ctcgtccggc gcgctccacg gtggccgcca ggtcgttgga 8880
gatgcgaccc atgagctgcg agaaggcgtt gagtccgccc tcgttccaga cccggctgta 8940
gaccacgccc ccctcggcgt cgcgggcgcg catgaccacc tgggccaggt tgagctccac 9000
gtgtcgcgtg aagacggcgt agttgcgcag gcgctggaaa aggtagttca gggtggtggc 9060
ggtgtgctcg gcgacaaaga agtacatgac ccagcgccgc aacgtggatt cattgatgtc 9120
ccccaaggcc tccaggcgct ccatggcctc gtagaagtcc acggcgaagt tgaaaaactg 9180
ggagttgcga gcggacacgg tcaactcctc ctccagaaga cggatgagct cggcgacagt 9240
gtcgcgcacc tcgcgctcga aggccacggg gggcgcttct tcctcttcca cctcttcttc 9300
catgattgct tcttcttctt cctcagccgg gacgggaggg ggcggcggcg ggggaggggc 9360
gcggcggcgg cggcggcgca ccgggaggcg gtcgatgaag cgctcgatca tctccccccg 9420
catgcggcgc atggtctcgg tgacggcgcg gccgttctcc cgggggcgca gctcgaagac 9480
gccgcctttc atctcgccgc ggggcgggcg gccgtgaggt agcgagacgg cgctgactat 9540
gcatcttaac aattgctgtg taggtacgcc gccaagggac ctgattgagt ccagatccac 9600
cggatccgaa aacctttgga ggaaagcgtc tatccagtcg cagtcgcaag gtaggctgag 9660
caccgtggcg ggcgggggcg ggtcgggaga gttcctggcg gagatgctgc tgatgatgta 9720
attaaagtag gcggtcttga gaaggcggat ggtggacagg agcaccatgt ctttgggtcc 9780
ggcctgttgg atgcggaggc ggtcggccat gccccaggcc tcgttctgac accggcgcag 9840
gtctttgtag tagtcttgca tgagtctttc caccggcacc tcttctcctt cctcttctcc 9900
atctcgccgg tggtttctcg cgccgcccat gcgcgtgacc ccaaagcccc tgagcggctg 9960
cagcagggcc aggtcggcga ccacgcgctc ggccaagatg gcctgctgta cctgagtgag 10020
ggtcctctcg aagtcatcca tgtccacgaa gcggtggtag gcgcccgtgt tgatggtgta 10080
ggtgcagttg gccatgacgg accagttgac ggtctggtgt cccggctgcg agagctccgt 10140
gtaccgcagg cgcgagaagg cgcgggaatc gaacacgtag tcgttgcaag tccgcaccag 10200
atactggtag cccaccagga agtgcggcgg aggttggcga tagaggggcc agcgctgggt 10260
ggcgggggcg ccgggcgcca ggtcttccag catgaggcgg tggtatccgt agatgtacct 10320
ggacatccag gtgatgccgg cggcggtggt ggtggcgcgc gcgtagtcgc ggacccggtt 10380
ccagatgttt cgcaggggcg agaagtgttc catggtcggc acgctctggc cggtgaggcg 10440
cgcgcagtcg ttgacgctct atacacacac aaaaacgaaa gcgtttacag ggctttcgtt 10500
ctgtagcctg gaggaaagta aatgggttgg gttgcggtgt gccccggttc gagaccaagc 10560
tgagctcggc cggctgaagc cgcagctaac gtggtattgg cagtcccgtc tcgacccagg 10620
ccctgtatcc tccaggatac ggtcgagagc ccttttgctt tcttggccaa gcgcccgtgg 10680
cgcgatctgg gatagatggt cgcgatgaga ggacaaaagc ggctcgcttc cgtagtctgg 10740
agaaacaatc gccagggttg cgttgcggcg taccccggtt cgagccccta tggcggcttg 10800
gatcggccgg aaccgcggct aacgtgggct gtggcagccc cgtcctcagg accccgccag 10860
ccgacttctc cagttacggg agcgagcccc ttttgttttt tattttttag atgcatcccg 10920
tgctgcggca gatgcgcccc tcgccccggc ccgatcagca gcagcaacag caggcatgca 10980
gacccccctc tcctctcccc gccccggtca ccacggccgc ggcggccgtg tccggcgcgg 11040
ggggtgcgct ggagtcagat gagccaccgc ggcggcgacc taggcagtat ctggacttgg 11100
aagagggcga gggactggcg cggctggggg cgagctcccc agagcgtcac ccgcgggtgc 11160
agttgaaaag ggacgcgcgc gaggcgtacc tgccgcggca aaacctgttt cgcgaccgcg 11220
ggggcgagga gcccgaggag atgcgagact gcaggttcca agcagggcgc gagctgcgcc 11280
gcggcttgga cagagagcgc ttgctgcgcg aggaggactt tgagcccgac acgcagacgg 11340
gcatcagccc cgcgcgcgcg cacgtggccg cggccgacct ggtgaccgcc tacgagcaga 11400
cggtgaacca ggagcgcaac ttccaaaaaa gcttcaacaa ccacgtgcgc acgctggtgg 11460
cgcgcgagga ggtgaccctg ggtctcatgc atctgtggga cctggtggag gcgatcgtgc 11520
agaaccccag cagcaagccc ctgaccgcgc agctgttcct ggtggtgcag cacagcaggg 11580
acaacgatgc cttcagggag gcgctgctga acatcaccga gccggagggg cgctggctcc 11640
tggacctgat aaacatcctg cagagcatag tggtgcagga gcgcagcctg agcctggccg 11700
agaaggtggc ggccattaac tattctatgc tgagcctggg caagttctac gcccgcaaga 11760
tctacaagac cccctacgtg cccatagaca aggaggtgaa gatagacagc ttctacatgc 11820
gcatggcgct aaaggtgctg accctgagcg acgacctggg agtgtaccgc aacgagcgca 11880
tccacaaggc cgtgagcgcc agccggcggc gcgagctgag cgaccgcgag ctgatgcaca 11940
gtctgcaacg cgcgctgacc ggcgcgggcg agggcgacag ggaggtcgag tcctacttcg 12000
acatgggggc cgacctgcac tggcagccga gccgccgcgc cctggaggcg gcgggggcgt 12060
atggcggccc cctggcggcc gatggcgagg aagaggagga ctatgagcta gaggagggcg 12120
agtacctgga ggactgacct ggctggtggt gttttggtat agatgcaaga tccgaacgtg 12180
gcggacccgg cggtccgggc ggcgctgcag agccagccgt ccggcattaa ctcctctgac 12240
gactgggccg cggccatggg tcgcatcatg gccctgaccg cgcgcaaccc cgaggccttc 12300
aggcagcagc ctcaggctaa ccggctggcg gccatcttgg aagcggtagt gcccgcgcgc 12360
tccaacccca cccacgagaa ggtgctggcc atagtcaacg cgctggcgga gagcagggcc 12420
atccgggcgg acgaggccgg actggtgtac gatgcgctgc tgcagcgggt ggcgcggtac 12480
aacagcggca acgtgcaaac caacctggac cgcctggtga cggacgtgcg cgaggccgtg 12540
gcgcagcgcg agcgcttgca tcaggacggt aacctgggct cgctggtggc gctaaacgcc 12600
ttcctcagca cccagccggc caacgtaccg cgggggcagg aggactacac caacttcttg 12660
agcgcgctgc ggctgatggt gaccgaggtc cctcagagcg aagtgtacca gtcggggccc 12720
gactacttct tccagaccag cagacagggc ttgcaaaccg tgaacctgag ccaggctttc 12780
aagaacctgc gggggctgtg gggagtgaag gcgcccaccg gcgaccgggc tacggtgtcc 12840
agcctgctaa cccccaactc gcgcctgctg ctgctgctga tcgcgccctt cacggacagc 12900
gggagcgtct cgcgggagac ctatctgggc cacctgctga cgctgtaccg cgaggccatc 12960
gggcaggcgc aggtggacga gcacaccttc caggagatca ccagcgtgag ccacgcgctg 13020
gggcaggagg acacgggcag cctgcaggcg accctgaact acctgctgac caacaggcgg 13080
cagaagattc ccacgctgca cagcctgacc caggaggagg agcgcatctt gcgctacgtg 13140
cagcagagcg tgagcctgaa cctgatgcgc gacggcgtga cgcccagcgt ggcgctggac 13200
atgaccgcgc gcaacatgga accgggcatg tacgcttccc agcggccgtt catcaaccgc 13260
ctgatggact acttgcatcg ggcggcggcc gtgaaccccg agtacttcac caatgccatt 13320
ctgaatcccc actggatgcc ccctccgggt ttctacaacg gggactttga ggtgcccgag 13380
gtcaacgacg ggttcctctg ggatgacatg gatgacagtg tgttctcccc caacccgctg 13440
cgcgccgcgt ctctgcgatt gaaggagggc tctgacaggg aaggaccgag gagtttggcc 13500
tcctccctgg ctctgggggc ggtgggcgcc acgggcgcgg cggcgcgggg cagcagcccc 13560
ttccccagcc tggcggactc tctgaatagc gggcgggtga gcaggccccg cttgctaggc 13620
gaggaggagt atctgaacaa ctccctgcta cagcccgtga gggacaaaaa cgctcagcgg 13680
cagcagtttc ccaacaacgg gatagagagc ctggtggaca agatgtccag atggaagacg 13740
tatgcgcagg agtacaagga gtgggaggac cgacagccgc ggcccctgcc gccccctaga 13800
cagcgctggc agcggcgtgc gtccaaccgc cgctggaggc aggggcccga ggacgatgat 13860
gactctgcag atgacagcag cgtgttggat ctgggcggga gcgggaaccc cttttcgcac 13920
ctgcgcccac gcctgggcaa gatgttttaa aagagaaaaa taaaaactca ccaaggccat 13980
ggcgacgagc gttggttttt ttgttccctt ccttagtatg cggcgcgcgg cgatgttcga 14040
ggaggggcct cccccctctt acgagagcgc gatgggaatt tctcctgcgg cgcccctgca 14100
gcctccctac gtgcctcctc ggtacctgca acctacaggg gggagaaata gcatctgtta 14160
ctctgagctg cagcccctgt acgataccac cagactgtac ctggtggaca acaagtccgc 14220
ggacgtggcc tccctgaact accagaacga ccacagcgat tttttgacca cggtgatcca 14280
aaacaacgac ttcaccccaa ccgaggccag tacccagacc ataaacctgg acaacaggtc 14340
gaactggggc ggcgacctga agactatcct gcacaccaat atgcccaacg tgaacgagtt 14400
catgttcacc aactctttta aggcgcgggt gatggtggcg cgcgagcagg gggaggcgaa 14460
gtacgagtgg gtggacttca cgctgcccga gggcaactat tcagagacca tgactctcga 14520
cctgatgaac aatgcgatcg tggaacacta tctgaaagtg ggcaggcaga acggggtgaa 14580
ggagagcgat atcggggtca agtttgacac cagaaacttt cgtctgggct gggaccccgt 14640
gaccgggctg gtcatgccgg gggtctacac caacgaggcc tttcatcccg atatagtgct 14700
cctgcccggc tgtggggtgg actttaccca gagccggctg agcaacctgc tgggcgttcg 14760
caagcggcaa cctttccagg agggtttcaa gatcacctat gaggatctgg aggggggcaa 14820
cattcccgcg ctccttgatc tggacgccta cgaggagagc ttgaaacccg aggagagcgc 14880
tggcgacagc ggcgagagtg gcgaggagca agccggcggc ggtggcagcg cgtcggtaga 14940
aaacgaaagt actcccgcag tggcggcgga cgctgcggag gtcgagccgg aggccatgca 15000
gcaggacgca gaggagggcg cgcaggagga catgaacaat ggggagatca ggggcgacac 15060
tttcgccacc cggggcgaag aaaaagaggc agaggcggcg gcggcgacgg cggaagccga 15120
aaccgaggca gaggcagagc ccgagaccga agttatggaa gacatgaatg atggagaacg 15180
taggggtgac acgtttgcca cccggggcga agagaaggcg gcggaggcag aagccgcggc 15240
tgaggaggcg gctgcggctg cggccaaggc tgaggctgcg gctgaggcta aggtcgaagc 15300
cgatgttgcg gttgaggctc aggctgagga ggaggcggcg actgaagcag ttaaggaaaa 15360
ggcccaggca gagcaggaag agaaaaaacc tgtcattcaa cctctaaaag aagatagcaa 15420
aaagcgcagt tacaacgtca tcgagggcag cacctttacc caataccgca gctggtacct 15480
ggcttacaac tacggcgacc cggtcaaggg ggtgcgctcg tggaccctgc tctgcacgcc 15540
ggacgtcacc tgcggctccg agcagatgta ctggtcgctg ccaaacatga tgcaagaccc 15600
ggtgaccttc cgttccacgc ggcaggttag caactttccg gtggtgggcg ccgaactgct 15660
gccagtgcac tccaagagtt tttacaacga gcaggccgtc tactcccagc tgatccgcca 15720
ggccacctct ctgacccacg tgttcaatcg ctttcccgag aaccagattt tggcgcgccc 15780
gccggccccc accatcacca ccgtcagtga aaacgttcct gccctcacag atcacgggac 15840
gctaccgctg cgcaacagca tctcaggagt ccagcgagtg accattactg acgccagacg 15900
ccggacctgc ccctacgttt acaaggcctt gggcatagtc tcgccgcgcg tcctctccag 15960
tcgcactttt taaaacacat ctaccctcac gctccaaaat catgtccgta ctcatctcgc 16020
ccagcaacaa caccggctgg gggctgcgcg cgcccagcaa gatgtttgga ggggcgagga 16080
aacgctccga acagcaccca gtgcgcgtgc gcggccacta ccgcgcgccc tggggtgcgc 16140
acaagcgcgg gcgcacaggg cgcaccactg tggatgatgt cattgactcc gtagtggagc 16200
aggcgcgcca ctacacaccc ggcgcgccga ccgcctccgc cgtgtccacc gtggaccagg 16260
cgatcgaaag cgtggtacag ggggcgcggc actatgccaa ccttaaaagt cgccgccgcc 16320
gcgtggcgcg ccgccatcgc cggagacccc gggctactgc cgccgcgcgc cttaccaagg 16380
ctctgctcaa gcgcgccagg cgaactggcc accgggccgc catgagggcc gcacggcggg 16440
ctgccgctgc cgcgagcgcc gtggccccgc gggcacgaag gcgcgcggcc gctgccgccg 16500
ccgccgccat ttccagcttg gcctcgacgc ggcgcggtaa catatactgg gtgcgcgact 16560
cggtaagcgg cacacgggtg cccgtgcgct ttcgcccccc acggaattag cacaaaacaa 16620
catacacact gagtctcctg ctgttgtgta tcccagcggc gaccgtcagc agcggcgaca 16680
tgtccaagcg caaaattaaa gaagagatgc tccaggtcat cgcgccggag atctatgggc 16740
ccccgaagaa ggaggaggat gattacaagc cccgcaagct aaagcgggtc aaaaagaaaa 16800
agaaagatga tgacgttgac gaggcggtgg agtttgtccg ccgcatggcg cccaggcgcc 16860
ccgtgcagtg gaagggtcgg cgcgtgcagc gagtcctgcg ccccggcacc gcggtggtct 16920
ttacgcccgg cgagcgttcc acgcgcactt tcaagcgggt gtacgatgag gtgtacggcg 16980
acgaggatct gttggagcag gccaaccatc gctttgggga gtttgcatat gggaaacggc 17040
cccgcgagag cctaaaagag gacctgctgg cgctaccgct ggacgagggc aatcccaccc 17100
cgagtctgaa gccggtaacc ctgcaacagg tgctgccttt gagcgcgccc agcgagcaga 17160
agcgagggtt gaagcgcgag ggcggggacc tggcacccac cgtgcagttg atggtgccca 17220
agcggcagaa gctggaggac gtgctggaga aaatgaaagt agagcccggg atccagcccg 17280
aaatcaaggt ccgccccatc aagcaggtgg cgcccggcgt gggagtccag accgtggacg 17340
ttaggattcc cacggaggag atggaaaccc aaaccgccac tccctcttcg gcggctagcg 17400
ccaccaccgg cgccgcttcg gtagaggtgc agacggaccc ctggctacct gccgccactg 17460
tcgccgccgc cgccgccgcc ccccgttcgc gcgggcgcaa gagaaattat ccagcggcca 17520
gcgcgctcat gccccagtac gcactgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcacccgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcattg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg cctccgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcagag gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgctg 17940
cccttcctaa tcccgctaat cgccgcggcg atcggtgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc attttttgga 18060
ggaaaaaata aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagt ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggccaga tgctgagaga caagttgaaa gagcagaact tccaggaaaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccggcctcag gtggaggaaa cgcctccagc 18480
aatggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcctaccact cgccccatag cccccatggc caccggtgtg gtgggacaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgatccgcct cgccagccaa aggcggcgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgccc ctgcgccgcg ctgcaagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gtttatcccc gtggatcggg 19200
aggacaccgc ctactcttac aaggcgcggt ttacgctggc cgtgggcgac aatcgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaact caggcggccg aggaagctgt tgacgaggaa gatgcagaag 19440
atgaagcgca accacaagag gaagcccctg ttaaaaaaat tcatgtatat gctcaggcgc 19500
ctcttgctgg cgaaaagatt accaaggatg gtttgcaaat aggtactgaa gtcgtaggag 19560
atacatctaa ggacactttt gcagataaaa cattccaacc cgaacctcag ataggcgagt 19620
ctcagtggaa cgaggctgat gccgcagtag caggaggtag agttttgaaa aagactaccc 19680
ctatgagacc ttgctatgga tcctatgcca ggcctaccaa tgccaacggg ggtcaaggaa 19740
ttatggttgc caatgaaaaa ggagtgttgc agtctaaagt agaaatgcaa tttttctcta 19800
acacctcaac ccttaatgcg cgggatggaa ccggcaatcc cgaaccaaag gtggtgttgt 19860
acagcgaaga tgtccacttg gaatctcccg atactcatct gtcttacaag cccacaaagg 19920
atgatgttaa tgccaaagtc atgttgggtc agcaagccat gcccaacaga cccaacctca 19980
ttggatttag agataatttc attgggctta tgttttacaa cagcaccggt aacatgggag 20040
tgctggcggg tcaggcctct cagttgaatg ctgtggtgga cttgcaggat agaaacacag 20100
aactgtcata tcagcttatg cttgattcaa ttggggatag aaccagatac ttctccatgt 20160
ggaaccaggc agtggatagc tatgatccag atgtcagaat tattgaaaac catggggttg 20220
aggatgaact gcccaactac tgcttccctt tgggcggcat aggaattact gatacttatc 20280
aaggggtgaa aaataccaat ggcaatggtc agtggaccaa agatgatcag ttcgcggacc 20340
gcaacgaaat aggggtggga aacaacttcg ccatggagat caacatccag gccaaccttt 20400
ggagaaactt cctctatgca aacgtggggc tctacctgcc agacaagctc aagtacaacc 20460
ccaccaacgt ggacatctct gacaacccca acacctatga ctacatgaac aagcgggtgg 20520
tggcccctgg cctggtggac tgctttgtca atgtgggagc caggtggtcc ctggactaca 20580
tggacaacgt caaccccttc aaccaccacc gcaatgcggg tctgcgctac cgctccatga 20640
tcctgggcaa cgggcgctat gtgccctttc acatccaggt accccagaag ttctttgcca 20700
tcaagaacct cctgctcctg cccggctcct acacctacga gtggaacttc aggaaggatg 20760
tgaacatggt cctacagagc tctctgggca atgaccttag ggtggatggg gccagcatca 20820
agtttgacag catcaccctc tatgctacat ttttccccat ggcccacaac accgcctcca 20880
cgcttgaggc catgctgaga aacgacacca acgaccagtc ctttaatgac tacctctctg 20940
gggccaacat gctctaccca atcccagcca aggccaccaa cgtgcccatc tccatcccct 21000
ctcgcaactg ggccgccttt agaggctggg cctttacccg ccttaagacc aaggagaccc 21060
cctccctggg ctcgggtttt gatccctact ttgtttactc gggatccatc ccctacctgg 21120
atggcacctt ctacctcaac cacactttca agaagatatc catcatgtat gactcctccg 21180
tcagctggcc gggcaacgac cgcttgctca cccccaatga gttcgaggtc aagcgcgccg 21240
tggacggcga gggctacaac gtggcccagt gcaacatgac caaggactgg ttcctggtgc 21300
agatgctggc caactacaac ataggctacc agggctttta catcccagag agctacaagg 21360
acaggatgta ctccttcttc agaaatttcc aacccatgag ccgacaggtg gtggacgaga 21420
ccaattacaa ggactatcaa gccattggca tcacccacca gcacaacaac tcgggtttcg 21480
tgggctacct ggcgcccacc atgcgcgagg gtcaggccta ccccgccaac ttcccctacc 21540
ccttgatagg caagaccgcg gtcgacagcg tcacccagaa aaagttcctc tgcgaccgca 21600
ccctctggcg catccccttc tctagcaact tcatgtccat gggtgcgctc acggacctgg 21660
gccaaaacct gctttatgcc aactctgccc atgcgctgga catgactttc gaggtggacc 21720
ccatggacga gcccaccctt ctctatattg tgtttgaagt gttcgacgtg gtcagagtgc 21780
accagccgca ccgcggtgtc atcgagaccg tgtacctgcg tacgcccttc tcagccggca 21840
acgccaccac ctaaggagac agcgccgccg cctgcatgac tggttccacc gagcaagagc 21900
tcagggccat cgccagagac ctgggatgcg gaccctactt tttgggcacc tatgacaaac 21960
gcttcccggg tttcatctcc cgagacaagc tcgcctgcgc catcgtcaac acggccgcgc 22020
gcgagaccgg gggcgtgcac tggctggcct ttggctggga cccgcgctct aaaacttgct 22080
acctctttga cccctttggc ttctctgatc agcgcctcag gcagatttat gagtttgagt 22140
acgaggggct gctgcgccgc agcgcgcttg cctcctcgcc cgaccgctgc atcacccttg 22200
agaagtccac cgagaccgtg caggggcccc actcggccgc ctgcggtctc ttctgttgca 22260
tgtttttgca cgcctttgta cactggcctc agagtcccat ggatcgcaac cccaccatga 22320
acttgctaaa gggagtgccc aacgccatgc tccagagccc ccaggtcctg cccaccctgc 22380
gccgcaacca ggaacagctc taccgcttcc tggagcgcca ctccccctac ttccgcagcc 22440
acagcgcgcg catccggggg gccacctctt tttgccactt gcaagaaaac atgcaagacg 22500
gaaaatgatg tacagcatgc ttttaataaa tgtaaagact gtgcacttta tttatacacg 22560
ggctctttct ggttatttat tcaacaccgc cgtcgccatc tagaaatcga aagggttctg 22620
ccgcgcgtcg ccgtgcgcca cgggcagaga cacgttgcga tactggaagc ggctcgccca 22680
cttgaactcg ggcaccacca tgcggggcag tggttcctcg gggaaattct cgctccacag 22740
ggtgcgggtc agctgcagcg cgctcaggag gtcgggagcc gagatcttga agtcgcagtt 22800
ggggccggaa ccctgcgcgc gcgagttgcg gtacacgggg ttgcagcact ggaacaccag 22860
cagggccgga ttattcacgc tggccagcag gctctcgtcg ctgatcatgt cgctgtccag 22920
atcctccgcg ttgctcaggg cgaatggggt catcttgcag acctgcctgc ccaggaaagg 22980
cgggagccca ggcttgccgt tgcagtcgca gcgcaggggc attagcaggt gcccacggcc 23040
cgactgcgcc tgcgggtaca acgcgcgcat gaaggcttcg atctgcctaa aagccacctg 23100
ggtcttggct ccctccgaaa agaacatccc acaggacttg ctggagaact gattcgcggg 23160
acagctggca tcgtgcaggc agcagcgcgc gtcagtgttg gcgatctgca ccacgttgcg 23220
accccaccgg tttttcacta tcttggcctt ggaagcctgc tcctttagcg cgcgctggcc 23280
gttctcgctg gtcacatcca tctctatcac ctgttccttg ttgatcatgt ttgtcccgtg 23340
cagacacttt aggtcgccct ccgtctgggt gcagcggtgc tcccacagcg cgcaaccggt 23400
gggctcccaa ttcttgtggg tcacccccgc gtaggcctgc aggtaggcct gcaggaagcg 23460
ccccatcatg gtcataaagg tcttctggct cgtaaaggtc agctgcaggc cgcgatgctc 23520
ttcgttcagc caggtcttgc agatggcggc cagcgcctcg gtctgctcgg gcagcatctt 23580
aaaatttgtc ttcaggtcgt tatccacgtg gtacttgtcc atcatggcac gcgccgcctc 23640
catgcccttc tcccaggcgg acaccatggg caggcttagg gggtttatca cttccagcgg 23700
cgaggacacc gtactttcga tttcttcttc ctccccctct tcccggcgcg cgcccccgct 23760
gttgcgcgct cttaccgcct gcaccaaggg gtcgtcttca ggcaagcgcc gcaccgagcg 23820
cttgccgccc ttgacctgct tgatcagtac cggcgggttg ctgaagccca ccatagtcag 23880
cgccgcctgc tcttcttcgt cttcgctgtc taccactatt tctggggagg ggcttctccg 23940
ctctgcggca aaggcggcgg atcgcttctt ttttttcttg ggagccgccg cgatggagtc 24000
cgccacggcg accgaggtcg agggcgtggg gctgggggtg cgcggcacca gggcctcgtc 24060
gccctcggac tcttcctctg actccaggcg gcggcggagt cgcttctttg ggggcgcgcg 24120
cgtcagcggc ggcggagacg gggacgggga cggggacggg acgccctcca cagggggcgg 24180
tcttcgcgca gacccgcggc cgcgctcggg ggtcttctcg cgctggtctt ggtcccgact 24240
ggccattgta tcctcctcct cctaggcaga gagacataag gagtctatca tgcaagtcga 24300
gaaggaggag agcttaacca ccccctctga gaccgccgtc gccgtcgccc ccgctaccgc 24360
cgacgcgccc gccacaccga gcgacacccc cgcggacccc cccgccgacg cacccctgtt 24420
cgaggaagcg gccgtggagc aggacccggg ctttgtctcg gcagaggagg atttgcaaga 24480
ggaggaggat aaggaggaga agccctcagt gccaaaagat cataaagagc aagacgagca 24540
cgacgcagac gcacaccagg gtgaagtcgg gcggggggac ggagggcatg gcggcgccga 24600
ctacctagac gaaggaaacg acgtgctctt gaagcacctg catcgtcagt gcgccatcgt 24660
ctgcgacgct ctgcaggagc gcagcgaggt gcccctcagc gtggcggagg tcagccgcgc 24720
ctacgagctc agcctctttt ccccccgggt gcccccccgc cgccgcgaaa acggcacatg 24780
cgagcccaac ccgcgcctca acttctaccc cgcctttgtg gtgcccgagg tcctggccac 24840
ctatcacatc ttctttcaaa attgcaagat ccccatctcg tgccgcgcca accgtagccg 24900
cgccgataag atgctggccc tgcgccaggg cgaccacata cctgatatcg ccgctttgga 24960
agatgtacca aagatcttcg agggtctggg tcgcaacgaa aagcgggcag caaactctct 25020
gcaacaggaa aacagcgaaa atgagagtca caccggggtg ctggtggagc tcgagggcga 25080
caacgcccgc ctggcggtgc tcaagcgcag catcgaggtc acccactttg cctaccccgc 25140
gctcaacctg ccccccaaag tcatgaacgc ggtcatggac gggctgatca tgcgccgcgg 25200
ccagcccctt gctccagatg caaacttgca tgaggagacc gaggacggcc agcccgtggt 25260
cagcgacgag cagctggcgc gctggctgga aaccgcggac cccgccgaac tggaggagcg 25320
gcgcaagatg atgatggccg cggtgctggt caccgtagag ctggagtgtc tgcagcgctt 25380
cttcggtgac cccgagatgc agagaaaggt cgaggagacc ctacactaca ccttccgcca 25440
gggctacgtg cgccaggctt gcaagatctc caacgtggag ctcagcaacc tggtgtccta 25500
cctgggcatc ttgcatgaga accgccttgg gcagagcgtg ctgcactcca ccctgcgcgg 25560
ggaagcgcgc cgcgactacg tgcgcgactg cgtttacctt ttcctctgct acacctggca 25620
gacggccatg ggggtctggc agcagtgcct ggaggagcgc aacctcaagg agctggagaa 25680
gctcctgcag cgcgcgctca aagacctctg gacgggcttc aacgagcgct cggtggccgc 25740
cgcgctggcc gacctcatct tccccgagcg cctgctcaaa actctccagc aggggctgcc 25800
cgacttcacc agccaaagca tgttgcaaaa ctttaggaac tttatcctgg agcgttctgg 25860
catcctaccc gccacctgct gcgccctgcc cagtgacttt gttcccctcg tgtaccgcga 25920
gtgccccccg ccgctgtggg gccactgcta cctgttccaa ctggccaact acctgtccta 25980
ccacgcggac ctcatggagg actccagcgg cgaggggctc atggagtgcc actgccgctg 26040
caacctctgc acgccccacc gctccctggt ctgcaacacc caactgctca gcgagagtca 26100
gattatcggt accttcgagc tacagggtcc gtcctcctca gacgagaagt ccgcggctcc 26160
ggggctaaaa ctcactccgg ggctgtggac ttccgcctac ctgcgcaaat ttgtacctga 26220
agactaccac gcccacgaga tcaggtttta cgaggaccaa tcccgcccgc ccaaggcgga 26280
gctgaccgcc tgcgtcatca cccagggcga gatcctaggc caattgcaag ccatccaaaa 26340
agcccgccaa gagtttttgc tgagaaaggg tcggggggtg tatctggacc cccagtcggg 26400
tgaggagctc aacccggttc ccccgctgcc gccgccgcgg gaccttgctt cccaggataa 26460
gcatcgccat ggctcccaga aagaagcagc agcggccgcc actgccgcca ccccacatgc 26520
tggaggaaga ggagtactgg gacagtcagg cagaggaggt ttcggacgag gaggagccgg 26580
agacggagat ggaagagtgg gaggaggaca gcttagacga ggaggcttcc gaagccgaag 26640
aggcaggcgc aacaccgtca ccctcggccg cagccccctc gcaggcgccc ccgaagtccg 26700
ctcccagcat cagcagcaac agcagcgcta taacctccgc tcctccaccg ccgcgaccca 26760
cggccgaccg cagacccaac cgtagatggg acaccaccgg aaccggggcc ggtaagtcct 26820
ccgggaaagg caagcaagcg cagcgccaag gctaccgctc gtggcgcgct cacaagaacg 26880
ccatagtcgc ttgcttgcaa gactgcgggg ggaacatctc cttcgcccgc cgcttcctgc 26940
tcttccacca cggtgtggcc ttcccccgta acgtcctgca ttactaccgt catctctaca 27000
gcccctactg cggcggcagt gagccagagg cggccggcgg cagcggcgcc cgtttcggtg 27060
cctaggaaga cccagggcaa gacttcagcc aagaaactcg cggcggccgc ggcgaacgcg 27120
gtcgcggggg ccctgcgcct gacggtgaac gaacccctgt cgacccgcga actgaggaac 27180
cgaatcttcc ccactctcta tgccatcttc cagcagagca gagggcagga tcaggaactg 27240
aaagtaaaaa acaggtctct gcgctccctc acccgcagct gtctgtatca caagagcgaa 27300
gaccagcttc ggcgcacgct ggaggacgct gaggcactct tcagcaaata ctgcgcgctc 27360
actcttaagg actagctccg cgcccttctc gaatttaggc gggaacgcct acgtcatcgc 27420
agcgccgccg tcatgagcaa ggacattccc acgccataca tgtggagcta tcagccgcag 27480
atgggactcg cggcgggcgc ctcccaagac tactccaccc gcatgaactg gctcagtgcc 27540
ggcccacaca tgatctcaca ggttaatgac atccgcaccc atcgaaacca aatattggtg 27600
gagcaggcgg caattaccac cacgccccgc aataatccca accccaggga gtggcccgcg 27660
tccctggtgt atcaggaaat tcccggcccc accaccgtac tacttccgcg tgattcccag 27720
gccgaagtcc aaatgactaa ctcaggggca cagctcgcgg gcggctgtcg tcacagggtg 27780
cggcctcctc gccagggtat aactcacctg gagatccgag gcagaggtat tcagctcaac 27840
gacgagtcgg tgagctcctc gctcggtctc agacctgacg ggaccttcca gatagccgga 27900
gccggccgat cttccttcac gccccgccag gcgtacctga ctctgcagag ctcgtcctcg 27960
gcgccgcgct cgggcggcat cgggactctc cagttcgtgc aggagtttgt gccctcggtc 28020
tacttcaacc ccttctcggg ctctcccggt cgctacccgg accagttcat cccgaacttt 28080
gacgccgcga gggactcggt ggacggctac gactgaatgt cgggtggacc cggtgcagag 28140
caacttcgcc tgaagcacct tgaccactgc cgccgccctc agtgctttgc ccgctgtcag 28200
accggtgagt tccagtactt ttccctgccc gactcgcacc cggacggccc ggcacacggg 28260
gtgcgctttt tcatcccgag tcaggtccgc tctaccctaa tcagggagtt tacagcccgt 28320
cccctactgg cggagttgga aaaggggcct tctatcctaa ccattgcctg catctgctct 28380
aaccctggat tacaccaaga tctttgctgt catttgtgtg ctgagtataa taaaggctga 28440
gatcagaatc tactcgggct cctgtcgcca tcctgtcaac gccaccgtcc aagcccggcc 28500
cgatcagccc gaggtgaacc tcacctgcgg tctgcaccgg cgcctgagga aatacctagc 28560
ttggtactac aacagcactc cctttgtggt ttacaacagc tttgaccagg acggggtctc 28620
actgagggat aacctctcga acctgagcta ctccatcagg aagaacaaca ccctcgagct 28680
acttcctcct tacctgcccg ggacttacca gtgtgtcacc ggtccctgca cccacaccca 28740
cctgttgatc gtaaacgact ctcttccgag aacagacctc aataactcct ctccgcagtt 28800
ccccagaaca ggaggtgagc tcaggaaacc ccgggtaaag aagggtggac aagagttaac 28860
acttgtgggg tttctggtgt atgtgacgct ggtggtggct cttttgatta aggcttttcc 28920
ttccatgtct gaactctccc tcttctttta tgaacaactc gactagtgct aacgagaccc 28980
tacccaacga atcgggattg aatatcggta accaggttgc agtttcactt ttgattacct 29040
ttatagtcct cttcctgcta gtgctgtcgc ttctgtgcct gcggatcggg ggctgctgca 29100
tccacgttta tatctggtgc tggctgttta gaaggttcgg agaccaccgc aggtagaata 29160
atgctgctta ccctctttgt cctggcgctg gctgccagct gccaagcctt ttccgaggct 29220
gacttcatag agccccagtg caatatcact tataaatctg aacgtgccat ctgtactatc 29280
ctaatcaaat gtgttactca acacgataag gtaactgtta aatacaaaga tcaattaaaa 29340
aaagacgcac tttacagcag ctggcaacca ggagatgaac aaaaatacaa tgtaaccgtc 29400
ttccagggca aactctccaa aacttacaat tacactttcc catttgagca gatgtgtgac 29460
tttgtcatgt acatggaaaa gcagtacaag ctgtggcctc caactcccca gggctgtgtg 29520
gaaaatccag gctctttctg tatgatctct ctctgtgtaa ctgtgctggc actaatactc 29580
acgcttctgt atatcagatt taaatcaagg caaagcttta ttgatgaaaa gaaaatgcct 29640
taatcgcttt cacgcttgat tgctaacacc gggtttttat ccgcagaatg attggaatca 29700
ccctactaat cacctccctc cttgcgattg cccatgggtt ggaacgaatc gaagtccctg 29760
tgggggccaa tgttaccctg gtggggcctg tcggcaatgc tacattaatg tgggaaaaat 29820
atactaaaaa tcaatgggtc tcttactgca ctaacaaaaa cagccacaag cccagagcca 29880
tctgcgatgg gcaaaattta accttgattg atgttcaatt gctggatgcg ggctactatt 29940
atgggcagct gggtacaatg attaattact ggagacccca cagagattac atgcttcacg 30000
tagtaaaggg tcccattagc agcccaacca ccacctctac cacccccact accaccacta 30060
ctcccaccac cagcactgcc gcccagcctc ctcatagcag aacaaccact tttatcaatt 30120
ccaagtccca ctccccccac attgccggcg ggccctccgc ctcagactcc gagaccaccg 30180
agatctgctt ctgcaaatgc tctgacgcca ttgcccagga tttggaagat cacgaggaag 30240
atgagcatga ctacgcagat gcatgccagg catcagagtc agaagcgctg ccggtggccc 30300
taaaacagta tgcagacccc cacaccaccc ccgaccttcc tccaccttcc cagaagccaa 30360
gtttcctggg ggaaaatgaa actctgcctc tctccatact agctctgaca tctgttgcta 30420
ttttggccgc tctgctggtg cttctatgct ctatatgcta cctgatctgc tgcagaaaga 30480
aaaaatctca cggccatgct caccagcccc tcatgcactt cccttaccct ccagagctgg 30540
gcgaccacaa actttaagtc tgcagtagct atctgcccat cccttgtcag tcgacagcga 30600
tgagccccac taatctaaca gcctctggac ttacaacatt gtctcttaat gagaccaccg 30660
ctcctcaaga cctgtacgat ggtgtctccg cgctggttaa ccagtgggat cacctgggca 30720
tatggtggct cctcatagga gcagtgaccc tgtgcctaat cctggtctgg atcatctgct 30780
gcatcaaaag cagaagaccc aggcggcggc ccatctacag gcccttcgtc atcacacctg 30840
aagataatga tgatgatgac accacctcca ggctgcagag cctaaagcag ctactcttct 30900
cttttacagc atggtaaatt gaatcatgcc ccgcattttc atctacttgc ttctccttcc 30960
actttttctg ggctcctcta cattggccgc tgtgtcccac atcgaggtag actgcctcac 31020
gcccttcaca gtctacctgc ttttcggctt tgtcatctgc acctttgtct gcagcgttat 31080
cactgtagtg atctgcttca tacagtgcat cgactacatc tgtgtgcggg tggcctactt 31140
tagacaccac ccccagtatc gcaacaggga catagcggct ctcctaagac ttgtttaaat 31200
catggccaaa ttacctgtga ttggtcttct gattatctgc tgcgtcctag ccgcgattgg 31260
gactcaacct aataccacca ccagcgctcc cagaaagaga catgtatcct gcagcttcaa 31320
gcgtccctgg aatatacccc aatgctttac tgatgaacct gaaatctctt tggcttggta 31380
cttcagcgtc accgcccttc tcatcttctg cagtacggtt attgctcttg ccatctaccc 31440
ttcccttaac ctgggctgga atgctgtcaa ctctatggaa tatcccacct tcccagaacc 31500
agacctgcca gacctggttg ttctaaacgc gtttcctcct cctccagttc aaaatcagtt 31560
tcgccctccg tcccctacgc ccactgaggt cagctacttt aatctaacag gcggagatga 31620
ctgaaaacct agacctagaa atggacggtc tctgcagcga gcaacgcaca ctagagaggc 31680
gccggcaaaa agcagagctc gagcgtctta aacaagagct ccaagacgcc gtggccatac 31740
accagtgcaa aaaagggctc ttctgtctgg taaaacaggc cacgctcacc tatgaaaaaa 31800
caggtgacac ccaccgccta ggatacaagc tgcccacaca gcgccaaaag tttgccctta 31860
tgataggtga acaacccatc accgtcaccc agcactccgt ggagacagaa ggctgcattc 31920
atgctccctg caggggcgct gactgcctct acaccttgat caaaaccctc tgcggtctca 31980
gagaccttat ccctttcaat tgatcataac tgtaatcaat aaaaaatcac ttacttgaaa 32040
tctgatagca agcctctgtc caattttttc agcaacactt ccttcccctc ttcccaactc 32100
tggtactcta ggcgcctcct agctgcaaac ttcctccaca gtctgaaggg aatgtcagat 32160
tcctcctcct cctgtccctc cgcacccaca atcttcatgt tgttgcagat gaaacgcgcg 32220
agatcgtctg acgagacctt caaccccgtg tacccctacg ataccgagat cgctccgact 32280
tctgtccctt tccttacccc tccctttgtg tcacccgcag gaatgcaaga aaatccagct 32340
ggggtgctgt ccctgcacct gtcagagccc cttaccaccc acaatggggc cctgactcta 32400
aaaatggggg gcggcctgac cctggacaag gaagggaatc tcacttccca aaacatcacc 32460
agtgtcgatc cccctctcaa aaaaagcaag aacaacatca gccttcagac cgccgcaccc 32520
ctcgccgtca gctccggggc cctaaccctt tttgccactc cccccctagc ggtcagtggc 32580
gacaacctta ctgtgcagtc tcaggcccct cttactttgg aagactcaaa actaactctg 32640
gccaccaaag gacccctaac tgtgtccgaa ggcaaacttg tcctagaaac agaggctccc 32700
ctgcatgcaa gtgacagcag tagcctgggc cttagcgtca cggccccact tagcattaac 32760
aatgacagcc taggactaga tctgcaggca cccattgtct ctcaaaatgg aaaactggct 32820
ctaaatatag caggccccct agctgtagcc gatagcatta atgctttgac agtaggcact 32880
ggcaaaggta ttggactaaa tgaaaccagc actcacttgc aagcaaaatt ggttgccccc 32940
ctaggctttg ataccaatgg caatattaag ctaagcgttg caggaggcat gaggctaaac 33000
aatgacacac tgatactaga tgtaaactac ccatttgaag ctcaaggtca actaagccta 33060
agagtgggca caggtccact gtatgtagat tctagcagtc ataatctaac cattagatgc 33120
cttaggggat tgtatataac atcatctaac aaccaaaacg gtctagaggc caacattaaa 33180
ctaacaaaag gccttgtgta tgaaggaaat gccatagcag ttaatgttgg tcaaggattg 33240
caatacagca ctactgccac atcggaaggt gtgtatccta tacagtctaa gataggtttg 33300
ggaatggaat atgataccaa cggagccatg atggcaaaac taggctccgg tctaagcttt 33360
gataattcag gagccattgt ggtgggaaac aaaaatgatg acaaacttac cctatggacc 33420
acacctgacc cgtctcctaa ctgtagaatt tattctgaaa aagatactaa actaaccttg 33480
gtgctgacta agtgtggcag tcaaatccta ggcacagtat ctgcccttgc tgtcagaggc 33540
agccttgcgc ccatcactaa cgcatccagc atagtccaaa tatttctacg atttgatgaa 33600
aatggactat tgatgagcaa ctcatcgcta gacggtgatt actggaatta cagaaatggg 33660
gactccacta atggcacacc atatacaaat gcagtaggct ttatgcctaa tctagctgcc 33720
tatcctaaag gtcaggctac aactgcaaaa agcagtattg taagccaggt atacatggat 33780
ggtgatacta ctaaacctat aacactaaaa ataaacttta atggcattga tgaaacaaca 33840
gaaaataccc ctgttagtaa atattccatg acattctcat ggagctggcc caccgcaagc 33900
tacataggcc acacttttgc aacaaactct tttactttct cctacatcgc ccaagaataa 33960
agaaagcaca gagatgcttg tttttgattt caaaattgtg tgcttttatt tattttcaag 34020
cttacagtat ttccagtagt cattcaaata gagcttaatg aaactgcatg agaacccttc 34080
cacatagctt aaattatcac cagtgcaaat ggagaaaaaa tcaacatacc tttttatcca 34140
gatatcatag aactctagtg gtcagttttc ccccaccctc ccagctcaca gaatacacag 34200
tcctttcccc ccggctggct ttaaacaaca ctatctcatt ggtaacagac atattcttag 34260
gtgtaataat ccacacggtc tcttggcggg ccaaacgctg gtcagtgatg ttaataaact 34320
ccccaggcag ctctttcaag ttcacgtcgc tgtccaactg ctgaagcgct cgcggctccg 34380
actgcgcctc tagcggaggc aacggcaaca cccgatcctt gatctataaa ggagtagagt 34440
cataatcccc cataagaata gggcggtgat gctgcaacaa ggcgcgcagc aactcctgcc 34500
gccgcctttc cgtacgacag gaatgcaacg gggtggtggt ctcctccgcg ataatccgca 34560
ccgctcgcaa catcagcgtc ctcgtcctcc gggcacagca gcgcatcctg atctcactga 34620
gatcggcgca gtaagtgcag cacaacacca agatgttatt taagatccca cagtgcaaag 34680
cactgtaccc aaagctcatg gcgggaagga cagcccccac gtgaccatca taccagatcc 34740
tcaggtaaat caaatgacga cctctcatga acacgctgga catgtacatc acctccttag 34800
gcatgtgctg attcaccacc tctcgatacc acaggcatcg ctgattaatt aaagacccct 34860
cgagcaccat cctgaaccag gaagccagca cctgaccccc cgccaggcac tgcagggacc 34920
ccggtgaatc gcagtggcag tgaagactcc agcgctcgta gccgtgaacc atagagctgg 34980
tcattatatc cacattggca caacacagac acactttcat acactttttc atgattagca 35040
gctcctctct agtcaggacc atatcccaag gaatcaccca ctcttgaatc aaggtaaatc 35100
ccacacagca gggcaggcct ctcacataac tcacgttatg catagtgagc gtgtcgcaat 35160
ctggaaatac cggatgatct tccatcaccg aagcccgggt ctccgtctca aagggaggta 35220
aacggtccct cgtgtaggga cagtggcggg ataatcgaga tcgtgttgaa cgtagagtca 35280
tgccaaaggg aacagcggac gtactcatat ttcctccagc agaaccaagt gcgcgcgtgg 35340
cagctatcct tgcgtcttct gtctcgccgc ctgccccgct cggtgtagta gttgtaatac 35400
agccactccc tcagaccgtc aaggcgctcc ctggcgtccg gatctataac aacaccatcc 35460
tgcagcgccg ccctgatgac atccaccacc gtagagtatg ccaagcccag ccaggaaatg 35520
cactcacttt gacagcgaga gataggagga gcgggaagag atggaagaac catgatagta 35580
aaagaacttt tattccaatc gatcctctac aatgtcaaag tgtagatcta tcagatggca 35640
ctggtctcct ccgctgagtc gatcaaaaat aacagctaaa ccacaaacaa cacgattggt 35700
caaatgctgc acaagggctt gcagcataaa atcgcctcga aagtccaccg caagcataac 35760
atcaaagcca ccgcccctat catgatctat gataaaaacc ccacagctat ccaccagacc 35820
catatagttt tcatctctcc atcgtgaaaa aatatttaca agctcctcct ttaaatcacc 35880
tccaaccaat tcaaaaagtt gagccagacc gccctccacc ttcattttca gcatgcgcat 35940
catgattgca aaaattcagg ctcctcagac acctgtataa gattgagaag cggaacgtta 36000
acatcaatgt ttcgctcgcg aagatcgcgc ctcagtgcaa gcatgatata atcccacagg 36060
tcggagcgga tcagcgagga catctccccg ccaggaacca actcaacgga gcctatgctg 36120
attataatac gcatattcgg ggctatgcta accagcacgg cccccaaata ggcgtactgc 36180
ataggcggcg acaaaaagtg aacagtttgg gttaaaaaat caggcaaaca ctcgcgcaaa 36240
aaagcaagaa catcataacc atgctcatgc aaatagatgc aagtaagctc aggaacgacc 36300
acagaaaaat gcacaatttt tctctcaaac atgactgcga gccctgcaaa aataaaaaag 36360
aaacattaca caagagtagc ctgtcttaca atgggataga ctactctaac caacataaga 36420
cgggccacaa catcgcccgc gtggccataa aaaaaattat ccgtgtgatt aaaaagaagc 36480
acagatagct ggccagtcat atccggagtc atcacgtgcg aacccgtgta gacccccggg 36540
ttggacacat cggccaaaca aagaaagcgg ccaatgtatc ccggaggaat gataacacta 36600
agacgaagat acaacagaat aaccccatgg gggggaataa caaagttagt aggtgaataa 36660
aaacgataaa cacccgaaac tccctcctgc gtaggcaaaa tagcgccctc cccttccaaa 36720
acaacatata gcgcttccac agcagccatg acaaaagact caaaacactc aaaagactca 36780
gtcttaccag gaaaataaaa gcactctcac agcaccagca ctaatcagag tgtgaaaaag 36840
gccaagtgcc gaacgagtat atataggaat taaaaatgac gtaaatgtgt aaaggtcaga 36900
aaacgcccag aaaaatacac agaccaacgc ccgaaacgaa aacccgcgaa aaaataccca 36960
gaagttcctc aacaaccgcc acttccgctt tcccacgaga cgtcacttcc tcaaaaatag 37020
caaactacat ttcccacata tacaaaacca aaacccctcc ccttgtcacc gcccacaact 37080
tacatcatca caaacgtcaa agcctacgtc acccgccccg cccacctcat tatcatattg 37140
gccacaatcc aaaataaggt atattattga tgatg 37175
<210> 9
<211> 37190
<212> DNA
<213> Great Ape Adenovirus
<400> 9
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg cggagggtat ttaaacccgc 480
tgcgctccta aagaggccac tcttgagtgc cagcgagaag agttttctcc tccgctccgt 540
ttcggcgatc gaaaaatgag acatttagcc tgcactccgg gtcttttgtc cggccgggcg 600
gcgtccgagc ttttggacgc tttgctcaat gaggttctga gcgatgattt tccgtctact 660
acccacttta gcccacctac tcttcacgaa ctgtacgatc tggatgtact ggtggatgtg 720
aacgatccca acgaggaggc ggtttctacg ttttttcccg agtctgcgct tttggctgcc 780
caggagggat ttgacctaca cactccgccg ctgcctattt tagagtctcc gctgccggag 840
cccagtggta taccttatat gcctgaactg cttcccgaag tggtagacct gacctgccac 900
gagccgggct ttccgcccag cgacgatgag ggtgagcctt ttgctttaga ctatgctgag 960
atacctgggc tcggttgcag gtcttgtgca tatcatcaga gggttaccgg agaccccgag 1020
gttaagtgtt cgctgtgcta tatgaggctg acctcttcct ttatctacag taagtttttt 1080
tgtgtaggtg ggctttttgg gtaggtgggt tttgtggcag gacaggtgta aatgttgctt 1140
gtgttttttg tacctgcagg tccggtgtcc gagccagacc cggagcccga ccgcgatccc 1200
gagccggatc ccgagcctcc tcgcaggcca aggaaattac cttccatttt gtgcaagcct 1260
aagacacctg tgaggaccag cgaggcggac agcactgact ctggcacttc tacctctcct 1320
cctgaaattc acccagtggt tcctctgggt atacatagac ctgttgctgt tagagtttgc 1380
gggcgacgcc ctgcagtaga gtgcattgag gacttgctta acgatcccga gggacctttg 1440
gacttgagca ttaaacgccc taggcaataa accccaccta agtaataaac cccacctaag 1500
taataaactt taccgccctt ggttattgag atgacgccca atgtttgctt ttgaatgact 1560
tcatgtgtat aataaaagtg agtgtggtca taggtctctt gtttgtctgg gcggggttta 1620
agggtatata agtttctcgg ggctaaactt ggttacactt gaccccaatg gaggcgtggg 1680
ggtgcttgga ggagtttgcg gacgtgcgcc gtttgctgga cgagagctct agcaatacct 1740
atagtatttg gaggtatctg tggggctcta ctcaggccaa gttggtcttc agaattaagc 1800
aggattacaa gtgcgatttt gaagagcttt ttagttcctg tggtgagctt ttgcaatcct 1860
tgaatctggg ccaccaggct atcttccagg aaaaggttct ctcgactttg gatttttcca 1920
ctcccgggcg caccgccgct tgtgtggctt ttgtgtcttt tgtgcaagat aaatggagcg 1980
gggagaccca cctgagtcac ggctacgtgc tggatttcat ggcgatggct ctttggaggg 2040
cttacaacaa atggaagatt cagaaggaac tgtacggttc cgccctacgt cgtccacttc 2100
tgcagcggca ggggctgatg tttcccgacc atcgccagca tcagaatctg gaagacgagc 2160
gagcggagaa gatcagcttg agagccggcc tggaccctcc tcaggaggaa tgaatctccc 2220
gcaggtggtt gagctgtttc ccgaactgag acgggtcctg actatcaggg aggatggtca 2280
gtttgtgaag aagctgaaga gggatcgggg tgagggagat gatgaggcgg ctagcaattt 2340
agcttttagt ctgataactc gccaccgacc ggaatgtatt acctatcagc agattaagga 2400
gagttgtgcc aacgagctgg atcttttggg tcagaagtat agcatagaac agcttaccac 2460
ttactggctt cagcccgggg atgattggga agaggcgatt agggtgtatg caaaggtggc 2520
cctgcggccc gattgcaagt ataagattac taagttggtt aatattagaa actgctgcta 2580
tatttctgga aacggggccg aagtggagat agatactgag gacagggtgg ctattaggtg 2640
ttgcatgata aacatgtggc ccgggatact ggggatggat ggggtgatat ttatgaatgt 2700
gaggttcacg ggccccaact ttaatggtac ggtgttcatg ggcaacacca acttgctcct 2760
gcatggtgcg agtttctatg ggtttaacaa cacctgtata gaggcctgga ccgatgtaaa 2820
ggttcgaggt tgttcctttt atagctgttg gaaggcggtg gtgtgtcgcc ctaaaagcag 2880
gggttctgtg aagaaatgct tgtttgaaag gtgcacccta ggtatccttt ctgagggcaa 2940
ctccagggtg cgccataatg tggcttcgaa ctgcggttgc ttcatgcaag tgaagggggt 3000
gagcgttatc aagcataact cggtctgtgg aaactgcgag gatcgcgcct ctcagatgct 3060
gacctgcttt gatggcaact gtcacctgtt gaagaccatt catataagca gtcaccccag 3120
aaaggcctgg cccgtgtttg agcataacat tctgacccgc tgttccttgc atctgggggt 3180
caggaggggt atgttcctgc cttaccagtg taactttagc cacactaaaa tcctgctgga 3240
acccgagtgc atgactaagg tcagcctgaa tggtgtgttt gatgtgagtc tgaagatttg 3300
gaaggtgctg aggtatgatg agaccaggac caggtgccga ccctgcgagt gcggcggcaa 3360
gcacatgaga aatcagcctg tgatgttgga tgtgaccgag gagcttaggc ctgaccatct 3420
ggtgctggcc tgcaccaggg ccgagtttgg gtctagcgat gaggataccg attgaggtgg 3480
gtaaggtggg cgtggctagc agggtgggcg tgtataaatt gggggtctaa ggggtctctc 3540
tgtttgtctt gcaacagccg ccgccatgag cgacaccggc aacagctttg atggaagcat 3600
ctttagtccc tatctgacag tgcgcatgcc tcactgggcc ggagtgcgtc agaatgtgat 3660
gggttccaac gtggatggac gtcccgttct gccttcaaat tcgtctacta tggcctacgc 3720
gaccgtggga ggaactccgc tggacgccgc gacctccgcc gccgcctccg ccgccgccgc 3780
gaccgcgcgc agcatggcta cggaccttta cagctctttg gtggcgagca gcgcggcctc 3840
tcgcgcgtct gctcgggatg agaaactgac tgctctgctg cttaaactgg aagacttgac 3900
ccgggagctg ggtcaactga cccagcaggt ttccagcttg cgtgagagca gccttgcctc 3960
cccctaatgg cccataatat aaataaaagc cagtctgttt ggattaagca agtgtatgtt 4020
ctttatttaa ctctccgcgc gcggtaagcc cgggaccagc ggtctcggtc gtttagggtg 4080
cggtggattt tttccaacac gtggtacagg tggctctgga tgtttagata catgggcatg 4140
agtccatccc tggggtggag gtagcaccac tgcagagctt cgtgctcggg ggtggtgttg 4200
tatatgatcc agtcgtagca ggagcgctgg gcgtggtgct gaaaaatgtc cttaagcaag 4260
aggcttatag ctagggggag gcccttggtg taagtgttta caaatctgct tagctgggag 4320
gggtgcatcc ggggggatat gatgtgcatc ttggactgga tttttaggtt ggctatgttc 4380
ccgcccagat cccttctggg attcatgttg tgcaggacca ccagcacggt atatccagtg 4440
cacttgggaa atttatcgtg gagcttagac gggaatgcat ggaagaactt ggagacgccc 4500
ttgtggcctc ccagattttc catacattcg tccatgatga tggcaatggg cccgtgggaa 4560
gctgcctgag caaaaacgtt tctggcatcg ctcacatcgt agttatgttc cagggtgagg 4620
tcatcatagg acatctttac gaatcggggg cgaagggtcc cggactgggg gatgatggta 4680
ccctcgggcc ccggggcgta gttcccctca cagatctgca tctcccaggc tttcatttca 4740
gagggaggga tcatatccac ctgcggggcg atgaaaaaga cagtttctgg cgcaggggag 4800
attaactggg atgagagcag gtttctgagc agctgtgact ttccacagcc ggtgggccca 4860
tatatcacgc ctatcaccgg ctgcagctgg tagttaagag agctgcagct gccgtcctcc 4920
cggagcaggg gggccacctc gttgagcata tccctgacgt ggatgttctc cctgaccagt 4980
tccgccagaa ggcgctcgcc gcccagcgaa agcagctctt gcaaggaagc aaaatttttc 5040
agcggtttca ggccatcggc cgtgggcatg tttttcagcg tctgggtcag cagctccagc 5100
ctgtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagatc tcctcgtttc 5160
gcgggttggg gcggctttcg ctgtagggca ccagccgatg ggcgtccagc ggggccagag 5220
tcatgtcctt ccatgggcgc agggtcctcg tcagggtggt ctgggtcacg gtgaaggggt 5280
gcgctccggg ttgggcactg gccagggtgc gcttgaggct ggttctgctg gtgctgaatc 5340
gctgccgctc ttcgccctgc gcgtcggcca ggtagcattt gaccatggtc tcgtagtcga 5400
gaccctcggc ggcgtgcccc ttggcgcgga gctttccctt ggaggtggcg ccgcacgagg 5460
ggcactgcag gctcttcagg gcgtagagct tgggagcgag aaacacggac tctggggagt 5520
aggcgtccgc gccgcaggcc gagcagaccg tctcgcattc caccagccaa gtgagttccg 5580
ggcggtcagg gtcaaaaacc aggttgcccc catgcttttt gatgcgtttc ttaccttggc 5640
tctccatgag gcggtgtccc ttctcggtga cgaagaggct gtccgtgtcc ccgtagaccg 5700
acttcagggg cctgtcttcc agcggagtgc ctctgtcctc ctcgtagaga aactctgacc 5760
actctgagac gaaggcccgc gtccaggcca ggacgaagga ggccacgtgg gaggggtagc 5820
ggtcgttgtc cactagcggg tccaccttct ccagggtgtg caggcacatg tccccctcct 5880
ccgcgtccag aaaagtgatt ggcttgtagg tgtaggacac gtgaccgggg gttcccaacg 5940
ggggggtata aaagggggtg ggtgcccttt catcttcact ctcttccgca tcgctgtctg 6000
cgagagccag ctgctggggt aagtattccc tctcgaaggc gggcatgacc tcagcgctca 6060
ggttgtcagt ttctaaaaat gaggaggatt tgatgttcac ctgtccggag gtgatacctt 6120
tgagggtacc tgggtccatc tggtcagaaa acactatttt tttgttatca agcttggtgg 6180
cgaatgaccc gtagagggcg ttggagagca gcttggcgat ggagcgcagg gtctggtttt 6240
tgtcgcggtc ggctcgctcc ttggccgcga tgttgagttg cacgtactcg cgggccacgc 6300
acttccactc ggggaacacg gtggtgcgct cgtctgggat caggcgcacc ctccagccgc 6360
ggttgtgcag ggtgaccatg tcgacgctgg tggcgacctc accgcgcaga cgctcgttgg 6420
tccagcagag gcggccgccc ttgcgcgagc agaagggggg tagggggtcc agctggtcct 6480
cgtttggggg gtccgcgtcg atggtaaaga ccccggggag caggcgcggg tcaaagtagt 6540
cgatcttgca agcttgcatg tccagagccc gctgccattc gcgggcggcg agcgcgcgct 6600
cgtaggggtt gaggggcggg ccccagggca tggggtgggt gagcgcggag gcgtacatgc 6660
cgcagatgtc atacacgtac aggggttccc tgaggatacc gaggtaggtg gggtagcagc 6720
gccccccgcg gatgctggcg cgcacgtagt catagagctc gtgggagggg gccagcatgt 6780
tgggcccgag gttggtgcgc tgggggcgct cggcgcggaa gacgatctgc ctgaagatgg 6840
cgtgggagtt ggaggagatg gtgggccgct ggaagacgtt gaagcttgct tcttgcaagc 6900
ccacggagtc cctgacgaag gaggcgtagg actcgcgcag cttgtgcacc agctcggcgg 6960
tgacctggac gtcgagcgca cagtagtcga gggtctcgcg gatgatgtca tacctatcct 7020
cccccttctt tttccacagc tcgcggttga ggacgaactc ttcgcggtct ttccagtact 7080
cttggagggg aaacccgtcc gtgtccgaac ggtaagagcc tagcatgtag aactggttga 7140
cggcctggta ggggcagcag cccttctcca cgggcagcgc gtaggcctgc gccgccttgc 7200
ggagggaggt gtgggtgagg gcgaaagtgt ccctgaccat gactttgagg tattgatgtc 7260
tgaagtctgt gtcatcgcag ccgccctgtt cccacagggt gtagtccgtg cgctttttgg 7320
agcgcgggtt gggcagggag aaggtgaggt cattgaagag gatcttcccc gctcgaggca 7380
tgaagtttct ggtgatgcga aagggccctg ggaccgagga gcggttgttg atgacctggg 7440
cggccaggac gatctcgtca aagccgttta tgttgtgtcc cacgatgtag agctccagga 7500
agcggggctg gcccttgatg gaggggagct ttttaagttc ctcgtaggta agctcctcgg 7560
gcgattccag gccgtgctcc tccagggccc agtcttgcaa gtgagggttg gccgccagga 7620
aggatcgcca gaggtcgcgg gccatgaggg tctgcaggcg gtcgcggaag gttctgaact 7680
gccgccccac ggccattttt tcgggggtga tgcagtagaa ggtgaggggg tctttctccc 7740
aggggtccca tctgagctct cgggcgaggt cgcgcgcggc agcgaccaga gcctcgtcgc 7800
cccccagttt catgaccagc atgaagggca cgagttgctt gccaaaggct cccatccaag 7860
tgtaggtttc tacatcgtag gtgacaaaga ggcgctccgt gcgaggatga gagccgattg 7920
ggaagaactg gatctcccgc caccagttgg aggattggct gttgatgtgg tgaaagtaga 7980
agtcccgtct gcgggccgag cactcgtgct ggcttttgta aaagcgaccg cagtactggc 8040
agcgctgcac gggttgtata tcttgcacga ggtgaacctg gcgacctctg acgaggaagc 8100
gcagcgggaa tctaagtccc ccgcctgggg tcccgtgtgg ctggtggtct tttactttgg 8160
ttgtctggcc gccagcatct gtctcctgga gggcgatggt ggaacagacc accacgccgc 8220
gagagccgca ggtccagatc tcggcgctcg gcgggcggag tttgatgacg acatcgcgca 8280
cattggagct gtccatggtc tccagctccc gcggcggcag gtcagccggg agttcctgga 8340
ggttcacctc gcagagacgg gtcaaggcgc ggacagtgtt gagatggtat ctgatttcaa 8400
ggggcatgtt ggaggcggag tcgatggctt gcaggaggcc gcagccccgg ggggccacga 8460
tggttccccg cggggcgcga ggggaggcgg aagctggggg tgtgttcaga agcggtgacg 8520
cgggcgggcc cccggaggta gggggggttc cggccccaca ggcatgggcg gcaggggcac 8580
gtcttcgccg cgcgcgggca ggggctggtg ctggctccga agagcgcttg cgtgcgcgac 8640
gacgcgacgg ttggtgtcct gtatctggcg cctctgagtg aagaccacgg gtcccgtgac 8700
cttgaacctg aaagagagtt cgacagaatc aatctcggca tcgttgacag cggcctggcg 8760
caggatctcc tgcacgtcgc ccgagttgtc ctggtaggcg atttctgcca tgaactgctc 8820
gatctcttcc tcctggagat ctcctcgtcc ggcgcgctcc acggtggccg ccaggtcgtt 8880
ggagatgcga cccatgagct gcgagaaggc gttgagtccg ccctcgttcc agacccggct 8940
gtagaccacg cccccctcgg cgtcgcgggc gcgcatgacc acctgggcca ggttgagctc 9000
cacgtgtcgc gtgaagacgg cgtagttgcg caggcgctgg aaaaggtagt tcagggtggt 9060
ggcggtgtgc tcggcgacga agaagtacat gacccagcgc cgcaacgtgg attcattgat 9120
gtcccccaag gcctccaggc gctccatggc ctcgtagaag tccacggcga agttgaaaaa 9180
ctgggagttg cgagcggaca cggtcaactc ctcctccaga agacggatga gctcggcgac 9240
agtgtcgcgc acctcgcgct cgaaggccac ggggggcgct tcttcctctt ccacctcttc 9300
ttccatgatt gcttcttctt cttcctcagc cgggacggga gggggcggcg gcgggggagg 9360
ggcgcggcgg cggcggcggc gcaccgggag gcggtcgatg aagcgctcga tcatctcccc 9420
ccgcatgcgg cgcatggtct cggtgacggc gcggccgttc tcccgggggc gcagctcgaa 9480
gacgccgcct ctcatttcgc cgcggggcgg gcggccgtga ggtagcgaga cggcgctgac 9540
tatgcatctt aacaattgct gtgtaggtac gccgccaagg gacctgattg agtccagatc 9600
caccggatcc gaaaaccttt ggaggaaagc gtctatccag tcgcagtcgc aaggtaggct 9660
gagcaccgtg gcgggcgggg gcgggtcggg agagttcctg gcggagatgc tgctgatgat 9720
gtaattaaag taggcggtct tgagaaggcg gatggtggac aggagcacca tgtctttggg 9780
tccggcctgt tggatgcgga ggcggtcggc catgccccag gcctcgttct gacaccggcg 9840
caggtctttg tagtaatctt gcatgagtct ttccaccggc acttcttctc cttcctcttc 9900
ttcatctcgc cggtggtttc tcgcgccgcc catgcgcgtg accccaaagc ccctgagcgg 9960
ctgcagcagg gccaggtcgg cgaccacgcg ctcggccaag atggcctgct gtacctgagt 10020
gagggtcctc tcgaagtcat ccatgtccac gaagcggtgg taggcacccg tgttgatggt 10080
gtaggtgcag ttggccatga cggaccagtt gacggtctgg tgtcccggct gcgagagctc 10140
cgtgtaccgc aggcgcgaga aggcgcggga atcgaacacg tagtcgttgc aagtccgcac 10200
cagatactgg tagcccacca ggaagtgcgg cggaggttgg cgatagaggg gccagcgctg 10260
ggtggcgggg gcgccgggcg ccaggtcttc cagcatgagg cggtggtatc cgtagatgta 10320
cctggacatc caggtgatgc ctgcggcggt ggtggtggcg cgcgcgtagt cgcggacccg 10380
gttccagatg tttcgcaggg gcgagaagtg ttccatggtc ggcacgctct ggccggtgag 10440
gcgcgcgcag tcgttgacgc tctatacaca cacaaaaacg aaagcgttta cagggctttc 10500
gttctgtagc ctggaggaaa gtaaatgggt tgggttgcgg tgtgccccgg ttcgagacca 10560
agctgagctc agccggctga agccgcagct aacgtggtat tggcagtccc gtctcgaccc 10620
aggccctgta tcctccagga tacggtcgag agcccttttg ctttcttggc caagcgcccg 10680
tggcgcgatc tgggatagat ggtcgcgatg agaggacaaa agcggctcgc ttccgtagtc 10740
tggagaaaca atcgccaggg ttgcgttgcg gcgtaccccg gttcgagccc ctatggcggc 10800
ttggatcggc cggaaccgcg gctaacgtgg gctgtggcag ccccgtcctc aggaccccgc 10860
cagccgactt ctccagttac gggagcgagc cccttttgtt tttttatttt ttagatgcat 10920
cccgtgctgc ggcagatgcg cccctcgccc cggcccgatc agcagcagca acagcaggca 10980
tgcagacccc cctctcctct ccccgccccg gtcaccacgg ccgcggcggc cgtgtccggt 11040
gcggggggcg cgctggagtc agatgagcca ccgcggcggc gacctaggca gtatctggac 11100
ttggaagagg gcgagggact ggcgcggctg ggggcgagct ctccagagcg ccacccgcgg 11160
gtgcagttga aaagggacgc gcgtgaggcg tacctgccgc ggcaaaacct gtttcgcgac 11220
cgcgggggcg aggagcccga ggagatgcgg gactgcaggt tccaagcggg gcgcgagctg 11280
cgccgcggct tggacagaca gcgcctgctg cgcgaggagg actttgagcc cgacacgcag 11340
acgggcatca gccccgcgcg cgcgcacgtg gccgcggccg acctggtgac cgcctacgag 11400
cagacggtga accaggagcg caacttccaa aaaagcttca acaaccacgt gcgcacgctg 11460
gtggcgcgcg aggaggtgac cctgggtctc atgcatctgt gggacctggt ggaggcgatc 11520
gtgcagaacc ccagcagcaa gcccctgacc gcgcagctgt tcctggtggt gcagcacagc 11580
agggacaacg aggccttcag ggaggcgctg ctgaacatca ccgagccgga ggggcgctgg 11640
ctcctggacc tgataaacat cctgcagagc atagtggtgc aggagcgcag cctgagcctg 11700
gccgagaagg tggcggccat taactattct atgctgagcc tgggcaagtt ctacgctcgc 11760
aagatctaca agacccccta cgtgcccata gacaaggagg tgaagataga cagcttctac 11820
atgcgcatgg cgctgaaggt gctaaccctg agcgacgacc tgggagtgta ccgcaacgag 11880
cgcatccaca aggccgtgag cgccagccgg cggcgcgagc tgagcgaccg cgaactgatg 11940
cacagtctgc agcgcgcgct gaccggcgcg ggcgagggcg acagggaggt cgagtcctac 12000
tttgacatgg gggccgacct gcactggcag ccgagccgcc gcgccctgga agcggcgggg 12060
gcgtacggcg gccccctggc ggccgatgac gaggaagagg aggactatga gctagaggag 12120
ggcgagtacc tggaggactg acctggctgg tggtgttttg gtatagatgc aagatccgaa 12180
cgtggcggac ccggcggtcc gggcggcgct gcagagccag ccgtccggca ttaactcctc 12240
tgacgactgg gccgcggcca tgggtcgcat catggccctg accgcgcgca accccgaggc 12300
cttcaggcag cagcctcagg ctaaccggct ggcggccatc ttggaagcgg tagtgcccgc 12360
gcgctccaac cccacccacg agaaggtgct ggccatagtc aacgcgctgg cggagagcag 12420
ggccatccgg gcagacgagg ccggactggt gtacgatgcg ctgctgcagc gggtggcgcg 12480
gtacaacagc ggcaacgtgc agaccaacct ggaccgcctg gtgacggacg tgcgcgaggc 12540
cgtggcgcag cgcgagcgct tgcatcagga cggcaacctg ggctcgctgg tggcgctaaa 12600
cgccttcctt agcacccagc cggccaacgt accgcggggg caggaggact acaccaactt 12660
cttgagcgcg ctgcggctga tggtgaccga ggtccctcag agcgaggtgt accagtcggg 12720
gcccgactac ttcttccaga ccagcagaca gggcttgcaa accgtgaacc tgagccaggc 12780
tttcaagaac ctgcgggggc tgtggggagt gaaggcgccc accggcgacc gggctacggt 12840
gtccagcctg ctaaccccca actcgcgcct gctgctgctg ctgatcgcgc ccttcacgga 12900
cagcgggagc gtctcgcggg agacctatct gggccacctg ctgacgctgt accgcgaggc 12960
catcgggcag gcgcaggtgg acgagcacac cttccaggag atcaccagcg tgagccacgc 13020
gctggggcag gaggacacgg gcagcctgca ggcgaccctg aactacctgc tgaccaacag 13080
gcggcagaag attcccacgc tgcacagcct gacccaggag gaggagcgca tcttgcgcta 13140
cgtgcagcag agcgtgagcc tgaacctgat gcgcgacggc gtgacgccca gcgtggcgct 13200
ggacatgacc gcgcgcaaca tggaaccggg catgtacgct tcccagcggc cgttcatcaa 13260
ccgcctgatg gactacttgc atcgggcggc ggccgtgaac cccgagtact tcaccaatgc 13320
cattctgaat ccccactgga tgccccctcc gggtttctac aacggggact tcgaggtgcc 13380
tgaggtcaac gatgggttcc tctgggatga catggatgac agtgtgttct cccccaaccc 13440
gctgcgcgcc gcgtctctgc gattgaagga gggctctgac agggaaggac caaggagtct 13500
ggcctcctcc ctggctctgg gggcggtggg cgccacgggc gcggcggcgc ggggcagcag 13560
ccccttcccc agcctggcgg actctctgaa tagcgggcgg gtgagcaggc cccgcttgct 13620
aggcgaggag gagtatctga acaactccct gctgcagccc gtgagggaca aaaacgctca 13680
gcggcagcag tttcccaaca atgggataga gagcctggtg gacaagatgt ccagatggaa 13740
gacgtatgcg caggagtaca aggagtggga ggaccgccag ccgcggcccc tgccgccccc 13800
tagacagcgc tggcagcggc gcgcgtccaa ccgccgctgg aggcaggggc ccgaggacga 13860
tgatgactct gcagatgaca gcagcgtgtt ggacctgggc gggagcggga accccttttc 13920
gcacctgcgc ccacgcctgg gcaagatgtt ttaaaagaga aaaataaaaa ctcaccaagg 13980
ccatggcgac gagcgttggt tttttgttcc cttccttagt atgcggcgcg cggcgatgtt 14040
cgaggagggg cctcccccct cttacgagag cgcgatggga atttctcctg cggcgcccct 14100
gcagcctccc tacgtgcctc ctcggtacct gcaacctaca ggggggagaa atagcatctg 14160
ttactctgag ctgcagcccc tgtacgatac caccagactg tacctggtgg acaacaagtc 14220
cgcggacgtg gcctccctga actaccagaa cgaccacagc gattttttga ccacggtgat 14280
ccaaaacaac gacttcaccc caaccgaggc cagtacccag accataaacc tggacaacag 14340
gtcgaactgg ggcggcgacc tgaagactat cctgcacacc aatatgccca acgtgaacga 14400
gttcatgttc accaactctt ttaaggcgcg ggtgatggtg gcgcgcgagc agggggaggc 14460
gaagtacgag tgggtggact tcacgctgcc cgagggcaac tactcagaga ccatgactct 14520
cgacctgatg aacaatgcga tcgtggaaca ctatctgaaa gtgggcaggc agaacggggt 14580
gaaggagagc gatatcgggg tcaagtttga caccagaaac ttccgtctgg gctgggaccc 14640
tgtgaccggg ctggtcatgc cgggggtcta caccaacgag gcctttcatc ccgatatagt 14700
gctcctgccc ggctgtgggg tggacttcac ccagagccgg ctgagcaacc tgctgggcgt 14760
tcgcaagcgg caacctttcc aggagggttt caagatcacc tatgaggatc tggagggggg 14820
caacattccc gcgctccttg atctggacgc ctacgaggag agcttgaaac ccgaggagag 14880
cgctggcgac agcggcgaga gtggcgagga gcaagccggc ggcggcggca gcgcgtcggt 14940
agaaaacgaa agtactcccg cagtggcggc ggacgctgcg gaggtcgagc cggaggccat 15000
gcagcaggac gcagaggagg gcgcgcagga ggacatgaac aatggggaga tcaggggcga 15060
cactttcgcc acccggggcg aagaaaaaga ggcagaggcg gcggcggcga cggcggaagc 15120
cgaaaccgag gcagaggcag agcccgagac cgaagttatg gaagacatga atgatggaga 15180
acgtaggggt gacacgtttg ccacccgggg cgaagagaag gcggcggagg cagaagccgc 15240
ggctgaggag gcggctgcgg ctgcggccaa ggctgaggct gcggctgagg ctaaggtcga 15300
agccgatgtt gcggttgagg ctcaggctga ggaggaggcg gcggctgaag cagttaagga 15360
aaaggcccag gcagagcagg aagagaaaaa acctgtcatt caacctctaa aagaagatag 15420
caaaaagcgc agttacaacg tcattgaggg cagcaccttt acccaatacc gcagctggta 15480
cctggcttac aactacggcg acccggtcaa gggggtgcgc tcgtggaccc tgctctgcac 15540
gccggacgtc acctgcggct ccgagcagat gtactggtcg ctgccaaaca tgatgcaaga 15600
cccggtgacc ttccgttcca cgcggcaggt tagcaacttt ccggtggtgg gcgccgaact 15660
gctgccagta cactccaaga gtttttacaa cgagcaggcc gtctactccc agctgatccg 15720
ccaggccacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga ttttggcgcg 15780
cccgccggcc cccaccatca ccaccgtcag tgaaaacgtt cctgccctca cagatcacgg 15840
gacgctaccg ctgcgcaaca gcatctcagg agtccagcga gtgaccatta ctgacgccag 15900
acgccggacc tgcccctacg tttacaaggc cttgggcata gtctcgccgc gcgtcctctc 15960
cagtcgcact ttttaaaaca catccaccca cacgctccaa aatcatgtcc gtactcatct 16020
cgcccagcaa caacaccggc tgggggctgc gcgcacccag caagatgttt ggaggggcaa 16080
ggaagcgctc cgaccagcac cccgtgcgcg tgcgcggcca ctaccgcgcg ccctggggtg 16140
cgcacaagcg cgggcgcaca gggcgcacca ctgtggatga tgtcattgac tccgtagtgg 16200
agcaggcgcg ccactacaca cccggcgcgc cgaccgcctc cgccgtgtcc accgtggacc 16260
aggcgatcga aagcgtggta cagggggcgc ggcactatgc caaccttaaa agtcgccgcc 16320
gccgcgtggc gcgccgccat cgccggagac cccgggctac tgccgccgcg cgccttacca 16380
aggctctgct caagcgcgcc aggcgaactg gccaccgggc cgccatgagg gccgcacggc 16440
gggctgccgc tgccgcgagc gccgtggccc cgcgggcacg aaggcgcgcg gccgctgccg 16500
ccgccgccgc catttccagc ttggcctcga cgcggcgcgg taacatatac tgggtgcgcg 16560
actcggtgag cggcacacgt gtgcccgtgc gctttcgccc cccacggaat tagcacaaga 16620
caacatacac actgagtctc ctgctgttgt gtatcccagc ggcgaccgtc agcagcggcg 16680
acatgtccaa gcgcaaaatt aaagaagaga tgctccaggt catcgcgccg gagatctatg 16740
ggcccccgaa gaaggaggag gaggattaca agccccgcaa gctaaagcgg gtcaaaaaga 16800
aaaagaaaga tgatgacgtt gacgaggcgg tggagtttgt ccgccgcatg gcgcccaggc 16860
gccctgtgca gtggaagggt cggcgcgtgc agcgagtcct gcgccccggc accgcggtgg 16920
tctttacgcc cggcgagcgt tccacgcgca ctttcaagcg ggtgtacgat gaggtgtacg 16980
gcgacgagga tctgttggag caggccaacc atcgatttgg ggagtttgca tatgggaaac 17040
ggcctcgcga gagtctaaaa gaggacctgc tggcgctacc gctggacgag ggcaatccca 17100
ccccgagtct gaagccggtg accctgcaac aggtgctgcc tttgagcgcg cccagcgagc 17160
agaagcgagg gttaaagcgc gagggcgggg acctggcacc caccgtgcag ttgatggtgc 17220
ccaagcggca gaagctggag gacgtgctgg agaaaatgaa agtagagccc gggatccagc 17280
ccgagatcaa ggtccgccct atcaagcagg tggcgcccgg cgtgggagtc cagaccgtgg 17340
acgttaggat tcccacggag gagatggaaa cccaaaccgc cactccctct tcggcagcaa 17400
gcgccaccac cggcgccgct tcggtagagg tgcagacgga cccctggcta cccgccgcca 17460
ctatcgccgt cgccgccgcc ccccgttcgc gcggacgcaa gagaaattat ccagcggcca 17520
gcgcgcttat gccccagtat gcgctgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcactcgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcatcg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg ccttcgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcaggg gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgttg 17940
cccctgctgg tcccgctact cgccgcggcg atcggcgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc atttttggag 18060
gaaaaaataa aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagc ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggtcaga tgctgagaga caagttgaaa gagcagaact tccaggagaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccgccctcag gtggaggaaa cgcctccagc 18480
catggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcccaccact cgccccatag ctcccatggc caccggtgtg gtgggtcaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgagccgact cgccagccaa aggcggtgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgcct ctgcgccgcg ctgcgagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gttcatcccc gtggatcggg 19200
aggacaccgc ttactcttac aaggcgcggt tcacgctggc cgtgggcgac aaccgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaact caggcggccg aggaagctgt tgacgaggaa gatgcagaag 19440
atgaagcgca accacaagag gaagcccctg ttaaaaaaat tcatgtatat gctcaggcgc 19500
ctcttgctgg cgaaaagatt accaaggatg gtttgcaaat aggtactgaa gtcgtaggag 19560
atacatctaa ggacactttt gcagataaaa cattccaacc cgaacctcag ataggcgagt 19620
ctcagtggaa cgaggctgat gccgcagtag caggaggtag agttttgaaa aagactaccc 19680
ctatgagacc ttgctatgga tcctatgcca ggcctaccaa tgccaacggg ggtcaaggaa 19740
ttatggttgc caatgaaaaa ggagtgttgc agtctaaagt agaaatgcaa tttttctcta 19800
acacctcaac ccttaatgcg cgggatggaa ccggcaatcc cgaaccaaag gtggtgttgt 19860
acagcgaaga tgtccacttg gaatctcccg atactcatct gtcttacaag cccacaaagg 19920
atgatgttaa tgccaaagtc atgttgggtc agcaagccat gcccaacaga cccaacctca 19980
ttggatttag agataatttc attgggctta tgttttacaa cagcaccggt aacatgggag 20040
tgctggcggg tcaggcctct cagttgaatg ctgtggtgga cttgcaggat agaaacacag 20100
aactgtcata tcagcttatg cttgattcaa ttggggatag aaccagatac ttctccatgt 20160
ggaaccaggc agtggatagc tatgatccag atgtcagaat tattgaaaac catggggttg 20220
aggatgaact gcccaactac tgcttccctt tgggcggcat aggaattact gatacttatc 20280
aaggggtgaa aaataccaat ggcaatggtc agtggaccaa agatgatcag ttcgcggacc 20340
gcaacgaaat aggggtggga aacaacttcg ccatggagat caacatccag gccaaccttt 20400
ggagaaactt cctctatgca aacgtggggc tctacctgcc agacaagctc aagtacaacc 20460
ccaccaacgt ggacatctct gacaacccca acacctatga ctacatgaac aagcgggtgg 20520
tggcccctgg cctggtggac tgctttgtca atgtgggagc caggtggtcc ctggactaca 20580
tggacaacgt caaccccttc aaccaccacc gcaatgcggg tctgcgctac cgctccatga 20640
tcctgggcaa cgggcgctat gtgccctttc acatccaggt accccagaag ttctttgcca 20700
tcaagaacct cctgctcctg cccggctcct acacctacga gtggaacttc aggaaggatg 20760
tgaacatggt cctacagagc tctctgggca atgaccttag ggtggatggg gccagcatca 20820
agtttgacag catcaccctc tatgctacat ttttccccat ggcccacaac accgcctcca 20880
cgcttgaggc catgctgaga aacgacacca acgaccagtc ctttaatgac tacctctctg 20940
gggccaacat gctctaccca atcccagcca aggccaccaa cgtgcccatc tccatcccct 21000
ctcgcaactg ggccgccttt agaggctggg cctttacccg ccttaagacc aaggagaccc 21060
cctccctggg ctcgggtttt gatccctact ttgtttactc gggatccatc ccctacctgg 21120
atggcacctt ctacctcaac cacactttca agaagatatc catcatgtat gactcctccg 21180
tcagctggcc gggcaacgac cgcttgctca cccccaatga gttcgaggtc aagcgcgccg 21240
tggacggcga gggctacaac gtggcccagt gcaacatgac caaggactgg ttcctggtgc 21300
agatgctggc caactacaac ataggctacc agggctttta catcccagag agctacaagg 21360
acaggatgta ctccttcttc agaaatttcc aacccatgag ccgacaggtg gtggacgaga 21420
ccaattacaa ggactatcaa gccattggca tcacccacca gcacaacaac tcgggtttcg 21480
tgggctacct ggcgcccacc atgcgcgagg gtcaggccta ccccgccaac ttcccctacc 21540
ccttgatagg caagaccgcg gtcgacagcg tcacccagaa aaagttcctc tgcgaccgca 21600
ccctctggcg catccccttc tctagcaact tcatgtccat gggtgcgctc acggacctgg 21660
gccaaaacct gctttatgcc aactctgccc atgcgctgga catgactttt gaggtggacc 21720
ccatggacga gcccaccctt ctctatattg tgtttgaagt gttcgacgtg gtcagagtgc 21780
accagccgca ccgcggtgtc atcgagaccg tgtacctgcg tacgcccttc tcagccggca 21840
acgccaccac ctaaggagac agcgccgccg ccgcctgcat gacgggttcc accgagcaag 21900
agctcagggc cattgccaga gacctgggat gcggacccta ttttttgggc acctatgaca 21960
aacgcttccc gggctttatc tcccgagaca agctcgcctg cgccattgtc aacacggccg 22020
cgcgcgagac cgggggcgtg cactggctgg cctttggctg ggacccgcgc tccaaaactt 22080
gctacctctt tgaccccttt ggcttctccg atcagcgcct caggcagatt tatgagtttg 22140
agtacgaggg gctgctgcgc cgcagcgcgc tcgcctcctc gcccgaccgc tgcatcaccc 22200
ttgagaagtc caccgaaacc gtgcaggggc cccactcggc cgcctgcggt ctcttctgtt 22260
gcatgttttt gcacgccttt gtgcactggc ctcagagtcc catggattgc aaccccacca 22320
tgaacttgct aaagggagtg cccaacgcca tgctccagag cccccaggtc cagcccaccc 22380
tgcgccgcaa ccaggaacag ctttaccgct tcctggagcg ccactccccc tacttccgca 22440
gccacagcgc gcgcatccgg ggggccacct ctttttgcca cttgcaagaa aacatgcaag 22500
acggaaaatg atgtacagca tgcttttaat aaatgtaaag actgtgcact ttaattatac 22560
acgggctctt tctggttatt tattcaacac cgccgtcgcc atttagaaat cgaaagggtt 22620
ctgccgtgcg tcgccgtgcg ccacgggcag agacacgttg cgatactgga agcggctcgc 22680
ccacttgaac tcgggcacca ccatgcgggg cagtggttcc tcggggaagt tctcgctcca 22740
cagggtgcgg gtcagctgca gcgcgctcag gaggtcggga gccgagatct tgaagtcgca 22800
gttggggccg gaaccctgcg cgcgcgagtt gcggtacacg gggttgcagc actggaacac 22860
cagcagggcc ggattattca cgctggccag caggctctcg tcgctgatca tgtcgctgtc 22920
cagatcctcc gcgttgctca gggcgaatgg ggtcatcttg cagacctgcc tgcccaggaa 22980
aggcgggagc ccaggcttgc cgttgcagtc gcagcgcagg ggcattagca ggtgcccacg 23040
gcccgactgc gcctgcgggt acaacgcgcg catgaaggct tcgatctgcc taaaagccac 23100
ctgggtcttg gctccctccg aaaagaacat cccacaggac ttgctggaga actggttcgc 23160
gggacagctg gcatcgtgca ggcagcagcg cgcgtcagtg ttggcaatct gcaccacgtt 23220
gcgaccccac cggtttttca ctatcttggc cttggaagcc tgctccttta gcgcgcgctg 23280
gccgttctcg ctggtcacat ccatctctat cacctgttcc ttgttgatca tgtttgtccc 23340
gtgcagacac tttaggtcgc cctccgtctg ggtgcagcgg tgctcccaca gcgcgcaacc 23400
ggtgggctcc caattcttgt gggtcacccc cgcgtaggcc tgcaggtagg cctgcaggaa 23460
gcgccccatc atggtcataa aggtcttctg gctcgtaaag gtcagctgca ggccgcgatg 23520
ctcttcgttc agccaggtct tgcagatggc ggccagcgcc tcggtctgct cgggcagcat 23580
cttaaaattt gtcttcaggt cgttatccac gtggtacttg tccatcatgg cacgcgccgc 23640
ctccatgccc ttctcccagg cggacaccat gggcaggctt agggggttta tcacttccag 23700
cggcgaggac accgtacttt cgatttcttc ttcctccccc tcttcccggc gcgcgccccc 23760
gctgttgcgc gctcttaccg cctgcaccaa ggggtcgtct tcaggcaagc gccgcaccga 23820
gcgcttgccg cccttgacct gcttgatcag taccggcggg ttgctgaagc ccaccatggt 23880
cagcgccgcc tgctcttctt cgtcttcgct gtctaccact atttctgggg aggggcttct 23940
ccgctctgcg gcaaaggcgg cggatcgctt cttttttttc ttgggagccg ccgcgatgga 24000
gtccgccacg gcgaccgagg tcgagggcgt ggggctgggg gtgcgcggta ccagggcctc 24060
gtcgccctcg gactcttcct ctgactccag gcggcggcgg agtcgcttct ttgggggcgc 24120
gcgcgtcagc ggcggcggag acggggacgg ggacggggac gggacgccct ccacaggggg 24180
tggtcttcgc gcagacccgc ggccgcgctc gggggtcttc tcgcgctggt cttggtcccg 24240
actggccatt gtatcctcct cctcctaggc agagagacat aaggagtcta tcatgcaagt 24300
cgagaaggag gagagcttaa ccaccccctc agagaccgcc gatgcgcccg ccgtcgccgt 24360
cgcccccgct accgccgacg cgcccgccac accgagcgac acccccacgg acccccccgc 24420
cgacgcaccc ctgttcgagg aagcggccgt ggagcaggac ccgggctttg tctcggcaga 24480
ggaggatttg caagaggagg agaataagga ggagaagccc tcagtgccaa aagatcataa 24540
agagcaagac gagcacgacg cagacgcaca ccagggtgaa gtcgggcggg gggacggagg 24600
gcatggcggc gccgactacc tagacgaagg aaacgacgtg ctcttgaagc acctgcatcg 24660
tcagtgcgcc atcgtctgcg acgctctgca ggagcgcagc gaggtgcccc tcagcgtggc 24720
ggaggtcagc cgcgcctacg agctcagcct cttttccccc cgggtgcccc cccgccgccg 24780
cgaaaacggc acatgcgagc ccaacccgcg cctcaacttc taccccgcct ttgtggtgcc 24840
cgaggtcctg gccacctatc acatcttctt tcaaaattgc aagatcccca tctcgtgccg 24900
cgccaaccgt agccgcgccg ataagatgct ggccctgcgc cagggcgacc acatacctga 24960
tatcgccgct ttggaagatg tgccaaagat cttcgagggt ctggggcgca acgagaagcg 25020
ggcagcaaac tctctgcaac aggaaaacag cgaaaatgag agtcacactg gagcgctggt 25080
ggagctggag ggcgacaacg cccgcctggc ggtgctcaag cgcagcatcg aggtcaccca 25140
ctttgcctac cccgcgctca acctgccccc caaagtcatg aacgcggtca tggacgggct 25200
gatcatgcgc cgcggccggc ccctcgctcc agatgcaaac ttgcatgagg agaccgagga 25260
cggtcagccc gtggtcagcg acgagcagct gacgcgctgg ctggagagcg cggaccccgc 25320
cgaactggag gagcggcgca agatgatgat ggccgcggtg ctggtcaccg tagagctgga 25380
gtgtctgcag cgcttcttcg gtgaccccga gatgcagaga aaggtcgagg agaccctaca 25440
ctacaccttc cgccagggct acgtgcgcca ggcttgcaag atctccaacg tggagctcag 25500
caacctggtg tcctacctgg gcatcttgca tgaaaaccgc cttgggcaga gcgtgctaca 25560
ctccaccctg cgcggggagg cgcgccgcga ctacgtgcgc gactgcgttt acctcttcct 25620
ctgctacacc tggcagacgg ccatgggggt ctggcagcag tgcctggagg agcgcaacct 25680
caaggagctg gagaagcttc tgcagcgcgc gctcaaagac ctctggacgg gcttcaacga 25740
gcgctcggtg gccgccgcgc tagccgacct catcttcccc gagcgcctgc tcaaaaccct 25800
ccagcagggg ctgcccgact tcaccagcca aagcatgttg caaaatttta ggaactttat 25860
cctggagcgt tctggcatcc tacccgccac ctgctgcgcc ctgcccagcg actttgtccc 25920
cctcgtgtac cgcgagtgcc ccccgccgct gtggggccac tgctacctgt tccaactggc 25980
caactacctg tcctaccacg cggacctcat ggaggactcc agcggcgagg ggctcatgga 26040
gtgccactgc cgctgcaacc tctgcacgcc ccaccgctcc ctggtctgca acacccaact 26100
gctcagcgag agtcagatta tcggtacctt cgagctacag ggtccgtcct cctcagacga 26160
gaagtccgcg gctccggggc taaaactcac tccggggctg tggacttccg cctacctgcg 26220
caaatttgta cctgaagact accacgccca cgaaatcagg ttttacgagg accaatcccg 26280
cccgcccaag gcggagctga ccgcctgcgt catcacccag ggcgagatcc taggccaatt 26340
gcaagccatc caaaaagccc gccaagagtt tttgctgaag aggggtcggg gggtgtatct 26400
ggacccccag tcgggtgagg agctcaaccc ggttcccccg ctgccaccgc cgcgggacct 26460
tgcttcccag gataagcatc gccatggctc ccagaaagaa gcagcagcgg ccgccgctgc 26520
cgccgcccca catgctggag gaagaggagg aatactggga cagtcaggca gaggaggttt 26580
cggacgagga ggagccggag acggagatgg aagagtggga ggaggacagc ttagacgagg 26640
aggcttccga agccgaagag gcaggcgcaa caccgtcacc ctcggccgca gccccctcgc 26700
aggcgccccc gaagtccgct cccagcatca gcagcaacag cagcgctata acctccgctc 26760
ctccaccgcc gcgacccacg gccgaccgca gacccaaccg tagatgggac accaccggaa 26820
ccggggccgg taagtcctcc gggagaggca agcaagcgca gcgccaaggc taccgctcgt 26880
ggcgcgctca caagaacgcc atagtcgctt gcttgcaaga ctgcgggggg aacatctcct 26940
tcgcccgccg cttcctgctc ttccaccacg gtgtggcctt cccccgtaac gtcctgcatt 27000
actaccgtca tctctacagc ccctactgcg gcggcagtga gccagaggcg gccagcggcg 27060
gcggcgcccg tttcggtgcc taggaagacc cagggcaaga cttcagccaa gaaactcgcg 27120
gcgaccgcgg cgaacgcggt cgcgggggcc ctgcgcctga cggtgaacga acccctgtcg 27180
acccgcgaac tgaggaaccg aatcttcccc actctctatg ccatcttcca gcagagcaga 27240
gggcaggatc aggaactgaa agtaaaaaac aggtctctgc gctccctcac ccgcagctgt 27300
ctgtatcaca agagcgaaga ccagcttcgg cgcacgctgg aggacgctga ggcactcttc 27360
agcaaatact gcgcgctcac tcttaaggac tagctccgcg cccttctcga atttaggcgg 27420
gaacgcctac gtcatcgcag cgccgccgtc atgagcaagg acattcccac gccatacatg 27480
tggagctatc agccgcagat gggactcgcg gcgggcgcct cccaagacta ctccacccgc 27540
atgaactggc tcagtgccgg cccacacatg atctcacagg ttaatgacat ccgcacccat 27600
cgaaaccaaa tattggtgaa gcaggcggca attaccacca cgccccgcaa taatcccaac 27660
cccagggagt ggcccgcgtc cctggtgtat caggaaattc ccggccccac caccgtacta 27720
cttccgcgtg attcccaggc cgaagtccaa atgactaact caggggcaca gctcgcgggc 27780
ggctgtcgtc acagggtgcg gcctcctcgc cagggtataa ctcacctgga gatccgaggc 27840
agaggtattc agctcaacga cgagtcggtg agctcctcgc tcggtctcag acctgacggg 27900
accttccaga tagccggagc cggccgatct tccttcacgc cccgccaggc gtacctgact 27960
ctgcagagct cgtcctcggc gccgcgctcg ggcggcatcg ggactctcca gttcgtgcag 28020
gagtttgtgc cctcggtcta cttcaacccc ttctcgggct ctcccggtcg ctacccggac 28080
cagtttatcc cgaactttga cgccgcgagg gactcggtgg acggctacga ctgaatgtcg 28140
ggtggacccg gtgcagagca acttcgcctg aagcaccttg accactgccg ccgccctcag 28200
tgctttgccc gctgtcagac cggtgagttc cagtactttt ccctgcccga ctcgcacccg 28260
gacggcccgg cgcacggggt gcgctttttc atcccgagtc aggtccgctc taccctaatc 28320
agggagttca ccgcccgtcc cctactggcg gagttggaaa aggggccttc tatcctaacc 28380
attgcctgca tttgctctaa ccctggatta caccaagatc tttgctgtca tttgtgtgct 28440
gagtataata aaggctgaga tcagaatcta ctcgggctcc tgtcgccatc ctgtcaacgc 28500
caccgtccaa gcccggcccg atcagcccga ggtgaacctc acctgtggtc tgcaccggcg 28560
cctgaggaaa tacctagctt ggtactacaa cagcactccc tttgtggttt acaacagctt 28620
tgaccaggac ggggtctcac tgagggataa cctctcgaac ctgagctact ccatcaggaa 28680
gaacaacacc ctcgagctac ttcctcctta cctgcccggg acttaccagt gtgtcaccgg 28740
cccctgcacc cacacccacc tgttgatcgt aaacgactct cttccgagaa cagacctcaa 28800
taactcctct ccgcagttcc ccagaacagg aggtgagctc aggaaacccc gggtaaagaa 28860
gggtggacaa gagttaacac ttgtggggtt tctggtatat gtgacgctgg tggtggctct 28920
tttgattaag gcttttcctt ccatgtctga actatccctc ttcttttatg aacaactcga 28980
ctagtgctaa cgggacccta cccaacgaat cgggattgaa tatcggtaac caggttgcag 29040
tttcactttt gattaccttc atagtcctct tcctgctagt gctgtcgctt ctgtgcctgc 29100
ggatcggggg ctgctgcatc cacgtttata tctggtgctg gctgtttaga aggttcggag 29160
accaccgcag gtagaataat gctgcttacc ctctttgtcc tggcgctggc tgccagctgc 29220
caagcctttt ccgaggctga cttcatagag ccccagtgca atatcactta taaatctgaa 29280
cgtgccatct gtactattct aatcaaatgt gttactcaac acgataaggt gactgttaaa 29340
tacaaagatc aattaaaaaa agacgcactt tacagcagct ggcaaccagg agatgatcaa 29400
aaatacaatg taaccgtctt ccagggcaaa ctctccaaaa cttacaatta caatttccca 29460
tttgagcaga tgtgtgactt tgtcatgtac atggaaaagc agtacaagct gtggcctcca 29520
actccccagg gctgtgtgga aaatccaggc tctttctgta tgatctctct ctgtgtaact 29580
gtgctggcac taatactcac gcttctgtat ctcagattta aatcaaggca aagcttcatt 29640
gatgaaaaga aaatgccata atcgctcaac gcttgattgc taacaccggg tttttatccg 29700
cagaatgatt ggaatcaccc tactaatcac ctccctcctt gcgattgccc atgggttgga 29760
acgaatcgaa gtccctgtgg gggccaatgt taccctggtg gggcctgtcg gcaatgctac 29820
attaatgtgg gaaaaatata ctaaaaatca atgggtttct tactgcacta acaaaaacag 29880
ccacaagccc agagccatct gcgatgggca aaatctaacc ttgattgatg ttcaattgct 29940
ggatgcgggc tactattatg ggcagctggg tacaatgatt aattactgga gaccccacag 30000
agattacatg cttcacgtag taaagggtcc cattagcagc ccaaccacca cctctaccac 30060
acccactacc accactactc ccaccaccag cactgccgcc cagcctcctc atagcagaac 30120
aaccactttt atcaattcca agtcccactc cccccacatt gccggcgggc cctccgcctc 30180
agactccgag accaccgaga tctgcttctg caaatgctct gacgccattg cccaggattt 30240
ggaagatcac gaggaagatg agcatgacta cgcagatgca tgccaggcat cagaggcaga 30300
agcgctaccg gtggccctaa aacagtatgc agactcccac accaccccca accttcctcc 30360
accttcccag aagccaagtt tcctggggga aaatgaaact ctgcctcttt ccatactagc 30420
tctgacatct gttgctattt tggccgctct gctggtgctt ctatgctcta tatgctacct 30480
gatctgctgc agaaagaaaa aatctcacgg ccatgctcac cagcccctca tgcacttccc 30540
ttaccctcca gagctgggcg accacaaact ttaagtctgc agtagctatc tgcccatccc 30600
ttgtcagtcg acagcgatga gccccactaa tctaacagcc tctggactta caacattgtc 30660
tcttaatgag accaccgctc ctcaagacct gtacgatggt gtctccgcgc tggttaacca 30720
gtgggatcac ctgggcatat ggtggctcct cataggagca gtgaccctgt gcctaatcct 30780
ggtctggatc atctgctgca tcaaaagcag aagacccagg cggcggccca tctacaggcc 30840
cttcgtcatc acacctgaag ataatgatga tgatgacacc acctccaggc tgcagagcct 30900
aaagcagcta ctcttctctt ttacagcatg gtaaattgaa tcatgccccg cattttcatc 30960
tacttgcttc tccttccact ttttctgggc tcctctacat tggccactgt gtcccacatc 31020
gaggtagact gcctcacgcc cttcacagtc tacctgcttt tcggctttgt catctgcacc 31080
tttgtctgca gcgttatcac tgtagtgatc tgcttcatac agtgcatcga ctacatctgt 31140
gtgcgggtgg cctactttag acaccacccc cagtatcgca acagggacat agcggctctc 31200
ctaagacttg tttaaatcat ggccaaatta cctgtgattg gtcttctgat tatctgctgc 31260
gtcctagccg cgattgggac tcaacctaat accaccacca gcgctcccag aaagagacat 31320
gtatcctgca gcttcaagcg tccctggaat ataccccaat gctttactga tgaacctgaa 31380
atctctttgg cttggtactt cagcgtcacc gcccttctca tcttctgcag tacggttatt 31440
gctcttgcca tctacccttc ccttaacctg ggctggaatg ctgtcaactc tatggaatat 31500
cccaccttcc cagaaccaga cctgccagac ctggttgttc taaacgcgtt tcctcctcct 31560
ccagttcaaa atcagtttcg ccctccgtcc cctacgccca ctgaggtcag ctactttaat 31620
ctaacaggcg gagatgactg aaaacctaga cctagaaatg gacggtctct gcagcgagca 31680
acgcacacta gagaggcgcc ggcaaaaagc agagctcgag cgtcttaaac aagagctcca 31740
agacgccgtg gccatacacc agtgcaaaaa agggctcttc tgtctggtaa aacaggccac 31800
gctcacctat gaaaaaacag gtgacaccca ccgcctagga tacaagctgc ccacacagcg 31860
ccaaaagttt gcccttatga taggtgaaca acccatcacc gtcacccagc actccgtgga 31920
gacagaaggc tgcattcatg ctccctgcag gggcgctgac tgcctctaca ccttgatcaa 31980
aaccctctgc ggtctcagag accttatccc tttcaattga tcataactgt aatcaataaa 32040
aaatcactta cttgaaatct gatagcaaga ctctgtccaa ttttttcagc aacacttcct 32100
tcccctcctc ccaactctgg tactctaggc gcctcctagc tgcaaacttc ctccacagtc 32160
tgaagggaat gtcagattcc tcctcctgtc cctccgcacc cacgatcttc atgttgttac 32220
agatgaaacg cgcgagatcg tctgacgaga ccttcaaccc cgtgtacccc tacgataccg 32280
agatcgctcc gacttctgtc cctttcctta cccctccctt tgtatcatcc gcaggaatgc 32340
aagaaaatcc agctggggtg ctgtccctgc acctgtcaga gccccttacc acccacaatg 32400
gggccctgac tctaaaaatg gggggcggcc tgaccctgga caaggaaggg aatctcactt 32460
cccaaaacat caccagtgtc gatccccctc tcaaaaaaag caagaacaac atcagccttc 32520
agaccgccgc acccctcgcc gtcagctccg gggccctaac cctttttgcc actccccccc 32580
tagcggtcag tggcgacaac cttactgtgc agtctcaggc ccctcttact ttggaagact 32640
caaaactaac tctggccacc aaaggacccc taactgtgtc cgaaggcaaa cttgtcctag 32700
aaacagagcc tcccctgcat gcaagtgaca gcagtagcct gggccttagc gtcacggccc 32760
cacttagcat taacaatgac agcctaggac tagacatgca agcgcccatc agctctcgag 32820
atggaaaact ggctctaaca gtggcggccc ccctaactgt ggccgagggt atcaatgctt 32880
tggcagtagc cacaggtaat ggtattggac taaatgaaac caacacacac ctgcaggcaa 32940
aactggtcgc gcccctaggc tttgatacca acggcaacat taagctaagc gtcgcaggag 33000
gcatgaggct aaacaataac acactgatac tagatgtaaa ctacccattt gaggctcaag 33060
gccaactgag cctaagagtg ggctcgggcc cactatatgt agattctagt agtcataacc 33120
taaccattag atgccttagg ggattgtatg taacatcttc taacaaccaa aacggtctag 33180
aggccaacat taaactaaca aaaggccttg tgtatgacgg aaatgccata gcagttaatg 33240
ttggcaaagg gctggaatac agccctactg gcacaacaga aaaacctata cagactaaaa 33300
taggtctagg catggagtat gacactgagg gagccatgat gacaaaacta ggctctggac 33360
taagctttga caattcagga gccattgtgg tgggaaacaa aaatgatgac aggcttactt 33420
tgtggaccac accggaccca tcgcccaact gtcagattta ctctgaaaaa gatgctaaac 33480
taaccttggt actgactaaa tgtggcagtc aggttgtagg cacagtatct attgccgctc 33540
ttaaaggtag ccttgtgcca atcactagtg caatcagtgt ggttcagata tacctaaggt 33600
ttgatgaaaa tggggtgctg atgagtaact cttcacttaa tggcgaatac tggaatttta 33660
gaaacggaga ctcaactaat ggcacaccat atacaaacgc agtgggtttt atgcctaatc 33720
tactggccta tcctaaaggt caaactacaa ctgcaaaaag taacattgtc agccaggtct 33780
acatgaacgg ggacgatact aaacccatga catttacaat caacttcaat ggccttagtg 33840
aaacagggga tacccctgtc agtaaatatt ccatgacatt ctcatggagg tggccaaatg 33900
gaagctacat agggcacaat tttgtaacaa actcctttac tttctcctac atcgcccaag 33960
aataaagaaa gcacagagat gcttgttttt gatttcaaaa ttgtgtgctt ttatttattt 34020
tcaagcttac agtatttcca gtagtcatta gaatagagct taattaaact gcatgagaac 34080
ccttccacat agcttaaatt atcaccagtg caaatggaaa aaaatcaaca taccttttta 34140
tccagatatc aaagaactct agtggtcagt tttcccccac cctcccagct cacagaatac 34200
acagtccttt ccccccggct ggctttaaac aacactatct cattggtaac agacatattt 34260
ttaggtgtaa taatccacac ggtctcttgg cgggccaaac gctggtctgt gatgttaata 34320
aactccccag gcagctcttt caagttcacg tcgctgtcca actgctgaag cgctcgcggc 34380
tccgactgcg cctctagcgg aggcaacggc agcacccgat ccttgatcta taaaggagta 34440
gagtcataat cccccataag aatagggcgg tgatgcagca acaaggcgcg cagcaactcc 34500
tgccgccgcc tctccgtacg acaggaatgc aacggggtgg tggtctcctc cgcgataatc 34560
cgcaccgctc gcagcatcag catcctcgtc ctccgggcac agcagcgcat cctgatctca 34620
ctgagatcgg cgcagtaagt gcagcacaac accaagatgt tatttaagat cccacagtgc 34680
aaagcactgt acccaaagct catggcggga aggacagccc ccacgtgacc atcgtaccag 34740
atcctcaggt aaatcaaatg acgacctctc ataaacacgc tggacatata catcacctcc 34800
ttgggcatga gctgattcac cacctctcga taccacaggc atcgctgatt aattaaagac 34860
ccctcgagca ccatcctgaa ccaggaagcc agcacctgac cccccgccag gcactgcagg 34920
gaccccggtg aatcgcagtg gcagtgaaga ctccagcgct cgtagccgtg aaccatagag 34980
ctggtcatta tatccacatt ggcacaacac agacacactt tcatacactt tttcatgatt 35040
agcagctcct ctctagtcaa gaccatatcc caaggaatca cccactcttg aatcaaggta 35100
aatcccacac agcagggcag gcctctcaca taactcacgt tatgcatagt gagcgtgtcg 35160
caatctggaa ataccggatg atcttccatc accgaagccc gggtctccgt ctcaaaggga 35220
ggtaaacggt ccctcgtgta gggacagtgg cgggataatc gagatcgtgt tgaacgtaga 35280
gtcatgccaa agggaacagc ggacgtactc atatttcctc cagcagaacc aagtgcgcgc 35340
gtggcagcta tccctgcgtc ttctgtctcg ccgcctgccc cgctcggtgt agtagttgta 35400
atacagccac tccctcagac cgtcaaggcg ctccctggcg tccggatcta taacaacacc 35460
gtcctgcagc gccgccctga tgacatccac caccgtagag tatgccaagc ccagccacga 35520
aatgcactca ctttgacagc gagagatagg aggagcggga agagatggaa gaaccatgat 35580
agtaaaagaa cttttattcc aatcgatcct ctacaatgtc aaagtgtaga tctatcagat 35640
ggcactggtc tcctccgctg agtcgatcaa aaataacagc taaaccacaa acaacacgat 35700
tggtcaaatg ctgcacaagg gcttgcagca taaaatcgcc tcgaaagtcc accgcaagca 35760
taacatcaaa gccaccgccc ctatcatgat ctatgataaa aaccccacag ctatccacca 35820
gacccatata gttttcatct ctccatcgtg aaaaaatatt tacaagctcc tcctttaaat 35880
cacctccaac caattcaaaa agttgagcca gaccgccctc caccttcatt ttcagcatgc 35940
gcatcatgat tgcaaaaatt caggctcctc agacacctgt ataagattga gaagcggaac 36000
gttaacatca atgtttcgct cgcgaagatc gcgcctcagt gcaagcatga tataatccca 36060
caggtcggag cggatcagcg aggacatctc cccgccagga accaactcaa cggagcctat 36120
gctgattata atacgcatat tcggggctat gctaaccagc acggccccca aataggcgta 36180
ctgcataggc ggcgacaaaa agtgaacagt ttgggttaaa aaatcaggca aacactcgcg 36240
caaaaaagca agaacatcat aaccatgctc atgcaaatag atgcaagtaa gctcaggaac 36300
gaccacagaa aaatgcacaa tttttctctc aaacatgact gcgagccctg caaaaaataa 36360
aaaagaaaca ttacacaaga gtagcctgtc ttacaatggg atagactact ctaaccaaca 36420
taagacgggc cacgacatcg cccgcgtggc cataaaaaaa attatccgtg tgattaaaaa 36480
gaagcacaga tagctggcca gtcatatccg gagtcatcac gtgcgaaccc gtgtagaccc 36540
ccgggttgga cacatcggcc aaacaaagaa agcggccaat gtatcccgga ggaatgataa 36600
cactaagacg aagatacaac agaataaccc catggggggg aataacaaag ttagtaggtg 36660
aataaaaacg ataaacaccc gaaactccct cctgcgtagg caaaatagcg ccctcccctt 36720
ccaaaacaac atacagcgct tccacagcag ccatgacaaa agactcaaaa cactcaaaag 36780
actcagtctt accaggaaaa taaaagcact ctcacagcac cagcactaat cagagtgtga 36840
agagggccaa gtgccgaacg agtatatata ggaattaaaa atgacgtaaa tgtgtaaagg 36900
tcaaaaaacg cccagaaaaa tacacagacc aacgcccgaa acgaaaaccc gcgaaaaaat 36960
acccagaagt tcctcaacaa ccgccacttc cgctttccca cgatacgtca cttcctcaaa 37020
aatagcaaac tacatttccc acatgtacaa aaccaaaacc cctccccttg tcaccgccca 37080
caacttacat aatcacaaac gtcaaagcct acgtcacccg ccccgcctcg ccccgcccac 37140
ctcattatca tattggcctc aatccaaaat aaggtatatt attgatgatg 37190
<210> 10
<211> 37184
<212> DNA
<213> Great Ape Adenovirus
<400> 10
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg cggagggtat ttaaacccgc 480
tgcgctccta aagaggccac tcttgagtgc cagcgagaag agttttctcc tccgctccgt 540
ttcggcgatc gaaaaatgag acatttagcc tgcactccgg gtcttttgtc cggccgggcg 600
gcgtccgagc ttttggacgc tttgctcaat gaggttctga gcgatgattt tccgtctact 660
acccacttta gcccacctac tcttcacgaa ctgtacgatc tggatgtact ggtggatgtg 720
aacgatccca acgaggaggc ggtttctacg ttttttcccg agtctgcgct tttggctgcc 780
caggagggat ttgacctaca cactccgccg ctgcctattt tagagtctcc gctgccggag 840
cccagtggta taccttatat gcctgaactg cttcccgaag tggtagacct gacctgccac 900
gagccgggct ttccgcccag cgacgatgag ggtgagcctt ttgctttaga ctatgctgag 960
atacctgggc tcggttgcag gtcttgtgca tatcatcaga gggttaccgg agaccccgag 1020
gttaagtgtt cgctgtgcta tatgaggctg acctcttcct ttatctacag taagtttttt 1080
tgtgtaggtg ggctttttgg gtaggtgggt tttgtggcag gacaggtgta aatgttgctt 1140
gtgttttttg tacctgcagg tccggtgtcc gagccagacc cggagcccga ccgcgatccc 1200
gagccggatc ccgagcctcc tcgcaggcca aggaaattac cttccatttt gtgcaagcct 1260
aagacacctg tgaggaccag cgaggcggac agcactgact ctggcacttc tacctctcct 1320
cctgaaattc acccagtggt tcctctgggt atacatagac ctgttgctgt tagagtttgc 1380
gggcgacgcc ctgcagtaga gtgcattgag gacttgctta acgatcccga gggacctttg 1440
gacttgagca ttaaacgccc taggcaataa accccaccta agtaataaac cccacctaag 1500
taataaactt taccgccctt ggttattgag atgacgccca atgtttgctt ttgaatgact 1560
tcatgtgtat aataaaagtg agtgtggtca taggtctctt gtttgtctgg gcggggttta 1620
agggtatata agtttctcgg ggctaaactt ggttacactt gaccccaatg gaggcgtggg 1680
ggtgcttgga ggagtttgcg gacgtgcgcc gtttgctgga cgagagctct agcaatacct 1740
atagtatttg gaggtatctg tggggctcta ctcaggccaa gttggtcttc agaattaagc 1800
aggattacaa gtgcgatttt gaagagcttt ttagttcctg tggtgagctt ttgcaatcct 1860
tgaatctggg ccaccaggct atcttccagg aaaaggttct ctcgactttg gatttttcca 1920
ctcccgggcg caccgccgct tgtgtggctt ttgtgtcttt tgtgcaagat aaatggagcg 1980
gggagaccca cctgagtcac ggctacgtgc tggatttcat ggcgatggct ctttggaggg 2040
cttacaacaa atggaagatt cagaaggaac tgtacggttc cgccctacgt cgtccacttc 2100
tgcagcggca ggggctgatg tttcccgacc atcgccagca tcagaatctg gaagacgagc 2160
gagcggagaa gatcagcttg agagccggcc tggaccctcc tcaggaggaa tgaatctccc 2220
gcaggtggtt gagctgtttc ccgaactgag acgggtcctg actatcaggg aggatggtca 2280
gtttgtgaag aagctgaaga gggatcgggg tgagggagat gatgaggcgg ctagcaattt 2340
agcttttagt ctgataactc gccaccgacc ggaatgtatt acctatcagc agattaagga 2400
gagttgtgcc aacgagctgg atcttttggg tcagaagtat agcatagaac agcttaccac 2460
ttactggctt cagcccgggg atgattggga agaggcgatt agggtgtatg caaaggtggc 2520
cctgcggccc gattgcaagt ataagattac taagttggtt aatattagaa actgctgcta 2580
tatttctgga aacggggccg aagtggagat agatactgag gacagggtgg ctattaggtg 2640
ttgcatgata aacatgtggc ccgggatact ggggatggat ggggtgatat ttatgaatgt 2700
gaggttcacg ggccccaact ttaatggtac ggtgttcatg ggcaacacca acttgctcct 2760
gcatggtgcg agtttctatg ggtttaacaa cacctgtata gaggcctgga ccgatgtaaa 2820
ggttcgaggt tgttcctttt atagctgttg gaaggcggtg gtgtgtcgcc ctaaaagcag 2880
gggttctgtg aagaaatgct tgtttgaaag gtgcacccta ggtatccttt ctgagggcaa 2940
ctccagggtg cgccataatg tggcttcgaa ctgcggttgc ttcatgcaag tgaagggggt 3000
gagcgttatc aagcataact cggtctgtgg aaactgcgag gatcgcgcct ctcagatgct 3060
gacctgcttt gatggcaact gtcacctgtt gaagaccatt catataagca gtcaccccag 3120
aaaggcctgg cccgtgtttg agcataacat tctgacccgc tgttccttgc atctgggggt 3180
caggaggggt atgttcctgc cttaccagtg taactttagc cacactaaaa tcctgctgga 3240
acccgagtgc atgactaagg tcagcctgaa tggtgtgttt gatgtgagtc tgaagatttg 3300
gaaggtgctg aggtatgatg agaccaggac caggtgccga ccctgcgagt gcggcggcaa 3360
gcacatgaga aatcagcctg tgatgttgga tgtgaccgag gagcttaggc ctgaccatct 3420
ggtgctggcc tgcaccaggg ccgagtttgg gtctagcgat gaggataccg attgaggtgg 3480
gtaaggtggg cgtggctagc agggtgggcg tgtataaatt gggggtctaa ggggtctctc 3540
tgtttgtctt gcaacagccg ccgccatgag cgacaccggc aacagctttg atggaagcat 3600
ctttagtccc tatctgacag tgcgcatgcc tcactgggcc ggagtgcgtc agaatgtgat 3660
gggttccaac gtggatggac gtcccgttct gccttcaaat tcgtctacta tggcctacgc 3720
gaccgtggga ggaactccgc tggacgccgc gacctccgcc gccgcctccg ccgccgccgc 3780
gaccgcgcgc agcatggcta cggaccttta cagctctttg gtggcgagca gcgcggcctc 3840
tcgcgcgtct gctcgggatg agaaactgac tgctctgctg cttaaactgg aagacttgac 3900
ccgggagctg ggtcaactga cccagcaggt ttccagcttg cgtgagagca gccttgcctc 3960
cccctaatgg cccataatat aaataaaagc cagtctgttt ggattaagca agtgtatgtt 4020
ctttatttaa ctctccgcgc gcggtaagcc cgggaccagc ggtctcggtc gtttagggtg 4080
cggtggattt tttccaacac gtggtacagg tggctctgga tgtttagata catgggcatg 4140
agtccatccc tggggtggag gtagcaccac tgcagagctt cgtgctcggg ggtggtgttg 4200
tatatgatcc agtcgtagca ggagcgctgg gcgtggtgct gaaaaatgtc cttaagcaag 4260
aggcttatag ctagggggag gcccttggtg taagtgttta caaatctgct tagctgggag 4320
gggtgcatcc ggggggatat gatgtgcatc ttggactgga tttttaggtt ggctatgttc 4380
ccgcccagat cccttctggg attcatgttg tgcaggacca ccagcacggt atatccagtg 4440
cacttgggaa atttatcgtg gagcttagac gggaatgcat ggaagaactt ggagacgccc 4500
ttgtggcctc ccagattttc catacattcg tccatgatga tggcaatggg cccgtgggaa 4560
gctgcctgag caaaaacgtt tctggcatcg ctcacatcgt agttatgttc cagggtgagg 4620
tcatcatagg acatctttac gaatcggggg cgaagggtcc cggactgggg gatgatggta 4680
ccctcgggcc ccggggcgta gttcccctca cagatctgca tctcccaggc tttcatttca 4740
gagggaggga tcatatccac ctgcggggcg atgaaaaaga cagtttctgg cgcaggggag 4800
attaactggg atgagagcag gtttctgagc agctgtgact ttccacagcc ggtgggccca 4860
tatatcacgc ctatcaccgg ctgcagctgg tagttaagag agctgcagct gccgtcctcc 4920
cggagcaggg gggccacctc gttgagcata tccctgacgt ggatgttctc cctgaccagt 4980
tccgccagaa ggcgctcgcc gcccagcgaa agcagctctt gcaaggaagc aaaatttttc 5040
agcggtttca ggccatcggc cgtgggcatg tttttcagcg tctgggtcag cagctccagc 5100
ctgtcccaga gctcggtgat gtgctctacg gcatctcgat ccagcagatc tcctcgtttc 5160
gcgggttggg gcggctttcg ctgtagggca ccagccgatg ggcgtccagc ggggccagag 5220
tcatgtcctt ccatgggcgc agggtcctcg tcagggtggt ctgggtcacg gtgaaggggt 5280
gcgctccggg ttgggcactg gccagggtgc gcttgaggct ggttctgctg gtgctgaatc 5340
gctgccgctc ttcgccctgc gcgtcggcca ggtagcattt gaccatggtc tcgtagtcga 5400
gaccctcggc ggcgtgcccc ttggcgcgga gctttccctt ggaggtggcg ccgcacgagg 5460
ggcactgcag gctcttcagg gcgtagagct tgggagcgag aaacacggac tctggggagt 5520
aggcgtccgc gccgcaggcc gagcagaccg tctcgcattc caccagccaa gtgagttccg 5580
ggcggtcagg gtcaaaaacc aggttgcccc catgcttttt gatgcgtttc ttaccttggc 5640
tctccatgag gcggtgtccc ttctcggtga cgaagaggct gtccgtgtcc ccgtagaccg 5700
acttcagggg cctgtcttcc agcggagtgc ctctgtcctc ctcgtagaga aactctgacc 5760
actctgagac gaaggcccgc gtccaggcca ggacgaagga ggccacgtgg gaggggtagc 5820
ggtcgttgtc cactagcggg tccaccttct ccagggtgtg caggcacatg tccccctcct 5880
ccgcgtccag aaaagtgatt ggcttgtagg tgtaggacac gtgaccgggg gttcccaacg 5940
ggggggtata aaagggggtg ggtgcccttt catcttcact ctcttccgca tcgctgtctg 6000
cgagagccag ctgctggggt aagtattccc tctcgaaggc gggcatgacc tcagcgctca 6060
ggttgtcagt ttctaaaaat gaggaggatt tgatgttcac ctgtccggag gtgatacctt 6120
tgagggtacc tgggtccatc tggtcagaaa acactatttt tttgttatca agcttggtgg 6180
cgaatgaccc gtagagggcg ttggagagca gcttggcgat ggagcgcagg gtctggtttt 6240
tgtcgcggtc ggctcgctcc ttggccgcga tgttgagttg cacgtactcg cgggccacgc 6300
acttccactc ggggaacacg gtggtgcgct cgtctgggat caggcgcacc ctccagccgc 6360
ggttgtgcag ggtgaccatg tcgacgctgg tggcgacctc accgcgcaga cgctcgttgg 6420
tccagcagag gcggccgccc ttgcgcgagc agaagggggg tagggggtcc agctggtcct 6480
cgtttggggg gtccgcgtcg atggtaaaga ccccggggag caggcgcggg tcaaagtagt 6540
cgatcttgca agcttgcatg tccagagccc gctgccattc gcgggcggcg agcgcgcgct 6600
cgtaggggtt gaggggcggg ccccagggca tggggtgggt gagcgcggag gcgtacatgc 6660
cgcagatgtc atacacgtac aggggttccc tgaggatacc gaggtaggtg gggtagcagc 6720
gccccccgcg gatgctggcg cgcacgtagt catagagctc gtgggagggg gccagcatgt 6780
tgggcccgag gttggtgcgc tgggggcgct cggcgcggaa gacgatctgc ctgaagatgg 6840
cgtgggagtt ggaggagatg gtgggccgct ggaagacgtt gaagcttgct tcttgcaagc 6900
ccacggagtc cctgacgaag gaggcgtagg actcgcgcag cttgtgcacc agctcggcgg 6960
tgacctggac gtcgagcgca cagtagtcga gggtctcgcg gatgatgtca tacctatcct 7020
cccccttctt tttccacagc tcgcggttga ggacgaactc ttcgcggtct ttccagtact 7080
cttggagggg aaacccgtcc gtgtccgaac ggtaagagcc tagcatgtag aactggttga 7140
cggcctggta ggggcagcag cccttctcca cgggcagcgc gtaggcctgc gccgccttgc 7200
ggagggaggt gtgggtgagg gcgaaagtgt ccctgaccat gactttgagg tattgatgtc 7260
tgaagtctgt gtcatcgcag ccgccctgtt cccacagggt gtagtccgtg cgctttttgg 7320
agcgcgggtt gggcagggag aaggtgaggt cattgaagag gatcttcccc gctcgaggca 7380
tgaagtttct ggtgatgcga aagggccctg ggaccgagga gcggttgttg atgacctggg 7440
cggccaggac gatctcgtca aagccgttta tgttgtgtcc cacgatgtag agctccagga 7500
agcggggctg gcccttgatg gaggggagct ttttaagttc ctcgtaggta agctcctcgg 7560
gcgattccag gccgtgctcc tccagggccc agtcttgcaa gtgagggttg gccgccagga 7620
aggatcgcca gaggtcgcgg gccatgaggg tctgcaggcg gtcgcggaag gttctgaact 7680
gccgccccac ggccattttt tcgggggtga tgcagtagaa ggtgaggggg tctttctccc 7740
aggggtccca tctgagctct cgggcgaggt cgcgcgcggc agcgaccaga gcctcgtcgc 7800
cccccagttt catgaccagc atgaagggca cgagttgctt gccaaaggct cccatccaag 7860
tgtaggtttc tacatcgtag gtgacaaaga ggcgctccgt gcgaggatga gagccgattg 7920
ggaagaactg gatctcccgc caccagttgg aggattggct gttgatgtgg tgaaagtaga 7980
agtcccgtct gcgggccgag cactcgtgct ggcttttgta aaagcgaccg cagtactggc 8040
agcgctgcac gggttgtata tcttgcacga ggtgaacctg gcgacctctg acgaggaagc 8100
gcagcgggaa tctaagtccc ccgcctgggg tcccgtgtgg ctggtggtct tttactttgg 8160
ttgtctggcc gccagcatct gtctcctgga gggcgatggt ggaacagacc accacgccgc 8220
gagagccgca ggtccagatc tcggcgctcg gcgggcggag tttgatgacg acatcgcgca 8280
cattggagct gtccatggtc tccagctccc gcggcggcag gtcagccggg agttcctgga 8340
ggttcacctc gcagagacgg gtcaaggcgc ggacagtgtt gagatggtat ctgatttcaa 8400
ggggcatgtt ggaggcggag tcgatggctt gcaggaggcc gcagccccgg ggggccacga 8460
tggttccccg cggggcgcga ggggaggcgg aagctggggg tgtgttcaga agcggtgacg 8520
cgggcgggcc cccggaggta gggggggttc cggccccaca ggcatgggcg gcaggggcac 8580
gtcttcgccg cgcgcgggca ggggctggtg ctggctccga agagcgcttg cgtgcgcgac 8640
gacgcgacgg ttggtgtcct gtatctggcg cctctgagtg aagaccacgg gtcccgtgac 8700
cttgaacctg aaagagagtt cgacagaatc aatctcggca tcgttgacag cggcctggcg 8760
caggatctcc tgcacgtcgc ccgagttgtc ctggtaggcg atttctgcca tgaactgctc 8820
gatctcttcc tcctggagat ctcctcgtcc ggcgcgctcc acggtggccg ccaggtcgtt 8880
ggagatgcga cccatgagct gcgagaaggc gttgagtccg ccctcgttcc agacccggct 8940
gtagaccacg cccccctcgg cgtcgcgggc gcgcatgacc acctgggcca ggttgagctc 9000
cacgtgtcgc gtgaagacgg cgtagttgcg caggcgctgg aaaaggtagt tcagggtggt 9060
ggcggtgtgc tcggcgacga agaagtacat gacccagcgc cgcaacgtgg attcattgat 9120
gtcccccaag gcctccaggc gctccatggc ctcgtagaag tccacggcga agttgaaaaa 9180
ctgggagttg cgagcggaca cggtcaactc ctcctccaga agacggatga gctcggcgac 9240
agtgtcgcgc acctcgcgct cgaaggccac ggggggcgct tcttcctctt ccacctcttc 9300
ttccatgatt gcttcttctt cttcctcagc cgggacggga gggggcggcg gcgggggagg 9360
ggcgcggcgg cggcggcggc gcaccgggag gcggtcgatg aagcgctcga tcatctcccc 9420
ccgcatgcgg cgcatggtct cggtgacggc gcggccgttc tcccgggggc gcagctcgaa 9480
gacgccgcct ctcatttcgc cgcggggcgg gcggccgtga ggtagcgaga cggcgctgac 9540
tatgcatctt aacaattgct gtgtaggtac gccgccaagg gacctgattg agtccagatc 9600
caccggatcc gaaaaccttt ggaggaaagc gtctatccag tcgcagtcgc aaggtaggct 9660
gagcaccgtg gcgggcgggg gcgggtcggg agagttcctg gcggagatgc tgctgatgat 9720
gtaattaaag taggcggtct tgagaaggcg gatggtggac aggagcacca tgtctttggg 9780
tccggcctgt tggatgcgga ggcggtcggc catgccccag gcctcgttct gacaccggcg 9840
caggtctttg tagtaatctt gcatgagtct ttccaccggc acttcttctc cttcctcttc 9900
ttcatctcgc cggtggtttc tcgcgccgcc catgcgcgtg accccaaagc ccctgagcgg 9960
ctgcagcagg gccaggtcgg cgaccacgcg ctcggccaag atggcctgct gtacctgagt 10020
gagggtcctc tcgaagtcat ccatgtccac gaagcggtgg taggcacccg tgttgatggt 10080
gtaggtgcag ttggccatga cggaccagtt gacggtctgg tgtcccggct gcgagagctc 10140
cgtgtaccgc aggcgcgaga aggcgcggga atcgaacacg tagtcgttgc aagtccgcac 10200
cagatactgg tagcccacca ggaagtgcgg cggaggttgg cgatagaggg gccagcgctg 10260
ggtggcgggg gcgccgggcg ccaggtcttc cagcatgagg cggtggtatc cgtagatgta 10320
cctggacatc caggtgatgc ctgcggcggt ggtggtggcg cgcgcgtagt cgcggacccg 10380
gttccagatg tttcgcaggg gcgagaagtg ttccatggtc ggcacgctct ggccggtgag 10440
gcgcgcgcag tcgttgacgc tctatacaca cacaaaaacg aaagcgttta cagggctttc 10500
gttctgtagc ctggaggaaa gtaaatgggt tgggttgcgg tgtgccccgg ttcgagacca 10560
agctgagctc agccggctga agccgcagct aacgtggtat tggcagtccc gtctcgaccc 10620
aggccctgta tcctccagga tacggtcgag agcccttttg ctttcttggc caagcgcccg 10680
tggcgcgatc tgggatagat ggtcgcgatg agaggacaaa agcggctcgc ttccgtagtc 10740
tggagaaaca atcgccaggg ttgcgttgcg gcgtaccccg gttcgagccc ctatggcggc 10800
ttggatcggc cggaaccgcg gctaacgtgg gctgtggcag ccccgtcctc aggaccccgc 10860
cagccgactt ctccagttac gggagcgagc cccttttgtt tttttatttt ttagatgcat 10920
cccgtgctgc ggcagatgcg cccctcgccc cggcccgatc agcagcagca acagcaggca 10980
tgcagacccc cctctcctct ccccgccccg gtcaccacgg ccgcggcggc cgtgtccggt 11040
gcggggggcg cgctggagtc agatgagcca ccgcggcggc gacctaggca gtatctggac 11100
ttggaagagg gcgagggact ggcgcggctg ggggcgagct ctccagagcg ccacccgcgg 11160
gtgcagttga aaagggacgc gcgtgaggcg tacctgccgc ggcaaaacct gtttcgcgac 11220
cgcgggggcg aggagcccga ggagatgcgg gactgcaggt tccaagcggg gcgcgagctg 11280
cgccgcggct tggacagaca gcgcctgctg cgcgaggagg actttgagcc cgacacgcag 11340
acgggcatca gccccgcgcg cgcgcacgtg gccgcggccg acctggtgac cgcctacgag 11400
cagacggtga accaggagcg caacttccaa aaaagcttca acaaccacgt gcgcacgctg 11460
gtggcgcgcg aggaggtgac cctgggtctc atgcatctgt gggacctggt ggaggcgatc 11520
gtgcagaacc ccagcagcaa gcccctgacc gcgcagctgt tcctggtggt gcagcacagc 11580
agggacaacg aggccttcag ggaggcgctg ctgaacatca ccgagccgga ggggcgctgg 11640
ctcctggacc tgataaacat cctgcagagc atagtggtgc aggagcgcag cctgagcctg 11700
gccgagaagg tggcggccat taactattct atgctgagcc tgggcaagtt ctacgctcgc 11760
aagatctaca agacccccta cgtgcccata gacaaggagg tgaagataga cagcttctac 11820
atgcgcatgg cgctgaaggt gctaaccctg agcgacgacc tgggagtgta ccgcaacgag 11880
cgcatccaca aggccgtgag cgccagccgg cggcgcgagc tgagcgaccg cgaactgatg 11940
cacagtctgc agcgcgcgct gaccggcgcg ggcgagggcg acagggaggt cgagtcctac 12000
tttgacatgg gggccgacct gcactggcag ccgagccgcc gcgccctgga agcggcgggg 12060
gcgtacggcg gccccctggc ggccgatgac gaggaagagg aggactatga gctagaggag 12120
ggcgagtacc tggaggactg acctggctgg tggtgttttg gtatagatgc aagatccgaa 12180
cgtggcggac ccggcggtcc gggcggcgct gcagagccag ccgtccggca ttaactcctc 12240
tgacgactgg gccgcggcca tgggtcgcat catggccctg accgcgcgca accccgaggc 12300
cttcaggcag cagcctcagg ctaaccggct ggcggccatc ttggaagcgg tagtgcccgc 12360
gcgctccaac cccacccacg agaaggtgct ggccatagtc aacgcgctgg cggagagcag 12420
ggccatccgg gcagacgagg ccggactggt gtacgatgcg ctgctgcagc gggtggcgcg 12480
gtacaacagc ggcaacgtgc agaccaacct ggaccgcctg gtgacggacg tgcgcgaggc 12540
cgtggcgcag cgcgagcgct tgcatcagga cggcaacctg ggctcgctgg tggcgctaaa 12600
cgccttcctt agcacccagc cggccaacgt accgcggggg caggaggact acaccaactt 12660
cttgagcgcg ctgcggctga tggtgaccga ggtccctcag agcgaggtgt accagtcggg 12720
gcccgactac ttcttccaga ccagcagaca gggcttgcaa accgtgaacc tgagccaggc 12780
tttcaagaac ctgcgggggc tgtggggagt gaaggcgccc accggcgacc gggctacggt 12840
gtccagcctg ctaaccccca actcgcgcct gctgctgctg ctgatcgcgc ccttcacgga 12900
cagcgggagc gtctcgcggg agacctatct gggccacctg ctgacgctgt accgcgaggc 12960
catcgggcag gcgcaggtgg acgagcacac cttccaggag atcaccagcg tgagccacgc 13020
gctggggcag gaggacacgg gcagcctgca ggcgaccctg aactacctgc tgaccaacag 13080
gcggcagaag attcccacgc tgcacagcct gacccaggag gaggagcgca tcttgcgcta 13140
cgtgcagcag agcgtgagcc tgaacctgat gcgcgacggc gtgacgccca gcgtggcgct 13200
ggacatgacc gcgcgcaaca tggaaccggg catgtacgct tcccagcggc cgttcatcaa 13260
ccgcctgatg gactacttgc atcgggcggc ggccgtgaac cccgagtact tcaccaatgc 13320
cattctgaat ccccactgga tgccccctcc gggtttctac aacggggact tcgaggtgcc 13380
tgaggtcaac gatgggttcc tctgggatga catggatgac agtgtgttct cccccaaccc 13440
gctgcgcgcc gcgtctctgc gattgaagga gggctctgac agggaaggac caaggagtct 13500
ggcctcctcc ctggctctgg gggcggtggg cgccacgggc gcggcggcgc ggggcagcag 13560
ccccttcccc agcctggcgg actctctgaa tagcgggcgg gtgagcaggc cccgcttgct 13620
aggcgaggag gagtatctga acaactccct gctgcagccc gtgagggaca aaaacgctca 13680
gcggcagcag tttcccaaca atgggataga gagcctggtg gacaagatgt ccagatggaa 13740
gacgtatgcg caggagtaca aggagtggga ggaccgccag ccgcggcccc tgccgccccc 13800
tagacagcgc tggcagcggc gcgcgtccaa ccgccgctgg aggcaggggc ccgaggacga 13860
tgatgactct gcagatgaca gcagcgtgtt ggacctgggc gggagcggga accccttttc 13920
gcacctgcgc ccacgcctgg gcaagatgtt ttaaaagaga aaaataaaaa ctcaccaagg 13980
ccatggcgac gagcgttggt tttttgttcc cttccttagt atgcggcgcg cggcgatgtt 14040
cgaggagggg cctcccccct cttacgagag cgcgatggga atttctcctg cggcgcccct 14100
gcagcctccc tacgtgcctc ctcggtacct gcaacctaca ggggggagaa atagcatctg 14160
ttactctgag ctgcagcccc tgtacgatac caccagactg tacctggtgg acaacaagtc 14220
cgcggacgtg gcctccctga actaccagaa cgaccacagc gattttttga ccacggtgat 14280
ccaaaacaac gacttcaccc caaccgaggc cagtacccag accataaacc tggacaacag 14340
gtcgaactgg ggcggcgacc tgaagactat cctgcacacc aatatgccca acgtgaacga 14400
gttcatgttc accaactctt ttaaggcgcg ggtgatggtg gcgcgcgagc agggggaggc 14460
gaagtacgag tgggtggact tcacgctgcc cgagggcaac tactcagaga ccatgactct 14520
cgacctgatg aacaatgcga tcgtggaaca ctatctgaaa gtgggcaggc agaacggggt 14580
gaaggagagc gatatcgggg tcaagtttga caccagaaac ttccgtctgg gctgggaccc 14640
tgtgaccggg ctggtcatgc cgggggtcta caccaacgag gcctttcatc ccgatatagt 14700
gctcctgccc ggctgtgggg tggacttcac ccagagccgg ctgagcaacc tgctgggcgt 14760
tcgcaagcgg caacctttcc aggagggttt caagatcacc tatgaggatc tggagggggg 14820
caacattccc gcgctccttg atctggacgc ctacgaggag agcttgaaac ccgaggagag 14880
cgctggcgac agcggcgaga gtggcgagga gcaagccggc ggcggcggca gcgcgtcggt 14940
agaaaacgaa agtactcccg cagtggcggc ggacgctgcg gaggtcgagc cggaggccat 15000
gcagcaggac gcagaggagg gcgcgcagga ggacatgaac aatggggaga tcaggggcga 15060
cactttcgcc acccggggcg aagaaaaaga ggcagaggcg gcggcggcga cggcggaagc 15120
cgaaaccgag gcagaggcag agcccgagac cgaagttatg gaagacatga atgatggaga 15180
acgtaggggt gacacgtttg ccacccgggg cgaagagaag gcggcggagg cagaagccgc 15240
ggctgaggag gcggctgcgg ctgcggccaa ggctgaggct gcggctgagg ctaaggtcga 15300
agccgatgtt gcggttgagg ctcaggctga ggaggaggcg gcggctgaag cagttaagga 15360
aaaggcccag gcagagcagg aagagaaaaa acctgtcatt caacctctaa aagaagatag 15420
caaaaagcgc agttacaacg tcattgaggg cagcaccttt acccaatacc gcagctggta 15480
cctggcttac aactacggcg acccggtcaa gggggtgcgc tcgtggaccc tgctctgcac 15540
gccggacgtc acctgcggct ccgagcagat gtactggtcg ctgccaaaca tgatgcaaga 15600
cccggtgacc ttccgttcca cgcggcaggt tagcaacttt ccggtggtgg gcgccgaact 15660
gctgccagta cactccaaga gtttttacaa cgagcaggcc gtctactccc agctgatccg 15720
ccaggccacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga ttttggcgcg 15780
cccgccggcc cccaccatca ccaccgtcag tgaaaacgtt cctgccctca cagatcacgg 15840
gacgctaccg ctgcgcaaca gcatctcagg agtccagcga gtgaccatta ctgacgccag 15900
acgccggacc tgcccctacg tttacaaggc cttgggcata gtctcgccgc gcgtcctctc 15960
cagtcgcact ttttaaaaca catccaccca cacgctccaa aatcatgtcc gtactcatct 16020
cgcccagcaa caacaccggc tgggggctgc gcgcacccag caagatgttt ggaggggcaa 16080
ggaagcgctc cgaccagcac cccgtgcgcg tgcgcggcca ctaccgcgcg ccctggggtg 16140
cgcacaagcg cgggcgcaca gggcgcacca ctgtggatga tgtcattgac tccgtagtgg 16200
agcaggcgcg ccactacaca cccggcgcgc cgaccgcctc cgccgtgtcc accgtggacc 16260
aggcgatcga aagcgtggta cagggggcgc ggcactatgc caaccttaaa agtcgccgcc 16320
gccgcgtggc gcgccgccat cgccggagac cccgggctac tgccgccgcg cgccttacca 16380
aggctctgct caagcgcgcc aggcgaactg gccaccgggc cgccatgagg gccgcacggc 16440
gggctgccgc tgccgcgagc gccgtggccc cgcgggcacg aaggcgcgcg gccgctgccg 16500
ccgccgccgc catttccagc ttggcctcga cgcggcgcgg taacatatac tgggtgcgcg 16560
actcggtgag cggcacacgt gtgcccgtgc gctttcgccc cccacggaat tagcacaaga 16620
caacatacac actgagtctc ctgctgttgt gtatcccagc ggcgaccgtc agcagcggcg 16680
acatgtccaa gcgcaaaatt aaagaagaga tgctccaggt catcgcgccg gagatctatg 16740
ggcccccgaa gaaggaggag gaggattaca agccccgcaa gctaaagcgg gtcaaaaaga 16800
aaaagaaaga tgatgacgtt gacgaggcgg tggagtttgt ccgccgcatg gcgcccaggc 16860
gccctgtgca gtggaagggt cggcgcgtgc agcgagtcct gcgccccggc accgcggtgg 16920
tctttacgcc cggcgagcgt tccacgcgca ctttcaagcg ggtgtacgat gaggtgtacg 16980
gcgacgagga tctgttggag caggccaacc atcgatttgg ggagtttgca tatgggaaac 17040
ggcctcgcga gagtctaaaa gaggacctgc tggcgctacc gctggacgag ggcaatccca 17100
ccccgagtct gaagccggtg accctgcaac aggtgctgcc tttgagcgcg cccagcgagc 17160
agaagcgagg gttaaagcgc gagggcgggg acctggcacc caccgtgcag ttgatggtgc 17220
ccaagcggca gaagctggag gacgtgctgg agaaaatgaa agtagagccc gggatccagc 17280
ccgagatcaa ggtccgccct atcaagcagg tggcgcccgg cgtgggagtc cagaccgtgg 17340
acgttaggat tcccacggag gagatggaaa cccaaaccgc cactccctct tcggcagcaa 17400
gcgccaccac cggcgccgct tcggtagagg tgcagacgga cccctggcta cccgccgcca 17460
ctatcgccgt cgccgccgcc ccccgttcgc gcggacgcaa gagaaattat ccagcggcca 17520
gcgcgcttat gccccagtat gcgctgcatc catccatcgc gcccaccccc ggctaccgcg 17580
ggtactcgta ccgcccgcgc agatcagccg gcactcgcgg ccgccgccgc cgtgcgacca 17640
caaccagccg ccgccgtcgc cgccgccgcc agccagtgct gacccccgtg tctgtaagga 17700
aggtggctcg ctcggggagc acgctggtgg tgcccagagc gcgctaccac cccagcatcg 17760
tttaaagccg gtctctgtat ggttcttgca gatatggccc tcacttgtcg ccttcgcttc 17820
ccggtgccgg gataccgagg aagaactcac cgccgcaggg gcatggcggg cagcggtctc 17880
cgcggcggcc gtcgccatcg ccggcgcgca aagagcaggc gcatgcgcgg cggtgtgttg 17940
cccctgctgg tcccgctact cgccgcggcg atcggcgccg tgcccgggat cgcctccgtg 18000
gccctgcagg cgtcccagaa acattgactc ttgcaacctt gcaagcttgc atttttggag 18060
gaaaaaataa aaaagtctag actctcacgc tcgcttggtc ctgtgactat tttgtagaaa 18120
aaagatggaa gacatcaact ttgcgtcgct ggccccgcgt cacggctcgc gcccgttcat 18180
gggagactgg acagatatcg gcaccagcaa tatgagcggt ggcgccttca gctggggcag 18240
tctgtggagc ggccttaaaa attttggttc caccattaag aactatggca acaaagcgtg 18300
gaacagcagc acgggtcaga tgctgagaga caagttgaaa gagcagaact tccaggagaa 18360
ggtggcgcag ggcctggcct ctggcatcag cggggtggtg gacatagcta accaggccgt 18420
gcagaaaaag ataaacagtc atctggaccc ccgccctcag gtggaggaaa cgcctccagc 18480
catggagacg gtgtctcccg agggcaaagg cgaaaagcgc ccgcggcccg acagggaaga 18540
gaccctggtg tcacacaccg aggagccgcc ctcttacgag gaggcagtca aggccggcct 18600
gcccaccact cgccccatag ctcccatggc caccggtgtg gtgggtcaca ggcaacacac 18660
ccccgcaaca ctagatctgc ccccgccgtc cgagccgact cgccagccaa aggcggtgac 18720
ggtgtccgct ccctccactt ccgccgccaa cagagtgcct ctgcgccgcg ctgcgagcgg 18780
cccccgggcc tcgcgagtca gcggcaactg gcagagcaca ctgaacagca tcgtgggcct 18840
gggagtgagg agtgtgaagc gccgccgttg ctactgaatg agcaagctag ctaacgtgtt 18900
gtatgtgtgt atgcgtccta tgtcgccgcc agaggagctg ttgagccgcc ggcgccgtct 18960
gcactccagc gaatttcaag atggcgaccc catcgatgat gcctcagtgg tcgtacatgc 19020
acatctcggg ccaggacgct tcggagtacc tgagccccgg gctggtgcag ttcgcccgcg 19080
ccacagacac ctacttcaac atgagtaaca agttcaggaa ccccactgtg gcgcccaccc 19140
acgatgtgac cacggaccgg tcgcagcgcc tgacgctgcg gttcatcccc gtggatcggg 19200
aggacaccgc ttactcttac aaggcgcggt tcacgctggc cgtgggcgac aaccgcgtgc 19260
tggacatggc ctccacttac tttgacatcc ggggggtgct ggacaggggc cccactttta 19320
agccctactc gggcactgcc tacaaccccc tggcccccaa gggcgccccc aattcttgtg 19380
agtgggaaca agaggaaaat caggtggtcg ctgcagatga tgaacttgaa gatgaagaag 19440
cgcaagcaca agaggaagcc cctgtgaaaa aaattcatgt atatgctcag gcgcctcttt 19500
ctggcgaaaa gattaccaag gatggtttgc aaataggtac tgaagtcgta ggagatacat 19560
ctaaggacac ttttgcagat aaaacattcc aacccgaacc tcagataggc gagtctcagt 19620
ggaacgaggc tgatgccaca gtagcaggag gtagagtttt gaaaaagact acccctatga 19680
gaccttgcta tggatcctat gccaggccta ccaatgccaa cgggggtcaa ggaattatgg 19740
ttgccaatga acaaggagtg ttggagtcta aagtagaaat gcaatttttc tctaacacca 19800
caacccttaa tgcgcgggat ggaaccggca atcccgaacc aaaggtggtg ttgtacagcg 19860
aagatgtcca cttggaatct cccgatactc atctgtctta caagcccaaa aaggatgatg 19920
ttaatgccaa aatcatgttg ggtcagcaag ccatgcccaa cagacccaac ctcattggat 19980
ttagagataa tttcattggg cttatgtttt acaacagcac cggtaacatg ggagtgctgg 20040
cgggtcaggc ctctcagttg aatgctgtgg tggacttgca ggatagaaac acagaactgt 20100
catatcagct tatgcttgat tcaattgggg atagaaccag atacttctcc atgtggaacc 20160
aggcagtgga tagctatgat ccagatgtca gaattattga aaaccatggg actgaggatg 20220
aactgcccaa ctactgcttc cctttgggcg gcataggagt tactgatact tatcaaggga 20280
taaaaaatac caatggcaat ggtcagtgga ccaaagatga tcagttcgcg gaccgcaacg 20340
aaataggggt gggaaacaac ttcgccatgg agatcaacat ccaggccaac ctttggagaa 20400
acttcctcta tgcaaacgtg gggctctacc tgccagacaa gctcaagtac aaccccacca 20460
acgtggacat ctctgacaac cccaacacct atgactacat gaacaagcgg gtggtggccc 20520
ctggcctggt ggactgcttt gtcaatgtgg gagccaggtg gtccctggac tacatggaca 20580
acgtcaaccc cttcaaccac caccgcaatg cgggtctgcg ctaccgctcc atgatcctgg 20640
gcaacgggcg ctatgtgccc tttcacatcc aggtacccca gaagttcttt gccatcaaga 20700
acctcctgct cctgcccggc tcctacacct acgagtggaa cttcaggaag gatgtgaaca 20760
tggtcctaca gagctctctg ggcaatgacc ttagggtgga tggggccagc atcaagtttg 20820
acagcatcac cctctatgct acatttttcc ccatggccca caacaccgcc tccacgcttg 20880
aggccatgct gagaaacgac accaacgacc agtcctttaa tgactacctc tctggggcca 20940
acatgctcta cccaatccca gccaaggcca ccaacgtgcc catctccatc ccctctcgca 21000
actgggccgc ctttagaggc tgggccttta cccgccttaa gaccaaggag accccctccc 21060
tgggctcggg ttttgatccc tactttgttt actcgggatc catcccctac ctggatggca 21120
ccttctacct caaccacact ttcaagaaga tatccatcat gtatgactcc tccgtcagct 21180
ggccgggcaa cgaccgcttg ctcaccccca atgagttcga ggtcaagcgc gccgtggacg 21240
gcgagggcta caacgtggcc cagtgcaaca tgaccaagga ctggttcctg gtgcagatgc 21300
tggccaacta caacataggc taccagggct tttacatccc agagagctac aaggacagga 21360
tgtactcctt cttcagaaat ttccaaccca tgagccgaca ggtggtggac gagaccaatt 21420
acaaggacta tcaagccatt ggcatcaccc accagcacaa caactcgggt ttcgtgggct 21480
acctggcgcc caccatgcgc gagggtcagg cctaccccgc caacttcccc taccccttga 21540
taggcaagac cgcggtcgac agcgtcaccc agaaaaagtt cctctgcgac cgcaccctct 21600
ggcgcatccc cttctctagc aacttcatgt ccatgggtgc gctcacggac ctgggccaaa 21660
acctgcttta tgccaactct gcccatgcgc tggacatgac ttttgaggtg gaccccatgg 21720
acgagcccac ccttctctat attgtgtttg aagtgttcga cgtggtcaga gtgcaccagc 21780
cgcaccgcgg tgtcatcgag accgtgtacc tgcgtacgcc cttctcagcc ggcaacgcca 21840
ccacctaagg agacagcgcc gccgccgcct gcatgacggg ttccaccgag caagagctca 21900
gggccattgc cagagacctg ggatgcggac cctatttttt gggcacctat gacaaacgct 21960
tcccgggctt tatctcccga gacaagctcg cctgcgccat tgtcaacacg gccgcgcgcg 22020
agaccggggg cgtgcactgg ctggcctttg gctgggaccc gcgctccaaa acttgctacc 22080
tctttgaccc ctttggcttc tccgatcagc gcctcaggca gatttatgag tttgagtacg 22140
aggggctgct gcgccgcagc gcgctcgcct cctcgcccga ccgctgcatc acccttgaga 22200
agtccaccga aaccgtgcag gggccccact cggccgcctg cggtctcttc tgttgcatgt 22260
ttttgcacgc ctttgtgcac tggcctcaga gtcccatgga ttgcaacccc accatgaact 22320
tgctaaaggg agtgcccaac gccatgctcc agagccccca ggtccagccc accctgcgcc 22380
gcaaccagga acagctttac cgcttcctgg agcgccactc cccctacttc cgcagccaca 22440
gcgcgcgcat ccggggggcc acctcttttt gccacttgca agaaaacatg caagacggaa 22500
aatgatgtac agcatgcttt taataaatgt aaagactgtg cactttaatt atacacgggc 22560
tctttctggt tatttattca acaccgccgt cgccatttag aaatcgaaag ggttctgccg 22620
tgcgtcgccg tgcgccacgg gcagagacac gttgcgatac tggaagcggc tcgcccactt 22680
gaactcgggc accaccatgc ggggcagtgg ttcctcgggg aagttctcgc tccacagggt 22740
gcgggtcagc tgcagcgcgc tcaggaggtc gggagccgag atcttgaagt cgcagttggg 22800
gccggaaccc tgcgcgcgcg agttgcggta cacggggttg cagcactgga acaccagcag 22860
ggccggatta ttcacgctgg ccagcaggct ctcgtcgctg atcatgtcgc tgtccagatc 22920
ctccgcgttg ctcagggcga atggggtcat cttgcagacc tgcctgccca ggaaaggcgg 22980
gagcccaggc ttgccgttgc agtcgcagcg caggggcatt agcaggtgcc cacggcccga 23040
ctgcgcctgc gggtacaacg cgcgcatgaa ggcttcgatc tgcctaaaag ccacctgggt 23100
cttggctccc tccgaaaaga acatcccaca ggacttgctg gagaactggt tcgcgggaca 23160
gctggcatcg tgcaggcagc agcgcgcgtc agtgttggca atctgcacca cgttgcgacc 23220
ccaccggttt ttcactatct tggccttgga agcctgctcc tttagcgcgc gctggccgtt 23280
ctcgctggtc acatccatct ctatcacctg ttccttgttg atcatgtttg tcccgtgcag 23340
acactttagg tcgccctccg tctgggtgca gcggtgctcc cacagcgcgc aaccggtggg 23400
ctcccaattc ttgtgggtca cccccgcgta ggcctgcagg taggcctgca ggaagcgccc 23460
catcatggtc ataaaggtct tctggctcgt aaaggtcagc tgcaggccgc gatgctcttc 23520
gttcagccag gtcttgcaga tggcggccag cgcctcggtc tgctcgggca gcatcttaaa 23580
atttgtcttc aggtcgttat ccacgtggta cttgtccatc atggcacgcg ccgcctccat 23640
gcccttctcc caggcggaca ccatgggcag gcttaggggg tttatcactt ccagcggcga 23700
ggacaccgta ctttcgattt cttcttcctc cccctcttcc cggcgcgcgc ccccgctgtt 23760
gcgcgctctt accgcctgca ccaaggggtc gtcttcaggc aagcgccgca ccgagcgctt 23820
gccgcccttg acctgcttga tcagtaccgg cgggttgctg aagcccacca tggtcagcgc 23880
cgcctgctct tcttcgtctt cgctgtctac cactatttct ggggaggggc ttctccgctc 23940
tgcggcaaag gcggcggatc gcttcttttt tttcttggga gccgccgcga tggagtccgc 24000
cacggcgacc gaggtcgagg gcgtggggct gggggtgcgc ggtaccaggg cctcgtcgcc 24060
ctcggactct tcctctgact ccaggcggcg gcggagtcgc ttctttgggg gcgcgcgcgt 24120
cagcggcggc ggagacgggg acggggacgg ggacgggacg ccctccacag ggggtggtct 24180
tcgcgcagac ccgcggccgc gctcgggggt cttctcgcgc tggtcttggt cccgactggc 24240
cattgtatcc tcctcctcct aggcagagag acataaggag tctatcatgc aagtcgagaa 24300
ggaggagagc ttaaccaccc cctcagagac cgccgatgcg cccgccgtcg ccgtcgcccc 24360
cgctaccgcc gacgcgcccg ccacaccgag cgacaccccc acggaccccc ccgccgacgc 24420
acccctgttc gaggaagcgg ccgtggagca ggacccgggc tttgtctcgg cagaggagga 24480
tttgcaagag gaggagaata aggaggagaa gccctcagtg ccaaaagatc ataaagagca 24540
agacgagcac gacgcagacg cacaccaggg tgaagtcggg cggggggacg gagggcatgg 24600
cggcgccgac tacctagacg aaggaaacga cgtgctcttg aagcacctgc atcgtcagtg 24660
cgccatcgtc tgcgacgctc tgcaggagcg cagcgaggtg cccctcagcg tggcggaggt 24720
cagccgcgcc tacgagctca gcctcttttc cccccgggtg cccccccgcc gccgcgaaaa 24780
cggcacatgc gagcccaacc cgcgcctcaa cttctacccc gcctttgtgg tgcccgaggt 24840
cctggccacc tatcacatct tctttcaaaa ttgcaagatc cccatctcgt gccgcgccaa 24900
ccgtagccgc gccgataaga tgctggccct gcgccagggc gaccacatac ctgatatcgc 24960
cgctttggaa gatgtgccaa agatcttcga gggtctgggg cgcaacgaga agcgggcagc 25020
aaactctctg caacaggaaa acagcgaaaa tgagagtcac actggagcgc tggtggagct 25080
ggagggcgac aacgcccgcc tggcggtgct caagcgcagc atcgaggtca cccactttgc 25140
ctaccccgcg ctcaacctgc cccccaaagt catgaacgcg gtcatggacg ggctgatcat 25200
gcgccgcggc cggcccctcg ctccagatgc aaacttgcat gaggagaccg aggacggtca 25260
gcccgtggtc agcgacgagc agctgacgcg ctggctggag agcgcggacc ccgccgaact 25320
ggaggagcgg cgcaagatga tgatggccgc ggtgctggtc accgtagagc tggagtgtct 25380
gcagcgcttc ttcggtgacc ccgagatgca gagaaaggtc gaggagaccc tacactacac 25440
cttccgccag ggctacgtgc gccaggcttg caagatctcc aacgtggagc tcagcaacct 25500
ggtgtcctac ctgggcatct tgcatgaaaa ccgccttggg cagagcgtgc tacactccac 25560
cctgcgcggg gaggcgcgcc gcgactacgt gcgcgactgc gtttacctct tcctctgcta 25620
cacctggcag acggccatgg gggtctggca gcagtgcctg gaggagcgca acctcaagga 25680
gctggagaag cttctgcagc gcgcgctcaa agacctctgg acgggcttca acgagcgctc 25740
ggtggccgcc gcgctagccg acctcatctt ccccgagcgc ctgctcaaaa ccctccagca 25800
ggggctgccc gacttcacca gccaaagcat gttgcaaaat tttaggaact ttatcctgga 25860
gcgttctggc atcctacccg ccacctgctg cgccctgccc agcgactttg tccccctcgt 25920
gtaccgcgag tgccccccgc cgctgtgggg ccactgctac ctgttccaac tggccaacta 25980
cctgtcctac cacgcggacc tcatggagga ctccagcggc gaggggctca tggagtgcca 26040
ctgccgctgc aacctctgca cgccccaccg ctccctggtc tgcaacaccc aactgctcag 26100
cgagagtcag attatcggta ccttcgagct acagggtccg tcctcctcag acgagaagtc 26160
cgcggctccg gggctaaaac tcactccggg gctgtggact tccgcctacc tgcgcaaatt 26220
tgtacctgaa gactaccacg cccacgaaat caggttttac gaggaccaat cccgcccgcc 26280
caaggcggag ctgaccgcct gcgtcatcac ccagggcgag atcctaggcc aattgcaagc 26340
catccaaaaa gcccgccaag agtttttgct gaagaggggt cggggggtgt atctggaccc 26400
ccagtcgggt gaggagctca acccggttcc cccgctgcca ccgccgcggg accttgcttc 26460
ccaggataag catcgccatg gctcccagaa agaagcagca gcggccgccg ctgccgccgc 26520
cccacatgct ggaggaagag gaggaatact gggacagtca ggcagaggag gtttcggacg 26580
aggaggagcc ggagacggag atggaagagt gggaggagga cagcttagac gaggaggctt 26640
ccgaagccga agaggcaggc gcaacaccgt caccctcggc cgcagccccc tcgcaggcgc 26700
ccccgaagtc cgctcccagc atcagcagca acagcagcgc tataacctcc gctcctccac 26760
cgccgcgacc cacggccgac cgcagaccca accgtagatg ggacaccacc ggaaccgggg 26820
ccggtaagtc ctccgggaga ggcaagcaag cgcagcgcca aggctaccgc tcgtggcgcg 26880
ctcacaagaa cgccatagtc gcttgcttgc aagactgcgg ggggaacatc tccttcgccc 26940
gccgcttcct gctcttccac cacggtgtgg ccttcccccg taacgtcctg cattactacc 27000
gtcatctcta cagcccctac tgcggcggca gtgagccaga ggcggccagc ggcggcggcg 27060
cccgtttcgg tgcctaggaa gacccagggc aagacttcag ccaagaaact cgcggcgacc 27120
gcggcgaacg cggtcgcggg ggccctgcgc ctgacggtga acgaacccct gtcgacccgc 27180
gaactgagga accgaatctt ccccactctc tatgccatct tccagcagag cagagggcag 27240
gatcaggaac tgaaagtaaa aaacaggtct ctgcgctccc tcacccgcag ctgtctgtat 27300
cacaagagcg aagaccagct tcggcgcacg ctggaggacg ctgaggcact cttcagcaaa 27360
tactgcgcgc tcactcttaa ggactagctc cgcgcccttc tcgaatttag gcgggaacgc 27420
ctacgtcatc gcagcgccgc cgtcatgagc aaggacattc ccacgccata catgtggagc 27480
tatcagccgc agatgggact cgcggcgggc gcctcccaag actactccac ccgcatgaac 27540
tggctcagtg ccggcccaca catgatctca caggttaatg acatccgcac ccatcgaaac 27600
caaatattgg tgaagcaggc ggcaattacc accacgcccc gcaataatcc caaccccagg 27660
gagtggcccg cgtccctggt gtatcaggaa attcccggcc ccaccaccgt actacttccg 27720
cgtgattccc aggccgaagt ccaaatgact aactcagggg cacagctcgc gggcggctgt 27780
cgtcacaggg tgcggcctcc tcgccagggt ataactcacc tggagatccg aggcagaggt 27840
attcagctca acgacgagtc ggtgagctcc tcgctcggtc tcagacctga cgggaccttc 27900
cagatagccg gagccggccg atcttccttc acgccccgcc aggcgtacct gactctgcag 27960
agctcgtcct cggcgccgcg ctcgggcggc atcgggactc tccagttcgt gcaggagttt 28020
gtgccctcgg tctacttcaa ccccttctcg ggctctcccg gtcgctaccc ggaccagttt 28080
atcccgaact ttgacgccgc gagggactcg gtggacggct acgactgaat gtcgggtgga 28140
cccggtgcag agcaacttcg cctgaagcac cttgaccact gccgccgccc tcagtgcttt 28200
gcccgctgtc agaccggtga gttccagtac ttttccctgc ccgactcgca cccggacggc 28260
ccggcgcacg gggtgcgctt tttcatcccg agtcaggtcc gctctaccct aatcagggag 28320
ttcaccgccc gtcccctact ggcggagttg gaaaaggggc cttctatcct aaccattgcc 28380
tgcatttgct ctaaccctgg attacaccaa gatctttgct gtcatttgtg tgctgagtat 28440
aataaaggct gagatcagaa tctactcggg ctcctgtcgc catcctgtca acgccaccgt 28500
ccaagcccgg cccgatcagc ccgaggtgaa cctcacctgt ggtctgcacc ggcgcctgag 28560
gaaataccta gcttggtact acaacagcac tccctttgtg gtttacaaca gctttgacca 28620
ggacggggtc tcactgaggg ataacctctc gaacctgagc tactccatca ggaagaacaa 28680
caccctcgag ctacttcctc cttacctgcc cgggacttac cagtgtgtca ccggcccctg 28740
cacccacacc cacctgttga tcgtaaacga ctctcttccg agaacagacc tcaataactc 28800
ctctccgcag ttccccagaa caggaggtga gctcaggaaa ccccgggtaa agaagggtgg 28860
acaagagtta acacttgtgg ggtttctggt atatgtgacg ctggtggtgg ctcttttgat 28920
taaggctttt ccttccatgt ctgaactatc cctcttcttt tatgaacaac tcgactagtg 28980
ctaacgggac cctacccaac gaatcgggat tgaatatcgg taaccaggtt gcagtttcac 29040
ttttgattac cttcatagtc ctcttcctgc tagtgctgtc gcttctgtgc ctgcggatcg 29100
ggggctgctg catccacgtt tatatctggt gctggctgtt tagaaggttc ggagaccacc 29160
gcaggtagaa taatgctgct taccctcttt gtcctggcgc tggctgccag ctgccaagcc 29220
ttttccgagg ctgacttcat agagccccag tgcaatatca cttataaatc tgaacgtgcc 29280
atctgtacta ttctaatcaa atgtgttact caacacgata aggtgactgt taaatacaaa 29340
gatcaattaa aaaaagacgc actttacagc agctggcaac caggagatga tcaaaaatac 29400
aatgtaaccg tcttccaggg caaactctcc aaaacttaca attacaattt cccatttgag 29460
cagatgtgtg actttgtcat gtacatggaa aagcagtaca agctgtggcc tccaactccc 29520
cagggctgtg tggaaaatcc aggctctttc tgtatgatct ctctctgtgt aactgtgctg 29580
gcactaatac tcacgcttct gtatctcaga tttaaatcaa ggcaaagctt cattgatgaa 29640
aagaaaatgc cataatcgct caacgcttga ttgctaacac cgggttttta tccgcagaat 29700
gattggaatc accctactaa tcacctccct ccttgcgatt gcccatgggt tggaacgaat 29760
cgaagtccct gtgggggcca atgttaccct ggtggggcct gtcggcaatg ctacattaat 29820
gtgggaaaaa tatactaaaa atcaatgggt ttcttactgc actaacaaaa acagccacaa 29880
gcccagagcc atctgcgatg ggcaaaatct aaccttgatt gatgttcaat tgctggatgc 29940
gggctactat tatgggcagc tgggtacaat gattaattac tggagacccc acagagatta 30000
catgcttcac gtagtaaagg gtcccattag cagcccaacc accacctcta ccacacccac 30060
taccaccact actcccacca ccagcactgc cgcccagcct cctcatagca gaacaaccac 30120
ttttatcaat tccaagtccc actcccccca cattgccggc gggccctccg cctcagactc 30180
cgagaccacc gagatctgct tctgcaaatg ctctgacgcc attgcccagg atttggaaga 30240
tcacgaggaa gatgagcatg actacgcaga tgcatgccag gcatcagagg cagaagcgct 30300
accggtggcc ctaaaacagt atgcagactc ccacaccacc cccaaccttc ctccaccttc 30360
ccagaagcca agtttcctgg gggaaaatga aactctgcct ctttccatac tagctctgac 30420
atctgttgct attttggccg ctctgctggt gcttctatgc tctatatgct acctgatctg 30480
ctgcagaaag aaaaaatctc acggccatgc tcaccagccc ctcatgcact tcccttaccc 30540
tccagagctg ggcgaccaca aactttaagt ctgcagtagc tatctgccca tcccttgtca 30600
gtcgacagcg atgagcccca ctaatctaac agcctctgga cttacaacat tgtctcttaa 30660
tgagaccacc gctcctcaag acctgtacga tggtgtctcc gcgctggtta accagtggga 30720
tcacctgggc atatggtggc tcctcatagg agcagtgacc ctgtgcctaa tcctggtctg 30780
gatcatctgc tgcatcaaaa gcagaagacc caggcggcgg cccatctaca ggcccttcgt 30840
catcacacct gaagataatg atgatgatga caccacctcc aggctgcaga gcctaaagca 30900
gctactcttc tcttttacag catggtaaat tgaatcatgc cccgcatttt catctacttg 30960
cttctccttc cactttttct gggctcctct acattggcca ctgtgtccca catcgaggta 31020
gactgcctca cgcccttcac agtctacctg cttttcggct ttgtcatctg cacctttgtc 31080
tgcagcgtta tcactgtagt gatctgcttc atacagtgca tcgactacat ctgtgtgcgg 31140
gtggcctact ttagacacca cccccagtat cgcaacaggg acatagcggc tctcctaaga 31200
cttgtttaaa tcatggccaa attacctgtg attggtcttc tgattatctg ctgcgtccta 31260
gccgcgattg ggactcaacc taataccacc accagcgctc ccagaaagag acatgtatcc 31320
tgcagcttca agcgtccctg gaatataccc caatgcttta ctgatgaacc tgaaatctct 31380
ttggcttggt acttcagcgt caccgccctt ctcatcttct gcagtacggt tattgctctt 31440
gccatctacc cttcccttaa cctgggctgg aatgctgtca actctatgga atatcccacc 31500
ttcccagaac cagacctgcc agacctggtt gttctaaacg cgtttcctcc tcctccagtt 31560
caaaatcagt ttcgccctcc gtcccctacg cccactgagg tcagctactt taatctaaca 31620
ggcggagatg actgaaaacc tagacctaga aatggacggt ctctgcagcg agcaacgcac 31680
actagagagg cgccggcaaa aagcagagct cgagcgtctt aaacaagagc tccaagacgc 31740
cgtggccata caccagtgca aaaaagggct cttctgtctg gtaaaacagg ccacgctcac 31800
ctatgaaaaa acaggtgaca cccaccgcct aggatacaag ctgcccacac agcgccaaaa 31860
gtttgccctt atgataggtg aacaacccat caccgtcacc cagcactccg tggagacaga 31920
aggctgcatt catgctccct gcaggggcgc tgactgcctc tacaccttga tcaaaaccct 31980
ctgcggtctc agagacctta tccctttcaa ttgatcataa ctgtaatcaa taaaaaatca 32040
cttacttgaa atctgatagc aagactctgt ccaatttttt cagcaacact tccttcccct 32100
cctcccaact ctggtactct aggcgcctcc tagctgcaaa cttcctccac agtctgaagg 32160
gaatgtcaga ttcctcctcc tgtccctccg cacccacgat cttcatgttg ttacagatga 32220
aacgcgcgag atcgtctgac gagaccttca accccgtgta cccctacgat accgagatcg 32280
ctccgacttc tgtccctttc cttacccctc cctttgtatc atccgcagga atgcaagaaa 32340
atccagctgg ggtgctgtcc ctgcacctgt cagagcccct taccacccac aatggggccc 32400
tgactctaaa aatggggggc ggcctgaccc tggacaagga agggaatctc acttcccaaa 32460
acatcaccag tgtcgatccc cctctcaaaa aaagcaagaa caacatcagc cttcagaccg 32520
ccgcacccct cgccgtcagc tccggggccc taaccctttt tgccactccc cccctagcgg 32580
tcagtggcga caaccttact gtgcagtctc aggcccctct tactttggaa gactcaaaac 32640
taactctggc caccaaagga cccctaactg tgtccgaagg caaacttgtc ctagaaacag 32700
agcctcccct gcatgcaagt gacagcagta gcctgggcct tagcgtcacg gccccactta 32760
gcattaacaa tgacagccta ggactagaca tgcaagcgcc catcagctct cgagatggaa 32820
aactggctct aacagtggcg gcccccctaa ctgtggccga gggtatcaat gctttggcag 32880
tagccacagg taatggtatt ggactaaatg aaaccaacac acacctgcag gcaaaactgg 32940
tcgcgcccct aggctttgat accaacggca acattaagct aagcgtcgca ggaggcatga 33000
ggctaaacaa taacacactg atactagatg taaactaccc atttgaggct caaggccaac 33060
tgagcctaag agtgggctcg ggcccactat atgtagattc tagtagtcat aacctaacca 33120
ttagatgcct taggggattg tatgtaacat cttctaacaa ccaaaacggt ctagaggcca 33180
acattaaact aacaaaaggc cttgtgtatg acggaaatgc catagcagtt aatgttggca 33240
aagggctgga atacagccct actggcacaa cagaaaaacc tatacagact aaaataggtc 33300
taggcatgga gtatgacact gagggagcca tgatgacaaa actaggctct ggactaagct 33360
ttgacaattc aggagccatt gtggtgggaa acaaaaatga tgacaggctt actttgtgga 33420
ccacaccgga cccatcgccc aactgtcaga tttactctga aaaagatgct aaactaacct 33480
tggtactgac taaatgtggc agtcaggttg taggcacagt atctattgcc gctcttaaag 33540
gtagccttgt gccaatcact agtgcaatca gtgtggttca gatataccta aggtttgatg 33600
aaaatggggt gctgatgagt aactcttcac ttaatggcga atactggaat tttagaaacg 33660
gagactcaac taatggcaca ccatatacaa acgcagtggg ttttatgcct aatctactgg 33720
cctatcctaa aggtcaaact acaactgcaa aaagtaacat tgtcagccag gtctacatga 33780
acggggacga tactaaaccc atgacattta caatcaactt caatggcctt agtgaaacag 33840
gggatacccc tgtcagtaaa tattccatga cattctcatg gaggtggcca aatggaagct 33900
acatagggca caattttgta acaaactcct ttactttctc ctacatcgcc caagaataaa 33960
gaaagcacag agatgcttgt ttttgatttc aaaattgtgt gcttttattt attttcaagc 34020
ttacagtatt tccagtagtc attagaatag agcttaatta aactgcatga gaacccttcc 34080
acatagctta aattatcacc agtgcaaatg gaaaaaaatc aacatacctt tttatccaga 34140
tatcaaagaa ctctagtggt cagttttccc ccaccctccc agctcacaga atacacagtc 34200
ctttcccccc ggctggcttt aaacaacact atctcattgg taacagacat atttttaggt 34260
gtaataatcc acacggtctc ttggcgggcc aaacgctggt ctgtgatgtt aataaactcc 34320
ccaggcagct ctttcaagtt cacgtcgctg tccaactgct gaagcgctcg cggctccgac 34380
tgcgcctcta gcggaggcaa cggcagcacc cgatccttga tctataaagg agtagagtca 34440
taatccccca taagaatagg gcggtgatgc agcaacaagg cgcgcagcaa ctcctgccgc 34500
cgcctctccg tacgacagga atgcaacggg gtggtggtct cctccgcgat aatccgcacc 34560
gctcgcagca tcagcatcct cgtcctccgg gcacagcagc gcatcctgat ctcactgaga 34620
tcggcgcagt aagtgcagca caacaccaag atgttattta agatcccaca gtgcaaagca 34680
ctgtacccaa agctcatggc gggaaggaca gcccccacgt gaccatcgta ccagatcctc 34740
aggtaaatca aatgacgacc tctcataaac acgctggaca tatacatcac ctccttgggc 34800
atgagctgat tcaccacctc tcgataccac aggcatcgct gattaattaa agacccctcg 34860
agcaccatcc tgaaccagga agccagcacc tgaccccccg ccaggcactg cagggacccc 34920
ggtgaatcgc agtggcagtg aagactccag cgctcgtagc cgtgaaccat agagctggtc 34980
attatatcca cattggcaca acacagacac actttcatac actttttcat gattagcagc 35040
tcctctctag tcaagaccat atcccaagga atcacccact cttgaatcaa ggtaaatccc 35100
acacagcagg gcaggcctct cacataactc acgttatgca tagtgagcgt gtcgcaatct 35160
ggaaataccg gatgatcttc catcaccgaa gcccgggtct ccgtctcaaa gggaggtaaa 35220
cggtccctcg tgtagggaca gtggcgggat aatcgagatc gtgttgaacg tagagtcatg 35280
ccaaagggaa cagcggacgt actcatattt cctccagcag aaccaagtgc gcgcgtggca 35340
gctatccctg cgtcttctgt ctcgccgcct gccccgctcg gtgtagtagt tgtaatacag 35400
ccactccctc agaccgtcaa ggcgctccct ggcgtccgga tctataacaa caccgtcctg 35460
cagcgccgcc ctgatgacat ccaccaccgt agagtatgcc aagcccagcc acgaaatgca 35520
ctcactttga cagcgagaga taggaggagc gggaagagat ggaagaacca tgatagtaaa 35580
agaactttta ttccaatcga tcctctacaa tgtcaaagtg tagatctatc agatggcact 35640
ggtctcctcc gctgagtcga tcaaaaataa cagctaaacc acaaacaaca cgattggtca 35700
aatgctgcac aagggcttgc agcataaaat cgcctcgaaa gtccaccgca agcataacat 35760
caaagccacc gcccctatca tgatctatga taaaaacccc acagctatcc accagaccca 35820
tatagttttc atctctccat cgtgaaaaaa tatttacaag ctcctccttt aaatcacctc 35880
caaccaattc aaaaagttga gccagaccgc cctccacctt cattttcagc atgcgcatca 35940
tgattgcaaa aattcaggct cctcagacac ctgtataaga ttgagaagcg gaacgttaac 36000
atcaatgttt cgctcgcgaa gatcgcgcct cagtgcaagc atgatataat cccacaggtc 36060
ggagcggatc agcgaggaca tctccccgcc aggaaccaac tcaacggagc ctatgctgat 36120
tataatacgc atattcgggg ctatgctaac cagcacggcc cccaaatagg cgtactgcat 36180
aggcggcgac aaaaagtgaa cagtttgggt taaaaaatca ggcaaacact cgcgcaaaaa 36240
agcaagaaca tcataaccat gctcatgcaa atagatgcaa gtaagctcag gaacgaccac 36300
agaaaaatgc acaatttttc tctcaaacat gactgcgagc cctgcaaaaa ataaaaaaga 36360
aacattacac aagagtagcc tgtcttacaa tgggatagac tactctaacc aacataagac 36420
gggccacgac atcgcccgcg tggccataaa aaaaattatc cgtgtgatta aaaagaagca 36480
cagatagctg gccagtcata tccggagtca tcacgtgcga acccgtgtag acccccgggt 36540
tggacacatc ggccaaacaa agaaagcggc caatgtatcc cggaggaatg ataacactaa 36600
gacgaagata caacagaata accccatggg ggggaataac aaagttagta ggtgaataaa 36660
aacgataaac acccgaaact ccctcctgcg taggcaaaat agcgccctcc ccttccaaaa 36720
caacatacag cgcttccaca gcagccatga caaaagactc aaaacactca aaagactcag 36780
tcttaccagg aaaataaaag cactctcaca gcaccagcac taatcagagt gtgaagaggg 36840
ccaagtgccg aacgagtata tataggaatt aaaaatgacg taaatgtgta aaggtcaaaa 36900
aacgcccaga aaaatacaca gaccaacgcc cgaaacgaaa acccgcgaaa aaatacccag 36960
aagttcctca acaaccgcca cttccgcttt cccacgatac gtcacttcct caaaaatagc 37020
aaactacatt tcccacatgt acaaaaccaa aacccctccc cttgtcaccg cccacaactt 37080
acataatcac aaacgtcaaa gcctacgtca cccgccccgc ctcgccccgc ccacctcatt 37140
atcatattgg cctcaatcca aaataaggta tattattgat gatg 37184
<210> 11
<211> 29
<212> PRT
<213> Great Ape Adenovirus
<400> 11
Glu Gln Glu Glu Asn Gln Val Val Ala Ala Asp Asp Glu Leu Glu Asp
1 5 10 15
Glu Glu Ala Gln Ala Gln Glu Glu Ala Pro Val Lys Lys
20 25
<210> 12
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 12
Ile Gln Ile Gly Thr Glu Val Val Gly Asp Thr Ser Lys Asp Thr
1 5 10 15
<210> 13
<211> 7
<212> PRT
<213> Great Ape Adenovirus
<400> 13
Asn Glu Ala Asp Ala Thr Ala
1 5
<210> 14
<211> 12
<212> PRT
<213> Great Ape Adenovirus
<400> 14
Met Val Ala Asn Glu Gln Gly Val Leu Glu Ser Lys
1 5 10
<210> 15
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 15
Asn Thr Thr Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu
1 5 10 15
<210> 16
<211> 9
<212> PRT
<213> Great Ape Adenovirus
<400> 16
Lys Lys Asp Asp Val Asn Ala Lys Ile
1 5
<210> 17
<211> 26
<212> PRT
<213> Great Ape Adenovirus
<400> 17
Val Thr Asp Thr Tyr Gln Gly Ile Lys Asn Thr Asn Gly Asn Gly Gln
1 5 10 15
Trp Thr Lys Asp Asp Gln Phe Ala Asp Arg
20 25
<210> 18
<211> 29
<212> PRT
<213> Great Ape Adenovirus
<400> 18
Glu Gln Glu Glu Asn Gln Val Glu Ala Ala Asp Glu Asp Val Glu Asp
1 5 10 15
Glu Glu Ala Gln Ala Gln Glu Glu Ala Pro Ala Lys Lys
20 25
<210> 19
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 19
Leu Gln Ile Gly Thr Glu Val Val Gly Glu Thr Ser Lys Asp Thr
1 5 10 15
<210> 20
<211> 7
<212> PRT
<213> Great Ape Adenovirus
<400> 20
Asn Glu Ala Asp Ala Ala Val
1 5
<210> 21
<211> 12
<212> PRT
<213> Great Ape Adenovirus
<400> 21
Leu Val Ala Asn Glu Gln Gly Val Met Glu Ser Lys
1 5 10
<210> 22
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 22
Asn Thr Ser Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu
1 5 10 15
<210> 23
<211> 9
<212> PRT
<213> Great Ape Adenovirus
<400> 23
Lys Lys Asp Asp Val Asn Ala Lys Val
1 5
<210> 24
<211> 26
<212> PRT
<213> Great Ape Adenovirus
<400> 24
Ile Thr Asp Thr Tyr Gln Gly Val Lys Asn Thr Asn Gly Asn Gly Gln
1 5 10 15
Trp Thr Lys Asp Asp Gln Phe Ala Asp Arg
20 25
<210> 25
<211> 30
<212> PRT
<213> Great Ape Adenovirus
<400> 25
Glu Gln Glu Glu Asn Gln Val Val Ala Ala Asp Glu Asp Leu Glu Glu
1 5 10 15
Asp Glu Glu Ala Gln Ala Glu Glu Gln Ala Pro Ala Lys Lys
20 25 30
<210> 26
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 26
Leu Gln Ile Gly Thr Glu Val Val Gly Asp Thr Ser Lys Asp Thr
1 5 10 15
<210> 27
<211> 7
<212> PRT
<213> Great Ape Adenovirus
<400> 27
Asn Glu Ala Asp Ala Thr Ala
1 5
<210> 28
<211> 12
<212> PRT
<213> Great Ape Adenovirus
<400> 28
Met Val Ala Asn Glu Gln Gly Val Leu Gln Ser Lys
1 5 10
<210> 29
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 29
Asn Thr Ser Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu
1 5 10 15
<210> 30
<211> 9
<212> PRT
<213> Great Ape Adenovirus
<400> 30
Lys Lys Asp Asp Val Asn Ala Lys Val
1 5
<210> 31
<211> 26
<212> PRT
<213> Great Ape Adenovirus
<400> 31
Ile Thr Asp Thr Tyr Gln Gly Val Lys Asn Ser Asn Gly Asn Gly Gln
1 5 10 15
Trp Thr Lys Asp Asp Gln Phe Ala Asp Arg
20 25
<210> 32
<211> 31
<212> PRT
<213> Great Ape Adenovirus
<400> 32
Glu Gln Glu Glu Thr Gln Ala Ala Glu Glu Ala Val Asp Glu Glu Asp
1 5 10 15
Ala Glu Asp Glu Ala Gln Pro Gln Glu Glu Ala Pro Ala Lys Lys
20 25 30
<210> 33
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 33
Leu Gln Ile Gly Thr Glu Val Val Gly Asp Thr Ser Lys Asp Thr
1 5 10 15
<210> 34
<211> 7
<212> PRT
<213> Great Ape Adenovirus
<400> 34
Asn Glu Ala Asp Ala Ala Val
1 5
<210> 35
<211> 12
<212> PRT
<213> Great Ape Adenovirus
<400> 35
Met Val Ala Asn Glu Lys Gly Val Leu Gln Ser Lys
1 5 10
<210> 36
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 36
Asn Thr Ser Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu
1 5 10 15
<210> 37
<211> 9
<212> PRT
<213> Great Ape Adenovirus
<400> 37
Thr Lys Asp Asp Val Asn Ala Lys Val
1 5
<210> 38
<211> 26
<212> PRT
<213> Great Ape Adenovirus
<400> 38
Ile Thr Asp Thr Tyr Gln Gly Val Lys Asn Thr Asn Gly Asn Gly Gln
1 5 10 15
Trp Thr Lys Asp Asp Gln Phe Ala Asp Arg
20 25
<210> 39
<211> 29
<212> PRT
<213> Great Ape Adenovirus
<400> 39
Glu Gln Glu Glu Asn Gln Val Val Ala Ala Asp Asp Glu Leu Glu Asp
1 5 10 15
Glu Glu Ala Gln Ala Gln Glu Glu Ala Pro Val Lys Lys
20 25
<210> 40
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 40
Leu Gln Ile Gly Thr Glu Val Val Gly Asp Thr Ser Lys Asp Thr
1 5 10 15
<210> 41
<211> 7
<212> PRT
<213> Great Ape Adenovirus
<400> 41
Asn Glu Ala Asp Ala Thr Val
1 5
<210> 42
<211> 12
<212> PRT
<213> Great Ape Adenovirus
<400> 42
Met Val Ala Asn Glu Gln Gly Val Leu Glu Ser Lys
1 5 10
<210> 43
<211> 15
<212> PRT
<213> Great Ape Adenovirus
<400> 43
Asn Thr Thr Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu
1 5 10 15
<210> 44
<211> 9
<212> PRT
<213> Great Ape Adenovirus
<400> 44
Lys Lys Asp Asp Val Asn Ala Lys Ile
1 5
<210> 45
<211> 26
<212> PRT
<213> Great Ape Adenovirus
<400> 45
Val Thr Asp Thr Tyr Gln Gly Ile Lys Asn Thr Asn Gly Asn Gly Gln
1 5 10 15
Trp Thr Lys Asp Asp Gln Phe Ala Asp Arg
20 25
<210> 46
<211> 955
<212> PRT
<213> Great Ape Adenovirus
<400> 46
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Asn Met Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Pro Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu Gln Glu Glu Asn Gln Val Val Ala
130 135 140
Ala Asp Asp Glu Leu Glu Asp Glu Glu Ala Gln Ala Gln Glu Glu Ala
145 150 155 160
Pro Val Lys Lys Ile His Val Tyr Ala Gln Ala Pro Leu Ser Gly Glu
165 170 175
Lys Ile Ser Lys Asp Gly Ile Gln Ile Gly Thr Glu Val Val Gly Asp
180 185 190
Thr Ser Lys Asp Thr Phe Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln
195 200 205
Ile Gly Glu Ser Gln Trp Asn Glu Ala Asp Ala Thr Ala Ala Gly Gly
210 215 220
Arg Val Leu Lys Lys Thr Thr Pro Met Arg Pro Cys Tyr Gly Ser Tyr
225 230 235 240
Ala Arg Pro Thr Asn Ala Asn Gly Gly Gln Gly Ile Met Val Ala Asn
245 250 255
Glu Gln Gly Val Leu Glu Ser Lys Val Glu Met Gln Phe Phe Ser Asn
260 265 270
Thr Thr Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu Pro Lys
275 280 285
Val Val Leu Tyr Ser Glu Asp Val His Leu Glu Ser Pro Asp Thr His
290 295 300
Leu Ser Tyr Lys Pro Lys Lys Asp Asp Val Asn Ala Lys Ile Met Leu
305 310 315 320
Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Leu Ile Gly Phe Arg Asp
325 330 335
Asn Phe Ile Gly Leu Met Phe Tyr Asn Ser Thr Gly Asn Met Gly Val
340 345 350
Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp
355 360 365
Arg Asn Thr Glu Leu Ser Tyr Gln Leu Leu Leu Asp Ser Ile Gly Asp
370 375 380
Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp
385 390 395 400
Pro Asp Val Arg Ile Ile Glu Asn His Gly Thr Glu Asp Glu Leu Pro
405 410 415
Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Val Thr Asp Thr Tyr Gln
420 425 430
Gly Ile Lys Asn Thr Asn Gly Asn Gly Gln Trp Thr Lys Asp Asp Gln
435 440 445
Phe Ala Asp Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala Met Glu
450 455 460
Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val
465 470 475 480
Gly Leu Tyr Leu Pro Asp Lys Leu Lys Tyr Asn Pro Thr Asn Val Asp
485 490 495
Ile Ser Asp Asn Pro Asn Thr Tyr Asp Tyr Met Asn Lys Arg Val Val
500 505 510
Ala Pro Gly Leu Val Asp Cys Phe Val Asn Val Gly Ala Arg Trp Ser
515 520 525
Leu Asp Tyr Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala
530 535 540
Gly Leu Arg Tyr Arg Ser Met Ile Leu Gly Asn Gly Arg Tyr Val Pro
545 550 555 560
Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu Leu
565 570 575
Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val
580 585 590
Asn Met Val Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly
595 600 605
Ala Ser Ile Lys Phe Asp Ser Ile Thr Leu Tyr Ala Thr Phe Phe Pro
610 615 620
Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp
625 630 635 640
Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Gly Ala Asn Met Leu
645 650 655
Tyr Pro Ile Pro Ala Lys Ala Thr Asn Val Pro Ile Ser Ile Pro Ser
660 665 670
Arg Asn Trp Ala Ala Phe Arg Gly Trp Ala Phe Thr Arg Leu Lys Thr
675 680 685
Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr
690 695 700
Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr
705 710 715 720
Phe Lys Lys Ile Ser Ile Met Tyr Asp Ser Ser Val Ser Trp Pro Gly
725 730 735
Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Val Lys Arg Ala Val
740 745 750
Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp
755 760 765
Phe Leu Val Gln Met Leu Ala Asn Tyr Asn Ile Gly Tyr Gln Gly Phe
770 775 780
Tyr Ile Pro Glu Ser Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn
785 790 795 800
Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Thr Asn Tyr Lys Asp
805 810 815
Tyr Gln Ala Ile Gly Ile Thr His Gln His Asn Asn Ser Gly Phe Val
820 825 830
Gly Tyr Leu Ala Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro Ala Asn
835 840 845
Phe Pro Tyr Pro Leu Ile Gly Lys Thr Ala Val Asp Ser Val Thr Gln
850 855 860
Lys Lys Phe Leu Cys Asp Arg Thr Leu Trp Arg Ile Pro Phe Ser Ser
865 870 875 880
Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Leu Leu
885 890 895
Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro
900 905 910
Met Asp Glu Pro Thr Leu Leu Tyr Ile Val Phe Glu Val Phe Asp Val
915 920 925
Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Thr Val Tyr Leu
930 935 940
Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
945 950 955
<210> 47
<211> 955
<212> PRT
<213> Great Ape Adenovirus
<400> 47
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Asn Met Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Pro Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu Gln Glu Glu Asn Gln Val Glu Ala
130 135 140
Ala Asp Glu Asp Val Glu Asp Glu Glu Ala Gln Ala Gln Glu Glu Ala
145 150 155 160
Pro Val Lys Lys Ile His Val Tyr Ala Gln Ala Pro Leu Ala Gly Glu
165 170 175
Lys Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Glu Val Val Gly Glu
180 185 190
Thr Ser Lys Asp Thr Phe Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln
195 200 205
Ile Gly Glu Ser Gln Trp Asn Glu Ala Asp Ala Ala Val Ala Gly Gly
210 215 220
Arg Val Leu Lys Lys Thr Thr Pro Met Arg Pro Cys Tyr Gly Ser Tyr
225 230 235 240
Ala Arg Pro Thr Asn Ala Asn Gly Gly Gln Gly Ile Leu Val Ala Asn
245 250 255
Glu Gln Gly Val Met Glu Ser Lys Val Glu Met Gln Phe Phe Ser Asn
260 265 270
Thr Ser Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu Pro Lys
275 280 285
Val Val Leu Tyr Ser Glu Asp Val His Leu Glu Ser Pro Asp Thr His
290 295 300
Leu Ser Tyr Lys Pro Lys Lys Asp Asp Val Asn Ala Lys Val Met Leu
305 310 315 320
Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Leu Ile Gly Phe Arg Asp
325 330 335
Asn Phe Ile Gly Leu Met Phe Tyr Asn Ser Thr Gly Asn Met Gly Val
340 345 350
Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp
355 360 365
Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met Leu Asp Ser Ile Gly Asp
370 375 380
Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp
385 390 395 400
Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu Pro
405 410 415
Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Ile Thr Asp Thr Tyr Gln
420 425 430
Gly Val Lys Asn Thr Asn Gly Asn Gly Gln Trp Thr Lys Asp Asp Gln
435 440 445
Phe Ala Asp Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala Met Glu
450 455 460
Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val
465 470 475 480
Gly Leu Tyr Leu Pro Asp Lys Leu Lys Tyr Asn Pro Thr Asn Val Asp
485 490 495
Ile Ser Asp Asn Pro Asn Thr Tyr Asp Tyr Met Asn Lys Arg Val Val
500 505 510
Ala Pro Gly Leu Val Asp Cys Phe Val Asn Val Gly Ala Arg Trp Ser
515 520 525
Leu Asp Tyr Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala
530 535 540
Gly Leu Arg Tyr Arg Ser Met Ile Leu Gly Asn Gly Arg Tyr Val Pro
545 550 555 560
Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu Leu
565 570 575
Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val
580 585 590
Asn Met Val Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly
595 600 605
Ala Ser Ile Lys Phe Asp Ser Ile Thr Leu Tyr Ala Thr Phe Phe Pro
610 615 620
Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp
625 630 635 640
Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Gly Ala Asn Met Leu
645 650 655
Tyr Pro Ile Pro Ala Lys Ala Thr Asn Val Pro Ile Ser Ile Pro Ser
660 665 670
Arg Asn Trp Ala Ala Phe Arg Gly Trp Ala Phe Thr Arg Leu Lys Thr
675 680 685
Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr
690 695 700
Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr
705 710 715 720
Phe Lys Lys Ile Ser Ile Met Tyr Asp Ser Ser Val Ser Trp Pro Gly
725 730 735
Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Val Lys Arg Ala Val
740 745 750
Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp
755 760 765
Phe Leu Val Gln Met Leu Ala Asn Tyr Asn Ile Gly Tyr Gln Gly Phe
770 775 780
Tyr Ile Pro Glu Ser Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn
785 790 795 800
Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Thr Asn Tyr Lys Asp
805 810 815
Tyr Gln Ala Ile Gly Ile Thr His Gln His Asn Asn Ser Gly Phe Val
820 825 830
Gly Tyr Leu Ala Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro Ala Asn
835 840 845
Phe Pro Tyr Pro Leu Ile Gly Lys Thr Ala Val Asp Ser Val Thr Gln
850 855 860
Lys Lys Phe Leu Cys Asp Arg Thr Leu Trp Arg Ile Pro Phe Ser Ser
865 870 875 880
Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Leu Leu
885 890 895
Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro
900 905 910
Met Asp Glu Pro Thr Leu Leu Tyr Ile Val Phe Glu Val Phe Asp Val
915 920 925
Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Thr Val Tyr Leu
930 935 940
Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
945 950 955
<210> 48
<211> 956
<212> PRT
<213> Great Ape Adenovirus
<400> 48
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Asn Met Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Pro Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu Gln Glu Glu Asn Gln Val Val Ala
130 135 140
Ala Asp Glu Asp Leu Glu Glu Asp Glu Glu Ala Gln Ala Glu Glu Gln
145 150 155 160
Ala Pro Val Lys Lys Ile His Val Tyr Ala Gln Ala Pro Leu Ala Gly
165 170 175
Glu Lys Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Glu Val Val Gly
180 185 190
Asp Thr Ser Lys Asp Thr Phe Ala Asp Lys Thr Phe Gln Pro Glu Pro
195 200 205
Gln Ile Gly Glu Ser Gln Trp Asn Glu Ala Asp Ala Thr Ala Ala Gly
210 215 220
Gly Arg Val Leu Lys Lys Thr Thr Pro Met Arg Pro Cys Tyr Gly Ser
225 230 235 240
Tyr Ala Arg Pro Thr Asn Ala Asn Gly Gly Gln Gly Ile Met Val Ala
245 250 255
Asn Glu Gln Gly Val Leu Gln Ser Lys Val Glu Met Gln Phe Phe Ser
260 265 270
Asn Thr Ser Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu Pro
275 280 285
Lys Val Val Leu Tyr Ser Glu Asp Val His Leu Glu Ser Pro Asp Thr
290 295 300
His Leu Ser Tyr Lys Pro Lys Lys Asp Asp Val Asn Ala Lys Val Met
305 310 315 320
Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Leu Ile Gly Phe Arg
325 330 335
Asp Asn Phe Ile Gly Leu Met Phe Tyr Asn Ser Thr Gly Asn Met Gly
340 345 350
Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln
355 360 365
Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met Leu Asp Ser Ile Gly
370 375 380
Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr
385 390 395 400
Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu Leu
405 410 415
Pro Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Ile Thr Asp Thr Tyr
420 425 430
Gln Gly Val Lys Asn Ser Asn Gly Asn Gly Gln Trp Thr Lys Asp Asp
435 440 445
Gln Phe Ala Asp Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala Met
450 455 460
Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn
465 470 475 480
Val Gly Leu Tyr Leu Pro Asp Lys Leu Lys Tyr Asn Pro Thr Asn Val
485 490 495
Asp Ile Ser Asp Asn Pro Asn Thr Tyr Asp Tyr Met Asn Lys Arg Val
500 505 510
Val Ala Pro Gly Leu Val Asp Cys Phe Val Asn Val Gly Ala Arg Trp
515 520 525
Ser Leu Asp Tyr Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn
530 535 540
Ala Gly Leu Arg Tyr Arg Ser Met Ile Leu Gly Asn Gly Arg Tyr Val
545 550 555 560
Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu
565 570 575
Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp
580 585 590
Val Asn Met Val Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val Asp
595 600 605
Gly Ala Ser Ile Lys Phe Asp Ser Ile Thr Leu Tyr Ala Thr Phe Phe
610 615 620
Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn
625 630 635 640
Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Gly Ala Asn Met
645 650 655
Leu Tyr Pro Ile Pro Ala Lys Ala Thr Asn Val Pro Ile Ser Ile Pro
660 665 670
Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ala Phe Thr Arg Leu Lys
675 680 685
Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val
690 695 700
Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His
705 710 715 720
Thr Phe Lys Lys Ile Ser Ile Met Tyr Asp Ser Ser Val Ser Trp Pro
725 730 735
Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Val Lys Arg Ala
740 745 750
Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp
755 760 765
Trp Phe Leu Val Gln Met Leu Ala Asn Tyr Asn Ile Gly Tyr Gln Gly
770 775 780
Phe Tyr Ile Pro Glu Ser Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg
785 790 795 800
Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Thr Asn Tyr Lys
805 810 815
Asp Tyr Gln Ala Ile Gly Ile Thr His Gln His Asn Asn Ser Gly Phe
820 825 830
Val Gly Tyr Leu Ala Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro Ala
835 840 845
Asn Phe Pro Tyr Pro Leu Ile Gly Lys Thr Ala Val Asp Ser Val Thr
850 855 860
Gln Lys Lys Phe Leu Cys Asp Arg Thr Leu Trp Arg Ile Pro Phe Ser
865 870 875 880
Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Leu
885 890 895
Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp
900 905 910
Pro Met Asp Glu Pro Thr Leu Leu Tyr Ile Val Phe Glu Val Phe Asp
915 920 925
Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Thr Val Tyr
930 935 940
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
945 950 955
<210> 49
<211> 957
<212> PRT
<213> Great Ape Adenovirus
<400> 49
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Asn Met Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Pro Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu Gln Glu Glu Thr Gln Ala Ala Glu
130 135 140
Glu Ala Val Asp Glu Glu Asp Ala Glu Asp Glu Ala Gln Pro Gln Glu
145 150 155 160
Glu Ala Pro Val Lys Lys Ile His Val Tyr Ala Gln Ala Pro Leu Ala
165 170 175
Gly Glu Lys Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Glu Val Val
180 185 190
Gly Asp Thr Ser Lys Asp Thr Phe Ala Asp Lys Thr Phe Gln Pro Glu
195 200 205
Pro Gln Ile Gly Glu Ser Gln Trp Asn Glu Ala Asp Ala Ala Val Ala
210 215 220
Gly Gly Arg Val Leu Lys Lys Thr Thr Pro Met Arg Pro Cys Tyr Gly
225 230 235 240
Ser Tyr Ala Arg Pro Thr Asn Ala Asn Gly Gly Gln Gly Ile Met Val
245 250 255
Ala Asn Glu Lys Gly Val Leu Gln Ser Lys Val Glu Met Gln Phe Phe
260 265 270
Ser Asn Thr Ser Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu
275 280 285
Pro Lys Val Val Leu Tyr Ser Glu Asp Val His Leu Glu Ser Pro Asp
290 295 300
Thr His Leu Ser Tyr Lys Pro Thr Lys Asp Asp Val Asn Ala Lys Val
305 310 315 320
Met Leu Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Leu Ile Gly Phe
325 330 335
Arg Asp Asn Phe Ile Gly Leu Met Phe Tyr Asn Ser Thr Gly Asn Met
340 345 350
Gly Val Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu
355 360 365
Gln Asp Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met Leu Asp Ser Ile
370 375 380
Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser
385 390 395 400
Tyr Asp Pro Asp Val Arg Ile Ile Glu Asn His Gly Val Glu Asp Glu
405 410 415
Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Ile Thr Asp Thr
420 425 430
Tyr Gln Gly Val Lys Asn Thr Asn Gly Asn Gly Gln Trp Thr Lys Asp
435 440 445
Asp Gln Phe Ala Asp Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala
450 455 460
Met Glu Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala
465 470 475 480
Asn Val Gly Leu Tyr Leu Pro Asp Lys Leu Lys Tyr Asn Pro Thr Asn
485 490 495
Val Asp Ile Ser Asp Asn Pro Asn Thr Tyr Asp Tyr Met Asn Lys Arg
500 505 510
Val Val Ala Pro Gly Leu Val Asp Cys Phe Val Asn Val Gly Ala Arg
515 520 525
Trp Ser Leu Asp Tyr Met Asp Asn Val Asn Pro Phe Asn His His Arg
530 535 540
Asn Ala Gly Leu Arg Tyr Arg Ser Met Ile Leu Gly Asn Gly Arg Tyr
545 550 555 560
Val Pro Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Asn
565 570 575
Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys
580 585 590
Asp Val Asn Met Val Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val
595 600 605
Asp Gly Ala Ser Ile Lys Phe Asp Ser Ile Thr Leu Tyr Ala Thr Phe
610 615 620
Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg
625 630 635 640
Asn Asp Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Gly Ala Asn
645 650 655
Met Leu Tyr Pro Ile Pro Ala Lys Ala Thr Asn Val Pro Ile Ser Ile
660 665 670
Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ala Phe Thr Arg Leu
675 680 685
Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe
690 695 700
Val Tyr Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn
705 710 715 720
His Thr Phe Lys Lys Ile Ser Ile Met Tyr Asp Ser Ser Val Ser Trp
725 730 735
Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Val Lys Arg
740 745 750
Ala Val Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys
755 760 765
Asp Trp Phe Leu Val Gln Met Leu Ala Asn Tyr Asn Ile Gly Tyr Gln
770 775 780
Gly Phe Tyr Ile Pro Glu Ser Tyr Lys Asp Arg Met Tyr Ser Phe Phe
785 790 795 800
Arg Asn Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Thr Asn Tyr
805 810 815
Lys Asp Tyr Gln Ala Ile Gly Ile Thr His Gln His Asn Asn Ser Gly
820 825 830
Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro
835 840 845
Ala Asn Phe Pro Tyr Pro Leu Ile Gly Lys Thr Ala Val Asp Ser Val
850 855 860
Thr Gln Lys Lys Phe Leu Cys Asp Arg Thr Leu Trp Arg Ile Pro Phe
865 870 875 880
Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn
885 890 895
Leu Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val
900 905 910
Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr Ile Val Phe Glu Val Phe
915 920 925
Asp Val Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Thr Val
930 935 940
Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
945 950 955
<210> 50
<211> 955
<212> PRT
<213> Great Ape Adenovirus
<400> 50
Met Ala Thr Pro Ser Met Met Pro Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Asn Met Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Pro Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu Gln Glu Glu Asn Gln Val Val Ala
130 135 140
Ala Asp Asp Glu Leu Glu Asp Glu Glu Ala Gln Ala Gln Glu Glu Ala
145 150 155 160
Pro Val Lys Lys Ile His Val Tyr Ala Gln Ala Pro Leu Ser Gly Glu
165 170 175
Lys Ile Thr Lys Asp Gly Leu Gln Ile Gly Thr Glu Val Val Gly Asp
180 185 190
Thr Ser Lys Asp Thr Phe Ala Asp Lys Thr Phe Gln Pro Glu Pro Gln
195 200 205
Ile Gly Glu Ser Gln Trp Asn Glu Ala Asp Ala Thr Val Ala Gly Gly
210 215 220
Arg Val Leu Lys Lys Thr Thr Pro Met Arg Pro Cys Tyr Gly Ser Tyr
225 230 235 240
Ala Arg Pro Thr Asn Ala Asn Gly Gly Gln Gly Ile Met Val Ala Asn
245 250 255
Glu Gln Gly Val Leu Glu Ser Lys Val Glu Met Gln Phe Phe Ser Asn
260 265 270
Thr Thr Thr Leu Asn Ala Arg Asp Gly Thr Gly Asn Pro Glu Pro Lys
275 280 285
Val Val Leu Tyr Ser Glu Asp Val His Leu Glu Ser Pro Asp Thr His
290 295 300
Leu Ser Tyr Lys Pro Lys Lys Asp Asp Val Asn Ala Lys Ile Met Leu
305 310 315 320
Gly Gln Gln Ala Met Pro Asn Arg Pro Asn Leu Ile Gly Phe Arg Asp
325 330 335
Asn Phe Ile Gly Leu Met Phe Tyr Asn Ser Thr Gly Asn Met Gly Val
340 345 350
Leu Ala Gly Gln Ala Ser Gln Leu Asn Ala Val Val Asp Leu Gln Asp
355 360 365
Arg Asn Thr Glu Leu Ser Tyr Gln Leu Met Leu Asp Ser Ile Gly Asp
370 375 380
Arg Thr Arg Tyr Phe Ser Met Trp Asn Gln Ala Val Asp Ser Tyr Asp
385 390 395 400
Pro Asp Val Arg Ile Ile Glu Asn His Gly Thr Glu Asp Glu Leu Pro
405 410 415
Asn Tyr Cys Phe Pro Leu Gly Gly Ile Gly Val Thr Asp Thr Tyr Gln
420 425 430
Gly Ile Lys Asn Thr Asn Gly Asn Gly Gln Trp Thr Lys Asp Asp Gln
435 440 445
Phe Ala Asp Arg Asn Glu Ile Gly Val Gly Asn Asn Phe Ala Met Glu
450 455 460
Ile Asn Ile Gln Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val
465 470 475 480
Gly Leu Tyr Leu Pro Asp Lys Leu Lys Tyr Asn Pro Thr Asn Val Asp
485 490 495
Ile Ser Asp Asn Pro Asn Thr Tyr Asp Tyr Met Asn Lys Arg Val Val
500 505 510
Ala Pro Gly Leu Val Asp Cys Phe Val Asn Val Gly Ala Arg Trp Ser
515 520 525
Leu Asp Tyr Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala
530 535 540
Gly Leu Arg Tyr Arg Ser Met Ile Leu Gly Asn Gly Arg Tyr Val Pro
545 550 555 560
Phe His Ile Gln Val Pro Gln Lys Phe Phe Ala Ile Lys Asn Leu Leu
565 570 575
Leu Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val
580 585 590
Asn Met Val Leu Gln Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly
595 600 605
Ala Ser Ile Lys Phe Asp Ser Ile Thr Leu Tyr Ala Thr Phe Phe Pro
610 615 620
Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp
625 630 635 640
Thr Asn Asp Gln Ser Phe Asn Asp Tyr Leu Ser Gly Ala Asn Met Leu
645 650 655
Tyr Pro Ile Pro Ala Lys Ala Thr Asn Val Pro Ile Ser Ile Pro Ser
660 665 670
Arg Asn Trp Ala Ala Phe Arg Gly Trp Ala Phe Thr Arg Leu Lys Thr
675 680 685
Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr
690 695 700
Ser Gly Ser Ile Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr
705 710 715 720
Phe Lys Lys Ile Ser Ile Met Tyr Asp Ser Ser Val Ser Trp Pro Gly
725 730 735
Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu Val Lys Arg Ala Val
740 745 750
Asp Gly Glu Gly Tyr Asn Val Ala Gln Cys Asn Met Thr Lys Asp Trp
755 760 765
Phe Leu Val Gln Met Leu Ala Asn Tyr Asn Ile Gly Tyr Gln Gly Phe
770 775 780
Tyr Ile Pro Glu Ser Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn
785 790 795 800
Phe Gln Pro Met Ser Arg Gln Val Val Asp Glu Thr Asn Tyr Lys Asp
805 810 815
Tyr Gln Ala Ile Gly Ile Thr His Gln His Asn Asn Ser Gly Phe Val
820 825 830
Gly Tyr Leu Ala Pro Thr Met Arg Glu Gly Gln Ala Tyr Pro Ala Asn
835 840 845
Phe Pro Tyr Pro Leu Ile Gly Lys Thr Ala Val Asp Ser Val Thr Gln
850 855 860
Lys Lys Phe Leu Cys Asp Arg Thr Leu Trp Arg Ile Pro Phe Ser Ser
865 870 875 880
Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gln Asn Leu Leu
885 890 895
Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro
900 905 910
Met Asp Glu Pro Thr Leu Leu Tyr Ile Val Phe Glu Val Phe Asp Val
915 920 925
Val Arg Val His Gln Pro His Arg Gly Val Ile Glu Thr Val Tyr Leu
930 935 940
Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr
945 950 955
<210> 51
<211> 651
<212> PRT
<213> Great Ape Adenovirus
<400> 51
Met Arg Arg Ala Ala Met Phe Glu Glu Gly Pro Pro Pro Ser Tyr Glu
1 5 10 15
Ser Ala Met Gly Ile Ser Pro Ala Ala Pro Leu Gln Pro Pro Tyr Val
20 25 30
Pro Pro Arg Tyr Leu Gln Pro Thr Gly Gly Arg Asn Ser Ile Cys Tyr
35 40 45
Ser Glu Leu Gln Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp
50 55 60
Asn Lys Ser Ala Asp Val Ala Ser Leu Asn Tyr Gln Asn Asp His Ser
65 70 75 80
Asp Phe Leu Thr Thr Val Ile Gln Asn Asn Asp Phe Thr Pro Thr Glu
85 90 95
Ala Ser Thr Gln Thr Ile Asn Leu Asp Asn Arg Ser Asn Trp Gly Gly
100 105 110
Asp Leu Lys Thr Ile Leu His Thr Asn Met Pro Asn Val Asn Glu Phe
115 120 125
Met Phe Thr Asn Ser Phe Lys Ala Arg Val Met Val Ala Arg Glu Gln
130 135 140
Gly Glu Ala Lys Tyr Glu Trp Val Asp Phe Thr Leu Pro Glu Gly Asn
145 150 155 160
Tyr Ser Glu Thr Met Thr Leu Asp Leu Met Asn Asn Ala Ile Val Glu
165 170 175
His Tyr Leu Lys Val Gly Arg Gln Asn Gly Val Lys Glu Ser Asp Ile
180 185 190
Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val
195 200 205
Thr Gly Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro
210 215 220
Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gln Ser Arg
225 230 235 240
Leu Ser Asn Leu Leu Gly Val Arg Lys Arg Gln Pro Phe Gln Glu Gly
245 250 255
Phe Lys Ile Thr Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu
260 265 270
Leu Asp Leu Asp Ala Tyr Glu Glu Ser Leu Lys Pro Glu Glu Ser Ala
275 280 285
Gly Asp Ser Gly Glu Ser Gly Glu Glu Gln Ala Gly Gly Gly Gly Ser
290 295 300
Ala Ser Val Glu Asn Glu Ser Thr Pro Ala Val Ala Ala Asp Ala Ala
305 310 315 320
Glu Val Glu Pro Glu Ala Met Gln Gln Asp Ala Glu Glu Gly Ala Gln
325 330 335
Glu Asp Met Asn Asn Gly Glu Ile Arg Gly Asp Thr Phe Ala Thr Arg
340 345 350
Gly Glu Glu Lys Glu Ala Glu Ala Ala Ala Ala Thr Ala Glu Ala Glu
355 360 365
Thr Glu Ala Glu Ala Glu Pro Glu Thr Glu Val Met Glu Asp Met Asn
370 375 380
Asp Gly Glu Arg Arg Gly Asp Thr Phe Ala Thr Arg Gly Glu Glu Lys
385 390 395 400
Ala Ala Glu Ala Glu Ala Ala Ala Glu Glu Ala Ala Ala Ala Ala Ala
405 410 415
Lys Ala Glu Ala Ala Ala Glu Ala Lys Val Glu Ala Asp Val Ala Val
420 425 430
Glu Ala Gln Ala Glu Glu Glu Ala Ala Ala Glu Ala Val Lys Glu Lys
435 440 445
Ala Gln Ala Glu Gln Glu Glu Lys Lys Pro Val Ile Gln Pro Leu Lys
450 455 460
Glu Asp Ser Lys Lys Arg Ser Tyr Asn Val Ile Glu Gly Ser Thr Phe
465 470 475 480
Thr Gln Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Val
485 490 495
Lys Gly Val Arg Ser Trp Thr Leu Leu Cys Thr Pro Asp Val Thr Cys
500 505 510
Gly Ser Glu Gln Met Tyr Trp Ser Leu Pro Asn Met Met Gln Asp Pro
515 520 525
Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Phe Pro Val Val Gly
530 535 540
Ala Glu Leu Leu Pro Val His Ser Lys Ser Phe Tyr Asn Glu Gln Ala
545 550 555 560
Val Tyr Ser Gln Leu Ile Arg Gln Ala Thr Ser Leu Thr His Val Phe
565 570 575
Asn Arg Phe Pro Glu Asn Gln Ile Leu Ala Arg Pro Pro Ala Pro Thr
580 585 590
Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr
595 600 605
Leu Pro Leu Arg Asn Ser Ile Ser Gly Val Gln Arg Val Thr Ile Thr
610 615 620
Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile
625 630 635 640
Val Ser Pro Arg Val Leu Ser Ser Arg Thr Phe
645 650
<210> 52
<211> 651
<212> PRT
<213> Great Ape Adenovirus
<400> 52
Met Arg Arg Ala Ala Met Phe Glu Glu Gly Pro Pro Pro Ser Tyr Glu
1 5 10 15
Ser Ala Met Gly Ile Ser Pro Ala Ala Pro Leu Gln Pro Pro Tyr Val
20 25 30
Pro Pro Arg Tyr Leu Gln Pro Thr Gly Gly Arg Asn Ser Ile Cys Tyr
35 40 45
Ser Glu Leu Gln Pro Leu Tyr Asp Thr Thr Arg Leu Tyr Leu Val Asp
50 55 60
Asn Lys Ser Ala Asp Val Ala Ser Leu Asn Tyr Gln Asn Asp His Ser
65 70 75 80
Asp Phe Leu Thr Thr Val Ile Gln Asn Asn Asp Phe Thr Pro Thr Glu
85 90 95
Ala Ser Thr Gln Thr Ile Asn Leu Asp Asn Arg Ser Asn Trp Gly Gly
100 105 110
Asp Leu Lys Thr Ile Leu His Thr Asn Met Pro Asn Val Asn Glu Phe
115 120 125
Met Phe Thr Asn Ser Phe Lys Ala Arg Val Met Val Ala Arg Glu Gln
130 135 140
Gly Glu Ala Lys Tyr Glu Trp Val Asp Phe Thr Leu Pro Glu Gly Asn
145 150 155 160
Tyr Ser Glu Thr Met Thr Leu Asp Leu Met Asn Asn Ala Ile Val Glu
165 170 175
His Tyr Leu Lys Val Gly Arg Gln Asn Gly Val Lys Glu Ser Asp Ile
180 185 190
Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val
195 200 205
Thr Gly Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro
210 215 220
Asp Ile Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gln Ser Arg
225 230 235 240
Leu Ser Asn Leu Leu Gly Val Arg Lys Arg Gln Pro Phe Gln Glu Gly
245 250 255
Phe Lys Ile Thr Tyr Glu Asp Leu Glu Gly Gly Asn Ile Pro Ala Leu
260 265 270
Leu Asp Leu Asp Ala Tyr Glu Glu Ser Leu Lys Pro Glu Glu Ser Ala
275 280 285
Gly Asp Ser Gly Glu Ser Gly Glu Glu Gln Ala Gly Gly Gly Gly Ser
290 295 300
Ala Ser Val Glu Asn Glu Ser Thr Pro Ala Val Ala Ala Asp Ala Ala
305 310 315 320
Glu Val Glu Pro Glu Ala Met Gln Gln Asp Ala Glu Glu Gly Ala Gln
325 330 335
Glu Asp Met Asn Asn Gly Glu Ile Arg Gly Asp Thr Phe Ala Thr Arg
340 345 350
Gly Glu Glu Lys Glu Ala Glu Ala Ala Ala Ala Thr Ala Glu Ala Glu
355 360 365
Thr Glu Ala Glu Ala Glu Pro Glu Thr Glu Val Met Glu Asp Met Asn
370 375 380
Asp Gly Glu Arg Arg Gly Asp Thr Phe Ala Thr Arg Gly Glu Glu Lys
385 390 395 400
Ala Ala Glu Ala Glu Ala Ala Ala Glu Glu Ala Ala Ala Ala Ala Ala
405 410 415
Lys Ala Glu Ala Ala Ala Glu Ala Lys Val Glu Ala Asp Val Ala Val
420 425 430
Glu Ala Gln Ala Glu Glu Glu Ala Ala Thr Glu Ala Val Lys Glu Lys
435 440 445
Ala Gln Ala Glu Gln Glu Glu Lys Lys Pro Val Ile Gln Pro Leu Lys
450 455 460
Glu Asp Ser Lys Lys Arg Ser Tyr Asn Val Ile Glu Gly Ser Thr Phe
465 470 475 480
Thr Gln Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Val
485 490 495
Lys Gly Val Arg Ser Trp Thr Leu Leu Cys Thr Pro Asp Val Thr Cys
500 505 510
Gly Ser Glu Gln Met Tyr Trp Ser Leu Pro Asn Met Met Gln Asp Pro
515 520 525
Val Thr Phe Arg Ser Thr Arg Gln Val Ser Asn Phe Pro Val Val Gly
530 535 540
Ala Glu Leu Leu Pro Val His Ser Lys Ser Phe Tyr Asn Glu Gln Ala
545 550 555 560
Val Tyr Ser Gln Leu Ile Arg Gln Ala Thr Ser Leu Thr His Val Phe
565 570 575
Asn Arg Phe Pro Glu Asn Gln Ile Leu Ala Arg Pro Pro Ala Pro Thr
580 585 590
Ile Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr
595 600 605
Leu Pro Leu Arg Asn Ser Ile Ser Gly Val Gln Arg Val Thr Ile Thr
610 615 620
Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Ile
625 630 635 640
Val Ser Pro Arg Val Leu Ser Ser Arg Thr Phe
645 650
<210> 53
<211> 598
<212> PRT
<213> Great Ape Adenovirus
<400> 53
Met Ser Asp Ser Ser Ser Cys Pro Ser Ala Pro Thr Ile Phe Met Leu
1 5 10 15
Leu Gln Met Lys Arg Ala Arg Ser Ser Asp Glu Thr Phe Asn Pro Val
20 25 30
Tyr Pro Tyr Asp Thr Glu Ile Ala Pro Thr Ser Val Pro Phe Leu Thr
35 40 45
Pro Pro Phe Val Ser Ser Ala Gly Met Gln Glu Asn Pro Ala Gly Val
50 55 60
Leu Ser Leu His Leu Ser Glu Pro Leu Thr Thr His Asn Gly Ala Leu
65 70 75 80
Thr Leu Lys Met Gly Gly Gly Leu Thr Leu Asp Lys Glu Gly Asn Leu
85 90 95
Thr Ser Gln Asn Ile Thr Ser Val Asp Pro Pro Leu Lys Lys Ser Lys
100 105 110
Asn Asn Ile Ser Leu Gln Thr Ala Ala Pro Leu Ala Val Ser Ser Gly
115 120 125
Ala Leu Thr Leu Phe Ala Thr Pro Pro Leu Ala Val Ser Gly Asp Asn
130 135 140
Leu Thr Val Gln Ser Gln Ala Pro Leu Thr Leu Glu Asp Ser Lys Leu
145 150 155 160
Thr Leu Ala Thr Lys Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Val
165 170 175
Leu Glu Thr Glu Pro Pro Leu His Ala Ser Asp Ser Ser Ser Leu Gly
180 185 190
Leu Ser Val Thr Ala Pro Leu Ser Ile Asn Asn Asp Ser Leu Gly Leu
195 200 205
Asp Met Gln Ala Pro Ile Ser Ser Arg Asp Gly Lys Leu Ala Leu Thr
210 215 220
Val Ala Ala Pro Leu Thr Val Ala Glu Gly Ile Asn Ala Leu Ala Val
225 230 235 240
Ala Thr Gly Asn Gly Ile Gly Leu Asn Glu Thr Asn Thr His Leu Gln
245 250 255
Ala Lys Leu Val Ala Pro Leu Gly Phe Asp Thr Asn Gly Asn Ile Lys
260 265 270
Leu Ser Val Ala Gly Gly Met Arg Leu Asn Asn Asn Thr Leu Ile Leu
275 280 285
Asp Val Asn Tyr Pro Phe Glu Ala Gln Gly Gln Leu Ser Leu Arg Val
290 295 300
Gly Ser Gly Pro Leu Tyr Val Asp Ser Ser Ser His Asn Leu Thr Ile
305 310 315 320
Arg Cys Leu Arg Gly Leu Tyr Val Thr Ser Ser Asn Asn Gln Asn Gly
325 330 335
Leu Glu Ala Asn Ile Lys Leu Thr Lys Gly Leu Val Tyr Asp Gly Asn
340 345 350
Ala Ile Ala Val Asn Val Gly Lys Gly Leu Glu Tyr Ser Pro Thr Gly
355 360 365
Thr Thr Glu Lys Pro Ile Gln Thr Lys Ile Gly Leu Gly Met Glu Tyr
370 375 380
Asp Thr Glu Gly Ala Met Met Thr Lys Leu Gly Ser Gly Leu Ser Phe
385 390 395 400
Asp Asn Ser Gly Ala Ile Val Val Gly Asn Lys Asn Asp Asp Arg Leu
405 410 415
Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gln Ile Tyr Ser
420 425 430
Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gln
435 440 445
Val Val Gly Thr Val Ser Ile Ala Ala Leu Lys Gly Ser Leu Val Pro
450 455 460
Ile Thr Ser Ala Ile Ser Val Val Gln Ile Tyr Leu Arg Phe Asp Glu
465 470 475 480
Asn Gly Val Leu Met Ser Asn Ser Ser Leu Asn Gly Glu Tyr Trp Asn
485 490 495
Phe Arg Asn Gly Asp Ser Thr Asn Gly Thr Pro Tyr Thr Asn Ala Val
500 505 510
Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Gly Gln Thr Thr Thr
515 520 525
Ala Lys Ser Asn Ile Val Ser Gln Val Tyr Met Asn Gly Asp Asp Thr
530 535 540
Lys Pro Met Thr Phe Thr Ile Asn Phe Asn Gly Leu Ser Glu Thr Gly
545 550 555 560
Asp Thr Pro Val Ser Lys Tyr Ser Met Thr Phe Ser Trp Arg Trp Pro
565 570 575
Asn Gly Ser Tyr Ile Gly His Asn Phe Val Thr Asn Ser Phe Thr Phe
580 585 590
Ser Tyr Ile Ala Gln Glu
595
<210> 54
<211> 602
<212> PRT
<213> Great Ape Adenovirus
<400> 54
Met Ser Asp Ser Ser Ser Ser Cys Pro Ser Ala Pro Thr Ile Phe Met
1 5 10 15
Leu Leu Gln Met Lys Arg Ala Arg Ser Ser Asp Glu Thr Phe Asn Pro
20 25 30
Val Tyr Pro Tyr Asp Thr Glu Ile Ala Pro Thr Ser Val Pro Phe Leu
35 40 45
Thr Pro Pro Phe Val Ser Pro Ala Gly Met Gln Glu Asn Pro Ala Gly
50 55 60
Val Leu Ser Leu His Leu Ser Glu Pro Leu Thr Thr His Asn Gly Ala
65 70 75 80
Leu Thr Leu Lys Met Gly Gly Gly Leu Ile Leu Asp Lys Glu Gly Asn
85 90 95
Leu Thr Ser Gln Asn Ile Thr Ser Val Asp Pro Pro Leu Lys Lys Ser
100 105 110
Lys Asn Asn Ile Ser Leu Gln Thr Ala Ala Pro Leu Ala Val Ser Ser
115 120 125
Gly Ala Leu Thr Leu Phe Ala Thr Pro Pro Leu Ala Val Ser Gly Asp
130 135 140
Asn Leu Thr Val Gln Ser Gln Ala Pro Leu Thr Leu Glu Asp Ser Lys
145 150 155 160
Leu Thr Leu Ala Thr Lys Gly Pro Leu Thr Val Ser Glu Gly Lys Leu
165 170 175
Val Leu Glu Thr Glu Ala Pro Leu His Ala Ser Asp Ser Ser Ser Leu
180 185 190
Gly Leu Ser Val Thr Ala Pro Leu Ser Ile Asn Asn Asp Ser Leu Gly
195 200 205
Leu Asp Leu Gln Ala Pro Ile Val Ser Gln Asn Gly Lys Leu Ala Leu
210 215 220
Asn Ile Ala Gly Pro Leu Ala Val Ala Asp Ser Ile Asn Ala Leu Thr
225 230 235 240
Val Gly Thr Gly Lys Gly Ile Gly Leu Asn Glu Thr Ser Thr His Leu
245 250 255
Gln Ala Lys Leu Val Ala Pro Leu Gly Phe Asp Thr Asn Gly Asn Ile
260 265 270
Lys Leu Ser Val Ala Gly Gly Met Arg Leu Asn Asn Asp Thr Leu Ile
275 280 285
Leu Asp Val Asn Tyr Pro Phe Glu Ala Gln Gly Gln Leu Ser Leu Arg
290 295 300
Val Gly Thr Gly Pro Leu Tyr Val Asp Ser Ser Ser His Asn Leu Thr
305 310 315 320
Ile Arg Cys Leu Arg Gly Leu Tyr Ile Thr Ser Ser Asn Asn Gln Asn
325 330 335
Gly Leu Glu Ala Asn Ile Lys Leu Thr Lys Gly Leu Val Tyr Glu Gly
340 345 350
Asn Ala Ile Ala Val Asn Val Gly Gln Gly Leu Gln Tyr Ser Thr Thr
355 360 365
Ala Thr Ser Glu Gly Val Tyr Pro Ile Gln Ser Lys Ile Gly Leu Gly
370 375 380
Met Glu Tyr Asp Thr Asn Gly Ala Met Met Ala Lys Leu Gly Ser Gly
385 390 395 400
Leu Ser Phe Asp Asn Ser Gly Ala Ile Val Val Gly Asn Lys Asn Asp
405 410 415
Asp Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg
420 425 430
Ile Tyr Ser Glu Lys Asp Thr Lys Leu Thr Leu Val Leu Thr Lys Cys
435 440 445
Gly Ser Gln Ile Leu Gly Thr Val Ser Ala Leu Ala Val Arg Gly Ser
450 455 460
Leu Ala Pro Ile Thr Asn Ala Ser Ser Ile Val Gln Ile Phe Leu Arg
465 470 475 480
Phe Asp Glu Asn Gly Leu Leu Met Ser Asn Ser Ser Leu Asp Gly Asp
485 490 495
Tyr Trp Asn Tyr Arg Asn Gly Asp Ser Thr Asn Gly Thr Pro Tyr Thr
500 505 510
Asn Ala Val Gly Phe Met Pro Asn Leu Ala Ala Tyr Pro Lys Gly Gln
515 520 525
Ala Thr Thr Ala Lys Ser Ser Ile Val Ser Gln Val Tyr Met Asp Gly
530 535 540
Asp Thr Thr Lys Pro Ile Thr Leu Lys Ile Asn Phe Asn Gly Ile Asp
545 550 555 560
Glu Thr Thr Glu Asn Thr Pro Val Ser Lys Tyr Ser Met Thr Phe Ser
565 570 575
Trp Ser Trp Pro Thr Ala Ser Tyr Ile Gly His Thr Phe Ala Thr Asn
580 585 590
Ser Phe Thr Phe Ser Tyr Ile Ala Gln Glu
595 600
<210> 55
<211> 168
<212> DNA
<213> Great Ape Adenovirus
<400> 55
agggctttcg ttctgtagcc tggaggaaag taaatgggtt gggttgcggt gtgccccggt 60
tcgagaccaa gctgagctca gccggctgaa gccgcagcta acgtggtatt ggcagtcccg 120
tctcgaccca ggccctgtat cctccaggat acggtcgaga gccctttt 168
<210> 56
<211> 168
<212> DNA
<213> Great Ape Adenovirus
<400> 56
agggctttcg ttctgtagcc tggaggaaag taaatgggtt gggttgcggt gtgccccggt 60
tcgagaccaa gctgagctcg gccggctgaa gccgcagcta acgtggtatt ggcagtcccg 120
tctcgaccca ggccctgtat cctccaggat acggtcgaga gccctttt 168
<210> 57
<211> 174
<212> DNA
<213> Great Ape Adenovirus
<400> 57
ggctcgcttc cgtagtctgg agaaacaatc gccagggttg cgttgcggcg taccccggtt 60
cgagccccta tggcggcttg gatcggccgg aaccgcggct aacgtgggct gtggcagccc 120
cgtcctcagg accccgccag ccgacttctc cagttacggg agcgagcccc tttt 174
<210> 58
<211> 104
<212> DNA
<213> Artificial Sequence
<220>
<223> FW primer GAd-GAG left end
<400> 58
gaactccgaa ttcgtttaaa ccatcatcaa taatatacct tattttggat tgaggccaat 60
atgataatga ggtgggcggg gcgaggcggg gcgggtgacg tagg 104
<210> 59
<211> 43
<212> DNA
<213> Artificial Sequence
<220>
<223> RV primer GAd-GAG left end
<400> 59
cataatcggc cgcagcggcc cgtcagatga cggcgacaat aaa 43
<210> 60
<211> 37
<212> DNA
<213> Artificial Sequence
<220>
<223> FW primer GAd right end
<400> 60
cataatcgac ccgagtcgca ctctcacagc accagca 37
<210> 61
<211> 47
<212> DNA
<213> Artificial Sequence
<220>
<223> RV primer GAd right end
<400> 61
gaactccgga tccgtttaaa ccatcatcaa taatatacct tattttg 47
<210> 62
<211> 35
<212> DNA
<213> Artificial Sequence
<220>
<223> FW primer pIX
<400> 62
cataatcgcg atcgcgctta ggcctgacca tctgg 35
<210> 63
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> RV primer pIX
<400> 63
gaactccggc gcgccttagg gggaggcaag gctg 34
<210> 64
<211> 55
<212> DNA
<213> Artificial Sequence
<220>
<223> FW primer Amp-LacZ-SacB Ex. 2
<400> 64
gaactccggc gcgcctaggg ataacagggt aataccccta tttgtttatt tttct 55
<210> 65
<211> 56
<212> DNA
<213> Artificial Sequence
<220>
<223> RV primer Amp-LacZ-SacB Ex. 2
<400> 65
cataatcggc gcgccattac cctgttatcc ctattatttg ttaactgtta attgtc 56
<210> 66
<211> 92
<212> DNA
<213> Artificial Sequence
<220>
<223> FW primer Amp-LacZ-SacB Ex. 4
<400> 66
ggattacacc aagatctttg ctgtcatttg tgtgctgagt ataataaagg ctgagatcag 60
aatctactcg acccctattt gtttattttt ct 92
<210> 67
<211> 93
<212> DNA
<213> Artificial Sequence
<220>
<223> RV primer Amp-LacZ-SacB Ex. 4
<400> 67
cttgctatca gatttcaagt aagtgatttt ttattgatta cagttatgat caattgaaag 60
ggataaggtc ttatttgtta actgttaatt gtc 93
<210> 68
<211> 100
<212> DNA
<213> Artificial Sequence
<220>
<223> SS oligo Amp-LacZ-SacB
<400> 68
ctgtcatttg tgtgctgagt ataataaagg ctgagatcag aatctactcg gaccttatcc 60
ctttcaattg atcataactg taatcaataa aaaatcactt 100
<210> 69
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> CMVfw
<400> 69
catctacgta ttagtcatcg ctattacca 29
<210> 70
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> CMVrv
<400> 70
gacttggaaa tccccgtgag t 21
<210> 71
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> CMVFAM-TAMRA probe
<400> 71
acatcaatgg gcgtggatag cggtt 25
<210> 72
<211> 33360
<212> DNA
<213> Artificial Sequence
<220>
<223> GADNOU19 GAG (DE1DE3)
<400> 72
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg ggccgccatt gcatacgttg 480
tatccatatc ataatatgta catttatatt ggctcatgtc caacattacc gccatgttga 540
cattgattat tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca 600
tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac 660
gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact 720
ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa 780
gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg 840
cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta 900
gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg 960
tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg 1020
caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg 1080
ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctctccctat cagtgataga 1140
gatctcccta tcagtgatag agatcgtcga cgagctcgtt tagtgaaccg tcagatcgcc 1200
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 1260
cgcggccggg aacggtgcat tggaacgcgg attccccgtg ccaagagtga gatctaccat 1320
gggtgctagg gcttctgtgc tgtctggtgg tgagctggac aagtgggaga agatcaggct 1380
gaggcctggt ggcaagaaga agtacaagct aaagcacatt gtgtgggcct ccagggagct 1440
ggagaggttt gctgtgaacc ctggcctgct ggagacctct gaggggtgca ggcagatcct 1500
gggccagctc cagccctccc tgcaaacagg ctctgaggag ctgaggtccc tgtacaacac 1560
agtggctacc ctgtactgtg tgcaccagaa gattgatgtg aaggacacca aggaggccct 1620
ggagaagatt gaggaggagc agaacaagtc caagaagaag gcccagcagg ctgctgctgg 1680
cacaggcaac tccagccagg tgtcccagaa ctaccccatt gtgcagaacc tccagggcca 1740
gatggtgcac caggccatct ccccccggac cctgaatgcc tgggtgaagg tggtggagga 1800
gaaggccttc tcccctgagg tgatccccat gttctctgcc ctgtctgagg gtgccacccc 1860
ccaggacctg aacaccatgc tgaacacagt ggggggccat caggctgcca tgcagatgct 1920
gaaggagacc atcaatgagg aggctgctga gtgggacagg ctgcatcctg tgcacgctgg 1980
ccccattgcc cccggccaga tgagggagcc caggggctct gacattgctg gcaccacctc 2040
caccctccag gagcagattg gctggatgac caacaacccc cccatccctg tgggggaaat 2100
ctacaagagg tggatcatcc tgggcctgaa caagattgtg aggatgtact cccccacctc 2160
catcctggac atcaggcagg gccccaagga gcccttcagg gactatgtgg acaggttcta 2220
caagaccctg agggctgagc aggcctccca ggaggtgaag aactggatga cagagaccct 2280
gctggtgcag aatgccaacc ctgactgcaa gaccatcctg aaggccctgg gccctgctgc 2340
caccctggag gagatgatga cagcctgcca gggggtgggg ggccctggtc acaaggccag 2400
ggtgctggct gaggccatgt cccaggtgac caactccgcc accatcatga tgcagagggg 2460
caacttcagg aaccagagga agacagtgaa gtgcttcaac tgtggcaagg tgggccacat 2520
tgccaagaac tgtagggccc ccaggaagaa gggctgctgg aagtgtggca aggagggcca 2580
ccagatgaag gactgcaatg agaggcaggc caacttcctg ggcaaaatct ggccctccca 2640
caagggcagg cctggcaact tcctccagtc caggcctgag cccacagccc ctcccgagga 2700
gtccttcagg tttggggagg agaagaccac ccccagccag aagcaggagc ccattgacaa 2760
ggagctgtac cccctggcct ccctgaggtc cctgtttggc aacgacccct cctcccagta 2820
aaataaagcc catcagaatt cagtcgacag cggccgcgat ctgctgtgcc ttctagttgc 2880
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 2940
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 3000
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 3060
catgctgggg atgcggtggg ctctatggcc gggccgcgat cgcgcttagg cctgaccatc 3120
tggtgctggc ctgcaccagg gccgagtttg ggtctagcga tgaggatacc gattgaggtg 3180
ggtaaggtgg gcgtggctag cagggtgggc gtgtataaat tgggggtcta aggggtctct 3240
ctgtttgtct tgcaacagcc gccgccatga gcgacaccgg caacagcttt gatggaagca 3300
tctttagtcc ctatctgaca gtgcgcatgc ctcactgggc cggagtgcgt cagaatgtga 3360
tgggttccaa cgtggatgga cgtcccgttc tgccttcaaa ttcgtctact atggcctacg 3420
cgaccgtggg aggaactccg ctggacgccg cgacctccgc cgccgcctcc gccgccgccg 3480
cgaccgcgcg cagcatggct acggaccttt acagctcttt ggtggcgagc agcgcggcct 3540
ctcgcgcgtc tgctcgggat gagaaactga ctgctctgct gcttaaactg gaagacttga 3600
cccgggagct gggtcaactg acccagcagg tttccagctt gcgtgagagc agccttgcct 3660
ccccctaatg gcccataata taaataaaag ccagtctgtt tggattaagc aagtgtatgt 3720
tctttattta actctccgcg cgcggtaagc ccgggaccag cggtctcggt cgtttagggt 3780
gcggtggatt ttttccaaca cgtggtacag gtggctctgg atgtttagat acatgggcat 3840
gagtccatcc ctggggtgga ggtagcacca ctgcagagct tcgtgctcgg gggtggtgtt 3900
gtatatgatc cagtcgtagc aggagcgctg ggcgtggtgc tgaaaaatgt ccttaagcaa 3960
gaggcttata gctaggggga ggcccttggt gtaagtgttt acaaatctgc ttagctggga 4020
ggggtgcatc cggggggata tgatgtgcat cttggactgg atttttaggt tggctatgtt 4080
cccgcccaga tcccttctgg gattcatgtt gtgcaggacc accagcacgg tatatccagt 4140
gcacttggga aatttatcgt ggagcttaga cgggaatgca tggaagaact tggagacgcc 4200
cttgtggcct cccagatttt ccatacattc gtccatgatg atggcaatgg gcccgtggga 4260
agctgcctga gcaaaaacgt ttctggcatc gctcacatcg tagttatgtt ccagggtgag 4320
gtcatcatag gacatcttta cgaatcgggg gcgaagggtc ccggactggg ggatgatggt 4380
accctcgggc cccggggcgt agttcccctc acagatctgc atctcccagg ctttcatttc 4440
agagggaggg atcatatcca cctgcggggc gatgaaaaag acagtttctg gcgcagggga 4500
gattaactgg gatgagagca ggtttctgag cagctgtgac tttccacagc cggtgggccc 4560
atatatcacg cctatcaccg gctgcagctg gtagttaaga gagctgcagc tgccgtcctc 4620
ccggagcagg ggggccacct cgttgagcat atccctgacg tggatgttct ccctgaccag 4680
ttccgccaga aggcgctcgc cgcccagcga aagcagctct tgcaaggaag caaaattttt 4740
cagcggtttc aggccatcgg ccgtgggcat gtttttcagc gtctgggtca gcagctccag 4800
cctgtcccag agctcggtga tgtgctctac ggcatctcga tccagcagat ctcctcgttt 4860
cgcgggttgg ggcggctttc gctgtagggc accagccgat gggcgtccag cggggccaga 4920
gtcatgtcct tccatgggcg cagggtcctc gtcagggtgg tctgggtcac ggtgaagggg 4980
tgcgctccgg gttgggcact ggccagggtg cgcttgaggc tggttctgct ggtgctgaat 5040
cgctgccgct cttcgccctg cgcgtcggcc aggtagcatt tgaccatggt ctcgtagtcg 5100
agaccctcgg cggcgtgccc cttggcgcgg agctttccct tggaggtggc gccgcacgag 5160
gggcactgca ggctcttcag ggcgtagagc ttgggagcga gaaacacgga ctctggggag 5220
taggcgtccg cgccgcaggc cgagcagacc gtctcgcatt ccaccagcca agtgagttcc 5280
gggcggtcag ggtcaaaaac caggttgccc ccatgctttt tgatgcgttt cttaccttgg 5340
ctctccatga ggcggtgtcc cttctcggtg acgaagaggc tgtccgtgtc cccgtagacc 5400
gacttcaggg gcctgtcttc cagcggagtg cctctgtcct cctcgtagag aaactctgac 5460
cactctgaga cgaaggcccg cgtccaggcc aggacgaagg aggccacgtg ggaggggtag 5520
cggtcgttgt ccactagcgg gtccaccttc tccagggtgt gcaggcacat gtccccctcc 5580
tccgcgtcca gaaaagtgat tggcttgtag gtgtaggaca cgtgaccggg ggttcccaac 5640
gggggggtat aaaagggggt gggtgccctt tcatcttcac tctcttccgc atcgctgtct 5700
gcgagagcca gctgctgggg taagtattcc ctctcgaagg cgggcatgac ctcagcgctc 5760
aggttgtcag tttctaaaaa tgaggaggat ttgatgttca cctgtccgga ggtgatacct 5820
ttgagggtac ctgggtccat ctggtcagaa aacactattt ttttgttatc aagcttggtg 5880
gcgaatgacc cgtagagggc gttggagagc agcttggcga tggagcgcag ggtctggttt 5940
ttgtcgcggt cggctcgctc cttggccgcg atgttgagtt gcacgtactc gcgggccacg 6000
cacttccact cggggaacac ggtggtgcgc tcgtctggga tcaggcgcac cctccagccg 6060
cggttgtgca gggtgaccat gtcgacgctg gtggcgacct caccgcgcag acgctcgttg 6120
gtccagcaga ggcggccgcc cttgcgcgag cagaaggggg gtagggggtc cagctggtcc 6180
tcgtttgggg ggtccgcgtc gatggtaaag accccgggga gcaggcgcgg gtcaaagtag 6240
tcgatcttgc aagcttgcat gtccagagcc cgctgccatt cgcgggcggc gagcgcgcgc 6300
tcgtaggggt tgaggggcgg gccccagggc atggggtggg tgagcgcgga ggcgtacatg 6360
ccgcagatgt catacacgta caggggttcc ctgaggatac cgaggtaggt ggggtagcag 6420
cgccccccgc ggatgctggc gcgcacgtag tcatagagct cgtgggaggg ggccagcatg 6480
ttgggcccga ggttggtgcg ctgggggcgc tcggcgcgga agacgatctg cctgaagatg 6540
gcgtgggagt tggaggagat ggtgggccgc tggaagacgt tgaagcttgc ttcttgcaag 6600
cccacggagt ccctgacgaa ggaggcgtag gactcgcgca gcttgtgcac cagctcggcg 6660
gtgacctgga cgtcgagcgc acagtagtcg agggtctcgc ggatgatgtc atacctatcc 6720
tcccccttct ttttccacag ctcgcggttg aggacgaact cttcgcggtc tttccagtac 6780
tcttggaggg gaaacccgtc cgtgtccgaa cggtaagagc ctagcatgta gaactggttg 6840
acggcctggt aggggcagca gcccttctcc acgggcagcg cgtaggcctg cgccgccttg 6900
cggagggagg tgtgggtgag ggcgaaagtg tccctgacca tgactttgag gtattgatgt 6960
ctgaagtctg tgtcatcgca gccgccctgt tcccacaggg tgtagtccgt gcgctttttg 7020
gagcgcgggt tgggcaggga gaaggtgagg tcattgaaga ggatcttccc cgctcgaggc 7080
atgaagtttc tggtgatgcg aaagggccct gggaccgagg agcggttgtt gatgacctgg 7140
gcggccagga cgatctcgtc aaagccgttt atgttgtgtc ccacgatgta gagctccagg 7200
aagcggggct ggcccttgat ggaggggagc tttttaagtt cctcgtaggt aagctcctcg 7260
ggcgattcca ggccgtgctc ctccagggcc cagtcttgca agtgagggtt ggccgccagg 7320
aaggatcgcc agaggtcgcg ggccatgagg gtctgcaggc ggtcgcggaa ggttctgaac 7380
tgccgcccca cggccatttt ttcgggggtg atgcagtaga aggtgagggg gtctttctcc 7440
caggggtccc atctgagctc tcgggcgagg tcgcgcgcgg cagcgaccag agcctcgtcg 7500
ccccccagtt tcatgaccag catgaagggc acgagttgct tgccaaaggc tcccatccaa 7560
gtgtaggttt ctacatcgta ggtgacaaag aggcgctccg tgcgaggatg agagccgatt 7620
gggaagaact ggatctcccg ccaccagttg gaggattggc tgttgatgtg gtgaaagtag 7680
aagtcccgtc tgcgggccga gcactcgtgc tggcttttgt aaaagcgacc gcagtactgg 7740
cagcgctgca cgggttgtat atcttgcacg aggtgaacct ggcgacctct gacgaggaag 7800
cgcagcggga atctaagtcc cccgcctggg gtcccgtgtg gctggtggtc ttttactttg 7860
gttgtctggc cgccagcatc tgtctcctgg agggcgatgg tggaacagac caccacgccg 7920
cgagagccgc aggtccagat ctcggcgctc ggcgggcgga gtttgatgac gacatcgcgc 7980
acattggagc tgtccatggt ctccagctcc cgcggcggca ggtcagccgg gagttcctgg 8040
aggttcacct cgcagagacg ggtcaaggcg cggacagtgt tgagatggta tctgatttca 8100
aggggcatgt tggaggcgga gtcgatggct tgcaggaggc cgcagccccg gggggccacg 8160
atggttcccc gcggggcgcg aggggaggcg gaagctgggg gtgtgttcag aagcggtgac 8220
gcgggcgggc ccccggaggt agggggggtt ccggccccac aggcatgggc ggcaggggca 8280
cgtcttcgcc gcgcgcgggc aggggctggt gctggctccg aagagcgctt gcgtgcgcga 8340
cgacgcgacg gttggtgtcc tgtatctggc gcctctgagt gaagaccacg ggtcccgtga 8400
ccttgaacct gaaagagagt tcgacagaat caatctcggc atcgttgaca gcggcctggc 8460
gcaggatctc ctgcacgtcg cccgagttgt cctggtaggc gatttctgcc atgaactgct 8520
cgatctcttc ctcctggaga tctcctcgtc cggcgcgctc cacggtggcc gccaggtcgt 8580
tggagatgcg acccatgagc tgcgagaagg cgttgagtcc gccctcgttc cagacccggc 8640
tgtagaccac gcccccctcg gcgtcgcggg cgcgcatgac cacctgggcc aggttgagct 8700
ccacgtgtcg cgtgaagacg gcgtagttgc gcaggcgctg gaaaaggtag ttcagggtgg 8760
tggcggtgtg ctcggcgacg aagaagtaca tgacccagcg ccgcaacgtg gattcattga 8820
tgtcccccaa ggcctccagg cgctccatgg cctcgtagaa gtccacggcg aagttgaaaa 8880
actgggagtt gcgagcggac acggtcaact cctcctccag aagacggatg agctcggcga 8940
cagtgtcgcg cacctcgcgc tcgaaggcca cggggggcgc ttcttcctct tccacctctt 9000
cttccatgat tgcttcttct tcttcctcag ccgggacggg agggggcggc ggcgggggag 9060
gggcgcggcg gcggcggcgg cgcaccggga ggcggtcgat gaagcgctcg atcatctccc 9120
cccgcatgcg gcgcatggtc tcggtgacgg cgcggccgtt ctcccggggg cgcagctcga 9180
agacgccgcc tctcatttcg ccgcggggcg ggcggccgtg aggtagcgag acggcgctga 9240
ctatgcatct taacaattgc tgtgtaggta cgccgccaag ggacctgatt gagtccagat 9300
ccaccggatc cgaaaacctt tggaggaaag cgtctatcca gtcgcagtcg caaggtaggc 9360
tgagcaccgt ggcgggcggg ggcgggtcgg gagagttcct ggcggagatg ctgctgatga 9420
tgtaattaaa gtaggcggtc ttgagaaggc ggatggtgga caggagcacc atgtctttgg 9480
gtccggcctg ttggatgcgg aggcggtcgg ccatgcccca ggcctcgttc tgacaccggc 9540
gcaggtcttt gtagtaatct tgcatgagtc tttccaccgg cacttcttct ccttcctctt 9600
cttcatctcg ccggtggttt ctcgcgccgc ccatgcgcgt gaccccaaag cccctgagcg 9660
gctgcagcag ggccaggtcg gcgaccacgc gctcggccaa gatggcctgc tgtacctgag 9720
tgagggtcct ctcgaagtca tccatgtcca cgaagcggtg gtaggcaccc gtgttgatgg 9780
tgtaggtgca gttggccatg acggaccagt tgacggtctg gtgtcccggc tgcgagagct 9840
ccgtgtaccg caggcgcgag aaggcgcggg aatcgaacac gtagtcgttg caagtccgca 9900
ccagatactg gtagcccacc aggaagtgcg gcggaggttg gcgatagagg ggccagcgct 9960
gggtggcggg ggcgccgggc gccaggtctt ccagcatgag gcggtggtat ccgtagatgt 10020
acctggacat ccaggtgatg cctgcggcgg tggtggtggc gcgcgcgtag tcgcggaccc 10080
ggttccagat gtttcgcagg ggcgagaagt gttccatggt cggcacgctc tggccggtga 10140
ggcgcgcgca gtcgttgacg ctctatacac acacaaaaac gaaagcgttt acagggcttt 10200
cgttctgtag cctggaggaa agtaaatggg ttgggttgcg gtgtgccccg gttcgagacc 10260
aagctgagct cagccggctg aagccgcagc taacgtggta ttggcagtcc cgtctcgacc 10320
caggccctgt atcctccagg atacggtcga gagccctttt gctttcttgg ccaagcgccc 10380
gtggcgcgat ctgggataga tggtcgcgat gagaggacaa aagcggctcg cttccgtagt 10440
ctggagaaac aatcgccagg gttgcgttgc ggcgtacccc ggttcgagcc cctatggcgg 10500
cttggatcgg ccggaaccgc ggctaacgtg ggctgtggca gccccgtcct caggaccccg 10560
ccagccgact tctccagtta cgggagcgag ccccttttgt ttttttattt tttagatgca 10620
tcccgtgctg cggcagatgc gcccctcgcc ccggcccgat cagcagcagc aacagcaggc 10680
atgcagaccc ccctctcctc tccccgcccc ggtcaccacg gccgcggcgg ccgtgtccgg 10740
tgcggggggc gcgctggagt cagatgagcc accgcggcgg cgacctaggc agtatctgga 10800
cttggaagag ggcgagggac tggcgcggct gggggcgagc tctccagagc gccacccgcg 10860
ggtgcagttg aaaagggacg cgcgtgaggc gtacctgccg cggcaaaacc tgtttcgcga 10920
ccgcgggggc gaggagcccg aggagatgcg ggactgcagg ttccaagcgg ggcgcgagct 10980
gcgccgcggc ttggacagac agcgcctgct gcgcgaggag gactttgagc ccgacacgca 11040
gacgggcatc agccccgcgc gcgcgcacgt ggccgcggcc gacctggtga ccgcctacga 11100
gcagacggtg aaccaggagc gcaacttcca aaaaagcttc aacaaccacg tgcgcacgct 11160
ggtggcgcgc gaggaggtga ccctgggtct catgcatctg tgggacctgg tggaggcgat 11220
cgtgcagaac cccagcagca agcccctgac cgcgcagctg ttcctggtgg tgcagcacag 11280
cagggacaac gaggccttca gggaggcgct gctgaacatc accgagccgg aggggcgctg 11340
gctcctggac ctgataaaca tcctgcagag catagtggtg caggagcgca gcctgagcct 11400
ggccgagaag gtggcggcca ttaactattc tatgctgagc ctgggcaagt tctacgctcg 11460
caagatctac aagaccccct acgtgcccat agacaaggag gtgaagatag acagcttcta 11520
catgcgcatg gcgctgaagg tgctaaccct gagcgacgac ctgggagtgt accgcaacga 11580
gcgcatccac aaggccgtga gcgccagccg gcggcgcgag ctgagcgacc gcgaactgat 11640
gcacagtctg cagcgcgcgc tgaccggcgc gggcgagggc gacagggagg tcgagtccta 11700
ctttgacatg ggggccgacc tgcactggca gccgagccgc cgcgccctgg aagcggcggg 11760
ggcgtacggc ggccccctgg cggccgatga cgaggaagag gaggactatg agctagagga 11820
gggcgagtac ctggaggact gacctggctg gtggtgtttt ggtatagatg caagatccga 11880
acgtggcgga cccggcggtc cgggcggcgc tgcagagcca gccgtccggc attaactcct 11940
ctgacgactg ggccgcggcc atgggtcgca tcatggccct gaccgcgcgc aaccccgagg 12000
ccttcaggca gcagcctcag gctaaccggc tggcggccat cttggaagcg gtagtgcccg 12060
cgcgctccaa ccccacccac gagaaggtgc tggccatagt caacgcgctg gcggagagca 12120
gggccatccg ggcagacgag gccggactgg tgtacgatgc gctgctgcag cgggtggcgc 12180
ggtacaacag cggcaacgtg cagaccaacc tggaccgcct ggtgacggac gtgcgcgagg 12240
ccgtggcgca gcgcgagcgc ttgcatcagg acggcaacct gggctcgctg gtggcgctaa 12300
acgccttcct tagcacccag ccggccaacg taccgcgggg gcaggaggac tacaccaact 12360
tcttgagcgc gctgcggctg atggtgaccg aggtccctca gagcgaggtg taccagtcgg 12420
ggcccgacta cttcttccag accagcagac agggcttgca aaccgtgaac ctgagccagg 12480
ctttcaagaa cctgcggggg ctgtggggag tgaaggcgcc caccggcgac cgggctacgg 12540
tgtccagcct gctaaccccc aactcgcgcc tgctgctgct gctgatcgcg cccttcacgg 12600
acagcgggag cgtctcgcgg gagacctatc tgggccacct gctgacgctg taccgcgagg 12660
ccatcgggca ggcgcaggtg gacgagcaca ccttccagga gatcaccagc gtgagccacg 12720
cgctggggca ggaggacacg ggcagcctgc aggcgaccct gaactacctg ctgaccaaca 12780
ggcggcagaa gattcccacg ctgcacagcc tgacccagga ggaggagcgc atcttgcgct 12840
acgtgcagca gagcgtgagc ctgaacctga tgcgcgacgg cgtgacgccc agcgtggcgc 12900
tggacatgac cgcgcgcaac atggaaccgg gcatgtacgc ttcccagcgg ccgttcatca 12960
accgcctgat ggactacttg catcgggcgg cggccgtgaa ccccgagtac ttcaccaatg 13020
ccattctgaa tccccactgg atgccccctc cgggtttcta caacggggac ttcgaggtgc 13080
ctgaggtcaa cgatgggttc ctctgggatg acatggatga cagtgtgttc tcccccaacc 13140
cgctgcgcgc cgcgtctctg cgattgaagg agggctctga cagggaagga ccaaggagtc 13200
tggcctcctc cctggctctg ggggcggtgg gcgccacggg cgcggcggcg cggggcagca 13260
gccccttccc cagcctggcg gactctctga atagcgggcg ggtgagcagg ccccgcttgc 13320
taggcgagga ggagtatctg aacaactccc tgctgcagcc cgtgagggac aaaaacgctc 13380
agcggcagca gtttcccaac aatgggatag agagcctggt ggacaagatg tccagatgga 13440
agacgtatgc gcaggagtac aaggagtggg aggaccgcca gccgcggccc ctgccgcccc 13500
ctagacagcg ctggcagcgg cgcgcgtcca accgccgctg gaggcagggg cccgaggacg 13560
atgatgactc tgcagatgac agcagcgtgt tggacctggg cgggagcggg aacccctttt 13620
cgcacctgcg cccacgcctg ggcaagatgt tttaaaagag aaaaataaaa actcaccaag 13680
gccatggcga cgagcgttgg ttttttgttc ccttccttag tatgcggcgc gcggcgatgt 13740
tcgaggaggg gcctcccccc tcttacgaga gcgcgatggg aatttctcct gcggcgcccc 13800
tgcagcctcc ctacgtgcct cctcggtacc tgcaacctac aggggggaga aatagcatct 13860
gttactctga gctgcagccc ctgtacgata ccaccagact gtacctggtg gacaacaagt 13920
ccgcggacgt ggcctccctg aactaccaga acgaccacag cgattttttg accacggtga 13980
tccaaaacaa cgacttcacc ccaaccgagg ccagtaccca gaccataaac ctggacaaca 14040
ggtcgaactg gggcggcgac ctgaagacta tcctgcacac caatatgccc aacgtgaacg 14100
agttcatgtt caccaactct tttaaggcgc gggtgatggt ggcgcgcgag cagggggagg 14160
cgaagtacga gtgggtggac ttcacgctgc ccgagggcaa ctactcagag accatgactc 14220
tcgacctgat gaacaatgcg atcgtggaac actatctgaa agtgggcagg cagaacgggg 14280
tgaaggagag cgatatcggg gtcaagtttg acaccagaaa cttccgtctg ggctgggacc 14340
ctgtgaccgg gctggtcatg ccgggggtct acaccaacga ggcctttcat cccgatatag 14400
tgctcctgcc cggctgtggg gtggacttca cccagagccg gctgagcaac ctgctgggcg 14460
ttcgcaagcg gcaacctttc caggagggtt tcaagatcac ctatgaggat ctggaggggg 14520
gcaacattcc cgcgctcctt gatctggacg cctacgagga gagcttgaaa cccgaggaga 14580
gcgctggcga cagcggcgag agtggcgagg agcaagccgg cggcggcggc agcgcgtcgg 14640
tagaaaacga aagtactccc gcagtggcgg cggacgctgc ggaggtcgag ccggaggcca 14700
tgcagcagga cgcagaggag ggcgcgcagg aggacatgaa caatggggag atcaggggcg 14760
acactttcgc cacccggggc gaagaaaaag aggcagaggc ggcggcggcg acggcggaag 14820
ccgaaaccga ggcagaggca gagcccgaga ccgaagttat ggaagacatg aatgatggag 14880
aacgtagggg tgacacgttt gccacccggg gcgaagagaa ggcggcggag gcagaagccg 14940
cggctgagga ggcggctgcg gctgcggcca aggctgaggc tgcggctgag gctaaggtcg 15000
aagccgatgt tgcggttgag gctcaggctg aggaggaggc ggcggctgaa gcagttaagg 15060
aaaaggccca ggcagagcag gaagagaaaa aacctgtcat tcaacctcta aaagaagata 15120
gcaaaaagcg cagttacaac gtcattgagg gcagcacctt tacccaatac cgcagctggt 15180
acctggctta caactacggc gacccggtca agggggtgcg ctcgtggacc ctgctctgca 15240
cgccggacgt cacctgcggc tccgagcaga tgtactggtc gctgccaaac atgatgcaag 15300
acccggtgac cttccgttcc acgcggcagg ttagcaactt tccggtggtg ggcgccgaac 15360
tgctgccagt acactccaag agtttttaca acgagcaggc cgtctactcc cagctgatcc 15420
gccaggccac ctctctgacc cacgtgttca atcgctttcc cgagaaccag attttggcgc 15480
gcccgccggc ccccaccatc accaccgtca gtgaaaacgt tcctgccctc acagatcacg 15540
ggacgctacc gctgcgcaac agcatctcag gagtccagcg agtgaccatt actgacgcca 15600
gacgccggac ctgcccctac gtttacaagg ccttgggcat agtctcgccg cgcgtcctct 15660
ccagtcgcac tttttaaaac acatccaccc acacgctcca aaatcatgtc cgtactcatc 15720
tcgcccagca acaacaccgg ctgggggctg cgcgcaccca gcaagatgtt tggaggggca 15780
aggaagcgct ccgaccagca ccccgtgcgc gtgcgcggcc actaccgcgc gccctggggt 15840
gcgcacaagc gcgggcgcac agggcgcacc actgtggatg atgtcattga ctccgtagtg 15900
gagcaggcgc gccactacac acccggcgcg ccgaccgcct ccgccgtgtc caccgtggac 15960
caggcgatcg aaagcgtggt acagggggcg cggcactatg ccaaccttaa aagtcgccgc 16020
cgccgcgtgg cgcgccgcca tcgccggaga ccccgggcta ctgccgccgc gcgccttacc 16080
aaggctctgc tcaagcgcgc caggcgaact ggccaccggg ccgccatgag ggccgcacgg 16140
cgggctgccg ctgccgcgag cgccgtggcc ccgcgggcac gaaggcgcgc ggccgctgcc 16200
gccgccgccg ccatttccag cttggcctcg acgcggcgcg gtaacatata ctgggtgcgc 16260
gactcggtga gcggcacacg tgtgcccgtg cgctttcgcc ccccacggaa ttagcacaag 16320
acaacataca cactgagtct cctgctgttg tgtatcccag cggcgaccgt cagcagcggc 16380
gacatgtcca agcgcaaaat taaagaagag atgctccagg tcatcgcgcc ggagatctat 16440
gggcccccga agaaggagga ggaggattac aagccccgca agctaaagcg ggtcaaaaag 16500
aaaaagaaag atgatgacgt tgacgaggcg gtggagtttg tccgccgcat ggcgcccagg 16560
cgccctgtgc agtggaaggg tcggcgcgtg cagcgagtcc tgcgccccgg caccgcggtg 16620
gtctttacgc ccggcgagcg ttccacgcgc actttcaagc gggtgtacga tgaggtgtac 16680
ggcgacgagg atctgttgga gcaggccaac catcgatttg gggagtttgc atatgggaaa 16740
cggcctcgcg agagtctaaa agaggacctg ctggcgctac cgctggacga gggcaatccc 16800
accccgagtc tgaagccggt gaccctgcaa caggtgctgc ctttgagcgc gcccagcgag 16860
cagaagcgag ggttaaagcg cgagggcggg gacctggcac ccaccgtgca gttgatggtg 16920
cccaagcggc agaagctgga ggacgtgctg gagaaaatga aagtagagcc cgggatccag 16980
cccgagatca aggtccgccc tatcaagcag gtggcgcccg gcgtgggagt ccagaccgtg 17040
gacgttagga ttcccacgga ggagatggaa acccaaaccg ccactccctc ttcggcagca 17100
agcgccacca ccggcgccgc ttcggtagag gtgcagacgg acccctggct acccgccgcc 17160
actatcgccg tcgccgccgc cccccgttcg cgcggacgca agagaaatta tccagcggcc 17220
agcgcgctta tgccccagta tgcgctgcat ccatccatcg cgcccacccc cggctaccgc 17280
gggtactcgt accgcccgcg cagatcagcc ggcactcgcg gccgccgccg ccgtgcgacc 17340
acaaccagcc gccgccgtcg ccgccgccgc cagccagtgc tgacccccgt gtctgtaagg 17400
aaggtggctc gctcggggag cacgctggtg gtgcccagag cgcgctacca ccccagcatc 17460
gtttaaagcc ggtctctgta tggttcttgc agatatggcc ctcacttgtc gccttcgctt 17520
cccggtgccg ggataccgag gaagaactca ccgccgcagg ggcatggcgg gcagcggtct 17580
ccgcggcggc cgtcgccatc gccggcgcgc aaagagcagg cgcatgcgcg gcggtgtgtt 17640
gcccctgctg gtcccgctac tcgccgcggc gatcggcgcc gtgcccggga tcgcctccgt 17700
ggccctgcag gcgtcccaga aacattgact cttgcaacct tgcaagcttg catttttgga 17760
ggaaaaaata aaaaagtcta gactctcacg ctcgcttggt cctgtgacta ttttgtagaa 17820
aaaagatgga agacatcaac tttgcgtcgc tggccccgcg tcacggctcg cgcccgttca 17880
tgggagactg gacagatatc ggcaccagca atatgagcgg tggcgccttc agctggggca 17940
gtctgtggag cggccttaaa aattttggtt ccaccattaa gaactatggc aacaaagcgt 18000
ggaacagcag cacgggtcag atgctgagag acaagttgaa agagcagaac ttccaggaga 18060
aggtggcgca gggcctggcc tctggcatca gcggggtggt ggacatagct aaccaggccg 18120
tgcagaaaaa gataaacagt catctggacc cccgccctca ggtggaggaa acgcctccag 18180
ccatggagac ggtgtctccc gagggcaaag gcgaaaagcg cccgcggccc gacagggaag 18240
agaccctggt gtcacacacc gaggagccgc cctcttacga ggaggcagtc aaggccggcc 18300
tgcccaccac tcgccccata gctcccatgg ccaccggtgt ggtgggtcac aggcaacaca 18360
cccccgcaac actagatctg cccccgccgt ccgagccgac tcgccagcca aaggcggtga 18420
cggtgtccgc tccctccact tccgccgcca acagagtgcc tctgcgccgc gctgcgagcg 18480
gcccccgggc ctcgcgagtc agcggcaact ggcagagcac actgaacagc atcgtgggcc 18540
tgggagtgag gagtgtgaag cgccgccgtt gctactgaat gagcaagcta gctaacgtgt 18600
tgtatgtgtg tatgcgtcct atgtcgccgc cagaggagct gttgagccgc cggcgccgtc 18660
tgcactccag cgaatttcaa gatggcgacc ccatcgatga tgcctcagtg gtcgtacatg 18720
cacatctcgg gccaggacgc ttcggagtac ctgagccccg ggctggtgca gttcgcccgc 18780
gccacagaca cctacttcaa catgagtaac aagttcagga accccactgt ggcgcccacc 18840
cacgatgtga ccacggaccg gtcgcagcgc ctgacgctgc ggttcatccc cgtggatcgg 18900
gaggacaccg cttactctta caaggcgcgg ttcacgctgg ccgtgggcga caaccgcgtg 18960
ctggacatgg cctccactta ctttgacatc cggggggtgc tggacagggg ccccactttt 19020
aagccctact cgggcactgc ctacaacccc ctggccccca agggcgcccc caattcttgt 19080
gagtgggaac aagaggaaaa tcaggtggtc gctgcagatg atgaacttga agatgaagaa 19140
gcgcaagcac aagaggaagc ccctgtgaaa aaaattcatg tatatgctca ggcgcctctt 19200
tctggcgaaa agattaccaa ggatggtttg caaataggta ctgaagtcgt aggagataca 19260
tctaaggaca cttttgcaga taaaacattc caacccgaac ctcagatagg cgagtctcag 19320
tggaacgagg ctgatgccac agtagcagga ggtagagttt tgaaaaagac tacccctatg 19380
agaccttgct atggatccta tgccaggcct accaatgcca acgggggtca aggaattatg 19440
gttgccaatg aacaaggagt gttggagtct aaagtagaaa tgcaattttt ctctaacacc 19500
acaaccctta atgcgcggga tggaaccggc aatcccgaac caaaggtggt gttgtacagc 19560
gaagatgtcc acttggaatc tcccgatact catctgtctt acaagcccaa aaaggatgat 19620
gttaatgcca aaatcatgtt gggtcagcaa gccatgccca acagacccaa cctcattgga 19680
tttagagata atttcattgg gcttatgttt tacaacagca ccggtaacat gggagtgctg 19740
gcgggtcagg cctctcagtt gaatgctgtg gtggacttgc aggatagaaa cacagaactg 19800
tcatatcagc ttatgcttga ttcaattggg gatagaacca gatacttctc catgtggaac 19860
caggcagtgg atagctatga tccagatgtc agaattattg aaaaccatgg gactgaggat 19920
gaactgccca actactgctt ccctttgggc ggcataggag ttactgatac ttatcaaggg 19980
ataaaaaata ccaatggcaa tggtcagtgg accaaagatg atcagttcgc ggaccgcaac 20040
gaaatagggg tgggaaacaa cttcgccatg gagatcaaca tccaggccaa cctttggaga 20100
aacttcctct atgcaaacgt ggggctctac ctgccagaca agctcaagta caaccccacc 20160
aacgtggaca tctctgacaa ccccaacacc tatgactaca tgaacaagcg ggtggtggcc 20220
cctggcctgg tggactgctt tgtcaatgtg ggagccaggt ggtccctgga ctacatggac 20280
aacgtcaacc ccttcaacca ccaccgcaat gcgggtctgc gctaccgctc catgatcctg 20340
ggcaacgggc gctatgtgcc ctttcacatc caggtacccc agaagttctt tgccatcaag 20400
aacctcctgc tcctgcccgg ctcctacacc tacgagtgga acttcaggaa ggatgtgaac 20460
atggtcctac agagctctct gggcaatgac cttagggtgg atggggccag catcaagttt 20520
gacagcatca ccctctatgc tacatttttc cccatggccc acaacaccgc ctccacgctt 20580
gaggccatgc tgagaaacga caccaacgac cagtccttta atgactacct ctctggggcc 20640
aacatgctct acccaatccc agccaaggcc accaacgtgc ccatctccat cccctctcgc 20700
aactgggccg cctttagagg ctgggccttt acccgcctta agaccaagga gaccccctcc 20760
ctgggctcgg gttttgatcc ctactttgtt tactcgggat ccatccccta cctggatggc 20820
accttctacc tcaaccacac tttcaagaag atatccatca tgtatgactc ctccgtcagc 20880
tggccgggca acgaccgctt gctcaccccc aatgagttcg aggtcaagcg cgccgtggac 20940
ggcgagggct acaacgtggc ccagtgcaac atgaccaagg actggttcct ggtgcagatg 21000
ctggccaact acaacatagg ctaccagggc ttttacatcc cagagagcta caaggacagg 21060
atgtactcct tcttcagaaa tttccaaccc atgagccgac aggtggtgga cgagaccaat 21120
tacaaggact atcaagccat tggcatcacc caccagcaca acaactcggg tttcgtgggc 21180
tacctggcgc ccaccatgcg cgagggtcag gcctaccccg ccaacttccc ctaccccttg 21240
ataggcaaga ccgcggtcga cagcgtcacc cagaaaaagt tcctctgcga ccgcaccctc 21300
tggcgcatcc ccttctctag caacttcatg tccatgggtg cgctcacgga cctgggccaa 21360
aacctgcttt atgccaactc tgcccatgcg ctggacatga cttttgaggt ggaccccatg 21420
gacgagccca cccttctcta tattgtgttt gaagtgttcg acgtggtcag agtgcaccag 21480
ccgcaccgcg gtgtcatcga gaccgtgtac ctgcgtacgc ccttctcagc cggcaacgcc 21540
accacctaag gagacagcgc cgccgccgcc tgcatgacgg gttccaccga gcaagagctc 21600
agggccattg ccagagacct gggatgcgga ccctattttt tgggcaccta tgacaaacgc 21660
ttcccgggct ttatctcccg agacaagctc gcctgcgcca ttgtcaacac ggccgcgcgc 21720
gagaccgggg gcgtgcactg gctggccttt ggctgggacc cgcgctccaa aacttgctac 21780
ctctttgacc cctttggctt ctccgatcag cgcctcaggc agatttatga gtttgagtac 21840
gaggggctgc tgcgccgcag cgcgctcgcc tcctcgcccg accgctgcat cacccttgag 21900
aagtccaccg aaaccgtgca ggggccccac tcggccgcct gcggtctctt ctgttgcatg 21960
tttttgcacg cctttgtgca ctggcctcag agtcccatgg attgcaaccc caccatgaac 22020
ttgctaaagg gagtgcccaa cgccatgctc cagagccccc aggtccagcc caccctgcgc 22080
cgcaaccagg aacagcttta ccgcttcctg gagcgccact ccccctactt ccgcagccac 22140
agcgcgcgca tccggggggc cacctctttt tgccacttgc aagaaaacat gcaagacgga 22200
aaatgatgta cagcatgctt ttaataaatg taaagactgt gcactttaat tatacacggg 22260
ctctttctgg ttatttattc aacaccgccg tcgccattta gaaatcgaaa gggttctgcc 22320
gtgcgtcgcc gtgcgccacg ggcagagaca cgttgcgata ctggaagcgg ctcgcccact 22380
tgaactcggg caccaccatg cggggcagtg gttcctcggg gaagttctcg ctccacaggg 22440
tgcgggtcag ctgcagcgcg ctcaggaggt cgggagccga gatcttgaag tcgcagttgg 22500
ggccggaacc ctgcgcgcgc gagttgcggt acacggggtt gcagcactgg aacaccagca 22560
gggccggatt attcacgctg gccagcaggc tctcgtcgct gatcatgtcg ctgtccagat 22620
cctccgcgtt gctcagggcg aatggggtca tcttgcagac ctgcctgccc aggaaaggcg 22680
ggagcccagg cttgccgttg cagtcgcagc gcaggggcat tagcaggtgc ccacggcccg 22740
actgcgcctg cgggtacaac gcgcgcatga aggcttcgat ctgcctaaaa gccacctggg 22800
tcttggctcc ctccgaaaag aacatcccac aggacttgct ggagaactgg ttcgcgggac 22860
agctggcatc gtgcaggcag cagcgcgcgt cagtgttggc aatctgcacc acgttgcgac 22920
cccaccggtt tttcactatc ttggccttgg aagcctgctc ctttagcgcg cgctggccgt 22980
tctcgctggt cacatccatc tctatcacct gttccttgtt gatcatgttt gtcccgtgca 23040
gacactttag gtcgccctcc gtctgggtgc agcggtgctc ccacagcgcg caaccggtgg 23100
gctcccaatt cttgtgggtc acccccgcgt aggcctgcag gtaggcctgc aggaagcgcc 23160
ccatcatggt cataaaggtc ttctggctcg taaaggtcag ctgcaggccg cgatgctctt 23220
cgttcagcca ggtcttgcag atggcggcca gcgcctcggt ctgctcgggc agcatcttaa 23280
aatttgtctt caggtcgtta tccacgtggt acttgtccat catggcacgc gccgcctcca 23340
tgcccttctc ccaggcggac accatgggca ggcttagggg gtttatcact tccagcggcg 23400
aggacaccgt actttcgatt tcttcttcct ccccctcttc ccggcgcgcg cccccgctgt 23460
tgcgcgctct taccgcctgc accaaggggt cgtcttcagg caagcgccgc accgagcgct 23520
tgccgccctt gacctgcttg atcagtaccg gcgggttgct gaagcccacc atggtcagcg 23580
ccgcctgctc ttcttcgtct tcgctgtcta ccactatttc tggggagggg cttctccgct 23640
ctgcggcaaa ggcggcggat cgcttctttt ttttcttggg agccgccgcg atggagtccg 23700
ccacggcgac cgaggtcgag ggcgtggggc tgggggtgcg cggtaccagg gcctcgtcgc 23760
cctcggactc ttcctctgac tccaggcggc ggcggagtcg cttctttggg ggcgcgcgcg 23820
tcagcggcgg cggagacggg gacggggacg gggacgggac gccctccaca gggggtggtc 23880
ttcgcgcaga cccgcggccg cgctcggggg tcttctcgcg ctggtcttgg tcccgactgg 23940
ccattgtatc ctcctcctcc taggcagaga gacataagga gtctatcatg caagtcgaga 24000
aggaggagag cttaaccacc ccctcagaga ccgccgatgc gcccgccgtc gccgtcgccc 24060
ccgctaccgc cgacgcgccc gccacaccga gcgacacccc cacggacccc cccgccgacg 24120
cacccctgtt cgaggaagcg gccgtggagc aggacccggg ctttgtctcg gcagaggagg 24180
atttgcaaga ggaggagaat aaggaggaga agccctcagt gccaaaagat cataaagagc 24240
aagacgagca cgacgcagac gcacaccagg gtgaagtcgg gcggggggac ggagggcatg 24300
gcggcgccga ctacctagac gaaggaaacg acgtgctctt gaagcacctg catcgtcagt 24360
gcgccatcgt ctgcgacgct ctgcaggagc gcagcgaggt gcccctcagc gtggcggagg 24420
tcagccgcgc ctacgagctc agcctctttt ccccccgggt gcccccccgc cgccgcgaaa 24480
acggcacatg cgagcccaac ccgcgcctca acttctaccc cgcctttgtg gtgcccgagg 24540
tcctggccac ctatcacatc ttctttcaaa attgcaagat ccccatctcg tgccgcgcca 24600
accgtagccg cgccgataag atgctggccc tgcgccaggg cgaccacata cctgatatcg 24660
ccgctttgga agatgtgcca aagatcttcg agggtctggg gcgcaacgag aagcgggcag 24720
caaactctct gcaacaggaa aacagcgaaa atgagagtca cactggagcg ctggtggagc 24780
tggagggcga caacgcccgc ctggcggtgc tcaagcgcag catcgaggtc acccactttg 24840
cctaccccgc gctcaacctg ccccccaaag tcatgaacgc ggtcatggac gggctgatca 24900
tgcgccgcgg ccggcccctc gctccagatg caaacttgca tgaggagacc gaggacggtc 24960
agcccgtggt cagcgacgag cagctgacgc gctggctgga gagcgcggac cccgccgaac 25020
tggaggagcg gcgcaagatg atgatggccg cggtgctggt caccgtagag ctggagtgtc 25080
tgcagcgctt cttcggtgac cccgagatgc agagaaaggt cgaggagacc ctacactaca 25140
ccttccgcca gggctacgtg cgccaggctt gcaagatctc caacgtggag ctcagcaacc 25200
tggtgtccta cctgggcatc ttgcatgaaa accgccttgg gcagagcgtg ctacactcca 25260
ccctgcgcgg ggaggcgcgc cgcgactacg tgcgcgactg cgtttacctc ttcctctgct 25320
acacctggca gacggccatg ggggtctggc agcagtgcct ggaggagcgc aacctcaagg 25380
agctggagaa gcttctgcag cgcgcgctca aagacctctg gacgggcttc aacgagcgct 25440
cggtggccgc cgcgctagcc gacctcatct tccccgagcg cctgctcaaa accctccagc 25500
aggggctgcc cgacttcacc agccaaagca tgttgcaaaa ttttaggaac tttatcctgg 25560
agcgttctgg catcctaccc gccacctgct gcgccctgcc cagcgacttt gtccccctcg 25620
tgtaccgcga gtgccccccg ccgctgtggg gccactgcta cctgttccaa ctggccaact 25680
acctgtccta ccacgcggac ctcatggagg actccagcgg cgaggggctc atggagtgcc 25740
actgccgctg caacctctgc acgccccacc gctccctggt ctgcaacacc caactgctca 25800
gcgagagtca gattatcggt accttcgagc tacagggtcc gtcctcctca gacgagaagt 25860
ccgcggctcc ggggctaaaa ctcactccgg ggctgtggac ttccgcctac ctgcgcaaat 25920
ttgtacctga agactaccac gcccacgaaa tcaggtttta cgaggaccaa tcccgcccgc 25980
ccaaggcgga gctgaccgcc tgcgtcatca cccagggcga gatcctaggc caattgcaag 26040
ccatccaaaa agcccgccaa gagtttttgc tgaagagggg tcggggggtg tatctggacc 26100
cccagtcggg tgaggagctc aacccggttc ccccgctgcc accgccgcgg gaccttgctt 26160
cccaggataa gcatcgccat ggctcccaga aagaagcagc agcggccgcc gctgccgccg 26220
ccccacatgc tggaggaaga ggaggaatac tgggacagtc aggcagagga ggtttcggac 26280
gaggaggagc cggagacgga gatggaagag tgggaggagg acagcttaga cgaggaggct 26340
tccgaagccg aagaggcagg cgcaacaccg tcaccctcgg ccgcagcccc ctcgcaggcg 26400
cccccgaagt ccgctcccag catcagcagc aacagcagcg ctataacctc cgctcctcca 26460
ccgccgcgac ccacggccga ccgcagaccc aaccgtagat gggacaccac cggaaccggg 26520
gccggtaagt cctccgggag aggcaagcaa gcgcagcgcc aaggctaccg ctcgtggcgc 26580
gctcacaaga acgccatagt cgcttgcttg caagactgcg gggggaacat ctccttcgcc 26640
cgccgcttcc tgctcttcca ccacggtgtg gccttccccc gtaacgtcct gcattactac 26700
cgtcatctct acagccccta ctgcggcggc agtgagccag aggcggccag cggcggcggc 26760
gcccgtttcg gtgcctagga agacccaggg caagacttca gccaagaaac tcgcggcgac 26820
cgcggcgaac gcggtcgcgg gggccctgcg cctgacggtg aacgaacccc tgtcgacccg 26880
cgaactgagg aaccgaatct tccccactct ctatgccatc ttccagcaga gcagagggca 26940
ggatcaggaa ctgaaagtaa aaaacaggtc tctgcgctcc ctcacccgca gctgtctgta 27000
tcacaagagc gaagaccagc ttcggcgcac gctggaggac gctgaggcac tcttcagcaa 27060
atactgcgcg ctcactctta aggactagct ccgcgccctt ctcgaattta ggcgggaacg 27120
cctacgtcat cgcagcgccg ccgtcatgag caaggacatt cccacgccat acatgtggag 27180
ctatcagccg cagatgggac tcgcggcggg cgcctcccaa gactactcca cccgcatgaa 27240
ctggctcagt gccggcccac acatgatctc acaggttaat gacatccgca cccatcgaaa 27300
ccaaatattg gtgaagcagg cggcaattac caccacgccc cgcaataatc ccaaccccag 27360
ggagtggccc gcgtccctgg tgtatcagga aattcccggc cccaccaccg tactacttcc 27420
gcgtgattcc caggccgaag tccaaatgac taactcaggg gcacagctcg cgggcggctg 27480
tcgtcacagg gtgcggcctc ctcgccaggg tataactcac ctggagatcc gaggcagagg 27540
tattcagctc aacgacgagt cggtgagctc ctcgctcggt ctcagacctg acgggacctt 27600
ccagatagcc ggagccggcc gatcttcctt cacgccccgc caggcgtacc tgactctgca 27660
gagctcgtcc tcggcgccgc gctcgggcgg catcgggact ctccagttcg tgcaggagtt 27720
tgtgccctcg gtctacttca accccttctc gggctctccc ggtcgctacc cggaccagtt 27780
tatcccgaac tttgacgccg cgagggactc ggtggacggc tacgactgaa tgtcgggtgg 27840
acccggtgca gagcaacttc gcctgaagca ccttgaccac tgccgccgcc ctcagtgctt 27900
tgcccgctgt cagaccggtg agttccagta cttttccctg cccgactcgc acccggacgg 27960
cccggcgcac ggggtgcgct ttttcatccc gagtcaggtc cgctctaccc taatcaggga 28020
gttcaccgcc cgtcccctac tggcggagtt ggaaaagggg ccttctatcc taaccattgc 28080
ctgcatttgc tctaaccctg gattacacca agatctttgc tgtcatttgt gtgctgagta 28140
taataaaggc tgagatcaga atctactcgg accttatccc tttcaattga tcataactgt 28200
aatcaataaa aaatcactta cttgaaatct gatagcaaga ctctgtccaa ttttttcagc 28260
aacacttcct tcccctcctc ccaactctgg tactctaggc gcctcctagc tgcaaacttc 28320
ctccacagtc tgaagggaat gtcagattcc tcctcctgtc cctccgcacc cacgatcttc 28380
atgttgttac agatgaaacg cgcgagatcg tctgacgaga ccttcaaccc cgtgtacccc 28440
tacgataccg agatcgctcc gacttctgtc cctttcctta cccctccctt tgtatcatcc 28500
gcaggaatgc aagaaaatcc agctggggtg ctgtccctgc acctgtcaga gccccttacc 28560
acccacaatg gggccctgac tctaaaaatg gggggcggcc tgaccctgga caaggaaggg 28620
aatctcactt cccaaaacat caccagtgtc gatccccctc tcaaaaaaag caagaacaac 28680
atcagccttc agaccgccgc acccctcgcc gtcagctccg gggccctaac cctttttgcc 28740
actccccccc tagcggtcag tggcgacaac cttactgtgc agtctcaggc ccctcttact 28800
ttggaagact caaaactaac tctggccacc aaaggacccc taactgtgtc cgaaggcaaa 28860
cttgtcctag aaacagagcc tcccctgcat gcaagtgaca gcagtagcct gggccttagc 28920
gtcacggccc cacttagcat taacaatgac agcctaggac tagacatgca agcgcccatc 28980
agctctcgag atggaaaact ggctctaaca gtggcggccc ccctaactgt ggccgagggt 29040
atcaatgctt tggcagtagc cacaggtaat ggtattggac taaatgaaac caacacacac 29100
ctgcaggcaa aactggtcgc gcccctaggc tttgatacca acggcaacat taagctaagc 29160
gtcgcaggag gcatgaggct aaacaataac acactgatac tagatgtaaa ctacccattt 29220
gaggctcaag gccaactgag cctaagagtg ggctcgggcc cactatatgt agattctagt 29280
agtcataacc taaccattag atgccttagg ggattgtatg taacatcttc taacaaccaa 29340
aacggtctag aggccaacat taaactaaca aaaggccttg tgtatgacgg aaatgccata 29400
gcagttaatg ttggcaaagg gctggaatac agccctactg gcacaacaga aaaacctata 29460
cagactaaaa taggtctagg catggagtat gacactgagg gagccatgat gacaaaacta 29520
ggctctggac taagctttga caattcagga gccattgtgg tgggaaacaa aaatgatgac 29580
aggcttactt tgtggaccac accggaccca tcgcccaact gtcagattta ctctgaaaaa 29640
gatgctaaac taaccttggt actgactaaa tgtggcagtc aggttgtagg cacagtatct 29700
attgccgctc ttaaaggtag ccttgtgcca atcactagtg caatcagtgt ggttcagata 29760
tacctaaggt ttgatgaaaa tggggtgctg atgagtaact cttcacttaa tggcgaatac 29820
tggaatttta gaaacggaga ctcaactaat ggcacaccat atacaaacgc agtgggtttt 29880
atgcctaatc tactggccta tcctaaaggt caaactacaa ctgcaaaaag taacattgtc 29940
agccaggtct acatgaacgg ggacgatact aaacccatga catttacaat caacttcaat 30000
ggccttagtg aaacagggga tacccctgtc agtaaatatt ccatgacatt ctcatggagg 30060
tggccaaatg gaagctacat agggcacaat tttgtaacaa actcctttac tttctcctac 30120
atcgcccaag aataaagaaa gcacagagat gcttgttttt gatttcaaaa ttgtgtgctt 30180
ttatttattt tcaagcttac agtatttcca gtagtcatta gaatagagct taattaaact 30240
gcatgagaac ccttccacat agcttaaatt atcaccagtg caaatggaaa aaaatcaaca 30300
taccttttta tccagatatc aaagaactct agtggtcagt tttcccccac cctcccagct 30360
cacagaatac acagtccttt ccccccggct ggctttaaac aacactatct cattggtaac 30420
agacatattt ttaggtgtaa taatccacac ggtctcttgg cgggccaaac gctggtctgt 30480
gatgttaata aactccccag gcagctcttt caagttcacg tcgctgtcca actgctgaag 30540
cgctcgcggc tccgactgcg cctctagcgg aggcaacggc agcacccgat ccttgatcta 30600
taaaggagta gagtcataat cccccataag aatagggcgg tgatgcagca acaaggcgcg 30660
cagcaactcc tgccgccgcc tctccgtacg acaggaatgc aacggggtgg tggtctcctc 30720
cgcgataatc cgcaccgctc gcagcatcag catcctcgtc ctccgggcac agcagcgcat 30780
cctgatctca ctgagatcgg cgcagtaagt gcagcacaac accaagatgt tatttaagat 30840
cccacagtgc aaagcactgt acccaaagct catggcggga aggacagccc ccacgtgacc 30900
atcgtaccag atcctcaggt aaatcaaatg acgacctctc ataaacacgc tggacatata 30960
catcacctcc ttgggcatga gctgattcac cacctctcga taccacaggc atcgctgatt 31020
aattaaagac ccctcgagca ccatcctgaa ccaggaagcc agcacctgac cccccgccag 31080
gcactgcagg gaccccggtg aatcgcagtg gcagtgaaga ctccagcgct cgtagccgtg 31140
aaccatagag ctggtcatta tatccacatt ggcacaacac agacacactt tcatacactt 31200
tttcatgatt agcagctcct ctctagtcaa gaccatatcc caaggaatca cccactcttg 31260
aatcaaggta aatcccacac agcagggcag gcctctcaca taactcacgt tatgcatagt 31320
gagcgtgtcg caatctggaa ataccggatg atcttccatc accgaagccc gggtctccgt 31380
ctcaaaggga ggtaaacggt ccctcgtgta gggacagtgg cgggataatc gagatcgtgt 31440
tgaacgtaga gtcatgccaa agggaacagc ggacgtactc atatttcctc cagcagaacc 31500
aagtgcgcgc gtggcagcta tccctgcgtc ttctgtctcg ccgcctgccc cgctcggtgt 31560
agtagttgta atacagccac tccctcagac cgtcaaggcg ctccctggcg tccggatcta 31620
taacaacacc gtcctgcagc gccgccctga tgacatccac caccgtagag tatgccaagc 31680
ccagccacga aatgcactca ctttgacagc gagagatagg aggagcggga agagatggaa 31740
gaaccatgat agtaaaagaa cttttattcc aatcgatcct ctacaatgtc aaagtgtaga 31800
tctatcagat ggcactggtc tcctccgctg agtcgatcaa aaataacagc taaaccacaa 31860
acaacacgat tggtcaaatg ctgcacaagg gcttgcagca taaaatcgcc tcgaaagtcc 31920
accgcaagca taacatcaaa gccaccgccc ctatcatgat ctatgataaa aaccccacag 31980
ctatccacca gacccatata gttttcatct ctccatcgtg aaaaaatatt tacaagctcc 32040
tcctttaaat cacctccaac caattcaaaa agttgagcca gaccgccctc caccttcatt 32100
ttcagcatgc gcatcatgat tgcaaaaatt caggctcctc agacacctgt ataagattga 32160
gaagcggaac gttaacatca atgtttcgct cgcgaagatc gcgcctcagt gcaagcatga 32220
tataatccca caggtcggag cggatcagcg aggacatctc cccgccagga accaactcaa 32280
cggagcctat gctgattata atacgcatat tcggggctat gctaaccagc acggccccca 32340
aataggcgta ctgcataggc ggcgacaaaa agtgaacagt ttgggttaaa aaatcaggca 32400
aacactcgcg caaaaaagca agaacatcat aaccatgctc atgcaaatag atgcaagtaa 32460
gctcaggaac gaccacagaa aaatgcacaa tttttctctc aaacatgact gcgagccctg 32520
caaaaaataa aaaagaaaca ttacacaaga gtagcctgtc ttacaatggg atagactact 32580
ctaaccaaca taagacgggc cacgacatcg cccgcgtggc cataaaaaaa attatccgtg 32640
tgattaaaaa gaagcacaga tagctggcca gtcatatccg gagtcatcac gtgcgaaccc 32700
gtgtagaccc ccgggttgga cacatcggcc aaacaaagaa agcggccaat gtatcccgga 32760
ggaatgataa cactaagacg aagatacaac agaataaccc catggggggg aataacaaag 32820
ttagtaggtg aataaaaacg ataaacaccc gaaactccct cctgcgtagg caaaatagcg 32880
ccctcccctt ccaaaacaac atacagcgct tccacagcag ccatgacaaa agactcaaaa 32940
cactcaaaag actcagtctt accaggaaaa taaaagcact ctcacagcac cagcactaat 33000
cagagtgtga agagggccaa gtgccgaacg agtatatata ggaattaaaa atgacgtaaa 33060
tgtgtaaagg tcaaaaaacg cccagaaaaa tacacagacc aacgcccgaa acgaaaaccc 33120
gcgaaaaaat acccagaagt tcctcaacaa ccgccacttc cgctttccca cgatacgtca 33180
cttcctcaaa aatagcaaac tacatttccc acatgtacaa aaccaaaacc cctccccttg 33240
tcaccgccca caacttacat aatcacaaac gtcaaagcct acgtcacccg ccccgcctcg 33300
ccccgcccac ctcattatca tattggcctc aatccaaaat aaggtatatt attgatgatg 33360
<210> 73
<211> 33360
<212> DNA
<213> Artificial Sequence
<220>
<223> GADNOU20 GAG (DE1DE3)
<400> 73
catcatcaat aatatacctt attttggatt gaggccaata tgataatgag gtgggcgggg 60
cgaggcgggg cgggtgacgt aggacgcgcg agtagggttg ggaggtgtgg cggaagtgtg 120
gcatttgcaa gtgggaggag ctgacatgca atcttccgtc gcggaaaatg tgacgttttt 180
gatgagcgcc gcctacctcc ggaagtgcca attttcgcgc gcttttcacc ggatatcgta 240
gtaattttgg gcgggaccat gtaagatttg gccattttcg cgcgaaaagt gaaacgggga 300
agtgaaaact gaataatagg gcgttagtca tagcgcgtaa tatttaccga gggccgaggg 360
actttgaccg attacgtgga ggactcgccc aggtgttttt tacgtgaatt tccgcgttcc 420
gggtcaaagt ctccgttttt attgtcgccg tcatctgacg ggccgccatt gcatacgttg 480
tatccatatc ataatatgta catttatatt ggctcatgtc caacattacc gccatgttga 540
cattgattat tgactagtta ttaatagtaa tcaattacgg ggtcattagt tcatagccca 600
tatatggagt tccgcgttac ataacttacg gtaaatggcc cgcctggctg accgcccaac 660
gacccccgcc cattgacgtc aataatgacg tatgttccca tagtaacgcc aatagggact 720
ttccattgac gtcaatgggt ggagtattta cggtaaactg cccacttggc agtacatcaa 780
gtgtatcata tgccaagtac gccccctatt gacgtcaatg acggtaaatg gcccgcctgg 840
cattatgccc agtacatgac cttatgggac tttcctactt ggcagtacat ctacgtatta 900
gtcatcgcta ttaccatggt gatgcggttt tggcagtaca tcaatgggcg tggatagcgg 960
tttgactcac ggggatttcc aagtctccac cccattgacg tcaatgggag tttgttttgg 1020
caccaaaatc aacgggactt tccaaaatgt cgtaacaact ccgccccatt gacgcaaatg 1080
ggcggtaggc gtgtacggtg ggaggtctat ataagcagag ctctccctat cagtgataga 1140
gatctcccta tcagtgatag agatcgtcga cgagctcgtt tagtgaaccg tcagatcgcc 1200
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 1260
cgcggccggg aacggtgcat tggaacgcgg attccccgtg ccaagagtga gatctaccat 1320
gggtgctagg gcttctgtgc tgtctggtgg tgagctggac aagtgggaga agatcaggct 1380
gaggcctggt ggcaagaaga agtacaagct aaagcacatt gtgtgggcct ccagggagct 1440
ggagaggttt gctgtgaacc ctggcctgct ggagacctct gaggggtgca ggcagatcct 1500
gggccagctc cagccctccc tgcaaacagg ctctgaggag ctgaggtccc tgtacaacac 1560
agtggctacc ctgtactgtg tgcaccagaa gattgatgtg aaggacacca aggaggccct 1620
ggagaagatt gaggaggagc agaacaagtc caagaagaag gcccagcagg ctgctgctgg 1680
cacaggcaac tccagccagg tgtcccagaa ctaccccatt gtgcagaacc tccagggcca 1740
gatggtgcac caggccatct ccccccggac cctgaatgcc tgggtgaagg tggtggagga 1800
gaaggccttc tcccctgagg tgatccccat gttctctgcc ctgtctgagg gtgccacccc 1860
ccaggacctg aacaccatgc tgaacacagt ggggggccat caggctgcca tgcagatgct 1920
gaaggagacc atcaatgagg aggctgctga gtgggacagg ctgcatcctg tgcacgctgg 1980
ccccattgcc cccggccaga tgagggagcc caggggctct gacattgctg gcaccacctc 2040
caccctccag gagcagattg gctggatgac caacaacccc cccatccctg tgggggaaat 2100
ctacaagagg tggatcatcc tgggcctgaa caagattgtg aggatgtact cccccacctc 2160
catcctggac atcaggcagg gccccaagga gcccttcagg gactatgtgg acaggttcta 2220
caagaccctg agggctgagc aggcctccca ggaggtgaag aactggatga cagagaccct 2280
gctggtgcag aatgccaacc ctgactgcaa gaccatcctg aaggccctgg gccctgctgc 2340
caccctggag gagatgatga cagcctgcca gggggtgggg ggccctggtc acaaggccag 2400
ggtgctggct gaggccatgt cccaggtgac caactccgcc accatcatga tgcagagggg 2460
caacttcagg aaccagagga agacagtgaa gtgcttcaac tgtggcaagg tgggccacat 2520
tgccaagaac tgtagggccc ccaggaagaa gggctgctgg aagtgtggca aggagggcca 2580
ccagatgaag gactgcaatg agaggcaggc caacttcctg ggcaaaatct ggccctccca 2640
caagggcagg cctggcaact tcctccagtc caggcctgag cccacagccc ctcccgagga 2700
gtccttcagg tttggggagg agaagaccac ccccagccag aagcaggagc ccattgacaa 2760
ggagctgtac cccctggcct ccctgaggtc cctgtttggc aacgacccct cctcccagta 2820
aaataaagcc catcagaatt cagtcgacag cggccgcgat ctgctgtgcc ttctagttgc 2880
cagccatctg ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc 2940
actgtccttt cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct 3000
attctggggg gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg 3060
catgctgggg atgcggtggg ctctatggcc gggccgcgat cgcgcttagg cctgaccatc 3120
tggtgctggc ctgcaccagg gccgagtttg ggtctagcga tgaggatacc gattgaggtg 3180
ggtaaggtgg gcgtggctag cagggtgggc gtgtataaat tgggggtcta aggggtctct 3240
ctgtttgtct tgcaacagcc gccgccatga gcgacaccgg caacagcttt gatggaagca 3300
tctttagtcc ctatctgaca gtgcgcatgc ctcactgggc cggagtgcgt cagaatgtga 3360
tgggttccaa cgtggatgga cgtcccgttc tgccttcaaa ttcgtctact atggcctacg 3420
cgaccgtggg aggaactccg ctggacgccg cgacctccgc cgccgcctcc gccgccgccg 3480
cgaccgcgcg cagcatggct acggaccttt acagctcttt ggtggcgagc agcgcggcct 3540
ctcgcgcgtc tgctcgggat gagaaactga ctgctctgct gcttaaactg gaagacttga 3600
cccgggagct gggtcaactg acccagcagg tttccagctt gcgtgagagc agccttgcct 3660
ccccctaatg gcccataata taaataaaag ccagtctgtt tggattaagc aagtgtatgt 3720
tctttattta actctccgcg cgcggtaagc ccgggaccag cggtctcggt cgtttagggt 3780
gcggtggatt ttttccaaca cgtggtacag gtggctctgg atgtttagat acatgggcat 3840
gagtccatcc ctggggtgga ggtagcacca ctgcagagct tcgtgctcgg gggtggtgtt 3900
gtatatgatc cagtcgtagc aggagcgctg ggcgtggtgc tgaaaaatgt ccttaagcaa 3960
gaggcttata gctaggggga ggcccttggt gtaagtgttt acaaatctgc ttagctggga 4020
ggggtgcatc cggggggata tgatgtgcat cttggactgg atttttaggt tggctatgtt 4080
cccgcccaga tcccttctgg gattcatgtt gtgcaggacc accagcacgg tatatccagt 4140
gcacttggga aatttatcgt ggagcttaga cgggaatgca tggaagaact tggagacgcc 4200
cttgtggcct cccagatttt ccatacattc gtccatgatg atggcaatgg gcccgtggga 4260
agctgcctga gcaaaaacgt ttctggcatc gctcacatcg tagttatgtt ccagggtgag 4320
gtcatcatag gacatcttta cgaatcgggg gcgaagggtc ccggactggg ggatgatggt 4380
accctcgggc cccggggcgt agttcccctc acagatctgc atctcccagg ctttcatttc 4440
agagggaggg atcatatcca cctgcggggc gatgaaaaag acagtttctg gcgcagggga 4500
gattaactgg gatgagagca ggtttctgag cagctgtgac tttccacagc cggtgggccc 4560
atatatcacg cctatcaccg gctgcagctg gtagttaaga gagctgcagc tgccgtcctc 4620
ccggagcagg ggggccacct cgttgagcat atccctgacg tggatgttct ccctgaccag 4680
ttccgccaga aggcgctcgc cgcccagcga aagcagctct tgcaaggaag caaaattttt 4740
cagcggtttc aggccatcgg ccgtgggcat gtttttcagc gtctgggtca gcagctccag 4800
cctgtcccag agctcggtga tgtgctctac ggcatctcga tccagcagat ctcctcgttt 4860
cgcgggttgg ggcggctttc gctgtagggc accagccgat gggcgtccag cggggccaga 4920
gtcatgtcct tccatgggcg cagggtcctc gtcagggtgg tctgggtcac ggtgaagggg 4980
tgcgctccgg gttgggcact ggccagggtg cgcttgaggc tggttctgct ggtgctgaat 5040
cgctgccgct cttcgccctg cgcgtcggcc aggtagcatt tgaccatggt ctcgtagtcg 5100
agaccctcgg cggcgtgccc cttggcgcgg agctttccct tggaggtggc gccgcacgag 5160
gggcactgca ggctcttcag ggcgtagagc ttgggagcga gaaacacgga ctctggggag 5220
taggcgtccg cgccgcaggc cgagcagacc gtctcgcatt ccaccagcca agtgagttcc 5280
gggcggtcag ggtcaaaaac caggttgccc ccatgctttt tgatgcgttt cttaccttgg 5340
ctctccatga ggcggtgtcc cttctcggtg acgaagaggc tgtccgtgtc cccgtagacc 5400
gacttcaggg gcctgtcttc cagcggagtg cctctgtcct cctcgtagag aaactctgac 5460
cactctgaga cgaaggcccg cgtccaggcc aggacgaagg aggccacgtg ggaggggtag 5520
cggtcgttgt ccactagcgg gtccaccttc tccagggtgt gcaggcacat gtccccctcc 5580
tccgcgtcca gaaaagtgat tggcttgtag gtgtaggaca cgtgaccggg ggttcccaac 5640
gggggggtat aaaagggggt gggtgccctt tcatcttcac tctcttccgc atcgctgtct 5700
gcgagagcca gctgctgggg taagtattcc ctctcgaagg cgggcatgac ctcagcgctc 5760
aggttgtcag tttctaaaaa tgaggaggat ttgatgttca cctgtccgga ggtgatacct 5820
ttgagggtac ctgggtccat ctggtcagaa aacactattt ttttgttatc aagcttggtg 5880
gcgaatgacc cgtagagggc gttggagagc agcttggcga tggagcgcag ggtctggttt 5940
ttgtcgcggt cggctcgctc cttggccgcg atgttgagtt gcacgtactc gcgggccacg 6000
cacttccact cggggaacac ggtggtgcgc tcgtctggga tcaggcgcac cctccagccg 6060
cggttgtgca gggtgaccat gtcgacgctg gtggcgacct caccgcgcag acgctcgttg 6120
gtccagcaga ggcggccgcc cttgcgcgag cagaaggggg gtagggggtc cagctggtcc 6180
tcgtttgggg ggtccgcgtc gatggtaaag accccgggga gcaggcgcgg gtcaaagtag 6240
tcgatcttgc aagcttgcat gtccagagcc cgctgccatt cgcgggcggc gagcgcgcgc 6300
tcgtaggggt tgaggggcgg gccccagggc atggggtggg tgagcgcgga ggcgtacatg 6360
ccgcagatgt catacacgta caggggttcc ctgaggatac cgaggtaggt ggggtagcag 6420
cgccccccgc ggatgctggc gcgcacgtag tcatagagct cgtgggaggg ggccagcatg 6480
ttgggcccga ggttggtgcg ctgggggcgc tcggcgcgga agacgatctg cctgaagatg 6540
gcgtgggagt tggaggagat ggtgggccgc tggaagacgt tgaagcttgc ttcttgcaag 6600
cccacggagt ccctgacgaa ggaggcgtag gactcgcgca gcttgtgcac cagctcggcg 6660
gtgacctgga cgtcgagcgc acagtagtcg agggtctcgc ggatgatgtc atacctatcc 6720
tcccccttct ttttccacag ctcgcggttg aggacgaact cttcgcggtc tttccagtac 6780
tcttggaggg gaaacccgtc cgtgtccgaa cggtaagagc ctagcatgta gaactggttg 6840
acggcctggt aggggcagca gcccttctcc acgggcagcg cgtaggcctg cgccgccttg 6900
cggagggagg tgtgggtgag ggcgaaagtg tccctgacca tgactttgag gtattgatgt 6960
ctgaagtctg tgtcatcgca gccgccctgt tcccacaggg tgtagtccgt gcgctttttg 7020
gagcgcgggt tgggcaggga gaaggtgagg tcattgaaga ggatcttccc cgctcgaggc 7080
atgaagtttc tggtgatgcg aaagggccct gggaccgagg agcggttgtt gatgacctgg 7140
gcggccagga cgatctcgtc aaagccgttt atgttgtgtc ccacgatgta gagctccagg 7200
aagcggggct ggcccttgat ggaggggagc tttttaagtt cctcgtaggt aagctcctcg 7260
ggcgattcca ggccgtgctc ctccagggcc cagtcttgca agtgagggtt ggccgccagg 7320
aaggatcgcc agaggtcgcg ggccatgagg gtctgcaggc ggtcgcggaa ggttctgaac 7380
tgccgcccca cggccatttt ttcgggggtg atgcagtaga aggtgagggg gtctttctcc 7440
caggggtccc atctgagctc tcgggcgagg tcgcgcgcgg cagcgaccag agcctcgtcg 7500
ccccccagtt tcatgaccag catgaagggc acgagttgct tgccaaaggc tcccatccaa 7560
gtgtaggttt ctacatcgta ggtgacaaag aggcgctccg tgcgaggatg agagccgatt 7620
gggaagaact ggatctcccg ccaccagttg gaggattggc tgttgatgtg gtgaaagtag 7680
aagtcccgtc tgcgggccga gcactcgtgc tggcttttgt aaaagcgacc gcagtactgg 7740
cagcgctgca cgggttgtat atcttgcacg aggtgaacct ggcgacctct gacgaggaag 7800
cgcagcggga atctaagtcc cccgcctggg gtcccgtgtg gctggtggtc ttttactttg 7860
gttgtctggc cgccagcatc tgtctcctgg agggcgatgg tggaacagac caccacgccg 7920
cgagagccgc aggtccagat ctcggcgctc ggcgggcgga gtttgatgac gacatcgcgc 7980
acattggagc tgtccatggt ctccagctcc cgcggcggca ggtcagccgg gagttcctgg 8040
aggttcacct cgcagagacg ggtcaaggcg cggacagtgt tgagatggta tctgatttca 8100
aggggcatgt tggaggcgga gtcgatggct tgcaggaggc cgcagccccg gggggccacg 8160
atggttcccc gcggggcgcg aggggaggcg gaagctgggg gtgtgttcag aagcggtgac 8220
gcgggcgggc ccccggaggt agggggggtt ccggccccac aggcatgggc ggcaggggca 8280
cgtcttcgcc gcgcgcgggc aggggctggt gctggctccg aagagcgctt gcgtgcgcga 8340
cgacgcgacg gttggtgtcc tgtatctggc gcctctgagt gaagaccacg ggtcccgtga 8400
ccttgaacct gaaagagagt tcgacagaat caatctcggc atcgttgaca gcggcctggc 8460
gcaggatctc ctgcacgtcg cccgagttgt cctggtaggc gatttctgcc atgaactgct 8520
cgatctcttc ctcctggaga tctcctcgtc cggcgcgctc cacggtggcc gccaggtcgt 8580
tggagatgcg acccatgagc tgcgagaagg cgttgagtcc gccctcgttc cagacccggc 8640
tgtagaccac gcccccctcg gcgtcgcggg cgcgcatgac cacctgggcc aggttgagct 8700
ccacgtgtcg cgtgaagacg gcgtagttgc gcaggcgctg gaaaaggtag ttcagggtgg 8760
tggcggtgtg ctcggcgacg aagaagtaca tgacccagcg ccgcaacgtg gattcattga 8820
tgtcccccaa ggcctccagg cgctccatgg cctcgtagaa gtccacggcg aagttgaaaa 8880
actgggagtt gcgagcggac acggtcaact cctcctccag aagacggatg agctcggcga 8940
cagtgtcgcg cacctcgcgc tcgaaggcca cggggggcgc ttcttcctct tccacctctt 9000
cttccatgat tgcttcttct tcttcctcag ccgggacggg agggggcggc ggcgggggag 9060
gggcgcggcg gcggcggcgg cgcaccggga ggcggtcgat gaagcgctcg atcatctccc 9120
cccgcatgcg gcgcatggtc tcggtgacgg cgcggccgtt ctcccggggg cgcagctcga 9180
agacgccgcc tctcatttcg ccgcggggcg ggcggccgtg aggtagcgag acggcgctga 9240
ctatgcatct taacaattgc tgtgtaggta cgccgccaag ggacctgatt gagtccagat 9300
ccaccggatc cgaaaacctt tggaggaaag cgtctatcca gtcgcagtcg caaggtaggc 9360
tgagcaccgt ggcgggcggg ggcgggtcgg gagagttcct ggcggagatg ctgctgatga 9420
tgtaattaaa gtaggcggtc ttgagaaggc ggatggtgga caggagcacc atgtctttgg 9480
gtccggcctg ttggatgcgg aggcggtcgg ccatgcccca ggcctcgttc tgacaccggc 9540
gcaggtcttt gtagtaatct tgcatgagtc tttccaccgg cacttcttct ccttcctctt 9600
cttcatctcg ccggtggttt ctcgcgccgc ccatgcgcgt gaccccaaag cccctgagcg 9660
gctgcagcag ggccaggtcg gcgaccacgc gctcggccaa gatggcctgc tgtacctgag 9720
tgagggtcct ctcgaagtca tccatgtcca cgaagcggtg gtaggcaccc gtgttgatgg 9780
tgtaggtgca gttggccatg acggaccagt tgacggtctg gtgtcccggc tgcgagagct 9840
ccgtgtaccg caggcgcgag aaggcgcggg aatcgaacac gtagtcgttg caagtccgca 9900
ccagatactg gtagcccacc aggaagtgcg gcggaggttg gcgatagagg ggccagcgct 9960
gggtggcggg ggcgccgggc gccaggtctt ccagcatgag gcggtggtat ccgtagatgt 10020
acctggacat ccaggtgatg cctgcggcgg tggtggtggc gcgcgcgtag tcgcggaccc 10080
ggttccagat gtttcgcagg ggcgagaagt gttccatggt cggcacgctc tggccggtga 10140
ggcgcgcgca gtcgttgacg ctctatacac acacaaaaac gaaagcgttt acagggcttt 10200
cgttctgtag cctggaggaa agtaaatggg ttgggttgcg gtgtgccccg gttcgagacc 10260
aagctgagct cagccggctg aagccgcagc taacgtggta ttggcagtcc cgtctcgacc 10320
caggccctgt atcctccagg atacggtcga gagccctttt gctttcttgg ccaagcgccc 10380
gtggcgcgat ctgggataga tggtcgcgat gagaggacaa aagcggctcg cttccgtagt 10440
ctggagaaac aatcgccagg gttgcgttgc ggcgtacccc ggttcgagcc cctatggcgg 10500
cttggatcgg ccggaaccgc ggctaacgtg ggctgtggca gccccgtcct caggaccccg 10560
ccagccgact tctccagtta cgggagcgag ccccttttgt ttttttattt tttagatgca 10620
tcccgtgctg cggcagatgc gcccctcgcc ccggcccgat cagcagcagc aacagcaggc 10680
atgcagaccc ccctctcctc tccccgcccc ggtcaccacg gccgcggcgg ccgtgtccgg 10740
tgcggggggc gcgctggagt cagatgagcc accgcggcgg cgacctaggc agtatctgga 10800
cttggaagag ggcgagggac tggcgcggct gggggcgagc tctccagagc gccacccgcg 10860
ggtgcagttg aaaagggacg cgcgtgaggc gtacctgccg cggcaaaacc tgtttcgcga 10920
ccgcgggggc gaggagcccg aggagatgcg ggactgcagg ttccaagcgg ggcgcgagct 10980
gcgccgcggc ttggacagac agcgcctgct gcgcgaggag gactttgagc ccgacacgca 11040
gacgggcatc agccccgcgc gcgcgcacgt ggccgcggcc gacctggtga ccgcctacga 11100
gcagacggtg aaccaggagc gcaacttcca aaaaagcttc aacaaccacg tgcgcacgct 11160
ggtggcgcgc gaggaggtga ccctgggtct catgcatctg tgggacctgg tggaggcgat 11220
cgtgcagaac cccagcagca agcccctgac cgcgcagctg ttcctggtgg tgcagcacag 11280
cagggacaac gaggccttca gggaggcgct gctgaacatc accgagccgg aggggcgctg 11340
gctcctggac ctgataaaca tcctgcagag catagtggtg caggagcgca gcctgagcct 11400
ggccgagaag gtggcggcca ttaactattc tatgctgagc ctgggcaagt tctacgctcg 11460
caagatctac aagaccccct acgtgcccat agacaaggag gtgaagatag acagcttcta 11520
catgcgcatg gcgctgaagg tgctaaccct gagcgacgac ctgggagtgt accgcaacga 11580
gcgcatccac aaggccgtga gcgccagccg gcggcgcgag ctgagcgacc gcgaactgat 11640
gcacagtctg cagcgcgcgc tgaccggcgc gggcgagggc gacagggagg tcgagtccta 11700
ctttgacatg ggggccgacc tgcactggca gccgagccgc cgcgccctgg aagcggcggg 11760
ggcgtacggc ggccccctgg cggccgatga cgaggaagag gaggactatg agctagagga 11820
gggcgagtac ctggaggact gacctggctg gtggtgtttt ggtatagatg caagatccga 11880
acgtggcgga cccggcggtc cgggcggcgc tgcagagcca gccgtccggc attaactcct 11940
ctgacgactg ggccgcggcc atgggtcgca tcatggccct gaccgcgcgc aaccccgagg 12000
ccttcaggca gcagcctcag gctaaccggc tggcggccat cttggaagcg gtagtgcccg 12060
cgcgctccaa ccccacccac gagaaggtgc tggccatagt caacgcgctg gcggagagca 12120
gggccatccg ggcagacgag gccggactgg tgtacgatgc gctgctgcag cgggtggcgc 12180
ggtacaacag cggcaacgtg cagaccaacc tggaccgcct ggtgacggac gtgcgcgagg 12240
ccgtggcgca gcgcgagcgc ttgcatcagg acggcaacct gggctcgctg gtggcgctaa 12300
acgccttcct tagcacccag ccggccaacg taccgcgggg gcaggaggac tacaccaact 12360
tcttgagcgc gctgcggctg atggtgaccg aggtccctca gagcgaggtg taccagtcgg 12420
ggcccgacta cttcttccag accagcagac agggcttgca aaccgtgaac ctgagccagg 12480
ctttcaagaa cctgcggggg ctgtggggag tgaaggcgcc caccggcgac cgggctacgg 12540
tgtccagcct gctaaccccc aactcgcgcc tgctgctgct gctgatcgcg cccttcacgg 12600
acagcgggag cgtctcgcgg gagacctatc tgggccacct gctgacgctg taccgcgagg 12660
ccatcgggca ggcgcaggtg gacgagcaca ccttccagga gatcaccagc gtgagccacg 12720
cgctggggca ggaggacacg ggcagcctgc aggcgaccct gaactacctg ctgaccaaca 12780
ggcggcagaa gattcccacg ctgcacagcc tgacccagga ggaggagcgc atcttgcgct 12840
acgtgcagca gagcgtgagc ctgaacctga tgcgcgacgg cgtgacgccc agcgtggcgc 12900
tggacatgac cgcgcgcaac atggaaccgg gcatgtacgc ttcccagcgg ccgttcatca 12960
accgcctgat ggactacttg catcgggcgg cggccgtgaa ccccgagtac ttcaccaatg 13020
ccattctgaa tccccactgg atgccccctc cgggtttcta caacggggac ttcgaggtgc 13080
ctgaggtcaa cgatgggttc ctctgggatg acatggatga cagtgtgttc tcccccaacc 13140
cgctgcgcgc cgcgtctctg cgattgaagg agggctctga cagggaagga ccaaggagtc 13200
tggcctcctc cctggctctg ggggcggtgg gcgccacggg cgcggcggcg cggggcagca 13260
gccccttccc cagcctggcg gactctctga atagcgggcg ggtgagcagg ccccgcttgc 13320
taggcgagga ggagtatctg aacaactccc tgctgcagcc cgtgagggac aaaaacgctc 13380
agcggcagca gtttcccaac aatgggatag agagcctggt ggacaagatg tccagatgga 13440
agacgtatgc gcaggagtac aaggagtggg aggaccgcca gccgcggccc ctgccgcccc 13500
ctagacagcg ctggcagcgg cgcgcgtcca accgccgctg gaggcagggg cccgaggacg 13560
atgatgactc tgcagatgac agcagcgtgt tggacctggg cgggagcggg aacccctttt 13620
cgcacctgcg cccacgcctg ggcaagatgt tttaaaagag aaaaataaaa actcaccaag 13680
gccatggcga cgagcgttgg ttttttgttc ccttccttag tatgcggcgc gcggcgatgt 13740
tcgaggaggg gcctcccccc tcttacgaga gcgcgatggg aatttctcct gcggcgcccc 13800
tgcagcctcc ctacgtgcct cctcggtacc tgcaacctac aggggggaga aatagcatct 13860
gttactctga gctgcagccc ctgtacgata ccaccagact gtacctggtg gacaacaagt 13920
ccgcggacgt ggcctccctg aactaccaga acgaccacag cgattttttg accacggtga 13980
tccaaaacaa cgacttcacc ccaaccgagg ccagtaccca gaccataaac ctggacaaca 14040
ggtcgaactg gggcggcgac ctgaagacta tcctgcacac caatatgccc aacgtgaacg 14100
agttcatgtt caccaactct tttaaggcgc gggtgatggt ggcgcgcgag cagggggagg 14160
cgaagtacga gtgggtggac ttcacgctgc ccgagggcaa ctactcagag accatgactc 14220
tcgacctgat gaacaatgcg atcgtggaac actatctgaa agtgggcagg cagaacgggg 14280
tgaaggagag cgatatcggg gtcaagtttg acaccagaaa cttccgtctg ggctgggacc 14340
ctgtgaccgg gctggtcatg ccgggggtct acaccaacga ggcctttcat cccgatatag 14400
tgctcctgcc cggctgtggg gtggacttca cccagagccg gctgagcaac ctgctgggcg 14460
ttcgcaagcg gcaacctttc caggagggtt tcaagatcac ctatgaggat ctggaggggg 14520
gcaacattcc cgcgctcctt gatctggacg cctacgagga gagcttgaaa cccgaggaga 14580
gcgctggcga cagcggcgag agtggcgagg agcaagccgg cggcggcggc agcgcgtcgg 14640
tagaaaacga aagtactccc gcagtggcgg cggacgctgc ggaggtcgag ccggaggcca 14700
tgcagcagga cgcagaggag ggcgcgcagg aggacatgaa caatggggag atcaggggcg 14760
acactttcgc cacccggggc gaagaaaaag aggcagaggc ggcggcggcg acggcggaag 14820
ccgaaaccga ggcagaggca gagcccgaga ccgaagttat ggaagacatg aatgatggag 14880
aacgtagggg tgacacgttt gccacccggg gcgaagagaa ggcggcggag gcagaagccg 14940
cggctgagga ggcggctgcg gctgcggcca aggctgaggc tgcggctgag gctaaggtcg 15000
aagccgatgt tgcggttgag gctcaggctg aggaggaggc ggcggctgaa gcagttaagg 15060
aaaaggccca ggcagagcag gaagagaaaa aacctgtcat tcaacctcta aaagaagata 15120
gcaaaaagcg cagttacaac gtcattgagg gcagcacctt tacccaatac cgcagctggt 15180
acctggctta caactacggc gacccggtca agggggtgcg ctcgtggacc ctgctctgca 15240
cgccggacgt cacctgcggc tccgagcaga tgtactggtc gctgccaaac atgatgcaag 15300
acccggtgac cttccgttcc acgcggcagg ttagcaactt tccggtggtg ggcgccgaac 15360
tgctgccagt acactccaag agtttttaca acgagcaggc cgtctactcc cagctgatcc 15420
gccaggccac ctctctgacc cacgtgttca atcgctttcc cgagaaccag attttggcgc 15480
gcccgccggc ccccaccatc accaccgtca gtgaaaacgt tcctgccctc acagatcacg 15540
ggacgctacc gctgcgcaac agcatctcag gagtccagcg agtgaccatt actgacgcca 15600
gacgccggac ctgcccctac gtttacaagg ccttgggcat agtctcgccg cgcgtcctct 15660
ccagtcgcac tttttaaaac acatccaccc acacgctcca aaatcatgtc cgtactcatc 15720
tcgcccagca acaacaccgg ctgggggctg cgcgcaccca gcaagatgtt tggaggggca 15780
aggaagcgct ccgaccagca ccccgtgcgc gtgcgcggcc actaccgcgc gccctggggt 15840
gcgcacaagc gcgggcgcac agggcgcacc actgtggatg atgtcattga ctccgtagtg 15900
gagcaggcgc gccactacac acccggcgcg ccgaccgcct ccgccgtgtc caccgtggac 15960
caggcgatcg aaagcgtggt acagggggcg cggcactatg ccaaccttaa aagtcgccgc 16020
cgccgcgtgg cgcgccgcca tcgccggaga ccccgggcta ctgccgccgc gcgccttacc 16080
aaggctctgc tcaagcgcgc caggcgaact ggccaccggg ccgccatgag ggccgcacgg 16140
cgggctgccg ctgccgcgag cgccgtggcc ccgcgggcac gaaggcgcgc ggccgctgcc 16200
gccgccgccg ccatttccag cttggcctcg acgcggcgcg gtaacatata ctgggtgcgc 16260
gactcggtga gcggcacacg tgtgcccgtg cgctttcgcc ccccacggaa ttagcacaag 16320
acaacataca cactgagtct cctgctgttg tgtatcccag cggcgaccgt cagcagcggc 16380
gacatgtcca agcgcaaaat taaagaagag atgctccagg tcatcgcgcc ggagatctat 16440
gggcccccga agaaggagga ggaggattac aagccccgca agctaaagcg ggtcaaaaag 16500
aaaaagaaag atgatgacgt tgacgaggcg gtggagtttg tccgccgcat ggcgcccagg 16560
cgccctgtgc agtggaaggg tcggcgcgtg cagcgagtcc tgcgccccgg caccgcggtg 16620
gtctttacgc ccggcgagcg ttccacgcgc actttcaagc gggtgtacga tgaggtgtac 16680
ggcgacgagg atctgttgga gcaggccaac catcgatttg gggagtttgc atatgggaaa 16740
cggcctcgcg agagtctaaa agaggacctg ctggcgctac cgctggacga gggcaatccc 16800
accccgagtc tgaagccggt gaccctgcaa caggtgctgc ctttgagcgc gcccagcgag 16860
cagaagcgag ggttaaagcg cgagggcggg gacctggcac ccaccgtgca gttgatggtg 16920
cccaagcggc agaagctgga ggacgtgctg gagaaaatga aagtagagcc cgggatccag 16980
cccgagatca aggtccgccc tatcaagcag gtggcgcccg gcgtgggagt ccagaccgtg 17040
gacgttagga ttcccacgga ggagatggaa acccaaaccg ccactccctc ttcggcagca 17100
agcgccacca ccggcgccgc ttcggtagag gtgcagacgg acccctggct acccgccgcc 17160
actatcgccg tcgccgccgc cccccgttcg cgcggacgca agagaaatta tccagcggcc 17220
agcgcgctta tgccccagta tgcgctgcat ccatccatcg cgcccacccc cggctaccgc 17280
gggtactcgt accgcccgcg cagatcagcc ggcactcgcg gccgccgccg ccgtgcgacc 17340
acaaccagcc gccgccgtcg ccgccgccgc cagccagtgc tgacccccgt gtctgtaagg 17400
aaggtggctc gctcggggag cacgctggtg gtgcccagag cgcgctacca ccccagcatc 17460
gtttaaagcc ggtctctgta tggttcttgc agatatggcc ctcacttgtc gccttcgctt 17520
cccggtgccg ggataccgag gaagaactca ccgccgcagg ggcatggcgg gcagcggtct 17580
ccgcggcggc cgtcgccatc gccggcgcgc aaagagcagg cgcatgcgcg gcggtgtgtt 17640
gcccctgctg gtcccgctac tcgccgcggc gatcggcgcc gtgcccggga tcgcctccgt 17700
ggccctgcag gcgtcccaga aacattgact cttgcaacct tgcaagcttg catttttgga 17760
ggaaaaaata aaaaagtcta gactctcacg ctcgcttggt cctgtgacta ttttgtagaa 17820
aaaagatgga agacatcaac tttgcgtcgc tggccccgcg tcacggctcg cgcccgttca 17880
tgggagactg gacagatatc ggcaccagca atatgagcgg tggcgccttc agctggggca 17940
gtctgtggag cggccttaaa aattttggtt ccaccattaa gaactatggc aacaaagcgt 18000
ggaacagcag cacgggtcag atgctgagag acaagttgaa agagcagaac ttccaggaga 18060
aggtggcgca gggcctggcc tctggcatca gcggggtggt ggacatagct aaccaggccg 18120
tgcagaaaaa gataaacagt catctggacc cccgccctca ggtggaggaa acgcctccag 18180
ccatggagac ggtgtctccc gagggcaaag gcgaaaagcg cccgcggccc gacagggaag 18240
agaccctggt gtcacacacc gaggagccgc cctcttacga ggaggcagtc aaggccggcc 18300
tgcccaccac tcgccccata gctcccatgg ccaccggtgt ggtgggtcac aggcaacaca 18360
cccccgcaac actagatctg cccccgccgt ccgagccgac tcgccagcca aaggcggtga 18420
cggtgtccgc tccctccact tccgccgcca acagagtgcc tctgcgccgc gctgcgagcg 18480
gcccccgggc ctcgcgagtc agcggcaact ggcagagcac actgaacagc atcgtgggcc 18540
tgggagtgag gagtgtgaag cgccgccgtt gctactgaat gagcaagcta gctaacgtgt 18600
tgtatgtgtg tatgcgtcct atgtcgccgc cagaggagct gttgagccgc cggcgccgtc 18660
tgcactccag cgaatttcaa gatggcgacc ccatcgatga tgcctcagtg gtcgtacatg 18720
cacatctcgg gccaggacgc ttcggagtac ctgagccccg ggctggtgca gttcgcccgc 18780
gccacagaca cctacttcaa catgagtaac aagttcagga accccactgt ggcgcccacc 18840
cacgatgtga ccacggaccg gtcgcagcgc ctgacgctgc ggttcatccc cgtggatcgg 18900
gaggacaccg cttactctta caaggcgcgg ttcacgctgg ccgtgggcga caaccgcgtg 18960
ctggacatgg cctccactta ctttgacatc cggggggtgc tggacagggg ccccactttt 19020
aagccctact cgggcactgc ctacaacccc ctggccccca agggcgcccc caattcttgt 19080
gagtgggaac aagaggaaaa tcaggtggtc gctgcagatg atgaacttga agatgaagaa 19140
gcgcaagcac aagaggaagc ccctgtgaaa aaaattcatg tatatgctca ggcgcctctt 19200
tctggcgaaa agatttccaa ggatggtatc caaataggta ctgaagtcgt aggagataca 19260
tctaaggaca cttttgcaga taaaacattc caacccgaac ctcagatagg cgagtctcag 19320
tggaacgagg ctgatgccac agcagcagga ggtagagttt tgaaaaagac tacccctatg 19380
agaccttgct atggatccta tgccaggcct accaatgcca acgggggtca aggaattatg 19440
gttgccaatg aacaaggagt gttggagtct aaagtagaaa tgcaattttt ctctaacacc 19500
acaaccctta atgcgcggga tggaaccggc aatcccgaac caaaggtggt gttgtacagc 19560
gaagatgtcc acttggaatc tcccgatact catctgtctt acaagcccaa aaaggatgat 19620
gttaatgcca aaatcatgtt gggtcagcaa gccatgccca acagacccaa cctcattgga 19680
tttagagata atttcattgg gcttatgttt tacaacagca ccggtaacat gggagtgctg 19740
gcgggtcagg cctctcagtt gaatgctgtg gtggacttgc aggatagaaa cacagaactg 19800
tcatatcagc ttctgcttga ttcaattggg gatagaacca gatacttctc catgtggaac 19860
caggcagtgg atagctatga tccagatgtc agaattattg aaaaccatgg gactgaggat 19920
gaactgccca actactgctt ccctttgggc ggcataggag ttactgatac ttatcaaggg 19980
ataaaaaata ccaatggcaa tggtcagtgg accaaagatg atcagttcgc ggaccgcaac 20040
gaaatagggg tgggaaacaa cttcgccatg gagatcaaca tccaggccaa cctttggaga 20100
aacttcctct atgcaaacgt ggggctctac ctgccagaca agctcaagta caaccccacc 20160
aacgtggaca tctctgacaa ccccaacacc tatgactaca tgaacaagcg ggtggtggcc 20220
cctggcctgg tggactgctt tgtcaatgtg ggagccaggt ggtccctgga ctacatggac 20280
aacgtcaacc ccttcaacca ccaccgcaat gcgggtctgc gctaccgctc catgatcctg 20340
ggcaacgggc gctatgtgcc ctttcacatc caggtacccc agaagttctt tgccatcaag 20400
aacctcctgc tcctgcccgg ctcctacacc tacgagtgga acttcaggaa ggatgtgaac 20460
atggtcctac agagctctct gggcaatgac cttagggtgg atggggccag catcaagttt 20520
gacagcatca ccctctatgc tacatttttc cccatggccc acaacaccgc ctccacgctt 20580
gaggccatgc tgagaaacga caccaacgac cagtccttta atgactacct ctctggggcc 20640
aacatgctct acccaatccc agccaaggcc accaacgtgc ccatctccat cccctctcgc 20700
aactgggccg cctttagagg ctgggccttt acccgcctta agaccaagga gaccccctcc 20760
ctgggctcgg gttttgatcc ctactttgtt tactcgggat ccatccccta cctggatggc 20820
accttctacc tcaaccacac tttcaagaag atatccatca tgtatgactc ctccgtcagc 20880
tggccgggca acgaccgctt gctcaccccc aatgagttcg aggtcaagcg cgccgtggac 20940
ggcgagggct acaacgtggc ccagtgcaac atgaccaagg actggttcct ggtgcagatg 21000
ctggccaact acaacatagg ctaccagggc ttttacatcc cagagagcta caaggacagg 21060
atgtactcct tcttcagaaa tttccaaccc atgagccgac aggtggtgga cgagaccaat 21120
tacaaggact atcaagccat tggcatcacc caccagcaca acaactcggg tttcgtgggc 21180
tacctggcgc ccaccatgcg cgagggtcag gcctaccccg ccaacttccc ctaccccttg 21240
ataggcaaga ccgcggtcga cagcgtcacc cagaaaaagt tcctctgcga ccgcaccctc 21300
tggcgcatcc ccttctctag caacttcatg tccatgggtg cgctcacgga cctgggccaa 21360
aacctgcttt atgccaactc tgcccatgcg ctggacatga cttttgaggt ggaccccatg 21420
gacgagccca cccttctcta tattgtgttt gaagtgttcg acgtggtcag agtgcaccag 21480
ccgcaccgcg gtgtcatcga gaccgtgtac ctgcgtacgc ccttctcagc cggcaacgcc 21540
accacctaag gagacagcgc cgccgccgcc tgcatgacgg gttccaccga gcaagagctc 21600
agggccattg ccagagacct gggatgcgga ccctattttt tgggcaccta tgacaaacgc 21660
ttcccgggct ttatctcccg agacaagctc gcctgcgcca ttgtcaacac ggccgcgcgc 21720
gagaccgggg gcgtgcactg gctggccttt ggctgggacc cgcgctccaa aacttgctac 21780
ctctttgacc cctttggctt ctccgatcag cgcctcaggc agatttatga gtttgagtac 21840
gaggggctgc tgcgccgcag cgcgctcgcc tcctcgcccg accgctgcat cacccttgag 21900
aagtccaccg aaaccgtgca ggggccccac tcggccgcct gcggtctctt ctgttgcatg 21960
tttttgcacg cctttgtgca ctggcctcag agtcccatgg attgcaaccc caccatgaac 22020
ttgctaaagg gagtgcccaa cgccatgctc cagagccccc aggtccagcc caccctgcgc 22080
cgcaaccagg aacagcttta ccgcttcctg gagcgccact ccccctactt ccgcagccac 22140
agcgcgcgca tccggggggc cacctctttt tgccacttgc aagaaaacat gcaagacgga 22200
aaatgatgta cagcatgctt ttaataaatg taaagactgt gcactttaat tatacacggg 22260
ctctttctgg ttatttattc aacaccgccg tcgccattta gaaatcgaaa gggttctgcc 22320
gtgcgtcgcc gtgcgccacg ggcagagaca cgttgcgata ctggaagcgg ctcgcccact 22380
tgaactcggg caccaccatg cggggcagtg gttcctcggg gaagttctcg ctccacaggg 22440
tgcgggtcag ctgcagcgcg ctcaggaggt cgggagccga gatcttgaag tcgcagttgg 22500
ggccggaacc ctgcgcgcgc gagttgcggt acacggggtt gcagcactgg aacaccagca 22560
gggccggatt attcacgctg gccagcaggc tctcgtcgct gatcatgtcg ctgtccagat 22620
cctccgcgtt gctcagggcg aatggggtca tcttgcagac ctgcctgccc aggaaaggcg 22680
ggagcccagg cttgccgttg cagtcgcagc gcaggggcat tagcaggtgc ccacggcccg 22740
actgcgcctg cgggtacaac gcgcgcatga aggcttcgat ctgcctaaaa gccacctggg 22800
tcttggctcc ctccgaaaag aacatcccac aggacttgct ggagaactgg ttcgcgggac 22860
agctggcatc gtgcaggcag cagcgcgcgt cagtgttggc aatctgcacc acgttgcgac 22920
cccaccggtt tttcactatc ttggccttgg aagcctgctc ctttagcgcg cgctggccgt 22980
tctcgctggt cacatccatc tctatcacct gttccttgtt gatcatgttt gtcccgtgca 23040
gacactttag gtcgccctcc gtctgggtgc agcggtgctc ccacagcgcg caaccggtgg 23100
gctcccaatt cttgtgggtc acccccgcgt aggcctgcag gtaggcctgc aggaagcgcc 23160
ccatcatggt cataaaggtc ttctggctcg taaaggtcag ctgcaggccg cgatgctctt 23220
cgttcagcca ggtcttgcag atggcggcca gcgcctcggt ctgctcgggc agcatcttaa 23280
aatttgtctt caggtcgtta tccacgtggt acttgtccat catggcacgc gccgcctcca 23340
tgcccttctc ccaggcggac accatgggca ggcttagggg gtttatcact tccagcggcg 23400
aggacaccgt actttcgatt tcttcttcct ccccctcttc ccggcgcgcg cccccgctgt 23460
tgcgcgctct taccgcctgc accaaggggt cgtcttcagg caagcgccgc accgagcgct 23520
tgccgccctt gacctgcttg atcagtaccg gcgggttgct gaagcccacc atggtcagcg 23580
ccgcctgctc ttcttcgtct tcgctgtcta ccactatttc tggggagggg cttctccgct 23640
ctgcggcaaa ggcggcggat cgcttctttt ttttcttggg agccgccgcg atggagtccg 23700
ccacggcgac cgaggtcgag ggcgtggggc tgggggtgcg cggtaccagg gcctcgtcgc 23760
cctcggactc ttcctctgac tccaggcggc ggcggagtcg cttctttggg ggcgcgcgcg 23820
tcagcggcgg cggagacggg gacggggacg gggacgggac gccctccaca gggggtggtc 23880
ttcgcgcaga cccgcggccg cgctcggggg tcttctcgcg ctggtcttgg tcccgactgg 23940
ccattgtatc ctcctcctcc taggcagaga gacataagga gtctatcatg caagtcgaga 24000
aggaggagag cttaaccacc ccctcagaga ccgccgatgc gcccgccgtc gccgtcgccc 24060
ccgctaccgc cgacgcgccc gccacaccga gcgacacccc cacggacccc cccgccgacg 24120
cacccctgtt cgaggaagcg gccgtggagc aggacccggg ctttgtctcg gcagaggagg 24180
atttgcaaga ggaggagaat aaggaggaga agccctcagt gccaaaagat cataaagagc 24240
aagacgagca cgacgcagac gcacaccagg gtgaagtcgg gcggggggac ggagggcatg 24300
gcggcgccga ctacctagac gaaggaaacg acgtgctctt gaagcacctg catcgtcagt 24360
gcgccatcgt ctgcgacgct ctgcaggagc gcagcgaggt gcccctcagc gtggcggagg 24420
tcagccgcgc ctacgagctc agcctctttt ccccccgggt gcccccccgc cgccgcgaaa 24480
acggcacatg cgagcccaac ccgcgcctca acttctaccc cgcctttgtg gtgcccgagg 24540
tcctggccac ctatcacatc ttctttcaaa attgcaagat ccccatctcg tgccgcgcca 24600
accgtagccg cgccgataag atgctggccc tgcgccaggg cgaccacata cctgatatcg 24660
ccgctttgga agatgtgcca aagatcttcg agggtctggg gcgcaacgag aagcgggcag 24720
caaactctct gcaacaggaa aacagcgaaa atgagagtca cactggagcg ctggtggagc 24780
tggagggcga caacgcccgc ctggcggtgc tcaagcgcag catcgaggtc acccactttg 24840
cctaccccgc gctcaacctg ccccccaaag tcatgaacgc ggtcatggac gggctgatca 24900
tgcgccgcgg ccggcccctc gctccagatg caaacttgca tgaggagacc gaggacggtc 24960
agcccgtggt cagcgacgag cagctgacgc gctggctgga gagcgcggac cccgccgaac 25020
tggaggagcg gcgcaagatg atgatggccg cggtgctggt caccgtagag ctggagtgtc 25080
tgcagcgctt cttcggtgac cccgagatgc agagaaaggt cgaggagacc ctacactaca 25140
ccttccgcca gggctacgtg cgccaggctt gcaagatctc caacgtggag ctcagcaacc 25200
tggtgtccta cctgggcatc ttgcatgaaa accgccttgg gcagagcgtg ctacactcca 25260
ccctgcgcgg ggaggcgcgc cgcgactacg tgcgcgactg cgtttacctc ttcctctgct 25320
acacctggca gacggccatg ggggtctggc agcagtgcct ggaggagcgc aacctcaagg 25380
agctggagaa gcttctgcag cgcgcgctca aagacctctg gacgggcttc aacgagcgct 25440
cggtggccgc cgcgctagcc gacctcatct tccccgagcg cctgctcaaa accctccagc 25500
aggggctgcc cgacttcacc agccaaagca tgttgcaaaa ttttaggaac tttatcctgg 25560
agcgttctgg catcctaccc gccacctgct gcgccctgcc cagcgacttt gtccccctcg 25620
tgtaccgcga gtgccccccg ccgctgtggg gccactgcta cctgttccaa ctggccaact 25680
acctgtccta ccacgcggac ctcatggagg actccagcgg cgaggggctc atggagtgcc 25740
actgccgctg caacctctgc acgccccacc gctccctggt ctgcaacacc caactgctca 25800
gcgagagtca gattatcggt accttcgagc tacagggtcc gtcctcctca gacgagaagt 25860
ccgcggctcc ggggctaaaa ctcactccgg ggctgtggac ttccgcctac ctgcgcaaat 25920
ttgtacctga agactaccac gcccacgaaa tcaggtttta cgaggaccaa tcccgcccgc 25980
ccaaggcgga gctgaccgcc tgcgtcatca cccagggcga gatcctaggc caattgcaag 26040
ccatccaaaa agcccgccaa gagtttttgc tgaagagggg tcggggggtg tatctggacc 26100
cccagtcggg tgaggagctc aacccggttc ccccgctgcc accgccgcgg gaccttgctt 26160
cccaggataa gcatcgccat ggctcccaga aagaagcagc agcggccgcc gctgccgccg 26220
ccccacatgc tggaggaaga ggaggaatac tgggacagtc aggcagagga ggtttcggac 26280
gaggaggagc cggagacgga gatggaagag tgggaggagg acagcttaga cgaggaggct 26340
tccgaagccg aagaggcagg cgcaacaccg tcaccctcgg ccgcagcccc ctcgcaggcg 26400
cccccgaagt ccgctcccag catcagcagc aacagcagcg ctataacctc cgctcctcca 26460
ccgccgcgac ccacggccga ccgcagaccc aaccgtagat gggacaccac cggaaccggg 26520
gccggtaagt cctccgggag aggcaagcaa gcgcagcgcc aaggctaccg ctcgtggcgc 26580
gctcacaaga acgccatagt cgcttgcttg caagactgcg gggggaacat ctccttcgcc 26640
cgccgcttcc tgctcttcca ccacggtgtg gccttccccc gtaacgtcct gcattactac 26700
cgtcatctct acagccccta ctgcggcggc agtgagccag aggcggccag cggcggcggc 26760
gcccgtttcg gtgcctagga agacccaggg caagacttca gccaagaaac tcgcggcgac 26820
cgcggcgaac gcggtcgcgg gggccctgcg cctgacggtg aacgaacccc tgtcgacccg 26880
cgaactgagg aaccgaatct tccccactct ctatgccatc ttccagcaga gcagagggca 26940
ggatcaggaa ctgaaagtaa aaaacaggtc tctgcgctcc ctcacccgca gctgtctgta 27000
tcacaagagc gaagaccagc ttcggcgcac gctggaggac gctgaggcac tcttcagcaa 27060
atactgcgcg ctcactctta aggactagct ccgcgccctt ctcgaattta ggcgggaacg 27120
cctacgtcat cgcagcgccg ccgtcatgag caaggacatt cccacgccat acatgtggag 27180
ctatcagccg cagatgggac tcgcggcggg cgcctcccaa gactactcca cccgcatgaa 27240
ctggctcagt gccggcccac acatgatctc acaggttaat gacatccgca cccatcgaaa 27300
ccaaatattg gtgaagcagg cggcaattac caccacgccc cgcaataatc ccaaccccag 27360
ggagtggccc gcgtccctgg tgtatcagga aattcccggc cccaccaccg tactacttcc 27420
gcgtgattcc caggccgaag tccaaatgac taactcaggg gcacagctcg cgggcggctg 27480
tcgtcacagg gtgcggcctc ctcgccaggg tataactcac ctggagatcc gaggcagagg 27540
tattcagctc aacgacgagt cggtgagctc ctcgctcggt ctcagacctg acgggacctt 27600
ccagatagcc ggagccggcc gatcttcctt cacgccccgc caggcgtacc tgactctgca 27660
gagctcgtcc tcggcgccgc gctcgggcgg catcgggact ctccagttcg tgcaggagtt 27720
tgtgccctcg gtctacttca accccttctc gggctctccc ggtcgctacc cggaccagtt 27780
tatcccgaac tttgacgccg cgagggactc ggtggacggc tacgactgaa tgtcgggtgg 27840
acccggtgca gagcaacttc gcctgaagca ccttgaccac tgccgccgcc ctcagtgctt 27900
tgcccgctgt cagaccggtg agttccagta cttttccctg cccgactcgc acccggacgg 27960
cccggcgcac ggggtgcgct ttttcatccc gagtcaggtc cgctctaccc taatcaggga 28020
gttcaccgcc cgtcccctac tggcggagtt ggaaaagggg ccttctatcc taaccattgc 28080
ctgcatttgc tctaaccctg gattacacca agatctttgc tgtcatttgt gtgctgagta 28140
taataaaggc tgagatcaga atctactcgg accttatccc tttcaattga tcataactgt 28200
aatcaataaa aaatcactta cttgaaatct gatagcaaga ctctgtccaa ttttttcagc 28260
aacacttcct tcccctcctc ccaactctgg tactctaggc gcctcctagc tgcaaacttc 28320
ctccacagtc tgaagggaat gtcagattcc tcctcctgtc cctccgcacc cacgatcttc 28380
atgttgttac agatgaaacg cgcgagatcg tctgacgaga ccttcaaccc cgtgtacccc 28440
tacgataccg agatcgctcc gacttctgtc cctttcctta cccctccctt tgtatcatcc 28500
gcaggaatgc aagaaaatcc agctggggtg ctgtccctgc acctgtcaga gccccttacc 28560
acccacaatg gggccctgac tctaaaaatg gggggcggcc tgaccctgga caaggaaggg 28620
aatctcactt cccaaaacat caccagtgtc gatccccctc tcaaaaaaag caagaacaac 28680
atcagccttc agaccgccgc acccctcgcc gtcagctccg gggccctaac cctttttgcc 28740
actccccccc tagcggtcag tggcgacaac cttactgtgc agtctcaggc ccctcttact 28800
ttggaagact caaaactaac tctggccacc aaaggacccc taactgtgtc cgaaggcaaa 28860
cttgtcctag aaacagagcc tcccctgcat gcaagtgaca gcagtagcct gggccttagc 28920
gtcacggccc cacttagcat taacaatgac agcctaggac tagacatgca agcgcccatc 28980
agctctcgag atggaaaact ggctctaaca gtggcggccc ccctaactgt ggccgagggt 29040
atcaatgctt tggcagtagc cacaggtaat ggtattggac taaatgaaac caacacacac 29100
ctgcaggcaa aactggtcgc gcccctaggc tttgatacca acggcaacat taagctaagc 29160
gtcgcaggag gcatgaggct aaacaataac acactgatac tagatgtaaa ctacccattt 29220
gaggctcaag gccaactgag cctaagagtg ggctcgggcc cactatatgt agattctagt 29280
agtcataacc taaccattag atgccttagg ggattgtatg taacatcttc taacaaccaa 29340
aacggtctag aggccaacat taaactaaca aaaggccttg tgtatgacgg aaatgccata 29400
gcagttaatg ttggcaaagg gctggaatac agccctactg gcacaacaga aaaacctata 29460
cagactaaaa taggtctagg catggagtat gacactgagg gagccatgat gacaaaacta 29520
ggctctggac taagctttga caattcagga gccattgtgg tgggaaacaa aaatgatgac 29580
aggcttactt tgtggaccac accggaccca tcgcccaact gtcagattta ctctgaaaaa 29640
gatgctaaac taaccttggt actgactaaa tgtggcagtc aggttgtagg cacagtatct 29700
attgccgctc ttaaaggtag ccttgtgcca atcactagtg caatcagtgt ggttcagata 29760
tacctaaggt ttgatgaaaa tggggtgctg atgagtaact cttcacttaa tggcgaatac 29820
tggaatttta gaaacggaga ctcaactaat ggcacaccat atacaaacgc agtgggtttt 29880
atgcctaatc tactggccta tcctaaaggt caaactacaa ctgcaaaaag taacattgtc 29940
agccaggtct acatgaacgg ggacgatact aaacccatga catttacaat caacttcaat 30000
ggccttagtg aaacagggga tacccctgtc agtaaatatt ccatgacatt ctcatggagg 30060
tggccaaatg gaagctacat agggcacaat tttgtaacaa actcctttac tttctcctac 30120
atcgcccaag aataaagaaa gcacagagat gcttgttttt gatttcaaaa ttgtgtgctt 30180
ttatttattt tcaagcttac agtatttcca gtagtcatta gaatagagct taattaaact 30240
gcatgagaac ccttccacat agcttaaatt atcaccagtg caaatggaaa aaaatcaaca 30300
taccttttta tccagatatc aaagaactct agtggtcagt tttcccccac cctcccagct 30360
cacagaatac acagtccttt ccccccggct ggctttaaac aacactatct cattggtaac 30420
agacatattt ttaggtgtaa taatccacac ggtctcttgg cgggccaaac gctggtctgt 30480
gatgttaata aactccccag gcagctcttt caagttcacg tcgctgtcca actgctgaag 30540
cgctcgcggc tccgactgcg cctctagcgg aggcaacggc agcacccgat ccttgatcta 30600
taaaggagta gagtcataat cccccataag aatagggcgg tgatgcagca acaaggcgcg 30660
cagcaactcc tgccgccgcc tctccgtacg acaggaatgc aacggggtgg tggtctcctc 30720
cgcgataatc cgcaccgctc gcagcatcag catcctcgtc ctccgggcac agcagcgcat 30780
cctgatctca ctgagatcgg cgcagtaagt gcagcacaac accaagatgt tatttaagat 30840
cccacagtgc aaagcactgt acccaaagct catggcggga aggacagccc ccacgtgacc 30900
atcgtaccag atcctcaggt aaatcaaatg acgacctctc ataaacacgc tggacatata 30960
catcacctcc ttgggcatga gctgattcac cacctctcga taccacaggc atcgctgatt 31020
aattaaagac ccctcgagca ccatcctgaa ccaggaagcc agcacctgac cccccgccag 31080
gcactgcagg gaccccggtg aatcgcagtg gcagtgaaga ctccagcgct cgtagccgtg 31140
aaccatagag ctggtcatta tatccacatt ggcacaacac agacacactt tcatacactt 31200
tttcatgatt agcagctcct ctctagtcaa gaccatatcc caaggaatca cccactcttg 31260
aatcaaggta aatcccacac agcagggcag gcctctcaca taactcacgt tatgcatagt 31320
gagcgtgtcg caatctggaa ataccggatg atcttccatc accgaagccc gggtctccgt 31380
ctcaaaggga ggtaaacggt ccctcgtgta gggacagtgg cgggataatc gagatcgtgt 31440
tgaacgtaga gtcatgccaa agggaacagc ggacgtactc atatttcctc cagcagaacc 31500
aagtgcgcgc gtggcagcta tccctgcgtc ttctgtctcg ccgcctgccc cgctcggtgt 31560
agtagttgta atacagccac tccctcagac cgtcaaggcg ctccctggcg tccggatcta 31620
taacaacacc gtcctgcagc gccgccctga tgacatccac caccgtagag tatgccaagc 31680
ccagccacga aatgcactca ctttgacagc gagagatagg aggagcggga agagatggaa 31740
gaaccatgat agtaaaagaa cttttattcc aatcgatcct ctacaatgtc aaagtgtaga 31800
tctatcagat ggcactggtc tcctccgctg agtcgatcaa aaataacagc taaaccacaa 31860
acaacacgat tggtcaaatg ctgcacaagg gcttgcagca taaaatcgcc tcgaaagtcc 31920
accgcaagca taacatcaaa gccaccgccc ctatcatgat ctatgataaa aaccccacag 31980
ctatccacca gacccatata gttttcatct ctccatcgtg aaaaaatatt tacaagctcc 32040
tcctttaaat cacctccaac caattcaaaa agttgagcca gaccgccctc caccttcatt 32100
ttcagcatgc gcatcatgat tgcaaaaatt caggctcctc agacacctgt ataagattga 32160
gaagcggaac gttaacatca atgtttcgct cgcgaagatc gcgcctcagt gcaagcatga 32220
tataatccca caggtcggag cggatcagcg aggacatctc cccgccagga accaactcaa 32280
cggagcctat gctgattata atacgcatat tcggggctat gctaaccagc acggccccca 32340
aataggcgta ctgcataggc ggcgacaaaa agtgaacagt ttgggttaaa aaatcaggca 32400
aacactcgcg caaaaaagca agaacatcat aaccatgctc atgcaaatag atgcaagtaa 32460
gctcaggaac gaccacagaa aaatgcacaa tttttctctc aaacatgact gcgagccctg 32520
caaaaaataa aaaagaaaca ttacacaaga gtagcctgtc ttacaatggg atagactact 32580
ctaaccaaca taagacgggc cacgacatcg cccgcgtggc cataaaaaaa attatccgtg 32640
tgattaaaaa gaagcacaga tagctggcca gtcatatccg gagtcatcac gtgcgaaccc 32700
gtgtagaccc ccgggttgga cacatcggcc aaacaaagaa agcggccaat gtatcccgga 32760
ggaatgataa cactaagacg aagatacaac agaataaccc catggggggg aataacaaag 32820
ttagtaggtg aataaaaacg ataaacaccc gaaactccct cctgcgtagg caaaatagcg 32880
ccctcccctt ccaaaacaac atacagcgct tccacagcag ccatgacaaa agactcaaaa 32940
cactcaaaag actcagtctt accaggaaaa taaaagcact ctcacagcac cagcactaat 33000
cagagtgtga agagggccaa gtgccgaacg agtatatata ggaattaaaa atgacgtaaa 33060
tgtgtaaagg tcaaaaaacg cccagaaaaa tacacagacc aacgcccgaa acgaaaaccc 33120
gcgaaaaaat acccagaagt tcctcaacaa ccgccacttc cgctttccca cgatacgtca 33180
cttcctcaaa aatagcaaac tacatttccc acatgtacaa aaccaaaacc cctccccttg 33240
tcaccgccca caacttacat aatcacaaac gtcaaagcct acgtcacccg ccccgcctcg 33300
ccccgcccac ctcattatca tattggcctc aatccaaaat aaggtatatt attgatgatg 33360
<210> 74
<211> 500
<212> PRT
<213> Human immunodeficiency virus
<400> 74
Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp
1 5 10 15
Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys
20 25 30
His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro
35 40 45
Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu
50 55 60
Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn
65 70 75 80
Thr Val Ala Thr Leu Tyr Cys Val His Gln Lys Ile Asp Val Lys Asp
85 90 95
Thr Lys Glu Ala Leu Glu Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys
100 105 110
Lys Lys Ala Gln Gln Ala Ala Ala Gly Thr Gly Asn Ser Ser Gln Val
115 120 125
Ser Gln Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His
130 135 140
Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu
145 150 155 160
Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser
165 170 175
Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly
180 185 190
Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu
195 200 205
Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala
210 215 220
Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr
225 230 235 240
Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr Asn Asn Pro Pro Ile
245 250 255
Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys
260 265 270
Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly
275 280 285
Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu
290 295 300
Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr
305 310 315 320
Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala
325 330 335
Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly
340 345 350
Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser
355 360 365
Gln Val Thr Asn Ser Ala Thr Ile Met Met Gln Arg Gly Asn Phe Arg
370 375 380
Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Val Gly His
385 390 395 400
Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys
405 410 415
Gly Lys Glu Gly His Gln Met Lys Asp Cys Asn Glu Arg Gln Ala Asn
420 425 430
Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe
435 440 445
Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg
450 455 460
Phe Gly Glu Glu Lys Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp
465 470 475 480
Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Asn Asp
485 490 495
Pro Ser Ser Gln
500
Claims (15)
- 다음을 포함하는 아데노바이러스 헥손 단백질을 인코딩하는 단리된 폴리뉴클레오타이드:
A) (i) 서열번호 11에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 27 번이 A가 아닌 그의 변이체를 포함하는 HVR1,
(ii) 서열번호 12에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 1 번이 L이 아닌 그의 변이체를 포함하는 HVR2,
(iii) 서열번호 13에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 7 번이 V가 아닌 그의 변이체를 포함하는 HVR3,
(iv) 서열번호 14에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR4,
(v) 서열번호 15에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR5,
(vi) 서열번호 16에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR6, 및
(vii) 서열번호 17에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 1 번이 I가 아닌 그의 변이체를 포함하는 HVR7; 또는
B) (i) 서열번호 18에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 8 번이 V가 아니고/아니거나, 12 번이 D가 아니고/아니거나, 13 번이 E가 아니고/아니거나, 14 번이 L이 아닌 그의 변이체를 포함하는 HVR1,
(ii) 서열번호 19에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 10 번이 D가 아닌 그의 변이체를 포함하는 HVR2,
(iii) 서열번호 20에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 6 번이 T가 아닌 그의 변이체를 포함하는 HVR3,
(iv) 서열번호 21에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 9 번이 L이 아닌 그의 변이체를 포함하는 HVR4,
(v) 서열번호 22에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 3 번이 T가 아닌 그의 변이체를 포함하는 HVR5,
(vi) 서열번호 23에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 9 번이 I가 아닌 그의 변이체를 포함하는 HVR6, 및
(vii) 서열번호 24에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 8 번이 I가 아닌 그의 변이체를 포함하는 HVR7; 또는
C) (i) 서열번호 25에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR1,
(ii) 서열번호 26에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR2,
(iii) 서열번호 27에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 7 번이 V가 아닌 그의 변이체를 포함하는 HVR3,
(iv) 서열번호 28에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 10 번이 E가 아닌 그의 변이체를 포함하는 HVR4,
(v) 서열번호 29에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 3 번이 T가 아닌 그의 변이체를 포함하는 HVR5,
(vi) 서열번호 30에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 9 번이 I가 아닌 그의 변이체를 포함하는 HVR6, 및
(vii) 서열번호 31에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 8 번이 I가 아니고/아니거나, 11 번이 T가 아닌 그의 변이체를 포함하는 HVR7; 또는
D) (i) 서열번호 32에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR1,
(ii) 서열번호 33에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR2,
(iii) 서열번호 34에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 6 번이 T가 아닌 그의 변이체를 포함하는 HVR3,
(iv) 서열번호 35에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 6 번이 Q가 아니고/아니거나, 10 번이 E가 아닌 그의 변이체를 포함하는 HVR4,
(v) 서열번호 36에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 3 번이 T가 아닌 그의 변이체를 포함하는 HVR5,
(vi) 서열번호 37에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 1 번이 K가 아닌 그의 변이체를 포함하는 HVR6, 및
(vii) 서열번호 38에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 8 번이 I가 아닌 그의 변이체를 포함하는 HVR7; 또는
E) (i) 서열번호 39에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 27 번이 A가 아닌 그의 변이체를 포함하는 HVR1,
(ii) 서열번호 40에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR2,
(iii) 서열번호 41에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR3,
(iv) 서열번호 42에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR4,
(v) 서열번호 43에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR5,
(vi) 서열번호 44에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 HVR6, 및
(vii) 서열번호 45에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖고 1 번이 I가 아닌 그의 변이체를 포함하는 HVR7. - 제1항에 있어서, 헥손 단백질은 다음을 포함하는, 단리된 폴리뉴클레오타이드:
A) 서열번호 46에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체,
B) 서열번호 47에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체,
C) 서열번호 48에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체,
D) 서열번호 49에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체, 및/또는
E) 서열번호 50에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체. - 제1항 또는 제2항에 있어서, 서열번호 51 또는 52에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 아데노바이러스 펜톤 단백질을 추가로 인코딩하는, 단리된 폴리뉴클레오타이드.
- 제1항 내지 제3항 중 어느 한 항에 있어서, 서열번호 53 또는 54에 따른 아미노산 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 아데노바이러스 섬유 단백질을 추가로 인코딩하는, 단리된 폴리뉴클레오타이드.
- 제1항 내지 제4항 중 어느 한 항에 있어서, 서열번호 57에 따른 뉴클레오타이드 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 VA RNA II 비코딩 RNA; 및/또는 서열번호 55 또는 56에 따른 뉴클레오타이드 서열, 또는 적어도 85% 서열 동일성을 갖는 그의 변이체를 포함하는 VA RNA I 비코딩 RNA를 추가로 인코딩하는, 단리된 폴리뉴클레오타이드.
- 제1항 내지 제5항 중 어느 한 항의 폴리뉴클레오타이드를 포함하는 아데노바이러스, 바람직하게는 복제-불능 아데노바이러스를 인코딩하는, 단리된 폴리뉴클레오타이드.
- 제6항에 있어서, 아데노바이러스는 키메라 아데노바이러스이고/이거나, 비-아데노바이러스 유전자, 단백질 또는 그의 단편을 보유하는, 단리된 폴리뉴클레오타이드.
- 제1항 내지 제4항 중 어느 한 항의 단리된 폴리뉴클레오타이드에 의해 인코딩되는 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드.
- 제1항 내지 제7항 중 어느 한 항에 따른 단리된 폴리뉴클레오타이드 및/또는 제8항에 따른 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드를 포함하는 단리된 아데노바이러스, 바람직하게는 복제-불능 아데노바이러스.
- 제1항 내지 제7항 중 어느 한 항의 단리된 폴리뉴클레오타이드에 의해 인코딩되는 바이러스-유사 입자.
- 제1항 내지 제7항 중 어느 한 항의 단리된 폴리뉴클레오타이드를 포함하는 벡터.
- (i) 애주번트; (ii) 제1항 내지 제7항 중 어느 한 항의 단리된 폴리뉴클레오타이드, 제8항의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제9항의 아데노바이러스, 제10항의 바이러스-유사 입자, 또는 제11항의 벡터; 및 선택적으로 (iii) 약제학적으로 허용 가능한 부형제를 포함하는 조성물.
- 제1항 내지 제7항 중 어느 한 항의 폴리뉴클레오타이드, 제8항의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제9항의 아데노바이러스, 제10항의 바이러스-유사 입자, 또는 제11항의 벡터를 포함하는 세포.
- 질병의 치료 또는 예방에 사용하기 위한 제1항 내지 제7항 중 어느 한 항의 폴리뉴클레오타이드, 제8항의 적어도 하나의 단리된 아데노바이러스 캡시드 폴리펩타이드, 제9항의 아데노바이러스, 제10항의 바이러스-유사 입자, 또는 제11항의 벡터 및/또는 제12항의 조성물.
- (i) 제1항 내지 제7항 중 어느 한 항의 단리된 폴리뉴클레오타이드를 세포에서 발현시켜, 아데노바이러스 또는 아데노바이러스-유사 입자가 세포내에서 조립되는 단계; 및
(ii) 세포 또는 세포 주위의 배지로부터 아데노바이러스 또는 아데노바이러스-유사 입자를 단리하는 단계
를 포함하는 아데노바이러스 또는 아데노바이러스-유사 입자를 생성하는 시험관내(in vitro) 방법.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP17179825 | 2017-07-05 | ||
EP17179825.9 | 2017-07-05 | ||
PCT/EP2018/068291 WO2019008111A1 (en) | 2017-07-05 | 2018-07-05 | AMINO ACID AND NUCLEIC ACID SEQUENCES OF ADENOVIRUS OF NON-HUMAN GREEN APES, VECTORS CONTAINING SAME, AND USES THEREOF |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20200024296A true KR20200024296A (ko) | 2020-03-06 |
KR102582561B1 KR102582561B1 (ko) | 2023-09-26 |
Family
ID=59295008
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020207003223A KR102582561B1 (ko) | 2017-07-05 | 2018-07-05 | 비인간 대형 유인원 아데노바이러스 핵산-서열 및 아미노산-서열, 이를 포함하는 벡터 및 그의 용도 |
Country Status (18)
Country | Link |
---|---|
US (1) | US11098324B2 (ko) |
EP (1) | EP3649237B1 (ko) |
JP (1) | JP7274222B2 (ko) |
KR (1) | KR102582561B1 (ko) |
CN (1) | CN111108192B (ko) |
AU (1) | AU2018295421B2 (ko) |
BR (1) | BR112020000145A2 (ko) |
CA (1) | CA3066962A1 (ko) |
DK (1) | DK3649237T3 (ko) |
ES (1) | ES2906441T3 (ko) |
HU (1) | HUE058778T2 (ko) |
IL (1) | IL271835B2 (ko) |
MX (1) | MX2020000221A (ko) |
PL (1) | PL3649237T3 (ko) |
PT (1) | PT3649237T (ko) |
RU (1) | RU2762854C2 (ko) |
SG (1) | SG11201913178WA (ko) |
WO (1) | WO2019008111A1 (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021172908A1 (ko) | 2020-02-27 | 2021-09-02 | ㈜아모레퍼시픽 | 사용감이 개선된 조성물 |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3587581A1 (en) | 2018-06-26 | 2020-01-01 | GlaxoSmithKline Biologicals S.A. | Formulations for simian adenoviral vectors having enhanced storage stability |
TW202043256A (zh) | 2019-01-10 | 2020-12-01 | 美商健生生物科技公司 | 前列腺新抗原及其用途 |
AU2020385683A1 (en) | 2019-11-18 | 2022-06-30 | Janssen Biotech, Inc. | Vaccines based on mutant CALR and JAK2 and their uses |
TW202144389A (zh) | 2020-02-14 | 2021-12-01 | 美商健生生物科技公司 | 在多發性骨髓瘤中表現之新抗原及其用途 |
TW202144388A (zh) | 2020-02-14 | 2021-12-01 | 美商健生生物科技公司 | 在卵巢癌中表現之新抗原及其用途 |
WO2021209897A1 (en) | 2020-04-13 | 2021-10-21 | Janssen Biotech, Inc. | Psma and steap1 vaccines and their uses |
EP4150344A1 (en) | 2020-05-14 | 2023-03-22 | GlaxoSmithKline Biologicals SA | Viral biosensors |
TW202208398A (zh) * | 2020-07-01 | 2022-03-01 | 義大利商萊伊錫拉有限責任公司 | 大猩猩腺病毒核酸及胺基酸序列,包含彼之載體,及其用途 |
EP4177347A1 (en) * | 2020-07-06 | 2023-05-10 | Jiaxing Anyu Biotechnology Co., Ltd | Novel chimpanzee adenovirus vector, construction method therefor, and application thereof |
EP4176087A1 (en) | 2020-07-06 | 2023-05-10 | Janssen Biotech, Inc. | A method for determining responsiveness to prostate cancer treatment |
WO2022009049A1 (en) | 2020-07-06 | 2022-01-13 | Janssen Biotech, Inc. | Prostate neoantigens and their uses |
WO2022009052A2 (en) | 2020-07-06 | 2022-01-13 | Janssen Biotech, Inc. | Prostate neoantigens and their uses |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006040330A2 (en) * | 2004-10-13 | 2006-04-20 | Crucell Holland B.V. | Improved adenoviral vectors and uses thereof |
WO2010051367A1 (en) * | 2008-10-31 | 2010-05-06 | The Trustees Of The University Of Pennsylvania | Simian adenoviruses sadv-43, -45,-48,-49, and -50 and uses thereof |
WO2013116591A1 (en) * | 2012-02-02 | 2013-08-08 | Genvec, Inc. | Adenoviral vector-based malaria vaccine |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE69534166T2 (de) | 1994-10-28 | 2006-03-09 | Trustees Of The University Of Pennsylvania | Rekombinanter adenovirus und methoden zu dessen verwendung |
US5922315A (en) | 1997-01-24 | 1999-07-13 | Genetic Therapy, Inc. | Adenoviruses having altered hexon proteins |
US20030224372A1 (en) | 2002-05-31 | 2003-12-04 | Denise Syndercombe-Court | Method for determining ethnic origin by means of STR profile |
CA2553541C (en) | 2004-01-23 | 2015-04-21 | Istituto Di Ricerche Di Biologia Molecolare P. Angeletti S.P.A. | Chimpanzee adenovirus vaccine carriers |
JP4843613B2 (ja) | 2004-10-13 | 2011-12-21 | クルセル ホランド ベー ヴェー | 改良されたアデノウイルスベクターおよびその使用方法 |
JP2008538894A (ja) | 2005-02-11 | 2008-11-13 | メルク エンド カムパニー インコーポレーテッド | アデノウイルス血清型26ベクター、核酸およびそれにより製造されたウイルス |
JP5882741B2 (ja) * | 2009-02-02 | 2016-03-09 | グラクソスミスクライン バイオロジカルズ ソシエテ アノニム | サルアデノウイルスの核酸配列及びアミノ酸配列、それを含有するベクター、並びにその使用 |
WO2010085984A1 (en) * | 2009-02-02 | 2010-08-05 | Okairos Ag | Simian adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
WO2012023995A1 (en) * | 2010-08-18 | 2012-02-23 | Takayuki Shiratsuchi | Modification of recombinant adenovirus capsid protein with immunogenic plasmodium circumsporozoite protein epitopes |
US20140348791A1 (en) * | 2011-09-09 | 2014-11-27 | Beth Israel Deaconess Medical Center, Inc. | Modified adenoviral vectors and methods of treatment using same |
JP6757119B2 (ja) | 2011-10-05 | 2020-09-16 | ジェンヴェック エルエルシー | アーフェンアデノウイルス(ゴリラ)又はアデノウイルスベクター、及び使用方法 |
CN103966263A (zh) * | 2013-02-04 | 2014-08-06 | 广州医学院第一附属医院 | 一种重组人3型腺病毒及其制备方法和应用 |
CN104419717B (zh) * | 2013-08-23 | 2018-04-27 | 长春百克生物科技股份公司 | 逃避预存免疫的重组腺病毒及其构建方法和用途 |
WO2019118480A1 (en) * | 2017-12-11 | 2019-06-20 | Beth Israel Deaconess Medical Center, Inc. | Recombinant adenoviruses and uses thereof |
TW202208398A (zh) * | 2020-07-01 | 2022-03-01 | 義大利商萊伊錫拉有限責任公司 | 大猩猩腺病毒核酸及胺基酸序列,包含彼之載體,及其用途 |
-
2018
- 2018-07-05 US US16/626,438 patent/US11098324B2/en active Active
- 2018-07-05 BR BR112020000145-7A patent/BR112020000145A2/pt unknown
- 2018-07-05 HU HUE18737898A patent/HUE058778T2/hu unknown
- 2018-07-05 SG SG11201913178WA patent/SG11201913178WA/en unknown
- 2018-07-05 CN CN201880045366.7A patent/CN111108192B/zh active Active
- 2018-07-05 JP JP2020500165A patent/JP7274222B2/ja active Active
- 2018-07-05 RU RU2019144161A patent/RU2762854C2/ru active
- 2018-07-05 DK DK18737898.9T patent/DK3649237T3/da active
- 2018-07-05 AU AU2018295421A patent/AU2018295421B2/en active Active
- 2018-07-05 EP EP18737898.9A patent/EP3649237B1/en active Active
- 2018-07-05 PT PT187378989T patent/PT3649237T/pt unknown
- 2018-07-05 MX MX2020000221A patent/MX2020000221A/es unknown
- 2018-07-05 ES ES18737898T patent/ES2906441T3/es active Active
- 2018-07-05 WO PCT/EP2018/068291 patent/WO2019008111A1/en unknown
- 2018-07-05 KR KR1020207003223A patent/KR102582561B1/ko active IP Right Grant
- 2018-07-05 PL PL18737898T patent/PL3649237T3/pl unknown
- 2018-07-05 CA CA3066962A patent/CA3066962A1/en active Pending
-
2020
- 2020-01-05 IL IL271835A patent/IL271835B2/en unknown
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006040330A2 (en) * | 2004-10-13 | 2006-04-20 | Crucell Holland B.V. | Improved adenoviral vectors and uses thereof |
WO2010051367A1 (en) * | 2008-10-31 | 2010-05-06 | The Trustees Of The University Of Pennsylvania | Simian adenoviruses sadv-43, -45,-48,-49, and -50 and uses thereof |
WO2013116591A1 (en) * | 2012-02-02 | 2013-08-08 | Genvec, Inc. | Adenoviral vector-based malaria vaccine |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021172908A1 (ko) | 2020-02-27 | 2021-09-02 | ㈜아모레퍼시픽 | 사용감이 개선된 조성물 |
Also Published As
Publication number | Publication date |
---|---|
NZ759892A (en) | 2023-09-29 |
RU2019144161A (ru) | 2021-08-05 |
RU2019144161A3 (ko) | 2021-08-05 |
SG11201913178WA (en) | 2020-01-30 |
US20210130848A1 (en) | 2021-05-06 |
JP2020530761A (ja) | 2020-10-29 |
IL271835A (en) | 2020-02-27 |
RU2762854C2 (ru) | 2021-12-23 |
KR102582561B1 (ko) | 2023-09-26 |
US11098324B2 (en) | 2021-08-24 |
EP3649237A1 (en) | 2020-05-13 |
BR112020000145A2 (pt) | 2020-07-14 |
CA3066962A1 (en) | 2019-01-10 |
AU2018295421A1 (en) | 2020-01-02 |
ES2906441T3 (es) | 2022-04-18 |
CN111108192B (zh) | 2023-12-15 |
JP7274222B2 (ja) | 2023-05-16 |
IL271835B2 (en) | 2023-08-01 |
PT3649237T (pt) | 2022-02-14 |
DK3649237T3 (da) | 2022-02-21 |
EP3649237B1 (en) | 2022-01-19 |
PL3649237T3 (pl) | 2022-03-28 |
WO2019008111A1 (en) | 2019-01-10 |
IL271835B1 (en) | 2023-04-01 |
HUE058778T2 (hu) | 2022-09-28 |
AU2018295421B2 (en) | 2024-01-25 |
CN111108192A (zh) | 2020-05-05 |
MX2020000221A (es) | 2020-08-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2762854C2 (ru) | Последовательности нуклеиновых кислот и аминокислотные последовательности аденовирусов человекообразных обезьян, исключая человека, содержащие их векторы, и их применения | |
KR102535670B1 (ko) | 아데노바이러스 폴리뉴클레오티드 및 폴리펩티드 | |
KR101761425B1 (ko) | 시미안 아데노바이러스 핵산- 및 아미노산-서열, 이를 포함하는 벡터 및 이의 용도 | |
AU2019204982B2 (en) | Recombinant HCMV and RhCMV Vectors and Uses Thereof | |
KR102205908B1 (ko) | 아데노바이러스 벡터 | |
BE1023916B1 (fr) | Nouvel adenovirus | |
KR101957616B1 (ko) | 아데노바이러스 조립 방법 | |
US20040136963A1 (en) | Simian adenovirus vectors and methods of use | |
AU2022203504A1 (en) | Oncolytic tumor viruses and methods of use | |
AU2017258857A1 (en) | Subfamily E simian adenovirus A1309, A1321, A1325, A1295 and A1322 and uses thereof | |
KR102471633B1 (ko) | 바이러스 동역학에 미치는 영향 최소화를 위한 치료용 아데노바이러스의 외인성 유전자 발현 | |
KR102403547B1 (ko) | 외인성 항원을 포함하는 인간 시토메갈로바이러스 | |
CN107574154A (zh) | 猴(大猩猩)腺病毒或腺病毒载体及其使用方法 | |
CN107937440A (zh) | 猴腺病毒(大猩猩)或腺病毒载体及其使用方法 | |
KR20200066349A (ko) | 복제 가능 아데노바이러스 벡터 | |
JP2023145678A (ja) | エプスタインバールウイルス抗原構築物 | |
KR20200083510A (ko) | 아데노바이러스 및 이의 용도 | |
KR20230031929A (ko) | 고릴라 아데노바이러스 핵산 서열 및 아미노산 서열, 이들을 함유하는 벡터, 및 이의 용도 | |
CN113897388A (zh) | 一种新型黑猩猩腺病毒载体及其构建方法和应用 | |
CN116940589A (zh) | 重组sars-cov-2疫苗 | |
CN116323955A (zh) | 通过crispr/cas介导的体内末端解析拯救重组腺病毒 | |
NL2023464B1 (en) | Oncolytic Non-human adenoviruses and uses thereof | |
RU2800361C2 (ru) | Стабильные составы цитомегаловируса | |
DK2391638T3 (en) | Abeadenovirus nucleic acid and amino acid sequences, vectors containing them, and uses thereof. | |
KR20230146436A (ko) | 집단의 재접종을 위한 중증 급성 호흡기 증후군 바이러스 sars-cov-2에 대한 특이 면역의 유도를 위한 제제(변형체)의 용도 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant |