CN102300872A - 猿腺病毒核酸和氨基酸序列,包含其的载体及其用途 - Google Patents
猿腺病毒核酸和氨基酸序列,包含其的载体及其用途 Download PDFInfo
- Publication number
- CN102300872A CN102300872A CN201080006197XA CN201080006197A CN102300872A CN 102300872 A CN102300872 A CN 102300872A CN 201080006197X A CN201080006197X A CN 201080006197XA CN 201080006197 A CN201080006197 A CN 201080006197A CN 102300872 A CN102300872 A CN 102300872A
- Authority
- CN
- China
- Prior art keywords
- adenovirus
- leu
- thr
- ser
- asn
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000013598 vector Substances 0.000 title abstract description 21
- 241000990167 unclassified Simian adenoviruses Species 0.000 title description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 title 1
- 241000701161 unidentified adenovirus Species 0.000 claims abstract description 330
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 275
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 275
- 239000002157 polynucleotide Substances 0.000 claims abstract description 275
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 81
- 229920001184 polypeptide Polymers 0.000 claims abstract description 66
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 66
- 239000012634 fragment Substances 0.000 claims abstract description 56
- CXURGFRDGROIKG-UHFFFAOYSA-N 3,3-bis(chloromethyl)oxetane Chemical compound ClCC1(CCl)COC1 CXURGFRDGROIKG-UHFFFAOYSA-N 0.000 claims abstract description 45
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 claims abstract description 21
- 201000010099 disease Diseases 0.000 claims abstract description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 174
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 107
- 235000001014 amino acid Nutrition 0.000 claims description 82
- 230000008034 disappearance Effects 0.000 claims description 81
- 150000001413 amino acids Chemical class 0.000 claims description 76
- 241000700605 Viruses Species 0.000 claims description 71
- 102000004169 proteins and genes Human genes 0.000 claims description 63
- 235000018102 proteins Nutrition 0.000 claims description 60
- 241001135569 Human adenovirus 5 Species 0.000 claims description 37
- 108091007433 antigens Proteins 0.000 claims description 35
- 102000036639 antigens Human genes 0.000 claims description 35
- 239000000427 antigen Substances 0.000 claims description 32
- 210000000234 capsid Anatomy 0.000 claims description 26
- 210000004907 gland Anatomy 0.000 claims description 19
- 239000000203 mixture Substances 0.000 claims description 18
- 101710087110 ORF6 protein Proteins 0.000 claims description 17
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 claims description 17
- 108700026758 Adenovirus hexon capsid Proteins 0.000 claims description 16
- 239000002671 adjuvant Substances 0.000 claims description 16
- 108700025771 adenovirus penton Proteins 0.000 claims description 15
- -1 TLR-6 Proteins 0.000 claims description 14
- 238000006073 displacement reaction Methods 0.000 claims description 14
- 230000008859 change Effects 0.000 claims description 12
- 238000001415 gene therapy Methods 0.000 claims description 12
- 230000037431 insertion Effects 0.000 claims description 11
- 238000003780 insertion Methods 0.000 claims description 11
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 claims description 10
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 claims description 10
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 claims description 10
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 claims description 10
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 claims description 10
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 claims description 10
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 claims description 10
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 claims description 10
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 claims description 10
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 claims description 10
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 claims description 10
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 claims description 10
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 claims description 10
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 claims description 10
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 claims description 10
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 claims description 10
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 claims description 10
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 claims description 10
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 claims description 10
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 claims description 10
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 claims description 10
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 claims description 10
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 claims description 10
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 claims description 10
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 claims description 10
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 claims description 10
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 claims description 10
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 claims description 10
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 claims description 10
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 claims description 10
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 claims description 10
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 claims description 10
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 claims description 10
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 claims description 10
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 claims description 10
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 claims description 10
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 claims description 10
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 claims description 10
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 claims description 10
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 claims description 10
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 claims description 10
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 claims description 10
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 claims description 10
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 10
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 claims description 10
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 claims description 10
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 claims description 10
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 claims description 10
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 claims description 10
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 claims description 10
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 claims description 10
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 claims description 10
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 claims description 10
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 claims description 10
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 claims description 10
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 claims description 10
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 claims description 10
- 101710159752 Poly(3-hydroxyalkanoate) polymerase subunit PhaE Proteins 0.000 claims description 10
- 101710130262 Probable Vpr-like protein Proteins 0.000 claims description 10
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 claims description 10
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 claims description 10
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 claims description 10
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 claims description 10
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 claims description 10
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 claims description 10
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 claims description 10
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 claims description 10
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 claims description 10
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 claims description 10
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 claims description 10
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 claims description 10
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 claims description 10
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 claims description 10
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 claims description 10
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 claims description 10
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 claims description 10
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 claims description 10
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 claims description 10
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 claims description 10
- 101710110895 Uncharacterized 7.3 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 10
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 claims description 10
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 claims description 10
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 claims description 10
- 238000011081 inoculation Methods 0.000 claims description 10
- 238000004321 preservation Methods 0.000 claims description 10
- 108090000695 Cytokines Proteins 0.000 claims description 9
- 102000004127 Cytokines Human genes 0.000 claims description 9
- 101000833492 Homo sapiens Jouberin Proteins 0.000 claims description 9
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 claims description 9
- 102100024407 Jouberin Human genes 0.000 claims description 9
- 125000000539 amino acid group Chemical group 0.000 claims description 9
- 230000002265 prevention Effects 0.000 claims description 9
- 102100031725 Cortactin-binding protein 2 Human genes 0.000 claims description 5
- 101000908757 Human adenovirus C serotype 2 Early 4 ORF4 protein Proteins 0.000 claims description 5
- 101150032643 IVa2 gene Proteins 0.000 claims description 5
- 101710197985 Probable protein Rev Proteins 0.000 claims description 5
- 102100040307 Protein FAM3B Human genes 0.000 claims description 5
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 5
- 101710134973 Uncharacterized 9.7 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 5
- 239000000556 agonist Substances 0.000 claims description 5
- 101001082397 Human adenovirus B serotype 3 Hexon-associated protein Proteins 0.000 claims description 4
- 101001120093 Pseudoalteromonas phage PM2 Protein P8 Proteins 0.000 claims description 4
- 108010024878 Adenovirus E1A Proteins Proteins 0.000 claims description 3
- 108010087905 Adenovirus E1B Proteins Proteins 0.000 claims description 3
- 108010057856 Adenovirus E2 Proteins Proteins 0.000 claims description 3
- 108010027410 Adenovirus E3 Proteins Proteins 0.000 claims description 3
- 108010056962 Adenovirus E4 Proteins Proteins 0.000 claims description 3
- 101710134784 Agnoprotein Proteins 0.000 claims description 3
- 241000193096 Human adenovirus B3 Species 0.000 claims description 3
- 101710124584 Probable DNA-binding protein Proteins 0.000 claims description 3
- 101710118538 Protease Proteins 0.000 claims description 3
- 101000669447 Homo sapiens Toll-like receptor 4 Proteins 0.000 claims description 2
- 101000669460 Homo sapiens Toll-like receptor 5 Proteins 0.000 claims description 2
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 claims description 2
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 claims description 2
- 102000002689 Toll-like receptor Human genes 0.000 claims description 2
- 108020000411 Toll-like receptor Proteins 0.000 claims description 2
- 102100039360 Toll-like receptor 4 Human genes 0.000 claims description 2
- 102100039357 Toll-like receptor 5 Human genes 0.000 claims description 2
- 102100039390 Toll-like receptor 7 Human genes 0.000 claims description 2
- 102100033117 Toll-like receptor 9 Human genes 0.000 claims description 2
- 108091023040 Transcription factor Proteins 0.000 claims description 2
- 102000040945 Transcription factor Human genes 0.000 claims description 2
- 102000009310 vitamin D receptors Human genes 0.000 claims description 2
- 108050000156 vitamin D receptors Proteins 0.000 claims description 2
- 101710145505 Fiber protein Proteins 0.000 abstract description 12
- 108090000565 Capsid Proteins Proteins 0.000 abstract description 11
- 102100023321 Ceruloplasmin Human genes 0.000 abstract description 11
- 239000008194 pharmaceutical composition Substances 0.000 abstract description 10
- 101710094396 Hexon protein Proteins 0.000 abstract 1
- 101710173835 Penton protein Proteins 0.000 abstract 1
- 238000011321 prophylaxis Methods 0.000 abstract 1
- 238000002560 therapeutic procedure Methods 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 116
- 229940024606 amino acid Drugs 0.000 description 79
- 108020004414 DNA Proteins 0.000 description 59
- 108010050848 glycylleucine Proteins 0.000 description 51
- 229960005486 vaccine Drugs 0.000 description 27
- 239000002245 particle Substances 0.000 description 23
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 21
- 239000002773 nucleotide Substances 0.000 description 21
- 125000003729 nucleotide group Chemical group 0.000 description 21
- 241001217856 Chimpanzee adenovirus Species 0.000 description 20
- 108010093581 aspartyl-proline Proteins 0.000 description 20
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 19
- 241000725303 Human immunodeficiency virus Species 0.000 description 19
- 230000006870 function Effects 0.000 description 19
- 239000000835 fiber Substances 0.000 description 18
- 238000000034 method Methods 0.000 description 18
- 210000002966 serum Anatomy 0.000 description 18
- 241000699666 Mus <mouse, genus> Species 0.000 description 17
- 230000004044 response Effects 0.000 description 17
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 16
- 241001465754 Metazoa Species 0.000 description 16
- 230000003053 immunization Effects 0.000 description 16
- 108010054155 lysyllysine Proteins 0.000 description 16
- 239000013612 plasmid Substances 0.000 description 16
- 108010020532 tyrosyl-proline Proteins 0.000 description 15
- 241000880493 Leptailurus serval Species 0.000 description 14
- 108010077245 asparaginyl-proline Proteins 0.000 description 14
- 108010038320 lysylphenylalanine Proteins 0.000 description 14
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 13
- 241000282577 Pan troglodytes Species 0.000 description 13
- 108010044940 alanylglutamine Proteins 0.000 description 13
- 108010038633 aspartylglutamate Proteins 0.000 description 13
- 239000003814 drug Substances 0.000 description 13
- 108010049041 glutamylalanine Proteins 0.000 description 13
- 108010037850 glycylvaline Proteins 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- 108010003700 lysyl aspartic acid Proteins 0.000 description 13
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 12
- 241000894006 Bacteria Species 0.000 description 12
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 12
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 12
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 12
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 12
- 108010008355 arginyl-glutamine Proteins 0.000 description 12
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 12
- 108010064235 lysylglycine Proteins 0.000 description 12
- 230000003472 neutralizing effect Effects 0.000 description 12
- 108010051242 phenylalanylserine Proteins 0.000 description 12
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 11
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 11
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 11
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 11
- 108010047562 NGR peptide Proteins 0.000 description 11
- 206010028980 Neoplasm Diseases 0.000 description 11
- 241000282576 Pan paniscus Species 0.000 description 11
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 11
- 108010013835 arginine glutamate Proteins 0.000 description 11
- 108010047857 aspartylglycine Proteins 0.000 description 11
- 108010092854 aspartyllysine Proteins 0.000 description 11
- 108010089804 glycyl-threonine Proteins 0.000 description 11
- 230000036039 immunity Effects 0.000 description 11
- 108010061238 threonyl-glycine Proteins 0.000 description 11
- 108010051110 tyrosyl-lysine Proteins 0.000 description 11
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 10
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 10
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 10
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 10
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 10
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 238000002649 immunization Methods 0.000 description 10
- 108010034529 leucyl-lysine Proteins 0.000 description 10
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 9
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 9
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 9
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 9
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 9
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 9
- 241000124008 Mammalia Species 0.000 description 9
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 9
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 9
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 9
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 9
- 230000000890 antigenic effect Effects 0.000 description 9
- 108010068265 aspartyltyrosine Proteins 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 9
- 230000002950 deficient Effects 0.000 description 9
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 9
- 108010078144 glutaminyl-glycine Proteins 0.000 description 9
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 9
- 230000006801 homologous recombination Effects 0.000 description 9
- 238000002744 homologous recombination Methods 0.000 description 9
- 208000015181 infectious disease Diseases 0.000 description 9
- 108010079317 prolyl-tyrosine Proteins 0.000 description 9
- 108010070643 prolylglutamic acid Proteins 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 108010071207 serylmethionine Proteins 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 9
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 9
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 8
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 8
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 8
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 8
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 8
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 8
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 8
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 8
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 8
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 8
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 8
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 108010029020 prolylglycine Proteins 0.000 description 8
- 230000010076 replication Effects 0.000 description 8
- 238000006467 substitution reaction Methods 0.000 description 8
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 7
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 7
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 7
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 7
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 7
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 7
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 7
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 7
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 7
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 7
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 7
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 7
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 7
- 108010065920 Insulin Lispro Proteins 0.000 description 7
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 7
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 7
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 7
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 7
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 7
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 7
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 7
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 7
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 7
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 7
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 7
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 7
- 210000001744 T-lymphocyte Anatomy 0.000 description 7
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 7
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 7
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 7
- 239000000370 acceptor Substances 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 230000002163 immunogen Effects 0.000 description 7
- 150000007523 nucleic acids Chemical class 0.000 description 7
- 239000003981 vehicle Substances 0.000 description 7
- 230000003612 virological effect Effects 0.000 description 7
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 6
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 6
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 6
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 6
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 6
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 6
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 6
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 6
- XUTOXNRSAGLAKO-UHFFFAOYSA-N Asn Val Asn Pro Chemical compound NC(=O)CC(N)C(=O)NC(C(C)C)C(=O)NC(CC(N)=O)C(=O)N1CCCC1C(O)=O XUTOXNRSAGLAKO-UHFFFAOYSA-N 0.000 description 6
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 6
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 6
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 6
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 6
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 6
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 6
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 6
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 6
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 6
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 6
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 6
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 6
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 6
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 6
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 6
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 6
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 6
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 6
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 6
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 6
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 6
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 6
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 6
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 6
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 6
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 6
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 6
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 6
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 6
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 6
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 6
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 6
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 6
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 6
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 6
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 6
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 6
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 6
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 6
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 6
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 6
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 6
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 6
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 6
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 6
- NHDMNXBBSGVYGP-PYJNHQTQSA-N Met-His-Ile Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)CC1=CN=CN1 NHDMNXBBSGVYGP-PYJNHQTQSA-N 0.000 description 6
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 6
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 6
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 6
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 6
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 6
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 6
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 6
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 6
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 6
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 6
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 6
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 6
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 6
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 6
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 6
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 6
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 6
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 6
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 6
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 6
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 6
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 6
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 6
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 6
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 6
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 6
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 6
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 6
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 6
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 6
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 6
- LMKKMCGTDANZTR-BZSNNMDCSA-N Tyr-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LMKKMCGTDANZTR-BZSNNMDCSA-N 0.000 description 6
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 6
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 6
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 6
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 6
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 6
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 6
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 6
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 6
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 6
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 6
- 239000000969 carrier Substances 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 230000013595 glycosylation Effects 0.000 description 6
- 238000006206 glycosylation reaction Methods 0.000 description 6
- 108010084389 glycyltryptophan Proteins 0.000 description 6
- 108010028295 histidylhistidine Proteins 0.000 description 6
- 230000000977 initiatory effect Effects 0.000 description 6
- 238000002347 injection Methods 0.000 description 6
- 239000007924 injection Substances 0.000 description 6
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 6
- 108010034507 methionyltryptophan Proteins 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 102000039446 nucleic acids Human genes 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 6
- 239000013600 plasmid vector Substances 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 108010001055 thymocartin Proteins 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 6
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 5
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 5
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 5
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 5
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 5
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 5
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 5
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 5
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 5
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 5
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 5
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 5
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 5
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 5
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 5
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 5
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 5
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 5
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 5
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 5
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 5
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 5
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 5
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 5
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 5
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 5
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 5
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 5
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 5
- VMVUDJUXJKDGNR-FXQIFTODSA-N Asp-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N VMVUDJUXJKDGNR-FXQIFTODSA-N 0.000 description 5
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 5
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 5
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 5
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 5
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 5
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 5
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 5
- 238000011725 BALB/c mouse Methods 0.000 description 5
- 229940046168 CpG oligodeoxynucleotide Drugs 0.000 description 5
- BOMGEMDZTNZESV-QWRGUYRKSA-N Cys-Tyr-Gly Chemical compound SC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 BOMGEMDZTNZESV-QWRGUYRKSA-N 0.000 description 5
- 101150066038 E4 gene Proteins 0.000 description 5
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 5
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 5
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 5
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 5
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 5
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 5
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 5
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 5
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 5
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 5
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 5
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 5
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 5
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 5
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 5
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 5
- STDOKNKEXOLSII-SZMVWBNQSA-N Glu-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CCC(=O)O)N STDOKNKEXOLSII-SZMVWBNQSA-N 0.000 description 5
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 5
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 5
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 5
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 5
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 5
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 5
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 5
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 5
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 5
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 5
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 5
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 5
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 5
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 5
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 5
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 5
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 5
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 5
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 5
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 5
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 5
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 5
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 5
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 5
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 5
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 5
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 5
- 102100037850 Interferon gamma Human genes 0.000 description 5
- 108010074328 Interferon-gamma Proteins 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 5
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 5
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 5
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 5
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 5
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 5
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 5
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 5
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 5
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 5
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 5
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 5
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 5
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 5
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 5
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 5
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 5
- UZBQXELAFPCGRV-SZMVWBNQSA-N Met-Trp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZBQXELAFPCGRV-SZMVWBNQSA-N 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 5
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 5
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 5
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 5
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 5
- FKFCKDROTNIVSO-JYJNAYRXSA-N Phe-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O FKFCKDROTNIVSO-JYJNAYRXSA-N 0.000 description 5
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 5
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 5
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 5
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 5
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 5
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 5
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 5
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 5
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 5
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 5
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 5
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 5
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 5
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 5
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 5
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 5
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 5
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 5
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 5
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 5
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 5
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 5
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 5
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 5
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 5
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 5
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 5
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 5
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 5
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 5
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 5
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 description 5
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 5
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 5
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 5
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 5
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 5
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 5
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 5
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 5
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 5
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 5
- GAKBTSMAPGLQFA-JNPHEJMOSA-N Tyr-Thr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 GAKBTSMAPGLQFA-JNPHEJMOSA-N 0.000 description 5
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 5
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 5
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 5
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 5
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 5
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 5
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 5
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 5
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 5
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 5
- 108010041407 alanylaspartic acid Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 5
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 5
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 5
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 5
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 5
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 5
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 5
- 230000002458 infectious effect Effects 0.000 description 5
- 108010078274 isoleucylvaline Proteins 0.000 description 5
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 5
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 108010005942 methionylglycine Proteins 0.000 description 5
- 238000006386 neutralization reaction Methods 0.000 description 5
- 230000001717 pathogenic effect Effects 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 108010077112 prolyl-proline Proteins 0.000 description 5
- 108010015796 prolylisoleucine Proteins 0.000 description 5
- 230000008521 reorganization Effects 0.000 description 5
- 239000013605 shuttle vector Substances 0.000 description 5
- 108010084932 tryptophyl-proline Proteins 0.000 description 5
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 5
- 108010009962 valyltyrosine Proteins 0.000 description 5
- 210000002845 virion Anatomy 0.000 description 5
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 4
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 4
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 4
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 4
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 4
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 4
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 4
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 4
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 4
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 4
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 4
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 4
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 4
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 4
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 4
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 4
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 4
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 4
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 4
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 4
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 4
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 4
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 4
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 4
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 4
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 4
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 4
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 4
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 4
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 4
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 4
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 4
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 4
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 4
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 4
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 4
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 4
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 4
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 4
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 4
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 4
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 4
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 4
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 4
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 4
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 4
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 4
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 4
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 4
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 4
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 4
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 4
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 4
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 4
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 4
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 4
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 4
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 4
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 4
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 4
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 4
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 4
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 4
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 4
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 4
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 4
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 4
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 4
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 4
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 4
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 4
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 4
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 4
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 4
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 4
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 4
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 4
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 4
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 4
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 4
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 4
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 230000000968 intestinal effect Effects 0.000 description 4
- 108010012058 leucyltyrosine Proteins 0.000 description 4
- 239000002502 liposome Substances 0.000 description 4
- 238000012856 packing Methods 0.000 description 4
- 102000005962 receptors Human genes 0.000 description 4
- 108020003175 receptors Proteins 0.000 description 4
- 230000002787 reinforcement Effects 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 4
- 238000010361 transduction Methods 0.000 description 4
- 230000026683 transduction Effects 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- 230000029812 viral genome replication Effects 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 108010027345 wheylin-1 peptide Proteins 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 241000701242 Adenoviridae Species 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 3
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 3
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 3
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 3
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 3
- TWVTVZUGEDBAJF-ACZMJKKPSA-N Asn-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N TWVTVZUGEDBAJF-ACZMJKKPSA-N 0.000 description 3
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 3
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 3
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 3
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 3
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 3
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 3
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 3
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 206010011831 Cytomegalovirus infection Diseases 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- 206010059866 Drug resistance Diseases 0.000 description 3
- 101150005585 E3 gene Proteins 0.000 description 3
- 241000700662 Fowlpox virus Species 0.000 description 3
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 3
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 3
- YKLNMGJYMNPBCP-ACZMJKKPSA-N Glu-Asn-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YKLNMGJYMNPBCP-ACZMJKKPSA-N 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 3
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 3
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 3
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 3
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 3
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 3
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 3
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 3
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 3
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 3
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 3
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 3
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 3
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 3
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 3
- 241000282553 Macaca Species 0.000 description 3
- 241000282567 Macaca fascicularis Species 0.000 description 3
- 241000701244 Mastadenovirus Species 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 3
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 3
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 3
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 3
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 3
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- 241000283984 Rodentia Species 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 3
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 3
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 3
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 3
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 3
- 101710172711 Structural protein Proteins 0.000 description 3
- 230000005867 T cell response Effects 0.000 description 3
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 3
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 3
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 3
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 3
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 3
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 3
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 3
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 3
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 3
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 3
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 3
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 3
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 3
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 3
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 3
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 3
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 3
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 3
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 108700010877 adenoviridae proteins Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 239000002775 capsule Substances 0.000 description 3
- 239000001768 carboxy methyl cellulose Substances 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 238000013016 damping Methods 0.000 description 3
- 230000007812 deficiency Effects 0.000 description 3
- 238000000151 deposition Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 239000006185 dispersion Substances 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 239000000568 immunological adjuvant Substances 0.000 description 3
- 238000001802 infusion Methods 0.000 description 3
- 201000007270 liver cancer Diseases 0.000 description 3
- 208000014018 liver neoplasm Diseases 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000003921 oil Substances 0.000 description 3
- 235000019198 oils Nutrition 0.000 description 3
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 3
- 101150088856 pix gene Proteins 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 210000004988 splenocyte Anatomy 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 2
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 2
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 2
- RYEWQKQXRJCHIO-SRVKXCTJSA-N Asp-Asn-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RYEWQKQXRJCHIO-SRVKXCTJSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 206010011732 Cyst Diseases 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 2
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 2
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 2
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 2
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 2
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 2
- 241000598171 Human adenovirus sp. Species 0.000 description 2
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 2
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 2
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 2
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 2
- USPJSTBDIGJPFK-PMVMPFDFSA-N Lys-Tyr-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O USPJSTBDIGJPFK-PMVMPFDFSA-N 0.000 description 2
- 206010027336 Menstruation delayed Diseases 0.000 description 2
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 2
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 2
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 2
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 2
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 2
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 2
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 2
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 2
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- 241000710961 Semliki Forest virus Species 0.000 description 2
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 2
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 2
- 241000710960 Sindbis virus Species 0.000 description 2
- 240000006474 Theobroma bicolor Species 0.000 description 2
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 2
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 2
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 2
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 2
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- KZTLJLFVOIMRAQ-IHPCNDPISA-N Trp-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZTLJLFVOIMRAQ-IHPCNDPISA-N 0.000 description 2
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 2
- 102100040247 Tumor necrosis factor Human genes 0.000 description 2
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 2
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 2
- KYPMKDGKAYQCHO-RYUDHWBXSA-N Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 description 2
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 2
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 2
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 2
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- 241000700618 Vaccinia virus Species 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 108020005202 Viral DNA Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 229920001400 block copolymer Polymers 0.000 description 2
- 230000023555 blood coagulation Effects 0.000 description 2
- 229960005084 calcitriol Drugs 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 2
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 238000005336 cracking Methods 0.000 description 2
- 230000037029 cross reaction Effects 0.000 description 2
- 208000031513 cyst Diseases 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000002158 endotoxin Substances 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 238000003114 enzyme-linked immunosorbent spot assay Methods 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- MURGITYSBWUQTI-UHFFFAOYSA-N fluorescin Chemical compound OC(=O)C1=CC=CC=C1C1C2=CC=C(O)C=C2OC2=CC(O)=CC=C21 MURGITYSBWUQTI-UHFFFAOYSA-N 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 230000008348 humoral response Effects 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 229960003130 interferon gamma Drugs 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 230000021633 leukocyte mediated immunity Effects 0.000 description 2
- 210000000088 lip Anatomy 0.000 description 2
- 229920006008 lipopolysaccharide Polymers 0.000 description 2
- 239000000314 lubricant Substances 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 231100000614 poison Toxicity 0.000 description 2
- 230000007096 poisonous effect Effects 0.000 description 2
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012882 sequential analysis Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 239000012096 transfection reagent Substances 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 235000013311 vegetables Nutrition 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 101150084750 1 gene Proteins 0.000 description 1
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- VNBFUGOVQMFIRN-UHFFFAOYSA-N 1-chlorobutan-2-ol Chemical compound CCC(O)CCl VNBFUGOVQMFIRN-UHFFFAOYSA-N 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- 208000010370 Adenoviridae Infections Diseases 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 229910017119 AlPO Inorganic materials 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- CUOMGDPDITUMIJ-HZZBMVKVSA-N Ala-Phe-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 CUOMGDPDITUMIJ-HZZBMVKVSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 241000710929 Alphavirus Species 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 1
- WCRQQIPFSXFIRN-LPEHRKFASA-N Asn-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N WCRQQIPFSXFIRN-LPEHRKFASA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- BLQBMRNMBAYREH-UWJYBYFXSA-N Asp-Ala-Tyr Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O BLQBMRNMBAYREH-UWJYBYFXSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- HTSSXFASOUSJQG-IHPCNDPISA-N Asp-Tyr-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HTSSXFASOUSJQG-IHPCNDPISA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 101710163595 Chaperone protein DnaK Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- WZZGXXNRSZIQFC-VGDYDELISA-N Cys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WZZGXXNRSZIQFC-VGDYDELISA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- MKMKILWCRQLDFJ-DCAQKATOSA-N Cys-Lys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKMKILWCRQLDFJ-DCAQKATOSA-N 0.000 description 1
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 1
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 101150029662 E1 gene Proteins 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 101001091269 Escherichia coli Hygromycin-B 4-O-kinase Proteins 0.000 description 1
- 102000016359 Fibronectins Human genes 0.000 description 1
- 108010067306 Fibronectins Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 208000037357 HIV infectious disease Diseases 0.000 description 1
- 229940033330 HIV vaccine Drugs 0.000 description 1
- 101710178376 Heat shock 70 kDa protein Proteins 0.000 description 1
- 101710152018 Heat shock cognate 70 kDa protein Proteins 0.000 description 1
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- 101710155188 Hexon-interlacing protein Proteins 0.000 description 1
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 1
- SLFSYFJKSIVSON-SRVKXCTJSA-N His-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SLFSYFJKSIVSON-SRVKXCTJSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 1
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 241000701149 Human adenovirus 1 Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 229940125581 ImmunityBio COVID-19 vaccine Drugs 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 101150027802 L2 gene Proteins 0.000 description 1
- 101150084684 L3 gene Proteins 0.000 description 1
- 101150007425 L4 gene Proteins 0.000 description 1
- 101150034230 LI gene Proteins 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- 108010028921 Lipopeptides Proteins 0.000 description 1
- 208000008771 Lymphadenopathy Diseases 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 241000712079 Measles morbillivirus Species 0.000 description 1
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 1
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 description 1
- 101150076514 NS gene Proteins 0.000 description 1
- 241000337007 Oceania Species 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010013639 Peptidoglycan Proteins 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 1
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- OHIYMVFLQXTZAW-UFYCRDLUSA-N Phe-Met-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OHIYMVFLQXTZAW-UFYCRDLUSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 101710182846 Polyhedrin Proteins 0.000 description 1
- 101710101995 Pre-hexon-linking protein IIIa Proteins 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 101000584831 Pseudoalteromonas phage PM2 Protein P6 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000287219 Serinus canaria Species 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 101001091268 Streptomyces hygroscopicus Hygromycin-B 7''-O-kinase Proteins 0.000 description 1
- 108700007696 Tetrahydrofolate Dehydrogenase Proteins 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical compound OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- NAQBQJOGGYGCOT-QEJZJMRPSA-N Trp-Asn-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NAQBQJOGGYGCOT-QEJZJMRPSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- IUFQHOCOKQIOMC-XIRDDKMYSA-N Trp-Asn-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N IUFQHOCOKQIOMC-XIRDDKMYSA-N 0.000 description 1
- UTQBQJNSNXJNIH-IHPCNDPISA-N Trp-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N UTQBQJNSNXJNIH-IHPCNDPISA-N 0.000 description 1
- LJCLHMPCYYXVPR-VJBMBRPKSA-N Trp-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LJCLHMPCYYXVPR-VJBMBRPKSA-N 0.000 description 1
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 1
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 1
- LFGHEUIUSIRJAE-TUSQITKMSA-N Trp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LFGHEUIUSIRJAE-TUSQITKMSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- VSYROIRKNBCULO-BWAGICSOSA-N Tyr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O VSYROIRKNBCULO-BWAGICSOSA-N 0.000 description 1
- YMZYSCDRTXEOKD-IHPCNDPISA-N Tyr-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YMZYSCDRTXEOKD-IHPCNDPISA-N 0.000 description 1
- KSGKJSFPWSMJHK-JNPHEJMOSA-N Tyr-Tyr-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSGKJSFPWSMJHK-JNPHEJMOSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 1
- 241000587120 Vaccinia virus Ankara Species 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 241000711975 Vesicular stomatitis virus Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 229920000392 Zymosan Polymers 0.000 description 1
- DPDMMXDBJGCCQC-UHFFFAOYSA-N [Na].[Cl] Chemical compound [Na].[Cl] DPDMMXDBJGCCQC-UHFFFAOYSA-N 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 208000013228 adenopathy Diseases 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 108010047506 alanyl-glutaminyl-glycyl-valine Proteins 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- AZDRQVAHHNSJOQ-UHFFFAOYSA-N alumane Chemical compound [AlH3] AZDRQVAHHNSJOQ-UHFFFAOYSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 239000004411 aluminium Substances 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 1
- 238000005267 amalgamation Methods 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 239000003429 antifungal agent Substances 0.000 description 1
- 230000030741 antigen processing and presentation Effects 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010031045 aspartyl-glycyl-aspartyl-alanine Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- SVPXDRXYRYOSEX-UHFFFAOYSA-N bentoquatam Chemical compound O.O=[Si]=O.O=[Al]O[Al]=O SVPXDRXYRYOSEX-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 238000011325 biochemical measurement Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000000337 buffer salt Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- GMRQFYUYWCNGIN-NKMMMXOESA-N calcitriol Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(C)(C)O)C)=C\C=C1\C[C@@H](O)C[C@H](O)C1=C GMRQFYUYWCNGIN-NKMMMXOESA-N 0.000 description 1
- 235000020964 calcitriol Nutrition 0.000 description 1
- 239000011612 calcitriol Substances 0.000 description 1
- CJZGTCYPCWQAJB-UHFFFAOYSA-L calcium stearate Chemical compound [Ca+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O CJZGTCYPCWQAJB-UHFFFAOYSA-L 0.000 description 1
- 235000013539 calcium stearate Nutrition 0.000 description 1
- 239000008116 calcium stearate Substances 0.000 description 1
- 229940023860 canarypox virus HIV vaccine Drugs 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- SQQXRXKYTKFFSM-UHFFFAOYSA-N chembl1992147 Chemical compound OC1=C(OC)C(OC)=CC=C1C1=C(C)C(C(O)=O)=NC(C=2N=C3C4=NC(C)(C)N=C4C(OC)=C(O)C3=CC=2)=C1N SQQXRXKYTKFFSM-UHFFFAOYSA-N 0.000 description 1
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 1
- 238000011210 chromatographic step Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 210000002808 connective tissue Anatomy 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000012050 conventional carrier Substances 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 239000013601 cosmid vector Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 108010057085 cytokine receptors Proteins 0.000 description 1
- 239000003405 delayed action preparation Substances 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 210000000852 deltoid muscle Anatomy 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- KAKKHKRHCKCAGH-UHFFFAOYSA-L disodium;(4-nitrophenyl) phosphate;hexahydrate Chemical compound O.O.O.O.O.O.[Na+].[Na+].[O-][N+](=O)C1=CC=C(OP([O-])([O-])=O)C=C1 KAKKHKRHCKCAGH-UHFFFAOYSA-L 0.000 description 1
- 239000002612 dispersion medium Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- MVPICKVDHDWCJQ-UHFFFAOYSA-N ethyl 3-pyrrolidin-1-ylpropanoate Chemical compound CCOC(=O)CCN1CCCC1 MVPICKVDHDWCJQ-UHFFFAOYSA-N 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000003292 glue Substances 0.000 description 1
- 235000011187 glycerol Nutrition 0.000 description 1
- 125000005908 glyceryl ester group Chemical group 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 1
- 239000008172 hydrogenated vegetable oil Substances 0.000 description 1
- 229940124669 imidazoquinoline Drugs 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229940102223 injectable solution Drugs 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 101150063421 l5 gene Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 239000007937 lozenge Substances 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- BSOQXXWZTUDTEL-ZUYCGGNHSA-N muramyl dipeptide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O BSOQXXWZTUDTEL-ZUYCGGNHSA-N 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000011330 nucleic acid test Methods 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 208000029211 papillomatosis Diseases 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 229940124531 pharmaceutical excipient Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000011505 plaster Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 229920000724 poly(L-arginine) polymer Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000012207 quantitative assay Methods 0.000 description 1
- 210000000664 rectum Anatomy 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 229940045902 sodium stearyl fumarate Drugs 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000002594 sorbent Substances 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 239000000021 stimulant Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 235000012222 talc Nutrition 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 230000029305 taxis Effects 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 229940044655 toll-like receptor 9 agonist Drugs 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 244000052613 viral pathogen Species 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 238000001086 yeast two-hybrid system Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
- C07K14/01—DNA viruses
- C07K14/075—Adenoviridae
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/0005—Vertebrate antigens
- A61K39/0011—Cancer antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/235—Adenoviridae
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/20—Antivirals for DNA viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
- A61P37/04—Immunostimulants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
- C12N15/861—Adenoviral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
- C12N15/864—Parvoviral vectors, e.g. parvovirus, densovirus
- C12N15/8645—Adeno-associated virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/525—Virus
- A61K2039/5256—Virus expressing foreign proteins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/545—Medicinal preparations containing antigens or antibodies characterised by the dose, timing or administration schedule
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10111—Atadenovirus, e.g. ovine adenovirus D
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10321—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10334—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10341—Use of virus, viral particle or viral elements as a vector
- C12N2710/10343—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/16011—Herpesviridae
- C12N2710/16611—Simplexvirus, e.g. human herpesvirus 1, 2
- C12N2710/16634—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24234—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2799/00—Uses of viruses
- C12N2799/02—Uses of viruses as vector
- C12N2799/021—Uses of viruses as vector for the expression of a heterologous nucleic acid
- C12N2799/022—Uses of viruses as vector for the expression of a heterologous nucleic acid where the vector is derived from an adenovirus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2810/00—Vectors comprising a targeting moiety
- C12N2810/50—Vectors comprising as targeting moiety peptide derived from defined protein
- C12N2810/60—Vectors comprising as targeting moiety peptide derived from defined protein from viruses
- C12N2810/6009—Vectors comprising as targeting moiety peptide derived from defined protein from viruses dsDNA viruses
- C12N2810/6018—Adenoviridae
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Animal Behavior & Ethology (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Biophysics (AREA)
- Oncology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Communicable Diseases (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
本发明涉及具有改进的血清阳性率的新腺病毒毒株。在一个方面,本发明涉及腺病毒衣壳蛋白例如六邻体、五邻体和纤维蛋白的分离的多肽及其片段和编码其的多核苷酸。也提供了包含根据本发明的分离的多核苷酸的载体和包含根据本发明的分离的多核苷酸或多肽的腺病毒和包含所述载体、腺病毒、多肽和/或多核苷酸的药物组合物。本发明还涉及分离的多核苷酸、分离的多肽、载体、腺病毒和/或药物组合物用于疾病的治疗或预防的用途。
Description
本发明涉及具有改进的血清阳性率的新腺病毒毒株。在一个方面,本发明涉及腺病毒衣壳蛋白例如六邻体、五邻体和纤维蛋白的分离的多肽及其片段和编码其的多核苷酸。也提供了包含根据本发明的分离的多核苷酸的载体和包含根据本发明的分离的多核苷酸或多肽的腺病毒和包含所述载体、腺病毒、多肽和/或多核苷酸的药物组合物。本发明还涉及分离的多核苷酸、分离的多肽、载体、腺病毒和/或药物组合物用于疾病的治疗或预防的用途。
背景技术
腺病毒(Ad)包括在两栖动物、鸟类和哺乳动物中发现的具有无囊膜二十面体衣壳结构的双链DNA病毒大家族(Straus, Adenovirus infections in humans; The
Adenoviruses, 451-498, 1984; Hierholzer等人,J.
Infect.Dis.,158 : 804-813,1988; Schnurr和Dondero,
Intervirology., 36: 79-83,1993 ; Jong 等人, J.
Clin. Microbiol., 37 : 3940-3945: 1999)。与逆转录病毒相比,腺病毒可以转导若干哺乳动物物种的许多细胞类型,包括分裂和不分裂细胞,并且不整合至宿主细胞的基因组。
一般而言,腺病毒DNA通常非常稳定和保持游离型(例如染色体外的),除非发生转化或肿瘤发生。此外,腺病毒载体可在良好确定的生产系统中增殖至高产量,所述生产系统易于临床级组合物的制药规模生产。这些特征和其表征良好的分子遗传学使重组腺病毒载体成为用作疫苗载体的有力候选者。重组腺病毒载体的产生可依赖于使用能够补充已被删除或被改造为无功能的腺病毒基因产物功能的包装细胞系。
目前,广泛使用2种表征良好的人C亚群腺病毒血清型(即hAd2和hAd5)作为用于基因治疗的大多数腺病毒载体的病毒骨架来源。还已检验了复制缺陷的人腺病毒载体作为疫苗载体用于递送来源于多种感染原(infectiuos
agent)的多种免疫原。在实验动物(例如啮齿类、犬类和非人灵长类)中进行的研究表明携带编码免疫原和其他抗原的转基因的重组复制缺陷的人腺病毒载体引起针对转基因产物的体液和细胞介导的免疫应答。一般而言,研究者已报导了使用人腺病毒作为疫苗载体在非人实验系统中的成功,其通过使用利用高剂量的预计可引起免疫应答的重组腺病毒载体的免疫方案,或通过使用利用相继施用来源于不同血清型但携带相同的转基因产物的腺病毒载体用于加强免疫的免疫方案(Mastrangeli等人, Human Gene Therapy, 7: 79-87 (1996)。
已开发了基于人5型腺病毒(Ad5)的病毒载体用于不同的基因治疗和疫苗应用。尽管基于Ad5的载体在动物模型中极其有效,在临床试验中已证明了在人中存在的针对Ad5野生型病毒的先存(pre-existing)免疫力,以降低基因转导的效率。特别是在参与基于Ad5载体的疫苗临床试验的受试者中证明了免疫效率的明确降低,这些受试者具有超过200的中和抗体滴度。在Merck进行的HIV疫苗STEP试验(Moore JP 等人 Science. 2008 May 9; 320(5877):753-5)中得到以Ad5为载体的疫苗的最全面表征。所述疫苗研究基于在具有HIV感染高风险的受试者中共注射3个表达不同HIV抗原的Ad5载体作为概念研究的证据。令人惊讶地,数据显示在具有抗-Ad5先存免疫力的接种的受试者中HIV感染率提高而不是具有保护作用。尽管此矛盾的观察的机制尚不清楚,所述结果对基于人源腺病毒的疫苗在健康受试者中的疫苗应用的安全性和效率提出了另外的问题。综合考虑目前在不同疫苗和基因治疗临床试验例如使用Ad5载体的试验中得到的全部结果提高了对以在人中具有极低的或缺乏先存免疫力为特征的腺病毒的需要。
发明概述
在第一个方面,本发明提供了分离的多核苷酸,其编码腺病毒纤维蛋白或其功能性衍生物,并且所述多核苷酸选自:
(a)多核苷酸,其编码具有根据SEQ
ID NO: 14-19、50和53中任意氨基酸序列的多肽,
(b)多核苷酸,其编码根据SEQ
ID NO: 14-19、50和53中任意多肽的功能性衍生物,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换,和
(c)多核苷酸,其编码在全长上与SEQ
ID NO: 14-19、50和53中任意氨基酸序列具有至少85%相同的氨基酸序列的功能性衍生物。
在另一个方面,本发明涉及分离的多核苷酸,其编码腺病毒六邻体蛋白或其功能性衍生物,并且所述多核苷酸选自:
(a)多核苷酸,其编码具有根据SEQ
ID NO: 20-25、51和54中任意氨基酸序列的多肽,
(b)多核苷酸,其编码根据SEQ
ID NO: 20-25、51和54中任意多肽的功能性衍生物,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换,和
(c)多核苷酸,其编码具有在全长上与SEQ
ID NO: 20-25、51和54中任意氨基酸序列至少95%相同的氨基酸序列的功能性衍生物。
也提供了分离的多核苷酸,其编码腺病毒五邻体蛋白或其功能性衍生物,并且所述多核苷酸选自:
(a)多核苷酸,其编码具有根据SEQ
ID NO: 26-31、52和55中任意氨基酸序列的多肽,
(b)多核苷酸,其编码根据SEQ
ID NO: 26-31、52和55中任意多肽的功能性衍生物,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换,和
(c)多核苷酸,其编码具有在全长上与SEQ
ID NO: 26-31、52和55中任意氨基酸序列至少85%相同的氨基酸序列的功能性衍生物。
本发明也涉及包含至少一种如上概述的根据本发明的分离的多核苷酸的多核苷酸。本发明还提供了由根据本发明的分离的多核苷酸编码的分离的腺病毒衣壳多肽或其功能性衍生物。
在另一个方面,本发明提供了包含根据本发明的分离的多核苷酸的载体。
也提供了重组腺病毒,优选不能复制的腺病毒,其包含根据本发明的分离的多核苷酸和/或至少一种根据本发明的分离的腺病毒衣壳多肽。
本发明的另一个方面是包含佐剂和以下(i)至(iv)中至少一种物质的组合物:
(i)一种或多种根据本发明的分离的腺病毒衣壳多肽;
(ii)根据本发明的分离的多核苷酸;
(iii)根据本发明的载体;
(iv)根据本发明的重组腺病毒;
和任选地,药学上可接受的赋形剂。
本发明还涉及包含以下至少一种的细胞:
(i)一种或多种根据本发明的分离的腺病毒衣壳多肽;
(ii)根据本发明的分离的多核苷酸;
(iii)根据本发明的载体;
(iv)根据本发明的重组腺病毒。
本发明的另一个方面涉及根据本发明的分离的腺病毒衣壳多肽、根据本发明的分离的多核苷酸、根据本发明的载体、根据本发明的重组腺病毒和/或根据本发明的组合物用于疾病的治疗或预防的用途。
发明详述
在以下详细描述本发明以前,应当理解本发明不受限于本文描述的特定方法、方案和试剂,因为这些可以变化。还应当理解本文使用的术语仅为了描述特定实施方案的目的,无意限制本发明的范围,本发明仅受限于所附权利要求书。除非另外定义,本文使用的所有技术和科学术语具有与本领域普通技术人员通常理解的相同的含义。
优选地,本文使用的术语如"A multilingual glossary of biotechnological
terms: (IUPAC Recommendations)", Leuenberger, H.G.W, Nagel, B.和Klbl, H.编(1995),
Helvetica Chimica Acta, CH-4010 Basel, Switzerland)中所描述的以及Axel Kleemann和Jurgen
Engel的"Pharmaceutical Substances:
Syntheses, Patents, Applications" , Thieme Medical Publishing, 1999;
"Merck Index: An Encyclopedia of Chemicals, Drugs, and Biologicals",
Susan Budavari 等人编, CRC Press,
1996, 和United States Pharmcopeial Convention,
Inc., Rockville Md., 2001出版的United States
Pharmacopeia-25/National Formulary-20中所描述的定义。
在本说明书和后面的权利要求书通篇中,除非上下文另有需要,否则词语“包含(comprise)”和变体例如“包含(comprises)”和“包含(comprising)”应当被理解为指包含所述的特征、整体(integer)或步骤或多个特征、整体或步骤的组但不排除任意其他特征、整体或步骤或多个整体或步骤的组。在后面的段落中更详细地定义了本发明的不同方面。除非明确指出有相反的意思,这样定义的每个方面可与任意其他方面或多个方面组合。特别地,指示为优选的或有利的任意特征可与指示为优选的或有利的任意其他特征或多个特征组合。
在本说明书正文通篇中引用了一些文件。本文引用的每个文件(包括所有专利、专利申请、科学出版物、制造商的说明书、指示等),不管在上文或下文中,在此以其整体引入作为参考。不应将本文的任何内容解释为承认本发明无权先于因先前发明所作的此类公开。
下面提供了在本说明书中经常使用的一些术语的定义。这些术语在其每次使用时在说明书的其余部分具有分别定义的含义和优选的含义。
一般而言,腺病毒基因组是被良好地表征的。就具有类似位置的特定开放阅读框而言,在腺病毒基因组的总体组织中具有一般的保守性,例如每个病毒的E1A、E1B、E2A、E2B、E3、E4、LI、L2、L3、L4和L5基因的位置。腺病毒基因组的每个末端包含被称为反向末端重复(ITR)的序列,其对病毒复制是必需的。病毒也包含病毒编码的蛋白酶,其对加工某些产生感染性病毒粒子所需的结构蛋白是必需的。根据宿主细胞转导后表达的病毒基因的顺序描述腺病毒基因组的结构。更具体地,根据转录是否在DNA复制开始前或后发生将病毒基因称为早期(E)或晚期(L)基因。在转导的早期,腺病毒的E1A、 E1B、E2A、E2B、E3和E4基因被表达以制备适于病毒复制的宿主细胞。在感染的晚期,激活晚期基因L1-L5的表达,其编码病毒颗粒的结构成分。
下表1提供了本文涉及的序列的概述:
表1
如本文使用的,术语“分离的”指基本上不含与其天然有联系的其他分子的分子。因此分离的分子不含其在自然界的活动物中,即实验设备之外可能相遇或接触的其他分子。
如本文使用的,术语“蛋白质”、“肽”和“多肽” 可通篇互换使用。这些术语指天然存在的肽(例如天然存在的蛋白质)和可能包括天然或非天然存在的氨基酸的合成的肽二者。肽也可被化学修饰,通过修饰天然或非天然存在的氨基酸的侧链或游离氨基或羧基端。此化学修饰包括加入其他的化学部分以及在氨基酸侧链中的功能基团的修饰,例如糖基化。肽是优选具有至少3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95或至少100个氨基酸,最优选至少8个或至少30个氨基酸的聚合物。由于本文公开的多肽和蛋白质来源于腺病毒,优选地本文使用的分离的多肽或蛋白质的分子量不超过200
kDa。
如本文使用的术语“载体”包括技术人员已知的任意载体,包括质粒载体,黏粒载体,噬菌体载体例如λ噬菌体,病毒载体例如腺病毒(Ad)载体(例如现有技术(例如WO 2005/071093 A2)已知的非复制型Ad5、Ad11、Ad26、Ad35、Ad49、ChAd3、ChAd4、ChAd5、ChAd7、ChAd8、ChAd9、ChAd10、ChAd11、ChAd16、ChAd17、ChAd19、ChAd20、ChAd22、ChAd24、ChAd26、ChAd30、ChAd31、ChAd37、ChAd38、ChAd44、ChAd63和ChAd82载体或可复制的Ad4和Ad7载体),腺相关病毒(AAV)载体(例如AAV 5型),甲病毒载体(例如委内瑞拉马脑脊髓炎病毒(VEE),辛德比斯病毒(SIN),塞姆利基森林病毒(SFV)和VEE-SIN嵌合体),疱疹病毒载体,麻疹病毒载体,痘病毒载体(例如牛痘病毒,修饰的牛痘病毒Ankara (MVA),NYVAC(来源于牛痘病毒的Copenhagen毒株),和禽痘病毒载体:金丝雀痘(ALVAC)和禽痘病毒(FPV)载体),和水疱性口炎病毒载体,类病毒颗粒或细菌孢子。载体也包括表达载体,克隆载体和可用于在宿主细胞中产生重组腺病毒的载体。
术语“表达载体”指包含至少一种待表达的核酸序列以及其转录和翻译控制序列的核酸分子。改变表达盒将导致包含其的载体引导不同序列或序列组合的表达。因为限制性位点被优选地改造在5'和3'端,可容易地插入、去除表达盒,或将其替换为另一个表达盒。优选地,表达盒包括用于有效表达给定基因的顺式调控元件,例如启动子、起始位点和/或多聚腺苷酸化位点,如下进一步描述。
术语“抗体”指单克隆和多克隆抗体二者,即任意免疫球蛋白或其能够结合抗原或半抗原的部分。抗原结合部分可通过重组DNA技术或通过完整抗体的酶或化学切割产生。在某些实施方案中,抗原结合部分包括Fab、Fab'、F(ab')2、Fd、Fv、dAb和互补决定区(CDR)变体,单链抗体(scFv),嵌合抗体,人源化抗体,双抗体和包含抗体的至少一部分的多肽,所述抗体的至少一部分足以赋予特定抗原与该多肽结合。
在本发明上下文中将在哺乳动物中施用免疫原/抗原以引起/产生免疫应答称为“引发”,将在哺乳动物中施用免疫原/抗原以增强针对所述免疫原/抗原例如特定病原(例如病毒粒子或病毒病原体,病原细菌的抗原或肿瘤抗原)的免疫应答称为“加强”。短语“异源引发-加强”的意思是用于在哺乳动物中引起/产生免疫应答(引发)的载体和用于在哺乳动物中增强免疫应答(加强)的载体是不同的。如果受试者例如患者已对第一个载体产生了抗体并需要加强,“异源引发-加强”是有用的。因此,在异源引发-加强的优选实施方案中可使用2种不同的腺病毒,例如用于接种和/或基因治疗。在此上下文中,如果在第一种腺病毒引发过程中引起的抗体应答不能阻止超过70%或优选超过80%的施用的用于加强的第二种腺病毒颗粒进入已经历引发和加强的动物的细胞核,第一种和第二种腺病毒应具有足够大的差异。
术语“可复制的”重组腺病毒(AdV)指在细胞中不包含任意重组辅助蛋白质的情况下可以在宿主细胞中复制的腺病毒。优选地,“可复制的”腺病毒包含以下完整或功能必需的早期基因:E1A、E1B、E2A、E2B、E3和E4。从特定动物分离的野生型腺病毒可在该动物中复制。
术语“复制缺陷的”重组AdV指已变得不能复制的腺病毒,因为其已被改造为包含至少一个功能性缺失,即损伤基因功能但不完全将其去除的缺失,例如引入人工终止子,缺失或突变活性位点或相互作用结构域,突变或缺失基因的调控序列等等,或完全去除编码对于病毒复制所必需的基因产物的基因,例如一个或多个选自E1、E2、E3和E4的腺病毒基因。本发明的重组黑猩猩腺病毒载体优选地是复制缺陷的。
在多核苷酸、多肽或蛋白质序列的上下文中,术语“同一性”或“相同的”(“identical”)指当进行最大对应性比对时,在2条序列中相同的残基数。特别地,2条序列的序列同一性百分比(不管是核酸或氨基酸序列)是2条比对的序列之间的精确匹配数除以较短的序列长度然后乘以100。可用于比对2条序列的比对工具是本领域技术人员熟知的,例如可从万维网,例如ClustalW
(www.ebi.ac.uk/clustalw)或Align
(http://www.ebi.ac.uk/emboss/align/index.html)上获得。2条序列之间的比对可使用标准设置进行,对Align EMBOSS::needle优选: 矩阵: Blosum62, 缺口开放10.0, 缺口延伸0.5。本领域技术人员理解可能有必要在2条序列中的任一条中引入缺口以产生令人满意的比对。将2条多肽之间的“最佳序列比对”定义为产生最大的比对同一残基数的比对。
腺病毒
腺病毒(Ad)是已在若干禽类和哺乳动物宿主中鉴定的无囊膜二十面体病毒。人腺病毒(hAd)属于哺乳动物腺病毒属,其包括所有已知的人腺病毒和许多动物(例如牛、猪、犬、鼠、马、猿和羊)腺病毒。通常基于大量生物学、化学、免疫学和结构标准将人腺病毒分为6个亚群(A-F),这些标准包括大鼠和猕猴红细胞的血凝性能,DNA同源性,限制酶切割模式,G+C含量百分比和致癌性(Straus, 1984,
in The Adenoviruses, H. Ginsberg编,
pps.451-498, New York : Plenus Press,和Horwitz,
1990; in Virology, B. N. Fields和D. M.
Knipe编, pps. 1679-1721)。
腺病毒粒子具有二十面体对称和60-90nm的直径(取决于血清型)。二十面体衣壳包含3种主要蛋白质、六邻体(II)、五邻体基质(III)和隆起的(knobbed)纤维(IV)蛋白(W. C. Russel, J. Gen.Virol., 81: 2573-2604 (2000))。在人中观察到的先存免疫力的一个方面是体液免疫,其可导致特异性针对腺病毒蛋白质的抗体的产生和持续。腺病毒引起的体液应答主要针对3个主要的结构蛋白:六邻体、五邻体和纤维。
至今已识别了51个不同的人腺病毒血清型并基于其血凝性能和生物物理和生物化学标准分为亚群。已发表的报导已确定包含针对多种血清型的抗体的滴度是普遍的(Dambrosio,
E. (1982) J. Hyg. (London) 89: 209-219)并且滴度的相当大的一部分具有中和活性。
如所述,重组腺病毒可用于基因治疗和作为疫苗。基于黑猩猩腺病毒的病毒载体为使用人源Ad载体用于基因疫苗的开发提供了替代方案(Farina SF, J Virol. 2001
Dec;75(23):11603-13.; Fattori E, Gene Ther. 2006 Jul;13(14):1088-96)。从黑猩猩中分离的腺病毒与从人中分离的腺病毒紧密相关,如它们在人源细胞中的有效增殖所证明的。但是,由于人和黑猩猩腺病毒是近亲,可以预期在2个病毒物种之间的血清学交叉反应。
当黑猩猩腺病毒被分离和表征后,此推测得到证实。然而,来自黑猩猩的腺病毒分离株显示了与人腺病毒表位的普通血清型的减少的交叉反应。因此,黑猩猩腺病毒(本文中又将普通黑猩猩腺病毒简称为“ChAd”和将倭黑猩猩腺病毒简称为“PanAd”)提供了降低与人中针对人腺病毒普通血清型的先存免疫力相关的副作用的基础。但是,在人血清的亚群中检测到针对目前分离的黑猩猩腺病毒的低至中度中和滴度,因此所有已知的黑猩猩腺病毒血清型仍然可被人血清在一定程度上中和。
本发明包含意料之外的发现:可分离到新的黑猩猩腺病毒毒株,即从普通黑猩猩(Pan troglodytes)中分离的ChAd55、ChAd73、ChAd83、ChAd146、ChAd147和从倭黑猩猩(Pan paniscus)中分离的PanAd1、PanAd2和PanAd3。所有这些新的毒株在人中显示了测量不到的血清阳性率,即这些腺病毒毒株代表了迄今为止描述的黑猩猩腺病毒中的例外,因为检测的所有人血清对中和抗体的存在完全呈阴性。在此上下文中,中和抗体指这样的抗体,其结合腺病毒的表位并阻止腺病毒在宿主细胞中产生有效感染或阻止表达转基因的不能复制的载体转导靶细胞,例如腺病毒DNA能够进入宿主细胞。尽管在所有现有技术中的黑猩猩来源的腺病毒中观察到中和抗体,新的腺病毒类型ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、PanAd1、PanAd2和PanAd3的特征在于:在人中完全缺乏针对这些腺病毒类型的先存中和抗体。因此,这些腺病毒提供了有价值的医学工具,例如可被用于免疫和/或基因治疗。
如下进一步详细描述的,本发明在一个方面提供了代表大部分暴露在表面的腺病毒表位的腺病毒衣壳蛋白,即六邻体、五邻体和纤维蛋白的新序列。如已经提到的,在人血清中不包含特异性针对根据本发明的病毒的中和抗体。因此,上述黑猩猩六邻体、五邻体和纤维蛋白的新序列的一个优势是可将这些蛋白质序列用于增强现有技术中已被改造用于例如医学目的的腺病毒。例如,本发明的衣壳蛋白或其功能片段可用于例如分别替换/置换不同的腺病毒(例如现有技术的腺病毒)的一个或多个主要结构衣壳蛋白或其功能片段以得到在人中具有减少的血清阳性率的改进的重组腺病毒。因为本发明的新腺病毒以及如上所述的已重新改造的腺病毒在施用时在人中不会遭遇任何显著的抑制性免疫应答,其总体转导效率和感染性将提高。因此,预期这样改进的腺病毒被期望是例如更有效的疫苗,因为病毒进入宿主细胞和抗原盒的表达将不会受到任何显著滴度的中和抗体的阻碍。此外,如在实施例中所示,甚至在使用包含ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、PanAd1、PanAd2或PanAd3分离株的六邻体、五邻体和纤维蛋白的编码HIV-gag的重组腺病毒接种的初次接受实验的小鼠中引起了针对HIV gag的有效免疫应答。ChAd55-gag、ChAd73-gag、ChAd83-gag、ChAd146-gag、ChAd147-gag、PanAd1-gag、PanAd2-gag和PanAd3-gag腺病毒引起的免疫应答与迄今为止基于现有技术中表达HIV
gag蛋白的重组人Ad5载体开发的最有效的载体所观察到的应答相当(见图5A、5B、5C中酶联免疫斑点(ELIspot)测定的数据)。
如上所述,腺病毒引起的体液应答主要针对3种主要的腺病毒结构蛋白质:六邻体、五邻体和纤维蛋白,其均包含组成腺病毒衣壳的一部分和暴露在病毒颗粒以外的多肽序列(又见:Madisch
I, 等人, J. Virol. 2005 Dec;79(24):15265-76;又见:Madisch I, 等人, J
Virol. 2007 Aug;81(15):8270-81和Pichla-Gollon
SL, 等人, J. Virol. 2007 Feb;81(4):1680-9)。
如图1所示的多重序列比对所示,本发明的PanAd1、PanAd2、PanAd3、ChAd55、ChAd73、ChAd83、ChAd146和ChAd147这一组的新腺病毒分离株共有非常相似的六邻体蛋白质序列。在比对中也标记了高变区(HVR),其出现在六邻体分子顶端位于病毒粒子外部并覆盖大量病毒粒子表面的环中(见Jophn J. Rux et.
Al, J. of Virology, Sept 2003, vol. 77, no.17)。在图2和3中分别提供了新的黑猩猩腺病毒的其他衣壳蛋白纤维蛋白和五邻体的序列相关性。预计所有3种结构衣壳蛋白均作用于低血清阳性率,因此可彼此独立地或组合用于抑制腺病毒与先存中和抗体的亲和力,例如用于制备具有减少的血清阳性率的重组嵌合腺病毒。
因此,在一个方面,本发明提供了分离的多核苷酸,其编码腺病毒纤维蛋白或其功能性衍生物并且其选自:
(a)编码具有根据SEQ
ID NO: 14-19、50和53;即SEQ ID NO: 14、15、16、17、18 、19、50或53中任意氨基酸序列的多肽的多核苷酸;
(b)编码根据SEQ
ID NO: 14-19、50和53;即SEQ ID NO: 14, 15, 16, 17, 18 , 19、50或53中任意多肽的功能性衍生物的多核苷酸,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换;和
(c)编码具有在全长上与SEQ
ID NO: 14-19、50和53;即SEQ ID NO: 14、15、16、17、18 、19、50或53中任意氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少99%相同的氨基酸序列的功能性衍生物的多核苷酸。
“腺病毒纤维蛋白”的意思是腺病毒中包含的隆起的纤维(IV)蛋白。在优选的实施方案中,本发明第一个方面和以下描述的其优选的实施方案中包含的分离的多核苷酸编码与感染性腺病毒粒子中的纤维蛋白或其片段具有相同功能的纤维蛋白或其功能性衍生物。因此,优选包含所述纤维蛋白或功能纤维蛋白衍生物作为衣壳蛋白的重组腺病毒能够进入宿主细胞。可以容易地测定重组腺病毒是否能够进入宿主细胞。例如,在宿主细胞接触腺病毒后可洗涤并裂解重组宿主细胞,使用例如特异性针对腺病毒RNA和/或DNA的合适的杂交探针可测定是否在宿主细胞中发现了腺病毒RNA和/或DNA。可选或另外地,可洗涤、裂解并用腺病毒特异性抗体探测(例如使用蛋白质印迹)已与重组腺病毒接触的宿主细胞。在另一个可选方案中,例如在体内观察宿主细胞在被重组腺病毒感染后是否表达一种基因产物,例如荧光蛋白,所述重组腺病毒包含合适的表达盒以在宿主细胞中表达所述基因产物。
进一步优选地,纤维蛋白和其功能性衍生物对腺病毒五邻体蛋白例如对SEQ ID NO: 26-31、52和/或55具有亲和力。一般技术人员熟知如何检测蛋白质-蛋白质亲和力。为了测定第一种蛋白是否能够结合第二种蛋白,例如黑猩猩来源的腺病毒的五邻体蛋白,技术人员可使用例如遗传学的酵母双杂交测定或生物化学测定例如拉拽(pull-down),酶联免疫吸附测定(ELISA),基于荧光激活的细胞分选(FACS)测定或等离子共振测定。当使用拉拽或等离子共振测定时,将至少一种蛋白质与亲和标签例如HIS-标签、GST-标签或其他生物化学领域中熟知的其他标签融合是有用的。腺病毒纤维蛋白以其糖基化形式能够进一步三聚化。因此,也优选根据本发明的第一个方面的多核苷酸编码的纤维蛋白或其片段能够被糖基化和/或形成三聚体。
如本申请通篇使用的,短语蛋白质或多肽的“功能性衍生物”通常指蛋白质或多肽的修饰形式,例如蛋白质或多肽的一个或多个氨基酸可被缺失、插入、修饰和/或置换。上面也提到过,如果在其衣壳中包含功能性衍生物的嵌合腺病毒能够感染宿主细胞,则衍生物是有功能的。此外,在“功能性衍生物”的上下文中,插入指在原始多肽或蛋白质中插入一个或多个氨基酸。优选地功能性衍生物不含有超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100个氨基酸改变(即缺失、插入、修饰和/或置换的氨基酸)。在另一个实施方案中,优选地不超过1%、2%、3%、4%、5%、6%、7%、8%、9%、10%、15%或不超过20%(最优选不超过5%)的蛋白质或多肽的所有氨基酸被改变(即缺失、插入、修饰和/或置换的氨基酸)。也可修饰,例如化学修饰蛋白质或多肽的氨基酸。例如,通过例如糖基化、酰胺化、磷酸化、泛素化等可修饰蛋白质或多肽的氨基酸的侧链或游离氨基或羧基末端。如本领域熟知的,也可在体内例如在宿主细胞内进行化学修饰。例如,在蛋白质的氨基酸序列中存在的合适的化学修饰基序,例如糖基化序列基序将导致蛋白质被糖基化。在衍生物中的置换可以为保守或非保守置换,优选保守置换。在某些实施方案中,置换也包括天然存在氨基酸和非天然存在氨基酸的交换。保守置换包含将氨基酸置换为另一个与被置换的氨基酸具有类似化学性质的氨基酸。优选地,保守置换是选自下述的置换:
(i)将碱性氨基酸置换为另一个不同的碱性氨基酸;
(ii)将酸性氨基酸置换为另一个不同的酸性氨基酸;
(iii)将芳香族氨基酸置换为另一个不同的芳香族氨基酸;
(iv)将非极性脂肪族氨基酸置换为另一个不同的非极性脂肪族氨基酸;
(v)将极性、不带电荷的氨基酸置换为另一个不同的极性、不带电荷的氨基酸。
碱性氨基酸优选地选自精氨酸、组氨酸和赖氨酸。酸性氨基酸优选地是天冬氨酸或谷氨酰胺。芳香族氨基酸优选地选自苯丙氨酸、酪氨酸和色氨酸。非极性、脂肪族氨基酸优选地选自甘氨酸、丙氨酸、缬氨酸、亮氨酸、甲硫氨酸和异亮氨酸。极性、不带电荷的氨基酸优选地选自丝氨酸、苏氨酸、半胱氨酸、脯氨酸、天冬酰胺和谷氨酰胺。与保守氨基酸置换相反,非保守氨基酸置换是一个氨基酸与任意不属于上述所列保守置换(i)至(v)的氨基酸的交换。
如果功能性衍生物包含缺失,那么在衍生物中已去除了在参考多肽或蛋白质序列中存在的一个或几个氨基酸。但是缺失不能过多以致衍生物总共包含的氨基酸少于200个。
上面已描述了测定序列同一性的方法。此外,也可使用Karlin和Altschul
(1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877的数学算法测定2条序列之间的百分比同一性。Altschul 等人
(1990) J. Mol. Biol. 215: 403-410的BLASTN和BLASTP程序也包含这样的算法。当使用BLASTN和BLASTP时优选使用这些程序的默认参数。
如上所述,腺病毒六邻体蛋白的高变结构域暴露在腺病毒的外侧。因此可通过中和抗体识别和结合腺病毒衣壳的这些区域。因此,具有包含来源于本发明的新的腺病毒分离株之一的六邻体蛋白的衣壳的腺病毒在人中将展示改进的即较小的血清阳性率。因此,在第二个方面,本发明提供了分离的多核苷酸,其编码腺病毒六邻体蛋白或其功能性衍生物并且其选自:
(a)编码具有根据SEQ
ID NO: 20-25、51和54,即SEQ ID NO: 20、21、22、23、24、25、51或54中任意氨基酸序列的多肽的多核苷酸;
(b)编码根据SEQ
ID NO: 20-25、51和54、即SEQ ID NO: 20、21、22、23、24、25、51或54中任意多肽的功能性衍生物的多核苷酸,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换;和
(c)编码具有在全长上与SEQ
ID NO: 20-25、51和54,即SEQ ID NO: 20、21、22、23、24、25、51或54中任意氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99.95%相同的氨基酸序列的功能性衍生物的多核苷酸。
在优选的实施方案中,在本发明的第二个方面和以下描述的其优选实施方案中包含的分离的多核苷酸编码与感染性腺病毒粒子中的六邻体蛋白或其片段具有相同功能的六邻体蛋白或其功能性衍生物。因此,优选包含所述六邻体或功能性衍生物作为衣壳蛋白的重组腺病毒能够进入宿主细胞。产生六邻体蛋白的功能性衍生物的一种合适方法在美国专利5,922,315中描述,将其引入作为参考。在此方法中,腺病毒六邻体的至少一个环区域改变为另一种腺病毒血清型的至少一个环区域。例如,本发明的六邻体蛋白质的环区域可用于置换现有技术中的腺病毒的对应的六邻体环以产生改进的杂交腺病毒。类似地也可产生本发明的五邻体和纤维蛋白的衍生物。
在第三个方面,本发明提供了分离的多核苷酸,其编码腺病毒五邻体蛋白或其功能性衍生物并且其选自:
(a)编码具有根据SEQ
ID NO: 26-31、52和55,即SEQ ID NO: 26、27、28、29、30、31、52或55中任意氨基酸序列的多肽的多核苷酸;
(b)编码根据SEQ
ID NO: 26-31、52和55,即SEQ ID NO: 26、27、28、29、30、31、52或55中任意多肽的功能性衍生物的多核苷酸,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换;和
(c)编码具有在全长上与SEQ
ID NO: 26-31、52和55,即SEQ ID NO: 26、27、28、29、30、31、52或55中任意氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少99%相同的氨基酸序列的功能性衍生物的多核苷酸。
优选地,五邻体蛋白和其功能性衍生物对腺病毒纤维蛋白例如对SEQ ID NO: 14-19、50和/或53具有亲和力。如上所述一般技术人员熟知如何检测蛋白质-蛋白质亲和力。“腺病毒五邻体蛋白”的意思是在腺病毒中包含的五邻体基质(III)蛋白。腺病毒五邻体蛋白的特征在于其位于衣壳的二十面体对称的转角(corner)。如上所述,在本发明的第一、第二和/或第三个方面和如下文描述的其优选实施方案的多核苷酸的优选实施方案中,所述多核苷酸编码一种或多种多肽,其中优选包含所述一种或多种多肽作为衣壳蛋白的重组腺病毒能够感染,即进入宿主细胞。
下面将对本文公开的每个新黑猩猩腺病毒分离株详细说明本发明的第一、第二和第三个方面的优选实施方案。
腺病毒
ChAd55
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 14的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 14的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少99%相同的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 20的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 20的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 26的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 26的氨基酸序列至少98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99.9%相同的氨基酸序列。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用Ad5基因组作为参考)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
ChAd73
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 15的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 15的氨基酸序列至少98%、99%或至少99.9%,更优选至少99%和最优选至少99.9%相同的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 21的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 21的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 27的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 27的氨基酸序列至少98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用Ad5基因组作为参考)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
ChAd83
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 16的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有SEQ ID NO: 16的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 22的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 22的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 28的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 28的氨基酸序列至少98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在最优选的实施方案中,本发明的多核苷酸由下述多核苷酸组成或包含下述多核苷酸:其在全长上与由SEQ ID NO: 65组成的序列或与由SEQ ID NO: 65组成的但缺乏SEQ
ID NO: 65的基因组区域E1A、E1B、E2A、E2B、E3和/或E4中的任意区域,最优选缺乏SEQ ID NO: 65的基因组区域E1、E3和E4的序列至少90%、91%、92%、93%、94%、95%、96%、97%、98%同一和最优选至少99%或100%同一。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用如SEQ
ID NO: 65所示的ChAd83基因组)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
ChAd146
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 17的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有SEQ ID NO: 17的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 23的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 23的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 29的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 29的氨基酸序列至少98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用Ad5基因组作为参考)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
ChAd147
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 18的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 18的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少90%相同的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 24的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 24的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 30的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 30的氨基酸序列至少98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用Ad5基因组作为参考)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
PanAd1
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 19的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 19的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少99%相同的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 25的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 25的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 31的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 31的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少90%相同的氨基酸序列。
在最优选的实施方案中,本发明的多核苷酸由下述多核苷酸组成或包含下述多核苷酸:其在全长上与由SEQ ID NO: 13组成的序列或与由SEQ ID NO: 13组成的但缺乏SEQ
ID NO: 13的基因组区域E1A、E1B、E2A、E2B、E3和/或E4中任意区域,最优选缺乏SEQ ID NO: 13的基因组区域E1、E3和E4的序列至少90%、91%、92%、93%、94%、95%、96%、97%、98%同一和最优选至少99%或100%同一。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用如SEQ
ID NO: 13所示的PanAd1基因组)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
PanAd2
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 50的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 50的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少99%相同的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 51的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 51的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 52的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 52的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少90%相同的氨基酸序列。
在最优选的实施方案中,本发明的多核苷酸由下述多核苷酸组成或包含下述多核苷酸:其在全长上与由SEQ ID NO: 62组成的序列或与由SEQ ID NO: 62组成的但缺乏SEQ
ID NO: 62的基因组区域E1A、E1B、E2A、E2B、E3和/或E4中任意区域,最优选缺乏SEQ ID NO: 62的基因组区域E1、E3和E4的序列至少90%、91%、92%、93%、94%、95%、96%、97%、98%同一和最优选至少99%或100%同一。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用如SEQ
ID NO: 62所示的PanAd1基因组)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
腺病毒
PanAd3
在本发明的第一个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 53的氨基酸序列的腺病毒纤维蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 53的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少99%相同的氨基酸序列。
在本发明的第二个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 54的氨基酸序列的腺病毒六邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 54的氨基酸序列至少95%、98%、99%、99.5%、99.9%或至少99.95%,更优选至少98%和最优选至少99%相同的氨基酸序列。
在本发明的第三个方面的优选实施方案中,分离的多核苷酸编码具有根据SEQ ID NO: 55的氨基酸序列的腺病毒五邻体蛋白或其功能性衍生物,其中所述功能性衍生物(i)不包含超过1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或超过100,优选不超过10个缺失的、插入的、修饰的和/或置换的氨基酸或(ii)具有在全长上与SEQ ID NO: 55的氨基酸序列至少85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%,更优选至少85%和最优选至少90%相同的氨基酸序列。
在最优选的实施方案中,本发明的多核苷酸由下述多核苷酸组成或包含下述多核苷酸:其在全长上与由SEQ ID NO: 63组成的序列或与由SEQ ID NO: 63组成的但缺乏SEQ
ID NO: 63的基因组区域E1A、E1B、E2A、E2B、E3和/或E4中任意区域,最优选缺乏SEQ ID NO: 63的基因组区域E1、E3和E4的序列至少90%、91%、92%、93%、94%、95%、96%、97%、98%同一和最优选至少99%或100%同一。
在另一个方面,本发明涉及包含第一、第二、第三、第一和第二、第一和第三、第二和第三或第一、第二和第三个方面的多核苷酸。优选包含这个或这些多核苷酸的多核苷酸包含其他在腺病毒基因组中(例如使用如SEQ
ID NO: 63所示的PanAd1基因组)与六邻体、五邻体和/或纤维基因邻近的腺病毒基因和核苷酸区段。优选所述多核苷酸也包含将多核苷酸包装进腺病毒颗粒所需的序列。
在重组腺病毒中,根据本发明的第一、第二和第三个方面和根据本文公开的各个优选的实施方案的纤维、六邻体和五邻体蛋白各自单独地作用于降低所述重组腺病毒与人和/或啮齿类中和抗体的相互作用。由此,编码本发明的所述纤维、六邻体和/或五邻体蛋白的多核苷酸可用于构建增强的重组腺病毒。因此,本发明的另一个即第四个方面提供了包含至少一种、优选至少2种和最优选3种分离的多核苷酸的多核苷酸,所述分离的多核苷酸选自由根据本发明的第一个方面、本发明的第二个方面和本发明的第三个方面的多核苷酸组成的多核苷酸组。因此,最优选地,第四个方面是包含本发明的第一、第二和第三个方面的分离的多核苷酸。在优选的实施方案中,根据本发明的第四个方面的多核苷酸是选自下述的多核苷酸:
(i)包含根据本发明的第一、第二或第三个方面的一种多核苷酸的多核苷酸;
(ii)包含根据本发明的第一个方面的多核苷酸和根据本发明的第二个方面的多核苷酸的多核苷酸;
(iii)包含根据本发明的第一个方面的多核苷酸和根据本发明的第三个方面的多核苷酸的多核苷酸;
(iv)包含根据本发明的第二个方面的多核苷酸和根据本发明的第三个方面的多核苷酸的多核苷酸;
(v)包含根据本发明的第一、第二和第三个方面的多核苷酸的多核苷酸;
其中优选地包含在根据(i)至(v)的多核苷酸中的所述多核苷酸选自相同的腺病毒分离株,例如分别编码纤维、六邻体和五邻体或其功能性衍生物的所有3种多核苷酸仅来自以下腺病毒中的一种:ChAd55、ChAd73、ChAd83、ChAd146、ChAd147 PanAd1、PanAd2或PanAd3。此外,优选地在本发明的第四个方面或其优选的实施方案中,例如如上概述的,每种“功能性衍生物”不包含超过10个、超过5个或超过3个氨基酸改变(即缺失、插入、修饰和/或置换的氨基酸)。
下表2列出了如上概述的本发明的第四个方面的多核苷酸的大量特别优选的实施方案。优选的多核苷酸选自表2所示的多核苷酸A1至AF1,其中所述多核苷酸包含根据本发明的第一、第二和第三个方面的可选项(c)的3种多核苷酸,每种多核苷酸分别编码腺病毒的纤维、六邻体和五邻体蛋白或其功能性衍生物。下表2显示了所述3种编码的蛋白质中的每一个在其全长上必须具有的与根据表2也显示的SEQ ID NO的氨基酸序列的最小序列同一性(即至少是指示的序列同一性):
例如,如上表1中所示的优选多核苷酸A1包含:
(i)编码具有在全长上与SEQ
ID NO: 14至少85%相同的氨基酸序列的多肽的多核苷酸;
(ii)编码具有在全长上与SEQ
ID NO: 20至少95%相同的氨基酸序列的多肽的多核苷酸;
(iii)编码具有在全长上与SEQ
ID NO: 26至少98%相同的氨基酸序列的多肽的多核苷酸;
如上所述最优选地在表2中所列的多核苷酸的所述“功能性衍生物”不包含超过10个氨基酸改变(即缺失、插入、修饰和/或置换的氨基酸)。
下表3列出了本发明的第四个方面的多核苷酸的进一步优选的实施方案。优选的多核苷酸选自从表3选择的多核苷酸A2至J2,其中所述多核苷酸包含命名为“多核苷酸1”、“多核苷酸2”和“多核苷酸3”的3种多核苷酸,其中每种多核苷酸在全长上分别与根据表3所示的SEQ
ID NO的对应多核苷酸具有指示的序列同一性:
因此,作为实例,上表3的优选实施方案A2(“A2 – ChAd55”)是包含下述的多核苷酸:
(i)在全长上与SEQ
ID NO: 32至少98%相同的多核苷酸;
(ii)在全长上与SEQ
ID NO: 38至少98%相同的多核苷酸;
(iii)在全长上与SEQ
ID NO: 44至少98%相同的多核苷酸。
下表4列出了上面概述的本发明的第四个方面的多核苷酸的大量进一步特别优选的实施方案。优选的多核苷酸选自表4所示的多核苷酸A3至H3,其中所述多核苷酸编码根据指示的SEQ ID NO的腺病毒纤维、六邻体和五邻体蛋白或其功能性衍生物,其中所有3种蛋白质和/或编码的功能性衍生物总共包含等于或少于1、2、3、4、5、6、7、8、9、10、15、20、25、30、35、40、45、50、60、70、80、90或大于100,优选不超过20个缺失的、插入的、修饰的和/或置换的氨基酸:
在本发明的第四个方面的多核苷酸的另一个实施方案中,所述多核苷酸编码根据表4所示的各个SEQ ID NO的同一毒株的腺病毒纤维和六邻体蛋白或其功能性衍生物。在本发明的第四个方面的多核苷酸的另一个实施方案中,所述多核苷酸编码根据表4所示的各个SEQ ID NO的同一毒株的腺病毒纤维和五邻体蛋白或其功能性衍生物。在本发明的第四个方面的多核苷酸的另一个实施方案中,所述多核苷酸编码根据表4所示的各个SEQ ID NO的同一毒株的腺病毒六邻体和五邻体蛋白或其功能性衍生物。在此上下文中,在每种情况下所述功能性衍生物包含少于1、2、3、4、5、6、7、8、9或少于10,最优选少于3个缺失的、插入的、修饰的和/或置换的氨基酸。
在本发明的第四个方面的另一个优选的实施方案中,所述多核苷酸由下述多核苷酸组成或包含下述多核苷酸:其在全长上与(i)由SEQ ID NO: 13、62、63或65中的任一个组成的序列或(ii)由SEQ ID NO: 13、62、63或65中的任一个组成的并缺乏一个或多个基因组区域E1A、E1B、E2A、E2B、E3 ORF1、E3 ORF2、E3 ORF3、E3 ORF4、E3 ORF5、E3 ORF6、E3 ORF7、E3 ORF8、E3 ORF9、E4 ORF7、E4 ORF6、E4 ORF5、E4 ORF4、E4 ORF3、E4 ORF2和/或E4 ORF1的序列至少90%、91%、92%、94%、95%、96%、97%、98%、99%、99.9%或100%,优选98%同一。因此,上述一个或多个基因组区域优选地在测定同一性百分比的比对中不予考虑。在本发明的分离的多核苷酸的另一个优选的实施方案中,多核苷酸包含SEQ
ID NO: 13、62、63或65或由SEQ ID NO: 13、62、63或65组成,其中一个或多个基因组区域E1A、E1B、E2A、E2B、E3 ORF1、E3 ORF2、E3 ORF3、E3 ORF4、E3 ORF5、E3 ORF6、E3 ORF7、E3 ORF8、E3 ORF9、E4 ORF7、E4 ORF6、E4 ORF5、E4 ORF4、E4 ORF3、E4 ORF2和E4 ORF1分别从SEQ ID NO: 13、62、63或65中被缺失或被置换为编码异源蛋白质的转基因或表达盒(如本文中描述的)。在最优选的实施方案中,缺失腺病毒区域E1、E3和/或E4,这也在实施例2中举例说明。上述优选的缺乏一个或多个指示的基因组区域的多核苷酸可进一步包含编码异源蛋白质的多核苷酸序列或包含这样的编码异源蛋白质的多核苷酸序列的表达盒。可将所述编码异源蛋白质的多核苷酸序列和所述包含这样的编码异源蛋白质的多核苷酸序列的表达盒插入例如本发明的多核苷酸的缺失区域,这是本领域所熟知的并也在下面的实施例中描述。所述异源蛋白质可为如本文描述的用于递送至靶细胞的分子,例如编码抗原蛋白或其片段,优选病原体的抗原蛋白或片段例如HIV
gag蛋白,肿瘤抗原或单纯疱疹病毒的蛋白质的多核苷酸,如实施例中描述。因此,在优选的实施方案中,根据本发明的分离的多核苷酸进一步包含编码选自病毒抗原、病原细菌抗原和肿瘤抗原的抗原的多核苷酸。在一个实施方案中,所述异源蛋白质因此可为选自RNA病毒抗原、病原细菌抗原和肿瘤抗原的抗原。抗原指能够在哺乳动物中引起免疫应答的任意蛋白质或肽。抗原优选地包含至少8个氨基酸和最优选包含8至12个氨基酸。因此,当测定序列同一性时,在比对中优选地不考虑基因组区域E1A、E1B、E2A、E2B、E3和/或E4,即使用下述序列进行比对,其由SEQ ID NO: 13、62 63或65的全序列组成的但不包括基因组区域E1A、E1B、E2A、E2B、E3、E4和/或任意编码异源多肽的多核苷酸或包含这样的多核苷酸的表达盒。还如上所述地,优选根据本发明的第四个方面和所有其优选实施方案的多核苷酸编码功能性六邻体、五邻体和/或纤维衣壳蛋白或其功能性衍生物,例如编码的蛋白质具有与感染性腺病毒粒子中的各个衣壳蛋白具有相同的功能。因此,在其衣壳中包含所述编码的重组五邻体、六邻体和/或纤维蛋白或其功能性衍生物的重组腺病毒能够进入宿主细胞。进一步优选根据本发明的或由本发明的多核苷酸编码的衣壳蛋白或其功能性衍生物在人中没有血清阳性率。
本发明还提供了由根据本发明的分离的多核苷酸编码的分离的蛋白质,即由根据本发明的第一、第二和/或第三个方面的分离的多核苷酸编码的分离的腺病毒衣壳多肽或其功能性衍生物。在此上下文中,在一个实施方案中的“功能性衍生物”不包含超过5、10或不包括超过25个的氨基酸改变(即缺失的、插入的、修饰的和/或置换的氨基酸)。
本发明还涉及包含根据本发明的分离的多核苷酸的载体。
优选地,载体不包含选自E1A、E1B、E2A、E2B、E3和E4的基因组区域中的基因,和/或包含选自E1A、E1B、E2A、E2B、E3和E4的基因组区域的至少一个基因,其中所述至少一个基因包含使所述至少一个基因无功能的缺失和/或突变。使这些基因产物之一无功能的一种可能是在这些基因的开放阅读框中引入一个或多个人工终止子(例如TAA)。使病毒复制缺陷的方法是本领域熟知的(见例如Brody 等人,
1994 Ann NY Acad Sci., 716: 90-101)。
在一些实施方案中,本发明的多核苷酸包含编码本发明的六邻体蛋白;五邻体蛋白;纤维蛋白;六邻体蛋白和五邻体蛋白;六邻体蛋白和纤维蛋白;五邻体蛋白和纤维蛋白;或六邻体蛋白、五邻体蛋白和纤维蛋白的多核苷酸和还包含另外的腺病毒多核苷酸。因此,在一个优选的实施方案中,根据本发明的分离的多核苷酸包含以下至少一项:
(a)腺病毒5'-反向末端重复(ITR);
(b)腺病毒E1a区,或其选自13S、12S和9S区的片段;
(c)腺病毒E1b区,或其选自小T、大T和IX区的片段;
(d)腺病毒E2b区,或其选自小pTP、聚合酶和IVa2区的片段;
(e)腺病毒L1区,或其片段,所述片段编码选自28.1 kD蛋白、聚合酶、agnoprotein、52/55 kDa蛋白和IIIa蛋白的腺病毒蛋白质;
(f)腺病毒L2区或包含编码本发明的五邻体蛋白的多核苷酸的L2区,或其片段,所述片段编码选自五邻体蛋白或本发明的五邻体蛋白、VII、V和Mu蛋白的腺病毒蛋白质;
(g)腺病毒L3区或包含编码本发明的六邻体蛋白的多核苷酸的L3区,或其片段,所述片段编码选自VI蛋白、六邻体蛋白或本发明的六邻体蛋白和内切蛋白酶的腺病毒蛋白质;
(h)腺病毒E2a区;
(i)腺病毒L4区,或其片段,所述片段编码选自100 kD蛋白、33 kD同源物和蛋白质VIII的腺病毒蛋白质;
(j)腺病毒E3区,或其选自E3 ORF1、E3 ORF2、E3 ORF3、E3 ORF4、E3 ORF5、E3 ORF6、E3 ORF7、E3 ORF8和E3 ORF9的片段;
(k)腺病毒L5区或包含编码本发明的纤维蛋白的多核苷酸的L5区,或其片段,所述片段编码纤维蛋白或本发明的纤维蛋白;
(l)腺病毒E4区,或其选自E4 ORF7、E4 ORF6、E4 ORF5、E4 ORF4、E4 ORF3、E4 ORF2和E4 ORF1的片段;特别是所述E4区的ORF6;
和/或
(m)腺病毒3'-ITR。
在上述多核苷酸的一些实施方案中可能也如上所述地希望,优选地,所述多核苷酸不包含如上概述的基因组区域的ORF(例如如在实施例2中定义的区域E3和/或E4)和/或包含下述腺病毒基因,其包含使至少一个基因无功能的缺失和/或突变。在这些优选的实施方案中,合适的腺病毒区将被修饰以不包含上述基因或使选定的基因无功能。任意腺病毒基因缺失将为插入转基因例如如本文描述的小基因盒制造空间。此外,基因缺失可用于产生下述腺病毒载体,其在不使用包装细胞系或辅助病毒的情况下不能复制,这是本领域熟知的。因此,包含如上所述的包含一个或多个特定的基因/区域缺失或丧失功能的突变的多核苷酸的最终重组腺病毒可为例如基因治疗或接种提供更安全的重组腺病毒。
在特别优选的实施方案中,本发明的多核苷酸包含以下至少一项:
(a)SEQ
ID NO: 13、62、63或65中任一个的5'-反向末端重复(ITR)区;
(b)SEQ
ID NO: 13、62、63或65中任一个的腺病毒E1a区,或其选自13S、12S和9S区的片段;
(c)SEQ
ID NO: 13、62、63或65中任一个的腺病毒E1b区,或其选自小T、大T和IX区的片段;
(d)SEQ
ID NO: 13、62、63或65中任一个的腺病毒E2b区;或其选自小pTP、聚合酶和IVa2区的片段;
(e)SEQ
ID NO: 13、62、63或65中任一个的腺病毒L1区,或其片段,所述片段编码选自28.1 kD蛋白、聚合酶、agnoprotein、52/55 kDa蛋白和IIIa蛋白的腺病毒蛋白质;
(f)SEQ
ID NO: 13、62、63或65中任一个的腺病毒L2区,或其片段,所述片段编码选自具有SEQ ID NO: 31、52或55的氨基酸序列的五邻体蛋白、VII、V和Mu蛋白的腺病毒蛋白质;
(g)SEQ
ID NO: 13、62、63或65中任一个的腺病毒L3区,或其片段,所述片段编码选自VI蛋白、具有SEQ ID NO: 25、51或54的氨基酸序列的六邻体蛋白和内切蛋白酶的腺病毒蛋白质;
(h)SEQ
ID NO: 13、62、63或65中任一个的腺病毒E2a区;
(i)SEQ
ID NO: 13、62、63或65中任一个的腺病毒L4区,或其片段,所述片段编码选自100 kD蛋白、33 kD同源物和蛋白质VIII的腺病毒蛋白质;
(j)SEQ
ID NO: 13、62、63或65中任一个的腺病毒E3区,或其选自E3 ORF1、E3 ORF2、E3 ORF3、E3 ORF4、E3 ORF5、E3 ORF6、E3 ORF7、E3 ORF8和E3 ORF9的片段;
(k)SEQ
ID NO: 13、62、63或65中任一个的腺病毒L5区,或其片段,所述片段编码具有SEQ ID NO:19、50或53的氨基酸序列的纤维蛋白;
(l)SEQ
ID NO: 13、62、63或65中任一个的腺病毒E4区,或其选自E4 ORF7、E4 ORF6、E4 ORF5、E4 ORF4、E4 ORF3、E4 ORF2和E4 ORF1的片段;或Ad5 E4区的ORF6(SEQ ID NO: 64);
和
(m)SEQ
ID NO: 13、62、63或65中任一个的3'-ITR。
在一个实施方案中,本发明的分离的多核苷酸还编码一个或多个优选所有以下腺病毒蛋白质:蛋白质VI、蛋白质VIII、蛋白质IX、蛋白质IIIa和蛋白质IVa2。优选地这些蛋白质由本文公开的PanAd1、PanAd2和PanAd3基因组序列的各自的开放阅读框编码。重组腺病毒领域的一般技术人员熟知如何测定编码上述特定腺病毒蛋白质的开放阅读框。他也知道腺病毒基因组的结构并可以不需过多劳动将本文概述的各个腺病毒区域和ORF定位至例如本发明的新腺病毒基因组PanAd1、PanAd2和PanAd3中的任一个。
为了表达多核苷酸,优选编码本发明的一个或多个腺病毒蛋白质的cDNA,可将所述多核苷酸亚克隆至表达载体,所述表达载体包含引导转录的强启动子,转录/翻译终止子和用于翻译起始的核糖体结合位点。合适的细菌启动子是本领域所熟知的,例如大肠杆菌、芽孢杆菌属物种和沙门氏菌,并且用于这些表达系统的试剂盒是可以商业获得的。类似的用于哺乳动物细胞、酵母和昆虫细胞的真核表达系统是本领域所熟知的并也可以商业获得。
除了启动子以外,表达载体通常包含转录单位或表达盒,所述转录单位或表达盒包含在宿主细胞中表达腺病毒蛋白质编码核酸所需的所有其他元件。因此一般的表达盒包含与编码腺病毒蛋白质/多肽的核酸序列可操作连接的启动子和转录物有效多聚腺苷酸化所需的信号,核糖体结合位点和翻译终止子。表达盒的其他元件可包括例如增强子。表达盒应也包含结构基因下游的转录终止区以提供有效终止。终止区可从与启动子序列相同的基因中得到,或可从不同的基因得到。
用于运送遗传信息至细胞的具体表达载体不是特别关键。可使用用于在真核或原核细胞中表达的任意传统载体。标准细菌表达载体包括质粒例如基于pBR322的质粒、pSKF、pET23D和融合表达系统例如GST和LacZ,但还有许多本领域技术人员已知的可被有效利用的载体。
包含来自真核病毒的调控元件的表达载体通常被用于真核表达载体,例如SV40载体,乳头状瘤病毒载体和EB病毒来源的载体。其他示例性真核载体包括pMSG,
pAV009/A.sup.+、pMTO10/A.sup.+、pMAMneo-5、杆状病毒pDSVE、pcDNA3.1、pIRES和任意其他允许在下述启动子引导下表达蛋白质的载体:例如HCMV即早期启动子、SV早期启动子、SV晚期启动子、金属硫蛋白启动子、鼠乳腺瘤病毒启动子、劳氏肉瘤病毒启动子、多角体蛋白启动子或其他显示对真核细胞中的表达有效的启动子。
一些表达系统具有提供基因扩增的标记例如胸苷激酶、潮霉素B磷酸转移酶和二氢叶酸还原酶。可选地,不涉及基因扩增的高产表达系统也是合适的。
在表达载体中也可包括的元件包括在大肠杆菌中有功能的复制子,编码药物抗性以允许选择含有重组质粒的细菌的基因,和在质粒的非必需区的唯一限制性位点以允许插入真核序列。选择的具体药物抗性基因不是关键的——本领域已知的许多药物抗性基因中的任一种都是合适的。如果需要,任选地选择原核序列使其不干扰DNA在真核细胞中的复制。
可使用标准转染方法产生细菌、哺乳动物细胞、酵母或昆虫细胞系。可使用任意熟知的用于将外源多核苷酸序列引入宿主细胞的程序。例如,可商业获得的基于脂质体的转染试剂盒例如LipofectamineTM
(Invitrogen),可商业获得的基于脂的转染试剂盒例如Fugene (Roche
Diagnostics),基于聚乙二醇的转染,磷酸钙沉淀,基因枪(生物射弹),电穿孔或病毒感染和可使用任意其他熟知的用于将克隆的基因组DNA、cDNA、合成的DNA或其它外源遗传物质引入宿主细胞的方法。仅需要使用的特定基因工程程序能够成功地将至少一个基因引入能够表达受体的宿主细胞中。
可使用标准技术任选地纯化表达的腺病毒蛋白质。例如,可在细胞进行沉淀和层析步骤前机械地或通过渗压震扰裂解细胞,重组蛋白质的性质和序列将取决于待回收的特定重组材料。可选地,重组蛋白质可以被分泌并从培养重组细胞的培养基中回收,如蛋白质表达领域已知的。
在一个优选的实施方案中,本发明的载体是质粒载体,例如表达载体。根据本发明的质粒也可用于产生重组腺病毒。
因此,本发明的另一个方面是重组腺病毒,优选不能复制的腺病毒,其包含根据本发明的分离的多核苷酸和/或根据本发明的至少一种分离的腺病毒衣壳多肽。优选地本发明的重组腺病毒包含本发明的六邻体、纤维和五邻体蛋白,例如如在上表2中概述的组合。在优选的实施方案中,重组腺病毒的特征在于其能够感染人细胞——优选在所述腺病毒在人血清中孵育1小时后能够感染人细胞,所述人血清来源于以前从未接触黑猩猩腺病毒的人。
由于提供了本发明的新六邻体、五邻体和纤维蛋白的序列信息,可以获得所述重组腺病毒,例如通过构建由通常的腺病毒蛋白质组成但具有包含至少一种根据本发明的分离的腺病毒衣壳多肽或其功能性衍生物的衣壳的重组腺病毒。在这点上优选重组腺病毒包含含有编码本发明的五邻体蛋白的多核苷酸序列的L2区,含有编码本发明的六邻体蛋白的多核苷酸序列的L3区和/或含有编码本发明的纤维蛋白的多核苷酸序列的L5区。最优选地所述重组腺病毒包含分别编码本发明的五邻体、六邻体和纤维蛋白的L2区、L3区和L5区。
构建重组腺病毒的方法是本领域熟知的。例如在Graham & Prevec, 1991 In Methods in
Molecular Biology: Gene Transfer and Expression Protocols, (Ed. Murray, EJ.),
p. 109;和Hitt 等人,
1997 "Human Adenovirus Vectors for Gene Transfer into Mammalian Cells"
Advances in Pharmacology 40:137-206中综述了制备重组腺病毒的有用技术。在WO
2006/086284中描述了其他方法。为了制备复制缺陷的腺病毒,可在补充细胞系(complementing
cell line)中表达E1A、E1B、E2A、E2B、E3和E4基因产物中的一种或几种,所述补充细胞系可用于增殖和挽救不能复制的重组腺病毒,因为其缺乏例如上述基因产物中的一种。这些细胞系的使用也在上述参考中描述。
在一个实施方案中,使用本发明的多核苷酸(或包含本文描述的本发明的所述多核苷酸的载体)产生重组腺病毒颗粒。重组腺病毒优选地如上述在一个或多个腺病毒区域例如E1a或E1b区功能性缺失,和任选地具有其他突变,例如温度敏感突变或在其他腺病毒基因中的缺失。在其他实施方案中,希望在重组腺病毒中保留完整的E1a和/或E1b区。这样的完整E1区可位于其在腺病毒基因组中的天然位置或被置于天然腺病毒基因组中的缺失位点(例如在E3区)。
在构建腺病毒载体以递送基因至宿主例如人(或其它哺乳动物)细胞时,可在本发明的载体中使用一系列腺病毒核酸序列。例如,可从组成重组病毒一部分的腺病毒序列中除去腺病毒延迟早期基因E3的全部或部分。认为猿E3的功能和重组病毒颗粒的功能和产生无关。在某些实施方案中,也可构建具有E4基因的至少ORF6区缺失的,和更理想地因为此区域功能中的冗余性,缺失全部E4区的腺病毒载体。本发明的还另一个载体在延迟早期基因E2a中含有缺失。也可在猿腺病毒基因组晚期基因L1至L5中的任意基因中进行缺失。类似地,在中期基因IX和IVa2中的缺失可用于某些目的。可在其他结构和非结构腺病毒基因中进行其他缺失。可单独使用上面讨论的缺失,即用于本发明的腺病毒序列可仅在单个区域中包含缺失。可选地,可以任意组合使用整个基因的缺失或有效破坏其生物活性的其部分的缺失。例如,在根据本发明的一个示例性载体中,腺病毒序列可具有E1和E4区的缺失,或El, E2a和E3区的缺失,或E1和E3区的缺失,或El, E2a和E4区的缺失,具有或不具有E3缺失,等等。如上讨论地,这些缺失可与其他腺病毒基因突变例如温度敏感突变组合使用以获得预期结果。
可在存在缺少的、对病毒感染性和腺病毒颗粒增殖所必需的腺病毒基因产物的情况下培养缺乏任意必需腺病毒序列(例如选自Ela、Elb、E2a、E2b、E4 ORF6、L1或L4的区域)的腺病毒载体。可通过在一种或多种辅助构建体(例如质粒或病毒)或包装宿主细胞(如上所述的补充细胞系)的存在下培养腺病毒载体以提供这些辅助功能。见例如本文包括的实例和1996年5月9日公开的国际专利申请WO96/13597(引入本文作为参考)中所描述的用于制备“最小”人腺病毒载体的技术。
有用的辅助病毒包含补充在本发明的腺病毒载体的优选实施方案中缺失的和/或所述载体转染的包装细胞系不表达的各个基因的选定的腺病毒基因序列。在一个实施方案中,辅助病毒是复制缺陷的并包含除了上述序列以外的多种腺病毒基因。
可也使辅助病毒形成多聚阳离子缀合物,如Wu 等人, J.
Biol. Chem., 264: 16985-16987 (1989); K. J. Fisher和J. M. Wilson, Biochem. J., 299: 49 (April 1, 1994)中所述。辅助病毒可任选地包含第二个报告小基因(minigene)。大量这样的报告基因是本领域已知的。在辅助病毒上存在不同于腺病毒载体上的转基因的报告基因允许独立地监控腺病毒载体和辅助病毒二者。在纯化时,此第二个报告基因可用于帮助得到的重组病毒和辅助病毒之间的分离。
为了产生在本文的优选实施方案中描述的缺失任意基因的重组腺病毒(Ad),如果缺失的基因对病毒的复制和感染性是必需的,优选地通过辅助病毒或细胞系,即补充或包装细胞系提供缺失的基因的功能。在许多情况下,表达人E1的细胞系可用于反式补充用于产生重组腺病毒的载体。这是特别有利地,因为由于在本发明的多核苷酸序列和在目前可利用的包装细胞中发现的人腺病毒E1序列之间的差异,使用目前包含人E1的细胞在复制和生产过程中将避免产生能够复制的腺病毒。然而,在某些情况下,希望使用表达E1基因产物的细胞系以产生E1缺失的重组腺病毒。
如果希望,可使用本文提供的序列以产生在用于在选定的亲本细胞系例如HeLa细胞中表达的启动子的转录控制下在最低程度上表达来自ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、PanAd1、PanAd2或PanAd3腺病毒的腺病毒E1基因的包装细胞或细胞系。为此可使用诱导型或组成型启动子。例如在本文描述的实施例中提供了启动子的实例。这样的表达E1的细胞系可用于产生重组腺病毒E1缺失的载体。此外,或可选地,本发明提供了表达一种或多种腺病毒基因产物,例如Ela、Elb、E2a和/或E4 ORF6,优选Ad5 E4 ORF6(又见以下实施例)的细胞系,其可使用与在产生重组腺病毒载体中使用的基本上相同的程序构建。可使用这样的细胞系反式补充缺失了编码这些产物的必需基因的腺病毒载体或提供包装辅助依赖病毒(例如腺相关病毒)所必需的辅助功能。
通常,当通过转染递送本发明的包含例如小基因的载体时,以约0.1 μg至约100μg DNA,和优选约10至约50μg DNA的量将载体递送至约1 x 104个细胞至约1 x 103个细胞,和优选约105个细胞。然而,可根据这些因素如选定的载体、递送方法和选定的宿主细胞来调节载体DNA与宿主细胞的相对量。载体至宿主细胞的引入可通过本领域已知的或本文公开的任意方法完成,包括转染和感染,例如使用CaPO4转染或电穿孔。
为了构建和组装期望的包含小基因的重组腺病毒,在一个实施例中如本领域熟知的可在辅助病毒的存在下将载体体外转染至包装细胞系,从而允许辅助病毒和载体序列之间发生同源重组,使载体中的腺病毒转基因序列能够复制和包装进病毒粒子衣壳,得到重组腺病毒载体颗粒。本发明的重组腺病毒可用于例如将选定的转基因转移至选定的宿主细胞。
在本发明的腺病毒的优选实施方案中,腺病毒在人受试者中具有低于5%的血清阳性率和优选地在人受试者中没有血清阳性率,最优选在以前没有接触黑猩猩腺病毒的人受试者中没有血清阳性率。在此上下文中优选人受试者属于选自欧洲、非洲土著人、亚洲、美洲土著人和大洋洲土著人的种族群。鉴定人受试者的种族起源的方法是本领域包括的(见例如WO2003/102236)。
在根据本发明的重组腺病毒的另一个优选实施方案中,腺病毒DNA能够进入哺乳动物靶细胞,即其具有感染性。本发明的感染性重组腺病毒可用作疫苗和用于基因治疗,也如下述。因此,在另一个实施方案中,优选重组腺病毒包含用于递送至靶细胞的分子。优选地,靶细胞是哺乳动物细胞,例如黑猩猩细胞、啮齿类细胞或人细胞。例如,用于递送至靶细胞的分子可为如本文定义的表达盒。将表达盒引入腺病毒基因组的方法是本领域熟知的(见例如上面提供的文献引用)。在一个实施例中包含编码例如小基因或反义基因的表达盒的本发明的重组腺病毒可通过下述方法产生:用所述表达盒替换选自E1A、E1B、E2A、E2B、E3和E4的腺病毒基因组区域。通过与已知和有注释的腺病毒基因组例如人Ad5的基因组的比对可容易地鉴定本发明的腺病毒的基因组区域E1A、E1B、E2A、E2B、E3和E4(见:Birgitt Täuber和Thomas
Dobner, Oncogene (2001) 20, p. 7847 –
7854;和又见: Andrew J. Davison, 等人, “Genetic content
and evolution of adenoviruses”, Journal of
General Virology (2003), 84, p. 2895–2908)。在如下实施例1和2和图4中也提供了如何产生包含用于递送至靶细胞的分子的修饰的腺病毒的非限制性实例。
用于递送至靶细胞的分子优选地为多核苷酸,但也可为优选具有治疗或诊断活性的多肽或小化学化合物。在一个特别优选的实施方案中,用于递送至靶细胞的分子是包含腺病毒5’反向末端重复序列(ITR),基因例如SEQ ID NO: 1和3’ITR的多核苷酸。对技术人员而言显而易见地,必须选择分子的分子大小使当重组腺病毒在例如包装细胞系中产生时衣壳可在分子周围形成并包装分子。因此,优选地基因是小基因,其可具有例如多达7000和最多可达8000个碱基对。
在优选的实施方案中,在根据本发明的重组腺病毒中包含的用于递送至靶细胞的分子是编码抗原蛋白质或其片段的多核苷酸。抗原蛋白质或其片段能够在哺乳动物中引起免疫应答,在一个特别优选的实施方案中可为如实施例中所示的HIV的gag蛋白并由根据SEQ ID NO: 1的多核苷酸编码。
在一个特别优选的实施方案中,本发明的重组腺病毒是已在ECACC(欧洲动物细胞保藏中心(European
Collection of Cell Culture), Porton Down,
Salisbury, SP4 OJG, UK)保藏的腺病毒并具有选自08110601
(ChAd83)、08110602 (ChAd73)、08110603 (ChAd55)、08110604
(ChAd147)和08110605 (ChAd146)的保藏号。在2008年11月6日由Okairos AG,
Elisabethenstr. 3, 4051 Basel, Switzerland完成了上述腺病毒毒株(腺病毒科,哺乳动物腺病毒属(拉丁文名:Mastadenovirus,
Adenoviridae))的保藏。
将根据国际承认用于专利程序目的的微生物保藏的《布达佩斯条约》维持这些保藏。仅为本领域技术人员的便利进行这些保藏,根据35 U.
S. C. 112并不承认保藏是必须的。除了根据37 C. F. R. 1. 808 (b)在授予专利权方面规定的要求以外,不能取消地去除对公众获得保藏材料的所有限制。
本发明的重组腺病毒的另一个优选的实施方案是由选自08110601 (ChAd83)、08110602 (ChAd73)、08110603
(ChAd55)、08110604 (ChAd147)和08110605 (ChAd146)的腺病毒衍生的腺病毒。优选地由上述保藏的腺病毒衍生的腺病毒已通过在其基因组中引入功能性缺失、缺失或修饰被改变,例如以得到不能复制的腺病毒和/或在能够在宿主细胞中表达转基因的腺病毒。例如,选自E1A、E1B、E2A、E2B、E3和E4基因的一个或多个基因可被缺失、使其无功能和/或可被如上概述的表达盒替换。此外,可引入一个或多个另一种腺病毒的基因,优选地用于缺失的基因。技术人员熟知如何在保藏的毒株中引入这些基因组改变。在这方面,上面已经描述了产生包含保藏毒株的优选修饰的、用于递送至靶细胞的分子的修饰的腺病毒的方法。
在另一个方面提供了组合物,其包含免疫佐剂和以下(i)至(iv)中至少一种:
(i)根据本发明的分离的蛋白质;
(ii)根据本发明的分离的多核苷酸;
(iii)根据本发明的载体;
(iv)根据本发明的重组腺病毒;
和任选地药学上可接受的赋形剂。
根据本发明的包含佐剂的组合物可用作疫苗,例如用于人受试者。本文中也被简称为“佐剂”的免疫佐剂与单独施用抗原相比加快、延长和/或增强对抗原/免疫原的免疫应答的质量和/或强度,因此减少在任意给定疫苗中必需的抗原/免疫原的量和/或对目的抗原/免疫原产生足够免疫应答所必需的注射频率。
在根据本发明的组合物的上下文中可使用的佐剂实例是氢氧化铝(矾)的凝胶样沉淀物;AlPO4;铝胶;来自革兰氏阴性细菌外膜的细菌产品,特别是单磷酰脂质A(MPLA)、脂多糖(LPS)、胞壁酰二肽和其衍生物;弗氏不完全佐剂;脂质体,特别是中性脂质体,包含组合物和任选地包含细胞因子的脂质体;非离子嵌段共聚物;ISCOMATRIX佐剂(Drane 等人, 2007);包含CpG二核苷酸(CpG基序)的未甲基化DNA,特别是具有硫代磷酸(PTO)骨架(CpG PTO ODN)或磷酸二酯(PO)骨架(CpG PO ODN)的CpG
ODN;合成的脂肽衍生物,特别是Pam3Cys;脂阿拉伯甘露聚糖;肽葡聚糖;酵母聚糖;热休克蛋白(HSP),热别是HSP70;dsRNA和其合成衍生物,特别是Poly I:poly C;聚阳离子肽,特别是聚-L-精氨酸;紫杉酚;纤连蛋白;鞭毛蛋白;咪唑并喹啉;具有佐剂活性的细胞因子,特别是GM-CSF,白介素- (IL-)2、IL-6、IL-7、IL-18,I和II型干扰素,特别是干扰素-γ,TNF-α;25-二羟维生素D3(骨化三醇);和合成的寡肽,特别是MHCII呈递肽。可使用包含聚氧化乙烯(POE)和聚氧化丙烯(POP)的非离子嵌段共聚物例如POE-POP-POE嵌段共聚物作为佐剂(Newman 等人, 1998)。此类型的佐剂对包含核酸作为活性成分的组合物特别有用。
任选地,可使用各种药学上可接受的赋形剂。优选的药学上可接受的赋形剂在下面讨论根据本发明的用途时描述。
特定受体的活化可刺激免疫应答。这些受体是熟练技术人员已知的并包括例如细胞因子受体,特别是I型细胞因子受体,II型细胞因子受体,TNF受体;和担当转录因子的维生素D受体;和Toll样受体1(TLR1)、TLR-2、TLR 3、TLR4、TLR5、TLR-6、TLR7和TLR9。这些受体的激动剂具有佐剂活性,即为免疫刺激性的。在优选的实施方案中,本发明的组合物的佐剂可为一种或多种Toll样受体激动剂。在更优选的实施方案中,佐剂是Toll样受体4激动剂。在特别优选的实施方案中,佐剂是Toll样受体9激动剂,其优选地由核苷酸tccatgacgttcctgacgtt (SEQ ID NO: 2)编码。
在另一个方面,本发明提供了细胞,优选非猿细胞,其包含以下至少一种:
(i)根据本发明的分离的蛋白质;
(ii)根据本发明的分离的多核苷酸;
(iii)根据本发明的载体;
(iv)根据本发明的重组腺病毒。
细胞可选自细菌细胞例如大肠杆菌细胞,酵母细胞例如酿酒酵母或毕赤酵母,植物细胞,昆虫细胞例如SF9或Hi5细胞,或哺乳动物细胞。哺乳动物细胞的优选实例是中国仓鼠卵巢(CHO)细胞,人胚胎肾(HEK 293)细胞,HELA细胞,人肝癌细胞(例如Huh7.5),Hep
G2人肝癌细胞,Hep 3B人肝癌细胞等等。
如果细胞包含根据(ii)的分离的多核苷酸,细胞中存在的此多核苷酸可(i)本身自由地分散,或(ii)整合至宿主细胞基因组或线粒体DNA。
在另一个优选的实施方案中,细胞是表达至少一种选自E1a、E1b、E2a、E2b、E4、L1、L2、L3、L4和L5的腺病毒基因的宿主细胞,优选293细胞或PER.C6TM细胞。
也提供了根据本发明的分离的多核苷酸、根据本发明的分离的蛋白质、根据本发明的载体、根据本发明的重组腺病毒和/或根据本发明的药物组合物用于疾病的治疗或预防的用途。
已证明了腺病毒载体作为疫苗载体的巨大潜力。临床前和临床研究已证明使用此系统在载体设计、稳健的抗原表达和保护性免疫方面的可行性。因此,优选的实施方案是根据本发明的用途,其中治疗或预防是例如对人受试者的接种。本领域包含的大量文献提供了如何使用腺病毒和将其制备用于接种的详细说明,这是技术人员已知的。
如果用途是接种,可以免疫学和/或预防有效剂量即优选1 x 108
至1 x 1011 个病毒颗粒(即1 x 108、5 x
108、1 x 109、5 x 109、1 x
1010、2.5 x 1010或5 x 1010 个颗粒)施用本发明的重组腺病毒。此外,对需要加强的接种,优选应用如上定义的“异源引发-加强”方法。此外,当在疫苗中使用根据本发明的分离的多核苷酸、根据本发明的分离的蛋白质、根据本发明的载体、根据本发明的重组腺病毒和/或根据本发明的药物组合物时,优选所述疫苗包含佐剂。优选的免疫佐剂已在本文中描述并可在这样的疫苗中使用。
使用根据本发明的多核苷酸或重组腺病毒蛋白质或其片段制备的重组腺病毒可用于转导多核苷酸例如DNA至宿主细胞。因此,优选地可制备虽然有感染性即能够进入宿主细胞但复制缺陷的腺病毒以在宿主细胞中表达任意定制的(custom)蛋白质或多肽。因此,在优选的实施方案中,在根据本发明的用途中列举的治疗是基因治疗。如果在基因治疗中使用根据本发明的分离的多核苷酸、分离的蛋白质、载体、重组腺病毒和/或药物组合物并对待治疗的受试者施用,优选其以足够大的剂量施用从而使治疗的结果是患者的一个或多个细胞被转染,即转导。如果通过本文公开的任意优选施用方法施用根据本发明的重组腺病毒和/或药物组合物,优选地施用优选1 x 108至5 x
1011个病毒颗粒(即1 x 108、5 x
108、1 x 109、5 x 109、1 x
1010、2.5 x 1010、5 x 1010、1 x
1011 ,或最优选5 x 1011
个颗粒)的有效剂量。在优选的实施方案中,在本发明的重组腺病毒中包含的优选异源多核苷酸能够在受试者的宿主细胞中表达蛋白质或多肽,其中所述蛋白质或多肽包含影响蛋白质或多肽从所述宿主细胞中分泌的信号肽。例如,可使用本发明的腺病毒治疗需要特定蛋白质的患者,所述腺病毒包含编码该蛋白质的可分泌形式的cDNA。
在本发明的用途的另一个实施方案中,将根据本发明的分离的多核苷酸、分离的蛋白质、载体、腺病毒和/或药物组合物(以下称为根据本发明的药物)配制为进一步包含一种或多种药学上可接受的稀释剂;载体;赋形剂,包括填充料、粘结剂、润滑剂、助流剂、崩解剂和吸附剂;和/或防腐剂。
可通过各种熟知的途径施用本发明的药物,包括口服、直肠、胃肠内和胃肠外施用,例如静脉内、肌肉内、鼻内、皮内、皮下和类似的施用途径。优选胃肠外、肌肉内和静脉内施用。优选地将根据本发明的药物配制为糖浆、输注或注射溶液、药片、胶囊、囊片(capslet)、锭剂、脂质体、栓剂、膏药、创可贴、延迟胶囊、粉末或缓释制剂。优选地稀释剂为水、缓冲液、缓冲的盐溶液或盐溶液和载体优选地选自可可油和vitebesole。
用于在本发明的用途中施用根据本发明的药物的特别优选的药物形式是适于可注射的用途的形式并包含用于临时制备无菌可注射溶液或分散剂的无菌水溶液或分散剂和无菌粉末。通常,这样的溶液或分散剂将包含溶剂或分散介质,其包括例如水缓冲的水溶液,例如生物可相容的缓冲液,乙醇,多元醇,例如甘油,丙二醇,聚乙二醇,其合适的混合物,表面活性剂或植物油。
可通过本领域已知的许多技术制备输注或注射溶液,包括但不限于加入防腐剂例如抗细菌或抗真菌剂,例如对羟苯甲酸酯(parabene)、氯丁醇、酚、山梨酸或硫柳汞(thimersal)。此外,可在输注或注射溶液中加入等渗剂,例如糖或盐,特别是氯化钠。
本发明的优选稀释剂是水、生理上可接受的缓冲液、生理上可接受的缓冲盐溶液或盐溶液。优选的载体是可可油和vitebesole。与根据本发明的药物的多种药物形式可一起使用的赋形剂可选自以下非限制性列表:
a)粘结剂例如乳糖、甘露醇、结晶山梨醇、二元磷酸盐、磷酸钙、糖、微晶纤维素、羧甲基纤维素、羟乙基纤维素、聚乙烯吡咯烷酮等等;
b)润滑剂例如硬脂酸镁、滑石、硬脂酸钙、硬脂酸锌、硬脂酸、氢化植物油、亮氨酸、甘油酯和硬脂酰延胡索酸钠,
c)崩解剂例如淀粉、croscaramellose、甲基纤维素钠、琼脂、膨润土、藻酸、羧甲基纤维素、聚乙烯吡咯烷酮等等。
其他合适的赋形剂可见American Pharmaceutical Association出版的Handbook of Pharmaceutical Excipients,其在本文引入作为参考。
优选特定量的根据本发明的药物用于疾病的治疗或预防。但是应当理解取决于疾病的严重程度、疾病的类型以及待治疗的各个患者,例如患者的一般健康状态等等,需要不同剂量的根据本发明的药物引起治疗或预防作用。合适剂量的确定是主治医生的判断范围内的。
如果使用根据本发明的药物用于预防,可将其配制为疫苗。在这种情况下以上面概述的优选和特别优选剂量优选地施用根据本发明的药物。优选地,在确定的时间段中重复施用疫苗至少2、3、4、5、6、7、8、9或至少10次,直至接种的受试者已产生足够的针对本发明的药物的抗体从而降低发生各个疾病的风险。在这种情况下,取决于疫苗的抗原性,所述时间段通常是可变的。优选地所述时间段不超过4周、3个月、6个月或3年。在一个实施方案中,如果使用根据本发明的腺病毒用于接种的目的,可将六邻体蛋白的至少一个高变结构域替换为接种针对的各自疾病试剂的免疫原性表位。疫苗通常包含一种或多种如上概述的佐剂。在Bangari
DS 和Mittal SK (2006) Vaccine, 24(7), p.
849-862;又见: Zhou D, 等人,
Expert Opin Biol Ther. 2006 Jan;6(1):63-72;和:
Folgori A, 等人, Nat Med. 2006 Feb;12(2):190-7.;又见: Draper SJ, 等人, Nat
Med. 2008 Aug;14(8):819-21. Epub 2008 Jul 27中提供了用于接种的腺病毒的用途的详细总结和与其有关的方法。
在不背离本发明的范围下本发明的多种修改和变化对本领域技术人员而言将是显而易见的。尽管已结合特定优选的实施方案描述了本发明,应当理解要求保护的本发明不应当不适当地受限于这些特定实施方案。实际上,本发明旨在覆盖对相关领域技术人员而言显而易见的实施本发明的描述方式的多种修改。
下图仅为了说明本发明而不应被解释为限制本发明的范围,在任何方面,本发明的范围由附加的权利要求指出。
附图简述
图1:本发明的多个腺病毒分离株的六邻体蛋白之间的多重序列比对,使用具有默认设置的Clustal-W。显示了所述新黑猩猩腺病毒分离株的六邻体蛋白(名为PanAd1、PanAd2、PanAd3、ChAd55、ChAd73、ChAd83、 ChAd146和ChAd147)。将高变结构域1至7分别称为“HVR 1-6”和“HVR 7”。
图2:腺病毒ChAd55和其他新黑猩猩腺病毒分离株(名为PanAd1、PanAd2、PanAd3、ChAd73、ChAd83、ChAd146和ChAd147)的纤维蛋白之间的多重序列比对,使用具有默认设置的Clustal-W。
图3:腺病毒ChAd55和其他新黑猩猩腺病毒分离株(名为PanAd1、PanAd2、PanAd3、ChAd73、ChAd83、ChAd146和ChAd147)的五邻体蛋白之间的多重序列比对,使用具有默认设置的Clustal-W。
图4:通过野生型病毒基因组和对应的穿梭质粒的同源重组构建复制缺陷的腺病毒载体的图解。又见实施例2。
图5:在接种了包含用于表达HIV gag蛋白(SEQ ID NO:1)的表达盒的重组腺病毒的小鼠中的细胞介导的免疫应答。比较了重组人Ad5和黑猩猩ChAd55(图5A),重组人Ad5和倭黑猩猩PanAd1、PanAd2和PanAd3腺病毒(图5B),和重组ChAd55、ChAd73、ChAd83、ChAd146和ChAd147(图5C)的接种效力。通过用定位于Balb/C小鼠的CD8
HIV gag表位孵育细胞,通过干扰素-γ
ELIspot测定测量免疫应答。将结果报告为斑点形成细胞/106个脾细胞。
图6:在一组欧洲来源的人血清中评估了新腺病毒载体的血清阳性率。在相同组中平行评估了人腺病毒5型(Ad5)和黑猩猩腺病毒ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、PanAd1、PanAd2、PanAd3和CV-68的血清阳性率。将数据表示为显示免疫阳性率的受试者的%。仅检测到针对Ad5和CV-68腺病毒的中和抗体,但没有检测到针对本发明的新腺病毒中的任一种的中和抗体。
图7:在图7A中显示了BALB/c小鼠的PanAd HSV免疫和在图7B中显示了BALB/c小鼠的PanAd 癌症Ag免疫。
图8:在引发/加强接种实验中显示了食蟹猴(Macaca fascicularis)的PanAd HIV gag免疫。
实施例
实施例1:腺病毒的分离和表征
ChAd55、ChAd73、ChAd83、ChAd146、ChAd147是从在不同的欧洲和美国机构中饲养的健康动物中获得一组黑猩猩腺病毒。ChAd55、ChAd73、ChAd83、ChAd146、ChAd147具有与人血清没有可检测的反应性的性质。PanAd1、PanAd2和PanAd3是从在不同的欧洲和美国机构中饲养的健康倭黑猩猩(Pan
Paniscus)中分离的新腺病毒。PanAd1、PanAd2和PanAd3具有与人血清没有可检测的反应性的性质。
通过感染接种在96孔板中的293细胞,在第一代扩增后克隆普通黑猩猩和倭黑猩猩腺病毒原种。通过有限稀释在第一代病毒扩增时得到的细胞裂解物进行病毒克隆。挑选出5个分离的克隆并连续增殖。在3-4代连续扩增后,在置于5个双层细胞工厂(NUNC)(2亿细胞/细胞工厂)的细胞上进行腺病毒的大规模制备。通过在氯化铯密度梯度上的2个超速离心步骤从细胞裂解物中得到纯化的病毒颗粒。
通过在1% SDS-TEN中的蛋白酶K(0.5
mg/ml)消化(55℃ 2小时)从3 X 1012 pp的纯化病毒制品中分离基因组DNA。在酚-氯仿抽提和乙醇沉淀后,在水中重悬基因组DNA并用于基因组测序。
通过对六邻体基因的高变区7(HVR7)的序列分析获得新分离株的最初分类。为此在HVR7两侧的高度保守区设计了2个引物:TGTCCTACCARCTCTTGCTTGA (SEQ ID NO. 3)和GTGGAARGGCACGTAGCG (SEQ ID NO. 4)。使用纯化的病毒DNA或粗293裂解物作为模板通过PCR扩增HVR7并且然后测序。通过测序高变区1至6得到有关分离株的更详细的信息。通过PCR使用寡核苷酸HVR1-6fd,
CAYGATGTGACCACCGACCG (SEQ ID NO. 5)和HVR1-6rev,
GTGTTYCTGTCYTGCAAGTC (SEQ ID NO. 6)扩增包含HVR1-6的DNA区域。基于HVR序列分析将新分离的病毒归类为人Ad病毒分类(Horowitz, MS
(1990), Adenoviridae and their replication. In Virology B.N. Fields和D.M. Knipe, 编
(raven Press, New York) pp.1679-1740)的亚群E(ChAd55、ChAd73、ChAd83、ChAd146、ChAd147)和亚群C(PanAd1、PanAd2和PanAd3)。
通过比对人和黑猩猩腺病毒的六邻体氨基酸序列得到系统树。结果与使用Align X程序(Informax, Inc)基于局限于六邻体HVR1-6和7的核苷酸序列比对的最初分类一致,证明ChAd55、ChAd73、ChAd83、ChAd146、ChAd147分离株与人Ad4(亚群E)的密切的系统发生关系,而倭黑猩猩腺病毒分离株PanAd1、PanAd2和PanAd3与人Ad1、2、5、6(亚群C)有关。
实施例2:载体构建
根据下述策略在质粒载体中克隆PanAd1、PanAd2和PanAd3以及ChAd55、ChAd73、ChAd83、ChAd146、ChAd147病毒基因组。对载体基因组的所有操作在大肠杆菌中根据标准技术进行。通过从ChAd和PanAd骨架缺失E1和E3区域构建载体系统。用基于人CMV IE启动子和BGHpA信号的包含HCV非结构区(HCV NS)和HIV gag (SEQ ID
NO: 1)基因的表达盒替换E1区以用于在动物模型中评估免疫学效价。此外,构建了表达分泌的碱性磷酸酶基因(SEAP)的ChAd和PanAd载体用于中和测定。根据标准方案在293细胞中增殖载体并通过CsCl梯度纯化。
按照以下提供的步骤进行PanAd1、PanAd2和PanAd3 ΔE1载体的构建。
I.PanAd穿梭载体的构建
使用PanAd1基因组构建穿梭载体,用于通过PanAd1、PanAd2和PanAd3全基因组的同源重组的克隆。简单地说,如下构建用于克隆倭黑猩猩腺病毒1的穿梭载体,在本文中将其称为pBAd1RLD_EGFP:
通过PCR使用寡核苷酸5’- ATCTGGAATTCGTTTAAACCATCATCAATAATATACCTTATTTTG-3’ (SEQ ID NO: 7)和5’- TCAGGAACTAGTTCCGTATACCTATAATAATAAAACGGAGACTTTG-3’ (SEQ ID NO: 8)扩增PanAd1左端(nt 1-450),用SpeI和EcoRI消化,然后连接至已包含HCMV-EGFP-bgh
polyA表达盒的质粒载体从而产生pBAd1-L。然后通过PCR使用寡核苷酸5’- TCCAGCGGCGCGCCAGACCCGAGTCTTACCAGGA-3’ (SEQ ID NO: 9)和5’- ATTCAGGATCCGAATTCGTTTAAACCATCATCAATAATATACCTTATTTTG-3’ (SEQ ID NO: 10)扩增PanAd1右端(nt 37362-37772),然后克隆至pBAd1-L,由此产生质粒pBAd1-RL。
然后通过PCR使用寡核苷酸5’- TATTCTGCGATCGCTGAGGTGGGTGAGTGGGCG -3’ (SEQ ID NO: 11)和5’- TTACTGGCGCGCCTGCCTCGAGTAAACGGCATTTGCAGGAGAAG-3’ (SEQ ID NO: 12)扩增包含pIX编码区的PanAd1 DNA片段(nt
3498-4039),然后将其克隆至pBAd1-RL,得到pBAd1RLD
EGFP穿梭质粒。通过置换pBAd1RLD EGFP穿梭质粒中的EGFP基因也构建了包含分泌的碱性磷酸酶(SEAP)、HIV gag、HCV非结构区(NS)基因的表达盒的穿梭质粒。
如Emini 等人, 国际公开号WO 03/031588中所描述的构建基于人巨细胞病毒(HCMV)启动子和牛生长激素多聚腺苷酸化信号(Bgh polyA)的HIV gag、HCV NS区、SEAP和EGFP表达盒。病毒DNA表达盒被设计为包含仅在2个ITR的末端存在的限制酶位点(PmeI)以允许从质粒DNA中释放病毒DNA。
II.ΔE1 PanAd1、PanAd2和PanAd3载体的构建
通过在大肠杆菌菌株BJ5183中的同源重组构建PanAd1、PanAd2和PanAd3载体。用PanAd1、2和3的纯化病毒DNA和pBAd1RLD-EGFP或pBAd1RLD-Gag共转化BJ5183细胞。在线性化pBAd1RLD-EGFP或pBAd1RLD-Gag末端存在的pIX基因、右ITR DNA序列和病毒基因组DNA之间的同源重组允许病毒基因组DNA通过同时缺失E1区(其被表达盒置换)插入质粒载体。此策略允许构建表达EGFP或HIV
gag转基因的preadeno质粒pPanAd1、pPanAd2和pPanAd3。然后通过替换EGFP或Gag表达盒将SEAP或HCV-NS表达盒克隆至pPanAd 1、2和3载体。
III.
E3区的缺失
通过使用包括在大肠杆菌中的克隆和同源重组的若干步骤的策略在PanAd1, PanAd2和PanAd3载体骨架中引入E3区的缺失。PanAd1 E3缺失跨越基因组PanAd1序列(SEQ ID NO.: 13)的第28636位核苷酸至第32596位核苷酸;PanAd2 E3缺失跨越基因组PanAd2序列(SEQ ID NO.: 62)的第28653位核苷酸至第32599位核苷酸;PanAd3 E3缺失跨越基因组PanAd3序列(SEQ ID NO.: 63)的第28684位核苷酸至第32640位核苷酸。
IV.
E4区的缺失
缺失PanAd1, PanAd2和PanAd3的天然E4区并替换为Ad5 E4 ORF6编码序列 (SEQ
ID NO.: 64)。在PanAd 1, 2和3骨架中引入的E4缺失的坐标如下:
PanAd1 E4缺失跨越第34690至37369位核苷酸(SEQ ID NO.: 13);
PanAd2 E4缺失跨越第34696至37400位核苷酸(SEQ ID NO.: 62);
PanAd3 E4缺失跨越第34690至37369位核苷酸(SEQ ID NO.: 63)。
缺失的区域包括所有PanAd E4 ORF但E4的天然启动子和多聚腺苷酸化信号未被缺失。
如Emini 等人, 国际公开号WO 03/031588中所描述的构建基于人巨细胞病毒(HCMV)启动子和牛生长激素多聚腺苷酸化信号(Bgh polyA)的HIV gag和HCV NS区表达盒并利用HCMV和Bgh polyA DNA序列之间的同源性通过在大肠杆菌菌株BJ5183中的同源重组插入PanAd1、2和3 ΔE1 EGFP载体。
V.
ChAd55 DE1表达载体的构建和挽救
构建用于ChAd55克隆的穿梭载体
根据上面描述的构建PanAd载体的相同策略构建ChAd55穿梭质粒,然后用于克隆ChAd55病毒基因组。为此,用AscI限制酶线性化包含病毒基因组右端和左端的穿梭载体pARS ChAd55(左端从ITR至pIX基因具有E1区被缺失并被置换为表达盒)并与ChAd55的纯化病毒DNA共转化至大肠杆菌菌株BJ5183。在线性化pARS ChAd55 以及ChAd55、ChAd73、ChAd83、ChAd146和ChAd147的纯化病毒基因组DNA末端存在的pIX基因和右ITR之间的DNA序列的同源重组允许病毒基因组DNA通过同时缺失E1区插入质粒载体。图4提供了黑猩猩腺病毒55(ChAd55)基因组克隆策略的图解。
构建了基于人巨细胞病毒(HCMV)启动子和牛生长激素多聚腺苷酸化信号(Bgh
polyA)的表达盒以表达分泌的碱性磷酸酶(SEAP)、EGFP、HIV gag、HCV NS基因。将所有表达盒插入将要通过同源重组转移至ΔE1腺病毒预质粒(pre-plasmid)的pARS
ChAd55载体的单个SnaBI位点中。
实施例3:免疫实验
使用表达HIV gag转基因的载体在小鼠中评估了ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、PanAd1、PanAd2和PanAd3载体作为潜在的重组疫苗的效力。在平行进行的免疫实验中比较了ChAd55
gag和人Ad5 gag的载体效价。对10只动物的实验组在四头肌内注射108 vp/小鼠载体剂量的Ad5gag或ChAd55gag(图5A)。在单独的实验中对5只动物的实验组注射108
vp/小鼠载体剂量的Ad5gag或PanAd1gag,
PanAd2gag和PanAd3gag (图5B)。通过用108
vp/小鼠载体剂量的ChAd55 gag平行免疫5只小鼠的实验组也测定了ChAd73 gag、ChAd83
gag、ChAd146 gag 和Chad147gag的效价(图5C)。通过在脾细胞上的干扰素-γ Elispot测定测量针对HIV gag引起的免疫应答。相比人Ad5 gag载体,ChAd55、ChAd73、ChAd83、ChAd146、ChAd147以及PanAd1、PanAd2和PanAd3在免疫实验中的结果显示本发明的新腺病毒在引发特异性免疫应答方面至少和现有技术的重组腺病毒Ad5一样有效。
实施例4:中和研究
进行了中和测定以评估针对普通黑猩猩腺病毒55、73、83、146、147和倭黑猩猩腺病毒1、2和3型的中和抗体在人血清中的阳性率。该测定评估了血清预孵育对携带分泌的碱性磷酸酶(SEAP)基因的ChAd55、ChAd73、ChAd83、ChAd146、ChAd147、 PanAd1、PanAd2和PanAd3转导人293细胞的能力的影响。将中和滴度定义为使在具有病毒、缺乏血清的阳性对照中观察到的SEAP活性减少50%的血清稀释度。在多个稀释度上(5个4倍递增,从1/18稀释开始至1:4608)检测了每种血清样品。样品在37℃预孵育1小时,然后加入到接种至96孔板的293细胞中(3x104个细胞/孔)。检测了一组人血清的中和活性。对Ad5和黑猩猩和倭黑猩猩Ad
SEAP载体平行检测了相同的组。图6提供了结果。结果显示对黑猩猩腺病毒的血清阳性率低于人腺病毒Ad5。然而,通常在受试者的子集中可检测到针对已描述的ChAd (CV-68)的中和抗体的存在。相反地,目前检测的所有人血清不能中和ChAd55以及PanAd1、PanAd2和PanAd3,甚至在极低的滴度上。对ChAd73、ChAd83、ChAd146和ChAd147观察到相同的结果。因此,新腺病毒分离株ChAd55、ChAd73、ChAd83、ChAd146、ChAd147 以及PanAd1、PanAd2和PanAd3代表了限制基于普通人Ad血清型例如Ad5的病毒载体的施用的先存抗人Ad免疫力问题的理想解决方案。
实施例5:与Ad5载体相比PanAd1和3载体的免疫效力
使用表达单纯疱疹病毒(HSV)抗原的载体和使用表达癌症抗原的载体在BALB/c小鼠中评估了PanAd1和PanAd3载体作为潜在的重组疫苗的效力。比较了表达HSV Ag和癌症Ag的PanAd1和3的载体效价与基于人Ad5的对应载体。
为了评估抗病毒效价,对9组BALB/c小鼠在四头肌内以从107 vp/小鼠开始直到109
vp/小鼠的递增载体剂量平行注射PanAd1-HSV、PanAd3-HSV和Ad5-HSV(见图7A)。通过在与覆盖抗原的全部氨基酸序列的肽库孵育的小鼠脾细胞上的干扰素-γ Elispot测定来测量针对HSV抗原引起的免疫应答。在图7中报告的与人Ad5载体比较的PanAd1, PanAd2和PanAd3的免疫实验结果显示本发明的新腺病毒比现有技术的重组腺病毒Ad5在每个检测的浓度上均更有效地引起特异性免疫应答。在使用PanAd1和PanAd3载体免疫的小鼠中观察到的更高频率的抗原特异性T细胞明确证明了这一点。
通过对BALB/c小鼠组在四头肌内注射从107
vp/小鼠开始直到109 vp/小鼠的递增载体剂量免疫小鼠评估PanAd载体引起抗肿瘤T细胞应答的效力。对2组BALB/C小鼠以107 vp/小鼠和109
vp/小鼠注射表达肿瘤抗原的Ad5载体。平行地用107、108、109
vp的携带相同肿瘤抗原的PanAd1或PanAd3载体免疫3组BALB/C小鼠。通过在脾细胞上使用表示已定位的CD8表位的单个肽的干扰素-γ Elispot测定来测量T细胞应答。图7B中显示的结果证明与用Ad5载体免疫的动物组相比,用PanAd载体免疫的动物组在最低载体剂量上应答的动物频率更高和抗原特异性T细胞的频率更高。
实施例6:用PanAd载体免疫食蟹猴
采用异源引发/加强方案通过肌肉内注射CsCl纯化的PanAd1和PanAd3免疫2组猕猴,每组3只。在第0周,组1中的每只动物在三角肌内接受108 vp剂量的PanAd3
Gag载体,而组2中的动物接受1010
vp剂量。然后在第13周用1010
vp PanAd1 Gag的单一剂量加强两组中的所有动物。
通过IFN-γ ELISPOT测定在不同时间点测量CMI。此测定测量HIV抗原特异性CD8+ 和CD4+ T淋巴细胞应答。制备了基于HIV Gag蛋白的氨基酸序列的肽用于这些测定以测量接种腺病毒载体的猴子中的免疫应答。各个肽具有20-mer的重叠,10个氨基酸的偏移。
IFNγ-ELISPOT测定提供了抗原特异性T淋巴细胞应答的定量测定。连续稀释PBMC并将其置于用抗-恒河猴IFN-γ抗体(MD-1 U-Cytech)包被的微板孔中。它们与HIV Gag肽库一起培养20小时,导致前体细胞的再激发和IFN-γ的分泌。洗去细胞,留下在细胞存在的集中区域中与抗体包被的孔结合的分泌的IFN。用生物素化的抗-恒河猴IFN抗体(检测Ab U-Cytech)和之后用碱性磷酸酶缀合的链霉抗生物素(Pharmingen
13043E)检测捕获的IFN。不可溶的碱性磷酸酶底物的加入导致孔中在细胞存在(located)的位点处的暗斑,在分泌了IFN-γ的每个T细胞上留下一个斑点。
每个孔中的斑点数与抗原特异性T细胞的前体频率直接相关。在此测定中选择干扰素γ作为直观的细胞因子(使用特异性抗-干扰素γ单克隆抗体),因为它是活化的T淋巴细胞合成和分泌的最常见的细胞因子和最大量的细胞因子之一。在此测定中,测定了样品在存在或缺乏(培养基对照)肽抗原时的斑点形成细胞数(SFC)/百万个PBMC。图8显示了在剂量1后和剂量2后的不同时间点获得的PBMC的来自猕猴的数据。在两种剂量上用PanAd3引发的所有动物显示了针对HIV
Gag的T细胞应答,通过PanAd1的第二次注射的有效加强证明:如六邻体、五邻体和纤维蛋白序列比对已暗示的,PanAd1和PanAd3是不同的血清型并可以在异源引发-加强免疫方案中组合。因此,在另一个方面,本发明提供了本发明的2种重组腺病毒在异源引发-加强免疫中的用途,其中本发明的2种重组腺病毒是不同的腺病毒血清型,最优选如本文描述的PanAd1和PanAd3。
序列表
<110> Okairos AG
<120> 猿腺病毒核酸和氨基酸序列,包含其的载体及其用途
<130> 578-13 PCT 2
<140>
<141> 2010-02-02
<150> PCT/EP2009/000672
<151> 2009-02-02
<150> US 61/172,624
<151> 2009-04-24
<150> US 61/174,852
<151> 2009-05-01
<150> US 61/266,342
<151> 2009-12-03
<160> 65
<170> PatentIn 版本3.5
<210> 1
<211> 1503
<212> DNA
<213> 人免疫缺陷病毒
<400> 1
atgggtgcta gggcttctgt gctgtctggt
ggtgagctgg acaagtggga gaagatcagg 60
ctgaggcctg gtggcaagaa gaagtacaag
ctaaagcaca ttgtgtgggc ctccagggag 120
ctggagaggt ttgctgtgaa ccctggcctg
ctggagacct ctgaggggtg caggcagatc 180
ctgggccagc tccagccctc cctgcaaaca
ggctctgagg agctgaggtc cctgtacaac 240
acagtggcta ccctgtactg tgtgcaccag
aagattgatg tgaaggacac caaggaggcc 300
ctggagaaga ttgaggagga gcagaacaag
tccaagaaga aggcccagca ggctgctgct 360
ggcacaggca actccagcca ggtgtcccag
aactacccca ttgtgcagaa cctccagggc 420
cagatggtgc accaggccat ctccccccgg
accctgaatg cctgggtgaa ggtggtggag 480
gagaaggcct tctcccctga ggtgatcccc
atgttctctg ccctgtctga gggtgccacc 540
ccccaggacc tgaacaccat gctgaacaca
gtggggggcc atcaggctgc catgcagatg 600
ctgaaggaga ccatcaatga ggaggctgct
gagtgggaca ggctgcatcc tgtgcacgct 660
ggccccattg cccccggcca gatgagggag
cccaggggct ctgacattgc tggcaccacc 720
tccaccctcc aggagcagat tggctggatg
accaacaacc cccccatccc tgtgggggaa 780
atctacaaga ggtggatcat cctgggcctg
aacaagattg tgaggatgta ctcccccacc 840
tccatcctgg acatcaggca gggccccaag
gagcccttca gggactatgt ggacaggttc 900
tacaagaccc tgagggctga gcaggcctcc
caggaggtga agaactggat gacagagacc 960
ctgctggtgc agaatgccaa ccctgactgc
aagaccatcc tgaaggccct gggccctgct 1020
gccaccctgg aggagatgat gacagcctgc
cagggggtgg ggggccctgg tcacaaggcc 1080
agggtgctgg ctgaggccat gtcccaggtg
accaactccg ccaccatcat gatgcagagg 1140
ggcaacttca ggaaccagag gaagacagtg
aagtgcttca actgtggcaa ggtgggccac 1200
attgccaaga actgtagggc ccccaggaag
aagggctgct ggaagtgtgg caaggagggc 1260
caccagatga aggactgcaa tgagaggcag
gccaacttcc tgggcaaaat ctggccctcc 1320
cacaagggca ggcctggcaa cttcctccag
tccaggcctg agcccacagc ccctcccgag 1380
gagtccttca ggtttgggga ggagaagacc
acccccagcc agaagcagga gcccattgac 1440
aaggagctgt accccctggc ctccctgagg
tccctgtttg gcaacgaccc ctcctcccag 1500
taa 1503
<210> 2
<211> 20
<212> DNA
<213> 人工
<220>
<223> TLR9激动剂
<400> 2
tccatgacgt tcctgacgtt
20
<210> 3
<211> 22
<212> DNA
<213> 人工
<220>
<223> 引物: HVR7 引物1
<400> 3
tgtcctacca rctcttgctt ga 22
<210> 4
<211> 18
<212> DNA
<213> 人工
<220>
<223> 引物: HVR7 引物2
<400> 4
gtggaarggc acgtagcg
18
<210> 5
<211> 20
<212> DNA
<213> 人工
<220>
<223> 引物: HVR1-6fd
<400> 5
caygatgtga ccaccgaccg
20
<210> 6
<211> 20
<212> DNA
<213> 人工
<220>
<223> 引物: HVR1-6rev
<400> 6
gtgttyctgt cytgcaagtc
20
<210> 7
<211> 45
<212> DNA
<213> 人工
<220>
<223> 引物: PanAd1左端P1
<400> 7
atctggaatt cgtttaaacc atcatcaata
atatacctta ttttg 45
<210> 8
<211> 46
<212> DNA
<213> 人工
<220>
<223> 引物: PanAd1左端P2
<400> 8
tcaggaacta gttccgtata cctataataa
taaaacggag actttg 46
<210> 9
<211> 34
<212> DNA
<213> 人工
<220>
<223> 引物: PanAd1右端P1
<400> 9
tccagcggcg cgccagaccc gagtcttacc
agga 34
<210> 10
<211> 51
<212> DNA
<213> 人工
<220>
<223> 引物: PanAd1右端P2
<400> 10
attcaggatc cgaattcgtt taaaccatca
tcaataatat accttatttt g 51
<210> 11
<211> 33
<212> DNA
<213> 人工
<220>
<223> 引物: pIX P1
<400> 11
tattctgcga tcgctgaggt gggtgagtgg
gcg 33
<210> 12
<211> 44
<212> DNA
<213> 人工
<220>
<223> 引物: pIX P2
<400> 12
ttactggcgc gcctgcctcg agtaaacggc
atttgcagga gaag 44
<210> 13
<211> 37772
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 13
catcatcaat aatatacctt attttggatt
gaagccaata tgataatgag gtgggcggag 60
cggggcgggg cggggaggag cggcggcgcg
gggcgggccg ggaggtgtgg cggaagttga 120
gtttgtaagt gtggcggatg tgacttgcta
gcgccggatg tggtaaaagt gacgtttttt 180
ggagtgcgac aacgcccacg ggaagtgaca
tttttcccgc ggtttttacc ggatgtcgta 240
gtgaatttgg gcgttaccaa gtaagatttg
gccattttcg cgggaaaact gaaatgggga 300
agtgaaatct gattaatttc gcgttagtca
taccgcgtaa tatttgccga gggccgaggg 360
actttgaccg attacgtgga ggaatcgccc
aggtgttttt tgaggtgaat ttccgcgttc 420
cgggtcaaag tctccgtttt attattatag
tcagctgacg cggagtgtat ttatacccgc 480
tgatctcgtc aagaggccac tcttgagtgc
cagcgagtag agttttctcc tctgccgctc 540
cgctctgaca ccgggggaaa aatgagacat
ttcacctacg atggcggtgt cctcaccggc 600
cagctggctg cctcggtcct ggacgccctg
atcgaggagg tattggccga caattatcct 660
cctccagctc attttgagcc acctactctt
cacgaactgt atgatttgga cgtggtggca 720
cctagcgacc cgaacgagca ggcggtttcc
agtttttttc ctgactctat gctgttggcc 780
agccaggagg gggtcgagct cgagacccct
cctccaatcg ccgtttctcc tgagcctccg 840
accctgacca ggcagcccga tcgccgtgtt
ggacctgcga ctatgcccca tctgctgccc 900
gaggtgatcg atctcacctg taacgagtct
ggttttccac ccagcgagga tgaggacgaa 960
gagggtgagc agtttgtgtt agattctgtg
gaggaacccg ggcgcggttg cagatcttgt 1020
caataccatc ggaaaaatac aggagacccc
caaattatgt gttccctgtg ttatatgaag 1080
acgacctgta tgtttattta cagtaagttt
gtgattggtg ggtcggtggg ctgtagtgtg 1140
ggtaggtggt ctgtggtttt ttttttttta
atatcagctt gggctaaaaa actgctatgg 1200
taattttttt aaggtccggt gtctgaacct
gagcaggaag ctgaaccgga gcctgagagt 1260
cgccccagga gaaggcctgc aattctaact
agaccgagtg cacctgtagc gagggacctc 1320
agcagtgcag agaccaccga ttccggtcct
tcctcatccc ctccagagat tcatcccgtg 1380
gtgcctttgt gtcccctcaa gcccgttgcc
gtgagagtta gtgggcggag ggccgccgtg 1440
gagagcattg aggacttgct taatgagaca
caggaacctt tggacttgag ctgtaaacgc 1500
cctaggcaat aaacctgctt acctggactg
aatgagttga cgcctatgtt tgcttttgaa 1560
tgacttaatg tgtatataat aaagagtgag
ataatgttta attgcatggt gtgtttgatt 1620
ggggcggggt ttgttgggta tataagcttc
cctgggctaa acttggttac acttgacctc 1680
atggaggcct gggagtgttt agagagcttt
gccgaagtgc gtgccttgct ggaagagagc 1740
tctaataata cctctgggtg gtggaggtat
ttttggggct ctccccaggc taagttagtt 1800
tgtagaatca aggaggatta caagtgggaa
tttgaacagc ttttgaaatc ctgtggtgag 1860
ctcttggatt ctttgaatct gggccaccag
gctcttttcc aggacaagat catcaggact 1920
ttggattttt ccacaccggg gcgcattgct
gccggggttg cttttctagc ttttttgaag 1980
gataaatgga gcgaagagac ccacttgagt
tcgggatacg tcctggattt tctggccata 2040
caactgtgga gagcatggat caggcacaag
aacagaatgc aactgttgtc ttccgtccgt 2100
ccgttgctga ttcagccgga ggagcagcag
accgggccgg aggaccgggc tcgtctggaa 2160
ccagaagaga gggcaccgga gaggagcgcg
tggaacctgg gagccggcct gaacggccat 2220
ccacatcggg agtgaatgtt ggacaggtgg
cggatctctt tccagaactg cgacgaatct 2280
taactatcag ggaggatgga caatttgtta
aggggcttaa gagggagcgg ggggcttctg 2340
aacataacga ggaggccagt aatttagctt
ttagtctgat gaccagacac cgtcccgagt 2400
gcattacttt tcagcagatt aaggataatt
gtgccaatga gttagatctg ctgggtcaga 2460
agtacagcat agagcagttg accacttact
ggctgcagcc gggtgatgat ctggaggaag 2520
ctattagggt gtatgccaag gtggccctga
ggcccgattg caagtacaag ctcaaggggc 2580
tggtgaatat caggaattgt tgctacattt
ctgggaacgg ggcggaggtg gagatagaga 2640
ccgatgacag ggtggccttt aggtgtagca
tgatgaatat gtggcctggg gtgctgggca 2700
tggacggggt ggtgattatg aatgtgaggt
tcacggggcc caattttaat ggcacggtgt 2760
tcctgggcaa caccaacttg gtgctgcacg
gggtgagctt ctatggcttt aacaacacct 2820
gtgtggaggc ctggaccgat gtgaaggtcc
gtggctgtgc cttctacgga tgttggaagg 2880
cggtagtgtg tcgccccaag agcaggagtt
ccattaaaaa atgcttgttt gagaggtgca 2940
ccctgggggt gctggcggag ggcaactgtc
gggtgcgcca caatgtggcc tcagaatgcg 3000
gttgcttcat gctagtcaag agcgtggcgg
tcatcaagca taacatggtg tgcggcaaca 3060
gcgaggacaa ggcctcgcag atgctgacct
gctcggatgg caactgccac ttactgaaga 3120
ccgtacatat aaccagccac agccgcaagg
cctggcccgt gttcgagcac aacgtgttga 3180
cccgctgctc tttgcatctg ggcaacagga
ggggtgtgtt cctgccctat caatgcaact 3240
tgagccacac caagatcttg ctagagcccg
aaagcatgtc caaggtgaac ctgaacgggg 3300
tgtttgacat gaccctgaag atatggaagg
tgctgaggta cgacgagacc aggtctcgat 3360
gcaggccctg cgagtgcggg ggcaagcata
tgaggaacca gcctgtgatg ctggatgtga 3420
ccgaggagct gaggcctgac cacttggttc
tggcctgcac cagggccgag tttggttcta 3480
gcgatgaaga cacagactga ggtgggtgag
tgggcgtggt ctgggggtgg gaagcaatat 3540
ataagttggg ggtcttaggg tctctgtgtc
tgttttgcag agggaccgcc ggcgccatga 3600
gcgggagcag tagcagcaac gccttggatg
gcagcatcgt gagcccttat ttgacgacgc 3660
gcatgcccca ctgggccggg gtgcgtcaga
atgtgatggg ctccagcatc gacggacgac 3720
ccgtgctgcc cgcaaattcc gccacgctga
cctacgcgac cgtcgcgggg accccgttgg 3780
acgccaccgc cgccgccgcc gccaccgccg
ccgcctcggc cgtgcgcagc ctggccacgg 3840
actttgcatt cttgggaccc ttggccaccg
gggcggccgc ccgtgccgcc gttcgcgatg 3900
acaagctgac cgccctgctg gcgcagttgg
atgcgcttac ccgggaactg ggtgaccttt 3960
cgcagcaggt cgtggccctg cgccagcagg
tctccgccct gcaggctagc gggaatgctt 4020
ctcctgcaaa tgccgtttaa gataaataaa
accagactct gtttggatta aagaaaagta 4080
gcaagtgcat tgctctcttt atttcataat
tttccgcgcg cgataggccc gagtccagcg 4140
ttctcggtcg ttgagggtgc ggtgtatctt
ctccaggacg tggtagaggt ggctctggac 4200
gttgagatac atgggcatga gcccgtcccg
ggggtggagg tagcaccact gcagagcttc 4260
atgctccggg gtggtgttgt agatgatcca
gtcgtagcag gagcgctggg catggtgcct 4320
aaaaatgtcc ttaagcagca ggccgatggc
cagggggagg cccttggtgt aagtgtttac 4380
aaaacggttg agttgggaag ggtgcatgcg
gggtgagatg atgtgcatct tagattgtat 4440
ttttagattg gcgatgtttc ctcccagatc
ccttctggga ttcatgttgt ggaggaccac 4500
cagcacagta tatccggtgc acttgggaaa
tttgtcatgc agcttagagg gaaatgcgtg 4560
gaagaacttg gagacgccct tgtggcctcc
cagattctcc atgcattcgt ccatgatgat 4620
ggcaatgggc ccgcgggagg cggcctgggc
aaagatgttt ctggggtcac tgacatcgta 4680
gttgtgttcc agggtgagat cgtcataggc
catttttata aagcgcgggc ggagggtgcc 4740
cgactggggg atgatggttc cctcgggccc
cggggcgtag ttgccttcgc agatctgcat 4800
ttcccaggcc ttaatctctg aggggggaat
catatccact tgcggggcga tgaagaaaac 4860
ggtttccgga gccggggaga ttaactggga
tgagagcagg tttctcagca gctgtgactt 4920
tccacagccg gtgggtccat aaataacacc
tataaccggc tgcagctggt agttgagcga 4980
gctgcagctg ccgtcgtccc ggaggagggg
ggccacctca ttgagcatgt cccggacgcg 5040
cttgttctcc tcgaccaggt ccgccagaag
gcgctcgccg cccagggaca gcagctcttg 5100
caaggaagca aagtttttca gcggtttgag
gccgtccgcc gtgggcatgt ttttcagggt 5160
ctggccgagc agctccaggc ggtcccagag
ctcggtgacg tgctctacgg catctctatc 5220
cagcatatct cctcgtttcg cgggttgggg
cggctttcgc tgtagggcac caggcgatgg 5280
tcgtccagcg cggccagagt catgtccttc
catgggcgca gggtcctcgt cagggtggtc 5340
tgggtcacgg tgaaggggtg cgccccgggc
tgggcgctgg ccagggtgcg cttgagactg 5400
gtcctgctgg tgctgaagcg ctgccggtct
tcgccctgcg cgtcggccag gtagcatttg 5460
accatggtgt cgtagtccag cccctccgcg
gcgtgtccct tggcgcgcag cttgcccttg 5520
gaggtggcgc cgcacgcggg gcactgcagg
ctcttgagcg cgtagagctt gggggcgagg 5580
aagaccgatt cgggggagta ggcgtccgcg
ccgcaggccc cgcacacggt ctcgcactcc 5640
accagccagg tgagctcggg gcgctcgggg
tcaaaaacca ggtttccccc atgctttttg 5700
atgcgtttct tacctcgggt ctccatgagg
cggtgtcccc gttcggtgac gaagaggctg 5760
tccgtgtctc cgtagaccga cttgaggggt
ctgtcctcca ggggggtccc tcggtcctct 5820
tcgtagagaa actcggacca ctctgagaca
aaggcccgcg tccaggccag gacgaaggag 5880
gccaggtggg aggggtaccg gtcgttgtcc
actagggggt ccaccttctc caaggtgtga 5940
agacacatgt cgccctcctc ggcgtccagg
aaggtgattg gcttgtaggt gtaggccacg 6000
tgacccgggg ttccggacgg gggggtataa
aagggggtgg gggcgcgctc gtcctcactc 6060
tcttccgcat cgctgtctgc gagggccagc
tgctggggtg agtattccct ctcgaaggcg 6120
ggcatgacct cagcgctgag gctgtcagtt
tctaaaaacg aggaggattt gatgttcacc 6180
tgtcccgagc tgatgccttt gagggtgccc
gcgtccatct ggtcagaaaa cacgatcttt 6240
ttattgtcca gcttggtggc gaacgacccg
tagagggcgt tggagagcag cttggcgatg 6300
gagcgcaggg tctgattctt gtcccggtcg
gcgcgctcct tggccgcgat gttgagctgc 6360
acgtactcgc gcgcgacgca gcgccactcg
gggaagacgg tggtgcgctc gtcgggcacc 6420
aggcgcacgc gccagccgcg gttgtgcagg
gtgacgaggt ccacgctggt ggcgacctcg 6480
ccgcgcaggc gctcgttggt ccagcagagg
cgcccgccct tgcgcgagca gaaggggggc 6540
agggggtcga gttgggtttc gtccgggggg
tccgcgtcca ccgtgaagac cccggggcgc 6600
aggcgcgcgt cgaagtagtc gatcttgcat
ccttgcaagt ccagcgcccg ctgccagtcg 6660
cgggcggcga gcgcgcgctc gtaggggttg
agcggcgggc cccagggcat ggggtgggtg 6720
agcgcggagg cgtacatgcc gcagatgtca
tagacgtaga ggggctcccg gaggatgccc 6780
aggtaggtgg ggtagcagcg gccgccgcgg
atgctggcgc gcacgtagtc gtagagctcg 6840
tgcgaggggg cgaggaggtc ggggcccagg
ttggtgcggg cggggcgctc cgcgcggaag 6900
acgatctgcc tgaagatggc atgcgagttg
gaagagatgg tggggcgctg gaagacgttg 6960
aagctggcgt cctgcaggcc gacggcgtcg
cgcacgaagg aggcgtagga ctcgcgcagc 7020
ttgtgcacca gctcggcggt gacctgcacg
tcgagcgcgc agtagtcgag ggtctcgcgg 7080
atgatgtcat acttagcctg ccccttcttt
ttccacagct cgcggttgag gacgaactct 7140
tcgcggtctt tccagtactc ttggatcggg
aaaccgtccg gctccgaacg gtaagagccc 7200
agcatgtaga actggttgac ggcctggtag
gcgcagcagc ccttctccac gggcagggcg 7260
taggcctgcg cggccttgcg gagcgaggtg
tgggtcaggg cgaaggtgtc cctgaccatg 7320
accttgaggt actggtgttt gaagtcggag
tcgtcgcagc cgccccgctc ccagagcgag 7380
aagtcggtgc gctttttgga gcgggggttg
ggcagcgcga aggtgacatc gttgtagagg 7440
atcttgcccg cgcgaggcat gaagttgcgg
gtgatgcgga agggccccgg cacttccgag 7500
cggttgttga tgacctgggc ggcgagcacg
atctcgtcga agccgttgat gttgtggccc 7560
acgatgtaga gttccaggaa gcggggccgg
cccttgacgc tgggcagctt ctttagctct 7620
tcgtaggtga gctcctcggg cgaggcgagg
ccgtgctcgg ccagggccca gtccgccagg 7680
tgcgggttgt ccgcgaggaa ggaccgccag
aggtcgcggg ccaggagggt ctgcaggcgg 7740
tccctgaagg tcctgaactg gcggcctacg
gccatctttt cgggggtgac gcagtagaag 7800
gtgagggggt cttgctgcca ggggtcccag
tcgagctcca gggcgaggtc gcgcgcggcg 7860
gcgaccaggc gctcgtcgcc cccgaatttc
atgaccagca tgaagggcac gagctgcttt 7920
ccgaaggcgc ccatccaagt gtaggtctct
acatcgtagg tgacaaagag acgttccgtg 7980
cgaggatgcg agccgatcgg gaagaactgg
atctcccgcc accagttgga ggagtggctg 8040
ttgatgtggt gaaagtagaa gtcccgtcgg
cgggccgagc actcgtgctg gcttttgtaa 8100
aagcgagcgc agtactggca gcgctgcacg
ggctgtacct cttgcacgag atgcacctgc 8160
cgaccgcgga cgaggaagct gagtgggaat
ctgagccccc cgcatggctc gcggcctggc 8220
tggtgctctt ctactttgga tgcgtggccg
tcaccgtctg gctcctcgag gggtgttacg 8280
gtggagcgga tcaccacgcc gcgcgagccg
caggtccaga tatcggcgcg cggcggtcgg 8340
agtttgatga cgacatcgcg cagctgggag
ctgtccatgg tctggagctc ccgcggcggc 8400
ggcaggtcag ccgggagttc ttgcaggttt
acctcgcaga gacgggccag ggcgcggggc 8460
aggtccaggt ggtacttgaa ttcgagaggc
gtgttggtgg cggcgtcgat ggcttgcagt 8520
atgccgcagc cccggggcgc gacgacggtg
ccccgcgggg cggtgaagct cccgccgccg 8580
ctcctgctgt cgccgccggt ggcggggctt
agaagcggtg ccgcggtcgg gcccccggag 8640
gtaggggggg ctccggtccc gcgggcaggg
gcggcagcgg cacgtcggcg ccgcgcgcgg 8700
gcaggagctg gtgctgcgcc cggaggttgc
tggcgaaggc gacgacgcgg cggttgatct 8760
cctggatctg gcgcctctgc gtgaagacga
cgggtccggt gagcttgaac ctgaaagaga 8820
gttcgacaga atcaatctcg gtgtcattga
ccgcgacctg gcgcaggatc tcctgcacgt 8880
cgcccgagtt gtcttggtag gcgatctcgg
ccatgaactg ttcaatctct tcctcctgga 8940
ggtctccgcg tccggcgcgc tccacggtgg
ccgccaggtc gttggagatg cgcgccatga 9000
gctgcgagaa ggcgttgagt ccgccctcgt
tccacactcg gctgtagacc acgccgccct 9060
ggtcgtcgcg ggcgcgcatg accacctgcg
cgaggttgag ttccacgtgg cgcgcaaaga 9120
cggcgtagtt gcgcaggcgc tggaagaggt
agttgagggt ggtggcggtg tgctcggcca 9180
caaagaagta catgacccag cggcgcaacg
tggattcgtt gatgtccccc aaggcctcca 9240
gtcgctccat ggcctcgtag aagtccacgg
cgaagttgaa aaactgggag ttgcgcgccg 9300
acacggtcaa ctcctcctcc agaagacgga
tgagctcggc gacggtgtcg cgcacctcgc 9360
gctcgaaggc tatgggaatc tcttcctccg
ccagcatcac cacctcttcc tcttcttcct 9420
cctctggcac ttccatgatg gcttcctcct
cttcgggggg tggcggcggg ggagggggcg 9480
ctcggcgccg gcggcggcgc accgggaggc
ggtccacgaa gcgctcgatc atctccccgc 9540
ggcggcgacg catggtctcg gtgacggcgc
ggccgttctc tcggggacgc agctggaaga 9600
cgccgccggt catctggtgc tggggcgggt
ggccgtgggg cagcgagacc gcgctgacga 9660
tgcatcttaa caattgctgc gtaggtacgc
cgccgaggga cctgagggag tccagatcca 9720
ccggatccga aaacctttcg aggaaggcat
ctaaccagtc gcagtcgcaa ggtaggctga 9780
gcaccgtggc gggcggcggg gggtgggggg
agtgtctggc ggaggtgctg ctgatgatgt 9840
aattgaagta ggcggtcttg acacggcgga
tggtcgacag gagcaccata tctttgggcc 9900
cggcctgctg gatgcggagg cggtcggcca
tgccccaggc ttcgttctgg catctgcgca 9960
ggtctttgta gtagtcttgc atgagccttt
ccaccggcac ctcttctcct tcttcttctg 10020
acatctctgc tgcatctgcg gccctggggc
gacggcgcgc gcccctgccc cccatgcgcg 10080
tcaccccgaa ccccctgagc ggctggagca
gggccaggtc ggcgacgacg cgctcggcca 10140
ggatggcctg ctggacctgc gtgagggtgg
tttggaagtc atccaagtcc acgaagcggt 10200
ggtaggcgcc cgtgttgatg gtgtaggtgc
agttggccat gacggaccag ttgacggtct 10260
ggtggcccgg ttgcgtcatc tcggtgtacc
tgaggcgcga gtaggcgcgc gagtcgaaga 10320
tgtagtcgtt gcaagtccgc accaggtact
ggtagcccac caggaagtgc ggcggcggct 10380
ggcggtagag gggccagcgg agggtggcgg
gggctccggg ggccaggtct tccagcatga 10440
ggcggtggta ttcgtagatg tacctggaca
tccaggtgat gcccgcggcg gtggtggagg 10500
cgcgcgggaa gtcgcgcacc cggttccaga
tgttgcgcag cggcagaaag tgctccatgg 10560
taggcgtgct ctggccggtc aggcgcgcgc
agtcgttgat actctagacc agggaaaacg 10620
aaagccggtc agcgggcact cttccgtggt
ctggtggata aattcgcaag ggtatcatgg 10680
cggagggcct cggttcgagc cccgggcccg
ggccggacgg tccgccatga tccacgcggt 10740
taccgcccgc gtgtcgaacc caggtggcga
cgtcagacaa cggtggagtg ttccttttgg 10800
gttttttttc caaatttttc tggccgggcg
ccgacgccgc cgcgtaagag actagagtgc 10860
aaaagcgaaa gcagtaagtg gctcgctccc
tgtagcccgg aggatccttg ctaagggttg 10920
cgttgcggcg aaccccggtt cgagtctggc
tctcgcgggc cgctcgggtc ggccggaacc 10980
gcggctaagg cgggattggc ctccccctca
ttaaagaccc cgcttgcgga ttcctccgga 11040
cacaggggac gagccccttt ttacttttgc
ttttctcaga tgcatccggt gctgcggcag 11100
atgcgccccc cgccccagca gcagcagcaa
catcagcaag agcggcacca gcagcagcgg 11160
gagtcatgca gggccccctc gcccacgctc
ggcggtccgg cgacctcggc gtccgcggcc 11220
gtgtctggag ccggcggcgg ggggctggcg
gacgacccgg aggagccccc gcggcgcagg 11280
gccagacagt acctggacct ggaggagggc
gagggcctgg cgcgactggg ggcgccgtcc 11340
cccgagcgcc acccgcgggt gcagctgaag
cgcgactcgc gcgaggcgta cgtgcctcgg 11400
cagaacctgt tcagagaccg cgcgggcgag
gagcccgagg agatgcggga ccgcaggttc 11460
gccgcggggc gggagctgcg gcaggggctg
aaccgggagc ggctgctgcg cgaggaggac 11520
tttgagcccg acgcgcggac ggggatcagc
cccgcgcgcg cgcacgtggc ggccgccgac 11580
ctggtgacgg catacgagca gacggtgaac
caggagatca acttccaaaa aagcttcaac 11640
aaccacgtgc gcacgctggt ggcgcgcgag
gaggtgacca tcggcctgat gcacctgtgg 11700
gactttgtga gcgcgctgga gcagaacccc
aacagcaagc ctctgacggc gcagctgttc 11760
ctgatagtgc agcacagcag ggacaacgag
gcgttcaggg acgcgctgct gaacatcacc 11820
gagcccgagg gtcggtggct cctggacctg
attaacatct tgcagagcat agtggtgcag 11880
gagcgcagcc tgagcctggc cgacaaggtg
gcggccatca attactcgat gctcagtctg 11940
ggcaagtttt acgcgcgcaa aatctaccag
acgccgtacg tgcccataga caaggaggtg 12000
aagatcgacg gcttctacat gcgcatggcg
ctgaaggtgc tgaccctgag cgacgacctg 12060
ggcgtgtacc gcaacgagcg catccacaag
gccgtgagcg tgagccggcg gcgcgagctg 12120
agcgaccgcg agctgatgca cagcctgcag
cgggcgctgg cgggggccgg cagcggcgac 12180
agggaggccg agtcctactt cgaggcgggg
gcggacctgc gctgggtgcc cagccggagg 12240
gccctggagg ccgcgggggc ccgccgcgag
gactatgcag acgaggagga ggaggatgac 12300
gaggagtacg agctagagga gggcgagtac
ctggactaaa ccgcaggtgg tgtttttggt 12360
agatgcaaga cccgaacgtg gtggacccgg
cgctgcgggc ggctctgcag agccagccgt 12420
ccggccttaa ctctacagac gactggcgac
aggtcatgga ccgcatcatg tcgctgacgg 12480
cgcgcaatcc ggacgcgttc cggcagcagc
cgcaggccaa caggctctcc gccatcttgg 12540
aggcggtggt gcctgcgcgc gcgaacccca
cgcacgagaa ggtgctggcc atagtgaacg 12600
cgctggccga gaacagggcc atccgcccgg
acgaggccgg gctggtgtac gacgcgctgc 12660
tgcagcgcgt ggcccgctac aacagcggca
acgtgcagac caacctggac cggctggtgg 12720
gggacgtgcg cgaggcggtg gcgcagcggg
agcgcgcgga gcggcagggc aacctgggct 12780
ccatggtggc gctgaacgcc ttcctgagca
cgcagccggc caacgtgccg cgggggcagg 12840
aggactacac caactttgta agcgcgctgc
ggctgatggt gaccgagacc ccccagagcg 12900
aggtgtacca gtcggggccg gactacttct
tccagaccag cagacagggc ctgcagacgg 12960
tgaacctgag ccaggctttc aagaacctgc
gggggctgtg gggggtgaag gcgcccaccg 13020
gggaccgggc gacggtgtcc agcctgctga
cgcccaactc gcgcctgctg ctgctgctga 13080
tcgcgccgtt cacggacagc ggcagcgtgt
cccgggagac ctacctcggg cacctgctga 13140
cgctgtaccg cgaggccatc gggcagaccc
aggtggacga gcacaccttc caggagatca 13200
ccagcgtgag ccgcgcgctg gggcaggagg
acacgggcag cctggaggcg accctgaact 13260
acctgctgac caaccggcgg cagaagatcc
cctcgctgca tagtttgacc accgaggagg 13320
agcgcatcct gcgctacgtg cagcagagcg
tgagcctgaa cctgatgcgc gacggggtga 13380
cgcccagcgt ggcgctggac atgaccgcgc
gcaacatgga accgggcatg tacgccgcgc 13440
accggcctta catcaaccgc ctgatggact
acttgcatcg cgcggcggcc gtgaaccccg 13500
agtacttcac caacgccatc ctgaacccgc
actggctccc gccgcccggg ttctacagcg 13560
ggggcttcga ggtccccgag gccaacgacg
gcttcctgtg ggacgacatg gacgacagcg 13620
tgttctcccc gcggccgcag gcgctggcgg
aggcgtcgct gctccgcctc cccaagaagg 13680
aagagagccg ccggcccagc agcgcggcgg
cctctctgtc cgagctgggg gcggcggccg 13740
cgcggcccgg gtccctgggg ggcagcccct
ttcccagcct ggtggggtct ctgcagagcg 13800
ggcgcaccac ccggccccgg ctgctgggcg
aggacgagta cctgaacaac tccctgatgc 13860
agccggtgcg ggagaaaaac ctgccccccg
ccttccccaa caacgggata gagagcctgg 13920
tagacaagat gagcagatgg aagacctatg
cgcaggagca cagggactcg cccgtgctcc 13980
gtccgcccac gcggcgccag cgccacgacc
ggcagcgggg gctggtgtgg gatgacgagg 14040
actccgcgga cgatagcagc gtgctggacc
tgggggggag cggcggcaac ccgttcgcgc 14100
acctgcgccc ccgcctgggg aggatgtttc
aataaaaaaa aaaaaaaatc aagcatgatg 14160
caaggttttt taagcggata aataaaaaac
tcaccaaggc catggcgacc gagcgttgtt 14220
ggtttcttgt tgtgttccct tagtatgcgg
cgcgcggcga tgtaccacga gggacctcct 14280
ccctcttatg agagcgtggt gggcgcggcg
gcggcctctc cctttgcgtc gcagctggag 14340
ccgccgtacg tgcctccgcg gtacctgcgg
cctacggggg gaagaaacag catccgttac 14400
tcggagctgg cgcccctgta cgacaccacc
cgggtgtacc tggtggacaa caagtcggcg 14460
gacgtggcct ccctgaacta ccagaacgac
cacagcaatt ttttgaccac ggtcatccag 14520
aacaatgact acaccccgag cgaggccagc
acccagacca tcaatctgga tgaccggtcg 14580
cactggggcg gcgacctgaa aaccatcctg
cacaccaaca tgcccaacgt gaacgagttc 14640
atgttcacca ataagttcaa ggcgcgggtg
atggtgtcgc gctcgcacac caaggacgac 14700
cgggtggagc tgaagtacga gtgggtagag
ttcgagctgc ccgagggcaa ctactcggag 14760
accatgacca tagacctgat gaacaacgcg
atcgtggagc actatctgaa agtgggcagg 14820
cagaacgggg tcctggagag cgacatcggg
gtcaagttcg acaccaggaa cttccgcctg 14880
gggctggacc cggtcaccgg gctggttatg
cccggggtct acaccaacga ggccttccac 14940
cccgacatca tcctgctgcc cggctgcggg
gtggacttca cctacagccg cctgagcaac 15000
ctgctgggca tccgcaagcg gcagcccttc
caggagggct tcaggatcac ctacgaggac 15060
ctggaggggg gcaacatccc cgcgctcctg
gatgtggagg cctaccagga tagcttgaag 15120
gaagaagagg cgggagaggg cagcggcggt
ggcgccggtc aggaggaggg cggggcctcc 15180
tctgaggcct ctgcggaccc agccgctgcc
gccgaggcgg aggcggccga ccccgcgatg 15240
gtggtagagg aagagaagga tatgaacgac
gaggcggtgc gcggcgacac ctttgccact 15300
cggggggagg agaagaaagc ggaggccgag
gccgcggcag aggaggcggc agcagcggcg 15360
gcggcagtag aggcggcggc cgaggcggag
aagcccccca aggagcccgt gattaagccc 15420
ctgaccgaag atagcaagaa gcgcagttac
aacgtgctca aggacagcac caacaccgag 15480
taccgcagct ggtacctggc ctacaactac
ggcgacccgg cgacgggggt gcgctcctgg 15540
accctgctgt gtacgccgga cgtgacctgc
ggctcggagc aggtgtactg gtcgctgccc 15600
gacatgatgc aagaccccgt gaccttccgc
tccacgcggc aggtcagcaa cttcccggtg 15660
gtgggcgccg agctgctgcc cgtgcactcc
aagagcttct acaacgacca ggccgtctac 15720
tcccagctca tccgccagtt cacctctctg
acccacgtgt tcaatcgctt tcctgagaac 15780
cagattctgg cgcgcccgcc cgcccccacc
atcaccaccg tcagtgaaaa cgttcctgct 15840
ctcacagatc acgggacgct accgctgcgc
aacagcatcg gaggagtcca gcgagtgacc 15900
gtaactgacg ccagacgccg cacctgcccc
tacgtttaca aggccctggg catagtctcg 15960
ccgcgcgtcc tttccagccg cactttttaa
gcatgtccat cctcatctcg cccagcaata 16020
acaccggctg gggcctgctg cgcgcgccca
gcaagatgtt cggaggggcg aggaagcgct 16080
ccgaccagca ccccgtgcgc gtgcgcgggc
actaccgcgc tccctggggc gcgcacaaac 16140
gcgggcgcac cggcaccgcg gggcgcacca
ccgtggacga agccatcgac tcggtggtgg 16200
agcaggcgcg caactacacg cccgcggtct
ccaccgtgga cgcggctatc gagagcgtgg 16260
tgcgaggcgc gcggcggtac gccaaggcga
agagccgccg gaggcgcgtg gcccgccgcc 16320
accgccgccg acccgggagc gccgccaagc
gcgccgccgc cgccttgctc cgtcgggcca 16380
gacgcacggg ccgccgtgcc gccatgaggg
ccgcgcgccg cctggccgcc ggcatcacca 16440
ccgtggcccc ccgcgccaga agacgcgcgg
ccgccgccgc cgccgcggcc atcagcgacc 16500
tggccaccag gcgccggggc aacgtgtact
gggtgcgcga ctcggtgagc ggcacgcgcg 16560
tgcccgtgcg cttccgcccc ccgcggactt
gagaggagag gacaggaaaa agcaacaaca 16620
tcaacaacac caccactgag tctcctgctg
ttgtgtgtat cccagcggcg cgcgcgcaca 16680
cggcgacatg tccaagcgca aaatcaaaga
agagatgctc caggtcgtcg cgccggagat 16740
ctatgggccc ccgaagaagg aagagcagga
tttcaagccc cgcaagataa agcgggtcaa 16800
aaagaaaaag aaagatgacg atgatggcga
ggtggagttt ctgcgcgcca cggcgcccag 16860
gcgcccgctg cagtggaagg gtcggcgcgt
aaagcgcgtt ctgcgccccg gcaccgcggt 16920
ggtcttcacg cccggcgagc gctccacccg
cactttcaag cgcgtctatg acgaggtgta 16980
cggcgacgaa gacctgctgg agcaggccaa
cgatcgctcc ggagagtttg cttacgggaa 17040
gcggcaccgg gcgatggaga aggacgaggt
gctggcgctg ccgctggacc ggggcaaccc 17100
cacccccagc ctgaagcccg tgaccctgca
gcaggtgcta ccggccagcg cgccctccga 17160
gatgaagcgg ggcctgaagc gcgagggcgg
cgacctggcg cccaccgtgc agctaatggt 17220
gcccaagcgg cagaggctgg aggacgtgct
ggagaaaatg aaagtagacc ccggcctgca 17280
gccggacatc agggtccgcc ccatcaagca
ggtggcgccg ggcctcggcg tgcagaccgt 17340
ggacgtggtc atccccaccg gcgcctcctc
ttccagcgcc gccgccgccg ccactagcac 17400
cgcggacatg gagacgcaga ctagccccgc
cgccacctcc tcggcggagg tacagacgga 17460
cccctggttg ccgccgccgg cgaccgcccc
ctcgcgcgca cgccgcgggc gcaggaagta 17520
cggcgccgcc agcgcgctca tgcccgagta
cgccttgcat ccttccatcg cgcccacccc 17580
cggctaccga ggctacagtt accgcccgcg
aagagccaag ggctccaccc gccgcagccg 17640
ccgcgccgcc acctctaccc gccgccgcag
tcgccgccgc cgccggcagc ccgcgctggc 17700
tccgatctcc gtgaaaagag tggcgcgcaa
cgggaacacc ttggtgctgc ccagggcgcg 17760
ctaccacccc agcatcgttt aaaaagcctg
ttgtggttct tgcagatatg gccctcactt 17820
gccgcctccg tttcccggtg ccgggatacc
gaggaagatc gcgccgcagg aggggtatgg 17880
ccggacgcgg cctgagcgga ggcagtcgcc
gtgcgcaccg gcggcgacgc gccaccagcc 17940
gacgcatgcg cggcggagtg ctgcctctgc
tgatccccct gatcgccgcg gcgatcggcg 18000
ccgtgcccgg gatcgcctcc gtggccttgc
aggcgtccca gaggcgttga cacagacttc 18060
ttgcaagctt gcaaaaatat ggaaaaatcc
ccccaataaa aaagtctaga ctctcacgct 18120
cgcttggtcc tgtgactatt ttgtagaaaa
aagatggaag acatcaactt tgcgtcgctg 18180
gccccgcgtc acggctcgcg cccgttcctg
ggacactgga acgatatcgg caccagcaac 18240
atgagcggtg gcgccttcag ttggggctct
ctgtggagcg gcattaaaaa tatcggttct 18300
gccgttaaga attacggcac caaggcctgg
aacagcagca cgggccagat gttgagagac 18360
aagttgaaag agcagaactt ccagcagaag
gtggtggagg gtctggcctc cggcatcaac 18420
ggggtggtgg acctggccaa tcaggccgtg
caaaataaga tcaacagcag actggacccc 18480
cggccgccgg tggaggagct gccgccggcg
ctggagacgg tgtcccccga tgggcggggc 18540
gaaaagcgcc cgcggcccga cagggaagag
accactctgg tcacgcacac cgatgagccg 18600
cccccctacg aggaagccct gaagcaaggc
ttgcccacca ctcggcccat cgcgcccatg 18660
gccaccgggg tggtgggccg ccacaccccc
gccacgctgg acctgcctcc tcctcctgtt 18720
tcttcttcgg ccgccgatgc gcagcagcag
aaggcggcgc tgcccggtcc gcccgcggcc 18780
gccccccgtc ccaccgccag tcgagcgccc
ctgcgtcgcg cggccagcgg cccccgcggg 18840
gtcgcgaggc acagcagcgg caactggcag
aacacgctga acagcatcgt gggtctgggg 18900
gtgcagtccg tgaagcgccg ccgatgctac
tgaatagctt agctaacggt gttgtatgtg 18960
tgtatgcgtc ctatgtcacc gccagaggag
ctgctgagtc gccgccgttc gcgcgcccac 19020
cgccactacc accgccggta ccactccagc
gcccctcaag atggcgaccc catcgatgat 19080
gccgcagtgg tcgtacatgc acatctcggg
ccaggacgcc tcggagtacc tgagccccgg 19140
gctggtgcag ttcgcccgcg ccaccgacag
ctacttcagc ctgagtaaca agtttaggaa 19200
ccccacggtg gcgcccacgc acgatgtgac
caccgaccgg tcccagcgcc tgacgctgcg 19260
gttcatcccc gtggaccgcg aggacaccgc
gtactcttac aaggcgcggt tcaccctggc 19320
cgtgggcgac aaccgcgtgc tggacatggc
ctccacctac tttgacatcc gcggcgtgct 19380
ggacaggggc cccaccttca agccctactc
cggcaccgcc tacaactccc tggcccccaa 19440
gggcgccccc aactcctgcg agtgggagca
agtggagcca gctgaagagg cagcagaaaa 19500
tgaagatgaa gaagaagaag aggatgttgt
tgatcctcag gaacaggagc ccactactaa 19560
aacacatgta tatgctcaag ctcccctttc
tggcgagaaa attaccaaag atggtctgca 19620
aataggaact gaggctacgg cagcaggagg
cactaaagac ttatttgcag accctacatt 19680
ccagccagaa ccccaagttg gcgaatctca
gtggaatgag gcggatgcta cagcagctgg 19740
aggtagagtg ctcaaaaaga ccactcccat
gaaaccttgc tatggctcat atgcccgccc 19800
cacaaatgcc aatgggggcc aaggtgtgct
aaaggcaaat gcccagggag tgctcgagtc 19860
tcaggttgag atgcagttct tttccacttc
tacaaatgcc acaaacgagc aaaacaacat 19920
ccagcccaaa ttggtgctgt acagcgagga
tgtgcatatg gagaccccag acacacacat 19980
ctcctacaag cctacaaaaa gcgatgataa
ttcaaaagtc atgctgggtc agcagtccat 20040
gcccaacagg ccaaattaca tcgccttcag
agacaacttt atcgggctca tgtattataa 20100
cagcactggc aacatggggg tgctggcagg
tcaggcctca cagttgaatg cagtggtgga 20160
cctgcaagac agaaacacag aactgtccta
ccagctcttg cttgattcca tgggagacag 20220
aaccagatac ttttccatgt ggaatcaggc
cgtggacagt tatgacccag atgtcagaat 20280
tattgaaaat catggaaccg aagatgagct
gcccaactat tgtttccctc tgggaggcat 20340
agggataact gacacttacc aggccattaa
gactaatggc aatggggcag gagatcaagc 20400
caccacgtgg cagaaagact cacaatttgc
agaccgcaac gaaatagggg tgggaaacaa 20460
cttcgccatg gagatcaacc tcagtgccaa
cctgtggagg aacttcctct actccaacgt 20520
ggccctgtac ctgccagaca agcttaagta
caacccctcc aacgtggaaa tctctgacaa 20580
ccccaacacc tacgactaca tgaacaagcg
agtggtggcc ccggggctgg tggactgcta 20640
catcaacctg ggcgcgcgct ggtccctgga
ctacatggac aacgtcaacc ccttcaacca 20700
ccaccgcaat gcgggcctgc gctaccgctc
catgcttctg ggcaacgggc gctacgtgcc 20760
cttccacatc caggtgcccc agaagttctt
tgccatcaag aacctcctcc tcctgccggg 20820
ctcctacacc tacgagtgga acttcaggaa
ggatgtcaac atggtcctgc agagctctct 20880
gggcaacgac ctcagggtcg acggggccag
catcaagttc gagagcatct gcctctacgc 20940
caccttcttc cccatggccc acaacacggc
ctccacgctc gaggccatgc tcaggaacga 21000
caccaacgac cagtccttca acgactacct
ctccgccgcc aacatgctct accccatccc 21060
cgccaacgcc accaacgtcc ccatctccat
cccctcgcgc aactgggcgg ccttccgcgg 21120
ctgggccttc acccgcctta agaccaagga
gaccccctcc ctgggctcgg gtttcgaccc 21180
ctactacacc tactcgggct ccatacccta
cctggacgga accttctacc tcaaccacac 21240
tttcaagaag gtctcggtca ccttcgactc
ctcggtcagc tggccgggca acgaccgcct 21300
gctcaccccc aacgagttcg agatcaagcg
ctcggtcgac ggggagggct acaacgtagc 21360
ccagtgcaac atgaccaagg actggttcct
catccagatg ctggccaact acaacatcgg 21420
ctatcagggc ttctacatcc cagagagcta
caaggacagg atgtactcct tctttaggaa 21480
cttccagccc atgagccggc aggtggtgga
cgaaaccaag tacaaggact accagcaggt 21540
gggcatcatc caccagcaca acaactcggg
cttcgtgggc tacctcgccc ccaccatgcg 21600
cgagggacag gcctaccccg ccaacttccc
ctacccgctc attggcaaga ccgcggtcga 21660
cagcatcacc cagaaaaagt tcctctgcga
ccgcaccctc tggcgcatcc ccttctccag 21720
caacttcatg tccatgggtg cgctcacgga
cctgggccag aacctgctct atgccaactc 21780
cgcccacgcg ctcgacatga ccttcgaggt
cgaccccatg gacgagccca cccttctcta 21840
tgttctgttc gaagtctttg acgtggttcg
ggtccaccag ccgcaccgcg gcgtcatcga 21900
gaccgtgtac ctgcgcacgc ccttctcggc
cggcaacgcc accacctaaa gaagcaagcc 21960
gccaccgcca ccacctgcat gtcgtcgggt
tccaccgagc aggagctcaa ggccatcgtc 22020
agagacctgg gatgcgggcc ctattttttg
ggcaccttcg acaaacgctt cccgggcttc 22080
gtcgccccgc acaagctggc ctgcgccatc
gtcaacacgg ccggccgcga gaccgggggc 22140
gtgcactggc tggccttcgc ctggaacccg
cgctccaaaa catgctacct ctttgacccc 22200
ttcggattct cggaccagcg gctcaagcag
atctaccagt tcgagtacga gggcctgctg 22260
cgccgcagcg ccatcgcctc ctcgcccgac
cgctgcgtca ccctcgagaa gtccacccag 22320
accgtgcagg ggcccgactc ggccgcctgc
ggtctcttct gctgcatgtt cctgcatgcc 22380
tttgtgcact ggccccagag tcccatggac
cgcaacccca ccatgaactt gctgacgggg 22440
atccccaact ccatgctcca gagcccccag
gccgcgccca ccctgcgccg caaccaggag 22500
cggctctaca gcttcctgga gcgccactcg
ccctacttcc gccgccacag cgcgcagatc 22560
aggggggcca cctctttctg ccgcatgcaa
gagatgcaag ggaaaatgca atgatgtaca 22620
cagacacttt ctttttctca ataaatggca
actttattta tacatgctct ctctcgggta 22680
ttcatttccc caccacccac cacccgccgc
cgtaaccatc tgctgctggc tttttaaaaa 22740
tcgaaagggt tctgccggga atcgccgtgc
gccacgggca gggacacgtt gcggaactgg 22800
tagcgggtgc cccacttgaa ctcgggcacc
accatgcggg gcaagtcggg gaagttgtcg 22860
gcccacaggc cgcgggtcag caccagcgcg
ttcatcaggt cgggcgccga gatcttgaag 22920
tcgcagttgg ggccgccgcc ctgcgcgcgc
gagttgcggt acaccgggtt gcaacactgg 22980
aacaccagca gcgccggata attcacgctg
gccagcacgc tccggtcgga gatcagctcg 23040
gcgtccaggt cctccgcgtt gctcagcgcg
aacggggtca gcttgggcac ctgccgcccc 23100
aggaagggag cgtgccccgg cttcgagttg
cagtcgcagc gcagcgggat cagcaggtgc 23160
ccgcggccgg actcggcgtt ggggtacagc
gcgcgcatga aggcctccat ctggcggaag 23220
gccatctggg ccttggcgcc ctccgagaag
aacatgccgc aggacttgcc cgagaactgg 23280
ttcgcggggc agctagcgtc gtgcaggcag
cagcgcgcgt cggtgttggc aatctgcacc 23340
acgttgcgcc cccaccggtt cttcacgatc
ttggccttgg aagcctgctc cttcagcgcg 23400
cgctgcccgt tctcgctggt cacatccatc
tcgatcacgt gctccttgtt caccatgctg 23460
ctgccgtgca gacacttcag ctcgccctcc
acctcggtgc agcggtgctg ccacagcgcg 23520
cagcccgtgg gctcgaaatg cttgtaggtc
acctccgcgt aggactgcag gtaggcctgc 23580
aggaagcgcc ccatcatggt cacgaaggtc
ttgttgctgc tgaaggtcag ctgcagcccg 23640
cggtgctcct cgttcagcca ggccttgcac
acggccgcca gcgcctccac ctggtcgggc 23700
agcatcttga agttcagctt cagctcattc
tccacatggt acttgtccat cagcgcgcgc 23760
gcagcctcca tgcccttctc ccaggccgac
accagcggca ggctcaaggg gttcaccacc 23820
gtcgcagtcg ccgccgcgct ttcgctttcc
gctccgctgt tctcttcttc ctcctcctct 23880
tcttcctcgc cgcccgcgcg cagcccccgc
accacggggt cgtcttcctg caggcgccgc 23940
accgagcgct tgccgctcct gccctgcttg
atgcgcacgg gcgggttgct gaagcctacc 24000
atcaccagcg cggcctcttc ttgctcgtcc
tcgctgtcca ctatgacctc gggggagggc 24060
gacctcagaa ccgtggcgcg ctgcctcttc
tttttcctgg gggcgtttgc aagctccgcg 24120
gccgcggccg ccgccgaggt cgaaggccga
gggctgggcg tgcgcggcac cagcgcgtcc 24180
tgcgagccgt cctcgtcctc ggactcgagg
cggcagcgag cccgcttctt tgggggcgcg 24240
cggggcggcg gcggcggggg cggcggcgac
ggagacgggg acgagacatc gtccagggtg 24300
ggaggacggc gggccgcgcc gcgtccgcgc
tcgggggtgg tttcgcgctg gtcctcttcc 24360
cgactggcca tctcccactg ctccttctcc
tataggcaga aagagatcat ggagtctctc 24420
atgcaagtcg agaaggagga ggacagccta
accaccgccc cctctgagcc ctccgccgcc 24480
accgccgcgg acgacgcgcc taccaccgcc
gccaccacca ccaccattac caccctaccc 24540
ggcgacgcag ccccgatcga gaaggaagtg
ttgatcgagc aggacccggg ttttgtgagc 24600
gaagaggagg atgaggagga tgaaaaggag
aaggataccg ccgcctcagt gccaaaagag 24660
gataaaaagc aagaccagga cgacgcagag
acagatgagg cagcaatcgg gcggggggac 24720
gagaggcatg atgatgatga tgatgacggc
tacctagacg tgggagacga cgtgctgctt 24780
aagcacctgc accgccagtg cgtcatcgtc
tgcgacgcgc tgcaggagcg ctgcgaagtg 24840
cccctggacg tggcggaggt cagccgcgcc
tacgagcggc acctcttcgc gccacacgtg 24900
ccccccaagc gccgggagaa cggcacctgc
gagcccaacc cgcgcctcaa cttctacccg 24960
gtcttcgcgg tacccgaggt gctggccacc
taccacatct tcttccaaaa ctgcaagatc 25020
cccctctcct gccgcgccaa ccgcacccgc
gccgacaagg cgctggccct gcggcagggc 25080
gcccacatac ctgatatcgc ctctctggag
gaggtgccca agatcttcga gggtctcggt 25140
cgcgacgaga aacgggcggc gaacgctctg
caaggagaca gcgaaaacga gagtcactcg 25200
ggggtgctgg tggagctcga gggcgacaac
gcgcgcctgg ccgtgctcaa gcgcagcatc 25260
gaagtcaccc acttcgccta cccggcgctc
aacctgcccc ccaaggtcat gagtgtggtc 25320
atgagcgagc tcatcatgcg ccgcgcccag
cccctggacg cggatgcaaa cttgcaagag 25380
ccctccgagg aaggcctgcc cgcggtcagc
gacgagcagc tggcgcgctg gctggagacc 25440
cgcgaccccg cccagctgga ggagcggcgc
aagctcatga tggccgcggt gctcgtcacc 25500
gtggagctcg agtgtctgca gcgcttcttc
ggggaccccg agatgcagcg caagctcgag 25560
gagaccctgc actacacctt ccgccagggc
tacgtgcgcc aggcctgcaa gatctccaac 25620
gtggagctct gcaacctggt ctcctacctg
ggcatcctgc acgagaaccg cctcgggcag 25680
aacgtcctgc actccaccct caaaggggag
gcgcgccgcg actacgtccg cgactgcgtc 25740
tacctcttcc tctgctacac gtggcagaca
gccatggggg tctggcagca gtgcctggag 25800
gagcgcaacc tcaaggagct ggagaagctc
ctcaggcgcg ccctcaggga cctctggagg 25860
ggcttcaacg agcgctcggt ggccgccgcg
ctggcggaca tcatcttccc cgagcgcctg 25920
ctcaaaaccc tgcagcaggg cctgcccgac
ttcaccagcc agagcatgct gcagaacttc 25980
aggaccttca tcctggagcg ctcgggcatc
ctgccggcca cctgctgcgc gctgcccagc 26040
gacttcgtgc ccatcaggta cagggagtgc
ccgccgccgc tctggggcca ctgctacctc 26100
ttccagctgg ccaactacct cgcctaccac
tcggatctca tggaagacgt gagcggcgag 26160
ggcctgctcg agtgccactg ccgctgcaac
ctgtgcacgc cccaccgctc tctagtctgc 26220
aacccgcagc tgctcagcga gagtcagatt
atcggtacct tcgagctgca gggtccctcg 26280
cccgacgaaa agtccgcggc tccggggttg
aaactcactc cggggctgtg gacttccgcc 26340
tacctacgca aatttgtacc tgaagactac
cacgcccacg agatcaggtt ttacgaagac 26400
caatcccgcc cgcccaaggc ggagctcacc
gcctgcgtca ttacccaggg ccacatcctg 26460
ggccaattgc aagccatcaa caaagcccgc
caagagttct tgctgaaaaa gggtcggggg 26520
gtgtacctgg acccccagtc cggcgaggag
ctaaacccgc tacccccgcc gccgccccag 26580
cagcgggacc ttgcttccca ggatggcacc
cagaaagaag cagccgccgc cgccgccagc 26640
atacatgctt ctggaggaag aggaggactg
ggacagtcag gcagaggagg tttcggacga 26700
ggacgaggag gaggagatga tggaagactg
ggaggaggac agcctagacg aggaagcttc 26760
agaggccgaa gaggtggcag acgcaacacc
atcaccctcg gccgcagccc cctcgccggc 26820
gcccccgaaa tcctccgacc ccagcagcag
cgctataacc tccgctcctc cggcgccggc 26880
gcccacccgc agcagaccca accgtagatg
ggacactaca ggaaccgggg tcggtaagtc 26940
caagtgcccc ccagcgccgc ccccgcaaca
ggagcaacag cagcagcagc ggcgacaggg 27000
ctaccgctcg tggcgcggac acaagaacgc
catagtcgcc tgcttgcaag actgcggggg 27060
caacatctcc ttcgcccgcc gcttcctgct
cttccaccac ggggtggctt ttccccgcaa 27120
tgtcctgcat tactaccgtc atctctacag
cccctactgc ggcggcagcg gcgacccaga 27180
gggagcggcg gcagcagcag cgccagccac
agcggcgacc acctaggaag acctccgcgg 27240
gcaagacggc gggagccggg agacccgcgg
cggcggcggt agcggcggcg gcgggcgcac 27300
tgcgcctctc gcccaacgaa cccctctcga
cccgggagct cagacacagg atcttcccca 27360
ctctgtatgc tatcttccag cagagcagag
gccaggaaca ggagctgaaa ataaaaaaca 27420
gatctctgcg ctccctcacc cgcagctgtc
tgtatcacaa aagcgaagat cagcttcggc 27480
gcacgctgga ggacgcggag gcactcttca
gcaaatactg cgcgctgact cttaaggact 27540
agccgcgcgc ccttctcgaa tttaggcggg
agaaagacta cgtcatcgcc gaccgccgcc 27600
cagcccaccc agccgacatg agcaaagaga
ttcccacgcc ctacatgtgg agctaccagc 27660
cgcagatggg actcgcggcg ggagcggccc
aagactactc cacccgcatg aactacatga 27720
gcgcggggcc ccacatgatc tcacgggtta
atgggatccg cgcccagcga aaccaaatac 27780
tgctggaaca ggcggccata accgccacac
cccgtcatga cctcaatccc cgaaattggc 27840
ccgccgccct cgtgtaccag gaaaccccct
ctgccaccac cgtggtactt ccgcgtgaca 27900
cccaggccga agtccagatg actaactcag
gggcgcagct cgcgggcggc tttcgtcacg 27960
gggtgcggcc gcaccggccg ggtatattac
acctggcgat cagaggccga ggtattcagc 28020
tcaacgacga gtcggtgagc tcttcgctcg
gtctccgtcc ggacggaacc ttccagatcg 28080
ccggatcagg tcgctcctca ttcacgcctc
gccaggcgta cctgactctg cagacctcct 28140
cctcggagcc tcgctccggc ggcatcggca
ccctccagtt cgtggaggag ttcgtgccct 28200
cggtctactt caaccccttc tcgggacctc
ccggacgcta ccccgaccag ttcatcccga 28260
actttgacgc ggtgaaggac tcggcggacg
gctacgactg aatgtcaagt gctgaggcag 28320
agagcgttcg cctgaaacac ctccagcact
gccgccgctt cgcctgcttt gcccgcagct 28380
ccggtgagtt ctgctacttt cagctgcccg
aggagcatac cgaggggccg gcgcacggcg 28440
tccgcctaac cacccagggc gaggttacct
gtacccttat ccgggagttt accctccgtc 28500
ccctgctagt ggagcgggag cggggttctt
gtgtcataac tatcgcctgc aactgcccta 28560
accctggatt acatcaagat ctttgttgtc
acctgtgcgc tgagtataat aaacgctgag 28620
atcagactct actggggctc ctgtcgccat
cctgtgaacg ccaccgtctt cacccacccc 28680
gagcagcccc aggcgaacct cacctgcggc
ctgcgtcgga gggccaagaa gtacctcacc 28740
tggtacttca acggcacccc ctttgtggtt
tacaacagct tcgaccagga cggagttgcc 28800
ttgagagacg acctttccgg tctcagctac
tccattcaca agaacaccac cctccacctc 28860
ttccctccct acctgccggg aacctacgag
tgcgtcaccg gccgctgcac ccacctcctc 28920
cgcctgatcg taaaccagac ctttccggga
acacacctct tccccagaac aggaggtgag 28980
ctcaggaaac cccctggggc ccagggcgga
gacttacctt cgacccttgt ggggttagga 29040
ttttttatcg ccgggttgct ggctctcctg
atcaaagctt ccttgagatt tgttctctcc 29100
ctttactttt atgaacagct caacttctaa
taacgctacc ttttctcagg aatcgggtag 29160
taacttctct tctgaaatcg ggctgggtgt
gctgcttact ctgttgattt ttttccttat 29220
catacttagc cttctgtgcc tcaggctcgc
cgcctgctgc gcacatatct acatctacag 29280
ccggttgctt aactgctggg gtcgccatcc
aagatgaacg gggctcaggt gctatgtctg 29340
ctggccctgg tggcctgcag tgccgccgtc
aattttgagg aacccgcttg caatgtgact 29400
ttcaagcctg aaggcgcaca ttgcaccact
ctggttaaat gtgtgacctc tcatgagaaa 29460
ctgctcatcg cctacaaaaa caaaacaggc
gagttcgcgg tctatagcgt gtggcaaccc 29520
ggagaccata ataactactc agtcaccgtc
ttcgagggtg cggagtctaa gaaattcgat 29580
tacacctttc ccttcgagga gatgtgtgaa
gcggtcatgt acctgtccaa acagtacaag 29640
ctgtggcccc ccacccccga ggcgtgtgtg
gaaaacactg ggtctttctg ctgtctctct 29700
ctgacaatca ctgtgcttgc tctaatctgc
acgctgctgt acatgaaatt caggcagagg 29760
cgaatcttta tcgatgagaa aaaaatgcct
tgatcgctaa caccggcttt ctgtctgcag 29820
aatgaaagca atcacctccc tactaatcag
caccaccctc cttgcgattg cccatgggtt 29880
gacacgaatc gaagtgccag tggggtccaa
tgtcaccatg gtgggccccg ccggcaattc 29940
ctccctgatg tgggaaaaat atgtccgtaa
tcaatgggat cattactgct ctaatcgaat 30000
ctgtatcaag cccagagcca tctgcgacgg
gcaaaatcta actttgattg atgtgcaaat 30060
gacggatgct gggtactatt acgggcagcg
gggagaaatg attaattact ggcgacccca 30120
caaggactac atgctgcatg tagtcaaggc
agtccccact actaccaccc ccaccactac 30180
cactcccacc actcccacta ctaccacccc
caccactact actagcactg ctactaccgc 30240
tgcccgcaaa gctattaccc gcaaaagcac
catgcttagc accaagcccc attctcactc 30300
ccacgccggc gggcccaccg gtgcggcctc
agaaaccacc gagctttgct tctgccaatg 30360
cactaacgcc agcgcccacg aactgttcga
cctggagaat gaggatgatg accagctgag 30420
ctccgcttgc ccggtcccgc tgcccgcaga
gccggtcgcc ctgaagcagc tcggtgatcc 30480
atttaatgac tctcctgttt atccctctcc
cgaatacccg cccgactcta ccttccacat 30540
cacgggcacc aacgacccca acctctcctt
ctacctgatg ctgctgcttt gtatctctgt 30600
ggtatcttcc gcgctcatgt tactgggcat
gttctgctgc ctcatctgcc gcagaaagag 30660
aaagtctcgc tctcagggcc aaccactgat
gcccttcccc taccccccag attttgcaga 30720
taacaagata tgagcacgct gctgacacta
accgctttac tcgcctgcgc tctaaccctt 30780
gtcgcttgcg aatccagata ccacaatgtc
acagttgtga caggagaaaa tgttacattc 30840
aactccacgg ccgacaccca gtggtcgtgg
agcggccacg gtagctatgt atacatctgc 30900
aatagctcca cctcccctag catgtcctct
cccaagtacc actgcaatgc cagcctgttc 30960
accctcatca acgcctccac ctcggacaat
ggactctatg taggctatgt gacacccggt 31020
gggcggggaa agacccacgc ctacaacctg
caagttcgcc acccctccac caccgccacc 31080
acctctgccg cccctacccg cagcagcagc
agcatcagca gcagcagcag cagcagcaga 31140
ttcctgactt taatcctagc cagctcaaca
accaccgcca ccgctgagac cacccacagc 31200
tccgcgcccg aaaccaccca cacccaccac
ccagagacga ccgcggcctc cagtgaccag 31260
atgtcggcca acatcaccgc ctcgggtctt
gaacttgctt caacccccac cccaaaacca 31320
gtggatgcag ccgacgtctc cgccctcgtc
aatgactggg cggggctggg aatgtggtgg 31380
ttcgccatag gcatgatggc gctctgcctg
cttctgctct ggctcatctg ctgcctcaac 31440
cgcaggcggg ccagacccat ctatagaccc
atcattgttc tcaaccccgc tgatgatggg 31500
atccatagat tggatggtct gaaaaaccta
cttttctctt ttacagtatg ataaattgag 31560
acatgcctcg cattttcatg tacttgacac
ttctcccact ttttctgggg tgttctacgc 31620
tggccgccgt ctctcacctc gaggtagact
gcctcacacc cttcactgtc tacctgattt 31680
acggattggt caccctcact ctcatctgca
gcctaatcac agtagtcatc gccttcatcc 31740
agtgcattga ctacatctgt gtgcgcctcg
catacctgag acaccacccg cagtaccgag 31800
acaggaacat tgcccaactc ctaagactgc
tctaatcatg cataagactg tgatctgcct 31860
cctcatcctc ctctccctgc ccgctctcgt
ctcatgccag cccgccacaa aacctccacg 31920
aaaaagacat gcctcctgtc gcttgagcca
actgtggaat attcccaaat gctacaatga 31980
aaagagcgag ctttccgaag cctggctata
tgcggtcatg tgtgtccttg tcttctgcag 32040
cacaatcttt gccctcatga tctaccccca
ctttgatttg ggatggaatg cggtcgatgc 32100
catgaattac cctacctttc ccgcgcccga
tatgattcca ctccgacagg ttgtggtgcc 32160
cgtcgccctc aatcaacgcc ccccatcccc
tacacccact gaggtcagct actttaatct 32220
aacaggcgga gatgactgac actctagatc
tagaaatgga cggcatcggc accgagcagc 32280
gtctcctaca gaggcgcaag caggcggctg
aacaagagcg cctcaatcag gagctccgag 32340
atctcattaa cctgcaccag tgcaaaaaag
gcatcttttg cctggtcaag caggccgatg 32400
tcacctacga gaaaaccggt aacagccacc
gcctcagcta caagctgccc acccaacgcc 32460
agaagttggt gctcatggtg ggtcagaatc
ccatcaccgt cacccagcac tcggtggaga 32520
ccgaggggtg tctgcactcc ccctgtcagg
gtccggaaga cctctgcacc ctggtaaaga 32580
ccctgtgtgg tcttagagat ttaatcccct
ttaactaatc aaacactgga atcaataaaa 32640
agaatcactt actttaaatc agtcagcagg
tctctgtcca ctttattcag cagcacctcc 32700
ttcccctcct cccaactctg gtactccaaa
cgcctcctgg cggcaaactt cctccacacc 32760
ctgaagggaa tgtcagattc ttgctcctgt
ccctccgcac ccactatctt catgttgttg 32820
cagatgaagc gcgccaaaac gtctgacgag
accttcaacc ccgtgtaccc ctatgacacg 32880
gaaaacgggc ctccctccgt ccctttcctc
acccctccct tcgtgtcccc cgacggattt 32940
caagaaagcc ccccaggggt cctgtctctg
cgcctgtcag agcccctggt cacttcccac 33000
ggcatgcttg ccctgaaaat gggaaatggc
ctctccctgg atgacgccgg caacctcacc 33060
tctcaagatg tcaccaccgt cacccctccc
ctcaaaaaaa ccaagaccaa cctcagcctc 33120
cagacctcag cccccctgac cgttagctct
gggtccctca ccgtcgcggc cgccgctcca 33180
ctggcggtgg ccggcacctc tctcaccatg
caatctcagg cccccttgac agtgcaagat 33240
gcaaaactcg gcctggccac ccagggaccc
ctgaccgtgt ctgaaggcaa actcaccttg 33300
cagacatcgg ctccactgac ggccgctgac
agcagcactc tcactgttag tgccacacct 33360
cccctcagca caagcaatgg tagtttgagc
attgacatgc aggccccgat ttataccacc 33420
aatggaaaac tggcacttaa cattggtgct
cccctgcatg tggtagacac cctaaatgca 33480
ctaactgtag taactggcca gggtcttacc
ataaatggaa gagccctgca aactagagtc 33540
acgggtgccc tcagttatga cacagaaggc
aacatccaac tgcaagccgg agggggtatg 33600
cgcattgaca ataatggcca acttatcctt
aatgtagctt atccatttga tgctcaaaac 33660
aacctcagcc ttagacttgg ccaaggtccc
ctaattgtta actctgccca caacttggat 33720
cttaacctta acagaggcct ttacttattt
acatctggaa acacgaaaaa actggaagtt 33780
aacataaaaa cagccaaagg tctattttac
gatggcaccg ctatagcaat caatgcaggt 33840
gacgggctac agtttgggtc tggttcagat
acaaatccat tgcaaactaa acttggattg 33900
gggctggaat atgactccaa caaagctata
atcactaaac ttggaactgg cctaagcttt 33960
gacaacacag gtgccatcac agtaggcaac
aaaaatgatg acaagcttac cttgtggacc 34020
acaccagacc cctccccaaa ctgcagaatt
aattcagaaa aagatgctaa actcacacta 34080
gttttgacta aatgcggcag ccaggtgtta
gccagcgttt ctgttttatc tgtaaaaggc 34140
agccttgccc ccatcagcgg cacagtaact
agcgcccaga ttgttttaag atttgatgaa 34200
aacggagttt tattgagcaa ttcttctctt
gacccccaat actggaacta tagaaaaggc 34260
gattctacag aaggcactgc atatactaat
gctgtgggat ttatgcccaa cctcacagca 34320
taccctaaaa cacagagcca gactgctaaa
agcaacattg taagtcaagt ttacttgaat 34380
ggggacaaaa caaaacccat gaccctaacc
atcaccctca atggaactaa tgaaacaggg 34440
gatgctacag taagcacata ctccatgtca
ttttcatgga actggaatgg aagtaattac 34500
attaatgaca ccttccaaac caactccttt
accttctcct acatcgccca agaataaaaa 34560
agcatgacgc tttgttctct gattcagtgt
gtttctttta ttttttttca attacaacag 34620
aatcattcaa gtcattctcc atttagctta
atagacccag tagtgcaaag ccccatacta 34680
gcttatttca gacagtataa attaaaccat
accttttgat ttcaatatta aaaaaatcat 34740
cacaggatcc tagtcgtcag gccgccccct
ccctgccaag acacagaata cacaatcctc 34800
tccccccggc tggctttaaa caacaccatc
tggttggtga cagacaggtt cttcggggtt 34860
atattccaca cggtctcctg gcgggccagg
cgctcgtcgg tgatgctgat aaactctccc 34920
ggcagctcgc tcaagttcac gtcgctgtcc
agcggctgaa cctcatgctg acgcggtaac 34980
tgcgcgaccg gctgctgaac aaacggaggc
cgcgcctaca agggggtaga gtcataatcc 35040
tccgtcagga tagggcggtt atgcagcagc
agcgagcgaa tcatctgctg ccgccgccgc 35100
tccgtccggc aggaaaacaa catcccggtg
gtctcctccg ctataatccg caccgcccgc 35160
agcataagcc tcctcgttct ccgcgcgcag
caccgcaccc tgatctcact caggttggcg 35220
cagtaggtac agcacatcac cacgatgtta
ttcatgatcc cacagtgcaa ggcgctgtat 35280
ccaaagctca tgcccgggac caccgccccc
acgtgaccgt cgtaccagaa gcgcaggtaa 35340
atcaagtgcc gacccctcat gaacgtgctg
gacataaaca tcacctcctt gggcatgttg 35400
taattcacca cctcccggta ccagatgaat
ctctgattga acacggcccc ttccaccacc 35460
atcctgaacc aagaggctag gacctgccca
ccggctatgc actgcaggga acccgggtta 35520
gaacaatgac aatgcagact ccagggctcg
taaccgtgga tcatccggct gctgaagaca 35580
tcgatgttgg cgcaacacag acacacgtgc
atacacttcc tcatgattag cagctcctcc 35640
ctcgtcagga tcatatccca agggataacc
cattcttgaa tcaacgtaaa gcccacagag 35700
cagggaaggc ctcgcacata actcacgttg
tgcatggtta gcgtgttgca ttccggaaac 35760
agcggatgat cctccagtat cgaggcgcgg
gtctcgttct cacagggagg taaaggggcc 35820
ctgctgtacg gactgtggcg ggacgaccga
gatcgtgttg agcgtaacgt catggaaaag 35880
ggaacgccgg acgtggtcat acttcttgaa
gcagaaccag gctcgcgcgt gacagacctc 35940
cttgcgtcta cggtctcgcc gcttagctcg
ctccgtgtga tagttgtagt acagccactc 36000
tctcaaagcg tcgaggcgac acctggcgtc
aggatgtatg tagactccgt cttgcaccgc 36060
ggccctgata atatccacca ccgtagaata
agccacacca agccaagcaa tacactcgct 36120
ttgcgagcgg cagacaggag gagcggggag
agacggaagg accatcataa aattttaaag 36180
aatattttcc aatacttcga aatcaagatc
taccaaatgg caacgctccc ctccactggc 36240
gcggtcaaac tctacggcca aagaacagat
aacggcattt ttaagatgtt cccggacggc 36300
gtctaaaaga caaaccgctc tcaagtcgac
ataaattata agccaaaagc catcgggatc 36360
catatccact atggacgcgc cggcggcgtc
caccaaaccc aaataatttt cttctctcca 36420
gcgcagcaaa atcccagtaa gcaactccct
gatattaaga tgaaccatgc caaaaatctg 36480
ttcaagagcg ccctccacct tcattctcaa
gcagcgcatc atgattgcaa aaattcaggt 36540
tcctcagaca cctgtatgag attcaaaacg
ggaatattaa caaaaattcc tctgtcgcgc 36600
agatcccttc gcagggcaag ctgaacataa
tcagacaggt ctgaacgaac cagcgaggcc 36660
aaatccccgc caggaaccag atccagagac
cctatgctga ttatgacgcg catactcggg 36720
gctatgctaa ccagcgtagc gccgatgtag
gcgtgctgca tgggcggcga aataaaatgc 36780
aaggtgctgg ttaaaaaatc aggcaaagcc
tcgcgcaaaa aagctaagac atcataatca 36840
tgctcatgca ggtagttgca ggtaagctca
ggaaccaaaa cggaataaca cacgattttc 36900
ctctcaaaca tgacttccag gtgactgcat
aagaaaaaaa ttataaataa taaatattaa 36960
ttaaataaat taaacattgg aagcctgtct
cacaacagga aaaaccactc tgatcaacat 37020
aagacgggcc acgggcatgc ccgcgtgacc
ataaaaaaat cggtctccgt gattacaaag 37080
caccacagat agctccccgg tcatgtcggg
ggtcatcatg tgagactgtg tatacacgtc 37140
cgggctgttg acatcggtca aagaaagaaa
tcgagctaca tagcccggag gaatcaacac 37200
ccgcacgcgg aggtacagca aaacggtccc
cataggagga atcacaaaat tagtaggaga 37260
aaaaaaaaca taaacaccag aaaaaccctc
ttgccgaggc aaaacagcgc cctcccgttc 37320
caaaacaaca taaagcgctt ccacaggagc
agccatgaca aagacccgag tcttaccagg 37380
aaaattttaa aaaagattcc tcaacgcagc
accagcacca acacctgtca gtgtaaaatg 37440
ccaagcgccg agcgagtata tataggaata
aaaagtgacg taaacggtta aagtccagaa 37500
aacgcccaga aaaaccgcac gcgaacctac
gccccgaaac gaaagccaaa aaacagtgaa 37560
cacgcccttt cggcgtcaac ttccgctttc
ccacggtacg tcacttccgc atatagtaaa 37620
actacgctac ccaacatgca agaagccacg
ccccaaaaca cgtcacacct cccggcccgc 37680
cccgcgccgc cgctcctccc cgccccgccc
cgctccgccc acctcattat catattggct 37740
tcaatccaaa ataaggtata ttattgatga
tg 37772
<210> 14
<211> 440
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 14
Met Ser Lys Lys Arg Val Arg Val
Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro
Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln
Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr
Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu
Asp Asp Ser Gly Lys Leu Ile Ser
65 70 75 80
Lys Asn Ala Thr Lys Ala Thr Ala
Pro Leu Ser Ile Ser Asn Ser Thr
85 90 95
Ile Ser Leu Asn Met Asp Ala Pro
Leu Tyr Asn Asn Asn Gly Lys Leu
100 105 110
Gly Ile Arg Ile Gly Ala Pro Leu
Lys Val Val Asp Leu Leu Asn Thr
115 120 125
Leu Ala Val Ala Tyr Gly Ser Gly
Leu Gly Leu Lys Asn Asn Ala Leu
130 135 140
Thr Val Gln Leu Val Ser Pro Leu
Thr Phe Asp Asn Lys Gly Asn Val
145 150 155 160
Lys Ile Asn Leu Gly Asn Gly Pro
Leu Thr Val Ala Ala Asn Arg Leu
165 170 175
Ser Val Thr Cys Lys Arg Gly Leu
Tyr Val Thr Thr Thr Gly Asp Ala
180 185 190
Leu Glu Ser Asn Ile Ser Trp Ala
Lys Gly Ile Arg Phe Glu Gly Asn
195 200 205
Ala Ile Ala Ala Asn Ile Gly Lys
Gly Leu Glu Phe Gly Thr Thr Ser
210 215 220
Ser Glu Ser Asp Val Ser Asn Ala
Tyr Pro Ile Gln Val Lys Leu Gly
225 230 235 240
Thr Gly Leu Thr Phe Asp Ser Thr
Gly Ala Ile Val Ala Trp Asn Lys
245 250 255
Glu Asp Asp Lys Leu Thr Leu Trp
Thr Thr Ala Asp Pro Ser Pro Asn
260 265 270
Cys His Ile Tyr Ser Asp Lys Asp
Ala Lys Leu Thr Leu Cys Leu Thr
275 280 285
Lys Cys Gly Ser Gln Ile Leu Gly
Thr Val Ser Leu Ile Ala Val Asp
290 295 300
Thr Gly Ser Leu Asn Pro Ile Thr
Gly Gln Val Thr Thr Ala Leu Val
305 310 315 320
Ser Leu Lys Phe Asp Ala Asn Gly
Val Leu Gln Thr Ser Ser Thr Leu
325 330 335
Asp Lys Glu Tyr Trp Asn Phe Arg
Lys Gly Asp Val Thr Pro Ala Glu
340 345 350
Pro Tyr Thr Asn Ala Ile Gly Phe
Met Pro Asn Ile Lys Ala Tyr Pro
355 360 365
Lys Asn Thr Asn Ser Ala Ala Lys
Ser His Ile Val Gly Lys Val Tyr
370 375 380
Leu His Gly Glu Val Ser Lys Pro
Leu Asp Leu Ile Ile Thr Phe Asn
385 390 395 400
Glu Thr Ser Asn Glu Thr Cys Thr
Tyr Cys Ile Asn Phe Gln Trp Gln
405 410 415
Trp Gly Thr Asp Lys Tyr Lys Asn
Glu Thr Leu Ala Val Ser Ser Phe
420 425 430
Thr Phe Ser Tyr Ile Ala Gln Glu
435 440
<210> 15
<211> 443
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 15
Met Ser Lys Lys Arg Val Arg Val
Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro
Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln
Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr
Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu
Asp Asp Ser Gly Lys Leu Ile Ser
65 70 75 80
Lys Asn Ala Thr Lys Ala Thr Ala
Pro Leu Ser Ile Ser Asn Ser Thr
85 90 95
Ile Ser Leu Asn Met Ala Ala Pro
Phe Tyr Asn Asn Asn Gly Thr Leu
100 105 110
Ser Leu Asn Val Ser Thr Pro Leu
Ala Val Phe Pro Thr Phe Asn Thr
115 120 125
Leu Gly Ile Ser Leu Gly Asn Gly
Leu Gln Thr Ser Asn Lys Leu Leu
130 135 140
Ala Val Gln Leu Thr His Pro Leu
Thr Phe Ser Ser Asn Ser Ile Thr
145 150 155 160
Val Lys Thr Asp Lys Gly Leu Tyr
Ile Asn Ser Ser Gly Asn Arg Gly
165 170 175
Leu Glu Ala Asn Ile Ser Leu Lys
Arg Gly Leu Ile Phe Asp Gly Asn
180 185 190
Ala Ile Ala Thr Tyr Leu Gly Ser
Gly Leu Asp Tyr Gly Ser Tyr Asp
195
200 205
Ser Asp Gly Lys Thr Arg Pro Ile
Ile Thr Lys Ile Gly Ala Gly Leu
210 215 220
Asn Phe Asp Ser Asn Asn Ala Met
Ala Val Lys Leu Gly Thr Gly Leu
225 230 235 240
Ser Phe Asp Ser Ala Gly Ala Leu
Thr Ala Gly Asn Lys Glu Asp Asp
245 250 255
Lys Leu Thr Leu Trp Thr Thr Pro
Asp Pro Ser Pro Asn Cys Gln Leu
260 265 270
Leu Ser Asp Arg Asp Ala Lys Phe
Thr Leu Cys Leu Thr Lys Cys Gly
275 280 285
Ser Gln Ile Leu Gly Thr Val Ala
Val Ala Ala Val Thr Val Ser Ser
290 295 300
Ala Leu Asn Pro Ile Asn Asp Thr
Val Lys Ser Ala Ile Val Phe Leu
305 310 315 320
Arg Phe Asp Ser Asp Gly Val Leu
Met Ser Asn Ser Ser Met Val Gly
325 330 335
Asp Tyr Trp Asn Phe Arg Glu Gly
Gln Thr Thr Gln Ser Val Ala Tyr
340 345 350
Thr Asn Ala Val Gly Phe Met Pro
Asn Leu Gly Ala Tyr Pro Lys Thr
355 360 365
Gln Ser Lys Thr Pro Lys Asn Ser
Ile Val Ser Gln Val Tyr Leu Asn
370 375 380
Gly Glu Thr Thr Met Pro Met Thr Leu
Thr Ile Thr Phe Asn Gly Thr
385 390 395 400
Asp Glu Lys Asp Thr Thr Pro Val
Ser Thr Tyr Ser Met Thr Phe Thr
405 410 415
Trp Gln Trp Thr Gly Asp Tyr Lys
Asp Lys Asn Ile Thr Phe Ala Thr
420 425 430
Asn Ser Phe Thr Phe Ser Tyr Met
Ala Gln Glu
435 440
<210> 16
<211> 425
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 16
Met Ser Lys Lys Arg Val Arg Val
Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro
Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln
Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr
Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu
Asp Ser Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala
Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp His Pro
Phe Tyr Thr Lys Asp Gly Lys Leu
100 105 110
Ala Leu Gln Val Ser Pro Pro Leu
Asn Ile Leu Arg Thr Ser Ile Leu
115 120 125
Asn Thr Leu Ala Leu Gly Phe Gly
Ser Gly Leu Gly Leu Arg Gly Ser
130 135 140
Ala Leu Ala Val Gln Leu Val Ser
Pro Leu Thr Phe Asp Thr Asp Gly
145 150 155 160
Asn Ile Lys Leu Thr Leu Asp Arg
Gly Leu His Val Thr Thr Gly Asp
165 170 175
Ala Ile Glu Ser Asn Ile Ser Trp
Ala Lys Gly Leu Lys Phe Glu Asp
180 185 190
Gly Ala Ile Ala Thr Asn Ile Gly
Asn Gly Leu Glu Phe Gly Ser Ser
195 200 205
Ser Thr Glu Thr Gly Val Asp Asp
Ala Tyr Pro Ile Gln Val Lys Leu
210 215 220
Gly Ser Gly Leu Ser Phe Asp Ser
Thr Gly Ala Ile Met Ala Gly Asn
225 230 235 240
Lys Glu Asp Asp Lys Leu Thr Leu
Trp Thr Thr Pro Asp Pro Ser Pro
245 250 255
Asn Cys Gln Ile Leu Ala Glu Asn
Asp Ala Lys Leu Thr Leu Cys Leu
260 265 270
Thr Lys Cys Gly Ser Gln Ile Leu Ala
Thr Val Ser Val Leu Val Val
275 280 285
Gly Ser Gly Asn Leu Asn Pro Ile
Thr Gly Thr Val Ser Ser Ala Gln
290 295 300
Val Phe Leu Arg Phe Asp Ala Asn
Gly Val Leu Leu Thr Glu His Ser
305 310 315 320
Thr Leu Lys Lys Tyr Trp Gly Tyr
Arg Gln Gly Asp Ser Ile Asp Gly
325 330 335
Thr Pro Tyr Val Asn Ala Val Gly
Phe Met Pro Asn Leu Lys Ala Tyr
340 345 350
Pro Lys Ser Gln Ser Ser Thr Thr
Lys Asn Asn Ile Val Gly Gln Val
355 360 365
Tyr Met Asn Gly Asp Val Ser Lys
Pro Met Leu Leu Thr Ile Thr Leu
370 375 380
Asn Gly Thr Asp Asp Ser Asn Ser
Thr Tyr Ser Met Ser Phe Ser Tyr
385 390 395 400
Thr Trp Thr Asn Gly Ser Tyr Val
Gly Ala Thr Phe Gly Ala Asn Ser
405 410 415
Tyr Thr Phe Ser Tyr Ile Ala Gln
Glu
420 425
<210> 17
<211> 425
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 17
Met Ser Lys Lys Arg Val Arg Val
Asp Asp Asp Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro
Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln
Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr
Thr Lys Asn Gly Glu Ile Thr Leu
50 55 60
Lys Leu Gly Glu Gly Leu Asp Leu
Asp Ser Ser Gly Lys Leu Ile Ser
65 70 75 80
Asn Thr Ala Thr Lys Ala Ala Ala
Pro Leu Ser Phe Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp His Pro
Phe Tyr Thr Lys Asp Gly Lys Leu
100 105 110
Ser Leu Gln Val Ser Pro Pro Leu
Asn Ile Leu Arg Thr Ser Ile Leu
115 120 125
Asn Thr Leu Ala Leu Gly Phe Gly
Ser Gly Leu Gly Leu Arg Gly Ser
130 135 140
Ala Leu Ala Val Gln Leu Val Ser
Pro Leu Thr Phe Asp Thr Asp Gly
145 150 155 160
Asn Ile Lys Leu Thr Leu Asp Arg
Gly Leu His Val Thr Thr Gly Asp
165 170 175
Ala Ile Glu Ser Asn Ile Ser Trp
Ala Lys Gly Leu Lys Phe Glu Asp
180 185 190
Gly Ala Ile Ala Thr Asn Ile Gly
Asn Gly Leu Glu Phe Gly Ser Ser
195 200 205
Ser Thr Glu Thr Gly Val Asp Asp
Ala Tyr Pro Ile Gln Val Lys Leu
210 215 220
Gly Ser Gly Leu Ser Phe Asp Ser
Thr Gly Ala Ile Met Ala Gly Asn
225 230 235 240
Lys Glu Asp Asp Lys Leu Thr Leu
Trp Thr Thr Pro Asp Pro Ser Pro
245 250 255
Asn Cys Gln Ile Leu Ala Glu Asn
Asp Ala Lys Leu Thr Leu Cys Leu
260 265 270
Thr Lys Cys Gly Ser Gln Ile Leu
Ala Thr Val Ser Val Leu Val Val
275 280 285
Gly Ser Gly Asn Leu Asn Pro Ile
Thr Gly Thr Val Ser Ser Ala Gln
290 295 300
Val Phe Leu Arg Phe Asp Ala Asn
Gly Val Leu Leu Thr Glu His Ser
305 310 315 320
Thr Leu Lys Lys Tyr Trp Gly Tyr
Arg Gln Gly Asp Ser Ile Asp Gly
325 330 335
Thr Pro Tyr Thr Asn Ala Val Gly
Phe Met Pro Asn Leu Lys Ala Tyr
340 345 350
Pro Lys Ser Gln Ser Ser Thr Thr
Lys Asn Asn Ile Val Gly Gln Val
355 360 365
Tyr Met Asn Gly Asp Val Ser Lys
Pro Met Leu Leu Thr Ile Thr Leu
370 375 380
Asn Gly Thr Asp Asp Ser Asn Ser
Thr Tyr Ser Met Ser Phe Ser Tyr
385 390 395 400
Thr Trp Thr Asn Gly Ser Tyr Val
Gly Ala Thr Phe Gly Ala Asn Ser
405 410 415
Tyr Thr Phe Ser Tyr Ile Ala Gln
Glu
420 425
<210> 18
<211> 442
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 18
Met Ser Lys Lys Arg Ala Arg Val
Asp Asp Gly Phe Asp Pro Val Tyr
1 5 10 15
Pro Tyr Asp Ala Asp Asn Ala Pro
Thr Val Pro Phe Ile Asn Pro Pro
20 25 30
Phe Val Ser Ser Asp Gly Phe Gln
Glu Lys Pro Leu Gly Val Leu Ser
35 40 45
Leu Arg Leu Ala Asp Pro Val Thr
Thr Lys Asn Gly Ala Val Pro Leu
50 55 60
Lys Leu Gly Glu Gly Val Asp Leu
Asp Asp Ser Gly Lys Leu Ile Ser
65 70 75 80
Lys Lys Ser Thr Lys Ala Asn Ser
Pro Leu Ser Ile Ser Asn Asn Thr
85 90 95
Ile Ser Leu Asn Met Asp Thr Pro
Phe Tyr Thr Lys Asp Gly Lys Leu
100 105 110
Thr Met Gln Val Thr Ala Pro Leu
Lys Leu Ala Asn Thr Ala Ile Leu
115 120 125
Asn Thr Leu Ala Met Ala Tyr Gly
Asn Gly Leu Gly Leu Asn Asn Asn
130 135 140
Ala Leu Thr Val Gln Val Thr Ser
Pro Leu Thr Phe Asp Asn Ser Lys
145 150 155 160
Val Lys Ile Asn Leu Gly Asn Gly
Pro Leu Met Val Ser Ala Asn Lys
165 170 175
Leu Ser Ile Asn Cys Leu Arg Gly
Leu Tyr Val Ala Pro Asn Asn Thr
180 185 190
Gly Leu Glu Thr Asn Ile Ser Trp
Ala Asn Ala Met Arg Phe Glu Gly
195 200 205
Asn Ala Met Ala Val Tyr Ile Asp
Thr Asn Lys Gly Leu Gln Phe Gly
210 215 220
Thr Thr Ser Thr Glu Thr Gly Val
Thr Asn Ala Tyr Pro Ile Gln Val
225 230 235 240
Lys Leu Gly Ala Gly Leu Ala Phe
Asp Ser Thr Gly Ala Ile Val Ala
245 250 255
Trp Asn Lys Glu Asn Asp Ser Leu
Thr Leu Trp Thr Thr Pro Asp Pro
260 265 270
Ser Pro Asn Cys Lys Ile Ala Ser
Glu Lys Asp Ala Lys Leu Thr Leu
275 280 285
Cys Leu Thr Lys Cys Gly Ser Gln
Ile Leu Gly Thr Val Ser Leu Leu
290 295 300
Ala Val Ser Gly Ser Leu Ala Pro
Ile Thr Gly Ala Val Ser Thr Ala
305 310 315 320
Leu Val Ser Leu Lys Phe Asn Ala
Asn Gly Ala Leu Leu Asp Lys Ser
325 330 335
Thr Leu Asn Lys Glu Tyr Trp Asn
Tyr Arg Gln Gly Asp Leu Ile Pro
340 345 350
Gly Thr Pro Tyr Thr His Ala Val
Gly Phe Met Pro Asn Lys Lys Ala
355 360 365
Tyr Pro Lys Asn Thr Thr Ala Ala
Ser Lys Ser His Ile Val Gly Asp
370 375 380
Val Tyr Leu Asp Gly Asp Ala Asp
Lys Pro Leu Ser Leu Ile Ile Thr
385 390 395 400
Phe Asn Glu Thr Asp Asp Glu Thr
Cys Asp Tyr Cys Ile Asn Phe Gln
405 410 415
Trp Lys Trp Gly Ala Asp Gln Tyr
Lys Asp Lys Thr Leu Ala Thr Ser
420 425 430
Ser Phe Thr Phe Ser Tyr Ile Ala
Gln Glu
435 440
<210> 19
<211> 577
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 19
Met Lys Arg Ala Lys Thr Ser Asp
Glu Thr Phe Asn Pro Val Tyr Pro
1 5 10 15
Tyr Asp Thr Glu Asn Gly Pro Pro
Ser Val Pro Phe Leu Thr Pro Pro
20 25 30
Phe Val Ser Pro Asp Gly Phe Gln
Glu Ser Pro Pro Gly Val Leu Ser
35 40 45
Leu Arg Leu Ser Glu Pro Leu Val
Thr Ser His Gly Met Leu Ala Leu
50 55 60
Lys Met Gly Asn Gly Leu Ser Leu
Asp Asp Ala Gly Asn Leu Thr Ser
65 70 75 80
Gln Asp Val Thr Thr Val Thr Pro
Pro Leu Lys Lys Thr Lys Thr Asn
85 90 95
Leu Ser Leu Gln Thr Ser Ala Pro
Leu Thr Val Ser Ser Gly Ser Leu
100 105 110
Thr Val Ala Ala Ala Ala Pro Leu
Ala Val Ala Gly Thr Ser Leu Thr
115 120 125
Met Gln Ser Gln Ala Pro Leu Thr
Val Gln Asp Ala Lys Leu Gly Leu
130 135 140
Ala Thr Gln Gly Pro Leu Thr Val
Ser Glu Gly Lys Leu Thr Leu Gln
145 150 155 160
Thr Ser Ala Pro Leu Thr Ala Ala
Asp Ser Ser Thr Leu Thr Val Ser
165 170 175
Ala Thr Pro Pro Leu Ser Thr Ser
Asn Gly Ser Leu Ser Ile Asp Met
180 185 190
Gln Ala Pro Ile Tyr Thr Thr Asn
Gly Lys Leu Ala Leu Asn Ile Gly
195 200 205
Ala Pro Leu His Val Val Asp Thr
Leu Asn Ala Leu Thr Val Val Thr
210 215 220
Gly Gln Gly Leu Thr Ile Asn Gly
Arg Ala Leu Gln Thr Arg Val Thr
225 230 235 240
Gly Ala Leu Ser Tyr Asp Thr Glu
Gly Asn Ile Gln Leu Gln Ala Gly
245 250 255
Gly Gly Met Arg Ile Asp Asn Asn
Gly Gln Leu Ile Leu Asn Val Ala
260 265 270
Tyr Pro Phe Asp Ala Gln Asn Asn
Leu Ser Leu Arg Leu Gly Gln Gly
275 280 285
Pro Leu Ile Val Asn Ser Ala His
Asn Leu Asp Leu Asn Leu Asn Arg
290 295 300
Gly Leu Tyr Leu Phe Thr Ser Gly
Asn Thr Lys Lys Leu Glu Val Asn
305 310 315 320
Ile Lys Thr Ala Lys Gly Leu Phe
Tyr Asp Gly Thr Ala Ile Ala Ile
325 330 335
Asn Ala Gly Asp Gly Leu Gln Phe
Gly Ser Gly Ser Asp Thr Asn Pro
340 345 350
Leu Gln Thr Lys Leu Gly Leu Gly
Leu Glu Tyr Asp Ser Asn Lys Ala
355 360 365
Ile Ile Thr Lys Leu Gly Thr Gly
Leu Ser Phe Asp Asn Thr Gly Ala
370 375 380
Ile Thr Val Gly Asn Lys Asn Asp
Asp Lys Leu Thr Leu Trp Thr Thr
385 390 395 400
Pro Asp Pro Ser Pro Asn Cys Arg
Ile Asn Ser Glu Lys Asp Ala Lys
405 410 415
Leu Thr Leu Val Leu Thr Lys Cys
Gly Ser Gln Val Leu Ala Ser Val
420 425 430
Ser Val Leu Ser Val Lys Gly Ser
Leu Ala Pro Ile Ser Gly Thr Val
435 440 445
Thr Ser Ala Gln Ile Val Leu Arg
Phe Asp Glu Asn Gly Val Leu Leu
450 455 460
Ser Asn Ser Ser Leu Asp Pro Gln
Tyr Trp Asn Tyr Arg Lys Gly Asp
465 470 475 480
Ser Thr Glu Gly Thr Ala Tyr Thr
Asn Ala Val Gly Phe Met Pro Asn
485 490 495
Leu Thr Ala Tyr Pro Lys Thr Gln
Ser Gln Thr Ala Lys Ser Asn Ile
500 505 510
Val Ser Gln Val Tyr Leu Asn Gly
Asp Lys Thr Lys Pro Met Thr Leu
515 520 525
Thr Ile Thr Leu Asn Gly Thr Asn
Glu Thr Gly Asp Ala Thr Val Ser
530 535 540
Thr Tyr Ser Met Ser Phe Ser Trp
Asn Trp Asn Gly Ser Asn Tyr Ile
545 550 555 560
Asn Asp Thr Phe Gln Thr Asn Ser
Phe Thr Phe Ser Tyr Ile Ala Gln
565 570 575
Glu
<210> 20
<211> 937
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 20
Met Ala Thr Pro Ser Met Leu Pro
Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser
Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp
Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Ser Gln Trp Ile
Thr Lys Asp Asn Gly Thr Asp Lys
130 135 140
Thr Tyr Ser Phe Gly Asn Ala Pro
Val Arg Gly Leu Asp Ile Thr Glu
145 150 155 160
Glu Gly Leu Gln Ile Gly Pro Asp
Glu Ser Gly Gly Glu Ser Lys Lys
165 170 175
Ile Phe Ala Asp Lys Thr Tyr Gln
Pro Glu Pro Gln Leu Gly Asp Glu
180 185 190
Glu Trp His Asp Thr Ile Gly Ala
Glu Asp Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Ala Thr Asn Met Lys
Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Ala Lys Gly Gly Gln
Ala Lys Ser Arg Thr Lys Asp Asp
225 230 235 240
Gly Thr Thr Glu Pro Asp Ile Asp
Met Ala Phe Phe Asp Asp Arg Ser
245 250 255
Gln Gln Ala Ser Phe Ser Pro Glu
Leu Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Asp Thr Pro Asp Thr His
Ile Ile Tyr Lys Pro Gly Thr Asp
275 280 285
Glu Thr Ser Ser Ser Phe Asn Leu
Gly Gln Gln Ser Met Pro Asn Arg
290
295 300
Pro Asn Tyr Ile Gly Phe Arg Asp
Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val
Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp
Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp
Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp
Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro
Asn Tyr Cys Phe Pro Leu Asn Gly
385 390 395 400
Val Gly Phe Thr Asp Thr Phe Gln
Gly Ile Lys Val Lys Thr Thr Asn
405 410 415
Asn Gly Thr Ala Asn Ala Thr Glu
Trp Glu Ser Asp Thr Ser Val Asn
420 425 430
Asn Ala Asn Glu Ile Ala Lys Gly
Asn Pro Phe Ala Met Glu Ile Asn
435 440 445
Ile Gln Ala Asn Leu Trp Arg Asn
Phe Leu Tyr Ala Asn Val Ala Leu
450 455 460
Tyr Leu Pro Asp Ser Tyr Lys Tyr
Thr Pro Ala Asn Ile Thr Leu Pro
465 470 475 480
Thr Asn Thr Asn Thr Tyr Asp Tyr Met
Asn Gly Arg Val Val Ala Pro
485 490 495
Ser Leu Val Asp Ala Tyr Ile Asn
Ile Gly Ala Arg Trp Ser Leu Asp
500 505 510
Pro Met Asp Asn Val Asn Pro Phe
Asn His His Arg Asn Ala Gly Leu
515 520 525
Arg Tyr Arg Ser Met Leu Leu Gly
Asn Gly Arg Tyr Val Pro Phe His
530 535 540
Ile Gln Val Pro Gln Lys Phe Phe
Ala Ile Lys Ser Leu Leu Leu Leu
545 550 555 560
Pro Gly Ser Tyr Thr Tyr Glu Trp
Asn Phe Arg Lys Asp Val Asn Met
565 570 575
Ile Leu Gln Ser Ser Leu Gly Asn
Asp Leu Arg Thr Asp Gly Ala Ser
580 585 590
Ile Ala Phe Thr Ser Ile Asn Leu
Tyr Ala Thr Phe Phe Pro Met Ala
595 600 605
His Asn Thr Ala Ser Thr Leu Glu
Ala Met Leu Arg Asn Asp Thr Asn
610 615 620
Asp Gln Ser Phe Asn Asp Tyr Leu
Ser Ala Ala Asn Met Leu Tyr Pro
625 630 635 640
Ile Pro Ala Asn Ala Thr Asn Val
Pro Ile Ser Ile Pro Ser Arg Asn
645 650 655
Trp Ala Ala Phe Arg Gly Trp Ser
Phe Thr Arg Leu Lys Thr Arg Glu
660 665 670
Thr Pro Ser Leu Gly Ser Gly Phe
Asp Pro Tyr Phe Val Tyr Ser Gly
675 680 685
Ser Ile Pro Tyr Leu Asp Gly Thr
Phe Tyr Leu Asn His Thr Phe Lys
690 695 700
Lys Val Ser Ile Thr Phe Asp Ser
Ser Val Ser Trp Pro Gly Asn Asp
705 710 715 720
Arg Leu Leu Thr Pro Asn Glu Phe
Glu Ile Lys Arg Thr Val Asp Gly
725 730 735
Glu Gly Tyr Asn Val Ala Gln Cys
Asn Met Thr Lys Asp Trp Phe Leu
740 745 750
Val Gln Met Leu Ala His Tyr Asn
Ile Gly Tyr Gln Gly Phe Tyr Val
755 760 765
Pro Glu Gly Tyr Lys Asp Arg Met
Tyr Ser Phe Phe Arg Asn Phe Gln
770 775 780
Pro Met Ser Arg Gln Val Val Asp
Glu Val Asn Tyr Lys Asp Tyr Gln
785 790 795 800
Ala Val Thr Leu Ala Tyr Gln His
Asn Asn Ser Gly Phe Val Gly Tyr
805 810 815
Leu Ala Pro Thr Met Arg Gln Gly
Gln Pro Tyr Pro Ala Asn Tyr Pro
820 825 830
Tyr Pro Leu Ile Gly Lys Ser Ala
Val Ala Ser Val Thr Gln Lys Lys
835 840 845
Phe Leu Cys Asp Arg Val Met Trp
Arg Ile Pro Phe Ser Ser Asn Phe
850 855 860
Met Ser Met Gly Ala Leu Thr Asp
Leu Gly Gln Asn Met Leu Tyr Ala
865 870 875 880
Asn Ser Ala His Ala Leu Asp Met
Asn Phe Glu Val Asp Pro Met Asp
885 890 895
Glu Ser Thr Leu Leu Tyr Val Val
Phe Glu Val Phe Asp Val Val Arg
900 905 910
Val His Gln Pro His Arg Gly Val
Ile Glu Ala Val Tyr Leu Arg Thr
915 920 925
Pro Phe Ser Ala Gly Asn Ala Thr
Thr
930 935
<210> 21
<211> 937
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 21
Met Ala Thr Pro Ser Met Leu Pro
Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser
Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp
Gly Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ala Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Ser Gln Trp Ile
Thr Lys Asp Asn Gly Thr Asp Lys
130 135 140
Thr Tyr Ser Phe Gly Asn Ala Pro
Val Arg Gly Leu Asp Ile Thr Glu
145 150 155 160
Glu Gly Leu Gln Ile Arg Thr Asp
Glu Ser Gly Gly Glu Ser Lys Lys
165 170 175
Ile Phe Ala Asp Lys Thr Tyr Gln
Pro Glu Pro Gln Leu Gly Asp Glu
180 185 190
Glu Trp His Asp Thr Ile Gly Ala
Glu Asp Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Ala Thr Asn Met Lys
Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Ala Lys Gly Gly Gln
Ala Lys Ser Arg Thr Lys Asp Asp
225 230 235 240
Gly Thr Thr Glu Pro Asp Ile Asp
Met Ala Phe Phe Asp Asp Arg Ser
245 250 255
Gln Gln Ala Ser Phe Ser Pro Glu
Leu Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Asp Thr Pro Asp Thr His Ile
Ile Tyr Lys Pro Gly Thr Asp
275 280 285
Glu Thr Ser Ser Ser Phe Asn Leu
Gly Gln Gln Ser Met Pro Asn Arg
290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp
Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val
Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp
Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp
Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp
Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro
Asn Tyr Cys Phe Pro Leu Asn Gly
385 390 395 400
Val Gly Phe Thr Asp Thr Phe Gln
Gly Ile Lys Val Lys Thr Thr Asn
405 410 415
Asn Gly Thr Ala Asn Ala Thr Glu
Trp Glu Ser Asp Thr Ser Val Asn
420 425 430
Asn Ala Asn Glu Ile Ala Lys Gly
Asn Pro Phe Ala Met Glu Ile Asn
435 440 445
Ile Gln Ala Asn Leu Trp Arg Asn
Phe Leu Tyr Ala Asn Val Ala Leu
450 455 460
Tyr Leu Pro Asp Ser Tyr Lys Tyr
Thr Pro Ala Asn Ile Thr Leu Pro
465 470 475 480
Thr Asn Thr Asn Thr Tyr Asp Tyr
Met Asn Gly Arg Val Val Ala Pro
485 490 495
Ser Leu Val Asp Ala Tyr Ile Asn
Ile Gly Ala Arg Trp Ser Leu Asp
500 505 510
Pro Met Asp Asn Val Asn Pro Phe
Asn His His Arg Asn Ala Gly Leu
515 520 525
Arg Tyr Arg Ser Met Leu Leu Gly
Asn Gly Arg Tyr Val Pro Phe His
530 535 540
Ile Gln Val Pro Gln Lys Phe Phe
Ala Ile Lys Ser Leu Leu Leu Leu
545 550 555 560
Pro Gly Ser Tyr Thr Tyr Glu Trp
Asn Phe Arg Lys Asp Val Asn Met
565 570 575
Ile Leu Gln Ser Ser Leu Gly Asn
Asp Leu Arg Thr Asp Gly Ala Ser
580 585 590
Ile Ala Phe Thr Ser Ile Asn Leu
Tyr Ala Thr Phe Phe Pro Met Ala
595 600 605
His Asn Thr Ala Ser Thr Leu Glu
Ala Met Leu Arg Asn Asp Thr Asn
610
615 620
Asp Gln Ser Phe Asn Asp Tyr Leu
Ser Ala Ala Asn Met Leu Tyr Pro
625 630 635 640
Ile Pro Ala Asn Ala Thr Asn Val
Pro Ile Ser Ile Pro Ser Arg Asn
645 650 655
Trp Ala Ala Phe Arg Gly Trp Ser
Phe Thr Arg Leu Lys Thr Arg Glu
660 665 670
Thr Pro Ser Leu Gly Ser Gly Phe
Asp Pro Tyr Phe Val Tyr Ser Gly
675 680 685
Ser Ile Pro Tyr Leu Asp Gly Thr
Phe Tyr Leu Asn His Thr Phe Lys
690 695 700
Lys Val Ser Ile Thr Phe Asp Ser
Ser Val Ser Trp Pro Gly Asn Asp
705 710 715 720
Arg Leu Leu Thr Pro Asn Glu Phe
Glu Ile Lys Arg Thr Val Asp Gly
725 730 735
Glu Gly Tyr Asn Val Ala Gln Cys
Asn Met Thr Lys Asp Trp Phe Leu
740 745 750
Val Gln Met Leu Ala His Tyr Asn
Ile Gly Tyr Gln Gly Phe Tyr Val
755 760 765
Pro Glu Gly Tyr Lys Asp Arg Met
Tyr Ser Phe Phe Arg Asn Phe Gln
770 775 780
Pro Met Ser Arg Gln Val Val Asp
Glu Val Asn Tyr Lys Asp Tyr Gln
785 790 795 800
Ala Val Thr Leu Ala Tyr Gln His
Asn Asn Ser Gly Phe Val Gly Tyr
805 810 815
Leu Ala Pro Thr Met Arg Gln Gly
Gln Pro Tyr Pro Ala Asn Tyr Pro
820 825 830
Tyr Pro Leu Ile Gly Lys Ser Ala
Val Ala Ser Val Thr Gln Lys Lys
835 840 845
Phe Leu Cys Asp Arg Val Met Trp
Arg Ile Pro Phe Ser Ser Asn Phe
850 855 860
Met Ser Met Gly Ala Leu Thr Asp
Leu Gly Gln Asn Met Leu Tyr Ala
865 870 875 880
Asn Ser Ala His Ala Leu Asp Met
Asn Phe Glu Val Asp Pro Met Asp
885 890 895
Glu Ser Thr Leu Leu Tyr Val Val
Phe Glu Val Phe Asp Val Val Arg
900 905 910
Val His Gln Pro His Arg Gly Val
Ile Lys Ala Val Tyr Leu Arg Thr
915 920 925
Pro Phe Ser Ala Gly Asn Ala Thr
Thr
930 935
<210> 22
<211> 937
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 22
Met Ala Thr Pro Ser Met Leu Pro
Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser
Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp
Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Ser Gln Trp Ile
Thr Lys Asp Asn Gly Thr Asp Lys
130 135 140
Thr Tyr Ser Phe Gly Asn Ala Pro
Val Arg Gly Leu Asp Ile Thr Glu
145 150 155 160
Glu Gly Leu Gln Ile Gly Thr Asp
Glu Ser Gly Gly Glu Ser Lys Lys
165 170 175
Ile Phe Ala Asp Lys Thr Tyr Gln
Pro Glu Pro Gln Leu Gly Asp Glu
180 185 190
Glu Trp His Asp Thr Ile Gly Ala
Glu Asp Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Ala Thr Asn Met Lys
Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Ala Lys Gly Gly Gln
Ala Lys Ser Arg Thr Lys Asp Asp
225 230 235 240
Gly Thr Thr Glu Pro Asp Ile Asp
Met Ala Phe Phe Asp Asp Arg Ser
245 250 255
Gln Gln Ala Ser Phe Ser Pro Glu
Leu Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Asp Thr Pro Asp Thr His Ile
Ile Tyr Lys Pro Gly Thr Asp
275 280 285
Glu Thr Ser Ser Ser Phe Asn Leu
Gly Gln Gln Ser Met Pro Asn Arg
290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp
Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val
Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp
Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp
Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp
Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro
Asn Tyr Cys Phe Pro Leu Asn Gly
385 390 395 400
Val Gly Phe Thr Asp Thr Phe Gln
Gly Ile Lys Val Lys Thr Thr Asn
405 410 415
Asn Gly Thr Ala Asn Ala Thr Glu
Trp Glu Ser Asp Thr Ser Val Asn
420 425 430
Asn Ala Asn Glu Ile Ala Lys Gly
Asn Pro Phe Ala Met Glu Ile Asn
435 440 445
Ile Gln Ala Asn Leu Trp Arg Asn
Phe Leu Tyr Ala Asn Val Ala Leu
450 455 460
Tyr Leu Pro Asp Ser Tyr Lys Tyr
Thr Pro Ala Asn Ile Thr Leu Pro
465 470 475 480
Thr Asn Thr Asn Thr Tyr Asp Tyr
Met Asn Gly Arg Val Val Ala Pro
485 490 495
Ser Leu Val Asp Ala Tyr Ile Asn
Ile Gly Ala Arg Trp Ser Leu Asp
500 505 510
Pro Met Asp Asn Val Asn Pro Phe
Asn His His Arg Asn Ala Gly Leu
515 520 525
Arg Tyr Arg Ser Met Leu Leu Gly
Asn Gly Arg Tyr Val Pro Phe His
530 535 540
Ile Gln Val Pro Gln Lys Phe Phe
Ala Ile Lys Ser Leu Leu Leu Leu
545 550 555 560
Pro Gly Ser Tyr Thr Tyr Glu Trp
Asn Phe Arg Lys Asp Val Asn Met
565 570 575
Ile Leu Gln Ser Ser Leu Gly Asn
Asp Leu Arg Thr Asp Gly Ala Ser
580 585 590
Ile Ala Phe Thr Ser Ile Asn Leu
Tyr Ala Thr Phe Phe Pro Met Ala
595 600 605
His Asn Thr Ala Ser Thr Leu Glu
Ala Met Leu Arg Asn Asp Thr Asn
610
615 620
Asp Gln Ser Phe Asn Asp Tyr Leu
Ser Ala Ala Asn Met Leu Tyr Pro
625 630 635 640
Ile Pro Ala Asn Ala Thr Asn Val
Pro Ile Ser Ile Pro Ser Arg Asn
645 650 655
Trp Ala Ala Phe Arg Gly Trp Ser
Phe Thr Arg Leu Lys Thr Arg Glu
660 665 670
Thr Pro Ser Leu Gly Ser Gly Phe
Asp Pro Tyr Phe Val Tyr Ser Gly
675 680 685
Ser Ile Pro Tyr Leu Asp Gly Thr
Phe Tyr Leu Asn His Thr Phe Lys
690 695 700
Lys Val Ser Ile Thr Phe Asp Ser
Ser Val Ser Trp Pro Gly Asn Asp
705 710 715 720
Arg Leu Leu Thr Pro Asn Glu Phe
Glu Ile Lys Arg Thr Val Asp Gly
725 730 735
Glu Gly Tyr Asn Val Ala Gln Cys
Asn Met Thr Lys Asp Trp Phe Leu
740 745 750
Val Gln Met Leu Ala His Tyr Asn
Ile Gly Tyr Gln Gly Phe Tyr Val
755 760 765
Pro Glu Gly Tyr Lys Asp Arg Met
Tyr Ser Phe Phe Arg Asn Phe Gln
770 775 780
Pro Met Ser Arg Gln Val Val Asp
Glu Val Asn Tyr Lys Asp Tyr Gln
785 790 795 800
Ala Val Thr Leu Ala Tyr Gln His
Asn Asn Ser Gly Phe Val Gly Tyr
805 810 815
Leu Ala Pro Thr Met Arg Gln Gly
Gln Pro Tyr Pro Ala Asn Tyr Pro
820 825 830
Tyr Pro Leu Ile Gly Lys Ser Ala
Val Ala Ser Val Thr Gln Lys Lys
835 840 845
Phe Leu Cys Asp Arg Val Met Trp
Arg Ile Pro Phe Ser Ser Asn Phe
850 855 860
Met Ser Met Gly Ala Leu Thr Asp
Leu Gly Gln Asn Met Leu Tyr Ala
865 870 875 880
Asn Ser Ala His Ala Leu Asp Met
Asn Phe Glu Val Asp Pro Met Asp
885 890 895
Glu Ser Thr Leu Leu Tyr Val Val
Phe Glu Val Phe Asp Val Val Arg
900 905 910
Val His Gln Pro His Arg Gly Val
Ile Glu Ala Val Tyr Leu Arg Thr
915 920 925
Pro Phe Ser Ala Gly Asn Ala Thr
Thr
930 935
<210> 23
<211> 937
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<220>
<221> misc_特征
<222> (296)..(296)
<223> Xaa可为任意天然存在的氨基酸
<400> 23
Met Ala Thr Pro Ser Met Leu Pro
Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser
Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp
Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Ser Gln Trp Ile
Thr Lys Asp Asn Gly Thr Asp Lys
130 135 140
Thr Tyr Ser Phe Gly Asn Ala Pro
Val Arg Gly Leu Asp Ile Thr Glu
145 150 155 160
Glu Gly Leu Gln Ile Gly Thr Asp
Glu Ser Gly Gly Lys Ser Lys Lys
165 170 175
Ile Phe Ala Asp Lys Thr Tyr Gln
Pro Glu Pro Gln Leu Gly Asp Glu
180 185 190
Glu Trp His Asp Thr Ile Gly Ala
Glu Asp Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Ala Thr Asn Met Lys
Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Ala Lys Gly Gly Gln
Ala Lys Ser Arg Thr Lys Asp Asp
225 230 235 240
Gly Thr Thr Glu Pro Asp Ile Asp
Met Ala Phe Phe Asp Asp Arg Ser
245 250 255
Gln Gln Ala Ser Phe Ser Pro Glu
Leu Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Asp Thr Pro Asp Thr His
Ile Ile Tyr Lys Pro Gly Thr Asp
275 280 285
Glu Thr Ser Ser Ser Phe Asn Xaa
Gly Gln Gln Ser Met Pro Asn Arg
290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp
Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val
Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp
Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp
Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp
Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro
Asn Tyr Cys Phe Pro Leu Asn Gly
385 390 395 400
Val Gly Phe Thr Asp Thr Phe Gln
Gly Ile Lys Val Lys Thr Thr Asn
405 410 415
Asn Gly Thr Ala Asn Ala Thr Glu
Trp Glu Ser Asp Thr Ser Val Asn
420 425 430
Asn Ala Asn Glu Ile Ala Lys Gly
Asn Pro Phe Ala Met Glu Ile Asn
435 440 445
Ile Gln Ala Asn Leu Trp Arg Asn
Phe Leu Tyr Ala Asn Val Ala Leu
450 455 460
Tyr Leu Pro Asp Ser Tyr Lys Tyr
Thr Pro Ala Asn Ile Thr Leu Pro
465 470 475 480
Thr Asn Thr Asn Thr Tyr Asp Tyr
Met Asn Gly Arg Val Val Ala Pro
485 490 495
Ser Leu Val Asp Ala Tyr Ile Asn
Ile Gly Ala Arg Trp Ser Leu Asp
500 505 510
Pro Met Asp Asn Val Asn Pro Phe
Asn His His Arg Asn Ala Gly Leu
515 520 525
Arg Tyr Arg Ser Met Leu Leu Gly
Asn Gly Arg Tyr Val Pro Phe His
530 535 540
Ile Gln Val Pro Gln Lys Phe Phe
Ala Ile Lys Asn Leu Leu Leu Leu
545 550 555 560
Pro Gly Ser Tyr Thr Tyr Glu Trp
Asn Phe Arg Lys Asp Val Asn Met
565 570 575
Ile Leu Gln Ser Ser Leu Gly Asn
Asp Leu Arg Thr Asp Gly Ala Ser
580 585 590
Ile Ala Phe Thr Ser Ile Asn Leu
Tyr Ala Thr Phe Phe Pro Met Ala
595 600 605
His Asn Thr Ala Ser Thr Leu Glu
Ala Met Leu Arg Asn Asp Thr Asn
610 615 620
Asp Gln Ser Phe Asn Asp Tyr Leu
Ser Ala Ala Asn Met Leu Tyr Pro
625 630 635 640
Ile Pro Ala Asn Ala Thr Asn Val
Pro Ile Ser Ile Pro Ser Arg Asn
645 650 655
Trp Ala Ala Phe Arg Gly Trp Ser
Phe Thr Arg Leu Lys Thr Arg Glu
660 665 670
Thr Pro Ser Leu Gly Ser Gly Phe
Asp Pro Tyr Phe Val Tyr Ser Gly
675 680 685
Ser Ile Pro Tyr Leu Asp Gly Thr
Phe Tyr Leu Asn His Thr Phe Lys
690 695 700
Lys Val Ser Ile Thr Phe Asp Ser
Ser Val Ser Trp Pro Gly Asn Asp
705 710 715 720
Arg Leu Leu Thr Pro Asn Glu Phe
Glu Ile Lys Arg Thr Val Asp Gly
725 730 735
Glu Gly Tyr Asn Val Ala Gln Cys
Asn Met Thr Lys Asp Trp Phe Leu
740 745 750
Val Gln Met Leu Ala His Tyr Asn
Ile Gly Tyr Gln Gly Phe Tyr Val
755 760 765
Pro Glu Gly Tyr Lys Asp Arg Met
Tyr Ser Phe Phe Arg Asn Phe Gln
770 775 780
Pro Met Ser Arg Gln Val Val Asp
Glu Val Asn Tyr Lys Asp Tyr Gln
785 790 795 800
Ala Val Thr Leu Ala Tyr Gln His
Asn Asn Ser Gly Phe Val Gly Tyr
805 810 815
Leu Ala Pro Thr Met Arg Gln Gly
Gln Pro Tyr Pro Ala Asn Tyr Pro
820 825 830
Tyr Pro Leu Ile Gly Lys Ser Ala
Val Ala Ser Val Thr Gln Lys Lys
835 840 845
Phe Leu Cys Asp Arg Val Met Trp
Arg Ile Pro Phe Ser Ser Asn Phe
850
855 860
Met Ser Met Gly Ala Leu Thr Asp
Leu Gly Gln Asn Met Leu Tyr Ala
865 870 875 880
Asn Ser Ala His Ala Leu Asp Met
Asn Phe Glu Val Asp Pro Met Asp
885 890 895
Glu Ser Thr Leu Leu Tyr Val Val
Phe Glu Val Phe Asp Val Val Arg
900 905 910
Val His Gln Pro His Arg Gly Val
Ile Glu Ala Val Tyr Leu Arg Thr
915 920 925
Pro Phe Ser Ala Gly Asn Ala Thr
Thr
930 935
<210> 24
<211> 937
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<220>
<221> misc_特征
<222> (538)..(538)
<223> Xaa可为任意天然存在的氨基酸
<400> 24
Met Ala Thr Pro Ser Met Leu Pro
Gln Trp Ala Tyr Met His Ile Ala
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Thr Tyr Phe Ser
Leu Gly Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Val Pro Val Asp
Arg Glu Asp Asn Thr Tyr Ser Tyr
65 70 75 80
Lys Val Arg Tyr Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Ser
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Thr Ser Gln Trp Val
Thr Lys Asp Asn Gly Thr Asp Lys
130 135 140
Thr Tyr Ser Phe Gly Asn Ala Pro
Val Arg Gly Leu Asp Ile Thr Glu
145 150 155 160
Glu Gly Leu Gln Ile Gly Thr Asp
Asp Ser Ser Thr Glu Ser Lys Lys
165 170 175
Ile Phe Ala Asp Lys Thr Tyr Gln
Pro Glu Pro Gln Val Gly Asp Glu
180 185 190
Glu Trp His Asp Thr Ile Gly Ala
Glu Asp Lys Tyr Gly Gly Arg Ala
195 200 205
Leu Lys Pro Ala Thr Asn Met Lys
Pro Cys Tyr Gly Ser Phe Ala Lys
210 215 220
Pro Thr Asn Ala Lys Gly Gly Gln
Ala Lys Thr Arg Thr Lys Asp Asp
225 230 235 240
Gly Thr Thr Glu Pro Asp Ile Asp
Met Ala Phe Phe Asp Asp Arg Ser
245 250 255
Gln Gln Ala Ser Phe Ser Pro Glu
Leu Val Leu Tyr Thr Glu Asn Val
260 265 270
Asp Leu Glu Thr Pro Asp Thr His
Ile Ile Tyr Lys Pro Gly Thr Asp
275 280 285
Glu Thr Ser Ser Ser Phe Asn Leu
Gly Gln Gln Ser Met Pro Asn Arg
290 295 300
Pro Asn Tyr Ile Gly Phe Arg Asp
Asn Phe Ile Gly Leu Met Tyr Tyr
305 310 315 320
Asn Ser Thr Gly Asn Met Gly Val
Leu Ala Gly Gln Ala Ser Gln Leu
325 330 335
Asn Ala Val Val Asp Leu Gln Asp
Arg Asn Thr Glu Leu Ser Tyr Gln
340 345 350
Leu Leu Leu Asp Ser Leu Gly Asp
Arg Thr Arg Tyr Phe Ser Met Trp
355 360 365
Asn Gln Ala Val Asp Ser Tyr Asp
Pro Asp Val Arg Ile Ile Glu Asn
370 375 380
His Gly Val Glu Asp Glu Leu Pro
Asn Tyr Cys Phe Pro Leu Asn Gly
385 390 395 400
Val Gly Phe Thr Asp Thr Phe Gln
Gly Ile Lys Val Lys Thr Thr Asn
405 410 415
Asn Gly Thr Ala Asn Ala Thr Glu
Trp Glu Ser Asp Thr Ser Val Asn
420 425 430
Asn Ala Asn Glu Ile Ala Lys Gly
Asn Pro Phe Ala Met Glu Ile Asn
435 440 445
Ile Gln Ala Asn Leu Trp Arg Asn
Phe Leu Tyr Ala Asn Val Ala Leu
450 455 460
Tyr Leu Pro Asp Ser Tyr Lys Tyr
Thr Pro Ala Asn Val Thr Leu Pro
465 470 475 480
Thr Asn Thr Asn Thr Tyr Glu Tyr
Met Asn Gly Arg Val Val Ala Pro
485 490 495
Ser Leu Val Asp Ser Tyr Ile Asn
Ile Gly Ala Arg Trp Ser Leu Asp
500 505 510
Pro Met Asp Asn Val Asn Pro Phe
Asn His His Arg Asn Ala Gly Leu
515 520 525
Arg Tyr Arg Ser Met Leu Leu Gly
Asn Xaa Arg Phe Val Pro Phe His
530 535 540
Ile Gln Val Pro Gln Lys Phe Phe
Ala Ile Lys Ser Leu Leu Leu Leu
545 550 555 560
Pro Gly Ser Tyr Thr Tyr Glu Trp
Asn Phe Arg Lys Asp Val Asn Met
565 570 575
Ile Leu Gln Ser Ser Leu Gly Asn
Asp Leu Arg Thr Asp Gly Ala Ser
580 585 590
Ile Ser Phe Thr Ser Ile Asn Leu
Tyr Ala Thr Phe Phe Pro Met Ala
595 600 605
His Asn Thr Ala Ser Thr Leu Glu
Ala Met Leu Arg Asn Asp Thr Asn
610 615 620
Asp Gln Ser Phe Asn Asp Tyr Leu
Ser Ala Ala Asn Met Leu Tyr Pro
625 630 635 640
Ile Pro Ala Asn Ala Thr Asn Val
Pro Ile Ser Ile Pro Ser Arg Asn
645 650 655
Trp Ala Ala Phe Arg Gly Trp Ser
Phe Thr Arg Leu Lys Thr Lys Glu
660 665 670
Thr Pro Ser Leu Gly Ser Gly Phe
Asp Pro Tyr Phe Val Tyr Ser Gly
675 680 685
Ser Ile Pro Tyr Leu Asp Gly Thr
Phe Tyr Leu Asn His Thr Phe Lys
690 695 700
Lys Val Ser Ile Thr Phe Asp Ser
Ser Val Ser Trp Pro Gly Asn Asp
705 710 715 720
Arg Leu Leu Thr Pro Asn Glu Phe
Glu Ile Lys Arg Thr Val Asp Gly
725 730 735
Glu Gly Tyr Asn Val Ala Gln Cys
Asn Met Thr Lys Asp Trp Phe Leu
740 745 750
Val Gln Met Leu Ala His Tyr Asn
Ile Gly Tyr Gln Gly Phe Tyr Val
755 760 765
Pro Glu Gly Tyr Lys Asp Arg Met
Tyr Ser Phe Phe Arg Asn Phe Gln
770 775 780
Pro Met Ser Arg Gln Val Val Asp
Glu Val Asn Tyr Lys Asp Tyr Gln
785 790 795 800
Ala Val Thr Leu Ala Tyr Gln His
Asn Asn Ser Gly Phe Val Gly Tyr
805 810 815
Leu Ala Pro Thr Met Arg Gln Gly
Gln Pro Tyr Pro Ala Asn Tyr Pro
820 825 830
Tyr Pro Leu Ile Gly Lys Ser Ala
Val Thr Ser Val Thr Gln Lys Lys
835 840 845
Phe Leu Cys Asp Arg Val Met Trp
Arg Ile Pro Phe Ser Ser Asn Phe
850
855 860
Met Ser Met Gly Ala Leu Thr Asp
Leu Gly Gln Asn Met Leu Tyr Ala
865 870 875 880
Asn Ser Ala His Ala Leu Asp Met
Asn Phe Glu Val Asp Pro Met Asp
885 890 895
Glu Ser Thr Leu Leu Tyr Val Val
Phe Glu Val Phe Asp Val Val Arg
900 905 910
Val His Gln Pro His Arg Gly Val
Ile Glu Ala Val Tyr Leu Arg Thr
915 920 925
Pro Phe Ser Ala Gly Asn Ala Thr
Thr
930 935
<210> 25
<211> 962
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 25
Met Ala Thr Pro Ser Met Met Pro
Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Ser Tyr Phe Ser
Leu Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp
Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu
Gln Val Glu Pro Ala Glu Glu Ala
130 135 140
Ala Glu Asn Glu Asp Glu Glu Glu
Glu Glu Asp Val Val Asp Pro Gln
145 150 155 160
Glu Gln Glu Pro Thr Thr Lys Thr
His Val Tyr Ala Gln Ala Pro Leu
165 170 175
Ser Gly Glu Lys Ile Thr Lys Asp
Gly Leu Gln Ile Gly Thr Glu Ala
180 185 190
Thr Ala Ala Gly Gly Thr Lys Asp
Leu Phe Ala Asp Pro Thr Phe Gln
195 200 205
Pro Glu Pro Gln Val Gly Glu Ser
Gln Trp Asn Glu Ala Asp Ala Thr
210 215 220
Ala Ala Gly Gly Arg Val Leu Lys
Lys Thr Thr Pro Met Lys Pro Cys
225 230 235 240
Tyr Gly Ser Tyr Ala Arg Pro Thr
Asn Ala Asn Gly Gly Gln Gly Val
245 250 255
Leu Lys Ala Asn Ala Gln Gly Val
Leu Glu Ser Gln Val Glu Met Gln
260 265 270
Phe Phe Ser Thr Ser Thr Asn Ala Thr
Asn Glu Gln Asn Asn Ile Gln
275 280 285
Pro Lys Leu Val Leu Tyr Ser Glu
Asp Val His Met Glu Thr Pro Asp
290 295 300
Thr His Ile Ser Tyr Lys Pro Thr
Lys Ser Asp Asp Asn Ser Lys Val
305 310 315 320
Met Leu Gly Gln Gln Ser Met Pro
Asn Arg Pro Asn Tyr Ile Ala Phe
325 330 335
Arg Asp Asn Phe Ile Gly Leu Met
Tyr Tyr Asn Ser Thr Gly Asn Met
340 345 350
Gly Val Leu Ala Gly Gln Ala Ser
Gln Leu Asn Ala Val Val Asp Leu
355 360 365
Gln Asp Arg Asn Thr Glu Leu Ser
Tyr Gln Leu Leu Leu Asp Ser Met
370 375 380
Gly Asp Arg Thr Arg Tyr Phe Ser
Met Trp Asn Gln Ala Val Asp Ser
385 390 395 400
Tyr Asp Pro Asp Val Arg Ile Ile
Glu Asn His Gly Thr Glu Asp Glu
405 410 415
Leu Pro Asn Tyr Cys Phe Pro Leu
Gly Gly Ile Gly Ile Thr Asp Thr
420 425 430
Tyr Gln Ala Ile Lys Thr Asn Gly
Asn Gly Ala Gly Asp Gln Ala Thr
435 440 445
Thr Trp Gln Lys Asp Ser Gln Phe
Ala Asp Arg Asn Glu Ile Gly Val
450 455 460
Gly Asn Asn Phe Ala Met Glu Ile
Asn Leu Ser Ala Asn Leu Trp Arg
465 470 475 480
Asn Phe Leu Tyr Ser Asn Val Ala
Leu Tyr Leu Pro Asp Lys Leu Lys
485 490 495
Tyr Asn Pro Ser Asn Val Glu Ile
Ser Asp Asn Pro Asn Thr Tyr Asp
500 505 510
Tyr Met Asn Lys Arg Val Val Ala
Pro Gly Leu Val Asp Cys Tyr Ile
515 520 525
Asn Leu Gly Ala Arg Trp Ser Leu
Asp Tyr Met Asp Asn Val Asn Pro
530 535 540
Phe Asn His His Arg Asn Ala Gly
Leu Arg Tyr Arg Ser Met Leu Leu
545 550 555 560
Gly Asn Gly Arg Tyr Val Pro Phe
His Ile Gln Val Pro Gln Lys Phe
565 570 575
Phe Ala Ile Lys Asn Leu Leu Leu
Leu Pro Gly Ser Tyr Thr Tyr Glu
580 585 590
Trp Asn Phe Arg Lys Asp Val Asn
Met Val Leu Gln Ser Ser Leu Gly
595 600 605
Asn Asp Leu Arg Val Asp Gly Ala
Ser Ile Lys Phe Glu Ser Ile Cys
610
615 620
Leu Tyr Ala Thr Phe Phe Pro Met
Ala His Asn Thr Ala Ser Thr Leu
625 630 635 640
Glu Ala Met Leu Arg Asn Asp Thr
Asn Asp Gln Ser Phe Asn Asp Tyr
645 650 655
Leu Ser Ala Ala Asn Met Leu Tyr
Pro Ile Pro Ala Asn Ala Thr Asn
660 665 670
Val Pro Ile Ser Ile Pro Ser Arg
Asn Trp Ala Ala Phe Arg Gly Trp
675 680 685
Ala Phe Thr Arg Leu Lys Thr Lys
Glu Thr Pro Ser Leu Gly Ser Gly
690 695 700
Phe Asp Pro Tyr Tyr Thr Tyr Ser
Gly Ser Ile Pro Tyr Leu Asp Gly
705 710 715 720
Thr Phe Tyr Leu Asn His Thr Phe
Lys Lys Val Ser Val Thr Phe Asp
725 730 735
Ser Ser Val Ser Trp Pro Gly Asn
Asp Arg Leu Leu Thr Pro Asn Glu
740 745 750
Phe Glu Ile Lys Arg Ser Val Asp
Gly Glu Gly Tyr Asn Val Ala Gln
755 760 765
Cys Asn Met Thr Lys Asp Trp Phe
Leu Ile Gln Met Leu Ala Asn Tyr
770 775 780
Asn Ile Gly Tyr Gln Gly Phe Tyr
Ile Pro Glu Ser Tyr Lys Asp Arg
785 790 795 800
Met Tyr Ser Phe Phe Arg Asn Phe
Gln Pro Met Ser Arg Gln Val Val
805 810 815
Asp Glu Thr Lys Tyr Lys Asp Tyr
Gln Gln Val Gly Ile Ile His Gln
820 825 830
His Asn Asn Ser Gly Phe Val Gly
Tyr Leu Ala Pro Thr Met Arg Glu
835 840 845
Gly Gln Ala Tyr Pro Ala Asn Phe
Pro Tyr Pro Leu Ile Gly Lys Thr
850 855 860
Ala Val Asp Ser Ile Thr Gln Lys
Lys Phe Leu Cys Asp Arg Thr Leu
865 870 875 880
Trp Arg Ile Pro Phe Ser Ser Asn
Phe Met Ser Met Gly Ala Leu Thr
885 890 895
Asp Leu Gly Gln Asn Leu Leu Tyr
Ala Asn Ser Ala His Ala Leu Asp
900 905 910
Met Thr Phe Glu Val Asp Pro Met
Asp Glu Pro Thr Leu Leu Tyr Val
915 920 925
Leu Phe Glu Val Phe Asp Val Val
Arg Val His Gln Pro His Arg Gly
930 935 940
Val Ile Glu Thr Val Tyr Leu Arg
Thr Pro Phe Ser Ala Gly Asn Ala
945 950 955 960
Thr Thr
<210> 26
<211> 531
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 26
Met Met Arg Arg Val Tyr Pro Glu
Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala
Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu
Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro
Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp
Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr
Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr
Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile
Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys
Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Ala
Val Gly Asp Asp Tyr Asp Gly Gly
145 150 155 160
Gln Asp Glu Leu Thr Tyr Glu Trp
Val Glu Phe Glu Leu Pro Glu Gly
165 170 175
Asn Phe Ser Val Thr Met Thr Ile
Asp Leu Met Asn Asn Ala Ile Ile
180 185 190
Asp Asn Tyr Leu Ala Val Gly Arg
Gln Asn Gly Val Leu Glu Ser Asp
195 200 205
Ile Gly Val Lys Phe Asp Thr Arg
Asn Phe Arg Leu Gly Trp Asp Pro
210 215 220
Val Thr Glu Leu Val Met Pro Gly
Val Tyr Thr Asn Glu Ala Phe His
225 230 235 240
Pro Asp Ile Val Leu Leu Pro Gly
Cys Gly Val Asp Phe Thr Glu Ser
245 250 255
Arg Leu Ser Asn Leu Leu Gly Ile
Arg Lys Arg Gln Pro Phe Gln Glu
260 265 270
Gly Phe Gln Ile Leu Tyr Glu Asp
Leu Glu Gly Gly Asn Ile Pro Ala
275 280 285
Leu Leu Asp Val Glu Ala Tyr Glu
Lys Ser Lys Glu Glu Ser Ala Ala
290 295 300
Ala Ala Thr Ala Ala Val Ala Thr
Ala Ser Thr Glu Val Arg Gly Asp
305 310 315 320
Asn Phe Ala Ser Ala Ala Ala Val
Ala Glu Ala Ala Glu Thr Glu Ser
325 330 335
Lys Ile Val Ile Gln Pro Val Glu
Lys Asp Ser Lys Asp Arg Ser Tyr
340 345 350
Asn Val Leu Ala Asp Lys Lys Asn
Thr Ala Tyr Arg Ser Trp Tyr Leu
355 360 365
Ala Tyr Asn Tyr Gly Asp Pro Glu
Lys Gly Val Arg Ser Trp Thr Leu
370 375 380
Leu Thr Thr Ser Asp Val Thr Cys
Gly Val Glu Gln Val Tyr Trp Ser
385 390 395 400
Leu Pro Asp Met Met Gln Asp Pro
Val Thr Phe Arg Ser Thr Arg Gln
405 410 415
Val Ser Asn Tyr Pro Val Val Gly
Ala Glu Leu Leu Pro Val Tyr Ser
420 425 430
Lys Ser Phe Phe Asn Glu Gln Ala
Val Tyr Ser Gln Gln Leu Arg Ala
435 440 445
Phe Thr Ser Leu Thr His Val Phe
Asn Arg Phe Pro Glu Asn Gln Ile
450 455 460
Leu Val Arg Pro Pro Ala Pro Thr
Ile Thr Thr Val Ser Glu Asn Val
465 470 475 480
Pro Ala Leu Thr Asp His Gly Thr
Leu Pro Leu Arg Ser Ser Ile Arg
485 490 495
Gly Val Gln Arg Val Thr Val Thr
Asp Ala Arg Arg Arg Thr Cys Pro
500 505 510
Tyr Val Tyr Lys Ala Leu Gly Val
Val Ala Pro Arg Val Leu Ser Ser
515 520 525
Arg Thr Phe
530
<210> 27
<211> 541
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 27
Met Met Arg Arg Val Tyr Pro Glu
Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Val
Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu
Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro
Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp
Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr
Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr
Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile
Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys
Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Thr
Val Gly Asp Asp Tyr Asp Gly Ser
145 150 155 160
Gln Asp Glu Leu Thr Tyr Glu Trp
Val Glu Phe Glu Leu Pro Glu Gly
165 170 175
Asn Phe Ser Val Thr Met Thr Ile
Asp Leu Met Asn Asn Ala Ile Ile
180 185 190
Asp Asn Tyr Leu Ala Val Gly Arg
Gln Asn Gly Val Leu Glu Ser Asp
195 200 205
Ile Gly Val Lys Phe Asp Thr Arg
Asn Phe Arg Leu Gly Trp Asp Pro
210 215 220
Val Thr Glu Leu Val Met Pro Gly
Val Tyr Thr Asn Glu Ala Phe His
225 230 235 240
Pro Asp Ile Val Leu Leu Pro Gly
Cys Gly Val Asp Phe Thr Glu Ser
245 250 255
Arg Leu Ser Asn Leu Leu Gly Ile
Arg Lys Arg Gln Pro Phe Gln Glu
260 265 270
Gly Phe Gln Ile Leu Tyr Glu Asp
Leu Glu Gly Gly Asn Ile Pro Ala
275 280 285
Leu Leu Asp Val Glu Ala Tyr Glu
Lys Ser Lys Glu Asp Ser Ala Ala
290 295 300
Ala Thr Thr Ala Ala Val Ala Thr
Ala Ala Thr Thr Asp Ala Asp Ala
305 310 315 320
Thr Thr Thr Arg Gly Asp Thr Phe
Ala Thr Gln Ala Glu Glu Ala Ala
325 330 335
Ala Leu Ala Ala Thr Asp Asp Ser
Glu Ser Lys Ile Val Ile Lys Pro
340 345 350
Val Glu Lys Asp Ser Lys Asp Arg
Ser Tyr Asn Val Leu Ala Asp Lys
355 360 365
Lys Asn Thr Ala Tyr Arg Ser Trp
Tyr Leu Ala Tyr Asn Tyr Gly Asp
370 375 380
Pro Glu Lys Gly Val Arg Ser Trp
Thr Leu Leu Thr Thr Ser Asp Val
385 390 395 400
Thr Cys Gly Val Glu Gln Val Tyr
Trp Ser Leu Pro Asp Met Met Gln
405 410 415
Asp Pro Val Thr Phe Arg Ser Thr
Arg Gln Val Ser Asn Tyr Pro Val
420 425 430
Val Gly Ala Glu Leu Leu Pro Val
Tyr Ser Lys Ser Phe Phe Asn Glu
435
440 445
Gln Ala Val Tyr Ser Gln Gln Leu
Arg Ala Phe Thr Ser Leu Thr His
450 455 460
Val Phe Asn Arg Phe Pro Glu Asn
Gln Ile Leu Val Arg Pro Pro Ala
465 470 475 480
Pro Thr Ile Thr Thr Val Ser Glu
Asn Val Pro Ala Leu Thr Asp His
485 490 495
Gly Thr Leu Pro Leu Arg Ser Ser
Ile Arg Gly Val Gln Arg Val Thr
500 505 510
Val Thr Asp Ala Arg Arg Arg Thr
Cys Pro Tyr Val Tyr Lys Ala Leu
515 520 525
Gly Val Val Ala Pro Arg Val Leu
Ser Ser Arg Thr Phe
530 535 540
<210> 28
<211> 532
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 28
Met Met Arg Arg Val Tyr Pro Glu
Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala
Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu
Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro
Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp
Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr
Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr
Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile
Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Met Tyr Ser Asn Lys
Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Thr
Val Thr Asp Gly Ser Gln Asp Glu
145 150 155 160
Leu Thr Tyr Glu Trp Val Glu Phe
Glu Leu Pro Glu Gly Asn Phe Ser
165 170 175
Val Thr Met Thr Ile Asp Leu Met
Asn Asn Ala Ile Ile Asp Asn Tyr
180 185 190
Leu Ala Val Gly Arg Gln Asn Gly
Val Leu Glu Ser Asp Ile Gly Val
195 200 205
Lys Phe Asp Thr Arg Asn Phe Arg
Leu Gly Trp Asp Pro Val Thr Glu
210 215 220
Leu Val Met Pro Gly Val Tyr Thr
Asn Glu Ala Phe His Pro Asp Ile
225 230 235 240
Val Leu Leu Pro Gly Cys Gly Val
Asp Phe Thr Glu Ser Arg Leu Ser
245 250 255
Asn Leu Leu Gly Ile Arg Lys Arg
Gln Pro Phe Gln Glu Gly Phe Gln
260 265 270
Ile Leu Tyr Glu Asp Leu Glu Gly Gly
Asn Ile Pro Ala Leu Leu Asp
275 280 285
Val Glu Ala Tyr Glu Lys Ser Lys
Glu Asp Ser Thr Ala Val Ala Thr
290 295 300
Ala Ala Thr Val Ala Asp Ala Thr
Val Thr Arg Gly Asp Thr Phe Ala
305 310 315 320
Thr Gln Ala Glu Glu Ala Ala Ala
Leu Ala Ala Thr Asp Asp Ser Glu
325 330 335
Ser Lys Ile Val Ile Lys Pro Val
Glu Lys Asp Ser Lys Asp Arg Ser
340 345 350
Tyr Asn Val Leu Ser Asp Gly Lys
Asn Thr Ala Tyr Arg Ser Trp Tyr
355 360 365
Leu Ala Tyr Asn Tyr Gly Asp Pro
Glu Lys Gly Val Arg Ser Trp Thr
370 375 380
Leu Leu Thr Thr Ser Asp Val Thr
Cys Gly Val Glu Gln Val Tyr Trp
385 390 395 400
Ser Leu Pro Asp Met Met Gln Asp
Pro Val Thr Phe Arg Ser Thr Arg
405 410 415
Gln Val Ser Asn Tyr Pro Val Val
Gly Ala Glu Leu Leu Pro Val Tyr
420 425 430
Ser Lys Ser Phe Phe Asn Glu Gln
Ala Val Tyr Ser Gln Gln Leu Arg
435 440 445
Ala Phe Thr Ser Leu Thr His Val
Phe Asn Arg Phe Pro Glu Asn Gln
450 455 460
Ile Leu Val Arg Pro Pro Ala Pro
Thr Ile Thr Thr Val Ser Glu Asn
465 470 475 480
Val Pro Ala Leu Thr Asp His Gly
Thr Leu Pro Leu Arg Ser Ser Ile
485 490 495
Arg Gly Val Gln Arg Val Thr Val
Thr Asp Ala Arg Arg Arg Thr Cys
500 505 510
Pro Tyr Val Tyr Lys Ala Leu Gly
Val Val Ala Pro Arg Val Leu Ser
515 520 525
Ser Arg Thr Phe
530
<210> 29
<211> 528
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 29
Met Met Arg Arg Val Tyr Pro Glu
Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Val Ala Ala
Ala Met Gln Pro Pro Leu Glu Ala
20 25 30
Pro Tyr Val Pro Pro Arg Tyr Leu
Ala Pro Thr Glu Gly Arg Asn Ser
35 40 45
Ile Arg Tyr Ser Glu Leu Ala Pro
Leu Tyr Asp Thr Thr Arg Leu Tyr
50 55 60
Leu Val Asp Asn Lys Ser Ala Asp
Ile Ala Ser Leu Asn Tyr Gln Asn
65 70 75 80
Asp His Ser Asn Phe Leu Thr Thr
Val Val Gln Asn Asn Asp Phe Thr
85 90 95
Pro Thr Glu Ala Ser Thr Gln Thr
Ile Asn Phe Asp Glu Arg Ser Arg
100 105 110
Trp Gly Gly Gln Leu Lys Thr Ile
Met His Thr Asn Met Pro Asn Val
115 120 125
Asn Glu Phe Leu Tyr Ser Asn Lys
Phe Lys Ala Arg Val Met Val Ser
130 135 140
Arg Lys Thr Pro Asn Gly Val Thr
Val Thr Asp Gly Ser Gln Asp Glu
145 150 155 160
Leu Thr Tyr Glu Trp Val Glu Phe
Glu Leu Pro Glu Gly Asn Phe Ser
165 170 175
Val Thr Met Thr Ile Asp Leu Met
Asn Asn Ala Ile Ile Asp Asn Tyr
180 185 190
Leu Ala Val Gly Arg Gln Asn Gly
Val Leu Glu Ser Asp Ile Gly Val
195 200 205
Lys Phe Asp Thr Arg Asn Phe Arg
Leu Gly Trp Asp Pro Val Thr Glu
210 215 220
Leu Val Met Pro Gly Val Tyr Thr
Asn Glu Ala Phe His Pro Asp Ile
225 230 235 240
Val Leu Leu Pro Gly Cys Gly Val
Asp Phe Thr Glu Ser Arg Leu Ser
245 250 255
Asn Leu Leu Gly Ile Arg Lys Arg
Gln Pro Phe Gln Glu Gly Phe Gln
260 265 270
Ile Met Tyr Glu Asp Leu Glu Gly
Gly Asn Ile Pro Ala Leu Leu Asp
275 280 285
Val Glu Ala Tyr Glu Lys Ser Lys
Glu Asp Ser Ala Ala Ala Ala Thr
290 295 300
Ala Ala Val Ala Thr Ala Ser Thr
Glu Val Arg Gly Asp Asn Phe Ala
305 310 315 320
Ser Ala Ala Ala Val Ala Glu Ala
Ala Glu Thr Glu Ser Lys Ile Val
325 330 335
Ile Gln Pro Val Glu Lys Asp Ser
Lys Asp Arg Ser Tyr Asn Val Leu
340 345 350
Ala Asp Lys Lys Asn Thr Ala Tyr
Arg Ser Trp Tyr Leu Ala Tyr Asn
355 360 365
Tyr Gly Asp Pro Glu Lys Gly Val
Arg Ser Trp Thr Leu Leu Thr Thr
370 375 380
Ser Asp Val Thr Cys Gly Val Glu
Gln Val Tyr Trp Ser Leu Pro Asp
385 390 395 400
Met Met Gln Asp Pro Val Thr Phe
Arg Ser Thr Arg Gln Val Ser Asn
405 410 415
Tyr Pro Val Val Gly Ala Glu Leu
Leu Pro Val Tyr Ser Lys Ser Phe
420 425 430
Phe Asn Glu Gln Ala Val Tyr Ser
Gln Gln Leu Arg Ala Phe Thr Ser
435 440 445
Leu Thr His Val Phe Asn Arg Phe
Pro Glu Asn Gln Ile Leu Val Arg
450 455 460
Pro Pro Ala Pro Thr Ile Thr Thr
Val Ser Glu Asn Val Pro Ala Leu
465 470 475 480
Thr Asp His Gly Thr Leu Pro Leu
Arg Ser Ser Ile Arg Gly Val Gln
485 490 495
Arg Val Thr Val Thr Asp Ala Arg
Arg Arg Thr Cys Pro Tyr Val Tyr
500 505 510
Lys Ala Leu Gly Val Val Ala Pro
Arg Val Leu Ser Ser Arg Thr Phe
515 520 525
<210> 30
<211> 535
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 30
Met Met Arg Arg Ala Tyr Pro Glu
Gly Pro Pro Pro Ser Tyr Glu Ser
1 5 10 15
Val Met Gln Gln Ala Met Ala Ala
Ala Ala Ala Met Gln Pro Pro Leu
20 25 30
Glu Ala Pro Tyr Val Pro Pro Arg
Tyr Leu Ala Pro Thr Glu Gly Arg
35 40 45
Asn Ser Ile Arg Tyr Ser Glu Leu
Ala Pro Leu Tyr Asp Thr Thr Arg
50 55 60
Leu Tyr Leu Val Asp Asn Lys Ser
Ala Asp Ile Ala Ser Leu Asn Tyr
65 70 75 80
Gln Asn Asp His Ser Asn Phe Leu
Thr Thr Val Val Gln Asn Asn Asp
85 90 95
Phe Thr Pro Thr Glu Ala Ser Thr
Gln Thr Ile Asn Phe Asp Glu Arg
100 105 110
Ser Arg Trp Gly Gly Gln Leu Lys
Thr Ile Met His Thr Asn Met Pro
115 120 125
Asn Val Asn Glu Phe Met Tyr Ser
Asn Lys Phe Lys Ala Arg Val Met
130 135 140
Val Ser Arg Lys Thr Pro Asn Gly
Val Thr Val Thr Glu Asp Tyr Asp
145 150 155 160
Gly Ser Gln Asp Glu Leu Lys Tyr
Glu Trp Val Glu Phe Glu Leu Pro
165 170 175
Glu Gly Asn Phe Ser Val Thr Met
Thr Ile Asp Leu Met Asn Asn Ala
180 185 190
Ile Ile Asp Asn Tyr Leu Ala Val
Gly Arg Gln Asn Gly Val Leu Glu
195 200 205
Ser Asp Ile Gly Val Lys Phe Asp
Thr Arg Asn Phe Arg Leu Gly Trp
210 215 220
Asp Pro Val Thr Glu Leu Val Met
Pro Gly Val Tyr Thr Asn Glu Ala
225 230 235 240
Phe His Pro Asp Ile Val Leu Leu
Pro Gly Cys Gly Val Asp Phe Thr
245 250 255
Glu Ser Arg Leu Ser Asn Leu Leu
Gly Ile Arg Lys Arg Gln Pro Phe
260 265 270
Gln Glu Gly Phe Gln Ile Met Tyr
Glu Asp Leu Glu Gly Gly Asn Ile
275 280 285
Pro Ala Leu Leu Asp Val Asp Ala
Tyr Glu Lys Ser Lys Glu Glu Ser
290 295 300
Ala Ala Ala Ala Thr Ala Ala Val
Ala Thr Ala Ser Thr Glu Val Arg
305 310 315 320
Gly Asp Asn Phe Ala Ser Ala Ala
Ala Val Ala Ala Ala Glu Ala Ala
325 330 335
Glu Thr Glu Ser Lys Ile Val Ile
Gln Pro Val Glu Lys Asp Ser Lys
340 345 350
Asp Arg Ser Tyr Asn Val Leu Pro
Asp Lys Ile Asn Thr Ala Tyr Arg
355 360 365
Ser Trp Tyr Leu Ala Tyr Asn Tyr
Gly Asp Pro Glu Lys Gly Val Arg
370 375 380
Ser Trp Thr Leu Leu Thr Thr Ser
Asp Val Thr Cys Gly Val Glu Gln
385 390 395 400
Val Tyr Trp Ser Leu Pro Asp Met
Met Gln Asp Pro Val Thr Phe Arg
405 410 415
Ser Thr Arg Gln Val Ser Asn Tyr
Pro Val Val Gly Ala Glu Leu Leu
420 425 430
Pro Val Tyr Ser Lys Ser Phe Phe
Asn Glu Gln Ala Val Tyr Ser Gln
435 440 445
Gln Leu Arg Ala Phe Thr Ser Leu
Thr His Val Phe Asn Arg Phe Pro
450 455 460
Glu Asn Gln Ile Leu Val Arg Pro
Pro Ala Pro Thr Ile Thr Thr Val
465 470 475 480
Ser Glu Asn Val Pro Ala Leu Thr
Asp His Gly Thr Leu Pro Leu Arg
485 490 495
Ser Ser Ile Arg Gly Val Gln Arg
Val Thr Val Thr Asp Ala Arg Arg
500 505 510
Arg Thr Cys Pro Tyr Val Tyr Lys
Ala Leu Gly Ile Val Ala Pro Arg
515 520 525
Val Leu Ser Ser Arg Thr Phe
530 535
<210> 31
<211> 581
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 31
Met Arg Arg Ala Ala Met Tyr His
Glu Gly Pro Pro Pro Ser Tyr Glu
1 5 10 15
Ser Val Val Gly Ala Ala Ala Ala
Ser Pro Phe Ala Ser Gln Leu Glu
20 25 30
Pro Pro Tyr Val Pro Pro Arg Tyr
Leu Arg Pro Thr Gly Gly Arg Asn
35 40 45
Ser Ile Arg Tyr Ser Glu Leu Ala
Pro Leu Tyr Asp Thr Thr Arg Val
50 55 60
Tyr Leu Val Asp Asn Lys Ser Ala
Asp Val Ala Ser Leu Asn Tyr Gln
65 70 75 80
Asn Asp His Ser Asn Phe Leu Thr
Thr Val Ile Gln Asn Asn Asp Tyr
85 90 95
Thr Pro Ser Glu Ala Ser Thr Gln
Thr Ile Asn Leu Asp Asp Arg Ser
100 105 110
His Trp Gly Gly Asp Leu Lys Thr
Ile Leu His Thr Asn Met Pro Asn
115 120 125
Val Asn Glu Phe Met Phe Thr Asn
Lys Phe Lys Ala Arg Val Met Val
130 135 140
Ser Arg Ser His Thr Lys Asp Asp
Arg Val Glu Leu Lys Tyr Glu Trp
145 150 155 160
Val Glu Phe Glu Leu Pro Glu Gly
Asn Tyr Ser Glu Thr Met Thr Ile
165 170 175
Asp Leu Met Asn Asn Ala Ile Val
Glu His Tyr Leu Lys Val Gly Arg
180 185 190
Gln Asn Gly Val Leu Glu Ser Asp
Ile Gly Val Lys Phe Asp Thr Arg
195 200 205
Asn Phe Arg Leu Gly Leu Asp Pro
Val Thr Gly Leu Val Met Pro Gly
210 215 220
Val Tyr Thr Asn Glu Ala Phe His
Pro Asp Ile Ile Leu Leu Pro Gly
225 230 235 240
Cys Gly Val Asp Phe Thr Tyr Ser
Arg Leu Ser Asn Leu Leu Gly Ile
245 250 255
Arg Lys Arg Gln Pro Phe Gln Glu
Gly Phe Arg Ile Thr Tyr Glu Asp
260 265 270
Leu Glu Gly Gly Asn Ile Pro Ala Leu
Leu Asp Val Glu Ala Tyr Gln
275 280 285
Asp Ser Leu Lys Glu Glu Glu Ala
Gly Glu Gly Ser Gly Gly Gly Ala
290 295 300
Gly Gln Glu Glu Gly Gly Ala Ser
Ser Glu Ala Ser Ala Asp Pro Ala
305 310 315 320
Ala Ala Ala Glu Ala Glu Ala Ala
Asp Pro Ala Met Val Val Glu Glu
325 330 335
Glu Lys Asp Met Asn Asp Glu Ala
Val Arg Gly Asp Thr Phe Ala Thr
340 345 350
Arg Gly Glu Glu Lys Lys Ala Glu
Ala Glu Ala Ala Ala Glu Glu Ala
355 360 365
Ala Ala Ala Ala Ala Ala Val Glu
Ala Ala Ala Glu Ala Glu Lys Pro
370 375 380
Pro Lys Glu Pro Val Ile Lys Pro
Leu Thr Glu Asp Ser Lys Lys Arg
385 390 395 400
Ser Tyr Asn Val Leu Lys Asp Ser
Thr Asn Thr Glu Tyr Arg Ser Trp
405 410 415
Tyr Leu Ala Tyr Asn Tyr Gly Asp
Pro Ala Thr Gly Val Arg Ser Trp
420 425 430
Thr Leu Leu Cys Thr Pro Asp Val
Thr Cys Gly Ser Glu Gln Val Tyr
435 440 445
Trp Ser Leu Pro Asp Met Met Gln
Asp Pro Val Thr Phe Arg Ser Thr
450 455 460
Arg Gln Val Ser Asn Phe Pro Val
Val Gly Ala Glu Leu Leu Pro Val
465 470 475 480
His Ser Lys Ser Phe Tyr Asn Asp
Gln Ala Val Tyr Ser Gln Leu Ile
485 490 495
Arg Gln Phe Thr Ser Leu Thr His
Val Phe Asn Arg Phe Pro Glu Asn
500 505 510
Gln Ile Leu Ala Arg Pro Pro Ala
Pro Thr Ile Thr Thr Val Ser Glu
515 520 525
Asn Val Pro Ala Leu Thr Asp His
Gly Thr Leu Pro Leu Arg Asn Ser
530 535 540
Ile Gly Gly Val Gln Arg Val Thr
Val Thr Asp Ala Arg Arg Arg Thr
545 550 555 560
Cys Pro Tyr Val Tyr Lys Ala Leu
Gly Ile Val Ser Pro Arg Val Leu
565 570 575
Ser Ser Arg Thr Phe
580
<210> 32
<211> 1323
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 32
atgtccaaaa agcgcgtccg ggtggatgat
gacttcgacc ccgtctaccc ctacgatgca 60
gacaacgcac cgaccgtgcc cttcatcaac
cctcccttcg tctcttcaga tggattccaa 120
gaaaagcccc tgggggtgtt gtccctgcga
ctggctgacc ccgtcaccac caagaacggg 180
gaaatcaccc tcaagctggg agagggggtg
gacctcgacg actcgggaaa actcatctcc 240
aaaaatgcca ccaaggccac tgcccctctc
agtatttcca acagcaccat ttcccttaac 300
atggatgccc ctctttacaa caacaatgga
aagttaggca taagaatagg agcacctcta 360
aaggtagtag acttactaaa cactttagct
gtagcctatg gatcgggtct aggtctcaag 420
aataatgccc ttacagttca gttagtttct
ccactcactt ttgataacaa aggcaatgta 480
aaaattaact tagggaatgg cccattaaca
gttgcggcaa accgactgag tgttacctgc 540
aaaagaggtt tatatgtcac tactacagga
gatgcactcg aaagcaacat aagctgggct 600
aaaggtataa gatttgaagg aaatgcaata
gcagcaaata ttggcaaagg gcttgaattt 660
ggtactacta gttcagagtc agatgtcagc
aatgcttatc ctatccaagt aaaactaggt 720
actggtctca cctttgacag cacaggtgca
attgtcgctt ggaacaaaga agatgacaaa 780
cttacactgt ggaccacagc cgatccatct
ccaaactgtc acatatattc tgacaaggat 840
gctaagctta cactctgctt gacaaagtgt
ggcagtcaga tactgggcac tgtttctctc 900
atagctgttg atactggtag cttaaatcca
ataacaggac aagtaaccac tgctcttgtt 960
tcacttaaat tcgatgccaa tggagttttg
caaaccagtt caacattgga caaagaatat 1020
tggaatttta gaaaaggaga tgtgacacct
gctgagccat atactaatgc tataggtttt 1080
atgcccaata taaaggcata tccgaaaaac
acaaattcag ctgcaaaaag tcacattgtg 1140
ggaaaagtat acctacatgg ggaagtaagc
aagccactag acttgataat tacatttaat 1200
gaaaccagta atgaaacctg tacctattgc
attaactttc agtggcagtg gggaactgac 1260
aaatataaaa atgaaacgct tgctgtcagt
tcattcacct tttcctacat tgcccaagaa 1320
taa
1323
<210> 33
<211> 1332
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 33
atgtccaaaa agcgcgtccg ggtggatgat
gacttcgacc ccgtctaccc ctacgatgca 60
gacaacgcac cgaccgtgcc cttcatcaac
ccccccttcg tctcttcaga tggattccaa 120
gagaagcccc tgggggtgtt gtccctgcga
ctggccgacc ccgtcaccac caagaacggg 180
gaaatcaccc tcaagctggg agagggggtg
gacctcgacg actcgggaaa actcatctcc 240
aaaaatgcca ccaaggccac tgcccctctc
agtatttcca acagcaccat ttcccttaac 300
atggctgccc ctttttacaa caacaatgga
acgttaagtc tcaatgtttc tacaccatta 360
gcagtatttc ccacttttaa cactttaggt
atcagtcttg gcaacggtct tcaaacttct 420
aataagttgc tggctgtaca gttaactcat
cctcttacat tcagctcaaa tagcatcaca 480
gtaaaaacag acaaaggact ctatattaat
tctagtggaa acagagggct tgaggctaac 540
ataagcctaa aaagaggact gatttttgat
ggtaatgcta ttgcaacata ccttggaagt 600
ggtttagact atggatccta tgatagcgat
ggaaaaacaa gacccatcat caccaaaatt 660
ggagcaggct tgaattttga ttctaataat
gccatggctg tgaagctagg cacaggttta 720
agttttgact ctgccggtgc cttaacagct
ggaaacaaag aggatgacaa gctaacactt 780
tggactacac ctgaccccag ccctaattgt
caattacttt cagacagaga tgccaaattt 840
accctatgtc ttacaaaatg cggtagtcaa
atactaggca ctgttgcagt agctgctgtt 900
actgtaagtt cagcactaaa tccaattaat
gacacagtaa aaagcgccat agtattcctt 960
agatttgact ctgacggtgt gctcatgtca
aactcatcaa tggtaggtga ttactggaac 1020
tttagggaag gacagaccac ccaaagtgtg
gcctatacaa atgctgtggg attcatgccc 1080
aatctaggtg catatcctaa aacccaaagc
aaaacaccaa aaaatagtat agtaagccag 1140
gtatatttaa atggagaaac tactatgcca
atgacactga caataacttt caatggcact 1200
gatgaaaaag acacaacacc tgtcagcact
tactctatga cttttacatg gcagtggact 1260
ggagactata aggacaagaa tattaccttt
gctaccaact cctttacttt ctcctacatg 1320
gcccaagaat aa
1332
<210> 34
<211> 1278
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 34
atgtccaaaa agcgcgtccg ggtggatgat
gacttcgacc ccgtctaccc ctacgatgca 60
gacaacgcac cgaccgtgcc cttcatcaac
ccccccttcg tctcttcaga tggattccaa 120
gagaagcccc tgggggtgtt gtccctgcga
ctggccgacc ccgtcaccac caagaacggg 180
gaaatcaccc tcaagctggg agagggggtg
gacctcgact cctcgggaaa actcatctcc 240
aacacggcca ccaaggccgc tgcccctctc
agtttttcca acaacaccat ttcccttaac 300
atggatcacc ccttttacac taaagatgga
aaattagcct tacaagtttc tccaccatta 360
aatatactga gaacaagcat tctaaacaca
ctagctttag gttttggatc aggtttagga 420
ctccgtggct ctgccttggc agtacagtta
gtctctccac ttacatttga tactgatgga 480
aacataaagc ttaccttaga cagaggtttg
catgttacaa caggagatgc aattgaaagc 540
aacataagct gggctaaagg tttaaaattt
gaagatggag ccatagcaac caacattgga 600
aatgggttag agtttggaag cagtagtaca
gaaacaggtg tcgatgatgc ttacccaatc 660
caagttaaac ttggatctgg ccttagcttt
gacagtacag gagccataat ggctggtaac 720
aaagaagacg ataaactcac tttgtggaca
acacctgatc catcaccaaa ctgtcaaata 780
ctcgcagaaa atgatgcaaa actaacactt
tgcttgacta aatgtggtag tcaaatactg 840
gccactgtgt cagtcttagt tgtaggaagt
ggaaacctaa accccattac tggcaccgta 900
agcagtgctc aggtgtttct acgttttgat
gcaaacggtg ttcttttaac agaacattct 960
acactaaaaa aatactgggg gtataggcag
ggagatagca tagatggcac tccatatgtc 1020
aatgctgtag gattcatgcc caatttaaaa
gcttatccaa agtcacaaag ttctactact 1080
aaaaataata tagtagggca agtatacatg
aatggagatg tttcaaaacc tatgcttctc 1140
actataaccc tcaatggtac tgatgacagc
aacagtacat attcaatgtc attttcatac 1200
acctggacta atggaagcta tgttggagca
acatttggag ctaactctta taccttctcc 1260
tacatcgccc aagaatga
1278
<210> 35
<211> 1278
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 35
atgtccaaaa agcgcgtccg ggtggatgat
gacttcgacc ccgtctaccc ctacgatgca 60
gacaacgcac cgaccgtgcc cttcatcaac
ccccccttcg tctcttcaga tggattccaa 120
gagaagcccc tgggggtgct gtccctgcga
ctggccgacc ccgtcaccac caagaacggg 180
gaaatcaccc tcaagctggg agaggggctg
gacctcgact cctcgggaaa actcatctcc 240
aacacggcca ccaaggccgc cgcccctctc
agtttttcca acaacaccat ttcccttaac 300
atggatcacc ccttttacac taaagatgga
aaattatcct tacaagtttc tccaccatta 360
aatatactga gaacaagcat tctaaacaca
ctagctttag gttttggatc aggtttagga 420
ctccgtggct ctgccttggc agtacagtta
gtctctccac ttacatttga tactgatgga 480
aacataaagc ttaccttaga cagaggtttg
catgttacaa caggagatgc aattgaaagc 540
aacataagct gggctaaagg tttaaaattt
gaagatggag ccatagcaac caacattgga 600
aatgggttag agtttggaag cagtagtaca
gaaacaggtg ttgatgatgc ttacccaatc 660
caagttaaac ttggatctgg ccttagcttt
gacagtacag gagccataat ggctggtaac 720
aaagaagacg ataaacttac tttgtggaca
acacctgatc catcaccaaa ctgtcaaata 780
ctcgcagaaa atgatgcaaa actaacactt
tgcttgacta aatgtggtag tcaaatactg 840
gccactgtgt cagtcttagt tgtaggaagt
ggaaacctaa accccattac tggcaccgta 900
agcagtgctc aggtgtttct acgttttgat
gcaaacggtg ttcttttaac agaacattct 960
acactaaaaa aatactgggg gtataggcag
ggagatagca tagatggcac tccatatacc 1020
aatgctgtag gattcatgcc caatttaaaa
gcttatccaa agtcacaaag ttctactact 1080
aaaaataata tagtagggca agtatacatg
aatggagatg tttcaaaacc tatgcttctc 1140
actataaccc tcaatggtac tgatgacagc
aacagtacat attcaatgtc attttcatac 1200
acctggacta atggaagcta tgttggagcg
acatttgggg ctaactctta taccttctca 1260
tacatcgccc aagaatga
1278
<210> 36
<211> 1329
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 36
atgtccaaaa agcgcgcgcg ggtggatgat
ggcttcgacc ccgtgtaccc ctacgatgca 60
gacaacgcac cgactgtgcc cttcatcaac
cctcccttcg tctcttcaga tggattccaa 120
gaaaagcccc tgggggtgtt gtccctgcgt
ctggccgacc ccgtcaccac caagaatggg 180
gctgtccccc tcaagctcgg ggagggggtg
gacctcgacg actcgggaaa actcatctcc 240
aaaaaatcca ccaaggccaa ttcccctctc
agtatttcca acaacaccat ttcccttaac 300
atggataccc ctttttatac caaagatgga
aaattaacca tgcaggtaac tgcaccatta 360
aagttagcaa acacggccat actaaacaca
ctagctatgg cctatggaaa tggtttaggt 420
ctaaacaaca atgctctcac tgttcaggta
acatctccac tcacatttga taatagcaaa 480
gtcaagatta acctagggaa tggaccacta
atggtatctg ctaacaagct ttcaatcaac 540
tgcttacggg gtctatatgt tgcccctaat
aataccggac tagaaaccaa cataagctgg 600
gcaaacgcaa tgcgctttga gggtaatgca
atggctgttt atatagacac aaataaaggc 660
ctacaatttg gcactactag cacagaaaca
ggtgtcacca atgcttaccc catacaagtc 720
aaacttggcg caggccttgc atttgatagc
acaggagcta ttgttgcttg gaacaaagaa 780
aatgacagcc tcactttgtg gactacacca
gatccctctc caaattgtaa aatagcatct 840
gaaaaggatg caaaactcac actttgcttg
acaaagtgtg gtagtcaaat cctaggcact 900
gtctccctat tagcagtcag tggcagcttg
gctcctatca caggggctgt tagtactgca 960
cttgtatcac tcaaattcaa tgctaatgga
gcccttttgg acaaatcaac tctgaacaaa 1020
gaatactgga actacagaca aggagatcta
attccaggta caccatatac acatgctgtg 1080
ggtttcatgc ctaacaaaaa agcctaccct
aaaaacacaa ctgcagcttc caagagccac 1140
attgtgggtg atgtgtattt agatggagat
gcagataagc ctttatctct tatcatcact 1200
ttcaatgaaa ctgatgatga aacctgtgat
tactgcatca actttcaatg gaaatgggga 1260
gctgatcaat ataaggataa gacactcgca
accagttcat tcaccttctc atacatcgcc 1320
caagaataa
1329
<210> 37
<211> 1731
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 37
atgaagcgcg ccaaaacgtc tgacgagacc
ttcaaccccg tgtaccccta tgacacggaa 60
aacgggcctc cctccgtccc tttcctcacc
cctcccttcg tgtcccccga cggatttcaa 120
gaaagccccc caggggtcct gtctctgcgc
ctgtcagagc ccctggtcac ttcccacggc 180
atgcttgccc tgaaaatggg aaatggcctc
tccctggatg acgccggcaa cctcacctct 240
caagatgtca ccaccgtcac ccctcccctc
aaaaaaacca agaccaacct cagcctccag 300
acctcagccc ccctgaccgt tagctctggg
tccctcaccg tcgcggccgc cgctccactg 360
gcggtggccg gcacctctct caccatgcaa
tctcaggccc ccttgacagt gcaagatgca 420
aaactcggcc tggccaccca gggacccctg
accgtgtctg aaggcaaact caccttgcag 480
acatcggctc cactgacggc cgctgacagc
agcactctca ctgttagtgc cacacctccc 540
ctcagcacaa gcaatggtag tttgagcatt
gacatgcagg ccccgattta taccaccaat 600
ggaaaactgg cacttaacat tggtgctccc
ctgcatgtgg tagacaccct aaatgcacta 660
actgtagtaa ctggccaggg tcttaccata
aatggaagag ccctgcaaac tagagtcacg 720
ggtgccctca gttatgacac agaaggcaac
atccaactgc aagccggagg gggtatgcgc 780
attgacaata atggccaact tatccttaat
gtagcttatc catttgatgc tcaaaacaac 840
ctcagcctta gacttggcca aggtccccta
attgttaact ctgcccacaa cttggatctt 900
aaccttaaca gaggccttta cttatttaca
tctggaaaca cgaaaaaact ggaagttaac 960
ataaaaacag ccaaaggtct attttacgat
ggcaccgcta tagcaatcaa tgcaggtgac 1020
gggctacagt ttgggtctgg ttcagataca
aatccattgc aaactaaact tggattgggg 1080
ctggaatatg actccaacaa agctataatc
actaaacttg gaactggcct aagctttgac 1140
aacacaggtg ccatcacagt aggcaacaaa
aatgatgaca agcttacctt gtggaccaca 1200
ccagacccct ccccaaactg cagaattaat
tcagaaaaag atgctaaact cacactagtt 1260
ttgactaaat gcggcagcca ggtgttagcc
agcgtttctg ttttatctgt aaaaggcagc 1320
cttgccccca tcagcggcac agtaactagc
gcccagattg ttttaagatt tgatgaaaac 1380
ggagttttat tgagcaattc ttctcttgac
ccccaatact ggaactatag aaaaggcgat 1440
tctacagaag gcactgcata tactaatgct
gtgggattta tgcccaacct cacagcatac 1500
cctaaaacac agagccagac tgctaaaagc
aacattgtaa gtcaagttta cttgaatggg 1560
gacaaaacaa aacccatgac cctaaccatc
accctcaatg gaactaatga aacaggggat 1620
gctacagtaa gcacatactc catgtcattt
tcatggaact ggaatggaag taattacatt 1680
aatgacacct tccaaaccaa ctcctttacc
ttctcctaca tcgcccaaga a 1731
<210> 38
<211> 2814
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<220>
<221> misc_特征
<222> (345)..(345)
<223> n是a, c, g,或t
<220>
<221> misc_特征
<222> (360)..(360)
<223> n是a, c, g,或t
<400> 38
atggccaccc catcgatgct gccccagtgg
gcgtacatgc acatcgccgg acaggacgct 60
tcggagtacc tgagtccggg tctggtgcag
ttcgcccgcg ccacagacac ctacttcagt 120
ctggggaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgc 180
agccagcggc tgacgctgcg cttcgtgccc
gtggaccgcg aggacaacac ctactcgtac 240
aaagtgcgct acacgctggc cgtgggcgac
aaccgcgtgc tggacatggc cagcacctac 300
tttgacatcc gcggcgtgct ggatcggggc
cccagcttca aaccntactc cggcaccgcn 360
tacaacagcc tggctcccaa gggagcgccc
aacacctcac agtggataac caaagacaat 420
ggaactgata agacatacag ttttggaaat
gctccagtca gaggattgga cattacagaa 480
gagggtctcc aaataggacc cgatgagtca
gggggtgaaa gcaagaaaat ttttgcagac 540
aaaacctatc agcctgaacc tcagcttgga
gatgaggaat ggcatgatac tattggagct 600
gaagacaagt atggaggcag agcgcttaaa
cctgccacca acatgaaacc ctgctatggg 660
tctttcgcca agccaactaa tgctaaggga
ggtcaggcta aaagcagaac caaggacgat 720
ggcactactg agcctgatat tgacatggcc
ttctttgacg atcgcagtca gcaagctagt 780
ttcagtccag aacttgtttt gtatactgag
aatgtcgatc tggacacccc ggatacccac 840
attatttaca aacctggcac tgatgaaaca
agttcttctt tcaacttggg tcagcagtcc 900
atgcccaaca gacccaacta catcggcttc
agagacaact ttatcggtct catgtactac 960
aacagtactg gcaatatggg tgtactagct
ggacaggcct cccagctgaa tgctgtggtg 1020
gacttgcagg acagaaacac tgaactgtcc
taccagctct tgcttgactc tctgggtgac 1080
agaaccaggt atttcagtat gtggaaccag
gcggtggaca gctacgaccc cgatgtgcgc 1140
attattgaaa atcacggtgt ggaggatgaa
ctacccaact attgcttccc tttgaatggt 1200
gtgggcttta cagatacatt ccagggaatt
aaggttaaaa ctaccaataa cggaacagca 1260
aatgctacag agtgggaatc tgatacctct
gtcaataatg ctaatgagat tgccaagggc 1320
aatcctttcg ccatggagat caacatccag
gccaacctgt ggcggaactt cctctacgcg 1380
aacgtggcgc tgtacctgcc cgactcctac
aagtacacgc cggccaacat cacgctgccc 1440
accaacacca acacctacga ttacatgaac
ggccgcgtgg tagcgccctc gctggtggac 1500
gcctacatca acatcggggc gcgctggtcg
ctggacccca tggacaacgt caaccccttc 1560
aaccaccacc gcaacgcggg cctgcgctac
cgctccatgc tcctgggcaa cgggcgctac 1620
gtgcccttcc acatccaggt gccccaaaag
tttttcgcca tcaagagcct cctgctcctg 1680
cccgggtcct acacctacga gtggaacttc
cgcaaggacg tcaacatgat cctgcagagc 1740
tccctcggca acgacctgcg cacggacggg
gcctccatcg ccttcaccag catcaacctc 1800
tacgccacct tcttccccat ggcgcacaac
accgcctcca cgctcgaggc catgctgcgc 1860
aacgacacca acgaccagtc cttcaacgac
tacctctcgg cggccaacat gctctacccc 1920
atcccggcca acgccaccaa cgtgcccatc
tccatcccct cgcgcaactg ggccgccttc 1980
cgcggctggt ccttcacgcg cctcaagacc
cgcgagacgc cctcgctggg ctccgggttc 2040
gacccctact tcgtctactc gggctccatc
ccctacctcg acggcacctt ctacctcaac 2100
cacaccttca agaaggtctc catcaccttc
gactcctccg tcagctggcc cggcaacgac 2160
cgcctcctga cgcccaacga gttcgaaatc
aagcgcaccg tcgacggaga ggggtacaac 2220
gtggcccagt gcaacatgac caaggactgg
ttcctggttc agatgctggc ccactacaac 2280
atcggctacc agggcttcta cgtgcccgag ggctacaagg
accgcatgta ctccttcttc 2340
cgcaacttcc agcccatgag ccgccaggtc
gtggacgagg tcaactacaa ggactaccag 2400
gccgtcaccc tggcctacca gcacaacaac
tcgggcttcg tcggctacct cgcgcccacc 2460
atgcgccagg gacagcccta ccccgccaac
tacccctacc cgctcatcgg caagagcgcc 2520
gtcgccagcg tcacccagaa aaagttcctc
tgcgaccggg tcatgtggcg catccccttc 2580
tccagcaact tcatgtccat gggcgcgctc
accgacctcg gccagaacat gctctacgcc 2640
aactccgccc acgcgctaga catgaatttc
gaagtcgacc ccatggatga gtccaccctt 2700
ctctatgttg tcttcgaagt cttcgacgtc
gtccgagtgc accagcccca ccgcggcgtc 2760
atcgaggccg tctacctgcg cacgcccttc
tcggccggta acgccaccac ctaa 2814
<210> 39
<211> 2814
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 39
atggccaccc catcgatgct gccccagtgg
gcgtacatgc acatcgccgg acaggacgct 60
tcggagtacc tgagtccggg tctggtgcag
tttgcccgcg ccacagacac ctacttcagt 120
ctggggaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgc 180
agccagcggc tgacgctgcg cttcgtgccc
gtggacggcg aggacaacac ctactcgtac 240
aaagtgcgct acacgctggc cgtgggcgac
aaccgcgtgc tggacatggc cagcacctac 300
tttgacatcc gcggcgtgct ggatcggggc
cccagcttca aaccctactc cggcaccgcc 360
tacaacgctc tggctcccaa gggagcgccc
aacacctcac agtggataac caaagacaat 420
ggaactgata agacatacag ttttggaaat
gctccagtca gaggattgga cattacagaa 480
gagggtctcc aaataagaac cgatgagtca
gggggtgaaa gcaagaaaat ttttgcagac 540
aaaacctatc agcctgaacc tcagcttgga
gatgaggaat ggcatgatac tattggagct 600
gaagacaagt atggaggcag agcgcttaaa
cctgccacca acatgaaacc ctgctatggg 660
tctttcgcca agccaactaa tgctaaggga
ggtcaggcta aaagcagaac caaggacgat 720
ggcactactg agcctgatat tgacatggcc
ttctttgacg atcgcagtca gcaagctagt 780
ttcagtccag aacttgtttt gtatactgag
aatgtcgatc tggacacccc ggatacccac 840
attatttaca aacctggcac tgatgaaaca
agttcttctt tcaacttggg tcagcagtcc 900
atgcccaaca gacccaacta cattgggttc
agagacaact ttatcgggct catgtactac 960
aacagcactg gcaatatggg tgtactggct
ggtcaggcct cccagctgaa tgctgtggtg 1020
gacttgcagg acagaaacac cgaactgtcc
taccagctct tgcttgactc tctgggtgac 1080
agaaccaggt atttcagtat gtggaatcag
gcggtggaca gttatgaccc cgatgtgcgc 1140
attattgaaa atcacggtgt ggaggatgaa
ctccccaact attgcttccc tttgaatggt 1200
gtgggcttta cagatacatt ccagggaatt
aaggttaaaa ctaccaataa cggaacagca 1260
aatgctacag agtgggaatc tgatacctct
gtcaataatg ctaatgagat tgccaagggc 1320
aatcctttcg ccatggagat caacatccag
gccaacctgt ggcggaactt cctctacgcg 1380
aacgtggcgc tgtacctgcc cgactcctac
aagtacacgc cggccaacat cacgctgccg 1440
accaacacca acacctacga ttacatgaac
ggccgcgtgg tggcgccctc gctggtggac 1500
gcctacatca acatcggggc gcgctggtcg
ctggacccca tggacaacgt caaccccttc 1560
aaccaccacc gaaacgcggg cctgcgatac
cgctccatgc tcctgggcaa cgggcgctac 1620
gtgcccttcc acatccaggt gccccaaaag
tttttcgcca tcaagagcct cctgctcctg 1680
cccgggtcct acacctacga gtggaacttc
cgcaaggacg tcaacatgat cctgcagagc 1740
tccctcggca acgacctgcg cacggacggg
gcttccatcg ccttcaccag catcaacctc 1800
tacgccacct tcttccccat ggcgcacaac
accgcctcca cgctcgaggc catgctgcgc 1860
aacgacacca acgaccagtc cttcaacgac
tacctctcgg cggccaacat gctctacccc 1920
atcccggcca acgccaccaa cgtgcccatc
tccatcccct cgcgcaactg ggccgccttc 1980
cgcggmtggt ccttcacgcg cctcaagacc
cgcgagacgc cctcgctagg ctccgggttc 2040
gacccctact tcgtctactc gggctccatc
ccctaccttg acggcacctt ctacctcaac 2100
cacaccttca agaaggtctc catcaccttc
gactcctccg tcagctggcc cggcaacgac 2160
cgcctcctga cgcccaacga gttcgaaatc
aagcgcaccg tcgacggaga ggggtacaac 2220
gtggcccagt gcaacatgac caaggactgg
ttcctggtcc agatgctggc ccactacaac 2280
atcggctacc agggcttcta cgtgcccgag
ggctacaagg accgcatgta ctccttcttc 2340
cgcaacttcc agcccatgag ccgccaggtc
gtggacgagg tcaactacaa ggactaccag 2400
gccgtcaccc tggcctacca gcacaacaac
tcgggcttcg tcggctacct cgcgcccacc 2460
atgcgccagg gacagcccta ccccgccaac
tacccctacc cgctcatcgg caagagcgcc 2520
gtcgccagcg tcacccagaa aaagttcctc
tgcgaccggg tcatgtggcg catccccttc 2580
tccagcaact tcatgtccat gggcgcgctc
accgacctcg gccaaaacat gctttacgcc 2640
aactccgccc acgcgctaga catgaatttc
gaagtcgacc ccatggatga gtccaccctt 2700
ctctatgttg tcttcgaagt cttcgacgtc
gtccgagtgc accagcccca ccgcggcgtc 2760
atcaaggccg tctacctgcg cacccccttc
tcggccggta acgccaccac ctaa 2814
<210> 40
<211> 2814
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 40
atggccaccc catcgatgct gccccagtgg
gcgtacatgc acatcgccgg acaggacgct 60
tcggagtacc tgagtccggg tctggtgcag
ttcgcccgcg ccacagacac ctacttcagt 120
ctggggaaca agtttaggaa ccccacggtg
gcacccacgc acgatgtgac caccgaccgc 180
agccagcggc tgacgctgcg cttcgtgccc
gtggaccgcg aggacaacac ctactcgtac 240
aaagtgcgct acacgctggc cgtgggcgac
aaccgcgtgc tggacatggc cagcacctac 300
tttgacatcc gcggcgtgct ggatcggggc
cccagcttca aaccctactc cggcaccgcc 360
tacaacagcc tggctcccaa gggagcgccc
aacacctcac agtggataac caaagacaat 420
ggaactgata agacatacag ttttggaaat
gctccagtca gaggattgga cattacagaa 480
gagggtctcc aaataggaac cgatgagtca
gggggtgaaa gcaagaaaat ttttgcagac 540
aaaacctatc agcctgaacc tcagcttgga
gatgaggaat ggcatgatac tattggagct 600
gaagacaagt atggaggcag agcgcttaaa
cctgccacca acatgaaacc ctgctatggg 660
tctttcgcca agccaactaa tgctaaggga
ggtcaggcta aaagcagaac caaggacgat 720
ggcactactg agcctgatat tgacatggcc
ttctttgacg atcgcagtca gcaagctagt 780
ttcagtccag aacttgtttt gtatactgag
aatgtcgatc tggacacccc ggatacccac 840
attatttaca aacctggcac tgatgaaaca
agttcttctt tcaacttggg tcagcagtcc 900
atgcccaaca gacccaacta cattggcttc
agagacaact ttatcgggct catgtactac 960
aacagcactg gcaatatggg tgtactggcc
ggtcaggcct cccagctgaa tgctgtggtg 1020
gacttgcagg acagaaacac tgaactgtcc
taccagctct tgcttgactc tctgggtgac 1080
agaaccaggt atttcagtat gtggaatcag
gcggtggaca gctatgaccc cgatgtgcgc 1140
attattgaaa atcacggtgt ggaggatgaa
ctccccaact attgcttccc tttgaatggt 1200
gtgggcttta cagatacatt ccagggaatt
aaggttaaaa ctacaaataa cggaacagca 1260
aatgctacag agtgggaatc tgatacctct
gtcaataatg ctaatgagat tgccaagggc 1320
aatcctttcg ccatggagat caacatccag
gccaacctgt ggcggaactt cctctacgcg 1380
aacgtggcgc tgtacctgcc cgactcctac
aagtacacgc cggccaacat cacgctgccc 1440
accaacacca acacctacga ttacatgaac
ggccgcgtgg tggcgccctc gctggtggac 1500
gcctacatca acatcggggc gcgctggtcg
ctggacccca tggacaacgt caaccccttc 1560
aaccaccacc gcaacgcggg cctgcgctac
cgctccatgc tcctgggcaa cgggcgctac 1620
gtgcccttcc acatccaggt gccccaaaag
tttttcgcca tcaagagcct cctgctcctg 1680
cccgggtcct acacctacga gtggaacttc
cgcaaggacg tcaacatgat cctgcagagc 1740
tccctcggca acgacctgcg cacggacggg
gcctccatcg ccttcaccag catcaacctc 1800
tacgccacct tcttccccat ggcgcacaac
accgcctcca cgctcgaggc catgctgcgc 1860
aacgacacca acgaccagtc cttcaacgac
tacctctcgg cggccaacat gctctacccc 1920
atcccggcca acgccaccaa cgtgcccatc
tccatcccct cgcgcaactg ggccgccttc 1980
cgcggatggt ccttcacgcg cctcaagacc
cgcgagacgc cctcgctcgg ctccgggttc 2040
gacccctact tcgtctactc gggctccatc
ccctacctcg acggcacctt ctacctcaac 2100
cacaccttca agaaggtctc catcaccttc
gactcctccg tcagctggcc cggcaacgac 2160
cgcctcctga cgcccaacga gttcgaaatc
aagcgcaccg tcgacggaga ggggtacaac 2220
gtggcccagt gcaacatgac caaggactgg
ttcctggtcc agatgctggc ccactacaac 2280
atcggctacc agggcttcta cgtgcccgag
ggctacaagg accgcatgta ctccttcttc 2340
cgcaacttcc agcccatgag ccgccaggtc
gtggacgagg tcaactacaa ggactaccag 2400
gccgtcaccc tggcctacca gcacaacaac
tcgggcttcg tcggctacct cgcgcccacc 2460
atgcgccagg gccagcccta ccccgccaac
tacccctacc cgctcatcgg caagagcgcc 2520
gtcgccagcg tcacccagaa aaagttcctc
tgcgaccggg tcatgtggcg catccccttc 2580
tccagcaact tcatgtccat gggcgcgctc
accgacctcg gccagaacat gctctacgcc 2640
aactccgccc acgcgctaga catgaatttc
gaagtcgacc ccatggatga gtccaccctt 2700
ctctatgttg tcttcgaagt cttcgacgtc
gtccgagtgc accagcccca ccgcggcgtc 2760
atcgaggccg tctacctgcg cacgcccttc
tcggccggca acgccaccac ctaa 2814
<210> 41
<211> 2814
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<220>
<221> misc_特征
<222> (297)..(297)
<223> n是a, c, g,或t
<220>
<221> misc_特征
<222> (360)..(360)
<223> n是a, c, g,或t
<220>
<221> misc_特征
<222> (887)..(887)
<223> n是a, c, g,或t
<400> 41
atggccaccc catcgatgct gccccagtgg
gcgtacatgc acatcgccgg acaggacgct 60
tcggagtacc tgagtccggg tctggtgcag
ttcgcccgcg ccacagacac ctacttcagt 120
ctggggaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgc 180
agccagcggc tgacgctgcg cttcgtgccc
gtggaccgcg aggacaacac ctactcgtac 240
aaagtgcgct acacgctggc cgtgggcgac
aaccgcgtgc tggacatggc cagcacntac 300
tttgacatcc gcggcgtgct ggatcggggc
cccagcttca aaccctactc cggcaccgcn 360
tacaacagcc tggctcccaa gggagcgccc
aacacctcac agtggataac caaagacaat 420
ggaactgata agacatacag ttttggaaat
gctccagtca gaggattgga cattacagaa 480
gagggtctcc aaataggaac cgatgagtca
gggggtaaaa gcaagaaaat ttttgcagac 540
aaaacctatc agcctgaacc tcagcttgga
gatgaggaat ggcatgatac tattggagct 600
gaagacaagt atggaggcag agcgcttaaa
cctgccacca acatgaaacc ctgctatggg 660
tctttcgcca agccaactaa tgctaaggga
ggtcaggcta aaagcagaac caaggacgat 720
ggcactactg agcctgatat tgacatggcc
ttttttgacg atcgcagtca gcaagctagt 780
ttcagtccag aacttgtttt gtatactgag
aatgtcgatc tggacacccc ggatacccac 840
attatttaca aacctggcac tgatgaaaca
agttcttctt tcaactnggg tcagcagtcc 900
atgcccaaca gacccaatta cattggcttc
agagacaact ttatcggact catgtactac 960
aacagcactg gcaatatggg tgtactggct
ggacaggcct cccagctgaa tgctgtggtg 1020
gacttgcagg acagaaacac cgaactgtcc
taccagctct tgcttgactc tctgggcgac 1080
agaaccaggt atttcagtat gtggaatcag
gcggtggaca gctatgaccc cgatgtgcgc 1140
attattgaaa atcacggtgt ggaggatgaa
cttcccaact attgcttccc tttgaatggt 1200
gtgggcttta cagatacatt ccagggaatt
aaggttaaaa ctaccaataa cggaacagca 1260
aacgctacag agtgggaatc tgatacctct
gtcaataatg ctaatgagat tgccaagggc 1320
aatcctttcg ccatggagat caacatccag
gccaacctgt ggcggaactt cctctacgcg 1380
aacgtggcgc tgtacctgcc cgactcctac
aagtacacgc cggccaacat cacgctgccc 1440
accaacacca acacctacga ttacatgaac
ggccgcgtgg tggcgccctc gctggtggac 1500
gcctacatca acatcggggc gcgctggtcg
ctggacccca tggacaacgt caaccccttc 1560
aaccaccacc gcaacgcggg cctgcgatac
cgctccatgc tcctgggcaa cgggcgctac 1620
gtgcccttcc acatccaggt gccccaaaag
tttttcgcca tcaagaacct cctgctcctg 1680
cccgggtcct acacctacga gtggaacttc
cgcaaggacg tcaacatgat cctgcagagc 1740
tccctcggca acgacctgcg cacggacggg
gcctccatcg ccttcaccag catcaacctc 1800
tacgccacct tcttccccat ggcgcacaac
accgcctcca cgctcgaggc catgctgcgc 1860
aacgacacca acgaccagtc cttcaacgac
tacctctcgg cggccaacat gctctacccc 1920
atcccggcca acgccaccaa cgtgcccatc
tccatcccct cgcgcaactg ggccgccttc 1980
cgcggatggt ccttcacgcg cctcaagacc
cgcgagacgc cctcgctcgg ctccgggttt 2040
gacccctact tcgtctactc gggctccatc
ccctacctcg acggcacctt ctacctcaac 2100
cacaccttca agaaggtctc catcaccttc
gactcctccg tcagctggcc cggcaacgac 2160
cgcctcctga cgcccaacga gttcgaaatc
aagcgcaccg tcgacggaga ggggtacaac 2220
gtggcccagt gcaacatgac caaggactgg
ttcctggtcc agatgctggc ccactacaac 2280
atcggctacc agggcttcta cgtgcccgag
ggctacaagg accgcatgta ctccttcttc 2340
cgcaacttcc agcccatgag ccgccaggtc
gtggacgagg tcaactacaa ggactaccag 2400
gccgtcaccc tggcctacca gcacaacaac
tcgggcttcg tcggctacct cgcgcccacc 2460
atgcgccagg gccagcccta ccccgccaac
tacccctacc cgctcatcgg caagagcgcc 2520
gttgccagcg tcacccagaa aaagttcctc
tgcgaccggg tcatgtggcg catccccttc 2580
tccagcaact tcatgtccat gggcgcgctc
accgacctcg gccagaacat gctctacgcc 2640
aactccgccc acgcgctaga catgaatttc
gaagtcgacc ccatggatga gtccaccctt 2700
ctctatgttg tcttcgaagt cttcgacgtc
gtccgagtgc accagcccca ccgcggcgtc 2760
atcgaggccg tctacctgcg cacgcccttc
tcggccggca acgccaccac ctaa 2814
<210> 42
<211> 2814
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<220>
<221> misc_特征
<222> (1612)..(1612)
<223> n是a, c, g,或t
<400> 42
atggccaccc catcgatgct gccccagtgg
gcgtacatgc acatcgccgg acaggacgct 60
tcggagtacc tgagtccggg tctggtgcag
ttcgcccgcg ccacagacac ctacttcagt 120
ctggggaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgc 180
agccagcgac tgacgctgcg cttcgtgccc
gtggaccgcg aggacaacac ctactcgtac 240
aaagtgcgct acacgctggc cgtgggcgac
aaccgcgtgc tggacatggc cagcacctac 300
tttgacatcc gcggcgtgct ggaccggggc
cctagcttca aaccctactc cggcaccgcc 360
tacaacagcc tggcccccaa gggagcaccc
aacacctcac agtgggtgac caaagacaat 420
gggactgata aaacatacag ctttggtaat
gctcctgtca gaggcttgga cattacagaa 480
gagggtctcc aaataggaac cgatgactct
tcaaccgaaa gcaagaaaat ttttgcagac 540
aaaacatatc agcctgaacc tcaggttgga
gatgaggaat ggcatgacac cattggggct 600
gaagacaaat atggaggcag agctcttaaa
cctgccacca acatgaaacc ctgttatggt 660
tcttttgcca agccaactaa tgctaaggga
ggtcaggcta aaaccagaac caaagacgat 720
ggaactaccg agcctgatat tgacatggcc
ttctttgacg atcgcagtca gcaggctagt 780
ttcagcccag aacttgtttt gtatactgag
aatgtggatt tggagacccc agatacccac 840
attatttaca aacccggtac tgatgaaaca
agttcttctt tcaacttggg tcagcaatcc 900
atgcccaaca gacccaacta cattggtttc
agagacaact ttattggctt gatgtactac 960
aacagcactg gcaacatggg tgtgctggct
ggtcaggctt ctcagctgaa tgccgtggtt 1020
gacttgcaag acagaaacac cgagctgtcc
taccagctct tgcttgactc tctgggcgac 1080
agaacccggt atttcagtat gtggaatcag
gcggtggaca gctatgatcc tgatgtgcgc 1140
attattgaaa accatggtgt ggaagatgaa
ctgccaaact attgcttccc tttaaatggt 1200
gtgggcttta cagacacatt ccagggaatt
aaggttaaaa ctaccaacaa cggtactgct 1260
aatgctacag agtgggaatc tgatacttct
gtcaataatg ccaatgagat tgccaagggt 1320
aatccattcg ccatggaaat caacatccaa
gccaacctgt ggaggaactt cctctatgcc 1380
aacgtggccc tgtacttgcc cgattcttac
aagtacacgc cggccaacgt caccctgccc 1440
accaacacca acacctacga gtacatgaac
ggccgggtgg tggcgccctc gctggtggac 1500
tcctacatca acatcggggc gcgctggtcg
ctggacccca tggacaacgt caatcccttc 1560
aaccaccacc gcaatgcggg gctgcgctac
cgctccatgc tcctgggcaa cnggcgcttc 1620
gtgcccttcc acatccaggt gccccagaaa
tttttcgcca tcaagagcct cctgctcctg 1680
cccgggtcct acacctacga gtggaacttc
cgcaaggacg tcaacatgat cctgcagagc 1740
tccctcggca acgacctgcg cacggacggg
gcctccatct ccttcaccag catcaacctc 1800
tacgccacct tcttccccat ggcgcacaac
acggcctcca ctctcgaggc catgctgcgc 1860
aacgacacca acgaccagtc cttcaacgac
tacctctcgg cggccaacat gctctacccc 1920
atcccggcca acgccaccaa cgtgcccatc
tccatcccct cgcgcaactg ggccgccttc 1980
cgcggctggt ccttcacgcg cctcaagacc
aaggagacgc cctcgctggg ctccgggttc 2040
gacccctact tcgtctactc gggctccatc
ccctacctcg acggcacctt ctacctcaac 2100
cacaccttca agaaggtctc catcaccttc
gactcctccg tcagctggcc cggcaacgac 2160
cggctcctga cgcccaacga gttcgaaatc
aagcgcaccg tcgacggcga gggatacaac 2220
gtggcccagt gcaacatgac caaggactgg
ttcctggtcc agatgctggc ccactacaac 2280
atcggctacc agggcttcta cgtgcccgag
ggctacaagg accgcatgta ctccttcttc 2340
cgcaacttcc agcccatgag ccgccaggtg
gtggacgagg tcaactacaa ggactaccag 2400
gccgtcaccc tggcctacca gcacaacaac
tcgggcttcg tcggctacct cgcgcccacc 2460
atgcgtcagg gccagcccta ccccgccaac
tacccctacc cgctcatcgg caagagcgcc 2520
gtcaccagcg tcacccagaa aaagttcctc
tgcgaccgcg tcatgtggcg catccccttc 2580
tccagcaact tcatgtccat gggcgcgctc
accgacctcg gccagaacat gctctatgcc 2640
aactccgccc acgcgctaga catgaatttc
gaagtcgacc ccatggatga gtccaccctt 2700
ctctatgttg tcttcgaagt cttcgacgtc
gtccgagtgc accagcccca ccgcggcgtc 2760
atcgaggccg tctacctgcg cacccccttc
tcggccggta acgccaccac ctaa 2814
<210> 43
<211> 2886
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 43
atggcgaccc catcgatgat gccgcagtgg
tcgtacatgc acatctcggg ccaggacgcc 60
tcggagtacc tgagccccgg gctggtgcag
ttcgcccgcg ccaccgacag ctacttcagc 120
ctgagtaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgg 180
tcccagcgcc tgacgctgcg gttcatcccc
gtggaccgcg aggacaccgc gtactcttac 240
aaggcgcggt tcaccctggc cgtgggcgac
aaccgcgtgc tggacatggc ctccacctac 300
tttgacatcc gcggcgtgct ggacaggggc
cccaccttca agccctactc cggcaccgcc 360
tacaactccc tggcccccaa gggcgccccc
aactcctgcg agtgggagca agtggagcca 420
gctgaagagg cagcagaaaa tgaagatgaa
gaagaagaag aggatgttgt tgatcctcag 480
gaacaggagc ccactactaa aacacatgta
tatgctcaag ctcccctttc tggcgagaaa 540
attaccaaag atggtctgca aataggaact
gaggctacgg cagcaggagg cactaaagac 600
ttatttgcag accctacatt ccagccagaa
ccccaagttg gcgaatctca gtggaatgag 660
gcggatgcta cagcagctgg aggtagagtg
ctcaaaaaga ccactcccat gaaaccttgc 720
tatggctcat atgcccgccc cacaaatgcc
aatgggggcc aaggtgtgct aaaggcaaat 780
gcccagggag tgctcgagtc tcaggttgag
atgcagttct tttccacttc tacaaatgcc 840
acaaacgagc aaaacaacat ccagcccaaa
ttggtgctgt acagcgagga tgtgcatatg 900
gagaccccag acacacacat ctcctacaag
cctacaaaaa gcgatgataa ttcaaaagtc 960
atgctgggtc agcagtccat gcccaacagg
ccaaattaca tcgccttcag agacaacttt 1020
atcgggctca tgtattataa cagcactggc
aacatggggg tgctggcagg tcaggcctca 1080
cagttgaatg cagtggtgga cctgcaagac
agaaacacag aactgtccta ccagctcttg 1140
cttgattcca tgggagacag aaccagatac
ttttccatgt ggaatcaggc cgtggacagt 1200
tatgacccag atgtcagaat tattgaaaat
catggaaccg aagatgagct gcccaactat 1260
tgtttccctc tgggaggcat agggataact
gacacttacc aggccattaa gactaatggc 1320
aatggggcag gagatcaagc caccacgtgg
cagaaagact cacaatttgc agaccgcaac 1380
gaaatagggg tgggaaacaa cttcgccatg
gagatcaacc tcagtgccaa cctgtggagg 1440
aacttcctct actccaacgt ggccctgtac
ctgccagaca agcttaagta caacccctcc 1500
aacgtggaaa tctctgacaa ccccaacacc
tacgactaca tgaacaagcg agtggtggcc 1560
ccggggctgg tggactgcta catcaacctg
ggcgcgcgct ggtccctgga ctacatggac 1620
aacgtcaacc ccttcaacca ccaccgcaat
gcgggcctgc gctaccgctc catgcttctg 1680
ggcaacgggc gctacgtgcc cttccacatc
caggtgcccc agaagttctt tgccatcaag 1740
aacctcctcc tcctgccggg ctcctacacc
tacgagtgga acttcaggaa ggatgtcaac 1800
atggtcctgc agagctctct gggcaacgac
ctcagggtcg acggggccag catcaagttc 1860
gagagcatct gcctctacgc caccttcttc
cccatggccc acaacacggc ctccacgctc 1920
gaggccatgc tcaggaacga caccaacgac
cagtccttca acgactacct ctccgccgcc 1980
aacatgctct accccatccc cgccaacgcc
accaacgtcc ccatctccat cccctcgcgc 2040
aactgggcgg ccttccgcgg ctgggccttc
acccgcctta agaccaagga gaccccctcc 2100
ctgggctcgg gtttcgaccc ctactacacc
tactcgggct ccatacccta cctggacgga 2160
accttctacc tcaaccacac tttcaagaag
gtctcggtca ccttcgactc ctcggtcagc 2220
tggccgggca acgaccgcct gctcaccccc
aacgagttcg agatcaagcg ctcggtcgac 2280
ggggagggct acaacgtagc ccagtgcaac
atgaccaagg actggttcct catccagatg 2340
ctggccaact acaacatcgg ctatcagggc
ttctacatcc cagagagcta caaggacagg 2400
atgtactcct tctttaggaa cttccagccc
atgagccggc aggtggtgga cgaaaccaag 2460
tacaaggact accagcaggt gggcatcatc
caccagcaca acaactcggg cttcgtgggc 2520
tacctcgccc ccaccatgcg cgagggacag
gcctaccccg ccaacttccc ctacccgctc 2580
attggcaaga ccgcggtcga cagcatcacc
cagaaaaagt tcctctgcga ccgcaccctc 2640
tggcgcatcc ccttctccag caacttcatg
tccatgggtg cgctcacgga cctgggccag 2700
aacctgctct atgccaactc cgcccacgcg
ctcgacatga ccttcgaggt cgaccccatg 2760
gacgagccca cccttctcta tgttctgttc
gaagtctttg acgtggttcg ggtccaccag 2820
ccgcaccgcg gcgtcatcga gaccgtgtac
ctgcgcacgc ccttctcggc cggcaacgcc 2880
accacc
2886
<210> 44
<211> 1596
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 44
atgatgaggc gcgtgtaccc ggagggtcct
cctccctcgt acgagagcgt gatgcagcag 60
gcggtggcgg cggcgatgca gcccccgctg
gaggcgcctt acgtgccccc gcggtacctg 120
gcgcctacgg aggggcggaa cagcattcgt
tactcggagc tggcaccctt gtacgatacc 180
acccggttgt acctggtgga caacaagtcg
gcggacatcg cctcgctgaa ctaccagaac 240
gaccacagca acttcctgac caccgtggtg
cagaacaacg atttcacccc cacggaggcc 300
agcacccaga ccatcaactt tgacgagcgc
tcgcggtggg gcggccagct gaaaaccatc 360
atgcacacca acatgcccaa cgtgaacgag
ttcatgtaca gcaacaagtt caaggcgcgg 420
gtgatggtct cgcgcaagac ccccaacggg
gtcgcggtag gggatgatta tgatggtggt 480
caggacgagc tgacctacga gtgggtggag
tttgagctgc ccgagggcaa cttctcggtg 540
accatgacca tcgatctgat gaacaacgcc
atcatcgaca actacttggc ggtggggcgg 600
cagaacgggg tgctggagag cgacatcggc
gtgaagttcg acacgcgcaa cttccggctg 660
ggctgggacc ccgtgaccga gctggtgatg
ccgggcgtgt acaccaacga ggccttccac 720
cccgacattg tcctgctgcc cggctgcggc
gtggacttca ccgagagccg cctcagcaac 780
ctgctgggca tccgcaagcg gcagcccttc
caggagggct tccagatcct gtacgaggac 840
ctggaggggg gcaacatccc cgcgctcttg
gatgtcgaag cctacgagaa aagcaaggag 900
gagagcgccg ccgcggcgac cgcagccgta
gccaccgcct ctaccgaggt gcggggcgat 960
aattttgcta gcgccgcagc agtggccgag
gcggctgaaa ccgaaagtaa gatagtgatc 1020
cagccggtgg agaaggacag caaggacagg
agctacaacg tgctcgcgga caagaaaaac 1080
accgcctacc gcagctggta cctggcctac
aactacggcg accccgagaa gggcgtgcgc 1140
tcctggacgc tgctcaccac ctcggacgtc
acctgcggcg tggagcaagt ctactggtcg 1200
ctgcccgaca tgatgcaaga cccggtcacc
ttccgctcca cgcgtcaagt tagcaactac 1260
ccggtggtgg gcgccgagct cctgcccgtc
tactccaaga gcttcttcaa cgagcaggcc 1320
gtctactcgc agcagctgcg cgccttcacc
tcgctcacgc acgtcttcaa ccgcttcccc 1380
gagaaccaga tcctcgtccg cccgcccgcg
cccaccatta ccaccgtcag tgaaaacgtt 1440
cctgctctca cagatcacgg gaccctgccg
ctgcgcagca gtatccgggg agtccagcgc 1500
gtgaccgtca ctgacgccag acgccgcacc
tgcccctacg tctacaaggc cctgggcgta 1560
gtcgcgccgc gcgtcctctc gagccgcacc
ttctaa 1596
<210> 45
<211> 1626
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 45
atgatgaggc gcgtgtaccc ggagggtcct
cctccctcgt acgagagcgt gatgcagcag 60
gcggtggcgg tggcgatgca gcccccgctg
gaggcgcctt acgtgccccc gcggtacctg 120
gcgcctacgg aggggcggaa cagcattcgt
tactcggagc tggcaccctt gtacgatacc 180
acccggttgt acctggtgga caacaagtcg
gcggacatcg cctcgctgaa ctaccagaac 240
gaccacagca acttcctgac caccgtggtg
cagaacaacg atttcacccc cacggaggcc 300
agcacccaga ccatcaactt tgacgagcgc
tcgcggtggg gcggccagct gaaaaccatc 360
atgcacacca acatgcccaa cgtgaacgag
ttcatgtaca gcaacaagtt caaggcgcgg 420
gtgatggtct cgcgcaagac ccccaacggg
gtgacggtag gggatgatta tgatggtagt 480
caggacgagc tgacctacga gtgggtggag
tttgagctgc ctgagggcaa cttctcggtg 540
accatgacca tcgatctgat gaacaacgcc
atcatcgaca actacttggc ggtggggcgg 600
cagaacgggg tgctggaaag cgacatcggc
gtgaagttcg acacgcgcaa cttccggctg 660
ggctgggacc ccgtgaccga gctggtgatg
ccgggcgtgt acaccaacga ggccttccac 720
cccgacatcg tcctgctgcc cggctgcggc
gtggacttca ccgagagccg cctcagcaac 780
ctgctgggca tccgcaagcg gcagcccttc
caggagggct tccagatcct gtacgaggac 840
ctggaggggg gcaacatccc cgcgctcttg
gatgtcgaag cctatgagaa aagcaaggag 900
gatagcgccg cagcgacgac cgcagccgtg
gctactgccg cgaccaccga tgcagatgca 960
actactacca ggggcgatac atttgccacc
caggcggagg aagcagccgc cctagcggcg 1020
accgatgata gtgaaagtaa gatagtcatc
aagccggtgg agaaggacag caaggacagg 1080
agctacaacg tgctcgcgga caagaaaaac
accgcctacc gcagctggta cctggcctac 1140
aactacggcg accccgagaa gggcgtgcgc
tcctggacgc tgctcaccac ctcggacgtc 1200
acctgcggcg tggagcaagt ctactggtcg
ctgcccgaca tgatgcaaga cccggtcacc 1260
ttccgctcca cgcgtcaagt tagcaactac
ccggtggtgg gcgccgagct cctgcccgtc 1320
tactccaaga gcttcttcaa cgagcaggcc
gtctactcgc agcagctgcg cgccttcacc 1380
tcgctcacgc acgtcttcaa ccgcttcccc
gagaaccaga tcctcgttcg cccgcccgcg 1440
cccaccatta ccaccgtcag tgaaaacgtt
cctgctctca cagatcacgg gaccctgccg 1500
ctgcgcagca gtatccgggg agtccagcgc
gtgaccgtca ctgacgccag acgccgcacc 1560
tgcccctacg tctacaaggc cctgggcgta
gtcgcgccgc gcgtcctctc gagccgcacc 1620
ttctaa
1626
<210> 46
<211> 1599
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 46
atgatgaggc gcgtgtaccc ggagggtcct
cctccctcgt acgagagcgt gatgcagcag 60
gcggtggcgg cggcgatgca gcccccgctg
gaggcgcctt acgtgccccc gcggtacctg 120
gcgcctacgg aggggcggaa cagcattcgt
tactcggagc tggcaccctt gtacgatacc 180
acccggttgt acctggtgga caacaagtcg
gcggacatcg cctcgctgaa ctaccagaac 240
gaccacagca acttcctgac caccgtggtg
cagaacaacg atttcacccc cacggaggcc 300
agcacccaga ccatcaactt tgacgagcgc
tcgcggtggg gcggccagct gaaaaccatc 360
atgcacacca acatgcccaa cgtgaacgag
ttcatgtaca gcaacaagtt caaggcgcgg 420
gtgatggtct cgcgcaagac ccccaacggg
gtcacagtaa cagatggtag tcaggacgag 480
ctgacctacg agtgggtgga gtttgagctg
cccgagggca acttctcggt gaccatgacc 540
atcgatctga tgaacaacgc catcatcgac
aactacttgg cggtggggcg gcagaacggg 600
gtgctggaga gcgacatcgg cgtgaagttc
gacacgcgca acttccggct gggctgggac 660
cccgtgaccg agctggtgat gccgggcgtg
tacaccaacg aggccttcca ccccgacatc 720
gtcctgctgc ccggctgcgg cgtggacttc
accgagagcc gcctcagcaa cctgctgggc 780
atccgcaagc ggcagccctt ccaggagggc
ttccagatcc tgtacgagga cctggagggg 840
ggcaacatcc ccgcgctctt ggatgtcgaa
gcctacgaga aaagcaagga ggatagcacc 900
gccgtggcta ccgccgcgac tgtggcagat
gccactgtca ccaggggcga tacattcgcc 960
acccaggcgg aggaagcagc cgccctagcg
gcgaccgatg atagtgaaag taagatagtt 1020
atcaagccgg tggagaagga cagcaaggac
aggagctaca acgttctatc ggatggaaag 1080
aacaccgcct accgcagctg gtacctggcc
tacaactacg gcgaccccga gaagggcgtg 1140
cgctcctgga cgctgctcac cacctcggac
gtcacctgcg gcgtggagca agtctactgg 1200
tcgctgcccg acatgatgca agacccggtc
accttccgct ccacgcgtca agttagcaac 1260
tacccggtgg tgggcgccga gctcctgccc
gtctactcca agagcttctt caacgagcag 1320
gccgtctact cgcagcagct gcgcgccttc
acctcgctca cgcacgtctt caaccgcttc 1380
cccgagaacc agatcctcgt ccgcccgccc
gcgcccacca ttaccaccgt cagtgaaaac 1440
gttcctgctc tcacagatca cgggaccctg
ccgctgcgca gcagtatccg gggagtccag 1500
cgcgtgaccg tcactgacgc cagacgccgc
acctgcccct acgtctacaa ggccctgggc 1560
gtagtcgcgc cgcgcgtcct ctcgagccgc
accttctaa 1599
<210> 47
<211> 1587
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 47
atgatgaggc gcgtgtaccc ggagggtcct
cctccctcgt acgagagcgt gatgcagcag 60
gcggtggcgg cggcgatgca gcccccgctg
gaggcgcctt acgtgccccc gcggtacctg 120
gcgcctacgg aggggcggaa cagcattcgt
tactcggagc tggcaccctt gtacgatacc 180
acccggttgt acctggtgga caacaagtcg
gcggacatcg cctcgctgaa ctaccagaac 240
gaccacagca acttcctgac caccgtggtg
cagaacaacg atttcacccc cacggaggcc 300
agcacccaga ccatcaactt tgacgagcgc
tcgcggtggg gcggccagct gaaaaccatc 360
atgcacacca acatgcccaa cgtgaacgag
ttcctgtaca gcaacaagtt caaggcgcgg 420
gtgatggtct cgcgcaagac ccccaacggg
gtcacagtaa cagatggtag tcaggacgag 480
ctgacctacg agtgggtgga gtttgagctg
cccgagggca acttctcggt gaccatgacc 540
atcgatctga tgaacaacgc cattatcgac
aattacttgg cggtggggcg gcagaacggg 600
gtgctggaga gcgacatcgg cgtgaagttc
gacacgcgca acttcaggct cggttgggac 660
cccgtgaccg agctggtcat gccgggcgtg
tacaccaacg aggccttcca ccccgacatc 720
gtcctgctgc ccggctgcgg cgtggacttc
accgagagcc gcctcagcaa cctgctgggc 780
attcgcaaga ggcagccctt ccaggagggt
ttccagatca tgtacgagga tctggagggg 840
ggcaacatcc ccgcgctcct ggatgtcgag
gcctacgaga aaagcaagga ggatagcgcc 900
gccgcggcga ccgcagccgt ggccaccgcc
tctaccgagg tgcggggcga taattttgct 960
agcgccgcgg cagtggccga ggcggctgaa
accgaaagta agatagtgat ccagccggtg 1020
gagaaggaca gcaaggacag gagctacaac
gtgctcgcgg acaagaaaaa caccgcctac 1080
cgcagctggt acctggccta caactacggc
gaccccgaga agggcgtgcg ctcctggacg 1140
ctgctcacca cctcggacgt cacctgcggc
gtggagcaag tctactggtc gctgcccgac 1200
atgatgcaag acccggtcac cttccgctcc
acgcgtcaag ttagcaacta cccggtggtg 1260
ggcgccgagc tcctgcccgt ctactccaag
agcttcttca acgagcaggc cgtctactcg 1320
cagcagctgc gcgccttcac ctcgctcacg
cacgtcttca accgcttccc cgagaaccag 1380
atcctcgtcc gcccgcccgc gcccaccatt
accaccgtca gtgaaaacgt tcctgctctc 1440
acagatcacg ggaccctgcc gctgcgcagc
agtatccggg gagtccagcg cgtgaccgtc 1500
actgacgcca gacgccgcac ctgcccctac
gtctacaagg ccctgggcgt agtcgcgccg 1560
cgcgtcctct cgagccgcac
cttctaa 1587
<210> 48
<211> 1608
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 48
atgatgaggc gtgcgtaccc ggagggtcct
cctccctcgt acgagagcgt gatgcagcag 60
gcgatggcgg cggcggcggc gatgcagccc
ccgctggagg ctccttacgt gcccccgcgg 120
tacctggcgc ctacggaggg gcggaacagc
attcgttact cggagctggc acccttgtac 180
gataccaccc ggttgtacct ggtggacaac
aagtcggcgg acatcgcctc gctgaactac 240
cagaacgacc acagcaactt cctgaccacc
gtggtgcaga acaatgactt cacccccacg 300
gaggccagca cccagaccat caactttgac
gagcgctcgc ggtggggcgg ccagctgaaa 360
accatcatgc acaccaacat gcccaacgtg
aacgagttca tgtacagcaa caagttcaag 420
gcgcgggtca tggtctcccg caagaccccc
aacggggtga cagtgacaga ggattatgat 480
ggtagtcagg atgagctgaa atacgagtgg
gtggagtttg agctgcccga aggcaacttc 540
tcggtgacca tgactatcga cctgatgaac
aacgccatca tcgacaatta cttggcggtg 600
gggcggcaga acggggtgct ggagagcgac
atcggcgtga agttcgacac taggaacttc 660
aggctgggct gggaccccgt gaccgagctg
gtcatgcccg gggtgtacac caacgaggcc 720
ttccatcccg atattgtctt gctgcccggc
tgcggggtgg acttcaccga gagccgcctc 780
agcaacctgc tgggcattcg caagaggcag
cccttccagg agggcttcca gatcatgtac 840
gaggatctgg aggggggtaa catccccgcg
ctcctggatg tcgacgccta tgagaaaagc 900
aaggaggaga gcgccgccgc ggcgaccgca
gccgtagcca ccgcctctac cgaggtcagg 960
ggcgataatt ttgctagcgc cgcagcagtg
gcagcggccg aggcggctga aaccgaaagt 1020
aagatagtca ttcagccggt ggagaaggat
agcaaagaca ggagctacaa cgtgctgccg 1080
gacaagataa acaccgccta ccgcagctgg
tacctggcct acaactatgg cgaccccgag 1140
aagggcgtgc gctcctggac gctgctcacc
acctcggacg tcacctgcgg cgtggagcaa 1200
gtctactggt cgctgcccga catgatgcaa
gacccggtca ccttccgctc cacgcgtcaa 1260
gttagcaact acccggtggt gggcgccgag
ctcctgcccg tctactccaa gagcttcttc 1320
aacgagcagg ccgtctactc gcagcagctg
cgcgccttca cctcgctcac gcacgtcttc 1380
aaccgcttcc ccgagaacca gatcctcgtc
cgcccgcccg cgcccaccat taccaccgtc 1440
agtgaaaacg ttcctgctct cacagatcac
gggaccctgc cgctgcgcag cagtatccgg 1500
ggagtccagc gcgtgaccgt tactgacgcc
agacgccgca cctgccccta cgtctacaag 1560
gccctgggca tagtcgcgcc gcgcgtcctc
tcgagccgca ccttctaa 1608
<210> 49
<211> 1743
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 49
atgcggcgcg cggcgatgta ccacgaggga
cctcctccct cttatgagag cgtggtgggc 60
gcggcggcgg cctctccctt tgcgtcgcag
ctggagccgc cgtacgtgcc tccgcggtac 120
ctgcggccta cggggggaag aaacagcatc
cgttactcgg agctggcgcc cctgtacgac 180
accacccggg tgtacctggt ggacaacaag
tcggcggacg tggcctccct gaactaccag 240
aacgaccaca gcaatttttt gaccacggtc
atccagaaca atgactacac cccgagcgag 300
gccagcaccc agaccatcaa tctggatgac
cggtcgcact ggggcggcga cctgaaaacc 360
atcctgcaca ccaacatgcc caacgtgaac
gagttcatgt tcaccaataa gttcaaggcg 420
cgggtgatgg tgtcgcgctc gcacaccaag
gacgaccggg tggagctgaa gtacgagtgg 480
gtagagttcg agctgcccga gggcaactac
tcggagacca tgaccataga cctgatgaac 540
aacgcgatcg tggagcacta tctgaaagtg
ggcaggcaga acggggtcct ggagagcgac 600
atcggggtca agttcgacac caggaacttc
cgcctggggc tggacccggt caccgggctg 660
gttatgcccg gggtctacac caacgaggcc
ttccaccccg acatcatcct gctgcccggc 720
tgcggggtgg acttcaccta cagccgcctg
agcaacctgc tgggcatccg caagcggcag 780
cccttccagg agggcttcag gatcacctac
gaggacctgg aggggggcaa catccccgcg 840
ctcctggatg tggaggccta ccaggatagc
ttgaaggaag aagaggcggg agagggcagc 900
ggcggtggcg ccggtcagga ggagggcggg
gcctcctctg aggcctctgc ggacccagcc 960
gctgccgccg aggcggaggc ggccgacccc
gcgatggtgg tagaggaaga gaaggatatg 1020
aacgacgagg cggtgcgcgg cgacaccttt
gccactcggg gggaggagaa gaaagcggag 1080
gccgaggccg cggcagagga ggcggcagca
gcggcggcgg cagtagaggc ggcggccgag 1140
gcggagaagc cccccaagga gcccgtgatt
aagcccctga ccgaagatag caagaagcgc 1200
agttacaacg tgctcaagga cagcaccaac
accgagtacc gcagctggta cctggcctac 1260
aactacggcg acccggcgac gggggtgcgc
tcctggaccc tgctgtgtac gccggacgtg 1320
acctgcggct cggagcaggt gtactggtcg
ctgcccgaca tgatgcaaga ccccgtgacc 1380
ttccgctcca cgcggcaggt cagcaacttc
ccggtggtgg gcgccgagct gctgcccgtg 1440
cactccaaga gcttctacaa cgaccaggcc
gtctactccc agctcatccg ccagttcacc 1500
tctctgaccc acgtgttcaa tcgctttcct
gagaaccaga ttctggcgcg cccgcccgcc 1560
cccaccatca ccaccgtcag tgaaaacgtt
cctgctctca cagatcacgg gacgctaccg 1620
ctgcgcaaca gcatcggagg agtccagcga
gtgaccgtaa ctgacgccag acgccgcacc 1680
tgcccctacg tttacaaggc cctgggcata
gtctcgccgc gcgtcctttc cagccgcact 1740
ttt
1743
<210> 50
<211> 577
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 50
Met Lys Arg Ala Lys Thr Ser Asp
Glu Thr Phe Asn Pro Val Tyr Pro
1 5 10 15
Tyr Asp Thr Glu Asn Gly Pro Pro
Ser Val Pro Phe Leu Thr Pro Pro
20 25 30
Phe Val Ser Pro Asp Gly Phe Gln
Glu Ser Pro Pro Gly Val Leu Ser
35 40 45
Leu Arg Leu Ser Glu Pro Leu Val
Thr Ser His Gly Met Leu Ala Leu
50 55 60
Lys Met Gly Asn Gly Leu Ser Leu
Asp Asp Ala Gly Asn Leu Thr Ser
65 70 75 80
Gln Asp Val Thr Thr Val Thr Pro
Pro Leu Lys Lys Thr Lys Thr Asn
85 90 95
Leu Ser Leu Gln Thr Ser Ala Pro
Leu Thr Val Ser Ser Gly Ser Leu
100 105 110
Thr Val Ala Ala Ala Ala Pro Leu
Ala Val Ala Gly Thr Ser Leu Thr
115 120 125
Met Gln Ser Gln Ala Pro Leu Thr
Val Gln Asp Ala Lys Leu Gly Leu
130 135 140
Ala Thr Gln Gly Pro Leu Thr Val
Ser Glu Gly Lys Leu Thr Leu Gln
145 150 155 160
Thr Ser Ala Pro Leu Thr Ala Ala
Asp Ser Ser Thr Leu Thr Val Ser
165 170 175
Ala Thr Pro Pro Leu Ser Thr Ser
Asn Gly Ser Leu Ser Ile Asp Met
180 185 190
Gln Ala Pro Ile Tyr Thr Thr Asn
Gly Lys Leu Ala Leu Asn Ile Gly
195 200 205
Ala Pro Leu His Val Val Asp Thr
Leu Asn Ala Leu Thr Val Val Thr
210 215 220
Gly Gln Gly Leu Thr Ile Asn Gly
Arg Ala Leu Gln Thr Arg Val Thr
225 230 235 240
Gly Ala Leu Ser Tyr Asp Thr Glu
Gly Asn Ile Gln Leu Gln Ala Gly
245 250 255
Gly Gly Met Arg Ile Asp Asn Asn
Gly Gln Leu Ile Leu Asn Val Ala
260 265 270
Tyr Pro Phe Asp Ala Gln Asn Asn
Leu Ser Leu Arg Leu Gly Gln Gly
275 280 285
Pro Leu Ile Val Asn Ser Ala His
Asn Leu Asp Leu Asn Leu Asn Arg
290 295 300
Gly Leu Tyr Leu Phe Thr Ser Gly
Asn Thr Lys Lys Leu Glu Val Asn
305 310 315 320
Ile Lys Thr Ala Lys Gly Leu Phe
Tyr Asp Gly Thr Ala Ile Ala Ile
325 330 335
Asn Ala Gly Asp Gly Leu Gln Phe
Gly Ser Gly Ser Asp Thr Asn Pro
340 345 350
Leu Gln Thr Lys Leu Gly Leu Gly
Leu Glu Tyr Asp Ser Asn Lys Ala
355 360 365
Ile Ile Thr Lys Leu Gly Thr Gly
Leu Ser Phe Asp Asn Thr Gly Ala
370 375 380
Ile Thr Val Gly Asn Lys Asn Asp
Asp Lys Leu Thr Leu Trp Thr Thr
385 390 395 400
Pro Asp Pro Ser Pro Asn Cys Arg
Ile Asn Ser Glu Lys Asp Ala Lys
405 410 415
Leu Thr Leu Val Leu Thr Lys Cys
Gly Ser Gln Val Leu Ala Ser Val
420 425 430
Ser Val Leu Ser Val Lys Gly Ser
Leu Ala Pro Ile Ser Gly Thr Val
435 440 445
Thr Ser Ala Gln Ile Val Leu Arg
Phe Asp Glu Asn Gly Val Leu Leu
450
455 460
Ser Asn Ser Ser Leu Asp Pro Gln
Tyr Trp Asn Tyr Arg Lys Gly Asp
465 470 475 480
Ser Thr Glu Gly Thr Ala Tyr Thr
Asn Ala Val Gly Phe Met Pro Asn
485 490 495
Leu Thr Ala Tyr Pro Lys Thr Gln
Ser Gln Thr Ala Lys Ser Asn Ile
500 505 510
Val Ser Gln Val Tyr Leu Asn Gly
Asp Lys Thr Lys Pro Met Thr Leu
515 520 525
Thr Ile Thr Leu Asn Gly Thr Asn
Glu Thr Gly Asp Ala Thr Val Ser
530 535 540
Thr Tyr Ser Met Ser Phe Ser Trp
Asn Trp Asn Gly Ser Asn Tyr Ile
545 550 555 560
Asn Asp Thr Phe Gln Thr Asn Ser
Phe Thr Phe Ser Tyr Ile Ala Gln
565 570 575
Glu
<210> 51
<211> 955
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 51
Met Ala Thr Pro Ser Met Met Pro
Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Ser Tyr Phe Ser
Leu Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50
55 60
Thr Leu Arg Phe Ile Pro Val Asp
Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Pro Cys Glu Trp Asp
Glu Ala Val Thr Ala Val Asp Ile
130 135 140
Asn Leu Asp Glu Leu Gly Glu Asp
Glu Asp Asp Ala Glu Gly Glu Ala
145 150 155 160
Glu Gln Gln Lys Ser His Val Phe
Gly Gln Ala Pro Tyr Ser Gly Gln
165 170 175
Asn Ile Thr Lys Glu Gly Ile Gln
Ile Gly Val Asp Thr Thr Ser Gln
180 185 190
Ala Gln Thr Pro Leu Tyr Ala Asp
Lys Thr Phe Gln Pro Glu Pro Gln
195 200 205
Val Gly Glu Ser Gln Trp Asn Glu
Thr Glu Ile Asn Tyr Gly Ala Gly
210 215 220
Arg Val Leu Lys Lys Thr Thr Leu
Met Lys Pro Cys Tyr Gly Ser Tyr
225 230 235 240
Ala Arg Pro Thr Asn Glu Asn Gly Gly
Gln Gly Ile Leu Leu Glu Lys
245 250 255
Glu Gly Gly Lys Pro Glu Ser Gln
Val Glu Met Gln Phe Phe Ser Thr
260 265 270
Thr Gln Ala Ala Ala Ala Gly Asn
Ser Asp Asn Leu Thr Pro Lys Val
275 280 285
Val Leu Tyr Ser Glu Asp Val His
Leu Glu Thr Pro Asp Thr His Ile
290 295 300
Ser Tyr Met Pro Thr Ser Asn Glu
Ala Asn Ser Arg Glu Leu Leu Gly
305 310 315 320
Gln Gln Ala Met Pro Asn Arg Pro
Asn Tyr Ile Ala Phe Arg Asp Asn
325 330 335
Phe Ile Gly Leu Met Tyr Tyr Asn
Ser Thr Gly Asn Met Gly Val Leu
340 345 350
Ala Gly Gln Ala Ser Gln Leu Asn
Ala Val Val Asp Leu Gln Asp Arg
355 360 365
Asn Thr Glu Leu Ser Tyr Gln Leu
Leu Leu Asp Ser Met Gly Asp Arg
370 375 380
Thr Arg Tyr Phe Ser Met Trp Asn
Gln Ala Val Asp Ser Tyr Asp Pro
385 390 395 400
Asp Val Arg Ile Ile Glu Asn His
Gly Thr Glu Asp Glu Leu Pro Asn
405 410 415
Tyr Cys Phe Pro Leu Gly Gly Ile
Ile Asn Thr Glu Thr Leu Thr Lys
420 425 430
Val Lys Pro Lys Thr Gly Gln Asp
Ala Gln Trp Glu Lys Asp Thr Glu
435 440 445
Phe Ser Glu Lys Asn Glu Ile Arg
Val Gly Asn Asn Phe Ala Met Glu
450 455 460
Ile Asn Leu Asn Ala Asn Leu Trp
Arg Asn Phe Leu Tyr Ser Asn Val
465 470 475 480
Ala Leu Tyr Leu Pro Asp Lys Leu
Lys Tyr Thr Pro Ala Asn Val Gln
485 490 495
Ile Ser Ser Asn Ser Asn Ser Tyr
Asp Tyr Met Asn Lys Arg Val Val
500 505 510
Ala Pro Gly Leu Val Asp Cys Tyr
Ile Asn Leu Gly Ala Arg Trp Ser
515 520 525
Leu Asp Tyr Met Asp Asn Val Asn
Pro Phe Asn His His Arg Asn Ala
530 535 540
Gly Leu Arg Tyr Arg Ser Met Leu
Leu Gly Asn Gly Arg Tyr Val Pro
545 550 555 560
Phe His Ile Gln Val Pro Gln Lys
Phe Phe Ala Ile Lys Asn Leu Leu
565 570 575
Leu Leu Pro Gly Ser Tyr Thr Tyr
Glu Trp Asn Phe Arg Lys Asp Val
580 585 590
Asn Met Val Leu Gln Ser Ser Leu
Gly Asn Asp Leu Arg Val Asp Gly
595 600 605
Ala Ser Ile Lys Phe Glu Ser Ile
Cys Leu Tyr Ala Thr Phe Phe Pro
610 615 620
Met Ala His Asn Thr Ala Ser Thr
Leu Glu Ala Met Leu Arg Asn Asp
625 630 635 640
Thr Asn Asp Gln Ser Phe Asn Asp
Tyr Leu Ser Ala Ala Asn Met Leu
645 650 655
Tyr Pro Ile Pro Ala Asn Ala Thr
Asn Val Pro Ile Ser Ile Pro Ser
660 665 670
Arg Asn Trp Ala Ala Phe Arg Gly
Trp Ala Phe Thr Arg Leu Lys Thr
675 680 685
Lys Glu Thr Pro Ser Leu Gly Ser
Gly Phe Asp Pro Tyr Tyr Thr Tyr
690 695 700
Ser Gly Ser Ile Pro Tyr Leu Asp
Gly Thr Phe Tyr Leu Asn His Thr
705 710 715 720
Phe Lys Lys Val Ser Val Thr Phe
Asp Ser Ser Val Ser Trp Pro Gly
725 730 735
Asn Asp Arg Leu Leu Thr Pro Asn
Glu Phe Glu Ile Lys Arg Ser Val
740 745 750
Asp Gly Glu Gly Tyr Asn Val Ala
Gln Cys Asn Met Thr Lys Asp Trp
755 760 765
Phe Leu Ile Gln Met Leu Ala Asn
Tyr Asn Ile Gly Tyr Gln Gly Phe
770 775 780
Tyr Ile Pro Glu Ser Tyr Lys Asp
Arg Met Tyr Ser Phe Phe Arg Asn
785 790 795 800
Phe Gln Pro Met Ser Arg Gln Val
Val Asp Glu Thr Lys Tyr Lys Asp
805 810 815
Tyr Gln Gln Val Gly Ile Ile His
Gln His Asn Asn Ser Gly Phe Val
820 825 830
Gly Tyr Leu Ala Pro Thr Met Arg
Glu Gly Gln Ala Tyr Pro Ala Asn
835 840 845
Phe Pro Tyr Pro Leu Ile Gly Lys
Thr Ala Val Asp Ser Val Thr Gln
850 855 860
Lys Lys Phe Leu Cys Asp Arg Thr
Leu Trp Arg Ile Pro Phe Ser Ser
865 870 875 880
Asn Phe Met Ser Met Gly Ala Leu
Thr Asp Leu Gly Gln Asn Leu Leu
885 890 895
Tyr Ala Asn Ser Ala His Ala Leu
Asp Met Thr Phe Glu Val Asp Pro
900 905 910
Met Asp Glu Pro Thr Leu Leu Tyr
Val Leu Phe Glu Val Phe Asp Val
915 920 925
Val Arg Val His Gln Pro His Arg
Gly Val Ile Glu Thr Val Tyr Leu
930 935 940
Arg Thr Pro Phe Ser Ala Gly Asn
Ala Thr Thr
945 950 955
<210> 52
<211> 582
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 52
Met Arg Arg Ala Ala Met Tyr His
Glu Gly Pro Pro Pro Ser Tyr Glu
1 5 10 15
Ser Val Val Gly Ala Ala Ala Ala
Ser Pro Phe Ala Ser Gln Leu Glu
20 25 30
Pro Pro Tyr Val Pro Pro Arg Tyr
Leu Arg Pro Thr Gly Gly Arg Asn
35 40 45
Ser Ile Arg Tyr Ser Glu Leu Ala
Pro Leu Tyr Asp Thr Thr Arg Val
50 55 60
Tyr Leu Val Asp Asn Lys Ser Ala
Asp Val Ala Ser Leu Asn Tyr Gln
65 70 75 80
Asn Asp His Ser Asn Phe Leu Thr
Thr Val Ile Gln Asn Asn Asp Tyr
85 90 95
Thr Pro Ser Glu Ala Ser Thr Gln
Thr Ile Asn Leu Asp Asp Arg Ser
100 105 110
His Trp Gly Gly Asp Leu Lys Thr
Ile Leu His Thr Asn Met Pro Asn
115 120 125
Val Asn Glu Phe Met Phe Thr Asn
Lys Phe Lys Ala Arg Val Met Val
130 135 140
Ser Arg Ser His Thr Lys Asp Asp
Arg Val Glu Leu Lys Tyr Glu Trp
145 150 155 160
Val Glu Phe Glu Leu Pro Glu Gly
Asn Tyr Ser Glu Thr Met Thr Ile
165 170 175
Asp Leu Met Asn Asn Ala Ile Val
Glu His Tyr Leu Lys Val Gly Arg
180 185 190
Gln Asn Gly Val Leu Glu Ser Asp
Ile Gly Val Lys Phe Asp Thr Arg
195 200 205
Asn Phe Arg Leu Gly Leu Asp Pro
Val Thr Gly Leu Val Met Pro Gly
210 215 220
Val Tyr Thr Asn Glu Ala Phe His
Pro Asp Ile Ile Leu Leu Pro Gly
225 230 235 240
Cys Gly Val Asp Phe Thr Tyr Ser
Arg Leu Ser Asn Leu Leu Gly Ile
245 250 255
Arg Lys Arg Gln Pro Phe Gln Glu
Gly Phe Arg Ile Thr Tyr Glu Asp
260 265 270
Leu Glu Gly Gly Asn Ile Pro Ala
Leu Leu Asp Val Glu Ala Tyr Gln
275 280 285
Asn Ser Leu Lys Glu Glu Glu Ala
Gly Glu Gly Ser Gly Gly Gly Gly
290 295 300
Ala Gly Gln Glu Glu Gly Gly Ala
Ser Ser Glu Ala Ser Ala Asp Ala
305 310 315 320
Ala Ala Ala Glu Ala Glu Glu Ala
Ala Asp Pro Ala Met Val Val Glu
325 330 335
Glu Glu Lys Asp Met Asn Asp Glu
Ala Val Arg Gly Asp Thr Phe Ala
340 345 350
Thr Arg Gly Glu Glu Lys Lys Ala
Glu Ala Glu Ala Ala Ala Glu Glu
355 360 365
Ala Ala Ala Ala Ala Ala Ala Val
Glu Ala Ala Ala Glu Ala Glu Lys
370 375 380
Pro Pro Lys Glu Pro Val Ile Lys
Pro Leu Thr Glu Asp Ser Lys Lys
385 390 395 400
Arg Ser Tyr Asn Val Leu Lys Asp
Ser Thr Asn Thr Glu Tyr Arg Ser
405 410 415
Trp Tyr Leu Ala Tyr Asn Tyr Gly
Asp Pro Ala Thr Gly Val Arg Ser
420 425 430
Trp Thr Leu Leu Cys Thr Pro Asp
Val Thr Cys Gly Ser Glu Gln Val
435 440 445
Tyr Trp Ser Leu Pro Asp Met Met
Gln Asp Pro Val Thr Phe Arg Ser
450 455 460
Thr Arg Gln Val Ser Asn Phe Pro
Val Val Gly Ala Glu Leu Leu Pro
465 470 475 480
Val His Ser Lys Ser Phe Tyr Asn
Asp Gln Ala Val Tyr Ser Gln Leu
485 490 495
Ile Arg Gln Phe Thr Ser Leu Thr
His Val Phe Asn Arg Phe Pro Glu
500 505 510
Asn Gln Ile Leu Ala Arg Pro Pro
Ala Pro Thr Ile Thr Thr Val Ser
515 520 525
Glu Asn Val Pro Ala Leu Thr Asp
His Gly Thr Leu Pro Leu Arg Asn
530 535 540
Ser Ile Gly Gly Val Gln Arg Val
Thr Val Thr Asp Ala Arg Arg Arg
545 550 555 560
Thr Cys Pro Tyr Val Tyr Lys Ala
Leu Gly Ile Val Ser Pro Arg Val
565 570 575
Leu Ser Ser Arg Thr Phe
580
<210> 53
<211> 542
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 53
Met Lys Arg Ala Lys Thr Ser Asp
Glu Thr Phe Asn Pro Val Tyr Pro
1 5 10 15
Tyr Asp Thr Glu Asn Gly Pro Pro
Ser Val Pro Phe Leu Thr Pro Pro
20 25 30
Phe Val Ser Pro Asp Gly Phe Gln
Glu Ser Pro Pro Gly Val Leu Ser
35 40 45
Leu Arg Leu Ser Glu Pro Leu Val
Thr Ser His Gly Met Leu Ala Leu
50 55 60
Lys Met Gly Asn Gly Leu Ser Leu
Asp Asp Ala Gly Asn Leu Thr Ser
65 70 75 80
Gln Asp Val Thr Thr Val Thr Pro
Pro Leu Lys Lys Thr Lys Thr Asn
85 90 95
Leu Ser Leu Gln Thr Ser Ala Pro
Leu Thr Val Ser Ser Gly Ser Leu
100 105 110
Thr Val Ala Ala Ala Ala Pro Leu
Ala Val Ala Gly Thr Ser Leu Thr
115 120 125
Met Gln Ser Gln Ala Pro Leu Thr
Val Gln Asp Ala Lys Leu Gly Leu
130 135 140
Ala Thr Gln Gly Pro Leu Thr Val
Ser Glu Gly Lys Leu Thr Leu Gln
145 150 155 160
Thr Ser Ala Pro Leu Thr Ala Ala
Asp Ser Ser Thr Leu Thr Val Gly
165 170 175
Thr Thr Pro Pro Ile Ser Val Ser Ser
Gly Ser Leu Gly Leu Asp Met
180 185 190
Glu Asp Pro Met Tyr Thr His Asp
Gly Lys Leu Gly Ile Arg Ile Gly
195 200 205
Gly Pro Leu Gln Val Val Asp Ser
Leu His Thr Leu Thr Val Val Thr
210 215 220
Gly Asn Gly Ile Thr Val Ala Asn
Asn Ala Leu Gln Thr Lys Val Ala
225 230 235 240
Gly Ala Leu Gly Tyr Asp Ser Ser
Gly Asn Leu Glu Leu Arg Ala Ala
245 250 255
Gly Gly Met Arg Ile Asn Thr Gly
Gly Gln Leu Ile Leu Asp Val Ala
260 265 270
Tyr Pro Phe Asp Ala Gln Asn Asn
Leu Ser Leu Arg Leu Gly Gln Gly
275 280 285
Pro Leu Tyr Val Asn Thr Asn His
Asn Leu Asp Leu Asn Cys Asn Arg
290 295 300
Gly Leu Thr Thr Thr Thr Ser Ser
Asn Thr Thr Lys Leu Glu Thr Lys
305 310 315 320
Ile Asp Ser Gly Leu Asp Tyr Asn
Ala Asn Gly Ala Ile Ile Ala Lys
325 330 335
Leu Gly Thr Gly Leu Thr Phe Asp
Asn Thr Gly Ala Ile Thr Val Gly
340 345 350
Asn Thr Gly Asp Asp Lys Leu Thr
Leu Trp Thr Thr Pro Asp Pro Ser
355 360 365
Pro Asn Cys Arg Ile His Ala Asp
Lys Asp Cys Lys Phe Thr Leu Val
370 375 380
Leu Thr Lys Cys Gly Ser Gln Ile
Leu Ala Ser Val Ala Ala Leu Ala
385 390 395 400
Val Ser Gly Asn Leu Ser Ser Met
Thr Gly Thr Val Ser Ser Val Thr
405 410 415
Ile Phe Leu Arg Phe Asp Gln Asn
Gly Val Leu Met Glu Asn Ser Ser
420 425 430
Leu Asp Lys Glu Tyr Trp Asn Phe
Arg Asn Gly Asn Ser Thr Asn Ala
435 440 445
Thr Pro Tyr Thr Asn Ala Val Gly
Phe Met Pro Asn Leu Ser Ala Tyr
450 455 460
Pro Lys Thr Gln Ser Gln Thr Ala
Lys Asn Asn Ile Val Ser Glu Val
465 470 475 480
Tyr Leu His Gly Asp Lys Ser Lys
Pro Met Ile Leu Thr Ile Thr Leu
485 490 495
Asn Gly Thr Asn Glu Ser Ser Glu
Thr Ser Gln Val Ser His Tyr Ser
500 505 510
Met Ser Phe Thr Trp Ser Trp Asp
Ser Gly Lys Tyr Ala Thr Glu Thr
515 520 525
Phe Ala Thr Asn Ser Phe Thr Phe
Ser Tyr Ile Ala Glu Gln
530 535 540
<210> 54
<211> 964
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 54
Met Ala Thr Pro Ser Met Met Pro
Gln Trp Ser Tyr Met His Ile Ser
1 5 10 15
Gly Gln Asp Ala Ser Glu Tyr Leu
Ser Pro Gly Leu Val Gln Phe Ala
20 25 30
Arg Ala Thr Asp Ser Tyr Phe Ser
Leu Ser Asn Lys Phe Arg Asn Pro
35 40 45
Thr Val Ala Pro Thr His Asp Val
Thr Thr Asp Arg Ser Gln Arg Leu
50 55 60
Thr Leu Arg Phe Ile Pro Val Asp
Arg Glu Asp Thr Ala Tyr Ser Tyr
65 70 75 80
Lys Ala Arg Phe Thr Leu Ala Val
Gly Asp Asn Arg Val Leu Asp Met
85 90 95
Ala Ser Thr Tyr Phe Asp Ile Arg
Gly Val Leu Asp Arg Gly Pro Thr
100 105 110
Phe Lys Pro Tyr Ser Gly Thr Ala
Tyr Asn Ser Leu Ala Pro Lys Gly
115 120 125
Ala Pro Asn Ser Cys Glu Trp Glu
Gln Glu Glu Thr Gln Thr Ala Glu
130 135 140
Glu Ala Gln Asp Glu Glu Glu Asp
Glu Ala Glu Ala Glu Glu Glu Met
145 150 155 160
Pro Gln Glu Glu Gln Ala Pro Val
Lys Lys Thr His Val Tyr Ala Gln
165 170 175
Ala Pro Leu Ser Gly Glu Lys Ile
Thr Lys Asp Gly Leu Gln Ile Gly
180 185 190
Thr Asp Ala Thr Ala Thr Glu Gln
Lys Pro Ile Tyr Ala Asp Pro Thr
195 200 205
Phe Gln Pro Glu Pro Gln Ile Gly
Glu Ser Gln Trp Asn Glu Ala Asp
210 215 220
Ala Ser Val Ala Gly Gly Arg Val
Leu Lys Lys Thr Thr Pro Met Lys
225 230 235 240
Pro Cys Tyr Gly Ser Tyr Ala Arg
Pro Thr Asn Ala Asn Gly Gly Gln
245 250 255
Gly Val Leu Val Glu Lys Asp Gly
Gly Lys Met Glu Ser Gln Val Asp
260 265 270
Met Gln Phe Phe Ser Thr Ser Glu
Asn Ala Arg Asn Glu Ala Asn Asn
275
280 285
Ile Gln Pro Lys Leu Val Leu Tyr
Ser Glu Asp Val His Met Glu Thr
290 295 300
Pro Asp Thr His Ile Ser Tyr Lys
Pro Ala Lys Ser Asp Asp Asn Ser
305 310 315 320
Lys Val Met Leu Gly Gln Gln Ser
Met Pro Asn Arg Pro Asn Tyr Ile
325 330 335
Gly Phe Arg Asp Asn Phe Ile Gly
Leu Met Tyr Tyr Asn Ser Thr Gly
340 345 350
Asn Met Gly Val Leu Ala Gly Gln
Ala Ser Gln Leu Asn Ala Val Val
355 360 365
Asp Leu Gln Asp Arg Asn Thr Glu
Leu Ser Tyr Gln Leu Leu Leu Asp
370 375 380
Ser Met Gly Asp Arg Thr Arg Tyr
Phe Ser Met Trp Asn Gln Ala Val
385 390 395 400
Asp Ser Tyr Asp Pro Asp Val Arg
Ile Ile Glu Asn His Gly Thr Glu
405 410 415
Asp Glu Leu Pro Asn Tyr Cys Phe
Pro Leu Gly Gly Ile Gly Val Thr
420 425 430
Asp Thr Tyr Gln Ala Ile Lys Thr
Asn Gly Asn Gly Asn Gly Gly Gly
435 440 445
Asn Thr Thr Trp Thr Lys Asp Glu
Thr Phe Ala Asp Arg Asn Glu Ile
450 455 460
Gly Val Gly Asn Asn Phe Ala Met
Glu Ile Asn Leu Ser Ala Asn Leu
465 470 475 480
Trp Arg Asn Phe Leu Tyr Ser Asn
Val Ala Leu Tyr Leu Pro Asp Lys
485 490 495
Leu Lys Tyr Asn Pro Ser Asn Val
Glu Ile Ser Asp Asn Pro Asn Thr
500 505 510
Tyr Asp Tyr Met Asn Lys Arg Val
Val Ala Pro Gly Leu Val Asp Cys
515 520 525
Tyr Ile Asn Leu Gly Ala Arg Trp
Ser Leu Asp Tyr Met Asp Asn Val
530 535 540
Asn Pro Phe Asn His His Arg Asn
Ala Gly Leu Arg Tyr Arg Ser Met
545 550 555 560
Leu Leu Gly Asn Gly Arg Tyr Val
Pro Phe His Ile Gln Val Pro Gln
565 570 575
Lys Phe Phe Ala Ile Lys Asn Leu
Leu Leu Leu Pro Gly Ser Tyr Thr
580 585 590
Tyr Glu Trp Asn Phe Arg Lys Asp
Val Asn Met Val Leu Gln Ser Ser
595 600 605
Leu Gly Asn Asp Leu Arg Val Asp
Gly Ala Ser Ile Lys Phe Glu Ser
610 615 620
Ile Cys Leu Tyr Ala Thr Phe Phe
Pro Met Ala His Asn Thr Ala Ser
625 630 635 640
Thr Leu Glu Ala Met Leu Arg Asn
Asp Thr Asn Asp Gln Ser Phe Asn
645 650 655
Asp Tyr Leu Ser Ala Ala Asn Met
Leu Tyr Pro Ile Pro Ala Asn Ala
660 665 670
Thr Asn Val Pro Ile Ser Ile Pro
Ser Arg Asn Trp Ala Ala Phe Arg
675 680 685
Gly Trp Ala Phe Thr Arg Leu Lys
Thr Lys Glu Thr Pro Ser Leu Gly
690 695 700
Ser Gly Phe Asp Pro Tyr Tyr Thr
Tyr Ser Gly Ser Ile Pro Tyr Leu
705 710 715 720
Asp Gly Thr Phe Tyr Leu Asn His
Thr Phe Lys Lys Val Ser Val Thr
725 730 735
Phe Asp Ser Ser Val Ser Trp Pro
Gly Asn Asp Arg Leu Leu Thr Pro
740 745 750
Asn Glu Phe Glu Ile Lys Arg Ser
Val Asp Gly Glu Gly Tyr Asn Val
755 760 765
Ala Gln Cys Asn Met Thr Lys Asp
Trp Phe Leu Ile Gln Met Leu Ala
770 775 780
Asn Tyr Asn Ile Gly Tyr Gln Gly
Phe Tyr Ile Pro Glu Ser Tyr Lys
785 790 795 800
Asp Arg Met Tyr Ser Phe Phe Arg
Asn Phe Gln Pro Met Ser Arg Gln
805 810 815
Val Val Asp Glu Thr Lys Tyr Lys
Asp Tyr Gln Gln Val Gly Ile Ile
820 825 830
His Gln His Asn Asn Ser Gly Phe
Val Gly Tyr Leu Ala Pro Thr Met
835 840 845
Arg Glu Gly Gln Ala Tyr Pro Ala
Asn Phe Pro Tyr Pro Leu Ile Gly
850 855 860
Lys Thr Ala Val Asp Ser Val Thr
Gln Lys Lys Phe Leu Cys Asp Arg
865 870 875 880
Thr Leu Trp Arg Ile Pro Phe Ser
Ser Asn Phe Met Ser Met Gly Ala
885 890 895
Leu Thr Asp Leu Gly Gln Asn Leu
Leu Tyr Ala Asn Ser Ala His Ala
900 905 910
Leu Asp Met Thr Phe Glu Val Asp
Pro Met Asp Glu Pro Thr Leu Leu
915 920 925
Tyr Val Leu Phe Glu Val Phe Asp
Val Val Arg Val His Gln Pro His
930 935 940
Arg Gly Val Ile Glu Thr Val Tyr
Leu Arg Thr Pro Phe Ser Ala Gly
945 950 955 960
Asn Ala Thr Thr
<210> 55
<211> 584
<212> PRT
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 55
Met Arg Arg Ala Ala Met Tyr His
Glu Gly Pro Pro Pro Ser Tyr Glu
1 5 10 15
Ser Val Val Gly Ala Ala Ala Ala
Ser Pro Phe Ala Ser Gln Leu Glu
20 25 30
Pro Pro Tyr Val Pro Pro Arg Tyr
Leu Arg Pro Thr Gly Gly Arg Asn
35 40 45
Ser Ile Arg Tyr Ser Glu Leu Ala
Pro Leu Tyr Asp Thr Thr Arg Val
50 55 60
Tyr Leu Val Asp Asn Lys Ser Ala
Asp Val Ala Ser Leu Asn Tyr Gln
65 70 75 80
Asn Asp His Ser Asn Phe Leu Thr
Thr Val Ile Gln Asn Asn Asp Tyr
85 90 95
Thr Pro Ser Glu Ala Ser Thr Gln
Thr Ile Asn Leu Asp Asp Arg Ser
100 105 110
His Trp Gly Gly Asp Leu Lys Thr
Ile Leu His Thr Asn Met Pro Asn
115 120 125
Val Asn Glu Phe Met Phe Thr Asn
Lys Phe Lys Ala Arg Val Met Val
130 135 140
Ser Arg Ser His Thr Lys Asp Asp
Arg Val Glu Leu Lys Tyr Glu Trp
145 150 155 160
Val Glu Phe Glu Leu Pro Glu Gly
Asn Tyr Ser Glu Thr Met Thr Ile
165 170 175
Asp Leu Met Asn Asn Ala Ile Val
Glu His Tyr Leu Lys Val Gly Arg
180 185 190
Gln Asn Gly Val Leu Glu Ser Asp
Ile Gly Val Lys Phe Asp Thr Arg
195 200 205
Asn Phe Arg Leu Gly Leu Asp Pro
Val Thr Gly Leu Val Met Pro Gly
210 215 220
Val Tyr Thr Asn Glu Ala Phe His
Pro Asp Ile Ile Leu Leu Pro Gly
225 230 235 240
Cys Gly Val Asp Phe Thr Tyr Ser
Arg Leu Ser Asn Leu Leu Gly Ile
245 250 255
Arg Lys Arg Gln Pro Phe Gln Glu
Gly Phe Arg Ile Thr Tyr Glu Asp
260 265 270
Leu Glu Gly Gly Asn Ile Pro Ala Leu
Leu Asp Val Glu Ala Tyr Gln
275 280 285
Asp Ser Leu Lys Glu Glu Glu Ala
Gly Glu Gly Ser Gly Gly Gly Gly
290 295 300
Gly Ala Gly Gln Glu Glu Gly Gly
Ala Ser Ser Glu Ala Ser Ala Asp
305 310 315 320
Ala Ala Ala Ala Ala Glu Ala Glu
Ala Ala Asp Pro Ala Met Val Val
325 330 335
Glu Glu Glu Lys Asp Met Asn Asp
Glu Ala Val Arg Gly Asp Thr Phe
340 345 350
Ala Thr Arg Gly Glu Glu Lys Lys
Ala Glu Ala Glu Ala Ala Ala Glu
355 360 365
Glu Ala Ala Ala Ala Ala Ala Ala
Ala Val Glu Ala Ala Ala Glu Ala
370 375 380
Glu Lys Pro Pro Lys Glu Pro Val
Ile Lys Ala Leu Thr Glu Asp Ser
385 390 395 400
Lys Lys Arg Ser Tyr Asn Val Leu
Lys Asp Ser Thr Asn Thr Ala Tyr
405 410 415
Arg Ser Trp Tyr Leu Ala Tyr Asn
Tyr Gly Asp Pro Ala Thr Gly Val
420 425 430
Arg Ser Trp Thr Leu Leu Cys Thr
Pro Asp Val Thr Cys Gly Ser Glu
435 440 445
Gln Val Tyr Trp Ser Leu Pro Asp
Met Met Gln Asp Pro Val Thr Phe
450 455 460
Arg Ser Thr Arg Gln Val Ser Asn
Phe Pro Val Val Gly Ala Glu Leu
465 470 475 480
Leu Pro Val His Ser Lys Ser Phe
Tyr Asn Asp Gln Ala Val Tyr Ser
485 490 495
Gln Leu Ile Arg Gln Phe Thr Ser
Leu Thr His Val Phe Asn Arg Phe
500 505 510
Pro Glu Asn Gln Ile Leu Ala Arg
Pro Pro Ala Pro Thr Ile Thr Thr
515 520 525
Val Ser Glu Asn Val Pro Ala Leu
Thr Asp His Gly Thr Leu Pro Leu
530 535 540
Arg Asn Ser Ile Gly Gly Val Gln
Arg Val Thr Val Thr Asp Ala Arg
545 550 555 560
Arg Arg Thr Cys Pro Tyr Val Tyr
Lys Ala Leu Gly Ile Val Ser Pro
565 570 575
Arg Val Leu Ser Ser Arg Thr Phe
580
<210> 56
<211> 1734
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 56
atgaagcgcg ccaaaacgtc tgacgagacc
ttcaaccccg tgtaccccta tgacacggaa 60
aacgggcctc cctccgtccc tttcctcacc
cctcccttcg tgtcccccga cggatttcaa 120
gaaagccccc caggggtcct gtctctgcgc
ctgtcagagc ccctggtcac ttcccacggc 180
atgcttgccc tgaaaatggg aaatggcctc
tccctggatg acgccggcaa cctcacctct 240
caagatgtca ccaccgtcac ccctcccctc
aaaaaaacca agaccaacct cagcctccag 300
acctcagccc ccctgaccgt tagctctggg
tccctcaccg tcgcggccgc cgctccactg 360
gcggtggccg gcacctctct caccatgcaa
tctcaggccc ccttgacagt gcaagatgca 420
aaactcggcc tggccaccca gggacccctg
accgtgtctg aaggcaaact caccttgcag 480
acatcggctc cactgacggc cgctgacagc
agcactctca ctgttagtgc cacacctccc 540
ctcagcacaa gcaatggtag tttgagcatt
gacatgcagg ccccgattta taccaccaat 600
ggaaaactgg cacttaacat tggtgctccc
ctgcatgtgg tagacaccct aaatgcacta 660
actgtagtaa ctggccaggg tcttaccata
aatggaagag ccctgcaaac tagagtcacg 720
ggtgccctca gttatgacac agaaggcaac
atccaactgc aagccggagg gggtatgcgc 780
attgacaata atggccaact tatccttaat
gtagcttatc catttgatgc tcaaaacaac 840
ctcagcctta gacttggcca aggtccccta
attgttaact ctgcccacaa cttggatctt 900
aaccttaaca gaggccttta cttatttaca
tctggaaaca cgaaaaaact ggaagttaac 960
ataaaaacag ccaaaggtct attttacgat
ggcaccgcta tagcaatcaa tgcaggtgac 1020
gggctacagt ttgggtctgg ttcagataca
aatccattgc aaactaaact tggattgggg 1080
ctggaatatg actccaacaa agctataatc
actaaacttg gaactggcct aagctttgac 1140
aacacaggtg ccatcacagt aggcaacaaa
aatgatgaca agcttacctt gtggaccaca 1200
ccagacccct ccccaaactg cagaattaat
tcagaaaaag atgctaaact cacactagtt 1260
ttgactaaat gcggcagcca ggtgttagcc
agcgtttctg ttttatctgt aaaaggcagc 1320
cttgccccca tcagcggcac agtaactagc
gcccagattg ttttaagatt tgatgaaaac 1380
ggagttttat tgagcaattc ttctcttgac
ccccaatact ggaactatag aaaaggcgat 1440
tctacagaag gcactgcata tactaatgct
gtgggattta tgcccaacct cacagcatac 1500
cctaaaacac agagccagac tgctaaaagc
aacattgtaa gtcaagttta cttgaatggg 1560
gacaaaacaa aacccatgac cctaaccatc
accctcaatg gaactaatga aacaggggat 1620
gctacagtaa gcacatactc catgtcattt
tcatggaact ggaatggaag taattacatt 1680
aatgacacct tccaaaccaa ctcctttacc
ttctcctaca tcgcccaaga ataa 1734
<210> 57
<211> 2868
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 57
atggcgaccc catcgatgat gccgcagtgg
tcgtacatgc acatctcggg ccaggacgcc 60
tcggagtacc tgagccccgg gctggtgcag
ttcgcccgcg ccaccgacag ctacttcagc 120
ctgagtaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgg 180
tcccagcgcc tgacgctgcg gttcatcccc
gtggaccgcg aggacaccgc gtactcttac 240
aaggcgcggt tcaccctggc cgtgggcgac
aaccgcgtgc tggacatggc ctccacctac 300
tttgacatcc gcggcgtgct ggacaggggc
cccaccttta agccctactc cggcactgcc 360
tacaactccc tggcccccaa gggcgccccc
aacccctgtg agtgggatga agccgttact 420
gctgttgaca ttaacctgga tgagctcggc
gaagatgaag acgacgccga aggggaagca 480
gaacagcaaa aaagtcatgt atttggtcaa
gcgccctact caggacaaaa cattacgaag 540
gagggcatac aaattggggt agataccacc
agccaagccc aaacaccttt atacgctgac 600
aaaacattcc aacccgaacc tcaggttgga
gaatcccaat ggaatgagac agaaatcaat 660
tatggagcgg gacgagtgct aaaaaagacc
accctcatga aaccatgcta tgggtcatat 720
gcaagaccta ctaatgaaaa cggcggtcag
ggcatactgc tggagaaaga gggtggtaaa 780
ccagaaagtc aagttgaaat gcaatttttt
tctactactc aggccgccgc ggctggtaat 840
tcagataatc ttactccaaa agttgttttg
tatagcgagg atgttcacct ggaaacgcca 900
gatacacaca tttcatatat gcccactagc
aacgaagcca attcaagaga actgttggga 960
caacaagcta tgcccaacag acccaactac
attgccttca gagacaactt tattggcctt 1020
atgtattaca acagcactgg caacatggga
gtgctggcag gtcaggcctc acagttgaat 1080
gcagtggtgg acttgcaaga cagaaacaca
gaactgtcct accagctctt gcttgattcc 1140
atgggagaca gaaccagata cttttccatg
tggaatcagg cggtggacag ttatgatcca 1200
gatgttagaa ttattgaaaa tcatggaact
gaagatgagc tgcccaacta ttgtttcccc 1260
ctgggcggca taattaacac cgaaacttta
actaaagtga aacctaagac tggacaagac 1320
gctcagtggg aaaaagatac tgagttttca
gagaaaaatg aaataagggt gggaaacaac 1380
ttcgccatgg agattaacct caatgccaac
ctgtggagga atttcctgta ctccaacgtg 1440
gccctgtacc tgccagacaa acttaagtac
actccagcca acgtgcagat ttccagcaac 1500
tccaactcct acgactacat gaacaagcga
gtggtggccc cggggctggt ggactgctac 1560
atcaacctgg gcgcgcgctg gtccctggac
tacatggaca acgtcaaccc cttcaaccac 1620
caccgcaatg cgggcctgcg ctaccgctcc
atgcttctgg gcaacgggcg ctacgtgccc 1680
ttccacatcc aggtgcccca gaagttcttt
gccatcaaga acctcctcct cctgccgggc 1740
tcctacacct acgagtggaa cttcaggaag
gatgtcaaca tggtcctcca gagctctctg 1800
ggtaacgacc tcagggtcga cggggccagc
atcaagttcg agagcatctg cctctacgcc 1860
accttcttcc ccatggccca caacacggcc
tccacgctcg aggccatgct caggaacgac 1920
accaacgacc agtccttcaa cgactacctc
tccgccgcca acatgctcta ccccatcccc 1980
gccaacgcca ccaacgtccc catctccatc
ccctcgcgca actgggcggc cttccgcggc 2040
tgggccttca ctcgcctcaa gaccaaggag
accccctccc tgggctcggg tttcgacccc 2100
tactacacct actcgggctc cataccctac
ctggacggaa ccttctacct caaccacacc 2160
ttcaagaagg tctcggtcac cttcgactcc
tcggtcagct ggccgggcaa cgaccgcctg 2220
ctcaccccca acgagttcga gatcaagcgc
tcggtcgacg gggagggcta caacgtggcc 2280
cagtgcaaca tgaccaagga ctggttcctc
atccagatgc tggccaacta caacatcggc 2340
tatcagggct tctacatccc agagagctac
aaggacagga tgtactcctt ctttaggaac 2400
ttccagccca tgagccggca ggtggtggac
gaaaccaagt acaaggacta ccagcaggtg 2460
ggcatcatcc accagcacaa caactcgggc
ttcgtgggct acctcgcccc caccatgcgc 2520
gagggacagg cctaccccgc caacttcccc
tacccgctca ttggcaagac cgcggtcgac 2580
agcgtcaccc agaaaaagtt cctctgcgac
cgcaccctct ggcgcatccc cttctccagc 2640
aacttcatgt ccatgggtgc gctcacggac
ctgggccaga acctgctcta tgccaactcc 2700
gcccacgcgc tcgacatgac cttcgaggtc
gaccccatgg acgagcccac ccttctctat 2760
gttctgttcg aagtctttga cgtggtccgg
gtccaccagc cgcaccgcgg cgtcatcgag 2820
accgtgtacc tgcgcacgcc cttctcggcc
ggcaacgcca ccacctaa 2868
<210> 58
<211> 1749
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 58
atgcggcgcg cggcgatgta ccacgaggga
cctcctccct cttatgagag cgtggtgggc 60
gcggcggcgg cctctccctt tgcgtcgcag
ctggagccgc cgtacgtgcc tccgcggtac 120
ctgcggccta cggggggaag aaacagcatc
cgttactcgg agctggcgcc cctgtacgac 180
accacccggg tgtacctggt ggacaacaag
tcggcggacg tggcctccct gaactaccag 240
aacgaccaca gcaatttttt gaccacggtc
atccagaaca atgactacac cccgagcgag 300
gccagcaccc agaccatcaa tctggatgac
cggtcgcact ggggcggcga cctgaaaacc 360
atcctgcaca ccaacatgcc caacgtgaac
gagttcatgt tcaccaataa gttcaaggcg 420
cgggtgatgg tgtcgcgttc gcacaccaag
gacgaccggg tggagctgaa gtacgagtgg 480
gtagagttcg agctgcccga gggcaactac
tcggagacca tgaccataga cctgatgaac 540
aacgcgatcg tggagcacta tctgaaagtg
ggcaggcaga acggggtcct ggagagcgac 600
atcggggtca agttcgacac caggaacttc
cgcctggggc tggacccggt caccgggctg 660
gtcatgcccg gggtctacac caacgaggcc
ttccaccccg acatcatcct gctgcccggc 720
tgcggggtgg acttcaccta cagccgcctg
agcaacctgc tgggcatccg caagcggcag 780
cccttccagg agggctttag gatcacctac
gaggacctgg aggggggcaa catccccgcg 840
ctcctggatg tggaggccta ccagaatagc
ttgaaggaag aagaggcggg agagggcagc 900
ggcggcggcg gcgccggtca ggaggagggc
ggggcctcct ctgaggcctc tgcggacgca 960
gctgccgccg aggcggagga ggcggccgac
cccgcgatgg tggtagagga agagaaggat 1020
atgaatgacg aggcggtgcg cggcgacacc
tttgccaccc ggggggagga gaagaaagcg 1080
gaggccgagg ccgcggcaga ggaggcggca
gcagcggcgg cggcagtaga ggcggcggcc 1140
gaggcggaga agccccccaa ggagcccgtg
attaagcccc tgaccgaaga tagcaagaag 1200
cgcagttaca acgtgctcaa ggacagcacc
aacaccgagt accgcagctg gtacctggcc 1260
tacaactacg gcgacccggc gacgggggtg
cgctcctgga ccctgctgtg tacgccggac 1320
gtgacctgcg gctcggagca ggtgtactgg
tcgctgcccg acatgatgca agaccccgtg 1380
accttccgct ccacgcggca ggtcagcaac
tttccggtgg tgggcgccga gctgctgccc 1440
gtgcactcca agagcttcta caacgaccag
gccgtctact cccagctcat ccgccagttc 1500
acctctctga cccacgtgtt caatcgcttt
cctgagaacc agattctggc gcgcccgccc 1560
gcccccacca tcaccaccgt cagtgaaaac
gttcctgctc tcacagatca cgggacgcta 1620
ccgctgcgca acagcatcgg aggagtccag
cgagtgaccg taactgacgc cagacgccgc 1680
acctgtccct acgtttacaa ggccctgggc
atagtctcgc cgcgcgtcct ttccagccgc 1740
actttttaa
1749
<210> 59
<211> 1629
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 59
atgaagcgcg ccaaaacgtc tgacgagacc
ttcaaccccg tgtaccccta tgacacggaa 60
aacgggcctc cctccgttcc tttcctcacc
cctcccttcg tgtcccccga cggatttcaa 120
gaaagccccc caggggtcct gtctctgcgc
ctgtcagagc ccctggtcac ttcccacggc 180
atgcttgccc tgaaaatggg aaatggcctc
tccctggatg acgccggcaa cctcacctct 240
caagatgtca ccaccgtcac ccctcccctc
aaaaaaacca agaccaacct cagcctccag 300
acctcagccc ccctgaccgt tagctctggg
tccctcaccg tcgcggccgc cgctccactg 360
gcggtggccg gcacctctct caccatgcaa
tctcaggccc ccttgacggt gcaagatgca 420
aaactgggtc tggccaccca gggacccctg
accgtgtctg aaggcaaact caccttgcag 480
acatcggctc cactgacggc cgccgacagc
agcactctca ctgttggcac cacaccgcca 540
atcagtgtga gcagtggaag tctaggctta
gatatggaag accccatgta tactcacgat 600
ggaaaactgg gaatcagaat tggtggccca
ctgcaagtag tagacagctt gcacacactc 660
actgtagtta ctggaaacgg aataactgta
gctaacaatg cccttcaaac taaagttgcg 720
ggtgccctgg gttatgactc atctggcaat
ctagaattgc gagccgcagg gggtatgcga 780
attaacacag ggggtcaact cattcttgat
gtggcttatc catttgatgc tcagaacaat 840
ctcagcctta gactcggcca gggaccttta
tatgtgaaca ccaatcacaa cctagattta 900
aattgcaaca gaggtctgac cacaaccacc
agcagtaaca caaccaaact tgaaactaaa 960
atcgattcgg gcttagacta taacgccaat
ggggctatca ttgctaaact tggcactggg 1020
ttaacctttg acaacacagg tgccataact
gtgggaaaca ctggggatga caaactcact 1080
ctgtggacta ccccagatcc ctctcctaac
tgcagaattc acgcagacaa agactgcaag 1140
tttactctag tcctgactaa gtgtggaagt
caaattctgg cctccgtcgc cgccctggcg 1200
gtgtctggaa acctatcatc aatgacaggc
actgtctcca gcgttaccat ctttctcaga 1260
ttcgatcaga atggagttct tatggaaaat
tcctcgctag acaaggagta ctggaacttc 1320
agaaatggta attccaccaa tgccaccccc
tacaccaatg cggttgggtt catgcccaac 1380
ctcagcgcct accccaaaac ccagagtcaa
actgcaaaaa acaacattgt aagtgaggtt 1440
tacttacatg gggacaaatc taaacccatg
atccttacca ttacccttaa tggcacaaat 1500
gaatccagtg aaactagtca ggtgagtcac
tactccatgt catttacatg gtcctgggac 1560
agtgggaaat atgccaccga aacctttgcc
accaactctt ttaccttctc ctacattgct 1620
gaacaataa
1629
<210> 60
<211> 2895
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 60
atggcgaccc catcgatgat gccgcagtgg
tcgtacatgc acatctcggg ccaggacgcc 60
tcggagtacc tgagccccgg gctggtgcag
ttcgcccgcg ccaccgacag ctacttcagc 120
ctgagtaaca agtttaggaa ccccacggtg
gcgcccacgc acgatgtgac caccgaccgg 180
tcccagcgcc tgacgctgcg gttcatcccc
gtggaccgcg aggacaccgc gtactcttac 240
aaggcgcggt tcaccctggc cgtgggcgac
aaccgcgtgc tggacatggc ctccacctac 300
tttgacatcc gcggcgtgct ggacaggggc
cccaccttca agccctactc cggcaccgcc 360
tacaactccc tggcccccaa gggcgccccc
aactcctgcg agtgggagca agaggagact 420
cagacagctg aagaggcaca agacgaagaa
gaagatgaag ctgaagctga ggaggaaatg 480
cctcaggaag agcaagcacc tgtcaaaaag
actcatgtat atgctcaggc tcccctttct 540
ggcgaaaaaa ttactaaaga cggtctgcag
ataggaacgg acgctacagc taccgaacaa 600
aaacctattt atgcagatcc cacattccag
ccagaacccc aaattggtga atctcagtgg 660
aatgaggcag atgcttcagt tgccggcggt
agagtgctga agaaaactac tcccatgaaa 720
ccctgttatg gttcctatgc caggcccaca
aatgccaatg gaggtcaggg tgtattggtg 780
gagaaagacg gtggaaagat ggaaagccaa
gtagatatgc aattcttttc gacttctgaa 840
aacgcccgta acgaggctaa caacattcag
cccaaattgg tgctgtacag cgaggatgtg 900
catatggaga ccccagacac acacatttct
tacaagcctg caaaaagcga tgataattcg 960
aaagtcatgc tgggtcagca gtccatgccc
aacaggccaa attacatcgg cttcagagac 1020
aactttatcg ggctcatgta ttacaacagc
actggcaaca tgggggtgct ggcaggtcag 1080
gcctcacagt tgaatgcggt ggtggacttg
caagacagaa acacagaact gtcctaccag 1140
ctcttgcttg attccatggg agacagaacc
agatactttt ccatgtggaa tcaggcggtg 1200
gacagttatg atccagatgt cagaattatt
gaaaatcatg gaactgaaga tgagctgccc 1260
aactattgtt tccctctggg aggcataggg
gtaactgaca cttaccaggc cattaagact 1320
aatggcaatg gcaacggcgg gggcaatacc
acttggacca aggatgaaac ttttgcagac 1380
cgcaacgaga taggggtggg aaacaatttc
gccatggaga tcaacctcag tgccaacctg 1440
tggaggaact tcctctactc caacgtggcc
ctgtacctgc cagacaagct taagtacaac 1500
ccctccaacg tggaaatctc tgacaacccc
aacacctacg actacatgaa caagcgagtg 1560
gtggccccgg ggctggtgga ctgctacatc
aacctgggcg cgcgctggtc cctggactac 1620
atggacaacg tcaacccctt caaccaccac
cgcaacgcgg gcctgcgcta ccgctccatg 1680
cttctgggca acgggcgcta cgtgcccttc
cacatccagg tgccccagaa gttctttgcc 1740
atcaagaacc tcctcctcct gccgggctcc
tacacctacg agtggaactt caggaaggat 1800
gtcaacatgg tcctccagag ctctctgggt
aacgacctca gggtcgacgg ggccagcatc 1860
aagttcgaga gcatctgcct ctacgccacc
ttcttcccca tggcccacaa cacggcctcc 1920
acgctcgagg ccatgctcag gaacgacacc
aacgaccagt ccttcaacga ctacctctcc 1980
gccgccaaca tgctctaccc catccccgcc
aacgccacca acgttcccat ctccatcccc 2040
tcgcgcaact gggcggcctt ccgcggctgg
gccttcaccc gcctcaagac caaggagacc 2100
ccctccctgg gctcgggttt cgacccctac
tacacctact cgggctccat accctacctg 2160
gacggaacct tctacctcaa ccacactttc
aagaaggtct cggtcacctt cgactcctcg 2220
gtcagctggc cgggcaacga tcgcctgctc
acccccaacg agttcgagat caagcgctcg 2280
gtcgacgggg agggctacaa cgtggcccag
tgcaacatga ccaaggactg gttcctcatc 2340
caaatgctgg ccaactacaa catcggctat
cagggcttct acatcccaga gagctacaag 2400
gacaggatgt actccttctt taggaacttc
cagcccatga gccggcaggt ggtggacgaa 2460
accaagtaca aggactacca gcaggtgggc
atcatccacc agcacaacaa ctcgggcttc 2520
gtgggctacc tcgcccccac catgcgcgag
ggacaggcct accccgccaa cttcccctac 2580
ccgctcattg gcaagaccgc ggtcgacagc
gtcacccaga aaaagttcct ctgcgaccgc 2640
accctctggc gcatcccctt ctccagcaac
ttcatgtcca tgggtgcgct cacggacctg 2700
ggccagaacc tgctctatgc caactccgcc
cacgcgctcg acatgacctt cgaggtcgac 2760
cccatggacg agcccaccct tctctatgtt
ctgttcgaag tctttgacgt ggtccgggtc 2820
caccagccgc accgcggcgt catcgagacc
gtgtacctgc gcacgccctt ctcggccggc 2880
aacgccacca cctaa
2895
<210> 61
<211> 1755
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 61
atgcggcgcg cggcgatgta ccacgaggga
cctcctccct cttatgagag cgtggtgggc 60
gcggcggcgg cctctccctt tgcgtcgcag
ctggagccgc cgtacgtgcc tccgcggtac 120
ctgcggccta cggggggaag aaacagcatc
cgttactcgg agctggcgcc cctgtacgac 180
accacccggg tgtacctggt ggacaacaag
tcggcggacg tggcctccct gaactaccag 240
aacgaccaca gcaatttttt gaccacggtc
atccagaaca atgactacac cccgagcgag 300
gccagcaccc agaccatcaa tctggatgac
cggtcgcact ggggcggcga cctgaaaacc 360
atcctgcaca ccaacatgcc caacgtgaac
gagttcatgt tcaccaataa gttcaaggcg 420
cgggtgatgg tgtcgcgttc gcacaccaag
gacgaccggg tggagctgaa gtacgagtgg 480
gtagagttcg agctgcccga gggcaactac
tcggagacca tgaccataga cctgatgaac 540
aacgcgatcg tggagcacta tctgaaagtg
ggcaggcaga acggggtcct ggagagcgac 600
atcggggtca agttcgacac caggaacttc
cgcctggggc tggacccggt caccgggctg 660
gtcatgcccg gggtctacac caacgaggcc
ttccaccccg acatcatcct gctgcccggc 720
tgcggggtgg acttcaccta cagccgcctg
agcaacctgc tgggcatccg caagcggcag 780
cccttccagg agggctttag gatcacctac
gaggacctgg aggggggcaa catccccgcg 840
ctcctggatg tggaggccta ccaggatagc
ttgaaggaag aagaggcggg agagggcagc 900
ggcggcggcg gcggcgccgg tcaggaggag
ggcggggcct cctctgaggc ctctgcggac 960
gccgccgctg ccgccgaggc ggaggcggcc
gaccccgcga tggtggtaga ggaagagaag 1020
gatatgaatg acgaggcggt gcgcggcgac
acctttgcca cccgggggga ggagaagaaa 1080
gcggaggccg aggccgcggc agaggaggcg
gcagcggcgg cggcggcggc agtagaggcg 1140
gcggccgagg cggagaagcc ccccaaggag
cccgtgatta aggccctgac cgaagatagc 1200
aagaagcgca gttacaacgt gctcaaggac
agcaccaaca ccgcgtaccg cagctggtac 1260
ctggcctaca actacggcga cccggcgacg
ggggtgcgct cctggaccct gctgtgtacg 1320
ccggacgtga cctgcggctc ggagcaggtg
tactggtcgc tgcccgacat gatgcaagac 1380
cccgtgacct tccgctccac gcggcaggtc
agcaacttcc cggtggtggg cgccgagctg 1440
ctgcccgtgc actccaagag cttctacaac
gaccaggccg tctactccca gctcatccgc 1500
cagttcacct ctctgaccca cgtgttcaat
cgctttcctg agaaccagat tctggcgcgc 1560
ccgcccgccc ccaccatcac caccgtcagt
gaaaacgttc ctgctctcac agatcacggg 1620
acgctaccgc tgcgcaacag catcggagga
gtccagcgag tgaccgtaac tgacgccaga 1680
cgccgcacct gtccctacgt ttacaaggcc
ctgggcatag tctcgccgcg cgtcctttcc 1740
agccgcactt tttaa
1755
<210> 62
<211> 37776
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 62
catcatcaat aatatacctt attttggatt
gaagccaata tgataatgag gtgggcggag 60
cggggcgggg cggggaggag cggcggcgcg
gggcgggccg ggaggtgtgg cggaagttga 120
gtttgtaagt gtggcggatg tgacttgcta
gcgccggatg tggtaaaagt gacgtttttg 180
gagtgcgaca acgcccacgg gaagtgacat
ttttcccgcg gtttttaccg gatgtcgtag 240
tgaatttggg cgttaccaag taagatttgg
ccattttcgc gggaaaactg aaatggggaa 300
gtgaaatctg attaatttcg cgttagtcat
accgcgtaat atttgccgag ggccgaggga 360
ctttgaccga ttacgtggag gaatcgccca
ggtgtttttt gaggtgaatt tccgcgttcc 420
gggtcaaagt ctccgtttta ttattatagt
cagctgacgc ggagtgtatt tatacccgct 480
gatctcgtca agaggccact cttgagtgcc
agcgagtaga gttttctcct ctgccgctcc 540
gctccgctct gacaccgggg gaaaaatgag
acatttcacc tacgatggcg gtgtgcttac 600
cggccagctg gctgcctcgg tcctggacgc
cctgattgag gacgtattgg ccgacaatta 660
tcctcctcca gctcattttg agccacctac
tcttcacgaa ctgtatgatt tggacgtggt 720
ggcacctagc gacccgaacg agcaggcggt
ttccagtttt tttcctgact ctatgctgtt 780
ggccagccag gagggggtcg agctcgagac
ccctcctcca atcgccgttt ctcctgagcc 840
tccgaccctg accaggcagc ccgatcgccg
tgttggacct gcgactatgc cccatctgct 900
gcccgaggtg atcgatctca cctgtaacga
gtctggtttt ccacccagcg aggatgagga 960
cgaagagggt gagcagtttg tgttagattc
tgtggaggaa cccgggcgcg gttgcagatc 1020
ttgtcaatac catcggaaaa atacaggaga
cccccaaatt atgtgttccc tgtgttatat 1080
gaagacgacc tgtatgttta tttacagtaa
gtttgtgatt ggtgggtcgg tgggctgtag 1140
tgtgggtagg tggtctgtgg tttttttttt
ttttaatatc agcttgggct aaaaaactgc 1200
tatggtaatt tttttaaggt ccggtgtctg
aacctgagca ggaagctgaa ccggagcctg 1260
agagtcgccc caggagaagg cctgcaattc
taactagacc gagtgcacct gtagcgaggg 1320
acctcagcag tgcagagacc accgattccg
gtccttcctc atcccctcca gagattcatc 1380
ccgtggtgcc tttgtgtccc ctcaagcccg
ttgccgtgag agttagtggg cggagggccg 1440
ccgtggagag cattgaggac ttgcttaatg
agacacagga acctttggac ttgagctgta 1500
aacgccctag gcaataaacc tgcttacctg
gactgaatga gttgacgcct atgtttgctt 1560
ttgaatgact taatgtgtat ataataaaga
gtgagataat gtttaattgc atggtgtgtt 1620
tgattggggc ggggtttgtt gggtatataa
gcttccctgg gctaaacttg gttacacttg 1680
acctcatgga ggcctgggag tgtttagaga
gctttgccga agtgcgtgcc ttgctggaag 1740
agagctctaa taatacctct gggtggtgga
ggtatttttg gggctctccc caggctaagt 1800
tagtttgtag aatcaaggag gattacaagt
gggaatttga acagcttttg aaatcctgtg 1860
gtgagctctt ggattctttg aatctgggcc
accaggctct tttccaggac aagatcatca 1920
ggactttgga tttttccaca ccggggcgca
ttgctgccgg ggttgctttt ctagcttttt 1980
tgaaggataa atggagcgaa gagacccact
tgagttcggg atacgtcctg gattttctgg 2040
ccatacaact gtggagagca tggatcaggc
acaagaacag aatgcaactg ttgtcttccg 2100
tccgtccgtt gctgattcag ccggaggagc
agcagaccgg gccggaggac cgggctcgtc 2160
tggaaccaga agagagggcg ccggagagga
gcgcgtggaa cctgggagcc ggcctgaacg 2220
gccatccaca tcgggagtga atgttggaca
ggtggcggat ctctttccag aactgcgacg 2280
aatcttaact atcagggagg atggacaatt
tgttaagggg cttaagaggg agcggggggc 2340
ttctgaacat aacgaggagg ccagtaattt
agcttttagt ctgatgacca gacaccgtcc 2400
cgagtgcatt acttttcagc agattaagga
taattgtgcc aatgagttag atctgctggg 2460
tcagaagtac agcatagagc agttgaccac
ttactggctg cagccgggtg atgatctgga 2520
ggaagctatt agggtgtatg ccaaggtggc
cctgaggccc gattgcaagt acaagctcaa 2580
ggggctggtg aatatcagga attgttgcta
catttctggg aacggggcgg aggtggagat 2640
agagaccgat gacagggtgg cctttaggtg
cagcatgatg aatatgtggc ctggggtgct 2700
gggcatggac ggggtggtga ttatgaatgt
gaggttcacg gggcccaatt ttaatggcac 2760
ggtgttcctg ggcaacacca acttggtgct
gcacggggtg agcttctatg gctttaacaa 2820
cacctgtgtg gaggcctgga ccgatgtgaa
ggtccgtggc tgtgccttct acggatgttg 2880
gaaggcggta gtgtgtcgcc ccaagagcag
gagttccatt aaaaaatgct tgtttgagag 2940
gtgcaccctg ggggtgctgg cggagggcaa
ctgtcgggtg cgccacaatg tggcctcaga 3000
atgcggttgc ttcatgctag tcaagagcgt
ggcggtcatc aagcataaca tggtgtgcgg 3060
caacagcgag gacaaggcct cgcagatgct
gacctgctcg gatggcaact gccacttact 3120
gaagaccgta catataacca gccacagccg
caaggcctgg cccgtgttcg agcacaacgt 3180
gttgacccgc tgctctttgc atctgggcaa
caggaggggt gtgttcctgc cctatcaatg 3240
caacttgagc cacaccaaga tcttgctaga
gcccgaaagc atgtccaagg tgaacctgaa 3300
cggggtgttt gacatgaccc tgaagatatg
gaaggtgctg aggtacgacg agaccaggtc 3360
tcgatgcagg ccctgcgagt gcgggggcaa
gcatatgagg aaccagcctg tgatgctgga 3420
tgtgaccgag gagctgaggc ctgaccactt
ggttctggcc tgcaccaggg ccgagtttgg 3480
ttctagcgat gaagacacag actgaggtgg
gtgagtgggc gtggtctggg ggtgggaagc 3540
aatatataag ttgggggtct tagggtctct
gtgtctgttt tgcagaggga ccgccggcgc 3600
catgagcggg agcagtagca gcaacgcctt
ggatggcagc atcgtgagcc cttatttgac 3660
gacgcgcatg ccccactggg ccggggtgcg
tcagaatgtg atgggctcca gcatcgacgg 3720
acgacccgtg ctgcccgcaa attccgccac
gctgacctac gcgaccgtcg cggggacccc 3780
gttggacgcc accgccgccg ccgccgccac
cgccgccgcc tcggccgtgc gcagcctggc 3840
cacggacttt gcattcttgg gacccttggc
caccggggcg gccgcccgtg ccgccgttcg 3900
cgatgacaag ctgaccgccc tgctggcgca
gttggatgcg cttacccggg aactgggtga 3960
cctttcgcag caggtcgtgg ccctgcgcca
gcaggtctcc gccctgcagg ctagcgggaa 4020
tgcttctcct gcaaatgccg tttaagataa
ataaaaccag actctgtttg gattaaagaa 4080
aagtagcaag tgcattgctc tctttatttc
ataattttcc gcgcgcgata ggcccgagtc 4140
cagcgttctc ggtcgttgag ggtgcggtgt
atcttctcca ggacgtggta gaggtggctc 4200
tggacgttga gatacatggg catgagcccg
tcccgggggt ggaggtagca ccactgcaga 4260
gcttcatgct ccggggtggt gttgtagatg
atccagtcgt agcaggagcg ctgggcatgg 4320
tgcctaaaaa tgtccttaag cagcaggccg
atggccaggg ggaggccctt ggtgtaagtg 4380
tttacaaaac ggttgagttg ggaagggtgc
atgcggggtg agatgatgtg catcttagat 4440
tgtattttta gattggcgat gtttcctccc
agatcccttc tgggattcat gttgtggagg 4500
accaccagca cagtatatcc ggtgcacttg
ggaaatttgt catgcagctt agagggaaat 4560
gcgtggaaga acttggagac gcccttgtgg
cctcccagat tctccatgca ttcgtccatg 4620
atgatggcaa tgggcccgcg ggaggcggcc
tgggcaaaga tgtttctggg gtcactgaca 4680
tcgtagttgt gttccagggt gagatcgtca
taggccattt ttataaagcg cgggcggagg 4740
gtgcccgact gggggatgat ggttccctcg
ggccccgggg cgtagttgcc ttcgcagatc 4800
tgcatttccc aggccttaat ctctgagggg
ggaatcatat ccacttgcgg ggcgatgaag 4860
aaaacggttt ccggagccgg ggagattaac
tgggatgaga gcaggtttct cagcagctgt 4920
gactttccac agccggtggg gccataaata
acacctataa ccggctgcag ctggtagttg 4980
agcgagctgc agctgccgtc gtcccggagg
aggggggcca cctcattgag catgtcccgg 5040
acgcgcttgt tctcctcgac caggtccgcc
agaaggcgct cgccgcccag ggacagcagc 5100
tcttgcaagg aagcaaagtt tttcagcggc
ttgaggccgt ccgccgtggg catgtttttc 5160
agggtctggc cgagcagctc caggcggtcc
cagagctcgg tgacgtgctc tacggcatct 5220
ctatccagca tatctcctcg tttcgcgggt
tggggcggct ttcgctgtag ggcaccaggc 5280
gatggtcgtc cagcgcggcc agagtcatgt
ccttccatgg gcgcagggtc ctcgtcaggg 5340
tggtctgggt cacggtgaag gggtgcgccc
cgggctgggc gctggccagg gtgcgcttga 5400
gactggtcct gctggtgctg aagcgctgcc
ggtcttcgcc ctgcgcgtcg gccaggtagc 5460
atttgaccat ggtgtcgtag tccagcccct
ccgcggcgtg tcccttggcg cgcagcttgc 5520
ccttggaggt ggcgccgcac gcggggcact
gcaggctctt gagcgcgtag agcttggggg 5580
cgaggaagac cgattcgggg gagtaggcgt
ccgcgccgca ggccccgcac acggtctcgc 5640
actccaccag ccaggtgagc tcggggcgct
cggggtcaaa aaccaggttt cccccatgct 5700
ttttgatgcg tttcttacct cgggtctcca
tgaggcggtg tccccgctcg gtgacgaaga 5760
ggctgtccgt gtctccgtag accgacttga
ggggtctgtc ctccaggggg gtccctcggt 5820
cctcttcgta gagaaactcg gaccactctg
agacgaaggc ccgcgtccag gccaggacga 5880
aggaggccag gtgggagggg tagcggtcgt
tgtccactag ggggtccacc ttctccaagg 5940
tgtgaagaca catgtcgccc tcctcggcgt
ccaggaaggt gattggcttg taggtgtagg 6000
ccacgtgacc cggggttccg gacggggggg
tataaaaggg ggtgggggcg cgctcgtcct 6060
cactctcttc cgcatcgctg tctgcgaggg
ccagctgctg gggtgagtat tccctctcga 6120
aggcgggcat gacctcagcg ctgaggctgt
cagtttctaa aaacgaggag gatttgatgt 6180
tcacctgtcc cgagctgatg cctttgaggg
tgcccgcgtc catctggtca gaaaacacga 6240
tctttttatt gtccagcttg gtggcgaacg
acccgtagag ggcgttggag agcagcttgg 6300
cgatggagcg cagggtctga ttcttgtccc
ggtcggcgcg ctccttggcc gcgatgttga 6360
gctgcacgta ctcgcgcgcg acgcagcgcc
actcggggaa gacggtggtg cgctcgtcgg 6420
gcaccaggcg cacgcgccag ccgcggttgt
gcagggtgac gaggtccacg ctggtggcga 6480
cctcgccgcg caggcgctcg ttggtccagc
agaggcgccc gcccttgcgc gagcagaagg 6540
ggggcagggg gtcgagttgg gtttcgtccg
gggggtccgc gtccaccgtg aagaccccgg 6600
ggcgcaggcg cgcgtcgaag tagtcgatct
tgcatccttg caagtccagc gcctgctgcc 6660
agtcgcgggc ggcgagcgcg cgctcgtagg
ggttgagcgg cgggccccag ggcatggggt 6720
gggtgagcgc ggaggcgtac atgccgcaga
tgtcatagac gtagaggggc tcccggagga 6780
tgcccaggta ggtggggtag cagcggccgc
cgcggatgct ggcgcgcacg tagtcgtaga 6840
gctcgtgcga gggggcgagg aggtcggggc
ccaggttggt gcgggcgggg cgctccgcgc 6900
ggaagacgat ctgcctgaag atggcatgcg
agttggaaga gatggtgggg cgctggaaga 6960
cgttgaagct ggcgtcctgc aggccgacgg
cgtcgcgcac gaaggaggcg taggactcgc 7020
gcagcttgtg caccagctcg gcggtgacct
gcacgtcgag cgcgcagtag tcgagggtct 7080
cgcggatgat gtcatactta gcctgcccct
tctttttcca cagctcgcgg ttgaggacga 7140
actcttcgcg gtctttccag tactcttgga
tcgggaaacc gtccggctcc gaacggtaag 7200
agcccagcat gtagaactgg ttgacggcct
ggtaggcgca gcagcccttc tccacgggca 7260
gggcgtaggc ctgcgcggcc ttgcggagcg
aggtgtgggt cagggcgaag gtgtccctga 7320
ccatgacctt gaggtactgg tgtttgaagt
cggagtcgtc gcagccgccc cgctcccaga 7380
gcgagaagtc ggtgcgcttt ttggagcggg
ggttgggcag cgcgaaggtg acatcgttgt 7440
agaggatctt gcccgcgcga ggcatgaagt
tgcgggtgat gcggaagggc cccggcactt 7500
ccgagcggtt gttgatgacc tgggcggcga
gcacgatctc gtcgaagccg ttgatgttgt 7560
ggcccacgat gtagagttcc aggaagcggg
gccggccctt gacgctgggc agcttcttta 7620
gctcttcgta ggtgagctcc tcgggcgagg
cgaggccgtg ctcggccagg gcccagtccg 7680
ccaggtgcgg gttgtccgcg aggaaggacc
gccagaggtc gcgggccagg agggtctgca 7740
ggcggtccct gaaggtcctg aactggcggc
ctacggccat cttttcgggg gtgacgcagt 7800
agaaggtgag ggggtcttgc tgccaggggt
cccagtcgag ctccagggcg aggtcgcgcg 7860
cggcggcgac caggcgctcg tcgcccccga
atttcatgac cagcatgaag ggcacgagct 7920
gctttccgaa ggcgcccatc caagtgtagg
tctctacatc gtaggtgaca aagagacgtt 7980
ccgtgcgagg atgcgagccg atcgggaaga
actggatctc ccgccaccag ttggaggagt 8040
ggctgttgat gtggtgaaag tagaagtccc
gtcggcgggc cgagcactcg tgctggcttt 8100
tgtaaaagcg agcgcagtac tggcagcgct
gcacgggctg tacctcttgc acgagatgca 8160
cctgccgacc gcggacgagg aagctgagtg
ggaatctgag ccccccgcat ggctcgcggc 8220
ctggctggtg ctcttctact ttggatgcgt
ggccgtcacc gtctggctcc tcgaggggtg 8280
ttacggtgga gcggatcacc acgccgcgcg
agccgcaggt ccagatatcg gcgcgcggcg 8340
gtcggagttt gatgacgaca tcgcgcagct
gggagctgtc catggtctgg agctcccgcg 8400
gcggcggcag gtcagccggg agttcttgca
ggtttacctc gcagagacgg gccagggcgc 8460
ggggcaggtc caggtggtac ttgaattcga
gaggcgtgtt ggtggcggcg tcgatggctt 8520
gcaggaggcc gcagccccgg ggcgcgacga
cggtgccccg cggggcggtg aagctcccgc 8580
cgccgctcct gctgtcgccg ccggtggcgg
ggcttagaag cggtgccgcg gtcgggcccc 8640
cggaggtagg gggggctccg gtcccgcggg
caggggcggc agcggcacgt cggcgccgcg 8700
cgcgggcagg agctggtgct gcgcccggag
gttgctggcg aaggcgacga cgcggcggtt 8760
gatctcctgg atctggcgcc tctgcgtgaa
gacgacgggt ccggtgagct tgaacctgaa 8820
agagagttcg acagaatcaa tctcggtgtc
attgaccgcg acctggcgca ggatctcctg 8880
cacgtcgccc gagttgtctt ggtaggcgat
ctcggccatg aactgttcga tctcttcctc 8940
ctggaggtct ccgcgtccgg cgcgctccac
ggtggccgcc aggtcgttgg agatgcgcgc 9000
catgagctgc gagaaggcgt tgagtccgcc
ctcgttccag actcggctgt agaccacgcc 9060
gccctggtcg tcgcgggcgc gcatgaccac
ctgcgcgagg ttgagttcca cgtggcgcgc 9120
aaagacggcg tagttgcgca ggcgctggaa
gaggtagttg agggtggtgg cggtgtgctc 9180
ggccacgaag aagtacatga cccagcggcg
caacgtggat tcgttgatgt cccccaaggc 9240
ctccagtcgc tccatggcct cgtagaagtc
cacggcgaag ttgaaaaact gggagttgcg 9300
cgccgacacg gtcaactcct cctccagaag
acggatgagc tcggcgacgg tgtcgcgcac 9360
ctcgcgctcg aaggctatgg gaatctcttc
ctccgccagc atcaccacct cttcctcttc 9420
ttcctcctct ggcacttcca tgatggcttc
ctcctcttcg gggggtggcg gcgggggagg 9480
gggcgctcgg cgccggcggc ggcgcaccgg
gaggcggtcc acgaagcgct cgatcatctc 9540
cccgcggcgg cgacgcatgg tctcggtgac
ggcgcggccg ttctctcggg gacgcagctg 9600
gaagacgccg ccggtcatct ggtgctgggg
cgggtggccg tggggcagcg agaccgcgct 9660
gacgatgcat cttaacaatt gctgcgtagg
tacgccgccg agggacctga gggagtccag 9720
atccaccgga tccgaaaacc tttcgaggaa
ggcatctaac cagtcgcagt cgcaaggtag 9780
gctgagcacc gtggcgggcg gcggggggtg
gggggagtgt ctggcggagg tgctgctgat 9840
gatgtaattg aagtaggcgg tcttgacacg
gcggatggtc gacaggagca ccatatcttt 9900
gggcccggcc tgctggatgc ggaggcggtc
ggccatgccc caggcttcgt tctggcatct 9960
gcgcaggtct ttgtagtagt cttgcatgag
cctttccacc ggcacctctt ctccttcttc 10020
ttctgacatc tctgctgcat ctgcggccct
ggggcgacgg cgcgcgcccc tgccccccat 10080
gcgcgtcacc ccgaaccccc tgagcggctg
gagcagggcc aggtcggcga cgacgcgctc 10140
ggccaggatg gcctgctgga cctgcgtgag
ggtggtttgg aagtcatcca agtccacgaa 10200
gcggtggtag gcgcccgtgt tgatggtgta
ggtgcagttg gccatgacgg accagttgac 10260
ggtctggtgg cccggttgcg tcatctcggt
gtacctgagg cgcgagtagg cgcgcgagtc 10320
gaagatgtag tcgttgcaag tccgcaccag
gtactggtag cccaccagga agtgcggcgg 10380
cggctggcgg tagaggggcc agcggagggt
ggcgggggct ccgggggcca ggtcttccag 10440
catgaggcgg tggtattcgt agatgtacct
ggacatccag gtgatgcccg cggcggtggt 10500
ggaggcgcgc gggaagtcgc gcacccggtt
ccagatgttg cgcagcggca gaaagtgctc 10560
catggtaggc gtgctctggc cggtcaggcg
cgcgcagtcg ttgatactct agaccaggga 10620
aaacgaaagc cggtcagcgg gcactcttcc
gtggtctggt ggataaattc gcaagggtat 10680
catggcggag ggcctcggtt cgagccccgg
gcccgggccg gacggtccgc catgatccac 10740
gcggttaccg cccgcgtgtc gaacccaggt
ggcgacgtca gacaacggtg gagtgttcct 10800
tttgggtttt ttttaatttt tctggccggg
cgccgacgcc gccgcgtaag agactagagt 10860
gcaaaagcga aagcagtaag tggctcgctc
cctgtagccc ggaggatcct tgctaagggt 10920
tgcgttgcgg cgaaccccgg ttcgagtctg
gctctcgctg ggccgctcgg gtcggccgga 10980
accgcggcta aggcgggatt ggcctccccc
tcattaaaga ccccgcttgc ggattcctcc 11040
ggacacaggg gacgagcccc tttttacttt
tgcttttctc agatgcatcc ggtgctgcgg 11100
cagatgcgcc ccccgcccca gcagcagcag
cagcaacatc agcaagagcg gcaccagcag 11160
cagcgggagt catgcagggc cccctcgccc
acgctcggcg gtccggcgac ctcggcgtcc 11220
gcggccgtgt ctggagccgg cggcggtggg
ctggcggacg acccggagga gcccccgcgg 11280
cgcagggcca gacagtacct ggacctggag
gagggcgagg gcctggcgcg actgggggcg 11340
ccgtcccccg agcgccaccc gcgggtgcag
ctgaagcgcg actcgcgcga ggcgtacgtg 11400
cctcggcaga acctgttcag agaccgcgcg
ggcgaggagc ccgaggagat gcgggaccgc 11460
aggttcgccg cggggcggga gctgcggcag
gggctgaacc gggagcggct gctgcgcgag 11520
gaggactttg agcccgacgc gcggacgggg
atcagccccg cgcgcgcgca cgtggcggcc 11580
gccgacctgg tgacggcgta cgagcagacg
gtgaaccagg agatcaactt ccaaaaaagc 11640
ttcaacaacc acgtgcgcac gctggtggcg
cgcgaggagg tgaccatcgg cctgatgcac 11700
ctgtgggact ttgtgagcgc gctggagcag
aaccccaaca gcaagcctct gacggcgcag 11760
ctgttcctga tagtgcagca cagcagggac
aacgaggcgt tcagggacgc gctgctgaac 11820
atcaccgagc ccgagggtcg gtggctcctg
gacctgatta acatcttgca gagcatagtg 11880
gtgcaggagc gcagcctgag cctggccgac
aaggtggcgg ccatcaatta ctcgatgctc 11940
agtctgggca agttttacgc gcgcaagatc
taccagacgc cgtacgtgcc catagacaag 12000
gaggtgaaga tcgacggctt ctacatgcgc
atggcgctga aggtgctgac cctgagcgac 12060
gacctgggcg tgtaccgcaa cgagcgcatc
cacaaggccg tgagcgtgag ccggcggcgc 12120
gagctgagcg accgcgagct gatgcacagc
ctgcagcggg cgctggcggg ggccggcagc 12180
ggcgacaggg aggccgagtc ctacttcgag
gcgggggcgg acctgcgctg ggtgcccagc 12240
cggagggccc tggaggccgc gggggcccgc
cgcgaggact atgcagacga ggaggaggag 12300
gatgacgagg agtacgagct agaggagggc
gagtacctgg actaaaccgc aggtggtgtt 12360
tttggtagat gcaagacccg aacgtggtgg
acccggcgct gcgggcggct ctgcagagcc 12420
agccgtccgg ccttaactct acagacgact
ggcgacaggt catggaccgc atcatgtcgc 12480
tgacggcgcg caatccggac gcgttccggc
agcagccgca ggccaacagg ctctccgcca 12540
tcttggaggc ggtggtgcct gcgcgcgcga
accccacgca cgagaaggtg ctggccatag 12600
tgaacgcgct cgccgagaac agggccatcc
gcccggacga ggccgggctg gtgtacgacg 12660
cgctgctgca gcgcgtggcc cgctacaaca
gcggcaacgt gcagaccaac ctggaccggc 12720
tggtggggga cgtgcgcgag gcggtggcgc
agcgggagcg cgcggagcgg cagggaaacc 12780
tgggctccat ggtggcgctg aacgccttcc
tgagcacgca gccggccaac gtgccgcggg 12840
ggcaggagga ctacaccaac tttgtgagcg
cgctgcggct gatggtgacc gagacccccc 12900
agagcgaggt gtaccagtcg gggccggact
actttttcca gaccagcaga cagggcctgc 12960
agacggtgaa cctgagccag gctttcaaga
acctgcgggg gctgtggggc gtgaaggcgc 13020
ccaccgggga ccgggcgacg gtgtccagcc
tgctgacgcc caactcgcgc ctgctgctgc 13080
tgctgatcgc gccgttcacg gacagcggca
gcgtgtcccg ggagacctac ctcgggcacc 13140
tgctgacgct gtaccgcgag gccatcgggc
agacccaggt ggacgagcac accttccagg 13200
agatcaccag cgtgagccgc gcgctggggc
aggaggacac gggcagcctg gaggcgaccc 13260
tgaactacct gctgaccaac cggcggcaga
agatcccctc gctgcatagt ttgaccaccg 13320
aggaggagcg catcctgcgc tacgtgcagc
agagcgtgag cctgaacctg atgcgcgacg 13380
gggtgacgcc cagcgtggcg ctggacatga
ccgcgcgcaa catggaaccg ggcatgtacg 13440
ccgcgcatcg gccttacatc aaccgcctga
tggactactt gcatcgcgcg gcggccgtga 13500
accccgagta cttcaccaac gccatcctga
acccgcactg gctcccgccg cccgggttct 13560
acagcggggg cttcgaggtc cccgaggcca
acgacggctt cctgtgggac gacatggacg 13620
acagcgtgtt ctccccgcgg ccgcaggcgc
tggcggaggc gtcgctgctc cgcctcccca 13680
agaaagaaga gagccgccgg cccagcagcg
cggcggcctc tctgtccgag ctgggggcgg 13740
cggccgcgcg gcccgggtcc ctggggggca
gcccctttcc cagtctggtg gggtctctgc 13800
agagcgggcg caccacccgg ccccggctgc
tgggcgagga cgagtacctg aacaactccc 13860
tgatgcagcc ggtgcgggag aaaaacctgc
cccccgcctt ccccaacaac gggatagaga 13920
gcctggtaga caagatgagc agatggaaga
cctatgcgca ggagcacagg gactcgcccg 13980
tgctccgtcc gcccacgcgg cgccagcgcc
acgaccggca gcgggggctg gtatgggatg 14040
acgaggactc cgcggacgat agcagcgtgc
tggacctggg ggggagcggc ggtaacccgt 14100
tcgcgcacct gcgcccccgc ctggggagga
tgtttcaata agaaaaatca agcatgatgc 14160
aaggtttttt aagcggataa ataaaaaact
caccaaggcc atggcgaccg agcgttgttg 14220
gtttcttgtt gtgttccctt agtatgcggc
gcgcggcgat gtaccacgag ggacctcctc 14280
cctcttatga gagcgtggtg ggcgcggcgg
cggcctctcc ctttgcgtcg cagctggagc 14340
cgccgtacgt gcctccgcgg tacctgcggc
ctacgggggg aagaaacagc atccgttact 14400
cggagctggc gcccctgtac gacaccaccc
gggtgtacct ggtggacaac aagtcggcgg 14460
acgtggcctc cctgaactac cagaacgacc
acagcaattt tttgaccacg gtcatccaga 14520
acaatgacta caccccgagc gaggccagca
cccagaccat caatctggat gaccggtcgc 14580
actggggcgg cgacctgaaa accatcctgc
acaccaacat gcccaacgtg aacgagttca 14640
tgttcaccaa taagttcaag gcgcgggtga
tggtgtcgcg ttcgcacacc aaggacgacc 14700
gggtggagct gaagtacgag tgggtagagt
tcgagctgcc cgagggcaac tactcggaga 14760
ccatgaccat agacctgatg aacaacgcga
tcgtggagca ctatctgaaa gtgggcaggc 14820
agaacggggt cctggagagc gacatcgggg
tcaagttcga caccaggaac ttccgcctgg 14880
ggctggaccc ggtcaccggg ctggtcatgc
ccggggtcta caccaacgag gccttccacc 14940
ccgacatcat cctgctgccc ggctgcgggg
tggacttcac ctacagccgc ctgagcaacc 15000
tgctgggcat ccgcaagcgg cagcccttcc
aggagggctt taggatcacc tacgaggacc 15060
tggagggggg caacatcccc gcgctcctgg
atgtggaggc ctaccagaat agcttgaagg 15120
aagaagaggc gggagagggc agcggcggcg
gcggcgccgg tcaggaggag ggcggggcct 15180
cctctgaggc ctctgcggac gcagctgccg
ccgaggcgga ggaggcggcc gaccccgcga 15240
tggtggtaga ggaagagaag gatatgaatg
acgaggcggt gcgcggcgac acctttgcca 15300
cccgggggga ggagaagaaa gcggaggccg
aggccgcggc agaggaggcg gcagcagcgg 15360
cggcggcagt agaggcggcg gccgaggcgg
agaagccccc caaggagccc gtgattaagc 15420
ccctgaccga agatagcaag aagcgcagtt
acaacgtgct caaggacagc accaacaccg 15480
agtaccgcag ctggtacctg gcctacaact
acggcgaccc ggcgacgggg gtgcgctcct 15540
ggaccctgct gtgtacgccg gacgtgacct
gcggctcgga gcaggtgtac tggtcgctgc 15600
ccgacatgat gcaagacccc gtgaccttcc
gctccacgcg gcaggtcagc aactttccgg 15660
tggtgggcgc cgagctgctg cccgtgcact
ccaagagctt ctacaacgac caggccgtct 15720
actcccagct catccgccag ttcacctctc
tgacccacgt gttcaatcgc tttcctgaga 15780
accagattct ggcgcgcccg cccgccccca
ccatcaccac cgtcagtgaa aacgttcctg 15840
ctctcacaga tcacgggacg ctaccgctgc
gcaacagcat cggaggagtc cagcgagtga 15900
ccgtaactga cgccagacgc cgcacctgtc
cctacgttta caaggccctg ggcatagtct 15960
cgccgcgcgt cctttccagc cgcacttttt
aagcatgtcc atcctcatct cgcccagcaa 16020
taacaccggc tggggcctgc tgcgcgcgcc
cagcaagatg ttcggagggg cgaggaagcg 16080
ctccgaccag caccccgtgc gcgtgcgcgg
gcactaccgc gccccctggg gcgcgcacaa 16140
acgcgggcgc accggcaccg cggggcgcac
caccgtggac gaagccatcg actcggtggt 16200
ggagcaggcg cgcaactaca cgcccgcggt
ctccaccgtg gacgcggcta tcgagagcgt 16260
ggtgcgaggc gcgcggcggt acgccaaggc
gaagagccgc cggaggcgcg tggcccgccg 16320
ccaccgccgc cgacccggga gcgccgccaa
gcgcgccgcc gccgccttgc ttcgccgggc 16380
cagacgcacg ggccgccgcg ccgccatgag
ggccgcgcgc cgcctggccg ccggcatcac 16440
caccgtggcc ccccgcgcca gaagacgcgc
ggccgctgcc gccgctgcgg ccatcagcga 16500
cctggccacc aggcgccggg gcaacgtgta
ctgggtgcgc gactcggtga gcggcacgcg 16560
cgtgcccgtg cgcttccgcc ccccgcggac
ttgagaggag aggacaggaa aaaagcatca 16620
acaacaacac cactgagtct cctgctgttg
tgtgtatccc agcggcgcgc gcgcacacgg 16680
cgacatgtcc aagcgcaaaa tcaaagaaga
gatgctccag gtcgtcgcgc cggagatcta 16740
tgggcccccg aagaaggaag agcaggattt
caagccccgc aagataaagc gggtcaaaaa 16800
gaaaaagaaa gatgacgatg atggcgaggt
ggagtttctg cgcgccacgg cgcccaggcg 16860
cccgctgcag tggaagggtc ggcgcgtaaa
gcgcgttctg cgccccggca ccgcggtggt 16920
cttcacgccc ggcgagcgct ccacccgcac
tttcaagcgc gtctatgacg aggtgtacgg 16980
cgacgaagac ctgctggagc aggccaacga
tcgctccgga gagtttgctt acgggaagcg 17040
gcaccgggcg atggagaagg acgaggtgct
ggcgctgccg ctggaccggg gcaaccccac 17100
ccccagcctg aagcccgtga ccttgcagca
ggtgctgccg agcagcgcgc cctccgagat 17160
gaagcggggc ctgaagcgcg agggcggcga
cctggcgccc accgtgcagc tgatggtgcc 17220
caagcggcag aggctggagg acgtgctgga
gaaaatgaaa gtagaccccg gcctgcagcc 17280
ggacatcagg gtccgcccca tcaagcaggt
ggcgccgggc ctcggcgtgc agaccgtgga 17340
cgtggtcatc cccaccggcg cctcctcttc
cagcgccgcc gccgccacta gcaccgcgga 17400
catggagacg cagactagct ccgccctcgc
cgcccccgcg gccgccgccg ccgccgccac 17460
ctcctcggcg gaggtacaga cggacccctg
gatgccgccg ccggcggccg ccccctcgcg 17520
cgcacgccgc gggcgcagga agtacggtgc
cgccagcgcg ctcatgcccg agtacgcctt 17580
gcatccttcc atcgcgccca cccccggcta
ccgaggctac agctaccgcc cgcgaagagc 17640
caagggctcc acccgccgca gccgccgcgc
cgccacctct acccgccgcc gcagtcgccg 17700
ccgccgccgc cggcagcccg cgctggctcc
gatctccgtg aggagagtgg cgcgcaacgg 17760
ggacaccttg gtgctgccca gggcgcgcta
ccaccccagc atcgtttaaa agcctgttgt 17820
ggttcttgca gatatggccc tcacttgccg
cctccgtttc ccggtgccgg gataccgagg 17880
aagatcgcgc cgtagaaggg gtatggccgg
acgcggcctg agcggaggca gccgccgtgc 17940
gcaccggcgg cgacgcgcca ccagccgacg
catgcgcggc ggggtgctgc ctctgctgat 18000
ccccctgatc gccgcggcga tcggcgccgt
gcccgggatc gcctccgtgg ccttgcaggc 18060
gtcccagagg cgttgacaca gacttcttgc
aagcttgcaa aatatggaaa aaatcccccc 18120
aataaaaaag tctagactct cacgctcgct
tggtcctgtg actattttgt agaaaaaaga 18180
tggaagacat caactttgcg tcgctggccc
cgcgtcacgg ctcgcgcccg ttcctgggac 18240
actggaacga tatcggcacc agcaacatga
gcggtggcgc cttcagttgg ggctctctgt 18300
ggagcggcat taaaaatatc ggttctgccg
ttaagaatta cggcaccaag gcctggaaca 18360
gcagcacggg ccagatgttg agagacaagt
tgaaagagca gaacttccag cagaaggtgg 18420
tggagggcct ggcctccggc atcaacgggg
tggtggacct ggccaatcag gccgtgcaaa 18480
ataagatcaa cagcaaactg gacccccggc
cgccggtgga agagctgccg ccggcgctgg 18540
agacggtgtc ccccgatggg cggggcgaaa
agcgcccgcg gcccgacagg gaagagacca 18600
ctctggtcac gcacaccgat gagccgcccc
cctacgagga agccctgaag caaggcttgc 18660
ccaccactcg gcccatcgcg cccatggcca
ccggggtggt gggccgccac acccccgcca 18720
cgctggacct gcctcctcct cctgtttctt
cttcggccgc cgatgcgcag cagcagaagg 18780
cggcgctgcc cggtccgccc gcggccgccc
cccgtcccac cgccagtcga gcgcccctgc 18840
gtcgcgcggc cagcggcccc cgcggggtcg
cgaggcacag cagcggcaac tggcagaaca 18900
cgctgaacag catcgtgggt ctgggggtgc
agtccgtgaa gcgccgccga tgctactgaa 18960
tagcttagct aacggtgttg tatgtgtgta
tgcgtcctat gtcaccgcca gaggagctgc 19020
tgagtcgccg ccgttcgcgc gcccaccgcc
actaccaccg ccggtactac tccagcgccc 19080
ctcaagatgg cgaccccatc gatgatgccg
cagtggtcgt acatgcacat ctcgggccag 19140
gacgcctcgg agtacctgag ccccgggctg
gtgcagttcg cccgcgccac cgacagctac 19200
ttcagcctga gtaacaagtt taggaacccc
acggtggcgc ccacgcacga tgtgaccacc 19260
gaccggtccc agcgcctgac gctgcggttc
atccccgtgg accgcgagga caccgcgtac 19320
tcttacaagg cgcggttcac cctggccgtg
ggcgacaacc gcgtgctgga catggcctcc 19380
acctactttg acatccgcgg cgtgctggac
aggggcccca cctttaagcc ctactccggc 19440
actgcctaca actccctggc ccccaagggc
gcccccaacc cctgtgagtg ggatgaagcc 19500
gttactgctg ttgacattaa cctggatgag
ctcggcgaag atgaagacga cgccgaaggg 19560
gaagcagaac agcaaaaaac tcatgtattt
ggtcaagcgc cctactcagg acaaaacatt 19620
acgaaggagg gcatacaaat tggggtagat
accaccagcc aagcccaaac acctttatac 19680
gctgacaaaa cattccaacc cgaacctcag
gttggagaat cccaatggaa tgagacagaa 19740
atcaattatg gagcgggacg agtgctaaaa
aagaccaccc tcatgaaacc atgctatggg 19800
tcatatgcaa gacctactaa tgaaaacggc
ggtcagggca tactgctgga gaaagagggt 19860
ggtaaaccag aaagtcaagt tgaaatgcaa
tttttttcta ctactcaggc cgccgcggct 19920
ggtaattcag ataatcttac tccaaaagtt
gttttgtata gcgaggatgt tcacctggaa 19980
acgccagata cacacatttc atatatgccc
actagcaacg aagccaattc aagagaactg 20040
ttgggacaac aagctatgcc caacagaccc
aactacattg ccttcagaga caactttatt 20100
ggccttatgt attacaacag cactggcaac
atgggagtgc tggcaggtca ggcctcacag 20160
ttgaatgcag tggtggactt gcaagacaga
aacacagaac tgtcctacca gctcttgctt 20220
gattccatgg gagacagaac cagatacttt
tccatgtgga atcaggcggt ggacagttat 20280
gatccagatg ttagaattat tgaaaatcat
ggaactgaag atgagctgcc caactattgt 20340
ttccccctgg gcggcataat taacaccgaa
actttaacta aagtgaaacc taagactgga 20400
caagacgctc agtgggaaaa agatactgag
ttttcagaga aaaatgaaat aagggtggga 20460
aacaacttcg ccatggagat taacctcaat
gccaacctgt ggaggaattt cctgtactcc 20520
aacgtggccc tgtacctgcc agacaaactt
aagtacactc cagccaacgt gcagatttcc 20580
agcaactcca actcctacga ctacatgaac
aagcgagtgg tggccccggg gctggtggac 20640
tgctacatca acctgggcgc gcgctggtcc
ctggactaca tggacaacgt caaccccttc 20700
aaccaccacc gcaatgcggg cctgcgctac
cgctccatgc ttctgggcaa cgggcgctac 20760
gtgcccttcc acatccaggt gccccagaag
ttctttgcca tcaagaacct cctcctcctg 20820
ccgggctcct acacctacga gtggaacttc
aggaaggatg tcaacatggt cctccagagc 20880
tctctgggta acgacctcag ggtcgacggg
gccagcatca agttcgagag catctgcctc 20940
tacgccacct tcttccccat ggcccacaac
acggcctcca cgctcgaggc catgctcagg 21000
aacgacacca acgaccagtc cttcaacgac
tacctctccg ccgccaacat gctctacccc 21060
atccccgcca acgccaccaa cgtccccatc
tccatcccct cgcgcaactg ggcggccttc 21120
cgcggctggg ccttcactcg cctcaagacc
aaggagaccc cctccctggg ctcgggtttc 21180
gacccctact acacctactc gggctccata
ccctacctgg acggaacctt ctacctcaac 21240
cacaccttca agaaggtctc ggtcaccttc
gactcctcgg tcagctggcc gggcaacgac 21300
cgcctgctca cccccaacga gttcgagatc
aagcgctcgg tcgacgggga gggctacaac 21360
gtggcccagt gcaacatgac caaggactgg
ttcctcatcc agatgctggc caactacaac 21420
atcggctatc agggcttcta catcccagag
agctacaagg acaggatgta ctccttcttt 21480
aggaacttcc agcccatgag ccggcaggtg
gtggacgaaa ccaagtacaa ggactaccag 21540
caggtgggca tcatccacca gcacaacaac
tcgggcttcg tgggctacct cgcccccacc 21600
atgcgcgagg gacaggccta ccccgccaac
ttcccctacc cgctcattgg caagaccgcg 21660
gtcgacagcg tcacccagaa aaagttcctc
tgcgaccgca ccctctggcg catccccttc 21720
tccagcaact tcatgtccat gggtgcgctc
acggacctgg gccagaacct gctctatgcc 21780
aactccgccc acgcgctcga catgaccttc
gaggtcgacc ccatggacga gcccaccctt 21840
ctctatgttc tgttcgaagt ctttgacgtg
gtccgggtcc accagccgca ccgcggcgtc 21900
atcgagaccg tgtacctgcg cacgcccttc
tcggccggca acgccaccac ctaaagaagc 21960
aagccgccac cgccaccacc tgcatgtcgt
cgggttccac cgagcaggag ctcaaggcca 22020
tcgtcagaga cctgggatgc gggccctatt
ttttgggcac cttcgacaaa cgcttcccgg 22080
gcttcgtcgc cccgcacaag ctggcctgcg
ccatcgtcaa cacggccggc cgcgagaccg 22140
ggggcgtgca ctggctggcc ttcgcctgga
acccgcgctc caaaacatgc tacctctttg 22200
accccttcgg attctcggac cagcggctca
agcagatcta ccagttcgag tacgagggcc 22260
tgctgcgccg cagcgccatc gcctcctcgc
ccgaccgctg cgtcaccctc gagaagtcca 22320
cccagaccgt gcaggggccc gactcggccg
cctgcggtct cttctgctgc atgttcctgc 22380
atgcctttgt gcgctggccc cagagtccca
tggaccgcaa ccccaccatg aacttgctga 22440
cggggatccc caactccatg ctccagagcc
cccaggccgc gcccaccctg cgccgcaatc 22500
aggagcgact ctacagcttc ctggagcgcc
actcgcccta cttccgccgc cacagcgcgc 22560
agatcagggg ggccacctct ttctgccgca
tgcaagagat gcaagggaaa atgcaatgat 22620
gtacacagac actttttctt ttctcaataa
atggcaactt tatttataca tgctctctct 22680
ctcgggtatt catttcccca ccacccacca
cccgccgccg ccgtaaccat ctgctgctgg 22740
cttttttaaa aatcgaaagg gttctgccgg
gaatcgccgt gcgccacggg cagggacacg 22800
ttgcggaact ggtagcgggt gccccacttg
aactcgggca ccaccatgcg gggcaagtcg 22860
gggaagttgt cggcccacag gccgcgggtc
agcaccagcg cgttcatcag gtcgggcgcc 22920
gagatcttga agtcgcagtt ggggccgccg
ccctgcgcgc gcgagttgcg gtacaccggg 22980
ttgcaacact ggaacaccag cagcgccgga
taattcacgc tggccagcac gctccggtcg 23040
gagatcagct cggcgtccag gtcctccgcg
ttgctcagcg cgaacggggt cagcttgggc 23100
acctgccgcc ccaggaaggg agcgtgtccc
ggcttggaat tgcagtcgca gcgcagcggg 23160
atcagcaggt gcccgcggcc ggactcggcg
ttggggtaca gcgcgcgcat gaaggcctcc 23220
atctggcgga aggccatctg ggccttggcg
ccctccgaga aaaacatgcc gcaggacttg 23280
cccgagaact ggttcgcggg gcagctcgcg
tcgtgcaggc agcagcgcgc gtcggtgttg 23340
gcgatctgca ccacgttgcg cccccaccgg
ttcttcacga tcttggcctt ggaagcctgc 23400
tccttcagcg cgcgctgccc gttctcgctg
gtcacatcca tctcgatcac gtgctccttg 23460
ttcaccatgc tgctgccgtg cagacacttc
agctcgccct ccacctcggt gcagcggtgc 23520
tgccacagcg cgcagcccgt gggctcgaaa
tgcttgtagg tcacctccgc gtaggactgc 23580
aggtaggcct gcaggaagcg ccccatcatg
gtcacgaagg tcttgttgct gctgaaggtc 23640
agctgcagcc cgcggtgctc ctcgttcagc
caggccttgc acacggccgc cagcgcctcc 23700
acctggtcgg gcagcatctt gaagttcagc
ttcagctcat tctccacatg gtacttgtcc 23760
atcagcgcgc gcgcagcctc catgcccttc
tcccaggccg acaccagcgg caggctcaag 23820
gggttcacca ccgtcgcagt cgccgccgcg
ctttcgcttt ccgctccgct gttctcttct 23880
tcctcctcct cctcttcttc ctcgccgccc
gcgcgcagcc cccgcaccac ggggtcgtct 23940
tcctgcaggc gccgcaccga gcgcttgccg
ctcctgccct gcttgatgcg cacgggcggg 24000
ttgctgaagc ctaccatcac cagcgcggcc
tcttcttgct cgtcctcgct gtccactatg 24060
acctcggggg agggcgacct cagtaccgtg
gcgcgctgcc tcttcttttt cctgggggcg 24120
tttgcaagct ccgcggccgc ggccgccgcc
gaggtcgaag gccgagggct gggcgtgcgc 24180
ggcaccagcg cgtcctgcga gccgtcctcg
tcctcggact cgaggcggca gcgagcccgc 24240
ttcttcgggg gcgcgcgggg cggcggcggc
gggggcggcg gcgacggaga cggggacgag 24300
acatcgtcca gggtgggagg acggcgggcc
gcgccgcgtc cgcgctcggg ggtggtttcg 24360
cgctggtcct cttcccgact ggccatctcc
cactgctcct tctcctatag gcagaaagag 24420
atcatggagt ctctcatgca agtcgagaag
gaggaggaca gcctaaccac caccgccccc 24480
tctgagccct ccgccgccgc cgccgcggac
gacgcgccca ccaccgccgc cgccaccacc 24540
accattacca ccctacccgg cgacgcagcc
ccgatcgaga aggaagtgtt gatcgagcag 24600
gacccgggtt ttgtgagcga agaggaggat
gaggaggatg aaaaggagaa ggataccgcc 24660
gcctcagtgc caaaagagga taaaaagcaa
gaccaggacg acgcagagac agatgaggca 24720
gcagtcgggc ggggggacga gaggcatgat
gatgatgacg gctacctaga cgtgggagac 24780
gacgtgctgc ttaagcacct gcaccgccag
tgcgtcatcg tctgcgacgc gctgcaggag 24840
cgctgcgaag tgcccctgga cgtggcggag
gtcagccgcg cctacgagcg gcacctcttc 24900
gcgccacacg tgccccccaa gcgccgggag
aacggcacct gcgagcccaa cccgcgcctc 24960
aacttctacc cggtcttcgc ggtacccgag
gtgctggcca cctaccacat cttcttccaa 25020
aactgcaaga tccccctctc ctgccgcgcc
aaccgcaccc gcgccgacaa gacgctggcc 25080
ctgcggcagg gcgcccacat acctgatatc
gcctctctgg aggaggtgcc caagatcttc 25140
gagggtctcg gtcgcgacga gaaacgggcg
gcgaacgctc tgcaaggaga cagcgaaaac 25200
gagagtcact cgggggtgct ggtggagctc
gagggcgaca acgcgcgcct ggccgtgctc 25260
aagcgcagca tcgaagtcac ccacttcgcc
tacccggcgc tcaacctgcc ccccaaggtc 25320
atgagtgtgg tcatgagcga gctcatcatg
cgccgcgccc agcccctgga cgcggatgca 25380
aacttgcaag agccctccga ggaaggcctg
cccgcggtca gcgacgagca gctggcgcgc 25440
tggctggaga cccgcgaccc cgcccagctg
gaggagcggc gcaagctcat gatggccgcg 25500
gtgctcgtca ccgtggagct cgagtgtctg
cagcgcttct tcggggaccc cgagatgcag 25560
cgcaagctcg aggagaccct gcactacacc
ttccgccagg gctacgtgcg ccaggcctgc 25620
aagatctcca acgtggagct ctgcaacctg
gtctcctacc tgggcatcct gcacgagaac 25680
cgcctcgggc agaacgtcct gcactccacc
ctcaaagggg aggcgcgccg cgactacgtc 25740
cgcgactgcg tctacctctt cctctgctac
acgtggcaga cagccatggg ggtctggcag 25800
cagtgcctgg aggagcgcaa cctcaaggag
ctggagaagc tcctccggcg cgccctcagg 25860
gacctctgga cgggcttcaa cgagcgctcg
gtggccgccg cgctggcgga catcatcttc 25920
cccgagcgcc tgctcaaaac cctgcagcag
ggcctgcccg acttcaccag ccagagcatg 25980
ctgcagaact tcaggacctt catcctggag
cgctcgggca tcctgccggc cacctgctgc 26040
gcgctgccca gcgacttcgt gcccatcagg
tacagggagt gcccgccgcc gctctggggc 26100
cactgctacc tcttccagct ggccaactac
ctcgcctacc actcggatct catggaagac 26160
gtgagcggcg agggcctgct cgagtgccac
tgccgctgca acctgtgcac gccccaccgc 26220
tctctagtct gcaacccgca gctgctcagc
gagagtcaga ttatcggtac ctttgagctg 26280
cagggtccct cgcccgacga aaagtccgcg
gctccggggt tgaaactcac tccggggctg 26340
tggacttccg cctacctacg caaatttgta
cctgaagact accacgccca cgagatcagg 26400
ttttacgaag accaatcccg cccgcccaag
gcggagctca ccgcctgcgt cattacccag 26460
ggccacatcc tgggccaatt gcaagccatc
aacaaagccc gccaagagtt cttgctgaaa 26520
aagggtcggg gggtgtacct ggacccccag
tccggcgagg agctaaaccc gctacccccg 26580
ccgccgcccc agcagcggga ccttgcttcc
caggatggca cccagaaaga agcagccgcc 26640
gccgccgcca gcatacatgc ttctggagga
agaggaggac tgggacagtc aggcagagga 26700
ggtttcggac gaggacgagg aggaggagat
gatggaagac tgggaggagg acagcctaga 26760
cgaggaagct tcagaggccg aagaggtggc
agacgcaaca ccatcaccct cggccgcagc 26820
cccctcgccg gcgcccccga aatcctccga
ccccagcagc agcgctataa cctccgctcc 26880
tccggcgccg gcgcccaccc gcagcagacc
caaccgtaga tgggacacta caggaaccgg 26940
ggtcggtaag tccaagtgcc ccccagcgcc
gcccccgcaa caggagcaac agcagcagca 27000
gcggcgacag ggctaccgct cgtggcgcgg
acacaaaaac gccatagtcg cctgcttgca 27060
agactgcggg ggcaacatct ccttcgcccg
ccgcttcctg ctcttccacc acggggtggc 27120
ttttccccgc aatgtcctgc attactaccg
tcatctctac agcccctact gcggcggcag 27180
cggcgaccca gagggagcgg cggcagcagc
agcgccagcc acagcggcga ccacctagga 27240
agacctccgc gggcaagacg gcgggagccg
ggagacccgc ggcggcggcg gtagcggcgg 27300
cggcgggcgc actgcgcctc tcgcccaacg
aacccctctc gacccgggag ctcagacaca 27360
ggatcttccc cactctgtat gctatcttcc
agcagagcag aggccaggaa caggagctga 27420
aaataaaaaa cagatctctg cgctccctca
cccgcagctg tctgtatcac aaaagcgaag 27480
atcagcttcg gcgcacgctg gaggacgcgg
aggcactctt cagcaaatac tgcgcgctga 27540
ctcttaagga ctagccgcgc gcccttctcg
aatttaggcg ggagaaagac tacgtcatcg 27600
ccgaccgccg cccagcccac ccagccgaca
tgagcaaaga gattcccacg ccctacatgt 27660
ggagctacca gccgcagatg ggactcgcgg
cgggagcggc ccaagactac tccacccgca 27720
tgaactacat gagcgcgggg ccccacatga
tctcacgggt taatgggatc cgcgcccagc 27780
gaaaccaaat actgctggaa caggcggcca
taaccgccac accccgtcat gacctcaatc 27840
cccgaaattg gcccgccgcc ctcgtgtacc
aggaaacccc ctctgccacc accgtggtac 27900
ttccgcgtga cacccaggcc gaagtccaga
tgactaactc aggggcgcag ctcgcgggcg 27960
gctttcgtca cggggtgcgg ccgcaccggc
cgggtatatt acacctggcg atcagaggcc 28020
gaggtattca gctcaacgac gagtcggtga
gctcttcgct cggtctccgt ccggacggaa 28080
ccttccagat cgccggatca ggtcgctcct
cattcacgcc tcgccaggcg tatctgactc 28140
tgcagacctc ctcctcggag cctcgctccg
gcggcatcgg caccctccag ttcgtggagg 28200
agttcgtgcc ctcggtctac ttcaacccct
tctcgggacc tcccggacgc taccccgacc 28260
agttcatccc gaactttgac gcggtgaagg
actcggcgga cggctacgac tgaatgtcaa 28320
gtgctgaggc agagagcgtt cgcctgaaac
acctccagca ctgccgccgc ttcgcctgct 28380
tcgcccgcag ctccggtgag ttctgctact
ttcagctgcc cgaggagcat accgaggggc 28440
cggcgcacgg cgtccgccta accacccagg
gcgaggttac ctgtaccctt atccgggagt 28500
ttaccctccg tcccctgcta gtggagcggg
agcggggttc ttgtgtcata actatcgcct 28560
gcaactgccc taaccctgga ttacatcaag
atctttgttg tcacctgtgc gctgagtata 28620
ataaacgctg agatcagact ctactggggc
tcctgtcgcc atcctgtgaa cgccaccgtc 28680
ttcacccacc ccgagcagcc ccaggcgaac
ctcacctgcg gcctgcgtcg gagggccaag 28740
aagtacctca cctggtactt caacggcacc
ccctttgtgg tttacaacag cttcgaccag 28800
gacggagttg ccttgagaga cgacctttcc
ggtctcagct actccattca caagaacacc 28860
accctccacc tcttccctcc ctacctgccg
ggaacctacg agtgcgtcac cggccgctgc 28920
acccacctcc tccgcctgat cgtaaaccag
acctttccgg gaacacacct cttccccaga 28980
acaggaggtg agctcaggaa accccctggg
gcccagggcg gagacttacc ttcgaccctt 29040
gtggggttag gattttttat cgccgggttg
ctggctctcc tgatcaaagc ttccttcaga 29100
tttgttctct ccctttactt ttatgaacag
ctcaacttct aataacacta ccttttctca 29160
ggaatcgggt agtgacttct cttctgaaat
cgggctgggt gtgctgctta ctctgttgat 29220
ttttttcctt atcatactta gccttctgtg
cctcaggctc gccgcctgct gcgcacatat 29280
ctacatctac agccggttgc ttaactgctg
gggtcgccat ccaagatgaa cggggctcag 29340
gtgctatgtc tgctggccct ggtggcctgc
agtgccgccc tcaattttga ggaacccgct 29400
tgcaatgtga ctttcaagcc tgagggcgca
cattgcacca ctctggttaa atgtgtgacc 29460
tctcatgaaa aactgctcat cgcctacaaa
aacaaaacag gcgagttcgc ggtctatagt 29520
gtgtggcaac ccggagacca taataactac
tcagtcaccg tcttcgaggg tgcggagtct 29580
aagaaattcg attacacctt tcccttcgag
gagatgtgtg atgcggtcat gtacctgtcc 29640
aaacagcaca agctgtggcc ccccaccccc
gaggcgtgtg tggaaaacac tgggtctttc 29700
tgctgtctct ctctggcaat cactgtgctt
gctctaatct gcacgctgct atacatgaga 29760
ttcaggcaga ggcgaatctt tatcgatgag
aaaaaaatgc cttgatcgct aacaccggct 29820
ttctgtctgc agaatgaaag caatcacctc
cctactaatc agcaccaccc tccttgcgat 29880
tgcccatggg ttgacacgaa tcgaagtgcc
agtggggtcc aatgtcacca tggtgggccc 29940
cgccggcaat tcctccctga tgtgggaaaa
atatgtccgt aatcaatggg atcattactg 30000
ctctaatcga atctgtatca agcccagagc
catctgcgac gggcaaaatc taactttgat 30060
tgatgtgcaa atgacggatg ctgggtacta
ttacgggcag cggggagaaa tgattaatta 30120
ctggcgaccc cacaaggact acatgctgca
tgtagtcaag gcagtcccca ctactaccac 30180
ccccaccact accactccca ctactaccac
ccccactact accactagca ctgctactac 30240
cgctgcccgc aaagctatta cccgcaaaag
caccatgctt agcaccaagc cccattctca 30300
ctcccacgcc ggcgggccca ccggtgcggc
ctcagaaacc accgagcttt gcttctgcca 30360
atgcactaac gccagcgccc acgaactgtt
cgacctggag aatgaggacg atgaccagct 30420
gagctccgct tgcccggtcc cgctgcccgc
agagccggtc gccctgaagc agctcggtga 30480
tccatttaat gactctcctg tttatccctc
tcccgaatac ccgcccgact ctaccttcca 30540
catcacgggc accaacgacc ccaacctctc
cttctacctg atgctgctgc tttgtatctc 30600
tgtggtatct tccgcgctca tgttactggg
catgttctgc tgcctcatct gccgcagaaa 30660
gagaaagtct cgctctcagg gccaaccact
gatgcccttc ccctaccccc cagattttgc 30720
agataacaag atatgagcac gctgctgaca
ctaaccgctt tactcgcctg cgctctaacc 30780
cttgtcgctt gcgaatccag ataccacaat
gtcacagttg tgacaggaga aaatgttaca 30840
ttcaactcca cggccgacac ccagtggtcg
tggagcggcc acggtagcta tgtatacatc 30900
tgcaatagct ccacctcccc tagcatgtcc
tctcccaagt accactgcaa tgccagcctg 30960
ttcaccctca tcaacgcctc cacctcggac
aatggactct atgtaggcta tgtgacaccc 31020
ggtgggcggg gaaagaccca cgcctacaac
ctgcaagttc gccacccctc caccaccgcc 31080
accacctctg ccgcccctac ccgcagcagc
agcagcatca gcagcagcag cagcagcagc 31140
agattcctga ctttaatcct agccagctca
acaaccaccg ccaccgctga gaccacccac 31200
agctccgcgc ccgaaaccac ccacacccac
cacccagaga cgaccgcggc ctccagtgac 31260
cagatgtcgg ccaacatcac cgcctcgggt
cttgaacttg cttcaacccc caccccaaaa 31320
ccagtggatg cagccgacgt ctccgccctc
gtcaatgact gggcggggct gggaatgtgg 31380
tggttcgcca taggcatgat ggcgctctgc
ctgcttctgc tctggctcat ctgctgcctc 31440
aaccgcaggc gggccagacc catctataga
cccatcattg ttctcaaccc cgctgatgat 31500
gggatccata gattggatgg tctgaaaaac
ctacttttct cttttacagt atgataaatt 31560
gagacatgcc tcgcattttc atgtacttga
cacttctccc actttttctg gggtgttcta 31620
cgctggccgc cgtctctcac ctcgaggtag
actgcctcac acccttcact gtctacctga 31680
tttacggatt ggtcaccctc actctcatct
gcagcctaat cacagtagtc atcgccttca 31740
tccagtgcat tgactacatc tgtgtgcgcc
tcgcatacct gagacaccac ccgcagtacc 31800
gagacaggaa cattgcccaa ctcctaagac
tgctctaatc atgcataaga ctgtgatctg 31860
cctcctcatc ctcctctccc tgcccgctct
cgtctcatgc cagcccacca caaaacctcc 31920
acgaaaaaga catgcctcct gtcgcttgag
ccaactgtgg aatattccca aatgctacaa 31980
tgaaaagagc gagctttccg aagcctggct
atatgcggtc atgtgtgtcc ttgtcttctg 32040
cagcacaatc tttgccctca tgatctaccc
ccactttgat ttgggatgga atgcggtcga 32100
tgccatgaat taccctacct ttcccgcgcc
cgatatgatt ccactccgac aggttgtggt 32160
gcccgtcgcc ctcaatcaac gccccccatc
ccctacaccc actgaggtca gctactttaa 32220
tctaacaggc ggagatgact gacactctag
atctagaaat ggacggcatc ggcaccgagc 32280
agcgtctcct acagaggcgc aagcaggcgg
ctgaacaaga gcgcctcaat caggagctcc 32340
gagatctcat taacctgcac cagtgcaaaa
aaggcatctt ttgcctggtc aagcaggccg 32400
atgtcaccta cgagaaaacc ggtaacagcc
accgcctcag ctacaagctg cccacccaac 32460
gccagaagtt ggtgctcatg gtgggtcaga
atcccatcac cgtcacccag cactcggtgg 32520
agaccgaggg gtgtctgcac tccccctgtc
agggtccgga agacctctgc accctggtaa 32580
agaccctgtg tggtcttaga gatttaatcc
cctttaacta atcaaacact ggaatcaata 32640
aaaagaatca cttactttaa atcagtcagc
aggtctctgt ccactttatt cagcagcacc 32700
tccttcccct cctcccaact ctggtactcc
aaacgcctcc tggcggcaaa cttcctccac 32760
accctgaagg gaatgtcaga ttcttgctcc
tgtccctccg cacccactat cttcatgttg 32820
ttgcagatga agcgcgccaa aacgtctgac
gagaccttca accccgtgta cccctatgac 32880
acggaaaacg ggcctccctc cgtccctttc
ctcacccctc ccttcgtgtc ccccgacgga 32940
tttcaagaaa gccccccagg ggtcctgtct
ctgcgcctgt cagagcccct ggtcacttcc 33000
cacggcatgc ttgccctgaa aatgggaaat
ggcctctccc tggatgacgc cggcaacctc 33060
acctctcaag atgtcaccac cgtcacccct
cccctcaaaa aaaccaagac caacctcagc 33120
ctccagacct cagcccccct gaccgttagc
tctgggtccc tcaccgtcgc ggccgccgct 33180
ccactggcgg tggccggcac ctctctcacc
atgcaatctc aggccccctt gacagtgcaa 33240
gatgcaaaac tcggcctggc cacccaggga
cccctgaccg tgtctgaagg caaactcacc 33300
ttgcagacat cggctccact gacggccgct
gacagcagca ctctcactgt tagtgccaca 33360
cctcccctca gcacaagcaa tggtagtttg
agcattgaca tgcaggcccc gatttatacc 33420
accaatggaa aactggcact taacattggt
gctcccctgc atgtggtaga caccctaaat 33480
gcactaactg tagtaactgg ccagggtctt
accataaatg gaagagccct gcaaactaga 33540
gtcacgggtg ccctcagtta tgacacagaa
ggcaacatcc aactgcaagc cggagggggt 33600
atgcgcattg acaataatgg ccaacttatc
cttaatgtag cttatccatt tgatgctcaa 33660
aacaacctca gccttagact tggccaaggt
cccctaattg ttaactctgc ccacaacttg 33720
gatcttaacc ttaacagagg cctttactta
tttacatctg gaaacacgaa aaaactggaa 33780
gttaacataa aaacagccaa aggtctattt
tacgatggca ccgctatagc aatcaatgca 33840
ggtgacgggc tacagtttgg gtctggttca
gatacaaatc cattgcaaac taaacttgga 33900
ttggggctgg aatatgactc caacaaagct
ataatcacta aacttggaac tggcctaagc 33960
tttgacaaca caggtgccat cacagtaggc
aacaaaaatg atgacaagct taccttgtgg 34020
accacaccag acccctcccc aaactgcaga
attaattcag aaaaagatgc taaactcaca 34080
ctagttttga ctaaatgcgg cagccaggtg
ttagccagcg tttctgtttt atctgtaaaa 34140
ggcagccttg cccccatcag cggcacagta
actagcgccc agattgtttt aagatttgat 34200
gaaaacggag ttttattgag caattcttct
cttgaccccc aatactggaa ctatagaaaa 34260
ggcgattcta cagaaggcac tgcatatact
aatgctgtgg gatttatgcc caacctcaca 34320
gcatacccta aaacacagag ccagactgct
aaaagcaaca ttgtaagtca agtttacttg 34380
aatggggaca aaacaaaacc catgacccta
accatcaccc tcaatggaac taatgaaaca 34440
ggggatgcta cagtaagcac atactccatg
tcattttcat ggaactggaa tggaagtaat 34500
tacattaatg acaccttcca aaccaactcc
tttaccttct cctacatcgc ccaagaataa 34560
aaaagcatga cgctttgttc tctgattcag
tgtgtttctt ttattttttt ttcaattaca 34620
acagaatcat tcaagtcatt ctccatttag
cttaatagac ccagtagtgc aaagccccat 34680
actagcttat ttcagacagt ataaattaaa
ccataccttt tgatttcaat attaaaaaaa 34740
tcatcacagg atcctagtcg tcaggccgcc
ccctccctgc caagacacag aatacacaat 34800
cctctccccc cggctggctt taaacaacac
catctggttg gtgacagaca ggttcttcgg 34860
ggttatattc cacacggtct cctggcgggc
caggcgctcg tcggtgatgc tgataaactc 34920
tcccggcagc tcgctcaagt tcacgtcgct
gtccagcggc tgaacctcat gctgacgcgg 34980
taactgcgcg accggctgct gaacaaacgg
aggccgcgcc tacaaggggg tagagtcata 35040
atcctccgtc aggatagggc ggttatgcag
cagcagcgag cgaatcatct gctgccgccg 35100
ccgctccgtc cggcaggaaa acaacatccc
ggtggtctcc tccgctataa tccgcaccgc 35160
ccgcagcata agcctcctcg ttctccgcgc
gcagcaccgc accctgatct cgctcaggtt 35220
ggcgcagtag gtacagcaca tcaccacgat
gttattcatg atcccacagt gcaaggcgct 35280
gtatccaaag ctcatgcccg ggaccaccgc
ccccacgtga ccgtcgtacc agaagcgcag 35340
gtaaatcaag tgacgacccc tcatgaacgt
gctggacata aacatcacct ccttgggcat 35400
gttgtaattc accacctccc ggtaccagat
gaatctctga ttgaacacgg ccccttccac 35460
caccatcctg aaccaagagg ctaggacctg
cccaccggct atgcactgca gggaacccgg 35520
gttggaacaa tgacaatgca gactccaggg
ctcgtaaccg tggatcatcc ggctgctgaa 35580
gacatcgatg ttggcgcaac acagacacac
gtgcatacac ttcctcatga ttagcagctc 35640
ctccctcgtc aggatcatat cccaagggat
aacccattct tgaatcaacg taaagcccac 35700
agagcaggga aggcctcgca cataactcac
gttgtgcatg gtcagcgtgt tgcattccgg 35760
aaacagcgga tgatcctcca gtatcgaggc
gcgggtctcg ttctcacagg gaggtaaagg 35820
ggccctgctg tacggactgt ggcgggacga
ccgagatcgt gttgagcgta acgtcatgga 35880
aaagggaacg ccggacgtgg tcatacttct
tgaagcagaa ccaggctcgc gcgtgacaga 35940
cctccttgcg tctacggtct cgccgcttag
ctcgctccgt gtgatagttg tagtacagcc 36000
actctctcaa agcgtcgagg cgacacctgg
cgtcaggatg tatgtagact ccgtcttgca 36060
ccgcggccct gataatatcc accaccgtag
aataagccac accaagccaa gcaatacact 36120
cgctttgcga gcggcagaca ggaggagcgg
ggagagacgg aaggaccatc ataaaatttt 36180
aaagaatatt ttccaatatt tcgaaatcaa
gatctaccaa atggcagcgc tcccctccac 36240
tggcgcggtc aaactctacg gccaaagaac
agataacggc atttttaaga tgttcccgga 36300
cggcgtctaa aagacaaacc gctctcaagt
cgacataaat tataagccaa aagccatcgg 36360
gttcaagatc cactatggac gcgccggcgg
cgtccaccaa acccaaataa ttttcttctc 36420
tccagcgctg caaaatccca gtaagcaact
ccctgatatt aagatgaacc atgccaaaaa 36480
tctgttcaag agcgccctcc accttcattc
tcaagcagcg catcatgatt gcaaaaattc 36540
aggttcctca gacacctgta tgagattcaa
aacgggaata ttaacaaaaa ttcctctgtc 36600
gcgcagatcc cttcgcaggg caagctgaac
ataatcagac aggtctgaac gaaccagcga 36660
ggccaaatcc ccgccaggaa ccagatccag
agaccctatg ctgattatga cgcgcatact 36720
cggggctatg ctaaccagcg tagcgccgat
gtaggcgtgc tgcatgggcg gcgaaataaa 36780
atgcaaggtg ctggttaaaa aatcaggcaa
agcctcgcgc aaaaaagcta agacatcata 36840
atcatgctca tgcaggtagt tgcaggtaag
ctcaggaacc aaaacggaat aacacacgat 36900
tttcctctca aacatgactt ccaggtgact
gcataagaaa aaaattataa ataataaata 36960
ttaattaaat aaattaaaca ttggaagcct
gtctcacaac aggaaaaacc actctgatca 37020
acataagacg ggccacgggc atgcccgcgt
gaccataaaa aaatcggtct ccgtgattac 37080
aaagcaccac agatagctcc ccggtcatgt
cgggggtcat catgtgagac tgtgtataca 37140
cgtccgggct gttgacatcg gtcaaagaaa
gaaatcgagc tacatagccc ggaggaatca 37200
acacccgcac gcggaggtac agcaaaacgg
tccccatagg aggaatcaca aaattagtag 37260
gagaaaaaaa aacataaaca ccagaaaaac
cctcttgccg aggcaaaaca gcgccctccc 37320
gttccaaaac aacataaagc gcttccacag
gagcagccat gacaaagacc cgagtcttac 37380
caggaaaatt ttaaaaaaga ttcctcaacg
cagcaccagc accaacacct gtcagtgtaa 37440
aatgccaagc gccgagcgag tatatatagg
aataaaaagt gacgtaaacg gttaaagtcc 37500
agaaaacgcc cagaaaaacc gcacgcgaac
ctacgccccg aaacgaaagc caaaaaacag 37560
tgaacacgcc ctttcggcgt caacttccgg
tttcccacgg tacgtcactt ccgcatataa 37620
gaaaactacg ctacccaaca tgcaagaagc
cacgccccaa aaaacgtcac acctcccggc 37680
ccgccccgcg ccgccgctcc tccccgcccc
gccccgctcc gcccacctca ttatcatatt 37740
ggcttcaatc caaaataagg tatattattg
atgatg 37776
<210> 63
<211> 37713
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 63
catcatcaat aatatacctt attttggatt
gaagccaata tgataatgag gtgggcggag 60
cggggcgggg cggggaggag cggcggcgcg
gggcgggccg ggaggtgtgg cggaagttga 120
gtttgtaagt gtggcggatg tgacttgcta
gcgccggatg tggtaaaagt gacgtttttg 180
gagtgcgaca acgcccacgg gaagtgacat
ttttcccgcg gtttttaccg gatgtcgtag 240
tgaatttggg cgttaccaag taagatttgg
ccattttcgc gggaaaactg aaatggggaa 300
gtgaaatctg attaatttcg cgttagtcat
accgcgtaat atttgccgag ggccgaggga 360
ctttgaccga ttacgtggag gaatcgccca
ggtgtttttg aggtgaattt ccgcgttccg 420
ggtcaaagtc tccgttttat tattatagtc
agctgacgcg gagtgtattt atacccgctg 480
atctcgtcaa gaggccactc ttgagtgcca
gcgagtagag ttttctcctc tgccgctccg 540
ctccgctctg acaccggggg aaaaaaatga
gacatttcac ctacgatggc ggtgtgctta 600
ccggccagct ggctgcctcg gtcctggacg
ccctgattga ggacgtattg gccgacaatt 660
atcctcctcc agctcatttt gagccaccta
ctcttcacga actgtatgat ttggacgtgg 720
tggcacctag cgacccgaac gagcaggcgg
tttccagttt ttttcctgac tctatgctgt 780
tggccagcca ggagggggtc gagctcgaga
cccctcctcc aatcgccgtt tctcctgagc 840
ctccgaccct gaccaggcag cccgatcgcc
gtgttggacc tgcgactatg ccccatctgc 900
tgcccgaggt gatcgatctc acctgtaacg
agtctggttt tccacccagc gaggatgagg 960
acgaagaggg tgagcagttt gtgttagatt
ctgtggagga acccgggcgc ggttgcagat 1020
cttgtcaata ccatcggaaa aatacaggag
acccccaaat tatgtgttcc ctgtgttata 1080
tgaagacgac ctgtatgttt atttacagta
agtttgtgat tggtgggtcg gtgggctgta 1140
gtgtgggtaa gtggtctgtg gttttttttt
tttaatatca gcttgggcta aaaaactgct 1200
atggtaattt ttttaaggtc cggtgtctga
acctgagcag gaagctgaac cggagcctga 1260
gagtcgcccc aggagaaggc ctgcaattct
aactagaccg agtgcacctg tagcgaggga 1320
cctcagcagt gcagagacca ccgattctgg
tccttcctca tcccctccag agattcatcc 1380
cgtggtgcct ttgtgtcccc tcaagcccgt
tgccgtgaga gttagtgggc ggagggccgc 1440
cgtggagagc attgaggact tgcttaatga
gacacaggaa cctttggact tgagctgtaa 1500
acgccctagg caataaacct gcttacctgg
actgaatgag ttgacgccta tgtttgcttt 1560
tgaatgactt aatgtgtata taataaagag
tgagataatg tttaattgca tggtgtgttt 1620
gattggggcg gggtttgttg ggtatataag
cttccctggg ctaaacttgg ttacacttga 1680
cctcatggag gcctgggagt gtttagagag
ctttgccgaa gtgcgtgcct tgctggaaga 1740
gagctctaat aatacctctg ggtggtggag
gtatttttgg ggctctcccc aggctaagtt 1800
agtttgtaga atcaaggagg attacaagtg
ggaatttgaa cagcttttga aatcctgtgg 1860
tgagctcttg gattctttga atctgggcca
ccaggctctt ttccaggaca agatcatcag 1920
gactttggat ttttccacac cggggcgcat
tgctgccggg gttgcttttc tagctttttt 1980
gaaggataaa tggagcgaag agacccactt
gagttcggga tacgtcctgg attttctggc 2040
catacaactg tggagagcat ggatcaggca
caagaacaga atgcaactgt tgtcttccgt 2100
ccgtccgttg ctgattcagc cggaggagca
gcagaccggg ccggaggacc gggctcgtct 2160
ggaaccagaa gagagggcac cggagaggag
cgcgtggaac ctgggagccg gcctgaacgg 2220
ccatccacat cgggagtgaa tgttggacag
gtggcggatc tctttccaga actgcgacga 2280
atcttaacta tcagggagga tggacaattt
gttaaggggc ttaagaggga gcggggggct 2340
tctgaacata acgaggaggc cagtaattta
gcttttagtc tgatgaccag acaccgtccc 2400
gagtgcatta cttttcagca gattaaggat
aattgtgcca atgagttaga tctgctgggt 2460
cagaagtaca gcatagagca gttgaccact
tactggctgc agccgggtga tgatctggag 2520
gaagctatta gggtgtatgc caaggtggcc
ctgaggcccg attgcaagta caagctcaag 2580
gggctggtga atatcaggaa ttgttgctac
atttctggga acggggcgga ggtggagata 2640
gagaccgatg acagggtggc ctttaggtgt
agcatgatga atatgtggcc tggggtgctg 2700
ggcatggacg gggtggtgat tatgaatgtg
aggttcacgg ggcccaattt taatggcacg 2760
gtgttcctgg gcaacaccaa cttggtgctg
cacggggtga gcttctatgg ctttaacaac 2820
acctgtgtgg aggcctggac cgatgtgaag
gtccgtggct gtgccttcta cggatgttgg 2880
aaggcggtag tgtgtcgccc caagagcagg
agttccatta aaaaatgctt gtttgagagg 2940
tgcaccctgg gggtgctggc ggagggcaac
tgtcgggtgc gccacaatgt ggcctcagaa 3000
tgcggttgct tcatgctagt caagagcgtg
gcggtcatca agcataacat ggtgtgcggc 3060
aacagcgagg acaaggcctc gcagatgctg
acctgctcgg atggcaactg ccacttactg 3120
aagaccgtac atataaccag ccacagccgc
aaggcctggc ccgtgttcga gcacaacgtg 3180
ttgacccgct gctctttgca tctgggcaac
aggaggggtg tgttcctgcc ctatcaatgc 3240
aacttgagcc acaccaagat cttgctagag
cccgaaagca tgtccaaggt gaacctgaac 3300
ggggtgtttg acatgaccct gaagatatgg
aaggtgctga ggtacgacga gaccaggtct 3360
cgatgcaggc cctgcgagtg cgggggcaag
catatgagga accagcctgt gatgctggat 3420
gtgaccgagg agctgaggcc tgaccacttg
gttctggcct gcaccagggc cgagtttggt 3480
tctagcgatg aagacacaga ctgaggtggg
tgagtgggcg tggtctgggg gtgggaagca 3540
atatataagt tgggggtctt agggtctctg
tgtctgtttt gcagagggac cgccggcgcc 3600
atgagcggga gcagtagcag caacgccttg
gatggcagca tcgtgagccc ttatttgacg 3660
acgcgcatgc cccactgggc cggggtgcgt
cagaatgtga tgggctccag catcgacgga 3720
cgacccgtgc tgcccgcaaa ttccgccacg
ctgacctacg cgaccgtcgc ggggaccccg 3780
ttggacgcca ccgccgccgc cgccgccacc
gccgccgcct cggccgtgcg cagcctggcc 3840
acggactttg cattcttggg acccttggcc
accggggcgg ccgcccgtgc cgccgttcgc 3900
gatgacaagc tgaccgccct gctggcgcag
ttggatgcgc ttacccggga actgggtgac 3960
ctttcgcagc aggtcgtggc cctgcgccag
caggtctccg ccctgcaggc tagcgggaat 4020
gcttctcctg caaatgccgt ttaagataaa
taaaaccaga ctctgtttgg attaaagaaa 4080
agtagcaagt gcattgctct ctttatttca
taattttccg cgcgcgatag gcccgagtcc 4140
agcgttctcg gtcgttgagg gtgcggtgta
tcttctccag gacgtggtag aggtggctct 4200
ggacgttgag atacatgggc atgagcccgt
cccgggggtg gaggtagcac cactgcagag 4260
cttcatgctc cggggtggtg ttgtagatga
tccagtcgta gcaggagcgc tgggcatggt 4320
gcctaaaaat gtccttaagc agcaggccga
tggccagggg gaggcccttg gtgtaagtgt 4380
ttacaaaacg gttgagttgg gaagggtgca
tgcggggtga gatgatgtgc atcttagatt 4440
gtatttttag attggcgatg tttcctccca
gatcccttct gggattcatg ttgtggagga 4500
ccaccagcac agtatatccg gtgcacttgg
gaaatttgtc atgcagctta gagggaaatg 4560
cgtggaagaa cttggagacg cccttgtggc
ctcccagatt ctccatgcat tcgtccatga 4620
tgatggcaat gggcccgcgg gaggcggcct
gggcaaagat gtttctgggg tcactgacat 4680
cgtagttgtg ttccagggtg agatcgtcat
aggccatttt tataaagcgc gggcggaggg 4740
tgcccgactg ggggatgatg gttccctcgg
gccccggggc gtagttgcct tcgcagatct 4800
gcatttccca ggccttaatc tctgaggggg
gaatcatatc cacttgcggg gcgatgaaga 4860
aaacggtttc cggagccggg gagattaact
gggatgagag caggtttctc agcagctgtg 4920
actttccaca gccggtgggt ccataaataa
cacctataac cggctgcagc tggtagttga 4980
gcgagctgca gctgccgtcg tcccggagga
ggggggccac ctcattgagc atgtcccgga 5040
cgcgcttgtt ctcctcgacc aggtccgcca
gaaggcgctc gccgcccagg gacagcagct 5100
cttgcaagga agcaaagttt ttcagcggtt
tgaggccgtc cgccgtgggc atgtttttca 5160
gggtctggcc gagcagctcc aggcggtccc
agagctcggt gacgtgctct acggcatctc 5220
tatccagcat atctcctcgt ttcgcgggtt
ggggcggctt tcgctgtagg gcaccaggcg 5280
atggtcgtcc agcgcggcca gagtcatgtc
cttccatggg cgcagggtcc tcgtcagggt 5340
ggtctgggtc acggtgaagg ggtgcgcccc
gggctgggcg ctggccaggg tgcgcttgag 5400
actggtcctg ctggtgctga agcgctgccg
gtcttcgccc tgcgcgtcgg ccaggtagca 5460
tttgaccatg gtgtcgtagt ccagcccctc
cgcggcgtgt cccttggcgc gcagcttgcc 5520
cttggaggtg gcgccgcacg cggggcactg
caggctcttg agcgcgtaga gcttgggggc 5580
gaggaagacc gattcggggg agtaggcgtc
cgcgccgcag gccccgcaca cggtctcgca 5640
ctccaccagc caggtgagct cggggcgctc
ggggtcaaaa accaggtttc ccccatgctt 5700
tttgatgcgt ttcttacctc gggtctccat
gaggcggtgt ccccgttcgg tgacgaagag 5760
gctgtccgtg tctccgtaga ccgacttgag
gggtctgtcc tccagggggg tccctcggtc 5820
ctcttcgtag agaaactcgg accactctga
gacaaaggcc cgcgtccagg ccaggacgaa 5880
ggaggccagg tgggaggggt accggtcgtt
gtccactagg gggtccacct tctccaaggt 5940
gtgaagacac atgtcgccct cctcggcgtc
caggaaggtg attggcttgt aggtgtaggc 6000
cacgtgaccc ggggttccgg acgggggggt
ataaaagggg gtgggggcgc gctcgtcctc 6060
actctcttcc gcatcgctgt ctgcgagggc
cagctgctgg ggtgagtatt ccctctcgaa 6120
ggcgggcatg acctcagcgc tgaggctgtc
agtttctaaa aacgaggagg atttgatgtt 6180
cacctgtccc gagctgatgc ctttgagggt
gcccgcgtcc atctggtcag aaaacacgat 6240
ctttttattg tccagcttgg tggcgaacga
cccgtagagg gcgttggaga gcagcttggc 6300
gatggagcgc agggtctgat tcttgtcccg
gtcggcgcgc tccttggccg cgatgttgag 6360
ctgcacgtac tcgcgcgcga cgcagcgcca
ctcggggaag acggtggtgc gctcgtcggg 6420
caccaggcgc acgcgccagc cgcggttgtg
cagggtgacg aggtccacgc tggtggcgac 6480
ctcgccgcgc aggcgctcgt tggtccagca
gaggcgcccg cccttgcgcg agcagaaggg 6540
gggcaggggg tcgagttggg tttcgtccgg
ggggtccgcg tccaccgtga agaccccggg 6600
gcgcaggcgc gcgtcgaagt agtcgatctt
gcatccttgc aagtccagcg cccgctgcca 6660
gtcgcgggcg gcgagcgcgc gctcgtaggg
gttgagcggc gggccccagg gcatggggtg 6720
ggtgagcgcg gaggcgtaca tgccgcagat
gtcatagacg tagaggggct cccggaggat 6780
gcccaggtag gtggggtagc agcggccgcc
gcggatgctg gcgcgcacgt agtcgtagag 6840
ctcgtgcgag ggggcgagga ggtcggggcc
caggttggtg cgggcggggc gctccgcgcg 6900
gaagacgatc tgcctgaaga tggcatgcga
gttggaagag atggtggggc gctggaagac 6960
gttgaagctg gcgtcctgca ggccgacggc
gtcgcgcacg aaggaggcgt aggactcgcg 7020
cagcttgtgc accagctcgg cggtgacctg
cacgtcgagc gcgcagtagt cgagggtctc 7080
gcggatgatg tcatacttag cctgcccctt
ctttttccac agctcgcggt tgaggacgaa 7140
ctcttcgcgg tctttccagt actcttggat
cgggaaaccg tccggctccg aacggtaaga 7200
gcccagcatg tagaactggt tgacggcctg
gtaggcgcag cagcccttct ccacgggcag 7260
ggcgtaggcc tgcgcggcct tgcggagcga
ggtgtgggtc agggcgaagg tgtccctgac 7320
catgaccttg aggtactggt gtttgaagtc
ggagtcgtcg cagccgcccc gctcccagag 7380
cgagaagtcg gtgcgctttt tggagcgggg
gttgggcagc gcgaaggtga catcgttgta 7440
gaggatcttg cccgcgcgag gcatgaagtt
gcgggtgatg cggaagggcc ccggcacttc 7500
cgagcggttg ttgatgacct gggcggcgag
cacgatctcg tcgaagccgt tgatgttgtg 7560
gcccacgatg tagagttcca ggaagcgggg
ccggcccttg acgctgggca gcttctttag 7620
ctcttcgtag gtgagctcct cgggcgaggc
gaggccgtgc tcggccaggg cccagtccgc 7680
caggtgcggg ttgtccgcga ggaaggaccg
ccagaggtcg cgggccagga gggtctgcag 7740
gcggtccctg aaggtcctga actggcggcc
tacggccatc ttttcggggg tgacgcagta 7800
gaaggtgagg gggtcttgct gccaggggtc
ccagtcgagc tccagggcga ggtcgcgcgc 7860
ggcggcgacc aggcgctcgt cgcccccgaa
tttcatgacc agcatgaagg gcacgagctg 7920
ctttccgaag gcgcccatcc aagtgtaggt
ctctacatcg taggtgacaa agagacgttc 7980
cgtgcgagga tgcgagccga tcgggaagaa
ctggatctcc cgccaccagt tggaggagtg 8040
gctgttgatg tggtgaaagt agaagtcccg
tcggcgggcc gagcactcgt gctggctttt 8100
gtaaaagcga gcgcagtact ggcagcgctg
cacgggctgt acctcttgca cgagatgcac 8160
ctgccgaccg cggacgagga agctgagtgg
gaatctgagc cccccgcatg gctcgcggcc 8220
tggctggtgc tcttctactt tggatgcgtg
gccgtcaccg tctggctcct cgaggggtgt 8280
tacggtggag cggatcacca cgccgcgcga
gccgcaggtc cagatatcgg cgcgcggcgg 8340
tcggagtttg atgacgacat cgcgcagctg
ggagctgtcc atggtctgga gctcccgcgg 8400
cggcggcagg tcagccggga gttcttgcag
gtttacctcg cagagacggg ccagggcgcg 8460
gggcaggtcc aggtggtact tgaattcgag
aggcgtgttg gtggcggcgt cgatggcttg 8520
cagtatgccg cagccccggg gcgcgacgac
ggtgccccgc ggggcggtga agctcccgcc 8580
gccgctcctg ctgtcgccgc cggtggcggg
gcttagaagc ggtgccgcgg tcgggccccc 8640
ggaggtaggg ggggctccgg tcccgcgggc
aggggcggca gcggcacgtc ggcgccgcgc 8700
gcgggcagga gctggtgctg cgcccggagg
ttgctggcga aggcgacgac gcggcggttg 8760
atctcctgga tctggcgcct ctgcgtgaag
acgacgggtc cggtgagctt gaacctgaaa 8820
gagagttcga cagaatcaat ctcggtgtca
ttgaccgcga cctggcgcag gatctcctgc 8880
acgtcgcccg agttgtcttg gtaggcgatc
tcggccatga actgttcaat ctcttcctcc 8940
tggaggtctc cgcgtccggc gcgctccacg
gtggccgcca ggtcgttgga gatgcgcgcc 9000
atgagctgcg agaaggcgtt gagtccgccc
tcgttccaca ctcggctgta gaccacgccg 9060
ccctggtcgt cgcgggcgcg catgaccacc
tgcgcgaggt tgagttccac gtggcgcgca 9120
aagacggcgt agttgcgcag gcgctggaag
aggtagttga gggtggtggc ggtgtgctcg 9180
gccacaaaga agtacatgac ccagcggcgc
aacgtggatt cgttgatgtc ccccaaggcc 9240
tccagtcgct ccatggcctc gtagaagtcc
acggcgaagt tgaaaaactg ggagttgcgc 9300
gccgacacgg tcaactcctc ctccagaaga
cggatgagct cggcgacggt gtcgcgcacc 9360
tcgcgctcga aggctatggg aatctcttcc
tccgccagca tcaccacctc ttcctcttct 9420
tcctcctctg gcacttccat gatggcttcc
tcctcttcgg ggggtggcgg cgggggaggg 9480
ggcgctcggc gccggcggcg gcgcaccggg
aggcggtcca cgaagcgctc gatcatctcc 9540
ccgcggcggc gacgcatggt ctcggtgacg
gcgcggccgt tctctcgggg acgcagctgg 9600
aagacgccgc cggtcatctg gtgctggggc
gggtggccgt ggggcagcga gaccgcgctg 9660
acgatgcatc ttaacaattg ctgcgtaggt
acgccgccga gggacctgag ggagtccaga 9720
tccaccggat ccgaaaacct ttcgaggaag
gcatctaacc agtcgcagtc gcaaggtagg 9780
ctgagcaccg tggcgggcgg cggggggtgg
ggggagtgtc tggcggaggt gctgctgatg 9840
atgtaattga agtaggcggt cttgacacgg
cggatggtcg acaggagcac catgtctttg 9900
ggcccggcct gctggatgcg gaggcggtcg
gccatgcccc aggcttcgtt ctggcatctg 9960
cgcaggtctt tgtagtagtc ttgcatgagc
ctttccaccg gcacctcttc tccttcttct 10020
tctgacatct ctgctgcatc tgcggccctg
gggcgacggc gcgcgcccct gccccccatg 10080
cgcgtcaccc cgaaccccct gagcggctgg
agcagggcca ggtcggcgac gacgcgctcg 10140
gccaggatgg cctgctggac ctgcgtgagg
gtggtttgga agtcatccaa gtccacgaag 10200
cggtggtagg cgcccgtgtt gatggtgtag
gtgcagttgg ccatgacgga ccagttgacg 10260
gtctggtggc ccggttgcgt catctcggtg
tacctgaggc gcgagtaggc gcgcgagtcg 10320
aagatgtagt cgttgcaagt ccgcaccagg
tactggtagc ccaccaggaa gtgcggcggc 10380
ggctggcggt agaggggcca gcggagggtg
gcgggggctc cgggggccag gtcttccagc 10440
atgaggcggt ggtattcgta gatgtacctg
gacatccagg tgatgcccgc ggcggtggtg 10500
gaggcgcgcg ggaagtcgcg cacccggttc
cagatgttgc gcagcggcag aaagtgctcc 10560
atggtaggcg tgctctggcc ggtcaggcgc
gcgcagtcgt tgatactcta gaccagggaa 10620
aacgaaagcc ggtcagcggg cactcttccg
tggtctggtg gataaattcg caagggtatc 10680
atggcggagg gcctcggttc gagccccggg
cccgggccgg acggtccgcc atgatccacg 10740
cggttaccgc ccgcgtgtcg aacccaggtg
gcgacgtcag acaacggtgg agtgttcctt 10800
ttgggttttt ttccaaattt ttctggccgg
gcgccgacgc cgccgcgtaa gagactagag 10860
tgcaaaagcg aaagcagtaa gtggctcgct
ccctgtagcc cggaggatcc ttgctaaggg 10920
ttgcgttgcg gcgaaccccg gttcgagtct
ggctctcgct gggccgctcg ggtcggccgg 10980
aaccgcggct aaggcgggat tggcctcccc
ctcattaaag accccgcttg cggattcctc 11040
cggacacagg ggacgagccc ctttttactt
ttgcttttct cagatgcatc cggtgctgcg 11100
gcagatgcgc cccccgcccc agcagcagca
gcagcaacat cagcaagagc ggcaccagca 11160
gcagcgggag tcatgcaggg ccccctcgcc
cacgctcggc ggtccggcga cctcggcgtc 11220
cgcggccgtg tctggagccg gcggcggtgg
gctggcggac gacccggagg agcccccgcg 11280
gcgcagggcc agacagtacc tggacctgga
ggagggcgag ggcctggcgc gactgggggc 11340
gccgtccccc gagcgccacc cgcgggtgca
gctgaagcgc gactcgcgcg aggcgtacgt 11400
gcctcggcag aacctgttca gagaccgcgc
gggcgaggag cccgaggaga tgcgggaccg 11460
caggttcgcc gcggggcggg agctgcggca
ggggctgaac cgggagcggc tgctgcgcga 11520
ggaggacttt gagcccgacg cgcggacggg
gatcagcccc gcgcgcgcgc acgtggcggc 11580
cgccgacctg gtgacggcgt acgagcagac
ggtgaaccag gagatcaact tccaaaaaag 11640
cttcaacaac cacgtgcgca cgctggtggc
gcgcgaggag gtgaccatcg gcctgatgca 11700
cctgtgggac tttgtgagcg cgctggagca
gaaccccaac agcaagcctc tgacggcgca 11760
gctgttcctg atagtgcagc acagcaggga
caacgaggcg ttcagggacg cgctgctgaa 11820
catcaccgag cccgagggtc ggtggctgct
ggacctgatt aacatcttgc agagcatagt 11880
ggtgcaggag cgcagcctga gcctggccga
caaggtggcg gccatcaatt actcgatgct 11940
cagtctgggc aagttttacg cgcgcaagat
ctaccagacg ccgtacgtgc ccatagacaa 12000
ggaggtgaag atcgacggct tctacatgcg
catggcgctg aaggtgctga ccctgagcga 12060
cgacctgggc gtgtaccgca acgagcgcat
ccacaaggcc gtgagcgtga gccggcggcg 12120
cgagctgagc gaccgcgagc tgatgcacag
cctgcagcgg gcgctggcgg gggccggcag 12180
cggcgacagg gaggccgagt cctacttcga
ggcgggggcg gacctgcgct gggtgcccag 12240
ccggagggcc ctggaggccg cgggggcccg
ccgcgaggac tatgcagacg aggaggagga 12300
ggatgacgag gagtacgagc tagaggaggg
cgagtacctg gactaaaccg caggtggtgt 12360
ttttggtaga tgcaagaccc gaacgtggtg
gacccggcgc tgcgggcggc tctgcagagc 12420
cagccgtccg gccttaactc tacagacgac
tggcgacagg tcatggaccg catcatgtcg 12480
ctgacggcgc gcaatccgga cgcgttccgg
cagcagccgc aggccaacag gctctccgcc 12540
atcttggagg cggtggtgcc tgcgcgcgcg
aaccccacgc acgagaaggt gctggccata 12600
gtgaacgcgc tggccgagaa cagggccatc
cgcccggacg aggccgggct ggtgtacgac 12660
gcgctgctgc agcgcgtggc ccgctacaac
agcggcaacg tgcagaccaa cctggaccgg 12720
ctggtggggg acgtgcgcga ggcggtggcg
cagcgggagc gcgcggagcg gcagggaaac 12780
ctgggctcca tggtggcgct gaacgccttc
ctgagcacgc agccggccaa cgtgccgcgg 12840
gggcaggagg actacaccaa ctttgtgagc
gcgctgcggc tgatggtgac cgagaccccc 12900
cagagcgagg tgtaccagtc ggggccggac
tactttttcc agaccagcag acagggcctg 12960
cagacggtga acctgagcca ggctttcaag
aacctgcggg ggctgtgggg cgtgaaggcg 13020
cccaccgggg accgggcgac ggtgtccagc
ctgctgacgc ccaactcgcg cctgctgctg 13080
ctgctgatcg cgccgttcac ggacagcggc
agcgtgtccc gggagaccta cctcgggcac 13140
ctgctgacgc tgtaccgcga ggccatcggg
cagacccagg tggacgagca caccttccag 13200
gagatcacca gcgtgagccg cgcgctgggg
caggaggaca cgggcagcct ggaggcgacc 13260
ctgaactacc tgctgaccaa ccggcggcag
aagatcccct cgctgcatag tttgaccacc 13320
gaggaggagc gcatcctgcg ctacgtgcag
cagagcgtga gcctgaacct gatgcgcgac 13380
ggggtgacgc ccagcgtggc gctggacatg
accgcgcgca acatggaacc gggcatgtac 13440
gccgcgcatc ggccttacat caaccgcctg
atggactact tgcatcgcgc ggcggccgtg 13500
aaccccgagt acttcaccaa cgccatcctg
aacccgcact ggctcccgcc gcccgggttc 13560
tacagcgggg gcttcgaggt ccccgaggcc
aacgacggct tcctgtggga cgacatggac 13620
gacagcgtgt tctccccgcg gccgcaggcg
ctggcggagg cgtcgctgct ccgcctcccc 13680
aagaaagaag agagccgccg gcccagcagc
gcggcggcct ctctgtccga gctgggggcg 13740
gcggccgcgc ggcccgggtc cctggggggc
agcccctttc ccagtctggt ggggtctctg 13800
cagagcgggc gcaccacccg gccccggctg
ctgggcgagg acgagtacct gaacaactcc 13860
ctgatgcagc cggtgcggga gaaaaacctg
ccccccgcct tccccaacaa cgggatagag 13920
agcctggtag acaagatgag cagatggaag
acctatgcgc aggagcacag ggactcgccc 13980
gtgctccgtc cgcccacgcg gcgccagcgc
cacgaccggc agcgggggct ggtatgggat 14040
gacgaggact ccgcggacga tagcagcgtg
ctggacctgg gggggagcgg cggtaacccg 14100
ttcgcgcacc tgcgcccccg cctggggagg
atgtttcaat aagaaaaatc aagcatgatg 14160
caaggttttt taagcggata aataaaaaac
tcaccaaggc catggcgacc gagcgttgtt 14220
ggtttcttgt tgtgttccct tagtatgcgg
cgcgcggcga tgtaccacga gggacctcct 14280
ccctcttatg agagcgtggt gggcgcggcg
gcggcctctc cctttgcgtc gcagctggag 14340
ccgccgtacg tgcctccgcg gtacctgcgg
cctacggggg gaagaaacag catccgttac 14400
tcggagctgg cgcccctgta cgacaccacc
cgggtgtacc tggtggacaa caagtcggcg 14460
gacgtggcct ccctgaacta ccagaacgac
cacagcaatt ttttgaccac ggtcatccag 14520
aacaatgact acaccccgag cgaggccagc
acccagacca tcaatctgga tgaccggtcg 14580
cactggggcg gcgacctgaa aaccatcctg
cacaccaaca tgcccaacgt gaacgagttc 14640
atgttcacca ataagttcaa ggcgcgggtg
atggtgtcgc gttcgcacac caaggacgac 14700
cgggtggagc tgaagtacga gtgggtagag
ttcgagctgc ccgagggcaa ctactcggag 14760
accatgacca tagacctgat gaacaacgcg
atcgtggagc actatctgaa agtgggcagg 14820
cagaacgggg tcctggagag cgacatcggg
gtcaagttcg acaccaggaa cttccgcctg 14880
gggctggacc cggtcaccgg gctggtcatg
cccggggtct acaccaacga ggccttccac 14940
cccgacatca tcctgctgcc cggctgcggg
gtggacttca cctacagccg cctgagcaac 15000
ctgctgggca tccgcaagcg gcagcccttc
caggagggct ttaggatcac ctacgaggac 15060
ctggaggggg gcaacatccc cgcgctcctg
gatgtggagg cctaccagga tagcttgaag 15120
gaagaagagg cgggagaggg cagcggcggc
ggcggcggcg ccggtcagga ggagggcggg 15180
gcctcctctg aggcctctgc ggacgccgcc
gctgccgccg aggcggaggc ggccgacccc 15240
gcgatggtgg tagaggaaga gaaggatatg
aatgacgagg cggtgcgcgg cgacaccttt 15300
gccacccggg gggaggagaa gaaagcggag
gccgaggccg cggcagagga ggcggcagcg 15360
gcggcggcgg cggcagtaga ggcggcggcc
gaggcggaga agccccccaa ggagcccgtg 15420
attaaggccc tgaccgaaga tagcaagaag
cgcagttaca acgtgctcaa ggacagcacc 15480
aacaccgcgt accgcagctg gtacctggcc
tacaactacg gcgacccggc gacgggggtg 15540
cgctcctgga ccctgctgtg tacgccggac
gtgacctgcg gctcggagca ggtgtactgg 15600
tcgctgcccg acatgatgca agaccccgtg
accttccgct ccacgcggca ggtcagcaac 15660
ttcccggtgg tgggcgccga gctgctgccc
gtgcactcca agagcttcta caacgaccag 15720
gccgtctact cccagctcat ccgccagttc
acctctctga cccacgtgtt caatcgcttt 15780
cctgagaacc agattctggc gcgcccgccc
gcccccacca tcaccaccgt cagtgaaaac 15840
gttcctgctc tcacagatca cgggacgcta
ccgctgcgca acagcatcgg aggagtccag 15900
cgagtgaccg taactgacgc cagacgccgc
acctgtccct acgtttacaa ggccctgggc 15960
atagtctcgc cgcgcgtcct ttccagccgc
actttttaag catgtccatc ctcatctcgc 16020
ccagcaataa caccggctgg ggcctgctgc
gcgcgcccag caagatgttt ggaggggcga 16080
ggaagcgctc cgaccagcac cccgtgcgcg
tgcgcgggca ctaccgcgcc ccctggggcg 16140
cgcacaaacg cgggcgcacc ggcaccgcgg
ggcgcaccac cgtggacgaa gccatcgact 16200
cggtggtgga gcaggcgcgc aactacacgc
ccgcggtctc caccgtggac gcggctatcg 16260
agagcgtggt gcgaggcgcg cggcggtacg
ccaaggcgaa gagccgccgg aggcgcgtgg 16320
cccgccgcca ccgccgtcga cccggaagcg
ccgccaagcg cgccgccgcc gccttgcttc 16380
gtcgggccag acgcacgggc cgccgcgccg
ccatgagggc cgcgcgccgc ctggccgccg 16440
gcatcaccac cgtggccccc cgcgccagaa
gacgcgcggc cgctgccgcc gccgcggcca 16500
tcagcgacct ggccaccagg cgccggggca
acgtgtactg ggtgcgcgac tcggtgagcg 16560
gcacgcgcgt gcccgtgcgc ttccgccccc
cgcggacttg agaggagagg acaggaaaaa 16620
agcatcaaca acaccaccac tgagtctcct
gctgttgtgt gtatcccagc ggcgcgcgcg 16680
cacacggcga catgtccaag cgcaaaatca
aagaagagat gctccaggtc gtcgcgccgg 16740
aaatctatgg gcccccgaag aaggaagagc
aggatttcaa gccccgcaag ataaagcggg 16800
tcaaaaagaa aaagaaagat gacgatgatg
gcgaggtgga gtttctgcgc gccacggcgc 16860
ccaggcgccc gctgcagtgg aagggtcggc
gcgtaaagcg cgttctgcgc cccggcaccg 16920
cggtggtctt cacgcccggc gagcgctcca
cccgcacttt caagcgcgtc tatgacgagg 16980
tgtacggcga cgaagacctg ctggagcagg
ccaacgatcg ctccggagag tttgcttacg 17040
ggaagcggca ccgggcgatg gagaaggacg
aggtgctggc gctgccgctg gaccggggca 17100
accccacccc cagcctgaag cccgtgaccc
tgcagcaggt gctgccggcc agcgcgccct 17160
ccgagatgaa gcggggcctg aagcgcgagg
gcggcgacct ggcgcccacc gtgcagctga 17220
tggtgcccaa gcggcagagg ctggaggacg
tgctggagaa aatgaaagta gaccccggcc 17280
tgcagccgga catcagggtc cgccccatca
agcaggtggc gccgggcctc ggcgtgcaga 17340
ccgtggacgt ggtcatcccc accggcgcct
cctcttccag cgccgccgcc gccactagca 17400
ccgcggacat ggagacgcag actagctccg
ccctcgccgc ccccgcggcc gccgccgccg 17460
ccacctcctc ggcggaggta cagacggacc
cctggatgcc gccgccggcg gccgccccct 17520
cgcgcgcacg ccgcgggcgc aggaagtacg
gcgccgccag cgcgctcatg cccgagtacg 17580
ccttgcatcc ttccatcgcg cccacccccg
gctaccgagg ctacagctac cgcccgcgaa 17640
gagccaaggg ctccacccgc cgcagccgcc
gcgccgccac ctctacccgc cgccgcagtc 17700
gccgccgccg ccggcagccc gcgctggctc
cgatctccgt gaggagagtg gcgcgcaacg 17760
gggacacctt ggtgctgccc agggcgcgct
accaccccag catcgtttaa aagcctgttg 17820
tggttcttgc agatatggcc ctcacttgcc
gcctccgttt cccggtgccg ggataccgag 17880
gaagatcgcg ccgtagaagg ggtatggccg
gacgcggcct gagcggaggc agccgccgtg 17940
cgcaccggcg gcgacgcgcc accagccgac
gcatgcgcgg cggggtgctg cctctgctga 18000
tccccctgat cgccgcggcg atcggcgccg
tgcccgggat cgcctccgtg gccttgcagg 18060
cgtcccagag gcgttgacac agacttcttg
caagcttgca aaaatatgga aaaaatcccc 18120
ccaataaaaa agtctagact ctcacgctcg
cttggtcctg tgactatttt gtagaaaaaa 18180
agatggaaga catcaacttt gcgtcgctgg
ccccgcgtca cggctcgcgc ccgttcctgg 18240
gacactggaa cgatatcggc accagcaaca
tgagcggtgg cgccttcagt tggggctctc 18300
tgtggagcgg cattaaaaat atcggttctg
ccgttaagaa ttacggctcc aaggcctgga 18360
acagcagcac gggccagatg ttgagagaca
agttgaaaga gcagaacttc cagcagaagg 18420
tggtggaggg cctggcctcc ggcatcaacg
gggtggtgga cctggccaat caggccgtgc 18480
aaaataagat caacagcaga ctggaccccc
ggccgccggt ggaagagctg ccgccggcgc 18540
tggagacggt gtcccccgat gggcggggcg
aaaagcgccc gcggcccgac agggaagaga 18600
ccactctggt cacgcacacc gatgagccgc
ccccctacga ggaagctctg aagcaaggct 18660
tgcccaccac tcggcccatc gcgcccatgg
ccaccggggt ggtgggccgc cacacccccg 18720
ccaggctgga cctgcctcct cctcctgttt
cttcttcggc cgccgatgcg cagcagcaga 18780
aggcggcgct gcccggtccg cccgcggccg
ccccccgtcc caccgccagt cgagcgcccc 18840
tgcgtcgcgc ggccagcggc ccccgcgggg
tcgcgaggca cagcagcggc aactggcaga 18900
acacgctgaa cagcatcgtg ggtctggggg
tgcagtccgt gaagcgccgc cgatgctact 18960
gaatagctta gctaacggtg ttgtatgtgt
gtatgcgtcc tatgtcaccg ccagaggagc 19020
tgctgagtcg ccgccgttcg cgcgcccacc
gccactacca ccgccggtac cactccagcg 19080
cccctcaaga tggcgacccc atcgatgatg
ccgcagtggt cgtacatgca catctcgggc 19140
caggacgcct cggagtacct gagccccggg
ctggtgcagt tcgcccgcgc caccgacagc 19200
tacttcagcc tgagtaacaa gtttaggaac
cccacggtgg cgcccacgca cgatgtgacc 19260
accgaccggt cccagcgcct gacgctgcgg
ttcatccccg tggaccgcga ggacaccgcg 19320
tactcttaca aggcgcggtt caccctggcc
gtgggcgaca accgcgtgct ggacatggcc 19380
tccacctact ttgacatccg cggcgtgctg
gacaggggcc ccaccttcaa gccctactcc 19440
ggcaccgcct acaactccct ggcccccaag
ggcgccccca actcctgcga gtgggagcaa 19500
gaggagactc agacagctga agaggcacaa
gacgaagaag aagatgaagc tgaagctgag 19560
gaggaaatgc ctcaggaaga gcaagcacct
gtcaaaaaga ctcatgtata tgctcaggct 19620
cccctttctg gcgaaaaaat tactaaagac
ggtctgcaga taggaacgga cgctacagct 19680
accgaacaaa aacctattta tgcagatccc
acattccagc cagaacccca aattggtgaa 19740
tctcagtgga atgaggcaga tgcttcagtt
gccggcggta gagtgctgaa gaaaactact 19800
cccatgaaac cctgttatgg ttcctatgcc
aggcccacaa atgccaatgg aggtcagggt 19860
gtattggtgg agaaagacgg tggaaagatg
gaaagccaag tagatatgca attcttttcg 19920
acttctgaaa acgcccgtaa cgaggctaac
aacattcagc ccaaattggt gctgtacagc 19980
gaggatgtgc atatggagac cccagacaca
cacatttctt acaagcctgc aaaaagcgat 20040
gataattcga aagtcatgct gggtcagcag
tccatgccca acaggccaaa ttacatcggc 20100
ttcagagaca actttatcgg gctcatgtat
tacaacagca ctggcaacat gggggtgctg 20160
gcaggtcagg cctcacagtt gaatgcggtg
gtggacttgc aagacagaaa cacagaactg 20220
tcctaccagc tcttgcttga ttccatggga
gacagaacca gatacttttc catgtggaat 20280
caggcggtgg acagttatga tccagatgtc
agaattattg aaaatcatgg aactgaagat 20340
gagctgccca actattgttt ccctctggga
ggcatagggg taactgacac ttaccaggcc 20400
attaagacta atggcaatgg caacggcggg
ggcaatacca cttggaccaa ggatgaaact 20460
tttgcagacc gcaacgagat aggggtggga
aacaatttcg ccatggagat caacctcagt 20520
gccaacctgt ggaggaactt cctctactcc
aacgtggccc tgtacctgcc agacaagctt 20580
aagtacaacc cctccaacgt ggaaatctct
gacaacccca acacctacga ctacatgaac 20640
aagcgagtgg tggccccggg gctggtggac
tgctacatca acctgggcgc gcgctggtcc 20700
ctggactaca tggacaacgt caaccccttc
aaccaccacc gcaacgcggg cctgcgctac 20760
cgctccatgc ttctgggcaa cgggcgctac
gtgcccttcc acatccaggt gccccagaag 20820
ttctttgcca tcaagaacct cctcctcctg
ccgggctcct acacctacga gtggaacttc 20880
aggaaggatg tcaacatggt cctccagagc
tctctgggta acgacctcag ggtcgacggg 20940
gccagcatca agttcgagag catctgcctc
tacgccacct tcttccccat ggcccacaac 21000
acggcctcca cgctcgaggc catgctcagg
aacgacacca acgaccagtc cttcaacgac 21060
tacctctccg ccgccaacat gctctacccc
atccccgcca acgccaccaa cgttcccatc 21120
tccatcccct cgcgcaactg ggcggccttc
cgcggctggg ccttcacccg cctcaagacc 21180
aaggagaccc cctccctggg ctcgggtttc
gacccctact acacctactc gggctccata 21240
ccctacctgg acggaacctt ctacctcaac
cacactttca agaaggtctc ggtcaccttc 21300
gactcctcgg tcagctggcc gggcaacgat
cgcctgctca cccccaacga gttcgagatc 21360
aagcgctcgg tcgacgggga gggctacaac
gtggcccagt gcaacatgac caaggactgg 21420
ttcctcatcc aaatgctggc caactacaac
atcggctatc agggcttcta catcccagag 21480
agctacaagg acaggatgta ctccttcttt
aggaacttcc agcccatgag ccggcaggtg 21540
gtggacgaaa ccaagtacaa ggactaccag
caggtgggca tcatccacca gcacaacaac 21600
tcgggcttcg tgggctacct cgcccccacc
atgcgcgagg gacaggccta ccccgccaac 21660
ttcccctacc cgctcattgg caagaccgcg
gtcgacagcg tcacccagaa aaagttcctc 21720
tgcgaccgca ccctctggcg catccccttc
tccagcaact tcatgtccat gggtgcgctc 21780
acggacctgg gccagaacct gctctatgcc
aactccgccc acgcgctcga catgaccttc 21840
gaggtcgacc ccatggacga gcccaccctt
ctctatgttc tgttcgaagt ctttgacgtg 21900
gtccgggtcc accagccgca ccgcggcgtc
atcgagaccg tgtacctgcg cacgcccttc 21960
tcggccggca acgccaccac ctaaagaagc
aagccgccac cgccaccacc tgcatgtcgt 22020
cgggttccac cgagcaggag ctcaaggcca
tcgtcagaga cctgggatgc gggccctatt 22080
ttttgggcac cttcgacaaa cgcttcccgg
gcttcgtcgc cccgcacaag ctggcctgcg 22140
ccatcgtcaa cacggccggc cgcgagaccg
ggggcgtgca ctggctggcc ttcgcctgga 22200
acccgcgctc caaaacatgc tacctctttg
accccttcgg attctcggac cagcggctca 22260
agcagatcta ccagttcgag tacgagggcc
tgctgcgccg cagcgccatc gcctcctcgc 22320
ccgaccgctg cgtcaccctc gagaagtcca
cccagaccgt gcaggggccc gactcggccg 22380
cctgcggtct cttctgctgc atgttcctgc
atgcctttgt gcactggccc cagagtccca 22440
tggaccgcaa ccccaccatg aacttgctga
cggggatccc caactccatg ctccagagcc 22500
cccaggtcgc gcccaccctg cgccgcaacc
aggagcggct ctacagcttc ctggaacgcc 22560
actcgcccta cttccgccgc cacagcgcgc
agatcagggg ggccacctct ttctgccgca 22620
tgcaagagat gcaagggaaa atgcaatgat
gtacacagac actttttctt ttctcaataa 22680
atggcaactt tatttataca tgctctctct
cgggtattca tttccccacc acccaccacc 22740
cgccgccgcc gtaaccatct gctgctggct
tttttttttt tttttaaaaa tcgaaagggt 22800
tctgccggga atcgccgtgc gccacgggca
gggacacgtt gcggaactgg tagcgggtgc 22860
cccacttgaa ctcgggcacc accatgcggg
gcaagtcggg gaagttgtcg gcccacaggc 22920
tgcgggtcag caccagcgcg ttcattaggt
cgggcgccga gatcttgaag tcgcagttgg 22980
ggccgccgcc ctgcgcgcgc gagttgcggt
acaccgggtt gcaacactgg aacaccagca 23040
gcgccggata attcacactg gccagcacgc
tccggtcgga gatcagctcg gcgtccaggt 23100
cctccgcgtt gctcagcgcg aacggggtca
gcttgggcac ctgccgcccc aggaagggag 23160
cgtgccccgg cttcgagttg cagtcgcagc
gcagcgggat cagcaggtgc ccgcggccgg 23220
actcggcgtt ggggtacagc gcgcgcatga
aggcctccat ctggcggaag gccatctggg 23280
ccttggcgcc ctccgagaag aacatgccgc
aggacttgcc cgagaactgg ttcgcggggc 23340
agctagcgtc gtgcaggcag cagcgcgcgt
cggtgttggc gatctgcacc acgttgcgcc 23400
cccaccggtt cttcacgatt ttggccttgg
aagcctgctc cttcagcgcg cgctgcccgt 23460
tctcgctggt cacatccatc tcgatcacgt
gctccttgtt caccatgctg ctgccgtgca 23520
gacacttcag ctcgccctcc acctcggtgc
agcggtgctg ccatagcgcg cagcccgtgg 23580
gctcgaaatg cttgtaggtc acctccgcgt
aggactgcag gtaggcctgc aggaagcgcc 23640
ccatcatggt cacgaaggtc ttgttgctgc
tgaaggtcag ctgcagcccg cggtgctcct 23700
cgttcagcca ggccttgcac acggccgcca
gcgcctccac ctggtcgggc agcatcttga 23760
agttcagctt cagctcattc tccacatggt
acttgtccat cagcgcgcgc gcagcctcca 23820
tgcccttctc ccaggccgac accagcggca
ggctcaaggg gttcaccacc gtcgcagccg 23880
ccgctgcgct ttcgctttcc gctccgctgt
tctcttcttc ctcctcctct tcttcctcgc 23940
cgcccgcgcg cagcccccgc accacggggt
cgtcttcctg caggcgccgc accgagcgct 24000
tgccgctcct gccctgcttg atacgcacgg
gcgggttgct gaagcctacc atcaccagcg 24060
cggcctcttc ttgctcgtcc tcgctgtcca
ctatgacctc gggggagggc gacctcagaa 24120
ccgtggcgcg ctgcctcttc tttttcctgg
gggcgtttgc cagctccgcg gccgcggccg 24180
ccgccgaggt cgaaggccga gggctgggcg
tgcgcggcac cagcgcgtcc tgcgagccgt 24240
cctcgtcctc ggactcgagg cggcagcgag
cccgcttctt cgggggcgcg cggggcggcg 24300
gcggcggggg cggcggcgac ggagacgggg
acgagacatc gtccagggtg ggaggacggc 24360
gggccgcgcc gcgtccgcgc tcgggggtgg
tttcgcgctg gtcctcttcc cgactggcca 24420
tctcccactg ctccttctcc tataggcaga
aagagatcat ggagtctctc atgcaagtcg 24480
agaaggagga ggacagccta accaccaccg
ccccctctga gccctccgcc gccgccgcgg 24540
acgacgcgcc caccaccacc gccgccgcca
ccaccaccat taccacccta cccggcgacg 24600
cagccccgat cgagaaggaa gtgttgatcg
agcaggaccc gggttttgtg agcgaagagg 24660
aggatgagga ggatgaaaag gagaaggata
ccgccgcctc agtgccaaaa gaggataaaa 24720
agcaagacca ggacgacgca gagacagatg
aggcagcagt cgggcggggg gacggaaggc 24780
atgatgatga tgacggctac ctagacgtgg
gagacgacgt gctgcttaag cacctgcacc 24840
gccagtgcgt catcgtctgc gacgcgctgc
aggagcgctg cgaagtgccc ctggacgtgg 24900
cggaggtcag ccgcgcctac gagcggcacc
tcttcgcgcc acacgtgccc cccaagcgcc 24960
gggagaacgg cacctgcgag cccaacccgc
gcctcaactt ctacccggtc ttcgcggtac 25020
ccgaggtgct ggccacctac cacatcttct
tccaaaactg caagatcccc ctctcctgcc 25080
gcgccaaccg cacccgcgcc gacaagacgc
tggccctgcg gcagggcgcc cacatacctg 25140
atatcgcctc tctggaggag gtgcccaaga
tcttcgaggg tctcggtcgc gacgagaaac 25200
gggcggcgaa cgctctgcaa ggagacagcg
aaaacgagag tcactcgggg gtgctggtgg 25260
agctcgaggg cgacaacgcg cgcctggccg
tgctcaagcg cagcatcgaa gtcacccact 25320
tcgcctaccc ggcgctcaac ctgcccccca
aggtcatgag tgtggtcatg agtgagctca 25380
tcatgcgccg cgcccagccc ctggacgcgg
atgcaaactt gcaagagccc tccgaggaag 25440
gcctgcccgc ggtcagcgac gagcagctgg
cgcgctggct ggagacccgc gaccccgccc 25500
agctggagga gcggcgcaag ctcatgatgg
ccgcggtgct cgtcaccgtg gagctcgagt 25560
gtctgcagcg cttcttcggg gaccccgaga
tgcagcgcaa gctcgaggag accctgcact 25620
acaccttccg ccagggctac gtgcgccagg
cctgcaagat ctccaacgtg gagctctgca 25680
acctggtctc ctacctgggc atcctgcacg
agaaccgcct cgggcagaac gtcctgcact 25740
ccaccctcaa aggggaggcg cgccgcgact
acgtccgcga ctgcgtctac ctcttcctct 25800
gctacacgtg gcagacggcc atgggggtct
ggcagcagtg cctggaggag cgcaacctca 25860
aggagctgga gaagctcctc cggcgcgccc
tcagggacct ctggacgggc ttcaacgagc 25920
gctcggtggc cgccgcgctg gcggacatca
tcttccccga gcgcctgctc aaaaccctgc 25980
agcagggcct gcccgacttc accagccaga
gcatgctgca gaacttcagg accttcatcc 26040
tggagcgctc gggcatcctg ccggccacct
gctgcgcgct gcccagcgac ttcgtgccca 26100
tcaggtacag ggagtgcccg ccgccgctct
ggggccactg ctacctcttc cagctggcca 26160
actacctcgc ctaccactcg gatctcatgg
aagacgtgag cggcgagggc ctgctcgagt 26220
gccactgccg ctgcaacctg tgcacgcccc
accgctctct agtctgcaat ccgcagctgc 26280
tcagcgagag tcagattatc ggtaccttcg
agctgcaggg tccctcgccc gacgaaaagt 26340
ccgcggctcc ggggttgaaa ctcactccgg
ggctgtggac ttccgcctac ctacgcaaat 26400
ttgtacctga agactaccac gcccacgaga
tcaggtttta cgaagaccaa tcccgcccgc 26460
ccaaggcgga gctcaccgcc tgcgtcatta
cccagggcca catcctgggc caattgcaag 26520
ccatcaacaa agcccgccaa gagttcttgc
tgaaaaaggg tcggggggtg tacctggacc 26580
cccagtccgg cgaggagcta aacccgctac
ccccgccgcc gccccagcag cgggaccttg 26640
cttcccagga tggcacccag aaagaagcag
ccgccgccgc cgccagcata catgcttctg 26700
gaggaagagg aggactggga cagtcaggca
gaggaggttt cggacgagga cgaggaggag 26760
gagatgatgg aagactggga ggaggacagc
ctagacgagg aagcttcaga ggccgaagag 26820
gtggcagacg caacaccatc accctcggcc
gcagccccct cgccggcgcc cccgaaatcc 26880
tccgacccca gcagcagcgc tataacctcc
gctcctccgg cgccggcgcc cacccgcagc 26940
agacccaacc gtagatggga cactacagga
accggggtcg gtaagtccaa gtgcccccca 27000
gcgccgcccc cgcaacagga gcaacagcag
cagcagcggc gacagggcta ccgctcgtgg 27060
cgcggacaca agaacgccat agtcgcctgc
ttgcaagact gcgggggcaa catctccttc 27120
gcccgccgct tcctgctctt ccaccacggg
gtggcttttc cccgcaatgt cctgcattac 27180
taccgtcatc tctacagccc ctactgcggc
ggcagcggcg acccagaggg agcggcggca 27240
gcagcagcgc cagccacagc ggcgaccacc
taggaagacc tccgcgggca agacggcggg 27300
agccgggaga cccgcggcgg cggcggtagc
ggcggcggcg ggcgcactgc gcctctcgcc 27360
caacgaaccc ctctcgaccc gggagctcag
acacaggatc ttccccactc tgtatgctat 27420
cttccagcag agcagaggcc aggaacagga
gctcaaaata aaaaacagat ctctgcgctc 27480
cctcacccgc agctgtctgt atcacaaaag
cgaagatcag cttcggcgca cgctggagga 27540
cgcggaggca ctcttcagca aatactgcgc
gctgactctt aaggactagc cgcgcgccct 27600
tctcgaattt aggcgggaga aagactacgt
catcgccgac cgccgcccag cccacccagc 27660
cgacatgagc aaagagattc ccacgcccta
catgtggagc taccagccgc agatgggact 27720
cgcggcggga gcggcccaag actactccac
ccgcatgaac tacatgagcg cggggcccca 27780
catgatctca cgggttaatg ggatccgcgc
ccagcgaaac caaatactgc tggaacaggc 27840
ggccataacc gccacacccc gtcatgacct
caatccccga aattggcccg ccgccctcgt 27900
gtaccaggaa accccctctg ccaccaccgt
ggtacttccg cgtgacaccc aggccgaagt 27960
ccagatgact aactcagggg cgcagctcgc
gggcggcttt cgtcacgggg tgcggccgca 28020
ccggccgggt atattacacc tggcgatcag
aggccgaggt attcagctca acgacgagtc 28080
ggtgagctct tcgctcggtc tccgtccgga
cggaaccttc cagatcgccg gatcaggtcg 28140
ctcctcattc acgcctcgcc aggcgtatct
gactctgcag acctcctcct cggagcctcg 28200
ctccggcggc atcggcaccc tccagttcgt
ggaggagttc gtgccctcgg tctacttcaa 28260
ccccttctcg ggacctcccg gacgctaccc
cgaccagttc atcccgaact ttgacgcggt 28320
gaaggactcg gcggacggct acgactgaat
gtcaagtgct gaggcagaga gcgttcgcct 28380
gaaacacctc cagcactgcc gccgcttcgc
ctgctttgcc cgcagctccg gtgagttctg 28440
ctactttcag ctgcccgagg agcataccga
agggccggcg cacggcgtcc gcctaaccac 28500
ccagggcgag gttacctgta cccttatccg
ggagtttacc ctccgtcccc tgctagtgga 28560
gcgggagcgg ggttcttgtg tcataactat
cgcctgcaac tgccctaacc ctggattaca 28620
tcaagatctt tgttgtcacc tgtgcgctga
gtataataaa cgctgagatc agactctact 28680
ggggctcctg tcgccatcct gtgaacgcca
ccgtcttcac ccaccccgag cagccccagg 28740
cgaacctcac ctgcggcctg cgtcggaggg
ccaagaagta cctcacctgg tacttcaacg 28800
gcaccccctt tgtggtttac aacagcttcg
accaggacgg agttgccttg agagacgacc 28860
tttccggtct cagctactcc attcacaaga
acaccaccct ccacctcttc cctccctacc 28920
tgccgggaac ctacgagtgc gtcaccggcc
gctgcaccca cctcctccgc ctgatcgtaa 28980
accagacctt tccgggaaca cacctcttcc
ccagaacagg aggtgagctc aggaaacccc 29040
ctggggccca gggcggagac ttaccttcga
cccttgtggg gttaggattt tttatcgccg 29100
ggttgctggc tctcctgatc aaagcttcct
tcagatttgt tctctccctt tacttttatg 29160
aacagctcaa cttctaataa cgctaccttt
tctcaggaat cgagtagtaa cttctcttcc 29220
gaaatcgggc tgggtgtgct gcttactctg
ttgatttttt tccttatcat acttagcctt 29280
ctgtgcctca ggctcgccgc ctgctgcgca
catatctaca tctacagccg gttgcttaac 29340
tgctggggtc gccatccaag atgaacgggg
ctcaggtgct atgtctgctg gccctggtgg 29400
cctgcagtgc cgccgtcaat tttgaggaac
ccgcttgcaa tgtgactttc aagcctgagg 29460
gcgcacattg caccactctg gttaaatgtg
tgacctctca tgaaaaactg ctcatcgcct 29520
acaaaaacaa aacaggccag atcgcagtct
atagcgagtg gctacccgga gaccataata 29580
actactcagt caccgtcttc gagggtgcgg
agtctaagaa attcgattac acctttccct 29640
tcgaggagat gtgtgatgcg gtcatgtacc
tgtccaaaca gtacaagctg tggcccccca 29700
cccccaaggc gtgtgtggaa aacactgggt
ctttctgctg tctctctctg gcaatcactg 29760
tgcttgctct aatctgcacg ctgctataca
tgagattcag gcagaggcga atctttatcg 29820
atgagaaaaa aatgccttga tcgctaacac
cggctttctg tctgcagaat gaaagcaatc 29880
acctccctac taatcagcac caccctcctt
gcgattgccc atgggttgac acgaatcgaa 29940
gtgccagtgg ggtccaatgt caccatggtg
ggccccgccg gcaattcctc cctgatgtgg 30000
gaaaaatatg tccgtaatca atgggatcat
tactgctcta atcgaatctg tatcaagccc 30060
agagccacct gcgacgggca aaatctaact
ttgattgatg tgcaaatgac ggatgctggg 30120
tactattacg ggcagcgggg agaaatgatt
aattactggc gaccccacaa ggactacatg 30180
ctgcatgtag tcaaggcagt cccaactact
accaccccca ccactaccac tcccactacc 30240
accaccccca ccactaccac tagcactgct
actaccgctg cccgcaaagc tattacccgc 30300
aaaagcacca tgcttagcac caagccccat
tctcactccc acgccggcgg gcccaccggt 30360
gcggcctcag aaaccaccga gctttgcttc
tgccaatgca ctaacgccag cgcccacgaa 30420
ctgttcgacc tggagaatga ggacgatgac
cagctgagct ccgcttgccc ggtcccgctg 30480
cccgcagagc cggtcgccct gaagcagctc
ggtgatccat ttaatgactc tcctgtttat 30540
ccctctcccg aataccctcc cgactctacc
ttccacatca cgggcaccaa agaccccaac 30600
ctctccttct acctgatgct gctgctctgt
atctctgtgg tatcttccgc gctcatgtta 30660
ctgggcatgt tctgctgcct catctgccgc
agaaaaagaa agtctcgctc tcagggccaa 30720
ccactgatgc ccttccccta ccccccagat
tttgcagata acaagatatg agcacgctgc 30780
tgacactaac cgctttactc gcctgcgctc
taacccttgt cgcttgcgaa tccagatacc 30840
acaatgtcac agttgtgaca ggagaaaatg
ttacattcaa ctccacggcc gacacccagt 30900
ggtcgtggag tggccacggt agctatgtat
acatctgcaa tagctccacc tcccctagca 30960
tgtcctctcc caagtaccac tgcaatgaca
gcctgttcac cctcatcaac gcctccacct 31020
cggacaatgg actctatgta ggctatgtga
cacccggtgg gcagggaaag acccacgcct 31080
acaacctgca agttcgccac ccctccacca
ccgccaccac ctctgccgcc cctacccgca 31140
gcagcagcag cagcagcagc agcagcagca
gcagcagcag cagattcctg actttaatcc 31200
tagccagctc aacaaccacc gccaccgctg
agaccaccca cagctccgcg cccgaaacca 31260
cccacaccca ccacccagag acgaccgcgg
cctccagcga ccagatgtcg gccaacatca 31320
ccgcctcggg tcttgaactt gcttcaaccc
ccaccccaaa accagtggat gcagccgacg 31380
tctccgccct cgtcaatgac tgggcggggc
tgggaatgtg gtggttcgcc ataggcatga 31440
tggcgctctg cctgcttctg ctctggctca
tctgctgcct caaccgcagg cgggccagac 31500
ccatctatag acccatcatt gttctcaacc
ccgctgatga tgggatccat agattggatg 31560
gtctgaaaaa cctacttttc tcttttacag
tatgataaat tgagacatgc ctcgcatttt 31620
catgtacttg acacttctcc cactttttct
ggggtgttct acgctggccg ccgtctctca 31680
cctcgaggta gactgcctca cacccttcac
tgtctacctg atttacggat tggtcaccct 31740
cactctcatc tgcagcctaa tcacagtagt
catcgccttc atccagtgca ttgactacat 31800
ctgtgtgcgc ctcgcatacc tgagacacca
cccgcagtac cgagacagga acattgccca 31860
actcctaaga ctgctctaat catgcataag
actgtgatct gcctcctcat cctcctctcc 31920
ctgcccgctc tcgtctcatg ccagcccacc
acaaaacctc cacgaaaaag acatgcctcc 31980
tgtcgcttga gccaactgtg gaatattccc
aaatgctaca atgaaaagag cgagctttcc 32040
gaagcctggc tatatgcggt catgtgtgtc
cttgtcttct gcagcacaat ctttgccctc 32100
atgatctacc cccactttga tttgggatgg
aatgcggtcg atgccatgaa ttaccctacc 32160
tttcccgcgc ccgatatgat tccactccga
caggttgtgg tgcccgtcgc cctcaatcaa 32220
cgccccccat cccctacacc cactgaggtc
agctacttta atctaacagg cggagatgac 32280
tgacactcta gatctagaaa tggacggcat
cggcaccgag cagcgtctcc tacagaggcg 32340
caagcaggcg gctgaacaag agcgcctcaa
tcaggagctc cgagatctca ttaacctgca 32400
ccagtgcaaa aaaggcatct tttgcctggt
caagcaggcc gatgtcacct acgagaaaac 32460
cggtaacagc caccgcctca gctacaagct
gcccacccaa cgccagaagt tggtgctcat 32520
ggtgggtcag aatcccatca ccgtcaccca
gcactcggtg gagaccgagg ggtgtctgca 32580
ctccccctgt cagggtccgg aagacctctg
caccctggta aagaccctgt gtggtcttag 32640
agatttaatc ccctttaact aatcaaacac
tggaatcaat aaaaagaatc acttacttta 32700
aatcagtcag caggtctctg tccactttat
tcagcagcac ctccttcccc tcctcccaac 32760
tctggtactc caaacgcctc ctggcggcaa
acttcctcca caccctgaag ggaatgtcag 32820
attcttgctc ctgtccctcc gcacccacta
tcttcatgtt gttgcagatg aagcgcgcca 32880
aaacgtctga cgagaccttc aaccccgtgt
acccctatga cacggaaaac gggcctccct 32940
ccgttccttt cctcacccct cccttcgtgt
cccccgacgg atttcaagaa agccccccag 33000
gggtcctgtc tctgcgcctg tcagagcccc
tggtcacttc ccacggcatg cttgccctga 33060
aaatgggaaa tggcctctcc ctggatgacg
ccggcaacct cacctctcaa gatgtcacca 33120
ccgtcacccc tcccctcaaa aaaaccaaga
ccaacctcag cctccagacc tcagcccccc 33180
tgaccgttag ctctgggtcc ctcaccgtcg
cggccgccgc tccactggcg gtggccggca 33240
cctctctcac catgcaatct caggccccct
tgacggtgca agatgcaaaa ctgggtctgg 33300
ccacccaggg acccctgacc gtgtctgaag
gcaaactcac cttgcagaca tcggctccac 33360
tgacggccgc cgacagcagc actctcactg
ttggcaccac accgccaatc agtgtgagca 33420
gtggaagtct aggcttagat atggaagacc
ccatgtatac tcacgatgga aaactgggaa 33480
tcagaattgg tggcccactg caagtagtag
acagcttgca cacactcact gtagttactg 33540
gaaacggaat aactgtagct aacaatgccc
ttcaaactaa agttgcgggt gccctgggtt 33600
atgactcatc tggcaatcta gaattgcgag
ccgcaggggg tatgcgaatt aacacagggg 33660
gtcaactcat tcttgatgtg gcttatccat
ttgatgctca gaacaatctc agccttagac 33720
tcggccaggg acctttatat gtgaacacca
atcacaacct agatttaaat tgcaacagag 33780
gtctgaccac aaccaccagc agtaacacaa
ccaaacttga aactaaaatc gattcgggct 33840
tagactataa cgccaatggg gctatcattg
ctaaacttgg cactgggtta acctttgaca 33900
acacaggtgc cataactgtg ggaaacactg
gggatgacaa actcactctg tggactaccc 33960
cagatccctc tcctaactgc agaattcacg
cagacaaaga ctgcaagttt actctagtcc 34020
tgactaagtg tggaagtcaa attctggcct
ccgtcgccgc cctggcggtg tctggaaacc 34080
tatcatcaat gacaggcact gtctccagcg
ttaccatctt tctcagattc gatcagaatg 34140
gagttcttat ggaaaattcc tcgctagaca
aggagtactg gaacttcaga aatggtaatt 34200
ccaccaatgc caccccctac accaatgcgg
ttgggttcat gcccaacctc agcgcctacc 34260
ccaaaaccca gagtcaaact gcaaaaaaca
acattgtaag tgaggtttac ttacatgggg 34320
acaaatctaa acccatgatc cttaccatta
cccttaatgg cacaaatgaa tccagtgaaa 34380
ctagtcaggt gagtcactac tccatgtcat
ttacatggtc ctgggacagt gggaaatatg 34440
ccaccgaaac ctttgccacc aactctttta
ccttctccta cattgctgaa caataaagaa 34500
gcataacgct gctgttcatt tgtaatcaag
tgttactttt ttatttttca attacaacag 34560
aatcattcaa gtcattctcc atttagctta
atagacccca gtagtgcaaa gccccatact 34620
agcttatttc agacagtata aattaaacca
taccttttga tttcaacatt aaaaaaatca 34680
tcacaggatc ctagtcgtca ggccgccccc
tcccttccaa gacacagaat acacaatcct 34740
ctccccccgg ctagctttaa acaacaccat
ctgattggtg acagacaggt tcttcggggt 34800
tatattccac acggtctcct ggcgggccag
gcgctcgtcg gtgatgctga taaactctcc 34860
cggcagctcg ctcaagttca cgtcgctgtc
cagcggctga acctcatgct gacgcggtaa 34920
ctgcgcgacc ggctgctgaa caaacggagg
ccgcgcctac aagggggtag agtcataatc 34980
ctccgtcagg atagggcggt tatgcagcag
cagcgagcga atcatctgct gccgccgccg 35040
ctccgtccgg caggaaaaca acatcccggt
ggtctcctcc gctataatcc gcaccgcccg 35100
cagcataagc ctcctcgttc tccgcgcgca
gcaccgcacc ctgatctcgc tcaggttggc 35160
gcagtaggta cagcacatca ccacgatgtt
attcatgatc ccacagtgca aggcgctgta 35220
tccaaagctc atgcccggga ccaccgcccc
cacgtgaccg tcgtaccaga agcgcaggta 35280
aatcaagtgc cgacccctca tgaacgtgct
ggacataaac atcacctcct tgggcatgtt 35340
gtaattcacc acctcccggt accagatgaa
tctctgattg aacacggccc cttccaccac 35400
catcctgaac caagaggcta ggacctgccc
accggctatg cactgcaggg aacccgggtt 35460
agaacaatga caatgcagac tccagggctc
gtaaccgtgg atcatccggc tgctgaagac 35520
atcgatgttg gcgcaacaca gacacacgtg
catacacttc ctcatgatta gcagctcctc 35580
cctcgtcagg atcatatccc aagggataac
ccattcttga atcaacgtaa agcccacaga 35640
gcagggaagg cctcgcacat aactcacgtt
gtgcatggtt agcgtgttgc attccggaaa 35700
cagcggatga tcctccagta tcgaggcgcg
ggtctcgttc tcacagggag gtaaaggggc 35760
cctgctgtac ggactgtggc gggacgaccg
agatcgtgtt gagcgtaacg tcatggaaaa 35820
gggaacgccg gacgtggtca tacttcttga
agcagaacca ggctcgcgcg tgacagacct 35880
ccttgcgtct acggtctcgc cgcttagctc
gctccgtgtg atagttgtag tacagccact 35940
ctctcaaagc gtcgaggcga cacctggcgt
caggatgtat gtagactccg tcttgcaccg 36000
cggccctgat aatatccacc accgtagaat
aagccacacc aagccaagca atacactcgc 36060
tttgcgagcg gcagacagga ggagcgggga
gagacggaag gaccatcata aaattttaaa 36120
gaatattttc caatacttcg aaatcaagat
ctaccaaatg gcaacgctcc cctccactgg 36180
cgcggtcaaa ctctacggcc aaagaacaga
taacggcatt tttaagatgt tcccggacgg 36240
cgtctaaaag acaaaccgct ctcaagtcga
cataaattat aagccaaaag ccatcgggat 36300
ccatatccac tatggacgcg ccggcggcgt
ccaccaaacc caaataattt tcttctctcc 36360
agcgcagcaa aatcccagta agcaactccc
tgatattaag atgaaccatg ccaaaaatct 36420
gttcaagagc gccctccacc ttcattctca
agcagcgcat catgattgca aaaattcagg 36480
ttcctcagac acctgtatga gattcaaaac
gggaatatta acaaaaattc ctctgtcgcg 36540
cagatccctt cgcagggcaa gctgaacata
atcagacagg tctgaacgaa ccagcgaggc 36600
caaatccccg ccaggaacca gatccagaga
ccctatgctg attatgacgc gcatactcgg 36660
ggctatgcta accagcgtag cgccgatgta
ggcgtgctgc atgggcggcg aaataaaatg 36720
caaggtgctg gttaaaaaat caggcaaagc
ctcgcgcaaa aaagctaaga catcataatc 36780
atgctcatgc aggtagttgc aggtaagctc
aggaaccaaa acggaataac acacgatttt 36840
cctctcaaac atgacttcca ggtgactgca
taagaaaaaa attataaata ataaatatta 36900
attaaataaa ttaaacattg gaagcctgtc
tcacaacagg aaaaaccact ctgatcaaca 36960
taagacgggc cacgggcatg cccgcgtgac
cataaaaaaa tcggtctccg tgattacaaa 37020
gcaccacaga tagctccccg gtcatgtcgg
gggtcatcat gtgagactgt gtatacacgt 37080
ccgggctgtt gacatcggtc aaagaaagaa
atcgagctac atagcccgga ggaatcaaca 37140
cccgcacgcg gaggtacagc aaaacggtcc
ccataggagg aatcacaaaa ttagtaggag 37200
aaaaaaaaac ataaacacca gaaaaaccct
cttgccgagg caaaacagcg ccctcccgtt 37260
ccaaaacaac ataaagcgct tccacaggag
cagccatgac aaagacccga gtcttaccag 37320
gaaaatttta aaaaagattc ctcaacgcag
caccagcacc aacacctgtc agtgtaaaat 37380
gccaagcgcc gagcgagtat atataggaat
aaaaagtgac gtaaacggtt aaagtccaga 37440
aaacgcccag aaaaaccgca cgcgaaccta
cgccccgaaa cgaaagccaa aaaacagtga 37500
acacgccctt tcggcgtcaa cttccgcttt
cccacggtac gtcacttccg catatagtaa 37560
aactacgcta cccaacatgc aagaagccac
gccccaaaaa acgtcacacc tcccggcccg 37620
ccccgcgccg ccgctcctcc ccgccccgcc
ccgctccgcc cacctcatta tcatattggc 37680
ttcaatccaa aataaggtat attattgatg
atg 37713
<210> 64
<211> 882
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 64
atgactacgt ccggcgttcc atttggcatg
acactacgac caacacgatc tcggttgtct 60
cggcgcactc cgtacagtag ggatcgtcta
cctccttttg agacagaaac ccgcgctacc 120
atactggagg atcatccgct gctgcccgaa
tgtaacactt tgacaatgca caacgtgagt 180
tacgtgcgag gtcttccctg cagtgtggga
tttacgctga ttcaggaatg ggttgttccc 240
tgggatatgg ttctaacgcg ggaggagctt
gtaatcctga ggaagtgtat gcacgtgtgc 300
ctgtgttgtg ccaacattga tatcatgacg
agcatgatga tccatggtta cgagtcctgg 360
gctctccact gtcattgttc cagtcccggt
tccctgcagt gtatagccgg cgggcaggtt 420
ttggccagct ggtttaggat ggtggtggat
ggcgccatgt ttaatcagag gtttatatgg 480
taccgggagg tggtgaatta caacatgcca
aaagaggtaa tgtttatgtc cagcgtgttt 540
atgaggggtc gccacttaat ctacctgcgc
ttgtggtatg atggccacgt gggttctgtg 600
gtccccgcca tgagctttgg atacagcgcc
ttgcactgtg ggattttgaa caatattgtg 660
gtgctgtgct gcagttactg tgctgattta
agtgagatca gggtgcgctg ctgtgcccgg 720
aggacaaggc gccttatgct gcgggcggtg
cgaatcatcg ctgaggagac cactgccatg 780
ttgtattcct gcaggacgga gcggcggcgg
cagcagttta ttcgcgcgct gctgcagcac 840
caccgcccta tcctgatgca cgattatgac
tctaccccca tg 882
<210> 65
<211> 36571
<212> DNA
<213> 腺病毒科 - 哺乳动物腺病毒
<400> 65
catcatcaat aatatacctc aaacttttgg
tgcgcgttaa tatgcaaatg agctgtttga 60
atttggggat gcggggcgct gattggctgc
gggagcggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtggc cgtgaggcgg
agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg
aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag
tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg
cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg
ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt
ttacgtaggc gtcagctgat cgccagggta 480
tttaaacctg cgctcactag tcaagaggcc
actcttgagt gccagcgagt agagttttct 540
cctccgcgcc gcgagtcaga tctacacttt
gaaagatgag gcacttgaga gacctgcccg 600
gtaatgtttt cctggctact gggaacgaga
ttctggaatt ggtggtggac gccatgatgg 660
gtgacgaccc tcccgagccc cctaccccat
ttgaggcgcc ttcgctgtac gatttgtatg 720
atctggaggt ggatgtgccc gagaacgacc
ccaacgagga ggcggtgaat gatttgttta 780
gcgatgccgc gctgctggct gccgagcagg
ctaatacgga ctttggctca gacagcgatt 840
cttctctcca taccccgaga cccggcagag
gtgagaaaaa gatccccgag cttaaagggg 900
aagagctcga cctgcgctgc tatgaggaat
gcttgcctcc gagcgatgat gaggaggacg 960
aggaggcgat tcgagctgca gcgaaccagg
gagtgaaagc tgcgggcgaa agctttagcc 1020
tggactgtcc tactctgccc ggacacggct
gtaagtcttg tgaatttcat cgcatgaata 1080
ctggagataa gaatgtgatg tgtgccctgt
gctatatgag agcttacaac cattgtgttt 1140
acagtaagtg tgattaactt tagttgggaa
ggcagagggt gactgggtgc tgactggttt 1200
atttatgtat atgtttttta tgtgtaggtc
ccgtctctga cgcagatgag acccccactt 1260
cagagtgcat ttcatcaccc ccagaaattg
gcgaggaacc gcccgaagat attattcata 1320
gaccagttgc agtgagagtc accgggcgga
gagcagctgt ggagagtttg gatgacttgc 1380
tacagggtgg ggatgaacct ttggacttgt
gtacccggaa acgccccagg cactaagtgc 1440
cacacatgtg tgtttactta aggtgatgtc
agtatttata gggtgtggag tgcaataaaa 1500
tccgtgttga ctttaagtgc gtggtttatg
actcaggggt ggggactgtg ggtatataag 1560
caggtgcaga cctgtgtggt cagttcagag
caggactcat ggagatctgg acggtcttgg 1620
aagactttca ccagactaga cagctgctag
agaactcatc ggagggagtc tcttacctgt 1680
ggagattctg cttcggtggg cctctagcta
agctagtcta tagggccaag caggattata 1740
aggatcaatt tgaggatatt ttgagagagt
gtcctggtat ttttgactct ctcaacttgg 1800
gccatcagtc tcactttaac cagagtattc
tgagagccct tgacttttcc actcctggca 1860
gaactaccgc cgcggtagcc ttttttgcct
ttatccttga caaatggagt caagaaaccc 1920
atttcagcag ggattaccgt ctggactgct
tagcagtagc tttgtggaga acatggaggt 1980
gccagcgcct gaatgcaatc tccggctact
tgccagtaca gccggtagac acgctgagga 2040
tcctgagtct ccagtcaccc caggaacacc
aacgccgcca gcagccgcag caggagcagc 2100
agcaagagga ggaccgagaa gagaacccga
gagccggtct ggaccctccg gtggcggagg 2160
aggaggagta gctgacttgt ttcccgagct
gcgccgggtg ctgactaggt cttccagtgg 2220
acgggagagg gggattaagc gggagaggca
tgaggagact agtcacagaa ctgaactgac 2280
tgtcagtctg atgagccgca ggcgcccaga
atcggtgtgg tggcatgagg tgcagtcgca 2340
ggggatagat gaggtctcgg tgatgcatga
gaaatattcc ctagaacaag tcaagacttg 2400
ttggttggag cctgaggatg attgggaggt
agccatcagg aattatgcca agctagctct 2460
gaagccagac aagaagtaca agattaccaa
actgattaat atcagaaatt cctgctacat 2520
ttcagggaat ggggccgagg tggagatcag
tacccaggag agggtggcct tcagatgctg 2580
catgatgaat atgtacccgg gggtggtggg
catggaggga gtcaccttta tgaacgcgag 2640
gttcaggggc gatgggtata atggggtggt
ctttatggcc aacaccaagc tgacagtgca 2700
cggatgctcc ttctttggct tcaataacat
gtgcatcgag gcctggggca gtgtttcagt 2760
gaggggatgc agtttttcag ccaactggat
gggggtcgtg ggcagaacca agagcaaggt 2820
gtcagtgaag aaatgcctgt tcgagaggtg
ccacctgggg gtgatgagcg agggcgaagc 2880
caaagtcaaa cactgcgcct ctactgagac
gggctgcttt gtgctgatca agggcaatgc 2940
ccaagtcaag cataacatga tctgtggggc
ctcggatgag cgcggctacc agatgctgac 3000
ctgcgccggt gggaacagcc atatgctggc
caccgtgcat gtgacctcgc acccccgcaa 3060
gacatggccc gagttcgagc acaacgtcat
gacccgctgc aatgtgcacc tgggctcccg 3120
ccgaggcatg ttcatgccct accagtgcaa
catgcaattt gtgaaggtgc tgctggagcc 3180
cgatgccatg tccagagtga gcctgacggg
ggtgtttgac atgaatgtgg agatgtggaa 3240
aattctgaga tatgatgaat ccaagaccag
gtgccgggcc tgcgaatgcg gaggcaagca 3300
cgccaggctt cagcccgtgt gtgtggaggt
gacggaggac ctgcgacccg atcatttggt 3360
gttgtcctgc aacgggacgg agttcggctc
cagcggggaa gaatctgact agagtgagta 3420
gtgtttgggg gaggtggagg gcctggatga
ggggcagaat gactaaaatc tgtgtttttc 3480
tgcgcagcag catgagcgga agcgcctcct
ttgagggagg ggtattcagc ccttatctga 3540
cggggcgtct cccctcctgg gcgggagtgc
gtcagaatgt gatgggatcc acggtggacg 3600
gccggcccgt gcagcccgcg aactcttcaa
ccctgaccta cgcgaccctg agctcctcgt 3660
ccgtggacgc agctgccgcc gcagctgctg
cttccgccgc cagcgccgtg cgcggaatgg 3720
ccctgggcgc cggctactac agctctctgg
tggccaactc gagttccacc aataatcccg 3780
ccagcctgaa cgaggagaag ctgctgctgc
tgatggccca gctcgaggcc ctgacccagc 3840
gcctgggcga gctgacccag caggttgctc
agctgcaggc ggagacgcgg gccgcggttg 3900
ccacggtgaa aaccaaataa aaaatgaatc
aataaataaa cggagacggt tgttgatttt 3960
aacacagagt cttgaatctt tatttgattt
ttcgcgcgcg gtaggccctg gaccaccggt 4020
ctcgatcatt gagcacccgg tggatctttt
ccaggacccg gtagaggtgg gcttggatgt 4080
tgaggtacat gggcatgagc ccgtcccggg
ggtggaggta gctccattgc agggcctcgt 4140
gctcgggggt ggtgttgtaa atcacccagt
catagcaggg gcgcagggcg tggtgctgca 4200
cgatgtcctt gaggaggaga ctgatggcca
cgggcagccc cttggtgtag gtgttgacga 4260
acctgttgag ctgggaggga tgcatgcggg
gggagatgag atgcatcttg gcctggatct 4320
tgagattggc gatgttcccg cccagatccc
gccgggggtt catgttgtgc aggaccacca 4380
gcacggtgta tccggtgcac ttggggaatt
tgtcatgcaa cttggaaggg aaggcgtgaa 4440
agaatttgga gacgcccttg tggccgccca
ggttttccat gcactcatcc atgatgatgg 4500
cgatgggccc gtgggcggcg gcctgggcaa
agacgtttcg ggggtcggac acatcgtagt 4560
tgtggtcctg ggtgagctcg tcataggcca
ttttaatgaa tttggggcgg agggtgcccg 4620
actgggggac gaaggtgccc tcgatcccgg
gggcgtagtt gccctcgcag atctgcatct 4680
cccaggcctt gagctcggag ggggggatca
tgtccacctg cggggcgatg aaaaaaacgg 4740
tttccggggc gggggagatg agctgcgccg
aaagcaggtt ccggagcagc tgggacttgc 4800
cgcagccggt ggggccgtag atgaccccga
tgaccggctg caggtggtag ttgagggaga 4860
gacagctgcc gtcctcgcgg aggagggggg
ccacctcgtt catcatctcg cgcacatgca 4920
tgttctcgcg cacgagttcc gccaggaggc
gctcgccccc cagcgagagg agctcttgca 4980
gcgaggcgaa gtttttcagc ggcttgagcc
cgtcggccat gggcattttg gagagggtct 5040
gttgcaagag ttccagacgg tcccagagct
cggtgatgtg ctctagggca tctcgatcca 5100
gcagacctcc tcgtttcgcg ggttggggcg
actgcgggag tagggcacca ggcgatgggc 5160
gtccagcgag gccagggtcc ggtccttcca
gggtcgcagg gtccgcgtca gcgtggtctc 5220
cgtcacggtg aaggggtgcg cgccgggctg
ggcgcttgcg agggtgcgct tcaggctcat 5280
ccggctggtc gagaaccgct cccggtcggc
gccctgtgcg tcggccaggt agcaattgag 5340
catgagttcg tagttgagcg cctcggccgc
gtggcccttg gcgcggagct tacctttgga 5400
agtgtgtccg cagacgggac agaggaggga
cttgagggcg tagagcttgg gggcgaggaa 5460
gacggactcg ggggcgtagg cgtccgcgcc
gcagctggcg cagacggtct cgcactccac 5520
gagccaggtg aggtcggggc ggtcggggtc
aaaaacgagg tttcctccgt gctttttgat 5580
gcgtttctta cctctggtct ccatgagctc
gtgtccccgc tgggtgacaa agaggctgtc 5640
cgtgtccccg tagaccgact ttatgggccg
gtcctcgagc ggggtgccgc ggtcctcgtc 5700
gtagaggaac cccgcccact ccgagacgaa
ggcccgggtc caggccagca cgaaggaggc 5760
cacgtgggag gggtagcggt cgttgtccac
cagcgggtcc accttctcca gggtatgcaa 5820
gcacatgtcc ccctcgtcca catccaggaa
ggtgattggc ttgtaagtgt aggccacgtg 5880
accgggggtc ccggccgggg gggtataaaa
gggggcgggc ccctgctcgt cctcactgtc 5940
ttccggatcg ctgtccagga gcgccagctg
ttggggtagg tattccctct cgaaggcggg 6000
catgacctcg gcactcaggt tgtcagtttc
tagaaacgag gaggatttga tattgacggt 6060
gccgttggag acgcctttca tgagcccctc
gtccatctgg tcagaaaaga cgatcttttt 6120
gttgtcgagc ttggtggcga aggagccgta
gagggcgttg gagagcagct tggcgatgga 6180
gcgcatggtc tggttctttt ccttgtcggc
gcgctccttg gcggcgatgt tgagctgcac 6240
gtactcgcgc gccacgcact tccattcggg
gaagacggtg gtgagctcgt cgggcacgat 6300
tctgacccgc cagccgcggt tgtgcagggt
gatgaggtcc acgctggtgg ccacctcgcc 6360
gcgcaggggc tcgttggtcc agcagaggcg
cccgcccttg cgcgagcaga aggggggcag 6420
cgggtccagc atgagctcgt cgggggggtc
ggcgtccacg gtgaagatgc cgggcaggag 6480
ctcggggtcg aagtagctga tgcaggtgcc
cagatcgtcc agcgccgctt gccagtcgcg 6540
cacggccagc gcgcgctcgt aggggctgag
gggcgtgccc cagggcatgg ggtgcgtgag 6600
cgcggaggcg tacatgccgc agatgtcgta
gacgtagagg ggctcctcga ggacgccgat 6660
gtaggtgggg tagcagcgcc ccccgcggat
gctggcgcgc acgtagtcgt acagctcgtg 6720
cgagggcgcg aggagccccg tgccgaggtt
ggagcgttgc ggcttttcgg cgcggtagac 6780
gatctggcgg aagatggcgt gggagttgga
ggagatggtg ggcctctgga agatgttgaa 6840
gtgggcgtgg ggcaggccga ccgagtccct
gatgaagtgg gcgtaggagt cctgcagctt 6900
ggcgacgagc tcggcggtga cgaggacgtc
cagggcgcag tagtcgaggg tctcttggat 6960
gatgtcgtac ttgagctggc ccttctgctt
ccacagctcg cggttgagaa ggaactcttc 7020
gcggtccttc cagtactctt cgagggggaa
cccgtcctga tcggcacggt aagagcccac 7080
catgtagaac tggttgacgg ccttgtaggc
gcagcagccc ttctccacgg ggagggcgta 7140
agcttgcgcg gccttgcgca gggaggtgtg
ggtgagggcg aaggtgtcgc gcaccatgac 7200
tttgaggaac tggtgcttga agtcgaggtc
gtcgcagccg ccctgctccc agagttggaa 7260
gtccgtgcgc ttcttgtagg cggggttggg
caaagcgaaa gtaacatcgt tgaagaggat 7320
cttgcccgcg cggggcatga agttgcgagt
gatgcggaaa ggctggggca cctcggcccg 7380
gttgttgatg acctgggcgg cgaggacgat
ctcgtcgaag ccgttgatgt tgtgcccgac 7440
gatgtagagt tccacgaatc gcgggcagcc
cttgacgtgg ggcagcttct tgagctcgtc 7500
gtaggtgagc tcggcggggt cgctgagccc
gtgctgctcg agggcccagt cggcgacgtg 7560
ggggttggcg ctgaggaagg aagtccagag
atccacggcc agggcggtct gcaagcggtc 7620
ccggtactga cggaactgct ggcccacggc
cattttttcg ggggtgacgc agtagaaggt 7680
gcgggggtcg ccgtgccagc ggtcccactt
gagttggagg gcgaggtcgt gggcgagctc 7740
gacgagcggc gggtccccgg agagtttcat
gaccagcatg aaggggacga gctgcttgcc 7800
gaaggacccc atccaggtgt aggtttccac
atcgtaggtg aggaagagcc tttcggtgcg 7860
aggatgcgag ccgatgggga agaactggat
ctcctgccac cagttggagg aatggctgtt 7920
gatgtgatgg aagtagaaat gccgacggcg
cgccgagcac tcgtgcttgt gtttatacaa 7980
gcgtccgcag tgctcgcaac gctgcacggg
atgcacgtgc tgcacgagct gtacctgggt 8040
tcctttgacg aggaatttca gtgggcagtg
gagcgctggc ggctgcatct ggtgctgtac 8100
tacgtcctgg ccatcggcgt ggccatcgtc
tgcctcgatg gtggtcatgc tgacgagccc 8160
gcgcgggagg caggtccaga cctcggctcg
gacgggtcgg agagcgagga cgagggcgcg 8220
caggccggag ctgtccaggg tcctgagacg
ctgcggagtc aggtcagtgg gcagcggcgg 8280
cgcgcggttg acttgcagga gcttttccag
ggcgcgcggg aggtccagat ggtacttgat 8340
ctccacggcg ccgttggtgg cgacgtccac
ggcttgcagg gtcccgtgcc cctggggcgc 8400
caccaccgtg ccccgtttct tcttgggcgg
cggcggctcc atgcttagaa gcggcggcga 8460
ggacgcgcgc cgggcggcag gggcggctcg
gggcccggag gcaggggcgg caggggcacg 8520
tcggcgccgc gcgcgggcag gttctggtac
tgcgcccgga gaagactggc gtgagcgacg 8580
acgcgacggt tgacgtcctg gatctgacgc
ctctgggtga aggccacggg acccgtgagt 8640
ttgaacctga aagagagttc gacagaatca
atttcggtat cgttgacggc ggcctgccgc 8700
aggatctctt gcacgtcgcc cgagttgtcc
tggtaggcga tctcggtcat gaactgctcg 8760
atctcctcct cctgaaggtc tccgcggccg
gcgcgctcga cggtggccgc gaggtcgttg 8820
gagatgcggc ccatgagctg cgagaaggcg
ttcatgccgg cctcgttcca gacgcggctg 8880
tagaccacgg ctccgttggg gtcgcgcgcg
cgcatgacca cctgggcgag gttaagctcg 8940
acgtggcgcg tgaagaccgc gtagttgcag
aggcgctggt agaggtagtt gagcgtggtg 9000
gcgatgtgct cggtgacgaa gaagtacatg
atccagcggc ggagcggcat ctcgctgacg 9060
tcgcccaggg cttccaagcg ctccatggtc
tcgtagaagt ccacggcgaa gttgaaaaac 9120
tgggagttgc gcgccgagac ggtcaactcc
tcctccagaa gacggatgag ctcggcgatg 9180
gtggcgcgca cctcgcgctc gaaggccccg
gggggctcct cttcttccat ctcctcctcc 9240
tcttcctcct ccactaacat ctcttctact
tcctcctcag gaggcggcgg cgggggaggg 9300
gccctgcgtc gccggcggcg cacgggcaga
cggtcgatga agcgctcgat ggtctccccg 9360
cgccggcgac gcatggtctc ggtgacggcg
cgcccgtcct cgcggggccg cagcgtgaag 9420
acgccgccgc gcatctccag gtggccgccg
ggggggtctc cgttgggcag ggagagggcg 9480
ctgacgatgc atcttatcaa ttggcccgta
gggactccgc gcaaggacct gagcgtctcg 9540
agatccacgg gatccgaaaa ccgctgaacg
aaggcttcga gccagtcgca gtcgcaaggt 9600
aggctgagcc cggtttcttg ttcttcgggt
atttggtcgg gaggcgggcg ggcgatgctg 9660
ctggtgatga agttgaagta ggcggtcctg
agacggcgga tggtggcgag gagcaccagg 9720
tccttgggcc cggcttgctg gatgcgcaga
cggtcggcca tgccccaggc gtggtcctga 9780
cacctggcga ggtccttgta gtagtcctgc
atgagccgct ctacgggcac gtcctcctcg 9840
cccgcgcggc cgtgcatgcg cgtgagcccg
aacccgcgct gcggctggac gagcgccagg 9900
tcggcgacga cgcgctcggc gaggatggcc
tgctggatct gggtgagggt ggtctggaag 9960
tcgtcgaagt cgacgaagcg gtggtaggct
ccggtgttga tggtgtagga gcagttggcc 10020
atgacggacc agttgacggt ctggtggccg
gggcgcacga gctcgtggta cttgaggcgc 10080
gagtaggcgc gcgtgtcgaa gatgtagtcg
ttgcaggtgc gcacgaggta ctggtatccg 10140
acgaggaagt gcggcggcgg ctggcggtag
agcggccatc gctcggtggc gggggcgccg 10200
ggcgcgaggt cctcgagcat gaggcggtgg
tagccgtaga tgtacctgga catccaggtg 10260
atgccggcgg cggtggtgga ggcgcgcggg
aactcgcgga cgcggttcca gatgttgcgc 10320
agcggcagga agtagttcat ggtggccgcg
gtctggcccg tgaggcgcgc gcagtcgtgg 10380
atgctctaga catacgggca aaaacgaaag
cggtcagcgg ctcgactccg tggcctggag 10440
gctaagcgaa cgggttgggc tgcgcgtgta
ccccggttcg aatctcgaat caggctggag 10500
ccgcagctaa cgtggtactg gcactcccgt
ctcgacccaa gcctgctaac gaaacctcca 10560
ggatacggag gcgggtcgtt ttttggcctt
ggtcgctggt catgaaaaac tagtaagcgc 10620
ggaaagcggc cgcccgcgat ggctcgctgc
cgtagtctgg agaaagaatc gccagggttg 10680
cgttgcggtg tgccccggtt cgagcctcag
cgctcggtgc cggccggatt ccgcggctaa 10740
cgtgggcgtg gctgccccgt cgtttccaag
accccttagc cagccgactt ctccagttac 10800
ggagcgagcc cctctttttc ttgtgttttt
gccagatgca tcccgtactg cggcagatgc 10860
gcccccaccc tccaccacaa ccgcccctac
cgcagcagca gcaacagccg gcgcttctgc 10920
ccccgcccca gcagcagcag ccagccacta
ccgcggcggc cgccgtgagc ggagccggcg 10980
ttcagtatga cctggccttg gaagagggcg
aggggctggc gcggctgggg gcgtcgtcgc 11040
cggagcggca cccgcgcgtg cagatgaaaa
gggacgctcg cgaggcctac gtgcccaagc 11100
agaacctgtt cagagacagg agcggcgagg
agcccgagga gatgcgcgcc tcccgcttcc 11160
acgcggggcg ggagctgcgg cgcggcctgg
accgaaagcg ggtgctgagg gacgaggatt 11220
tcgaggcgga cgagctgacg gggatcagcc
ccgcgcgcgc gcacgtggcc gcggccaacc 11280
tggtcacggc gtacgagcag accgtgaagg
aggagagcaa ctttcaaaaa tccttcaaca 11340
accacgtgcg cacgctgatc gcgcgcgagg
aggtgaccct gggcctgatg cacctgtggg 11400
acctgctgga ggccatcgtg cagaacccca
cgagcaagcc gctgacggcg cagctgtttc 11460
tggtggtgca gcacagtcgg gacaacgaga
cgttcaggga ggcgctgctg aatatcaccg 11520
agcccgaggg ccgctggctc ctggacctgg
tgaacattct gcagagcatc gtggtgcagg 11580
agcgcgggct gccgctgtcc gagaagctgg
cggccatcaa cttctcggtg ctgagcctgg 11640
gcaagtacta cgctaggaag atctacaaga
ccccgtacgt gcccatagac aaggaggtga 11700
agatcgacgg gttttacatg cgcatgaccc
tgaaagtgct gaccctgagc gacgatctgg 11760
gggtgtaccg caacgacagg atgcaccgcg
cggtgagcgc cagccgccgg cgcgagctga 11820
gcgaccagga gctgatgcac agcctgcagc
gggccctgac cggggccggg accgaggggg 11880
agagctactt tgacatgggc gcggacctgc
gctggcagcc cagccgccgg gccttggaag 11940
ctgccggcgg cgtgccctac gtggaggagg
tggacgatga ggaggaggag ggcgagtacc 12000
tggaagactg atggcgcgac cgtatttttg
ctagatgcag caacagccac cgccgcctcc 12060
tgatcccgcg atgcgggcgg cgctgcagag
ccagccgtcc ggcattaact cctcggacga 12120
ttggacccag gccatgcaac gcatcatggc
gctgacgacc cgcaatcccg aagcctttag 12180
acagcagcct caggccaacc ggctctcggc
catcctggag gccgtggtgc cctcgcgctc 12240
gaaccccacg cacgagaagg tgctggccat
cgtgaacgcg ctggtggaga acaaggccat 12300
ccgcggcgac gaggccgggc tggtgtacaa
cgcgctgctg gagcgcgtgg cccgctacaa 12360
cagcaccaac gtgcagacga acctggaccg
catggtgacc gacgtgcgcg aggcggtgtc 12420
gcagcgcgag cggttccacc gcgagtcgaa
cctgggctcc atggtggcgc tgaacgcctt 12480
cctgagcacg cagcccgcca acgtgccccg
gggccaggag gactacacca acttcatcag 12540
cgcgctgcgg ctgatggtgg ccgaggtgcc
ccagagcgag gtgtaccagt cggggccgga 12600
ctacttcttc cagaccagtc gccagggctt
gcagaccgtg aacctgagcc aggctttcaa 12660
gaacttgcag ggactgtggg gcgtgcaggc
cccggtcggg gaccgcgcga cggtgtcgag 12720
cctgctgacg ccgaactcgc gcctgctgct
gctgctggtg gcgcccttca cggacagcgg 12780
cagcgtgagc cgcgactcgt acctgggcta
cctgcttaac ctgtaccgcg aggccatcgg 12840
gcaggcgcac gtggacgagc agacctacca
ggagatcacc cacgtgagcc gcgcgctggg 12900
ccaggaggac ccgggcaacc tggaggccac
cctgaacttc ctgctgacca accggtcgca 12960
gaagatcccg ccccagtacg cgctgagcac
cgaggaggag cgcatcctgc gctacgtgca 13020
gcagagcgtg gggctgttcc tgatgcagga
gggggccacg cccagcgccg cgctcgacat 13080
gaccgcgcgc aacatggagc ccagcatgta
cgcccgcaac cgcccgttca tcaataagct 13140
gatggactac ttgcatcggg cggccgccat
gaactcggac tactttacca acgccatctt 13200
gaacccgcac tggctcccgc cgcccgggtt
ctacacgggc gagtacgaca tgcccgaccc 13260
caacgacggg ttcctgtggg atgacgtgga
cagcagcgtg ttctcgccgc gtcccaccac 13320
caccgtgtgg aagaaagagg gcggggaccg
gcggccgtcc tcggcgctgt ccggtcgcgc 13380
gggtgctgcc gcggcggtgc ccgaggccgc
cagccccttt ccgagcctgc ccttttcgct 13440
gaacagcgtg cgcagcagcg agctgggtcg
gctgacgcgg ccgcgcctgc tgggcgagga 13500
ggagtacctg aacgactcct tgttgaggcc
cgagcgcgaa aagaacttcc ccaataacgg 13560
gatagagagc ctggtggaca agatgagccg
ctggaagacg tacgcgcacg agcacaggga 13620
cgagccccga gctagcagcg caggcacccg
tagacgccag cggcacgaca ggcagcgggg 13680
tctggtgtgg gacgatgagg attccgccga
cgacagcagc gtgttggact tgggtgggag 13740
tggtggtggt aacccgttcg ctcacttgcg
cccccgtatc gggcgcctga tgtaagaatc 13800
tgaaaaataa aaaacggtac tcaccaaggc
catggcgacc agcgtgcgtt cttctctgtt 13860
gtttgtagta gtatgatgag gcgcgtgtac
ccggagggtc ctcctccctc gtacgagagc 13920
gtgatgcagc aggcggtggc ggcggcgatg
cagcccccgc tggaggcgcc ttacgtgccc 13980
ccgcggtacc tggcgcctac ggaggggcgg
aacagcattc gttactcgga gctggcaccc 14040
ttgtacgata ccacccggtt gtacctggtg
gacaacaagt cggcggacat cgcctcgctg 14100
aactaccaga acgaccacag caacttcctg
accaccgtgg tgcagaacaa cgatttcacc 14160
cccacggagg ccagcaccca gaccatcaac
tttgacgagc gctcgcggtg gggcggccag 14220
ctgaaaacca tcatgcacac caacatgccc
aacgtgaacg agttcatgta cagcaacaag 14280
ttcaaggcgc gggtgatggt ctcgcgcaag
acccccaacg gggtcacagt aacagatggt 14340
agtcaggacg agctgaccta cgagtgggtg
gagtttgagc tgcccgaggg caacttctcg 14400
gtgaccatga ccatcgatct gatgaacaac
gccatcatcg acaactactt ggcggtgggg 14460
cggcagaacg gggtgctgga gagcgacatc
ggcgtgaagt tcgacacgcg caacttccgg 14520
ctgggctggg accccgtgac cgagctggtg
atgccgggcg tgtacaccaa cgaggccttc 14580
caccccgaca tcgtcctgct gcccggctgc
ggcgtggact tcaccgagag ccgcctcagc 14640
aacctgctgg gcatccgcaa gcggcagccc
ttccaggagg gcttccagat cctgtacgag 14700
gacctggagg ggggcaacat ccccgcgctc
ttggatgtcg aagcctacga gaaaagcaag 14760
gaggatagca ccgccgtggc taccgccgcg
actgtggcag atgccactgt caccaggggc 14820
gatacattcg ccacccaggc ggaggaagca
gccgccctag cggcgaccga tgatagtgaa 14880
agtaagatag ttatcaagcc ggtggagaag
gacagcaagg acaggagcta caacgttcta 14940
tcggatggaa agaacaccgc ctaccgcagc
tggtacctgg cctacaacta cggcgacccc 15000
gagaagggcg tgcgctcctg gacgctgctc
accacctcgg acgtcacctg cggcgtggag 15060
caagtctact ggtcgctgcc cgacatgatg
caagacccgg tcaccttccg ctccacgcgt 15120
caagttagca actacccggt ggtgggcgcc
gagctcctgc ccgtctactc caagagcttc 15180
ttcaacgagc aggccgtcta ctcgcagcag
ctgcgcgcct tcacctcgct cacgcacgtc 15240
ttcaaccgct tccccgagaa ccagatcctc
gtccgcccgc ccgcgcccac cattaccacc 15300
gtcagtgaaa acgttcctgc tctcacagat
cacgggaccc tgccgctgcg cagcagtatc 15360
cggggagtcc agcgcgtgac cgtcactgac
gccagacgcc gcacctgccc ctacgtctac 15420
aaggccctgg gcgtagtcgc gccgcgcgtc
ctctcgagcc gcaccttcta aaaaatgtcc 15480
attctcatct cgcccagtaa taacaccggt
tggggcctgc gcgcgcccag caagatgtac 15540
ggaggcgctc gccaacgctc cacgcaacac
cccgtgcgcg tgcgcgggca cttccgcgct 15600
ccctggggcg ccctcaaggg tcgcgtgcgc
tcgcgcacca ccgtcgacga cgtgatcgac 15660
caggtggtgg ccgacgcgcg caactacacg
cccgccgccg cgcccgcctc caccgtggac 15720
gccgtcatcg acagcgtggt ggccgacgcg
cgccggtacg cccgcgccaa gagccggcgg 15780
cggcgcatcg cccggcggca ccggagcacc
cccgccatgc gcgcggcgcg agccttgctg 15840
cgcagggcca ggcgcacggg acgcagggcc
atgctcaggg cggccagacg cgcggcctcc 15900
ggcagcagca gcgccggcag gacccgcaga
cgcgcggcca cggcggcggc ggcggccatc 15960
gccagcatgt cccgcccgcg gcgcggcaac
gtgtactggg tgcgcgacgc cgccaccggt 16020
gtgcgcgtgc ccgtgcgcac ccgcccccct
cgcacttgaa gatgctgact tcgcgatgtt 16080
gatgtgtccc agcggcgagg aggatgtcca
agcgcaaata caaggaagag atgctccagg 16140
tcatcgcgcc tgagatctac ggccccgcgg
cggcggtgaa ggaggaaaga aagccccgca 16200
aactgaagcg ggtcaaaaag gacaaaaagg
aggaggaaga tgtggacgga ctggtggagt 16260
ttgtgcgcga gttcgccccc cggcggcgcg
tgcagtggcg cgggcggaaa gtgaaaccgg 16320
tgctgcggcc cggcaccacg gtggtcttca
cgcccggcga gcgttccggc tccgcctcca 16380
agcgctccta cgacgaggtg tacggggacg
aggacatcct cgagcaggcg gccgagcgtc 16440
tgggcgagtt tgcttacggc aagcgcagcc
gccccgcgcc cttgaaagag gaggcggtgt 16500
ccatcccgct ggaccacggc aaccccacgc
cgagcctgaa gccggtgacc ctgcagcagg 16560
tgctgccgag cgcggcgccg cgccggggct
tcaagcgcga gggcggcgag gatctgtacc 16620
cgaccatgca gctgatggtg cccaagcgcc
agaagctgga ggacgtgctg gagcacatga 16680
aggtggaccc cgaggtgcag cccgaggtca
aggtgcggcc catcaagcag gtggccccgg 16740
gcctgggcgt gcagaccgtg gacatcaaga
tccccacgga gcccatggaa acgcagaccg 16800
agcccgtgaa gcccagcacc agcaccatgg
aggtgcagac ggatccctgg atgccggcgc 16860
cggcttccac caccactcgc cgaagacgca
agtacggcgc ggccagcctg ctgatgccca 16920
actacgcgct gcatccttcc atcatcccca
cgccgggcta ccgcggcacg cgcttctacc 16980
gcggctacag cagccgccgc aagaccacca
cccgccgccg ccgtcgccgc acccgccgca 17040
gcaccaccgc gacttccgcc gccgccttgg
tgcggagagt gtaccgcagc gggcgtgagc 17100
ctctgaccct gccgcgcgcg cgctaccacc
cgagcatcgc catttaactc tgccgtcgcc 17160
tccttgcaga tatggccctc acatgccgcc
tccgcgtccc cattacgggc taccgaggaa 17220
gaaagccgcg ccgtagaagg ctgacgggga
acgggctgcg tcgccatcac caccggcggc 17280
ggcgcgccat cagcaagcgg ttggggggag
gcttcctgcc cgcgctgatc cccatcatcg 17340
ccgcggcgat cggggcgatc cccggcatag
cttccgtggc ggtgcaggcc tctcagcgcc 17400
actgagacac agcttggaaa atttgtaata
aaaaaatgga ctgacgctcc tggtcctgtg 17460
atgtgtgttt ttagatggaa gacatcaatt
tttcgtccct ggcaccgcga cacggcacgc 17520
ggccgtttat gggcacctgg agcgacatcg
gcaacagcca actgaacggg ggcgccttca 17580
attggagcag tctctggagc gggcttaaga
atttcgggtc cacgctcaaa acctatggca 17640
acaaggcgtg gaacagcagc acagggcagg
cgctgaggga aaagctgaaa gagcagaact 17700
tccagcagaa ggtggtcgat ggcctggcct
cgggcatcaa cggggtggtg gacctggcca 17760
accaggccgt gcagaaacag atcaacagcc
gcctggacgc ggtcccgccc gcggggtccg 17820
tggagatgcc ccaggtggag gaggagctgc
ctcccctgga caagcgcggc gacaagcgac 17880
cgcgtcccga cgcggaggag acgctgctga
cgcacacgga cgagccgccc ccgtacgagg 17940
aggcggtgaa actgggtctg cccaccacgc
ggcccgtggc gcctctggcc accggggtgc 18000
tgaaacccag cagcagcagc agccagcccg
cgaccctgga cttgcctcca cctcgcccct 18060
ccacagtggc taagcccctg ccgccggtgg
ccgtcgcgtc gcgcgccccc cgaggccgcc 18120
cccaggcgaa ctggcagagc actctgaaca
gcatcgtggg tctgggagtg cagagtgtga 18180
agcgccgccg ctgctattaa aagacactgt
agcgcttaac ttgcttgtct gtgtgtatat 18240
gtatgtccgc cgaccagaag gaggaggaag
aggcgcgtcg ccgagttgca agatggccac 18300
cccatcgatg ctgccccagt gggcgtacat
gcacatcgcc ggacaggacg cttcggagta 18360
cctgagtccg ggtctggtgc agttcgcccg
cgccacagac acctacttca gtctggggaa 18420
caagtttagg aaccccacgg tggcacccac
gcacgatgtg accaccgacc gcagccagcg 18480
gctgacgctg cgcttcgtgc ccgtggaccg
cgaggacaac acctactcgt acaaagtgcg 18540
ctacacgctg gccgtgggcg acaaccgcgt
gctggacatg gccagcacct actttgacat 18600
ccgcggcgtg ctggatcggg gccccagctt
caaaccctac tccggcaccg cctacaacag 18660
cctggctccc aagggagcgc ccaacacctc
acagtggata accaaagaca atggaactga 18720
taagacatac agttttggaa atgctccagt
cagaggattg gacattacag aagagggtct 18780
ccaaatagga accgatgagt cagggggtga
aagcaagaaa atttttgcag acaaaaccta 18840
tcagcctgaa cctcagcttg gagatgagga
atggcatgat actattggag ctgaagacaa 18900
gtatggaggc agagcgctta aacctgccac
caacatgaaa ccctgctatg ggtctttcgc 18960
caagccaact aatgctaagg gaggtcaggc
taaaagcaga accaaggacg atggcactac 19020
tgagcctgat attgacatgg ccttctttga
cgatcgcagt cagcaagcta gtttcagtcc 19080
agaacttgtt ttgtatactg agaatgtcga
tctggacacc ccggataccc acattattta 19140
caaacctggc actgatgaaa caagttcttc
tttcaacttg ggtcagcagt ccatgcccaa 19200
cagacccaac tacattggct tcagagacaa
ctttatcggg ctcatgtact acaacagcac 19260
tggcaatatg ggtgtactgg ccggtcaggc
ctcccagctg aatgctgtgg tggacttgca 19320
ggacagaaac actgaactgt cctaccagct
cttgcttgac tctctgggtg acagaaccag 19380
gtatttcagt atgtggaatc aggcggtgga
cagctatgac cccgatgtgc gcattattga 19440
aaatcacggt gtggaggatg aactccccaa
ctattgcttc cctttgaatg gtgtgggctt 19500
tacagataca ttccagggaa ttaaggttaa
aactacaaat aacggaacag caaatgctac 19560
agagtgggaa tctgatacct ctgtcaataa
tgctaatgag attgccaagg gcaatccttt 19620
cgccatggag atcaacatcc aggccaacct
gtggcggaac ttcctctacg cgaacgtggc 19680
gctgtacctg cccgactcct acaagtacac
gccggccaac atcacgctgc ccaccaacac 19740
caacacctac gattacatga acggccgcgt
ggtggcgccc tcgctggtgg acgcctacat 19800
caacatcggg gcgcgctggt cgctggaccc
catggacaac gtcaacccct tcaaccacca 19860
ccgcaacgcg ggcctgcgct accgctccat
gctcctgggc aacgggcgct acgtgccctt 19920
ccacatccag gtgccccaaa agtttttcgc
catcaagagc ctcctgctcc tgcccgggtc 19980
ctacacctac gagtggaact tccgcaagga
cgtcaacatg atcctgcaga gctccctcgg 20040
caacgacctg cgcacggacg gggcctccat
cgccttcacc agcatcaacc tctacgccac 20100
cttcttcccc atggcgcaca acaccgcctc
cacgctcgag gccatgctgc gcaacgacac 20160
caacgaccag tccttcaacg actacctctc
ggcggccaac atgctctacc ccatcccggc 20220
caacgccacc aacgtgccca tctccatccc
ctcgcgcaac tgggccgcct tccgcggatg 20280
gtccttcacg cgcctcaaga cccgcgagac
gccctcgctc ggctccgggt tcgaccccta 20340
cttcgtctac tcgggctcca tcccctacct
cgacggcacc ttctacctca accacacctt 20400
caagaaggtc tccatcacct tcgactcctc
cgtcagctgg cccggcaacg accgcctcct 20460
gacgcccaac gagttcgaaa tcaagcgcac
cgtcgacgga gaggggtaca acgtggccca 20520
gtgcaacatg accaaggact ggttcctggt
ccagatgctg gcccactaca acatcggcta 20580
ccagggcttc tacgtgcccg agggctacaa
ggaccgcatg tactccttct tccgcaactt 20640
ccagcccatg agccgccagg tcgtggacga
ggtcaactac aaggactacc aggccgtcac 20700
cctggcctac cagcacaaca actcgggctt
cgtcggctac ctcgcgccca ccatgcgcca 20760
gggccagccc taccccgcca actaccccta
cccgctcatc ggcaagagcg ccgtcgccag 20820
cgtcacccag aaaaagttcc tctgcgaccg
ggtcatgtgg cgcatcccct tctccagcaa 20880
cttcatgtcc atgggcgcgc tcaccgacct
cggccagaac atgctctacg ccaactccgc 20940
ccacgcgcta gacatgaatt tcgaagtcga
ccccatggat gagtccaccc ttctctatgt 21000
tgtcttcgaa gtcttcgacg tcgtccgagt
gcaccagccc caccgcggcg tcatcgaggc 21060
cgtctacctg cgcacgccct tctcggccgg
caacgccacc acctaagcct cttgcttctt 21120
gcaagatgac ggcctgtggc tccggcgagc
aggagctcag ggccatcctc cgcgacctgg 21180
gctgcgggcc ctacttcctg ggcaccttcg
acaagcgctt cccgggattc atggccccgc 21240
acaagctggc ctgcgccatc gtcaacacgg
ccggccgcga gaccgggggc gagcactggc 21300
tggccttcgc ctggaacccg cgcacccaca
cctgctacct cttcgacccc ttcgggttct 21360
cggacgagcg cctcaagcag atctaccagt
tcgagtacga gggcctgctg cgccgcagcg 21420
ccctggccac cgaggaccgc tgcgtcaccc
tggaaaagtc cacccagacc gtgcagggtc 21480
cgcgctcggc cgcctgcggg ctcttctgct
gcatgttcct gcacgccttc gtgcactggc 21540
ccgaccgccc catggacaag aaccccacca
tgaacttgct gacgggggtg cccaacggca 21600
tgctccagtc gccccaggtg gaacccaccc
tgcgccgcaa ccaggaggcg ctctaccgct 21660
tcctcaacgc ccactccgcc tactttcgct
cccaccgcgc gcgcatcgag aaggccaccg 21720
ccttcgaccg catgaatcaa gacatgtaaa
ctgtgtgtat gtgaatgctt tattcataat 21780
aaacagcaca tgtttatgcc accttctctg
aggctctgac tttatttaga aatcgaaggg 21840
gttctgccgg ctctcggcgt gccccgcggg
cagggatacg ttgcggaact ggtacttggg 21900
cagccacttg aactcgggga tcagcagctt
cggcacgggg aggtcgggga acgagtcgct 21960
ccacagcttg cgcgtgagtt gcagggcgcc
cagcaggtcg ggcgcggata tcttgaaatc 22020
acagttggga cccgcgttct gcgcgcgaga
gttgcggtac acggggttgc agcactggaa 22080
caccatcagg gccgggtgct tcacgctcgc
cagcaccgtc gcgtcggtga tgccctccac 22140
gtccagatcc tcggcgttgg ccatcccgaa
gggggtcatc ttgcaggtct gccgccccat 22200
gctgggcacg cagccgggct tgtggttgca
atcgcagtgc agggggatca gcatcatctg 22260
ggcctgctcg gagctcatgc ccgggtacat
ggccttcatg aaagcctcca gctggcggaa 22320
ggcctgctgc gccttgccgc cctcggtgaa
gaagaccccg caggacttgc tagagaactg 22380
gttggtggcg cagccggcgt cgtgcacgca
gcagcgcgcg tcgttgttgg ccagctgcac 22440
cacgctgcgc ccccagcggt tctgggtgat
cttggcccgg tcggggttct ccttcagcgc 22500
gcgctgcccg ttctcgctcg ccacatccat
ctcgatcgtg tgctccttct ggatcatcac 22560
ggtcccgtgc aggcaccgca gcttgccctc
ggcttcggtg catccgtgca gccacagcgc 22620
gcagccggtg cactcccagt tcttgtgggc
gatctgggag tgcgagtgca cgaagccctg 22680
caggaagcgg cccatcatcg cggtcagggt
cttgttgctg gtgaaggtca gcgggatgcc 22740
gcggtgctcc tcgttcacat acaggtggca
gatgcggcgg tacacctcgc cctgctcggg 22800
catcagctgg aaggcggact tcaggtcgct
ctccacgcgg taccgctcca tcagcagcgt 22860
catgacttcc atgcccttct cccaggccga
aacgatcggc aggctcaggg ggttcttcac 22920
cgttgtcatc ttagtcgccg ccgccgaggt
cagggggtcg ttctcgtcca gggtctcaaa 22980
cactcgcttg ccgtccttct cggtgatgcg
cacgggggga aagctgaagc ccacggccgc 23040
cagctcctcc tcggcctgcc tttcgtcctc
gctgtcctgg ctgatgtctt gcaaaggcac 23100
atgcttggtc ttgcggggtt tctttttggg
cggcagaggc ggcggcggag acgtgctggg 23160
cgagcgcgag ttctcgctca ccacgactat
ttcttcttct tggccgtcgt ccgagaccac 23220
gcggcggtag gcatgcctct tctggggcag
aggcggaggc gacgggctct cgcggttcgg 23280
cgggcggctg gcagagcccc ttccgcgttc
gggggtgcgc tcctggcggc gctgctctga 23340
ctgacttcct ccgcggccgg ccattgtgtt
ctcctaggga gcaagcatgg agactcagcc 23400
atcgtcgcca acatcgccat ctgcccccgc
cgccgccgac gagaaccagc agcagcagaa 23460
tgaaagctta accgccccgc cgcccagccc
cacctccgac gccgcggccc cagacatgca 23520
agagatggag gaatccatcg agattgacct
gggctacgtg acgcccgcgg agcacgagga 23580
ggagctggca gcgcgctttt cagccccgga
agagaaccac caagagcagc cagagcagga 23640
agcagagagc gagcagagcc aggctgggct
cgagcatggc gactacctga gcggggcaga 23700
ggacgtgctc atcaagcatc tggcccgcca
atgcatcatc gtcaaggatg cgctgctcga 23760
ccgcgccgag gtgcccctca gcgtggcgga
gctcagccgc gcctacgagc gcaacctctt 23820
ctcgccgcgc gtgcccccca agcgccagcc
caacggcacc tgcgagccca acccgcgcct 23880
caacttctac ccggtcttcg cggtgcccga
ggccctggcc acctaccacc tctttttcaa 23940
gaaccaaagg atccccgtct cctgccgcgc
caaccgcacc cgcgccgacg ccctgctcaa 24000
cctgggcccc ggcgcccgcc tacctgatat
cgcctccttg gaagaggttc ccaagatctt 24060
cgagggtctg ggcagcgacg agactcgggc
cgcgaacgct ctgcaaggaa gcggagagga 24120
gcatgagcac cacagcgccc tggtggagtt
ggaaggcgac aacgcgcgcc tggcggtcct 24180
caagcgcacg gtcgagctga cccacttcgc
ctacccggcg ctcaacctgc cccccaaggt 24240
catgagcgcc gtcatggacc aggtgctcat
caagcgcgcc tcgcccctct cggaggagga 24300
gatgcaggac cccgagagct cggacgaggg
caagcccgtg gtcagcgacg agcagctggc 24360
gcgctggctg ggagcgagta gcacccccca
gagcctggaa gagcggcgca agctcatgat 24420
ggccgtggtc ctggtgaccg tggagctgga
gtgtctgcgc cgcttcttcg ccgacgcgga 24480
gaccctgcgc aaggtcgagg agaacctgca
ctacctcttc aggcacgggt tcgtgcgcca 24540
ggcctgcaag atctccaacg tggagctgac
caacctggtc tcctacatgg gcatcctgca 24600
cgagaaccgc ctggggcaga acgtgctgca
caccaccctg cgcggggagg cccgccgcga 24660
ctacatccgc gactgcgtct acctgtacct
ctgccacacc tggcagacgg gcatgggcgt 24720
gtggcagcag tgcctggagg agcagaacct
gaaagagctc tgcaagctcc tgcagaagaa 24780
cctgaaggcc ctgtggaccg ggttcgacga
gcgcaccacc gcctcggacc tggccgacct 24840
catcttcccc gagcgcctgc ggctgacgct
gcgcaacggg ctgcccgact ttatgagcca 24900
aagcatgttg caaaactttc gctctttcat
cctcgaacgc tccgggatcc tgcccgccac 24960
ctgctccgcg ctgccctcgg acttcgtgcc
gctgaccttc cgcgagtgcc ccccgccgct 25020
ctggagccac tgctacctgc tgcgtctggc
caactacctg gcctaccact cggacgtgat 25080
cgaggacgtc agcggcgagg gtctgctcga
gtgccactgc cgctgcaacc tctgcacgcc 25140
gcaccgctcc ctggcctgca acccccagct
gctgagcgag acccagatca tcggcacctt 25200
cgagttgcaa ggccccggcg aggagggcaa
ggggggtctg aaactcaccc cggggctgtg 25260
gacctcggcc tacttgcgca agttcgtgcc
cgaggactac catcccttcg agatcaggtt 25320
ctacgaggac caatcccagc cgcccaaggc
cgagctgtcg gcctgcgtca tcacccaggg 25380
ggccatcctg gcccaattgc aagccatcca
gaaatcccgc caagaatttc tgctgaaaaa 25440
gggccacggg gtctacttgg acccccagac
cggagaggag ctcaacccca gcttccccca 25500
ggatgcccag aggaagcagc aagaagctga
aagtggagct gccgctgccg ccggaggatt 25560
tggaggaaga ctgggagagc agtcaggcag
aggaggagga gatggaagac tgggacagca 25620
ctcaggcaga ggaggacagc ctgcaagaca
gtctggaaga cgaggtggag gaggaggcag 25680
aggaagaagc agccgccgcc agaccgtcgt
cctcggcgga gaaagcaagc agcacggata 25740
ccatctccgc tccgggtcgg ggtctcggcg
gccgggccca cagtaggtgg gacgagaccg 25800
ggcgcttccc gaaccccacc acccagaccg
gtaagaagga gcggcaggga tacaagtcct 25860
ggcgggggca caaaaacgcc atcgtctcct
gcttgcaagc ctgcgggggc aacatctcct 25920
tcacccggcg ctacctgctc ttccaccgcg
gggtgaactt cccccgcaac atcttgcatt 25980
actaccgtca cctccacagc ccctactact
gtttccaaga agaggcagaa acccagcagc 26040
agcagaaaac cagcagcagc tagaaaatcc
acagcggcgg cggcggcagg tggactgagg 26100
atcgcggcga acgagccggc gcagacccgg
gagctgagga accggatctt tcccaccctc 26160
tatgccatct tccagcagag tcgggggcag
gagcaggaac tgaaagtcaa gaaccgttct 26220
ctgcgctcgc tcacccgcag ttgtctgtat
cacaagagcg aagaccaact tcagcgcact 26280
ctcgaggacg ccgaggctct cttcaacaag
tactgcgcgc tcactcttaa agagtagccc 26340
gcgcccgccc acacacggaa aaaggcggga
attacgtcac cacctgcgcc cttcgcccga 26400
ccatcatcat gagcaaagag attcccacgc
cttacatgtg gagctaccag ccccagatgg 26460
gcctggccgc cggcgccgcc caggactact
ccacccgcat gaactggctc agtgccgggc 26520
ccgcgatgat ctcacgggtg aatgacatcc
gcgcccgccg aaaccagata ctcctagaac 26580
agtcagcgat caccgccacg ccccgccatc
accttaatcc gcgtaattgg cccgccgccc 26640
tggtgtacca ggaaattccc cagcccacga
ccgtactact tccgcgagac gcccaggccg 26700
aagtccagct gactaactca ggtgtccagc
tggccggcgg cgccgccctg tgtcgtcacc 26760
gccccgctca gggtataaag cggctggtga
tccgaggcag aggcacacag ctcaacgacg 26820
aggtggtgag ctcttcgctg ggtctgcgac
ctgacggagt cttccaactc gccggatcgg 26880
ggagatcttc cttcacgcct cgtcaggccg
tcctgacttt ggagagttcg tcctcgcagc 26940
cccgctcggg tggcatcggc actctccagt
tcgtggagga gttcactccc tcggtctact 27000
tcaacccctt ctccggctcc cccggccact
acccggacga gttcatcccg aacttcgacg 27060
ccatcagcga gtcggtggac ggctacgatt
gaatgtccca tggtggcgcg gctgacctag 27120
ctcggcttcg acacctggac cactgccgcc
gcttccgctg cttcgctcgg gatctcgccg 27180
agtttgccta ctttgagctg cccgaggagc
accctcaggg cccggcccac ggagtgcgga 27240
tcatcgtcga agggggcctc gactcccacc
tgcttcggat cttcagccag cgtccgatcc 27300
tggtcgagcg cgagcaagga cagacccgtc
tgaccctgta ctgcatctgc aaccaccccg 27360
gcctgcatga aagtctttgt tgtctgctgt
gtactgagta taataaaagc tgagatcagc 27420
gactactccg gacttccgtg tgttcctgaa
tccatcaacc agtccctgtt cttcaccggg 27480
aacgagaccg agctccagct ccagtgtaag
ccccacaaga agtacctcac ctggctgttc 27540
cagggctccc cgatcgccgt tgtcaaccac
tgcgacaacg acggagtcct gctgagcggc 27600
cctgccaacc ttactttttc cacccgcaga
agcaagctcc agctcttcca acccttcctc 27660
cccgggacct atcagtgcgt ctcgggaccc
tgccatcaca ccttccacct gatcccgaat 27720
accacagcgt cgctccccgc tactaacaac
caaactaccc accaacgcca ccgtcgcgac 27780
ctttcctctg aatctaatac cactaccgga
ggtgagctcc gaggtcgacc aacctctggg 27840
atttactacg gcccctggga ggtggtgggg
ttaatagcgc taggcctagt tgtgggtggg 27900
cttttggctc tctgctacct atacctccct
tgctgttcgt acttagtggt gctgtgttgc 27960
tggtttaaga aatggggcag atcaccctag
tgagctgcgg tgtgctggtg gcggtggtgc 28020
tttcgattgt gggactgggc ggcgcggctg
tagtgaagga gaaggccgat ccctgcttgc 28080
atttcaatcc cgacaaatgc cagctgagtt
ttcagcccga tggcaatcgg tgcgcggtgc 28140
tgatcaagtg cggatgggaa tgcgagaacg
tgagaatcga gtacaataac aagactcgga 28200
acaatactct cgcgtccgtg tggcagcccg
gggaccccga gtggtacacc gtctctgtcc 28260
ccggtgctga cggctccccg cgcaccgtga
ataatacttt catttttgcg cacatgtgcg 28320
acacggtcat gtggatgagc aagcagtacg
atatgtggcc ccccacgaag gagaacatcg 28380
tggtcttctc catcgcttac agcctgtgca
cggtgctaat caccgctatc gtgtgcctga 28440
gcattcacat gctcatcgct attcgcccca
gaaataatgc cgaaaaagag aaacagccat 28500
aacacgtttt ttcacacacc ttgtttttac
agacaatgcg tctgttaaat tttttaaaca 28560
ttgtgctcag tattgcttat gcctctggct
atgcaaacat acagaaaacc ctctatgtag 28620
gatctgatga tacactagag ggtacccaat
cacaagctag ggtttcatgg tatttttata 28680
aaagctcaga taatcctatt actctttgca
aaggtgatca ggggcggaca acaaagccgc 28740
ctatcacatt tagctgtacc agaacaaatc
tcacgctttt ctcaattaca aaacaatatg 28800
ctggtattta ttacagtaca aactttcata
gtgggcaaga taaatattat actgttaagg 28860
tagaaaatcc taccactcct agaactacca
ccaccaccac caccaccacc actactgcga 28920
agcccactaa acctaaaact accaagaaaa
ccactgtgaa aactacaact agaaccacca 28980
caactacaga aaccaccacc agcacaacac
ttgctgcaac tacacacaca cacactgagc 29040
taaccttaca gaccactaat gatttgatag
ccctgttgca aaagggggat aacagcacca 29100
cttccaatga ggagataccc aaatccatga
ttggcattat tgttgctgta gtggtgtgca 29160
tgttgatcat cgccttgtgc atggtgtact
atgccttctg ctacagaaag cacagactga 29220
acgacaagct ggaacactta ctaagtgttg
aattttaatt ttttagaacc atgaagatcc 29280
taggcctttt agttttttct atcattacct
ctgctctatg caattctgac aatgaggacg 29340
ttactgtcgt tgtcggatca aattatacac
tgaaaggtcc agcgaagggt atgctttcgt 29400
ggtattgctg gtttggaact gacactgatc
aaactgagct ttgcaatgca atgaaaggtc 29460
aaataccaac ctcaaaaatt aaacataaat
gcaatggtac tgacttagta ctactcaata 29520
tcacgaaatc atatgctggc agctattcat
gccctggaga tgatgctgag aacatgattt 29580
tttacaaagt aactgttgtt gatcccacta
ctccaccacc caccaccaca actactcaca 29640
ccacacacac agaacaaaca ccagaggcag
cagaagcaga gttggccttc caggttcacg 29700
gagattcctt tgctgtcaat acccctacac
ccgatcatcg gtgtccgggg ctgctagtca 29760
gcggcattgt cggtgtgctt tcgggattag
cagtcataat catctgcatg ttcatttttg 29820
cttgctgcta tagaaggctt taccgacaaa
aatcagaccc actgctgaac ctctatgttt 29880
aattttttcc agagccatga aggcagttag
cgctctagtt ttttgttctt tgattggcat 29940
tgttttttgc aatcctatta ctagagttag
ctttattaaa gatgtgaatg ttactgaggg 30000
gggcaatgtg acactggtag gtgtagaggg
tgctaaaaac accacctgga caaaatacca 30060
ccttgggtgg aaagatattt gcaattggag
tgtcactgtg tacacatgtg agggagttaa 30120
tcttaccatt gtcaatgcca cctcagctca
aaatggtaga attcaaggac aaagtgttag 30180
tgtgaccagt gatgggtatt ttacccaaca
tacttttatc tatgacgtta aagtcatacc 30240
actgcctacg cctagcccac ctagcaccac
tacacaaaca acccacacta cacagacaac 30300
cacatacagt acatcaaatc agcctaccac
cactacagca gcagaggttg ccagctcgtc 30360
tggagttcaa gtggcatttt tgttgttgcc
cccatctagc agtcccactg ctattaccaa 30420
tgagcagact actgcatttt tgtccactgt
cgagagccac accacagcta cctccagtgc 30480
cttctctagc accgccaatc tctcctcgct
ttcctctaca ccaatcagtc ccgctactac 30540
tactaccccc gctattcttc ccactcccct
gaagcaaaca gacggcggca tgcaatggca 30600
gatcaccctg ctcattgtga tcgggttggt
catcctagcc gtgttgctct actacatctt 30660
ctgccgccgc attcccaacg cgcaccgcaa
gccggtctac aagcccatca ttgtcgggca 30720
gccggagccg cttcaggtgg aagggggtct
aaggaatctt ctcttctctt ttacagtatg 30780
gtgattgaac tatgattcct agacaattct
tgatcactat tcttatctgc ctcctccaag 30840
tctgtgccac cctcgctctg gtggccaacg
ccagtccaga ctgtattggg cccttcgcct 30900
cctacgtgct ctttgccttc atcacctgca
tctgctgctg tagcatagtc tgcctgctta 30960
tcaccttctt ccagttcatt gactggatct
ttgtgcgcat cgcctacctg cgccaccacc 31020
cccagtaccg cgaccagcga gtggcgcagc
tgctcaggct cctctgataa gcatgcgggc 31080
tctgctactt ctcgcgcttc tgctgttagt
gctcccccgt cccgttgacc cccggccccc 31140
cactcagtcc cccgaggagg tccgcaaatg
caaattccaa gaaccctgga aattcctcaa 31200
atgctaccgc caaaaatcag acatgcatcc
cagctggatc atgatcattg ggatcgtgaa 31260
cattctggcc tgcaccctca tctcctttgt
gatttacccc tgctttgact ttggttggaa 31320
ctcgccagag gcgctctatc tcccgcctga
acctgacaca ccaccacagc aacctcaggc 31380
acacgcacta ccaccaccac agcctaggcc
acaatacatg cccatattag actatgaggc 31440
cgagccacag cgacccatgc tccccgctat
tagttacttc aatctaaccg gcggagatga 31500
ctgacccact ggccaacaac aacgtcaacg
accttctcct ggacatggac ggccgcgcct 31560
cggagcagcg actcgcccaa cttcgcattc
gccagcagca ggagagagcc gtcaaggagc 31620
tgcaggacgg catagccatc caccagtgca
agaaaggcat cttctgcctg gtgaaacagg 31680
ccaagatctc ctacgaggtc acccagaccg
accatcgcct ctcctacgag ctcctgcagc 31740
agcgccagaa gttcacctgc ctggtcggag
tcaaccccat cgtcatcacc cagcagtcgg 31800
gcgataccaa ggggtgcatc cactgctcct
gcgactcccc cgactgcgtc cacactctga 31860
tcaagaccct ctgcggcctc cgcgacctcc
tccccatgaa ctaatcaccc acttatccag 31920
tgaaataaaa aaataatcat ttgatttgaa
ataaagatac aatcatattg atgatttgag 31980
tttaacaaaa ataaagaatc acttacttga
aatctgatac caggtctctg tccatatttt 32040
ctgccaacac cacctcactc ccctcttccc
agctctggta ctgcaggccc cggcgggctg 32100
caaacttcct ccacacgctg aaggggatgt
caaattcctc ctgcccctca atcttcattt 32160
tatcttctat cagatgtcca aaaagcgcgt
ccgggtggat gatgacttcg accccgtcta 32220
cccctacgat gcagacaacg caccgaccgt
gcccttcatc aaccccccct tcgtctcttc 32280
agatggattc caagagaagc ccctgggggt
gttgtccctg cgactggccg accccgtcac 32340
caccaagaac ggggaaatca ccctcaagct
gggagagggg gtggacctcg actcctcggg 32400
aaaactcatc tccaacacgg ccaccaaggc
cgctgcccct ctcagttttt ccaacaacac 32460
catttccctt aacatggatc acccctttta
cactaaagat ggaaaattag ccttacaagt 32520
ttctccacca ttaaatatac tgagaacaag
cattctaaac acactagctt taggttttgg 32580
atcaggttta ggactccgtg gctctgcctt
ggcagtacag ttagtctctc cacttacatt 32640
tgatactgat ggaaacataa agcttacctt
agacagaggt ttgcatgtta caacaggaga 32700
tgcaattgaa agcaacataa gctgggctaa
aggtttaaaa tttgaagatg gagccatagc 32760
aaccaacatt ggaaatgggt tagagtttgg
aagcagtagt acagaaacag gtgtcgatga 32820
tgcttaccca atccaagtta aacttggatc
tggccttagc tttgacagta caggagccat 32880
aatggctggt aacaaagaag acgataaact
cactttgtgg acaacacctg atccatcacc 32940
aaactgtcaa atactcgcag aaaatgatgc
aaaactaaca ctttgcttga ctaaatgtgg 33000
tagtcaaata ctggccactg tgtcagtctt
agttgtagga agtggaaacc taaaccccat 33060
tactggcacc gtaagcagtg ctcaggtgtt
tctacgtttt gatgcaaacg gtgttctttt 33120
aacagaacat tctacactaa aaaaatactg
ggggtatagg cagggagata gcatagatgg 33180
cactccatat gtcaatgctg taggattcat
gcccaattta aaagcttatc caaagtcaca 33240
aagttctact actaaaaata atatagtagg
gcaagtatac atgaatggag atgtttcaaa 33300
acctatgctt ctcactataa ccctcaatgg
tactgatgac agcaacagta catattcaat 33360
gtcattttca tacacctgga ctaatggaag
ctatgttgga gcaacatttg gagctaactc 33420
ttataccttc tcctacatcg cccaagaatg
aatactgtat cccaccctgc atgcccaacc 33480
ctcccccacc tctgtctata tggaaaactc
tgaaacacaa aataaaataa agttcaagtg 33540
ttttattgat tcaacagttt tacaggattc
gagcagttat ttttcctcca ccctcccagg 33600
acatggaata caccaccctc tccccccgca
cagccttgaa catctgaatg ccattggtga 33660
tggacatgct tttggtctcc acgttccaca
cagtttcaga gcgagccagt ctcgggtcgg 33720
tcagggagat gaaaccctcc gggcactccc
gcatctgcac ctcacagctc aacagctgag 33780
gattgtcctc ggtggtcggg atcacggtta
tctggaagaa gcagaagagc ggcggtggga 33840
atcatagtcc gcgaacggga tcggccggtg
gtgtcgcatc aggccccgca gcagtcgctg 33900
ccgccgccgc tccgtcaagc tgctgctcag
ggggtccggg tccagggact ccctcagcat 33960
gatgcccacg gccctcagca tcagtcgtct
ggtgcggcgg gcgcagcagc gcatgcggat 34020
ctcgctcagg tcgctgcagt acgtgcaaca
caggaccacc aggttgttca acagtccata 34080
gttcaacacg ctccagccga aactcatcgc
gggaaggatg ctacccacgt ggccgtcgta 34140
ccagatcctc aggtaaatca agtggcgccc
cctccagaac acgctgccca tgtacatgat 34200
ctccttgggc atgtggcggt tcaccacctc
ccggtaccac atcaccctct ggttgaacat 34260
gcagccccgg atgatcctgc ggaaccacag
ggccagcacc gccccgcccg ccatgcagcg 34320
aagagacccc gggtcccggc aatggcaatg
gaggacccac cgctcgtacc cgtggatcat 34380
ctgggagctg aacaagtcta tgttggcaca
gcacaggcac acgctcatgc atctcttcag 34440
cactctcagc tcctcggggg tcaaaaccat
atcccagggc acgggaaact cttgcaggac 34500
agcgaagccc gcagaacagg gcaatcctcg
cacataactt acattgtgca tggacagggt 34560
atcgcaatca ggcagcaccg ggtgatcctc
caccagagaa gcgcgggtct cggtctcctc 34620
acagcgtggt aagggggccg gccgatacgg
gtgatggcgg gacgcggctg atcgtgttcg 34680
cgaccgtgtc atgatgcagt tgctttcgga
cattttcgta cttgctgaag cagaacctgg 34740
tccgggcgct gcacaccgat cgccggcggc
ggtctcggcg cttggaacgc tcggtgttga 34800
agttgtaaaa cagccactct ctcagaccgt
gcagcagatc tagggcctca ggagtgatga 34860
agatcccatc atgcctgatg gctctgatca
catcgaccac cgtggaatgg gccagaccca 34920
gccagatgat gcaattttgt tgggtttcgg
tgacggcggg ggagggaaga acaggaagaa 34980
ccatgattaa cttttaatcc aaacggtctc
ggagcacttc aaaatgaagg tcgcggagat 35040
ggcacctctc gcccccgctg tgttggtgga
aaataacagc caggtcaaag gtgatacggt 35100
tctcgagatg ttccacggtg gcttccagca
aagcctccac gcgcacatcc agaaacaaga 35160
caatagcgaa agcgggaggg ttctctaatt
cctcaatcat catgttacac tcctgcacca 35220
tccccagata attttcattt ttccagcctt
gaatgattcg aactagttcc tgaggtaaat 35280
ccaagccagc catgataaag agctcgcgca
gagcgccctc caccggcatt cttaagcaca 35340
ccctcataat tccaagatat tctgctcctg
gttcacctgc agcagattga caagcgggat 35400
atcaaaatct ctgccgcgat ccctgagctc
ctccctcagc aataactgta agtactcttt 35460
catatcctct ccgaaatttt tagccatagg
acccccagga ataagagaag ggcaagccac 35520
attacagata aaccgaagtc ccccccagtg
agcattgcca aatgtaagat tgaaataagc 35580
atgctggcta gacccggtga tatcttccag
ataactggac agaaaatcgg gcaagcaatt 35640
tttaagaaaa tcaacaaaag aaaaatcttc
caggtgcacg tttagggcct cgggaacaac 35700
gatggagtaa gtgcaagggg tgcgttccag
catggttagt tagctgatct gtaaaaaaac 35760
aaaaaataaa acattaaacc atgctagcct
ggcgaacagg tgggtaaatc gttctctcca 35820
gcaccaggca ggccacgggg tctccggcgc
gaccctcgta aaaattgtcg ctatgattga 35880
aaaccatcac agagagacgt tcccggtggc
cggcgtgaat gattcgagaa gaagcataca 35940
cccccggaac attggagtcc gtgagtgaaa
aaaagcggcc gaggaagcaa tgaggcacta 36000
caacgctcac tctcaagtcc agcaaagcga
tgccatgcgg atgaagcaca aaattttcag 36060
gtgcgtaaaa aatgtaatta ctcccctcct
gcacaggcag cgaagctccc gatccctcca 36120
gatacacata caaagcctca gcgtccatag
cttaccgagc ggcagcagca gcggcacaca 36180
acaggcgcaa gagtcagaga aaagactgag
ctctaacctg tccgcccgct ctctgctcaa 36240
tatatagccc cagatctaca ctgacgtaaa
ggccaaagtc taaaaatacc cgccaaataa 36300
tcacacacgc ccagcacacg cccagaaacc
ggtgacacac tcaaaaaaat acgcgcactt 36360
cctcaaacgc ccaaactgcc gtcatttccg
ggttcccacg ctacgtcatc aaaacacgac 36420
tttcaaattc cgtcgaccgt taaaaacgtc
acccgccccg cccctaacgg tcgccgctcc 36480
cgcagccaat cagcgccccg catccccaaa
ttcaaacagc tcatttgcat attaacgcgc 36540
accaaaagtt
tgaggtatat tattgatgat g 36571
Claims (23)
1.一种分离的多核苷酸,其编码腺病毒纤维蛋白或其功能性衍生物,并且所述多核苷酸选自:
(a)多核苷酸,其编码具有根据SEQ ID NO: 14-19、50和53中任意氨基酸序列的多肽,
(b)多核苷酸,其编码根据SEQ ID NO: 14-19、50和53中任意多肽的功能性衍生物,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换,和
(c)多核苷酸,其编码在全长上与SEQ ID NO: 14-19、50和53中任意氨基酸序列具有至少85%相同的氨基酸序列的功能性衍生物。
2.一种分离的多核苷酸,其编码腺病毒六邻体蛋白或其功能性衍生物,并且所述多核苷酸选自:
(a)多核苷酸,其编码具有根据SEQ ID NO: 20-25、51和54中任意氨基酸序列的多肽,
(b)多核苷酸,其编码根据SEQ ID NO: 20-25、51和54中任意多肽的功能性衍生物,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换,和
(c)多核苷酸,其编码具有在全长上与SEQ ID NO: 20-25、51和54中任意氨基酸序列至少95%相同的氨基酸序列的功能性衍生物。
3.一种分离的多核苷酸,其编码腺病毒五邻体蛋白或其功能性衍生物,并且所述多核苷酸选自:
(a)多核苷酸,其编码具有根据SEQ ID NO: 26-31、52和55中任意氨基酸序列的多肽,
(b)多核苷酸,其编码根据SEQ ID NO: 26-31、52和55中任意多肽的功能性衍生物,其中所述功能性衍生物包含一个或多个氨基酸残基的缺失、插入和/或置换,和
(c)多核苷酸,其编码具有在全长上与SEQ ID NO: 26-31、52和55中任意氨基酸序列至少85%相同的氨基酸序列的功能性衍生物。
4.一种多核苷酸,其包含至少一种根据权利要求1、2和3中任一项的分离的多核苷酸。
5.根据权利要求1-4中任一项的分离的多核苷酸,其中所述多核苷酸包含以下至少一项:
(a)腺病毒5’-端,优选腺病毒5’反向末端重复,
(b)腺病毒E1a区或其选自13S、12S和9S区的片段,
(c)腺病毒E1b区或其选自小T、大T和IX区的片段,
(d)腺病毒E2b区或其选自小pTP、聚合酶和IVa2区的片段,
(e)腺病毒L1区或其片段,所述片段编码选自28.1 kD蛋白、聚合酶、agnoprotein、52/55 kDa蛋白和IIIa蛋白的腺病毒蛋白质;
(f)腺病毒L2区或其片段,所述片段编码选自根据权利要求3的五邻体蛋白、VII、V和Mu蛋白的腺病毒蛋白质,
(g)腺病毒L3区或其片段,所述片段编码选自VI蛋白、根据权利要求2的六邻体蛋白和内切蛋白酶的腺病毒蛋白质,
(h)腺病毒E2a区,
(i)腺病毒L4区或其片段,所述片段编码选自100 kD蛋白、33
kD同源物和蛋白质VIII的腺病毒蛋白质,
(j)腺病毒E3区或其选自E3 ORF1、E3
ORF2、E3 ORF3、E3
ORF4、E3 ORF5、E3
ORF6、E3 ORF7、E3
ORF8和E3 ORF9的片段,
(k)腺病毒L5区或其片段,所述片段编码根据权利要求1的纤维蛋白,
(l)腺病毒E4区或其选自E4 ORF7、E4
ORF6、E4 ORF5、E4
ORF4、E4 ORF3、E4
ORF2和E4 ORF1的片段,和/或
(m)腺病毒3’-端,优选腺病毒3’反向末端重复。
6.根据权利要求4的分离的多核苷酸,其中所述多核苷酸由下述多核苷酸组成或包含下述多核苷酸:其在全长上与基本上由SEQ ID NO: 13、62、63或65中的任一个组成的序列或由SEQ ID NO: 13、62、63或65中的任一个组成但缺乏SEQ ID NO: 13、62、63或65的基因组区域E1A、E1B、E2A、E2B、E3和/或E4的序列至少90%同一。
7.一种由根据权利要求1-3中任一项的分离的多核苷酸编码的分离的腺病毒衣壳多肽,或其功能性衍生物。
8.一种载体,其包含根据权利要求1-6中任一项的分离的多核苷酸。
9.根据权利要求8的载体,其中所述载体不包含选自E1A、E1B、E2A、E2B、E3和E4的基因组区域中的基因,和/或包含选自E1A、E1B、E2A、E2B、E3和E4的基因组区域的至少一个基因,其中所述至少一个基因包含使所述至少一个基因无功能的缺失和/或突变。
10.一种重组腺病毒,优选不能复制的腺病毒,其包含根据权利要求1-6中任一项的分离的多核苷酸和/或至少一种根据权利要求7的分离的腺病毒衣壳多肽。
11.权利要求10的重组腺病毒,其中所述重组腺病毒包含用于递送至靶细胞的分子。
12.根据权利要求10或11的重组腺病毒,其中所述腺病毒在人受试者中具有低于5%的血清阳性率和优选在人受试者中无血清阳性率。
13.根据权利要求10-12中任一项的重组腺病毒,其中所述腺病毒能够感染哺乳动物细胞。
14.根据权利要求11-13中任一项的重组腺病毒,其中所述用于递送至靶细胞的分子是编码抗原蛋白或其片段的多核苷酸。
15.权利要求10-14中任一项的重组腺病毒,其中所述腺病毒是已被保藏的腺病毒并具有选自08110601 (ChAd83)、 08110602
(ChAd73)、08110603 (ChAd55)、08110604 (ChAd147)和08110605
(ChAd146)的保藏号。
16.一种组合物,其包含佐剂和以下(i)至(iv)中至少一项:
(i)一种或多种根据权利要求7的分离的腺病毒衣壳多肽,
(ii)根据权利要求1-6中任一项的分离的多核苷酸,
(iii)根据权利要求8-9中任一项的载体,
(iv)根据权利要求10-15中任一项的重组腺病毒,
和任选地,药学上可接受的赋形剂。
17.根据权利要求16的组合物,其中所述佐剂是受体的激动剂,所述受体选自I型细胞因子受体、II型细胞因子受体、TNF受体、担当转录因子的维生素D受体、以及Toll样受体1(TLR1)、TLR-2、TLR 3、TLR4、TLR5、TLR-6、TLR7和TLR9。
18.根据权利要求17的组合物,其中所述佐剂是Toll样受体4或9激动剂。
19.一种细胞,其包含以下至少一项:
(i)一种或多种根据权利要求7的分离的腺病毒衣壳多肽,
(ii)根据权利要求1-6中任一项的分离的多核苷酸,
(iii)根据权利要求8-9中任一项的载体,
(iv)根据权利要求10-15中任一项的重组腺病毒。
20.根据权利要求18的细胞,其中所述细胞是表达至少一种选自E1a、E1b、E2a、E2b、E4、L1、L2、L3、L4和 L5的腺病毒基因的宿主细胞。
21.根据权利要求7的分离的腺病毒衣壳多肽、根据权利要求1-6中任一项的分离的多核苷酸、根据权利要求8-9中任一项的载体、根据权利要求10-15中任一项的重组腺病毒和/或根据权利要求18的组合物用于疾病的治疗或预防的用途。
22.根据权利要求21的用途,其中所述治疗或预防是接种。
23.根据权利要求21的用途,其中所述治疗是基因治疗。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510427025.4A CN105112428B (zh) | 2009-02-02 | 2010-02-02 | 猿腺病毒核酸和氨基酸序列,包含其的载体及其用途 |
Applications Claiming Priority (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2009/000672 WO2010085984A1 (en) | 2009-02-02 | 2009-02-02 | Simian adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
EPPCT/EP2009/000672 | 2009-02-02 | ||
US17262409P | 2009-04-24 | 2009-04-24 | |
US61/172624 | 2009-04-24 | ||
US17485209P | 2009-05-01 | 2009-05-01 | |
US61/174852 | 2009-05-01 | ||
US26634209P | 2009-12-03 | 2009-12-03 | |
US61/266342 | 2009-12-03 | ||
PCT/EP2010/000616 WO2010086189A2 (en) | 2009-02-02 | 2010-02-02 | Simian adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510427025.4A Division CN105112428B (zh) | 2009-02-02 | 2010-02-02 | 猿腺病毒核酸和氨基酸序列,包含其的载体及其用途 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102300872A true CN102300872A (zh) | 2011-12-28 |
Family
ID=45561183
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201080006197XA Pending CN102300872A (zh) | 2009-02-02 | 2010-02-02 | 猿腺病毒核酸和氨基酸序列,包含其的载体及其用途 |
Country Status (17)
Country | Link |
---|---|
US (3) | US9718863B2 (zh) |
EP (2) | EP2391638B1 (zh) |
JP (2) | JP5882741B2 (zh) |
KR (2) | KR101763093B1 (zh) |
CN (1) | CN102300872A (zh) |
AU (3) | AU2010209938A1 (zh) |
BR (1) | BRPI1008018A2 (zh) |
CA (2) | CA2749325C (zh) |
ES (1) | ES2898235T3 (zh) |
IL (2) | IL214097B (zh) |
MX (1) | MX2011007980A (zh) |
NZ (1) | NZ594355A (zh) |
PL (1) | PL2391638T3 (zh) |
RU (1) | RU2604815C2 (zh) |
SG (2) | SG172935A1 (zh) |
SI (1) | SI2391638T1 (zh) |
WO (1) | WO2010086189A2 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105473723A (zh) * | 2012-05-18 | 2016-04-06 | 宾夕法尼亚大学托管会 | 亚家族e猿腺病毒a1302、a1320、a1331和a1337及其用途 |
CN108025058A (zh) * | 2015-06-12 | 2018-05-11 | 葛兰素史密丝克莱恩生物有限公司 | 腺病毒多核苷酸和多肽 |
CN108135991A (zh) * | 2015-07-27 | 2018-06-08 | 葛兰素史密丝克莱恩生物有限公司 | 新型腺病毒 |
CN110997011A (zh) * | 2017-05-10 | 2020-04-10 | 犹他大学研究基金会 | Arc衣壳的组合物和使用方法 |
CN111372943A (zh) * | 2017-10-31 | 2020-07-03 | 扬森疫苗与预防公司 | 腺病毒及其用途 |
Families Citing this family (124)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
HUE039908T2 (hu) * | 2009-02-02 | 2019-02-28 | Glaxosmithkline Biologicals Sa | Majom adenovírusból származó nukleinsav- és aminosav-szekvenciák, azt tartalmazó vektorok és alkalmazásuk |
EP2853266B1 (en) | 2009-11-09 | 2018-01-31 | Genvec, Inc. | Method of propagating monkey adenoviral vectors |
US9526777B2 (en) | 2010-04-16 | 2016-12-27 | The United States Of America As Represented By The Department Of Health And Human Services | Methods for the induction of ebola virus-specific immune responses comprising administering a replication-defective chimpanzee adenovirus vector expressing the ebola virus glycoprotein |
AU2011332025B2 (en) * | 2010-11-23 | 2015-06-25 | The Trustees Of The University Of Pennsylvania | Subfamily E simian adenoviruses A1321, A1325, A1295, A1309 and A1322 and uses thereof |
WO2012089231A1 (en) | 2010-12-30 | 2012-07-05 | Okairòs Ag | Paramyxovirus vaccines |
TWI623618B (zh) | 2011-07-12 | 2018-05-11 | 傳斯堅公司 | Hbv聚合酶突變體 |
TW201318637A (zh) | 2011-09-29 | 2013-05-16 | Transgene Sa | 免疫療法組成物及用於治療c型肝炎病毒感染之療程(一) |
WO2013045668A2 (en) | 2011-09-29 | 2013-04-04 | Transgene Sa | Immunotherapy composition and regimen for treating hepatitis c virus infection |
CA2850629C (en) | 2011-10-05 | 2024-05-21 | Genvec, Inc. | Affenadenovirus (gorilla) or adenoviral vectors and methods of use |
US9629906B2 (en) | 2011-10-05 | 2017-04-25 | Genvec, Inc. | Affenadenovirus (gorilla) or adenoviral vectors and methods of use |
EP2764014B1 (en) | 2011-10-05 | 2022-02-09 | GenVec, Inc. | Adenoviral vectors and methods of use |
EP2780034A1 (en) | 2011-11-14 | 2014-09-24 | Crucell Holland B.V. | Heterologous prime-boost immunization using measles virus-based vaccines |
SG11201405228VA (en) | 2012-03-12 | 2014-11-27 | Crucell Holland Bv | Batches of recombinant adenovirus with altered terminal ends |
US8932607B2 (en) | 2012-03-12 | 2015-01-13 | Crucell Holland B.V. | Batches of recombinant adenovirus with altered terminal ends |
CN105457021A (zh) | 2012-05-04 | 2016-04-06 | 辉瑞公司 | 前列腺相关抗原及基于疫苗的免疫治疗疗法 |
US9676824B2 (en) | 2012-05-29 | 2017-06-13 | Genvec, Inc. | Herpes simplex virus vaccine |
WO2014005643A1 (en) | 2012-07-05 | 2014-01-09 | Okairos Ag | Novel prime-boosting regimens involving immunogenic polypeptides encoded by polynucleotides |
HRP20211756T1 (hr) | 2012-07-05 | 2022-02-18 | Glaxosmithkline Biologicals Sa | Novi senzibilizacijsko-pojačivački režimi koji uključuju imunogene polipeptide koje kodiraju polinukleotidi |
CA2891349C (en) | 2012-11-16 | 2023-07-18 | Beth Israel Deaconess Medical Center, Inc. | Recombinant adenoviruses and use thereof |
JP6576326B2 (ja) | 2013-03-14 | 2019-09-18 | ソーク インスティテュート フォー バイオロジカル スタディーズ | 腫瘍溶解性アデノウイルス組成物 |
WO2014139587A1 (en) | 2013-03-15 | 2014-09-18 | Okairòs Ag | Improved poxviral vaccines |
EA035522B1 (ru) | 2013-04-25 | 2020-06-29 | Янссен Вэксинс Энд Превеншн Б.В. | Стабильные растворимые f-полипептиды rsv в конформации "до слияния" |
WO2015092710A1 (en) | 2013-12-19 | 2015-06-25 | Glaxosmithkline Biologicals, S.A. | Contralateral co-administration of vaccines |
EP3154576A1 (en) | 2014-06-13 | 2017-04-19 | GlaxoSmithKline Biologicals S.A. | Immunogenic combinations |
UA125013C2 (uk) | 2014-09-03 | 2021-12-29 | Баваріан Нордік А/С | Вакцинна комбінація та спосіб індукування імунної відповіді проти філовірусу у суб'єкта |
SG11201701745TA (en) | 2014-09-03 | 2017-04-27 | Bavarian Nordic As | Methods and compositions for enhancing immune responses |
WO2017025782A1 (en) | 2014-09-17 | 2017-02-16 | Glaxosmithkline Biologicals Sa | Improved poxviral vaccines |
PT3197489T (pt) | 2014-09-26 | 2021-04-30 | Beth Israel Deaconess Medical Ct Inc | Métodos e composições para induzir a imunidade protetora contra a infeção pelo vírus da imunodeficiência humana |
WO2016131945A1 (en) | 2015-02-20 | 2016-08-25 | Transgene Sa | Combination product with autophagy modulator |
EP3283634B1 (en) | 2015-04-14 | 2019-05-22 | Janssen Vaccines & Prevention B.V. | Recombinant adenovirus expressing two transgenes with a bidirectional promoter |
BR112017017949A2 (pt) | 2015-05-15 | 2018-04-10 | Curevac Ag | regimes de iniciação-reforço envolvendo administração de pelo menos um constructo de mrna |
GB201514772D0 (en) * | 2015-08-19 | 2015-09-30 | Glaxosmithkline Biolog Sa | Adenovirus polynucleotides and polypeptides |
BE1024824B1 (fr) * | 2015-06-12 | 2018-07-13 | Glaxosmithkline Biologicals Sa | Polynucleotides et polypeptides d'adenovirus |
US10457708B2 (en) | 2015-07-07 | 2019-10-29 | Janssen Vaccines & Prevention B.V. | Stabilized soluble pre-fusion RSV F polypeptides |
EP3821906A1 (en) | 2015-07-07 | 2021-05-19 | Janssen Vaccines & Prevention B.V. | Vaccine against rsv comprising modified f polypeptide |
HUE045993T2 (hu) | 2015-12-15 | 2020-01-28 | Janssen Vaccines & Prevention Bv | Emberi immunhiány vírus antigének, vektorok, készítmények és alkalmazásukra szolgáló eljárások |
CN108778321A (zh) | 2016-01-19 | 2018-11-09 | 辉瑞公司 | 癌症疫苗 |
CA3013637A1 (en) | 2016-02-23 | 2017-08-31 | Salk Institute For Biological Studies | High throughput assay for measuring adenovirus replication kinetics |
AU2017223589B2 (en) | 2016-02-23 | 2023-08-03 | Salk Institute For Biological Studies | Exogenous gene expression in therapeutic adenovirus for minimal impact on viral kinetics |
DK3436591T5 (da) | 2016-03-31 | 2024-09-16 | The European Molecular Biology Laboratory | Adenovirale coat-protein-afledte transportvehikler |
IL262109B2 (en) | 2016-04-05 | 2023-04-01 | Janssen Vaccines Prevention B V | vaccine against rsv |
PE20190420A1 (es) | 2016-04-05 | 2019-03-19 | Janssen Vaccines And Prevention B V | Proteinas f de prefusion del virus respiratorio sincicial (vrs) solubles y estabilizadas |
EP3452081A1 (en) | 2016-05-04 | 2019-03-13 | Transgene SA | Combination therapy with cpg tlr9 ligand |
EP3455358B1 (en) | 2016-05-12 | 2020-08-26 | Janssen Vaccines & Prevention B.V. | Potent and balanced bidirectional promoter |
MY194419A (en) | 2016-05-30 | 2022-11-30 | Janssen Vaccines & Prevention Bv | Stabilized pre-fusion rsv f proteins |
US11001858B2 (en) | 2016-06-20 | 2021-05-11 | Janssen Vaccines & Prevention B.V. | Potent and balanced bidirectional promoter |
GB2549809C (en) * | 2016-06-23 | 2022-11-30 | Univ Oxford Innovation Ltd | Vector |
US10925956B2 (en) | 2016-07-15 | 2021-02-23 | Janssen Vaccines & Prevention B.V. | Methods and compositions for inducing protective immunity against a marburg virus infection |
WO2018011198A1 (en) | 2016-07-15 | 2018-01-18 | Janssen Vaccines & Prevention B.V. | Methods and compositions for inducing protective immunity against a marburg virus infection |
US11498956B2 (en) | 2016-08-23 | 2022-11-15 | Glaxosmithkline Biologicals Sa | Fusion peptides with antigens linked to short fragments of invariant chain(CD74) |
EP3518966A1 (en) | 2016-09-29 | 2019-08-07 | GlaxoSmithKline Biologicals S.A. | Compositions and methods of treatment of persistent hpv infection |
GB201616904D0 (en) | 2016-10-05 | 2016-11-16 | Glaxosmithkline Biologicals Sa | Vaccine |
US20190328869A1 (en) | 2016-10-10 | 2019-10-31 | Transgene Sa | Immunotherapeutic product and mdsc modulator combination therapy |
GB201620968D0 (en) | 2016-12-09 | 2017-01-25 | Glaxosmithkline Biologicals Sa | Adenovirus polynucleotides and polypeptides |
CA3045976A1 (en) | 2016-12-09 | 2018-06-14 | Glaxosmithkline Biologicals Sa | Chimpanzee adenovirus constructs with lyssavirus antigens |
WO2018111767A1 (en) | 2016-12-12 | 2018-06-21 | Salk Institute For Biological Studies | Tumor-targeting synthetic adenoviruses and uses thereof |
GB201701239D0 (en) * | 2017-01-25 | 2017-03-08 | Glaxosmithkline Biologicals Sa | Novel formulation |
CN110268061B (zh) | 2017-02-09 | 2024-07-16 | 扬森疫苗与预防公司 | 用于表达异源基因的有效的短启动子 |
WO2018185732A1 (en) | 2017-04-06 | 2018-10-11 | Janssen Vaccines & Prevention B.V. | Mva-bn and ad26.zebov or ad26.filo prime-boost regimen |
BR112019023477A2 (pt) | 2017-05-08 | 2020-06-30 | Gritstone Oncology, Inc. | vetores de neoantígeno de alfavírus |
EP3624844A1 (en) | 2017-05-17 | 2020-03-25 | Janssen Vaccines & Prevention B.V. | Methods and compositions for inducing protective immunity against rsv infection |
WO2018210871A1 (en) | 2017-05-17 | 2018-11-22 | Janssen Vaccines & Prevention B.V. | Methods and compositions for inducing protective immunity against rsv infection |
JP7272965B2 (ja) | 2017-06-15 | 2023-05-12 | ヤンセン ファッシンズ アンド プリベンション ベーフェー | Hiv抗原をコードするポックスウイルスベクターおよびその使用方法 |
AU2018295421B2 (en) * | 2017-07-05 | 2024-01-25 | Nouscom Ag | Non human great apes adenovirus nucleic acid- and amino acid-sequences, vectors containing same, and uses thereof |
RU2020100072A (ru) | 2017-07-11 | 2021-08-11 | Пфайзер Инк. | Иммуногенные композиции, содержащие cea, muc1 и tert |
JP7298926B2 (ja) | 2017-07-12 | 2023-06-27 | ノイスコム アーゲー | 癌の治療のためのネオアンチゲンワクチン組成物 |
US11649467B2 (en) | 2017-07-21 | 2023-05-16 | Glaxosmithkline Biologicals Sa | Chikungunya virus antigen constructs |
SG11202000019RA (en) | 2017-07-28 | 2020-02-27 | Janssen Vaccines & Prevention Bv | Methods and compositions for heterologous reprna immunizations |
MX2020002876A (es) | 2017-09-15 | 2020-07-22 | Janssen Vaccines & Prevention Bv | Metodo para la induccion segura de inmunidad contra el vsr. |
WO2019076892A1 (en) | 2017-10-16 | 2019-04-25 | Glaxosmithkline Biologicals Sa | ENHANCED PROMOTER |
WO2019076880A1 (en) | 2017-10-16 | 2019-04-25 | Glaxosmithkline Biologicals Sa | SIMIENS ADENOVIRAL VECTORS COMPRISING TWO EXPRESSION CASSETTES |
EA202090700A1 (ru) | 2017-10-16 | 2020-10-21 | Глаксосмитклайн Байолоджикалс Са | Компетентные по репликации аденовирусные векторы |
MX2020003748A (es) | 2017-10-16 | 2020-11-06 | Glaxosmithkline Biologicals Sa | Vectores adenovirales con dos casetes de expresion que codifican proteinas antigenicas del vsr o fragmentos de las mismas. |
EA202091074A1 (ru) | 2017-10-31 | 2020-07-22 | Янссен Вэксинс Энд Превеншн Б.В. | Аденовирус и его применения |
CA3077630A1 (en) | 2017-10-31 | 2019-05-09 | Janssen Vaccines & Prevention B.V. | Adenovirus vectors and uses thereof |
KR20200083510A (ko) * | 2017-10-31 | 2020-07-08 | 얀센 백신스 앤드 프리벤션 비.브이. | 아데노바이러스 및 이의 용도 |
EP3703744A1 (en) | 2017-11-03 | 2020-09-09 | Nouscom AG | Vaccine t cell enhancer |
WO2019099970A1 (en) | 2017-11-20 | 2019-05-23 | Janssen Pharmaceuticals Inc. | Method of providing safe administration of adenoviral vectors encoding a zika virus antigen |
EP3723771A4 (en) | 2017-12-11 | 2022-04-06 | Beth Israel Deaconess Medical Center, Inc. | RECOMBINANT ADENOVIRUS AND THEIR USES |
GB201721069D0 (en) | 2017-12-15 | 2018-01-31 | Glaxosmithkline Biologicals Sa | Hepatitis B Immunisation regimen and compositions |
GB201721068D0 (en) | 2017-12-15 | 2018-01-31 | Glaxosmithkline Biologicals Sa | Hepatitis B immunisation regimen and compositions |
MX2020006471A (es) | 2017-12-19 | 2020-09-22 | Janssen Sciences Ireland Unlimited Co | Metodos y composiciones para inducir una respuesta inmune contra el virus de hepatitis b (hbv). |
BR112020012361A2 (pt) | 2017-12-20 | 2020-11-24 | Glaxosmithkline Biologicals S.A. | constructos de antígeno do vírus epstein-barr |
EP3807298A1 (en) | 2018-06-12 | 2021-04-21 | GlaxoSmithKline Biologicals S.A. | Adenovirus polynucleotides and polypeptides |
EP3581201A1 (en) | 2018-06-15 | 2019-12-18 | GlaxoSmithKline Biologicals S.A. | Escherichia coli o157:h7 proteins and uses thereof |
EP3587581A1 (en) | 2018-06-26 | 2020-01-01 | GlaxoSmithKline Biologicals S.A. | Formulations for simian adenoviral vectors having enhanced storage stability |
US11713469B2 (en) | 2018-07-20 | 2023-08-01 | Janssen Vaccines & Prevention B.V. | Recombinant adenoviral vector expressing Zika antigen with improved productivity |
GB201812647D0 (en) * | 2018-08-03 | 2018-09-19 | Chancellor Masters And Scholars Of The Univ Of Oxford | Viral vectors and methods for the prevention or treatment of cancer |
CA3109541A1 (en) | 2018-10-19 | 2020-04-23 | Nouscom Ag | Teleost invariant chain cancer vaccine |
WO2020099614A1 (en) | 2018-11-15 | 2020-05-22 | Nouscom Ag | Selection of cancer mutations for generation of a personalized cancer vaccine |
CN113573729A (zh) | 2019-01-10 | 2021-10-29 | 詹森生物科技公司 | 前列腺新抗原及其用途 |
CA3132601A1 (en) | 2019-03-05 | 2020-09-10 | Glaxosmithkline Biologicals Sa | Hepatitis b immunisation regimen and compositions |
MX2021014525A (es) | 2019-05-30 | 2022-03-17 | Gritstone Bio Inc | Adenovirus modificados. |
IL293051A (en) | 2019-11-18 | 2022-07-01 | Janssen Biotech Inc | calr and jak2 mutant-based vaccines and their uses |
CN113088530A (zh) * | 2020-01-08 | 2021-07-09 | 怡道生物科技(苏州)有限公司 | 一种基于黑猩猩ChAd63型腺病毒的表达载体及其构建方法 |
TW202204380A (zh) | 2020-01-31 | 2022-02-01 | 美商詹森藥物公司 | 用於預防及治療冠狀病毒感染之組合物及方法-sars-cov-2疫苗 |
TW202144388A (zh) | 2020-02-14 | 2021-12-01 | 美商健生生物科技公司 | 在卵巢癌中表現之新抗原及其用途 |
TW202144389A (zh) | 2020-02-14 | 2021-12-01 | 美商健生生物科技公司 | 在多發性骨髓瘤中表現之新抗原及其用途 |
WO2021209897A1 (en) | 2020-04-13 | 2021-10-21 | Janssen Biotech, Inc. | Psma and steap1 vaccines and their uses |
WO2021228842A1 (en) | 2020-05-11 | 2021-11-18 | Janssen Pharmaceuticals, Inc. | Stabilized coronavirus spike protein fusion proteins |
CN116096406A (zh) | 2020-06-29 | 2023-05-09 | 扬森疫苗与预防公司 | 针对呼吸道合胞病毒感染的疫苗组合 |
US20230024133A1 (en) | 2020-07-06 | 2023-01-26 | Janssen Biotech, Inc. | Prostate Neoantigens And Their Uses |
EP4175664A2 (en) | 2020-07-06 | 2023-05-10 | Janssen Biotech, Inc. | Prostate neoantigens and their uses |
WO2022009051A1 (en) | 2020-07-06 | 2022-01-13 | Janssen Biotech, Inc. | A method for determining responsiveness to prostate cancer treatment |
CA3187149A1 (en) | 2020-07-06 | 2022-01-13 | Janssen Pharmaceuticals, Inc. | Stabilized corona virus spike protein fusion proteins |
BR112022027038A2 (pt) | 2020-07-08 | 2023-01-24 | Janssen Sciences Ireland Unlimited Co | Vacinas de replicon de rna contra hbv |
CA3189238A1 (en) | 2020-07-13 | 2022-01-20 | Transgene | Treatment of immune depression |
KR20230046313A (ko) * | 2020-08-06 | 2023-04-05 | 그릿스톤 바이오, 인코포레이티드 | 다중에피토프 백신 카세트 |
WO2022175479A1 (en) | 2021-02-19 | 2022-08-25 | Janssen Vaccines & Prevention B.V. | Vaccine combinations against respiratory syncytial virus strain a and b infections |
WO2022175477A1 (en) | 2021-02-19 | 2022-08-25 | Janssen Vaccines & Prevention B.V. | Stabilized pre-fusion rsv fb antigens |
US20240197859A1 (en) | 2021-04-01 | 2024-06-20 | Janssen Vaccines & Prevention B.V. | Stabilized Pre-Fusion PIV3 F Proteins |
WO2022218997A1 (en) | 2021-04-12 | 2022-10-20 | Centre National De La Recherche Scientifique (Cnrs) | Novel universal vaccine presenting system |
AU2022299252A1 (en) | 2021-06-21 | 2023-11-23 | Nouscom Ag | Vaccine composition comprising encoded adjuvant |
WO2023020939A1 (en) | 2021-08-17 | 2023-02-23 | Janssen Pharmaceuticals, Inc. | Sars-cov-2 vaccines |
WO2023026182A1 (en) | 2021-08-24 | 2023-03-02 | Janssen Pharmaceuticals, Inc. | Sars-cov-2 vaccines |
WO2023047349A1 (en) | 2021-09-24 | 2023-03-30 | Janssen Pharmaceuticals, Inc. | Stabilized coronavirus spike protein fusion proteins |
WO2023047348A1 (en) | 2021-09-24 | 2023-03-30 | Janssen Pharmaceuticals, Inc. | Stabilized corona virus spike protein fusion proteins |
WO2023111725A1 (en) | 2021-12-14 | 2023-06-22 | Janssen Pharmaceuticals, Inc. | Sars-cov-2 vaccines |
EP4448802A1 (en) | 2021-12-16 | 2024-10-23 | Janssen Vaccines & Prevention B.V. | Stabilized pre-fusion hmpv fusion proteins |
WO2023198815A1 (en) | 2022-04-14 | 2023-10-19 | Janssen Vaccines & Prevention B.V. | Sequential administration of adenoviruses |
WO2023213764A1 (en) | 2022-05-02 | 2023-11-09 | Transgene | Fusion polypeptide comprising an anti-pd-l1 sdab and a member of the tnfsf |
WO2024061759A1 (en) | 2022-09-23 | 2024-03-28 | Janssen Vaccines & Prevention B.V. | Stabilized coronavirus s proteins |
WO2024061757A1 (en) | 2022-09-23 | 2024-03-28 | Janssen Vaccines & Prevention B.V. | Pre-fusion human piv1 f proteins |
WO2024074584A1 (en) | 2022-10-06 | 2024-04-11 | Janssen Vaccines & Prevention B.V. | Stabilized pre-fusion piv3 f proteins |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PT787200E (pt) | 1994-10-28 | 2005-08-31 | Univ Pennsylvania | Adenovirus melhorado e metodos para a sua utilizacao |
AU722042B2 (en) * | 1995-11-30 | 2000-07-20 | Board Of Regents, The University Of Texas System | Methods and compositions for the diagnosis and treatment of cancer |
US5922315A (en) | 1997-01-24 | 1999-07-13 | Genetic Therapy, Inc. | Adenoviruses having altered hexon proteins |
AU2002322285A1 (en) * | 2001-06-22 | 2003-01-08 | The Trustees Of The University Of Pennsylvania | Method for rapid screening of bacterial transformants and novel simian adenovirus proteins |
US7598362B2 (en) | 2001-10-11 | 2009-10-06 | Merck & Co., Inc. | Hepatitis C virus vaccine |
NZ532383A (en) * | 2001-11-21 | 2007-03-30 | Univ Pennsylvania | Pan-7 simian adenovirus nucleic acid and amino acid sequences, vectors containing same, and methods of use |
US20030224372A1 (en) | 2002-05-31 | 2003-12-04 | Denise Syndercombe-Court | Method for determining ethnic origin by means of STR profile |
RU2267496C2 (ru) | 2004-01-15 | 2006-01-10 | Сергей Иванович Черныш | Противоопухолевые и антивирусные пептиды |
PT1711518E (pt) * | 2004-01-23 | 2010-02-26 | Isti Di Ric Di Bio Moleco P An | Transportadores de vacinas de adenovírus de chimpanzé |
JP2008538894A (ja) | 2005-02-11 | 2008-11-13 | メルク エンド カムパニー インコーポレーテッド | アデノウイルス血清型26ベクター、核酸およびそれにより製造されたウイルス |
JP5475279B2 (ja) | 2005-06-17 | 2014-04-16 | イステイチユート・デイ・リチエルケ・デイ・ビオロジア・モレコラーレ・ピ・アンジエレツテイ・エツセ・エルレ・エルレ | C型肝炎ウイルス核酸ワクチン |
CN101883858B (zh) * | 2007-11-28 | 2015-07-22 | 宾夕法尼亚大学托管会 | 猿猴亚家族E腺病毒SAdV-39、-25.2、-26、-30、-37和-38及其应用 |
EP2463362B1 (en) * | 2007-11-28 | 2017-11-08 | The Trustees Of The University Of Pennsylvania | Simian subfamily c adenovirus SAdv-31 and uses thereof |
EP2325298B1 (en) * | 2008-03-04 | 2016-10-05 | The Trustees Of The University Of Pennsylvania | SIMIAN ADENOVIRUSES SAdV-36, -42.1, -42.2, AND -44 AND USES THEREOF |
CA2726914A1 (en) * | 2008-06-03 | 2009-12-10 | Okairos Ag | A vaccine for the prevention and therapy of hcv infections |
-
2010
- 2010-02-02 ES ES18173908T patent/ES2898235T3/es active Active
- 2010-02-02 KR KR1020167024396A patent/KR101763093B1/ko active IP Right Grant
- 2010-02-02 SG SG2011050275A patent/SG172935A1/en unknown
- 2010-02-02 KR KR1020117017573A patent/KR101761425B1/ko active IP Right Grant
- 2010-02-02 SG SG2014007959A patent/SG2014007959A/en unknown
- 2010-02-02 PL PL10702615T patent/PL2391638T3/pl unknown
- 2010-02-02 SI SI201031745T patent/SI2391638T1/sl unknown
- 2010-02-02 NZ NZ59435510A patent/NZ594355A/xx not_active IP Right Cessation
- 2010-02-02 CA CA2749325A patent/CA2749325C/en not_active Expired - Fee Related
- 2010-02-02 EP EP10702615.5A patent/EP2391638B1/en active Active
- 2010-02-02 CN CN201080006197XA patent/CN102300872A/zh active Pending
- 2010-02-02 JP JP2011546719A patent/JP5882741B2/ja active Active
- 2010-02-02 BR BRPI1008018A patent/BRPI1008018A2/pt not_active Application Discontinuation
- 2010-02-02 WO PCT/EP2010/000616 patent/WO2010086189A2/en active Application Filing
- 2010-02-02 MX MX2011007980A patent/MX2011007980A/es active IP Right Grant
- 2010-02-02 RU RU2011136282/10A patent/RU2604815C2/ru not_active Application Discontinuation
- 2010-02-02 AU AU2010209938A patent/AU2010209938A1/en not_active Abandoned
- 2010-02-02 CA CA3108979A patent/CA3108979A1/en not_active Abandoned
- 2010-02-02 EP EP18173908.7A patent/EP3385387B1/en active Active
- 2010-02-06 US US13/147,193 patent/US9718863B2/en active Active
-
2011
- 2011-07-14 IL IL214097A patent/IL214097B/en active IP Right Grant
-
2015
- 2015-09-10 IL IL241494A patent/IL241494A0/en unknown
- 2015-10-08 AU AU2015238866A patent/AU2015238866B2/en not_active Ceased
-
2016
- 2016-02-04 JP JP2016019877A patent/JP6262779B2/ja active Active
-
2017
- 2017-06-15 US US15/623,723 patent/US10544192B2/en active Active
- 2017-06-23 AU AU2017204292A patent/AU2017204292B2/en not_active Ceased
-
2019
- 2019-12-11 US US16/710,131 patent/US11214599B2/en active Active
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105473723A (zh) * | 2012-05-18 | 2016-04-06 | 宾夕法尼亚大学托管会 | 亚家族e猿腺病毒a1302、a1320、a1331和a1337及其用途 |
CN108025058A (zh) * | 2015-06-12 | 2018-05-11 | 葛兰素史密丝克莱恩生物有限公司 | 腺病毒多核苷酸和多肽 |
US11254710B2 (en) | 2015-06-12 | 2022-02-22 | Glaxosmithkline Biologicals Sa | Adenovirus polynucleotides and polypeptides |
US11254711B2 (en) | 2015-06-12 | 2022-02-22 | Glaxosmithkline Biologicals Sa | Adenovirus polynucleotides and polypeptides |
CN108025058B (zh) * | 2015-06-12 | 2022-12-16 | 葛兰素史密丝克莱恩生物有限公司 | 腺病毒多核苷酸和多肽 |
CN108135991A (zh) * | 2015-07-27 | 2018-06-08 | 葛兰素史密丝克莱恩生物有限公司 | 新型腺病毒 |
CN110997011A (zh) * | 2017-05-10 | 2020-04-10 | 犹他大学研究基金会 | Arc衣壳的组合物和使用方法 |
CN111372943A (zh) * | 2017-10-31 | 2020-07-03 | 扬森疫苗与预防公司 | 腺病毒及其用途 |
CN111372943B (zh) * | 2017-10-31 | 2023-12-05 | 扬森疫苗与预防公司 | 腺病毒及其用途 |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101763093B1 (ko) | 시미안 아데노바이러스 핵산- 및 아미노산-서열, 이를 포함하는 벡터 및 이의 용도 | |
AU2019204982B2 (en) | Recombinant HCMV and RhCMV Vectors and Uses Thereof | |
DK2163260T3 (en) | Chimpanzee adenovirus vaccine carriers | |
AU2019271972B2 (en) | Adenovirus polynucleotides and polypeptides | |
RU2762854C2 (ru) | Последовательности нуклеиновых кислот и аминокислотные последовательности аденовирусов человекообразных обезьян, исключая человека, содержащие их векторы, и их применения | |
CN1833027B (zh) | 产生嵌合腺病毒的方法及这种嵌合腺病毒的用途 | |
AU2011332025B2 (en) | Subfamily E simian adenoviruses A1321, A1325, A1295, A1309 and A1322 and uses thereof | |
AU2022203504A1 (en) | Oncolytic tumor viruses and methods of use | |
KR102403547B1 (ko) | 외인성 항원을 포함하는 인간 시토메갈로바이러스 | |
KR102471633B1 (ko) | 바이러스 동역학에 미치는 영향 최소화를 위한 치료용 아데노바이러스의 외인성 유전자 발현 | |
KR20180034589A (ko) | 면역 반응을 유도하기 위한 신규한 방법 | |
JP2024073576A (ja) | 改変アデノウイルス | |
CN107574154A (zh) | 猴(大猩猩)腺病毒或腺病毒载体及其使用方法 | |
CN107937440A (zh) | 猴腺病毒(大猩猩)或腺病毒载体及其使用方法 | |
JP2023145678A (ja) | エプスタインバールウイルス抗原構築物 | |
CN116940589A (zh) | 重组sars-cov-2疫苗 | |
KR20210053923A (ko) | 항종양 면역 반응을 자극하는 키메라 종양 용해성 허피스바이러스 | |
DK2391638T3 (en) | Abeadenovirus nucleic acid and amino acid sequences, vectors containing them, and uses thereof. | |
CN114761030A (zh) | 具有诱导的抗肿瘤免疫的溶瘤病毒疗法 | |
NL2023464B1 (en) | Oncolytic Non-human adenoviruses and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: SMITHKLINE BEECHAM BIOLOG Free format text: FORMER OWNER: OKAIROS AG Effective date: 20150324 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20150324 Address after: Belgium richling Texas Applicant after: Glaxo Smithkline Biologicals S.A. Address before: Basel Applicant before: Okairos AG Switzerland |
|
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20111228 |