CN114269363A - 用于hiv疫苗应用的复制缺陷型腺病毒载体 - Google Patents
用于hiv疫苗应用的复制缺陷型腺病毒载体 Download PDFInfo
- Publication number
- CN114269363A CN114269363A CN201980097540.7A CN201980097540A CN114269363A CN 114269363 A CN114269363 A CN 114269363A CN 201980097540 A CN201980097540 A CN 201980097540A CN 114269363 A CN114269363 A CN 114269363A
- Authority
- CN
- China
- Prior art keywords
- composition
- asn
- thr
- leu
- mammal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000013598 vector Substances 0.000 title claims description 116
- 241000701161 unidentified adenovirus Species 0.000 title abstract description 9
- 229940033330 HIV vaccine Drugs 0.000 title description 3
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 235
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 160
- 239000000203 mixture Substances 0.000 claims abstract description 108
- 238000000034 method Methods 0.000 claims abstract description 89
- 230000014509 gene expression Effects 0.000 claims abstract description 87
- 241000124008 Mammalia Species 0.000 claims abstract description 84
- 150000007523 nucleic acids Chemical group 0.000 claims abstract description 54
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 48
- 241000282577 Pan troglodytes Species 0.000 claims abstract description 17
- 210000004027 cell Anatomy 0.000 claims description 43
- 230000028993 immune response Effects 0.000 claims description 39
- 241000701022 Cytomegalovirus Species 0.000 claims description 19
- 210000003719 b-lymphocyte Anatomy 0.000 claims description 14
- 239000003623 enhancer Substances 0.000 claims description 14
- 230000002163 immunogen Effects 0.000 claims description 14
- 239000002671 adjuvant Substances 0.000 claims description 13
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 claims description 9
- 101000768957 Acholeplasma phage L2 Uncharacterized 37.2 kDa protein Proteins 0.000 claims description 9
- 101000823746 Acidianus ambivalens Uncharacterized 17.7 kDa protein in bps2 3'region Proteins 0.000 claims description 9
- 101000916369 Acidianus ambivalens Uncharacterized protein in sor 5'region Proteins 0.000 claims description 9
- 101000769342 Acinetobacter guillouiae Uncharacterized protein in rpoN-murA intergenic region Proteins 0.000 claims description 9
- 101000823696 Actinobacillus pleuropneumoniae Uncharacterized glycosyltransferase in aroQ 3'region Proteins 0.000 claims description 9
- 101000786513 Agrobacterium tumefaciens (strain 15955) Uncharacterized protein outside the virF region Proteins 0.000 claims description 9
- 101000618005 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_00885 Proteins 0.000 claims description 9
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 claims description 9
- 102100020724 Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Human genes 0.000 claims description 9
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 claims description 9
- 101000967489 Azorhizobium caulinodans (strain ATCC 43989 / DSM 5975 / JCM 20966 / LMG 6465 / NBRC 14845 / NCIMB 13405 / ORS 571) Uncharacterized protein AZC_3924 Proteins 0.000 claims description 9
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 claims description 9
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 claims description 9
- 101000823761 Bacillus licheniformis Uncharacterized 9.4 kDa protein in flaL 3'region Proteins 0.000 claims description 9
- 101000819719 Bacillus methanolicus Uncharacterized N-acetyltransferase in lysA 3'region Proteins 0.000 claims description 9
- 101000789586 Bacillus subtilis (strain 168) UPF0702 transmembrane protein YkjA Proteins 0.000 claims description 9
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 claims description 9
- 101000792624 Bacillus subtilis (strain 168) Uncharacterized protein YbxH Proteins 0.000 claims description 9
- 101000790792 Bacillus subtilis (strain 168) Uncharacterized protein YckC Proteins 0.000 claims description 9
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 claims description 9
- 101000819705 Bacillus subtilis (strain 168) Uncharacterized protein YlxR Proteins 0.000 claims description 9
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 claims description 9
- 101000948218 Bacillus subtilis (strain 168) Uncharacterized protein YtxJ Proteins 0.000 claims description 9
- 101000718627 Bacillus thuringiensis subsp. kurstaki Putative RNA polymerase sigma-G factor Proteins 0.000 claims description 9
- 101000641200 Bombyx mori densovirus Putative non-structural protein Proteins 0.000 claims description 9
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 claims description 9
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 claims description 9
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 claims description 9
- 101000947633 Claviceps purpurea Uncharacterized 13.8 kDa protein Proteins 0.000 claims description 9
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 claims description 9
- 102100031725 Cortactin-binding protein 2 Human genes 0.000 claims description 9
- 101000948901 Enterobacteria phage T4 Uncharacterized 16.0 kDa protein in segB-ipI intergenic region Proteins 0.000 claims description 9
- 101000805958 Equine herpesvirus 4 (strain 1942) Virion protein US10 homolog Proteins 0.000 claims description 9
- 101000790442 Escherichia coli Insertion element IS2 uncharacterized 11.1 kDa protein Proteins 0.000 claims description 9
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 claims description 9
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 claims description 9
- 101000788354 Escherichia phage P2 Uncharacterized 8.2 kDa protein in gpA 5'region Proteins 0.000 claims description 9
- 101000770304 Frankia alni UPF0460 protein in nifX-nifW intergenic region Proteins 0.000 claims description 9
- 101000797344 Geobacillus stearothermophilus Putative tRNA (cytidine(34)-2'-O)-methyltransferase Proteins 0.000 claims description 9
- 101000748410 Geobacillus stearothermophilus Uncharacterized protein in fumA 3'region Proteins 0.000 claims description 9
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 claims description 9
- 101000772675 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) UPF0438 protein HI_0847 Proteins 0.000 claims description 9
- 101000631019 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Uncharacterized protein HI_0350 Proteins 0.000 claims description 9
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 9
- 101000768938 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.9 kDa protein in int-C1 intergenic region Proteins 0.000 claims description 9
- 101000785414 Homo sapiens Ankyrin repeat, SAM and basic leucine zipper domain-containing protein 1 Proteins 0.000 claims description 9
- 101000782488 Junonia coenia densovirus (isolate pBRJ/1990) Putative non-structural protein NS2 Proteins 0.000 claims description 9
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 claims description 9
- 101000811523 Klebsiella pneumoniae Uncharacterized 55.8 kDa protein in cps region Proteins 0.000 claims description 9
- 101000818409 Lactococcus lactis subsp. lactis Uncharacterized HTH-type transcriptional regulator in lacX 3'region Proteins 0.000 claims description 9
- 101000878851 Leptolyngbya boryana Putative Fe(2+) transport protein A Proteins 0.000 claims description 9
- 101000758828 Methanosarcina barkeri (strain Fusaro / DSM 804) Uncharacterized protein Mbar_A1602 Proteins 0.000 claims description 9
- 101001122401 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF3 Proteins 0.000 claims description 9
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 claims description 9
- 101001055788 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) Pentapeptide repeat protein MfpA Proteins 0.000 claims description 9
- 101710087110 ORF6 protein Proteins 0.000 claims description 9
- 101000740670 Orgyia pseudotsugata multicapsid polyhedrosis virus Protein C42 Proteins 0.000 claims description 9
- 101000769182 Photorhabdus luminescens Uncharacterized protein in pnp 3'region Proteins 0.000 claims description 9
- 101710197985 Probable protein Rev Proteins 0.000 claims description 9
- 101000961392 Pseudescherichia vulneris Uncharacterized 29.9 kDa protein in crtE 3'region Proteins 0.000 claims description 9
- 101000731030 Pseudomonas oleovorans Poly(3-hydroxyalkanoate) polymerase 2 Proteins 0.000 claims description 9
- 101001065485 Pseudomonas putida Probable fatty acid methyltransferase Proteins 0.000 claims description 9
- 101000711023 Rhizobium leguminosarum bv. trifolii Uncharacterized protein in tfuA 3'region Proteins 0.000 claims description 9
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 claims description 9
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 claims description 9
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 claims description 9
- 101000948156 Rhodococcus erythropolis Uncharacterized 47.3 kDa protein in thcA 5'region Proteins 0.000 claims description 9
- 101000917565 Rhodococcus fascians Uncharacterized 33.6 kDa protein in fasciation locus Proteins 0.000 claims description 9
- 101000790284 Saimiriine herpesvirus 2 (strain 488) Uncharacterized 9.5 kDa protein in DHFR 3'region Proteins 0.000 claims description 9
- 101000936719 Streptococcus gordonii Accessory Sec system protein Asp3 Proteins 0.000 claims description 9
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 claims description 9
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 claims description 9
- 101000788499 Streptomyces coelicolor Uncharacterized oxidoreductase in mprA 5'region Proteins 0.000 claims description 9
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 claims description 9
- 101001102841 Streptomyces griseus Purine nucleoside phosphorylase ORF3 Proteins 0.000 claims description 9
- 101000708557 Streptomyces lincolnensis Uncharacterized 17.2 kDa protein in melC2-rnhH intergenic region Proteins 0.000 claims description 9
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 claims description 9
- 101000649826 Thermotoga neapolitana Putative anti-sigma factor antagonist TM1081 homolog Proteins 0.000 claims description 9
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 claims description 9
- 101710110895 Uncharacterized 7.3 kDa protein in cox-rep intergenic region Proteins 0.000 claims description 9
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 claims description 9
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 claims description 9
- 101000827562 Vibrio alginolyticus Uncharacterized protein in proC 3'region Proteins 0.000 claims description 9
- 101000778915 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) Uncharacterized membrane protein VP2115 Proteins 0.000 claims description 9
- 239000012636 effector Substances 0.000 claims description 8
- 208000031886 HIV Infections Diseases 0.000 claims description 7
- 208000037357 HIV infectious disease Diseases 0.000 claims description 6
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 claims description 6
- 210000003162 effector t lymphocyte Anatomy 0.000 claims description 5
- 210000003071 memory t lymphocyte Anatomy 0.000 claims description 5
- 230000003044 adaptive effect Effects 0.000 claims description 4
- 102100034353 Integrase Human genes 0.000 claims description 3
- 108010078428 env Gene Products Proteins 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 239000008194 pharmaceutical composition Substances 0.000 abstract description 18
- 229940126580 vector vaccine Drugs 0.000 abstract 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 50
- 241000699670 Mus sp. Species 0.000 description 38
- 239000000427 antigen Substances 0.000 description 33
- 108091007433 antigens Proteins 0.000 description 33
- 102000036639 antigens Human genes 0.000 description 33
- 108090000765 processed proteins & peptides Proteins 0.000 description 33
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 27
- 201000010099 disease Diseases 0.000 description 19
- 230000004044 response Effects 0.000 description 17
- 102000004196 processed proteins & peptides Human genes 0.000 description 16
- 238000002474 experimental method Methods 0.000 description 14
- 230000037452 priming Effects 0.000 description 14
- 241000700605 Viruses Species 0.000 description 13
- 150000001875 compounds Chemical class 0.000 description 13
- 229920001184 polypeptide Polymers 0.000 description 13
- 230000001105 regulatory effect Effects 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 102000053602 DNA Human genes 0.000 description 12
- 239000003795 chemical substances by application Substances 0.000 description 12
- 102000039446 nucleic acids Human genes 0.000 description 11
- 108020004707 nucleic acids Proteins 0.000 description 11
- 238000002965 ELISA Methods 0.000 description 10
- 230000005867 T cell response Effects 0.000 description 10
- 239000002245 particle Substances 0.000 description 10
- 102000040430 polynucleotide Human genes 0.000 description 10
- 108091033319 polynucleotide Proteins 0.000 description 10
- 239000002157 polynucleotide Substances 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 125000003729 nucleotide group Chemical group 0.000 description 9
- 229920002477 rna polymer Polymers 0.000 description 9
- 238000011282 treatment Methods 0.000 description 9
- 102000004127 Cytokines Human genes 0.000 description 8
- 108090000695 Cytokines Proteins 0.000 description 8
- 210000001744 T-lymphocyte Anatomy 0.000 description 8
- 230000005875 antibody response Effects 0.000 description 8
- 208000035475 disorder Diseases 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 208000024891 symptom Diseases 0.000 description 8
- 241001465754 Metazoa Species 0.000 description 7
- 150000001413 amino acids Chemical group 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 230000001225 therapeutic effect Effects 0.000 description 7
- 229960005486 vaccine Drugs 0.000 description 7
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 6
- LANZYLJEHLBUPR-BPUTZDHNSA-N Asn-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N LANZYLJEHLBUPR-BPUTZDHNSA-N 0.000 description 6
- 102100032912 CD44 antigen Human genes 0.000 description 6
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 6
- 101000868273 Homo sapiens CD44 antigen Proteins 0.000 description 6
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 239000003937 drug carrier Substances 0.000 description 6
- 230000003053 immunization Effects 0.000 description 6
- 230000001939 inductive effect Effects 0.000 description 6
- 230000000069 prophylactic effect Effects 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 5
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 108010066427 N-valyltryptophan Proteins 0.000 description 5
- 108700008625 Reporter Genes Proteins 0.000 description 5
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 5
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 5
- 230000004913 activation Effects 0.000 description 5
- 229940037003 alum Drugs 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 239000003981 vehicle Substances 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 4
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 4
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 4
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 4
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 4
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 4
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 4
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 4
- 108060003951 Immunoglobulin Proteins 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 4
- JCVOHUKUYSYBAD-DCAQKATOSA-N Lys-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CS)C(=O)O JCVOHUKUYSYBAD-DCAQKATOSA-N 0.000 description 4
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 4
- UEHNWRNADDPYNK-DLOVCJGASA-N Phe-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N UEHNWRNADDPYNK-DLOVCJGASA-N 0.000 description 4
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 4
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 4
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 4
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 4
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 4
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 4
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 4
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 4
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 4
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 4
- 230000000890 antigenic effect Effects 0.000 description 4
- 230000002238 attenuated effect Effects 0.000 description 4
- 239000012472 biological sample Substances 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000009472 formulation Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 230000036039 immunity Effects 0.000 description 4
- 238000002649 immunization Methods 0.000 description 4
- 230000005847 immunogenicity Effects 0.000 description 4
- 102000018358 immunoglobulin Human genes 0.000 description 4
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 4
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000011277 treatment modality Methods 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 3
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 3
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 3
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 3
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 3
- 238000011725 BALB/c mouse Methods 0.000 description 3
- 101710117545 C protein Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 3
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 3
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 3
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 3
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 102000001398 Granzyme Human genes 0.000 description 3
- 108060005986 Granzyme Proteins 0.000 description 3
- YXBRCTXAEYSCHS-XVYDVKMFSA-N His-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YXBRCTXAEYSCHS-XVYDVKMFSA-N 0.000 description 3
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 3
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 3
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 3
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 3
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 3
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 3
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 3
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 3
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 3
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 3
- YPBYQWFZAAQMGW-XIRDDKMYSA-N Trp-Lys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N YPBYQWFZAAQMGW-XIRDDKMYSA-N 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 239000013592 cell lysate Substances 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 238000001990 intravenous administration Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 239000003755 preservative agent Substances 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 238000002255 vaccination Methods 0.000 description 3
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 2
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 2
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 2
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 2
- ASCGFDYEKSRNPL-CIUDSAMLSA-N Asn-Glu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O ASCGFDYEKSRNPL-CIUDSAMLSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 2
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- ICTXFVKYAGQURS-UBHSHLNASA-N Asp-Asn-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ICTXFVKYAGQURS-UBHSHLNASA-N 0.000 description 2
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 210000001266 CD8-positive T-lymphocyte Anatomy 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 2
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 2
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 2
- GLAPJAHOPFSLKL-SRVKXCTJSA-N Gln-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N GLAPJAHOPFSLKL-SRVKXCTJSA-N 0.000 description 2
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 2
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 2
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- RBOOOLVEKJHUNA-CIUDSAMLSA-N His-Cys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O RBOOOLVEKJHUNA-CIUDSAMLSA-N 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 2
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 2
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 2
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- NUEHSWNAFIEBCQ-NAKRPEOUSA-N Ile-Val-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUEHSWNAFIEBCQ-NAKRPEOUSA-N 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 2
- IECZNARPMKQGJC-XIRDDKMYSA-N Met-Gln-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N IECZNARPMKQGJC-XIRDDKMYSA-N 0.000 description 2
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 2
- XLTSAUGGDYRFLS-UMPQAUOISA-N Met-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCSC)N)O XLTSAUGGDYRFLS-UMPQAUOISA-N 0.000 description 2
- HOTNHEUETJELDL-BPNCWPANSA-N Met-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N HOTNHEUETJELDL-BPNCWPANSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- MQVFHOPCKNTHGT-MELADBBJSA-N Phe-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O MQVFHOPCKNTHGT-MELADBBJSA-N 0.000 description 2
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 2
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 2
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 2
- KBKTUNYBNJWFRL-UBHSHLNASA-N Trp-Ser-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 KBKTUNYBNJWFRL-UBHSHLNASA-N 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- 239000003070 absorption delaying agent Substances 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 239000003429 antifungal agent Substances 0.000 description 2
- 229940121375 antifungal agent Drugs 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000016396 cytokine production Effects 0.000 description 2
- 230000001461 cytolytic effect Effects 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- MMXKVMNBHPAILY-UHFFFAOYSA-N ethyl laurate Chemical compound CCCCCCCCCCCC(=O)OCC MMXKVMNBHPAILY-UHFFFAOYSA-N 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 229940079322 interferon Drugs 0.000 description 2
- 238000007913 intrathecal administration Methods 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 239000011859 microparticle Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 230000002335 preservative effect Effects 0.000 description 2
- 230000002685 pulmonary effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 238000007920 subcutaneous administration Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 239000000375 suspending agent Substances 0.000 description 2
- 239000002562 thickening agent Substances 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000011200 topical administration Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 102000003390 tumor necrosis factor Human genes 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 239000000277 virosome Substances 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- BGDILZXXDJCKPF-CIUDSAMLSA-N Arg-Gln-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O BGDILZXXDJCKPF-CIUDSAMLSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 1
- MPXTVIOEYISUQC-DHATWTDPSA-N Asn-Met-Thr-Thr Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MPXTVIOEYISUQC-DHATWTDPSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- FLJVGAFLZVBBNG-BPUTZDHNSA-N Asn-Trp-Arg Chemical compound N[C@@H](CC(=O)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O FLJVGAFLZVBBNG-BPUTZDHNSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 241000714230 Avian leukemia virus Species 0.000 description 1
- 230000028728 B cell mediated immunity Effects 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 241000220450 Cajanus cajan Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 208000003322 Coinfection Diseases 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- 102000004420 Creatine Kinase Human genes 0.000 description 1
- 108010042126 Creatine kinase Proteins 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 1
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- JIVJQYNNAYFXDG-LKXGYXEUSA-N Cys-Thr-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JIVJQYNNAYFXDG-LKXGYXEUSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 1
- PXEGEYISOXISDV-XIRDDKMYSA-N Cys-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 PXEGEYISOXISDV-XIRDDKMYSA-N 0.000 description 1
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- YVGGHNCTFXOJCH-UHFFFAOYSA-N DDT Chemical compound C1=CC(Cl)=CC=C1C(C(Cl)(Cl)Cl)C1=CC=C(Cl)C=C1 YVGGHNCTFXOJCH-UHFFFAOYSA-N 0.000 description 1
- -1 DNA or RNA Chemical class 0.000 description 1
- 102100036912 Desmin Human genes 0.000 description 1
- 108010044052 Desmin Proteins 0.000 description 1
- 101150005585 E3 gene Proteins 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 239000001856 Ethyl cellulose Substances 0.000 description 1
- ZZSNKZQZMQGXPY-UHFFFAOYSA-N Ethyl cellulose Chemical compound CCOCC1OC(OC)C(OCC)C(OCC)C1OC1C(O)C(O)C(OC)C(CO)O1 ZZSNKZQZMQGXPY-UHFFFAOYSA-N 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 101710177291 Gag polyprotein Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- TWHDOEYLXXQYOZ-FXQIFTODSA-N Gln-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N TWHDOEYLXXQYOZ-FXQIFTODSA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- QSQXZZCGPXQBPP-BQBZGAKWSA-N Gly-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)CN)C(=O)N[C@@H](CS)C(=O)O QSQXZZCGPXQBPP-BQBZGAKWSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- 102000002068 Glycopeptides Human genes 0.000 description 1
- 108010015899 Glycopeptides Proteins 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 229940033332 HIV-1 vaccine Drugs 0.000 description 1
- 102000001554 Hemoglobins Human genes 0.000 description 1
- 108010054147 Hemoglobins Proteins 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- ZNTSGDNUITWTRA-WDSOQIARSA-N His-Trp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O ZNTSGDNUITWTRA-WDSOQIARSA-N 0.000 description 1
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- YWFZWQKWNDOWPA-XIRDDKMYSA-N Leu-Trp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O YWFZWQKWNDOWPA-XIRDDKMYSA-N 0.000 description 1
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 1
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 1
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 1
- AIPHUKOBUXJNKM-KKUMJFAQSA-N Lys-Cys-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AIPHUKOBUXJNKM-KKUMJFAQSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 108010059343 MM Form Creatine Kinase Proteins 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- RMHHNLKYPOOKQN-FXQIFTODSA-N Met-Cys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O RMHHNLKYPOOKQN-FXQIFTODSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- JYPITOUIQVSCKM-IHRRRGAJSA-N Met-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCSC)N JYPITOUIQVSCKM-IHRRRGAJSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 1
- RKRFGIBULDYDPF-XIRDDKMYSA-N Met-Trp-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKRFGIBULDYDPF-XIRDDKMYSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 102000036675 Myoglobin Human genes 0.000 description 1
- 108010062374 Myoglobin Proteins 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 108060008487 Myosin Proteins 0.000 description 1
- GXCLVBGFBYZDAG-UHFFFAOYSA-N N-[2-(1H-indol-3-yl)ethyl]-N-methylprop-2-en-1-amine Chemical compound CN(CCC1=CNC2=C1C=CC=C2)CC=C GXCLVBGFBYZDAG-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 235000019485 Safflower oil Nutrition 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- FRQRWAMUESPWMT-HSHDSVGOSA-N Thr-Trp-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N)O FRQRWAMUESPWMT-HSHDSVGOSA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- CDPXXGFRDZVVGF-OYDLWJJNSA-N Trp-Arg-Trp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CDPXXGFRDZVVGF-OYDLWJJNSA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 1
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- KOVPHHXMHLFWPL-BPUTZDHNSA-N Trp-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CC(=O)N)C(=O)O KOVPHHXMHLFWPL-BPUTZDHNSA-N 0.000 description 1
- VRTMYQGKPQZAPO-SBCJRHGPSA-N Trp-Trp-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VRTMYQGKPQZAPO-SBCJRHGPSA-N 0.000 description 1
- SGQSAIFDESQBRA-IHPCNDPISA-N Trp-Tyr-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SGQSAIFDESQBRA-IHPCNDPISA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- RIJPHPUJRLEOAK-JYJNAYRXSA-N Tyr-Gln-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O RIJPHPUJRLEOAK-JYJNAYRXSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- XPYNXORPPVTVQK-SRVKXCTJSA-N Val-Arg-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N XPYNXORPPVTVQK-SRVKXCTJSA-N 0.000 description 1
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- UZQJVUCHXGYFLQ-AYDHOLPZSA-N [(2s,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-4-[(2r,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-3,5-dihydroxy-6-(hydroxymethyl)-4-[(2s,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxy-3,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-3,5-dihydroxy-6-(hy Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O)O[C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O)O[C@H]1CC[C@]2(C)[C@H]3CC=C4[C@@]([C@@]3(CC[C@H]2[C@@]1(C=O)C)C)(C)CC(O)[C@]1(CCC(CC14)(C)C)C(=O)O[C@H]1[C@@H]([C@@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O[C@H]4[C@@H]([C@@H](O[C@H]5[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O5)O)[C@H](O)[C@@H](CO)O4)O)[C@H](O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UZQJVUCHXGYFLQ-AYDHOLPZSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 229940027570 adenoviral vector vaccine Drugs 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 159000000013 aluminium salts Chemical class 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000010056 antibody-dependent cellular cytotoxicity Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 210000003567 ascitic fluid Anatomy 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000006472 autoimmune response Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000013060 biological fluid Substances 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000008366 buffered solution Substances 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 229920002301 cellulose acetate Polymers 0.000 description 1
- 239000012707 chemical precursor Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 229940110456 cocoa butter Drugs 0.000 description 1
- 235000019868 cocoa butter Nutrition 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000002648 combination therapy Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 235000005687 corn oil Nutrition 0.000 description 1
- 239000002285 corn oil Substances 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 235000012343 cottonseed oil Nutrition 0.000 description 1
- 239000002385 cottonseed oil Substances 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 210000005045 desmin Anatomy 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000007884 disintegrant Substances 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 235000019325 ethyl cellulose Nutrition 0.000 description 1
- 229920001249 ethyl cellulose Polymers 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108010045774 gag protein (129-135) Proteins 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 210000000609 ganglia Anatomy 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 239000003349 gelling agent Substances 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 239000003979 granulating agent Substances 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 210000005003 heart tissue Anatomy 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 239000003906 humectant Substances 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 230000004727 humoral immunity Effects 0.000 description 1
- 230000008348 humoral response Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 230000021633 leukocyte mediated immunity Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 210000004880 lymph fluid Anatomy 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- VTHJTEIRLNZDEV-UHFFFAOYSA-L magnesium dihydroxide Chemical compound [OH-].[OH-].[Mg+2] VTHJTEIRLNZDEV-UHFFFAOYSA-L 0.000 description 1
- 239000000347 magnesium hydroxide Substances 0.000 description 1
- 229910001862 magnesium hydroxide Inorganic materials 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000004660 morphological change Effects 0.000 description 1
- 239000002105 nanoparticle Substances 0.000 description 1
- 239000002077 nanosphere Substances 0.000 description 1
- 238000013188 needle biopsy Methods 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 229960005030 other vaccine in atc Drugs 0.000 description 1
- 230000003071 parasitic effect Effects 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 239000004014 plasticizer Substances 0.000 description 1
- 210000004910 pleural fluid Anatomy 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 150000004804 polysaccharides Chemical class 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 229920001592 potato starch Polymers 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 229960003387 progesterone Drugs 0.000 description 1
- 239000000186 progesterone Substances 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000013074 reference sample Substances 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 235000005713 safflower oil Nutrition 0.000 description 1
- 239000003813 safflower oil Substances 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 239000003549 soybean oil Substances 0.000 description 1
- 235000012424 soybean oil Nutrition 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 210000004988 splenocyte Anatomy 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 239000000196 tragacanth Substances 0.000 description 1
- 235000010487 tragacanth Nutrition 0.000 description 1
- 229940116362 tragacanth Drugs 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 238000000870 ultraviolet spectroscopy Methods 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 108010043941 valyl-glutamyl-isoleucyl-asparaginyl-cysteinyl-threonyl-arginine Proteins 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 229960004854 viral vaccine Drugs 0.000 description 1
- 229940023147 viral vector vaccine Drugs 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
Images
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/66—Microorganisms or materials therefrom
- A61K35/76—Viruses; Subviral particles; Bacteriophages
- A61K35/761—Adenovirus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
- A61P31/18—Antivirals for RNA viruses for HIV
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
- A61K2039/53—DNA (RNA) vaccination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/10011—Adenoviridae
- C12N2710/10311—Mastadenovirus, e.g. human or simian adenoviruses
- C12N2710/10341—Use of virus, viral particle or viral elements as a vector
- C12N2710/10343—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16111—Human Immunodeficiency Virus, HIV concerning HIV env
- C12N2740/16134—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16211—Human Immunodeficiency Virus, HIV concerning HIV gagpol
- C12N2740/16234—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Virology (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Veterinary Medicine (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Epidemiology (AREA)
- Biotechnology (AREA)
- Mycology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Oncology (AREA)
- Communicable Diseases (AREA)
- AIDS & HIV (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
本发明包括产生黑猩猩来源的腺病毒血清型AdC6或AdC7载体疫苗的组合物和方法,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含含有与编码异源蛋白质的序列可操作地连接的启动子序列的表达盒,其中异源蛋白质为选自gp140和Gag的至少一种HIV蛋白质,其中gp140来自选自B、AE、BC和C的中国HIV进化枝,和其中Gag来自中国HIV进化枝B。此外,本发明涵盖为哺乳动物接种疫苗的药物组合物以及蛋白质表达系统。
Description
相关申请的交叉引用
本申请根据35U.S.C.§119(e)要求2019年4月17日提交的美国临时申请专利号62/835,108的优先权,其通过引用以其整体并入本文。
背景技术
HIV感染在世界范围内很普遍,促使人们寻求开发有效的疫苗来治疗或预防HIV感染。中国和亚洲的情况也是如此。接种疫苗被广泛认为是预防或改善传染病的发病率的最有效方法。病毒载体疫苗,诸如基于腺病毒载体的疫苗,可用于抵抗各种传染性和恶性疾病(Small and Ertl,Curr Opin Virol.2011,October 1;1(4):241–245)。
本领域需要生产用于治疗或预防HIV感染的更有效的腺病毒载体疫苗系统的方法。中国和亚洲其他国家的需求尤为迫切。本发明满足了这一需求。
发明内容
提供了包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含表达盒,表达盒包含与编码异源蛋白质的序列可操作地连接的启动子,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质;
其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和
其中Gag来自中国HIV进化枝B。
在一些实施方式中,表达盒位于早期基因E1基因组区中。在一些实施方式中,表达盒包含嵌合内含子和/或CMV增强子。在一些实施方式中,由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。在进一步实施方式中,整个早期基因E3基因组区被缺失。
在一些实施方式中,启动子是组成型启动子。在进一步实施方式中,启动子是巨细胞病毒即时早期启动子(CMV)。
在一些实施方式中,核酸序列包含SEQ ID NO:6或7。
提供了包含前述实施方式中任一项的组合物的蛋白质表达系统,其中核酸序列包含SEQ ID NO:6或7。
还提供了包含前述实施方式中任一项的组合物的蛋白质表达系统,其中表达盒编码的异源蛋白质包含选自SEQ ID NO:1-5的氨基酸序列。
提供了包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含含有与编码异源蛋白质的序列可操作地连接的组成型启动子的表达盒,其中表达盒位于早期基因E1基因组中,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质;
其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和
其中Gag来自中国HIV进化枝B。
在一些实施方式中,核酸序列包含SEQ ID NO:6或7。
提供了包含前述实施方式中任一项的组合物的蛋白质表达系统,其中表达盒编码的异源蛋白质包含选自SEQ ID NO:1-5的氨基酸序列。
提供了引发哺乳动物中针对异源蛋白质的免疫应答的方法,方法包括向哺乳动物施用包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含含有与编码异源蛋白质的序列可操作地连接的启动子的表达盒,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质;
其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和
其中Gag来自中国HIV进化枝B。
在一些实施方式中,表达盒位于早期基因E1区中。在一些实施方式中,表达盒包含嵌合内含子和/或CMV增强子。
在一些实施方式中,由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。在进一步实施方式中,整个早期基因E3基因组区被缺失。
在一些实施方式中,启动子是组成型启动子。在进一步实施方式中,启动子是巨细胞病毒即时早期启动子(CMV)。
在一些实施方式中,核酸序列包含SEQ ID NO:6或7。
提供了治疗和/或预防哺乳动物中HIV的方法,方法包括施用治疗有效量的由包含SEQ ID NO:6或7的核酸序列编码的组合物。
还提供了针对HIV感染为哺乳动物接种疫苗的方法,方法包括向哺乳动物施用治疗有效量的前述实施方式中任一项的组合物,其中组合物的施用引发哺乳动物的免疫应答。在一些实施方式中,为哺乳动物预防性施用组合物。在进一步实施方式中,为哺乳动动物治疗性施用组合物。在仍进一步实施方式中,组合物与佐剂组合施用。
提供了产生对哺乳动物中异源蛋白质的效应和记忆T细胞免疫应答的方法,方法包括以下步骤:(a)以有效引发哺乳动物中免疫应答的量向哺乳动物施用前述实施方式中任一项的组合物;(b)在第二随后的时间段施用第二有效量的前述实施方式中任一项的组合物,其中针对异源蛋白质的T记忆细胞在哺乳动物中被重新活化。在一些实施方式中,在(a)中第一施用和在(b)中第二施用的组合物包括选自gp140和Gag的相同或不同的HIV异源蛋白质。在进一步实施方式中,在(a)中第一施用和在(b)中第二施用的组合物是选自AdC6和AdC7的相同或不同血清型。在仍进一步实施方式中,在(a)中第一施用和在(b)中第二施用的组合物拥有相同或不同的HIV进化枝。
在一些实施方式中,方法进一步包括向哺乳动物施用免疫原的步骤。在一些实施方式中,免疫原包含异源蛋白质,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质;其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和其中Gag来自中国HIV进化枝B,其中B细胞免疫应答被进一步扩大。
提供了产生对哺乳动物中异源蛋白质的适应性B细胞免疫应答的方法,方法包括以下步骤:(a)以有效引发哺乳动物中免疫应答的量向哺乳动物施用前述实施方式中任一项的组合物;(b)在第二随后的时间段施用第二有效量的前述实施方式中任一项的组合物,其中针对异源蛋白质的B记忆细胞在哺乳动物中被重新活化。在一些实施方式中,在(a)中第一施用和在(b)中第二施用的组合物包括选自gp140和Gag的相同或不同的HIV异源蛋白质。在进一步实施方式中,在(a)中第一施用和在(b)中第二施用的组合物是选自AdC6和AdC7的相同或不同血清型。在仍进一步实施方式中,在(a)中第一施用和在(b)中第二施用的组合物拥有相同或不同的HIV进化枝。
在一些实施方式中,方法进一步包括向哺乳动物施用免疫原的步骤。在一些实施方式中,免疫原包含异源蛋白质,其中异源蛋白质是选自任意来源的任何进化枝的至少一种HIV env蛋白质,其中B细胞免疫应答被进一步扩大。
在一些实施方式中,哺乳动物是人。
可以组合方面或实施方式的任意和全部特征以实现新的实施方式。
附图说明
出于说明发明的目的,在附图中描绘了发明的某些实施方式。然而,发明不限于附图中描绘的实施方式的精确布置和工具。
图1A是图解来自应答于gag肽释放细胞因子的血液的所有CD8+CD44+细胞中CD8+CD44+细胞的百分比的一系列图表。没有gag肽的背景应答被减去。线表示平均应答±SD。上方具有星号的线指示显著性差异(p<0.01)。图1B是图解来自14天前用AdC6gag或AdC7gag载体的1011个病毒颗粒(vp)免疫的小鼠的汇集血液的所有CD8+CD44+细胞中CD8+CD44+细胞的百分比的一系列图表,AdC6gag或AdC7gag载体应答于携带gag的优势免疫表位的肽产生细胞因子。没有gag肽的背景应答被减去。
图2是图解来自用AdC6gag或AdC7gag载体的1011个病毒颗粒(vp)免疫18天后个体小鼠的脾脏的测试的所有CD8+CD44+细胞中特异性CD8+CD44+细胞的百分比的一系列图表,AdC6gag或AdC7gag载体应答于gag肽产生指示的细胞因子。和反映基于布尔门控(Booleangating)计算的总应答。没有gag肽的背景应答被减去。
图3A-3B是显示来自用AdC6gag载体的1010或109vp免疫14天后汇集血液的测试的T细胞应答的一系列图表。图表布局与图1的成镜像。图3A显示CD8+T细胞应答,图3B显示CD4+T细胞应答。
图4显示了在涂覆有进化枝C、AE或BC的gp140蛋白质的平板上用所指示载体的1011vp初免后收集和测试的血清样品获得的ELISA结果。圆圈–用AdC6载体免疫的小鼠。方块–用AdC7载体免疫的小鼠。利用来自幼稚小鼠的血清获得的值被减去。线显示了中位数。nt–未测试。
图5显示了在涂覆有进化枝C、AE或BC的gp140蛋白质的平板上用指示载体的1011vp初免然后用表达相同插入物的异源载体的109vp加强(boost)后收集和测试的血清样品获得的ELISA结果。圆圈–用AdC6和AdC7载体免疫的小鼠。方块–用AdC7载体免疫然后用AdC6载体加强的小鼠。利用来自幼稚小鼠的血清获得的值被减去。线显示了中位数。nt–未测试。
图6显示了在涂覆有进化枝C、AE或BC的gp140蛋白质的平板上用AdC6载体的1011vp初免然后用表达相同插入物的AdC7载体的109vp加强并随后用明矾中进化枝C蛋白第二加强后收集和测试的血清样品获得的ELISA结果。利用来自幼稚小鼠的血清获得的值被减去。线显示了中位数。
图7显示了在涂覆有进化枝C、AE或BC的gp140蛋白质的平板上用AdC6载体的1011vp初免(圆圈)和之后用表达相同插入物的AdC7载体的109vp加强(方块)后收集和测试的血清样品获得的ELISA结果。利用来自幼稚小鼠的血清获得的值被减去。线显示了中位数。这些数据类似于图4和5中的数据,但是对2个时间点的试验同时进行以允许直接比较。
图8显示了用在涂覆有进化枝C、AE或BC的gp140蛋白质的平板上用AdC7载体的1011vp初免(圆圈)和之后用表达相同插入物的AdC6载体的109vp加强(方块)后收集和测试的血清样品获得的ELISA结果。利用来自幼稚小鼠的血清获得的值被减去。线显示了中位数。这些数据类似于图4和5中的数据,但是对2个时间点的试验同时进行以允许直接比较。
图9显示了在涂覆有进化枝C、AE或BC的gp140蛋白质的平板上用AdC6载体的1011vp初免(圆圈)然后用表达相同插入物的AdC7载体的109vp加强(方块)并随后用明矾中进化枝C蛋白质第二加强后收集和测试的血清样品获得的ELISA结果。利用来自幼稚小鼠的血清获得的值被减去。线显示了中位数。上方具有星号的线指示通过2向方差分析的显著性差异。这些数据类似于图5和6中的数据,但是对2个时间点的试验同时进行以允许直接比较。
图10显示了图6-8中所示数据的组合。
图11显示了根据用于针对测试的三种不同进化枝(C、AE和BC)进行免疫的插入物相关联的来自单个小鼠组的不同血清的吸收值。图显示了r-值。显著值通过柱上方的星号指示。
图12显示了在用汇集血液测试的AdC6gag或AdC7gag载体初免后2周(左)和使用来自个体小鼠的PBMC测试2周后的异源载体加强后的gag-特异性CD8+T细胞的频率。使用来自幼稚小鼠的PBMC作为实验的对照。结果显示了基于布尔门控计算的所有细胞因子(IFN-γ、IL-2、粒酶B和TNF-α)的和。
图13显示了在用AdC6gag初免2周后(左)和在用AdC7gag载体加强4周后gag-特异性CD8+T细胞的频率。结果显示了基于布尔门控计算的所有细胞因子(IFN-γ、IL-2、粒酶B和TNF-α)的和。
图14显示了用不同AdC6gp140载体的混合物(每种以109或1010vp给予)初免BALB/c小鼠,随后6周后用以相同剂量给予AdC7gp140载体加强,随后6周用进化枝C env蛋白加强后,针对进化枝C env作为吸附量的抗体应答。使用来自幼稚BALB/c小鼠的血清作为实验的对照。
图15显示了用不同AdC6gp140或AdC7gp140载体的混合物(每种以1010vp给予)初免ICR小鼠,随后8周后用以相同剂量给予异源载体加强后,针对进化枝C、AE和BC env作为吸附量的抗体应答。使用来自幼稚ICR小鼠的血清作为实验的对照。
具体实施方式
本发明涉及用于产生黑猩猩来源的腺病毒载体的组合物和方法,黑猩猩来源的腺病毒载体包括核酸序列和启动子序列,核酸序列包括在一些腺病毒早期基因中的缺失(即,其中早期基因E1区被缺失,和其中在一些实施方式中,来自早期基因E3的ORF3、ORF4、ORF5、ORF6、和ORF7或整个E3基因也被缺失),启动子序列与编码异源蛋白质的序列连接,在某些实施方式中,异源蛋白质包含选自gp140和Gag的HIV蛋白质;其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和其中Gag来自中国HIV进化枝B。另外,本发明包括治疗和/或预防或免疫具体疾病或障碍的组合物和方法,以及在施用本发明的黑猩猩来源的腺病毒载体的哺乳动物中诱导效应和记忆T细胞和B细胞免疫应答的方法。
定义
除非另外定义,否则本文使用的所有技术和科学术语具有与本发明所属领域的普通技术人员通常理解的相同的含义。尽管与本文所述的那些相似或等同的任何方法和材料可用于测试本发明的实践中,但本文描述了优选的材料和方法。在描述和要求保护本发明时,将使用以下术语。
还应理解,本文使用的术语仅用于描述具体实施方式的目的,并不旨在是限制性的。
如本文所使用,冠词“一(a,an)”用于指代一个或多于一个(即,至少一个)该冠词的语法对象。举例来说,“一要素”意思是一个要素或多于一个要素。
如本文所使用的术语“抗体”或“Ab”是指衍生自免疫球蛋白质分子的蛋白质或多肽序列,其特异性结合抗原上的特定表位。抗体可以是衍生自天然来源或来自重组来源的完整免疫球蛋白质,并且可以是完整免疫球蛋白质的免疫反应性部分。在本发明中使用的抗体可以以多种形式存在,包括例如多克隆抗体、单克隆抗体、细胞内抗体(“胞内抗体”)、Fv,Fab和F(ab)2,以及单链抗体(scFv)和人源化抗体(Harlow et al.,1998,UsingAntibodies:A Laboratory Manual,Cold Spring Harbor Laboratory Press,NY;Harlowet al.,1989,Antibodies:A Laboratory Manual,Cold Spring Harbor,New York;Houston et al.,1988,Proc.Natl.Acad.Sci.USA 85:5879-5883;Bird et al.,1988,Science 242:423-426)。抗体可以衍生自天然来源或来自重组来源。抗体通常是免疫球蛋白质分子的四聚体。
术语“改善(ameliorating)”或“治疗”是指由于所进行的动作而减轻了与疾病相关的临床体征和/或症状。待监测的体征或症状对于熟练的临床医生来说是熟知的。
如本文所使用,当提及可测量值比如量、时间持续长度等时,术语“约”是指包括从规定值±20%或±10%,更优选地±5%,甚至更优选±1%,和仍更优选地±0.1%的变化,由此这些变化适合于进行所公开的方法。
术语“生物”或“生物样品”是指从生物体或从生物体的组分(例如细胞)获得的样品。样品可以是任何生物组织或流体。通常地,样品将是“临床样品”,其为源自患者的样品。这样的样品包括但不限于骨髓、心脏组织、痰液、血液、淋巴液、血细胞(例如白细胞)、组织或细针活检样品、尿液、腹膜液和胸膜液、或来自其的细胞。生物样品还可以包括组织切片,例如用于组织学目的获取的冷冻切片。
如本文所使用,“更大”是指高于对照至少10%或更多,例如高于对照20%、30%、40%或50%、60%、70%、80%、90%或更多,和/或高于对照1.1倍、1.2倍、1.4倍、1.6倍、1.8倍、2.0倍或更多,以及其间的任何和所有全增量和部分增量的表达水平。
如本文所使用,术语“对照”或“参照”可以互换地使用,并且指用作比较标准的值。
如本文所使用的术语“免疫原性”是指当将抗原或生物体施用到动物时,抗原或生物体引发动物中的免疫应答的先天能力。因此,“加强免疫原性”是指当将抗原或生物体施用到动物时,增加抗原或生物体引发动物中的免疫应答的能力。抗原或生物体引发免疫应答的增加的能力可以通过以下衡量:结合至抗原或生物体的更多数量的抗体、抗原或生物体的抗体的更大的多样性、对抗原或生物体特异性的更多数量的T细胞、对抗原或生物体的更大的细胞毒性或辅助T细胞应答、响应于抗原的细胞因子的更高表达等。
如本文所使用,术语“引发免疫应答”或“免疫”是指针对异源蛋白质产生B细胞和/或T细胞应答的过程。
如本文所使用,术语“活化”是指在充分的细胞表面部分连接后诱导显著的生化或形态变化的细胞状态。在T细胞的背景下,这种活化是指已被充分刺激以诱导细胞增殖的T细胞的状态。T细胞的活化还可以诱导细胞因子产生和进行调节效应子或细胞溶解效应子功能。在其他细胞的背景下,该术语推断具体物理化学过程的上调或下调。
术语“活化的T细胞”是指目前正在经历细胞分裂、细胞因子产生、进行调节效应子或细胞溶解效应子功能和/或最近经历“活化”过程的T细胞。
如本文所使用的术语“抗原”或“Ag”被定义为引起免疫应答的分子。该免疫应答可包括抗体产生或特异性免疫感受态细胞的活化,或两者。技术人员将理解,包括几乎所有的蛋白质或肽在内的任何大分子都可以用作抗原。此外,抗原可以衍生自重组DNA或基因组DNA。因此,技术人员将理解,任何DNA——其包含编码引发免疫应答的蛋白质的核苷酸序列或部分核苷酸序列——编码“抗原”,该术语如在本文所使用的。此外,本领域技术人员将理解,抗原不需要唯一地由基因的全长核苷酸序列编码。显而易见的是,本发明包括但不限于多于一种基因的部分核苷酸序列的使用,并且这些核苷酸序列以各种组合排列以引发期望的免疫应答。此外,技术人员将理解,抗原根本不需要由“基因”编码。显而易见的是,抗原可以合成产生,或者可以源自生物样品。这种生物样品可以包括但不限于组织样品、肿瘤样品、细胞或生物流体。
本文使用的“异源抗原”是指对包括或表达抗原的生物体非内源的抗原。作为实例,包含或表达病毒或肿瘤抗原的病毒疫苗载体包括异源抗原。本文所使用的术语“异源蛋白质”是指在受试者(即哺乳动物)中引起有益的免疫应答的蛋白质,不论其来源如何。
本文所使用的术语“人体免疫缺陷病毒”或“HIV”是指本领域已知的或迄今未知的任何HIV毒株或变体,其包括但不限于HIV-1和HIV-2。在本文公开的某些实施方式中示例了HIV-1。
术语“特异性结合”、“选择性结合”或“结合特异性”是指本发明的人源化抗体或结合化合物以比当结合至非靶标表位时产生的亲和力更高的亲和力结合至靶标表位的能力。在某些实施方式中,特异性结合是指以高于非靶标表位的亲和力至少10、50、100、250、500或1000倍的亲和力结合至靶标。
如本文所使用,“组合疗法”是指第一药剂与另一种药剂联合施用。“与……组合”或“与……结合”是指施用除了另一种治疗方式之外的一种治疗方式。因此,“与……组合”是指在向个体递送其他治疗方式之前、期间或之后施用一种治疗方式。这种组合被认为是单一治疗方式或方案的一部分。
“体液免疫”或“体液免疫应答”二者均指B细胞介导的免疫,并由B淋巴细胞(B细胞)产生和分泌的高度特异性抗体介导。
“预防”是指药物组合物用于针对紊乱接种疫苗的用途。
“佐剂”是指能够加强抗原的免疫原性的物质。佐剂可以是一种物质或多种物质的混合物,并通过直接作用于免疫系统或通过提供抗原的缓慢释放起作用。佐剂的实例是铝盐、聚阴离子、细菌糖肽和作为弗氏不完全佐剂的缓释剂。
“递送载体(Delivery vehicle)”是指有助于将抗原靶向特定细胞并促进抗原被免疫系统有效识别的组合物。最公知的递送载体是脂质体、病毒体、包括微球和纳米球在内的微粒、复合神经节、菌影、细菌多糖、减毒细菌、病毒样颗粒、减毒病毒和ISCOMS。。
如本文所使用,术语“表达盒”意思是能够引导异源编码序列转录和/或翻译的核酸序列。在一些实施方式中,表达盒包含与编码异源蛋白质的序列可操作地连接的启动子序列。在一些实施方式中,表达盒进一步包含与编码异源蛋白质的序列可操作地连接的至少一个调节序列。
“并入……中”或“包封在……中”是指抗原肽在递送载体,比如微粒、菌影、减毒细菌、病毒样颗粒、减毒病毒、ISCOM、脂质体和优选的病毒体内。
如本文所使用,术语“肽”、“多肽”和“蛋白质”可以互换地使用,并且指由通过肽键共价连接的氨基酸残基组成的化合物。蛋白质或肽必须包含至少两个氨基酸,并且对可构成蛋白质或肽的序列的氨基酸的最大数量没有限制。多肽包括包含通过肽键彼此连接的两个或更多个氨基酸的任何肽或蛋白质。如本文所使用,该术语是指短链和较长的链,短链在本领域中通常也被称为肽、寡肽和寡聚体,较长的链在本领域中通常被称为具有多种类型的蛋白质。“多肽”包括例如生物活性片段、基本上同源的多肽、寡肽、同源二聚体、异源二聚体、多肽的变体、修饰的多肽、衍生物、类似物、融合蛋白质等。多肽包括天然肽、重组肽、合成肽或其组合。
如本文所使用的“融合蛋白质”是指如此蛋白质,其中该蛋白质包括通过肽键或其他化学键连接在一起的两个或更多个蛋白质。蛋白质可以通过肽键或其他化学键直接连接在一起,或者在两个或更多个蛋白质之间具有一个或多个氨基酸,该一个或多个氨基酸在本文中被称为间隔区。
在本发明的上下文中,使用常见的核酸碱基的下列缩写。“A”是指腺苷,“C”是指胞嘧啶,“G”是指鸟苷,“T”是指胸苷,“U”是指尿苷。
如本文所使用的术语“RNA”被定义为核糖核酸。
“转化(transform,transforming,transformation)”在本文中用于指将分离的核酸引入生物体内部的过程。
在本发明的上下文中使用的术语“治疗(treatment)”意思是指包括疾病或紊乱的治疗性治疗以及预防或抑制措施。如本文所使用,术语“治疗(treatment)”和相关术语比如“治疗(treat)”和“治疗(treating)”是指病情或其至少一种症状的进展、严重性和/或持续时间的降低。因此,术语“治疗(treatment)”是指可使受试者受益的任何方案。治疗可以是针对现有病症,或可以是预防性的(预防性治疗)。治疗可以包括治愈、缓解或预防效果。本文提及的“治疗性”和“预防性”治疗应在其最广泛的背景下考虑。术语“治疗性”并不必然意味着治疗受试者直至完全康复。类似地,“预防性”并不必然意味着受试者最终不会感染病情。因此,例如,术语治疗包括在疾病或紊乱发作之前或之后施用药剂,从而预防或消除疾病或紊乱的所有症候。作为另一个例子,在疾病的临床表现之后施用药剂以对抗疾病的症状构成疾病的“治疗”。
当术语“等价的”关于核苷酸序列使用时,应理解为是指编码功能上等同的多肽的核苷酸序列。等价核苷酸序列将包括通过一个或多个核苷酸取代、添加或缺失区别的序列,比如等位基因变体;并且因此由于遗传密码的简并性,将包括与本文所述核酸的核苷酸序列不同的序列。
如本文中关于核酸比如DNA或RNA使用的术语“分离的”是指分别与存在于大分子的天然来源中的其他DNA或RNA分离的分子。如本文所使用,术语分离还指如此核酸或肽,当其通过重组DNA技术产生时基本上不含细胞材料、病毒材料或培养基,或当化学合成时基本上不含化学前体或其他化学品。此外,“分离的核酸”是指包括核酸片段,其不会作为片段天然出现并且不会以天然状态被发现。术语“分离的”在本文中也用于指多肽,其与其他细胞蛋白质分离并且意思是包括纯化多肽和重组多肽二者。“分离的细胞”或“分离的细胞群”是在其天然环境中不存在的细胞或细胞群。
如本文使用的“突变”是导致由其天然状态改变的DNA序列的变化。突变可以包括至少一个脱氧核糖核酸碱基比如嘌呤(腺嘌呤和/或胸腺嘧啶)和/或嘧啶(鸟嘌呤和/或胞嘧啶)的缺失和/或插入和/或复制和/或取代。突变可能会或可能不会产生在生物体的可观察特征(表型)中的可辨别的变化。
如本文所使用,术语“核酸”是指多核苷酸,比如脱氧核糖核酸(DNA),和在合适的情况下指核糖核酸(RNA)。该术语还应理解为包括由核苷酸类似物制备的RNA或DNA的类似物作为等价物,并且如适用于描述的实施方式,包括单链多核苷酸(有义或反义)和双链多核苷酸。EST、染色体、cDNA、mRNA和rRNA是可以被称为核酸的分子的代表性实例。
如本文所使用,“可操作地连接的”序列包括与感兴趣的基因邻接的表达控制序列和反式或在一定距离处起作用以控制感兴趣的基因的表达控制序列。表达控制序列包括合适的转录起始、终止、启动子和增强子序列;有效的RNA处理信号,比如剪接和多腺苷酸化(polyA)信号;稳定细胞质mRNA的序列;提高翻译效率的序列(即Kozak共有序列);加强蛋白质稳定性的序列;并且当需要时,加强编码产物分泌的序列。许多表达控制序列——包括天然的、组成型的、诱导型的和/或组织特异性的启动子——在本领域是已知的并且可用于本发明的组合物中。除了DNA表达和控制序列之外,“可操作地连接”还应该被解释为包括RNA表达和控制序列。
本文所使用的术语“启动子”被定义为由细胞合成机器或引入的合成机器识别的启动多核苷酸序列特异性转录所需的DNA序列。
如本文所使用,术语“启动子/调节序列”意思指对于表达可操作地连接至启动子/调节序列的基因产物所需的核酸序列。在一些情况下,该序列可以是核心启动子序列,和在其他情况下,该序列还可以包括对于表达基因产物所需的增强子序列和其他调节元件。例如,启动子/调节序列可以是以组织特异性方式表达基因产物的一种启动子/调节序列。
“组成型”启动子是核苷酸序列,当其与编码或指定基因产物的多核苷酸可操作地连接时,在细胞的大多数或所有生理条件下,该核苷酸序列引起基因产物在细胞中产生。
“诱导型”启动子是核苷酸序列,当其与编码或指定基因产物的多核苷酸可操作地连接时,基本上只有当细胞中存在对应于启动子的诱导物时,该核苷酸序列引起基因产物在细胞中产生。
如本文所使用,术语“药物组合物”是指本发明中有用的至少一种化合物与其他化学组分的混合物,所述化学组分比如运载体、稳定剂、稀释剂、佐剂、分散剂、悬浮剂、增稠剂和/或赋形剂。药物组合物有助于将化合物施用到生物体。本领域存在多种施用化合物的技术,其包括但不限于:静脉内、口服、气溶胶、肠胃外、眼内、肺部和外用施用。
语言“药学上可接受的运载体”包括药学上可接受的盐、药学上可接受的材料、组合物或运载体,比如液体或固体填充剂、稀释剂、赋形剂、溶剂或包封材料,其参与在受试者内或向受试者运送或运输本发明的化合物(一种或多种),使得它可以执行其期望的功能。通常地,这类化合物从一个器官或身体的一部分运送或运输到另一个器官或身体的一部分。在与制剂的其他成分相容的意义上,每种盐或运载体必须是“可接受的”,并且对受试者是无害的。可用作药学上可接受的运载体的材料的一些实例包括:糖,比如乳糖、葡萄糖和蔗糖;淀粉,比如玉米淀粉和马铃薯淀粉;纤维素及其衍生物,如羧甲基纤维素钠、乙基纤维素和醋酸纤维素;西黄蓍胶粉;麦芽;明胶;滑石;赋形剂,比如,可可脂和栓剂蜡;油,比如花生油、棉籽油、红花油、芝麻油、橄榄油、玉米油和豆油;二醇,比如丙二醇;多元醇,比如甘油、山梨糖醇、甘露醇和聚乙二醇;酯,比如油酸乙酯和月桂酸乙酯;琼脂;缓冲剂,比如氢氧化镁和氢氧化铝;藻酸;无热原水;等渗盐水;林格溶液(Ringer’s solution);乙醇;磷酸盐缓冲溶液;稀释剂;成粒剂;润滑剂;粘合剂;崩解剂;润湿剂;乳化剂;着色剂;脱模剂;涂层剂;甜味剂;调味剂;加香剂;防腐剂;抗氧化剂;增塑剂;胶凝剂;增稠剂;硬化剂;沉降剂;悬浮剂;表面活性剂;保湿剂;运载体;稳定剂;和药物制剂中使用的其他无毒相容物质,或其任何组合。如本文所使用,“药学上可接受的运载体”还包括与化合物的活性相容的并且对于受试者是生理学上可接受的任何和所有包衣、抗细菌剂和抗真菌剂,以及吸收延迟剂等。补充活性化合物也可以并入组合物中。
如本文所使用,术语“有效量”或“治疗有效量”是指从本发明的载体产生的病毒样颗粒的量,其是对于预防具体病情所需的,或者降低病情或其至少一种症状或与其相关的病症的严重性和/或改善病情或其至少一种症状或与其相关的病症。
如本文使用的“受试者”或“患者”可以是人或非人哺乳动物。非人哺乳动物包括例如家畜和宠物,比如绵羊、牛科、猪科、犬科、猫科和鼠科哺乳动物。优选地,受试者是人。
“效价”是与参考样品相比的病毒或病毒载体浓度的数值量度,其中浓度通过病毒的活性或通过测量单位体积的缓冲溶液中的病毒数量来确定。例如,通过使用软琼脂方法测量病毒的溶液(一种或多种)(通常是连续稀释)对例如HeLa细胞的感染性(参见,Graham&van der Eb(1973)Virology 52:456-467),或通过监测赋予细胞的抗性,例如由病毒或载体编码的G418抗性,或通过UV分光光度法定量病毒(参见Chardonnet&Dales(1970)Virology 40:462-477)来确定病毒原液的效价。
“载体”是包括分离的核酸并且可用于将分离的核酸递送至细胞内部的物质的组合物。本领域已知许多载体,其包括但不限于线性多核苷酸、与离子或两亲化合物相关联的多核苷酸、质粒和病毒。在本公开内容中,术语“载体”包括自主复制的病毒。
范围:贯穿本公开内容,本发明的各个方面可以以范围格式呈现。应当理解,范围形式的描述仅仅是为了方便和简洁,并且不应该被解释为对本发明范围的僵化限制。因此,范围的描述应该被认为已明确地公开了所有可能的子范围以及该范围内的单个数值。例如,范围比如从1至6的描述应当被认为明确公开了子范围,比如从1至3、从1至4、从1至5、从2至4、从2至6、从3至6等,以及在该范围内的单个数字,例如,1、2、2.7、3、4、5、5.3和6。无论该范围的广度如何,这都适用。
描述
提供了包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含表达盒,表达盒包含与编码异源蛋白质的序列可操作地连接的启动子,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质;其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和其中Gag来自中国HIV进化枝B。
在一些实施方式中,表达盒进一步包含与编码异源蛋白质的序列可操作地连接的至少一个调节序列。
在一些实施方式中,表达盒位于早期基因E1基因组区中。
在一些实施方式中,表达盒进一步包含嵌合内含子和/或CMV增强子。
在一些实施方式中,由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。
在一些实施方式中,整个早期基因E3基因组区被缺失。
在进一步实施方式中,启动子是组成型启动子。在仍进一步实施方式中,启动子是巨细胞病毒即时早期启动子(CMV)。
在一些实施方式中,核酸序列包含SEQ ID NO:6或7。在一些实施方式中,核酸序列由SEQ ID NO:6或7组成。
提供了包含前述实施方式中任一项的组合物的蛋白质表达系统,其中核酸序列包含SEQ ID NO:6或7。还提供了包含前述实施方式中任一项的组合物的蛋白质表达系统,其中核酸序列由SEQ ID NO:6或7组成。还提供了包含前述实施方式中任一项的组合物的蛋白质表达系统,其中表达盒编码的异源蛋白质包含选自SEQ ID NO:1-5的氨基酸序列。
还提供了引发哺乳动物中针对异源蛋白质的免疫应答的方法,方法包括向哺乳动物施用包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中核酸进一步包含含有与编码异源蛋白质的序列可操作地连接的启动子的表达盒,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质;其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和其中Gag来自中国HIV进化枝B。
在一些实施方式中,表达盒进一步包含与编码异源蛋白质的序列可操作地连接的至少一个调节序列。
在一些实施方式中,表达盒位于早期基因E1基因组区中。
在一些实施方式中,表达盒进一步包含嵌合内含子和/或CMV增强子。
在一些实施方式中,由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。
在一些实施方式中,整个早期基因E3基因组区被缺失。
在进一步实施方式中,启动子是组成型启动子。在仍进一步实施方式中,启动子是巨细胞病毒即时早期启动子(CMV)。
提供了治疗和/或预防哺乳动物中HIV的方法,方法包括施用治疗有效量的由包含SEQ ID NO:6或7的核酸序列编码的组合物。在一些实施方式中,核酸序列由SEQ ID NO:6或7组成。
提供了针对HIV感染为哺乳动物接种疫苗的方法,方法包括向哺乳动物施用治疗有效量的前述实施方式中任一项的组合物,其中组合物的施用引发哺乳动物的免疫应答。在一些实施方式中,为哺乳动物预防性施用组合物。在进一步实施方式中,为哺乳动动物治疗性施用组合物。在仍进一步实施方式中,组合物与佐剂组合施用。
提供了产生对哺乳动物中异源蛋白质的效应和记忆T细胞免疫应答的方法,方法包括以下步骤:(a)以有效引发哺乳动物中免疫应答的量向哺乳动物施用前述实施方式中任一项的组合物;(b)在第二随后的时间段施用第二有效量的前述实施方式中任一项的组合物,其中针对异源蛋白质的T记忆细胞在哺乳动物中被重新活化。在一些实施方式中,在(a)中第一施用和在(b)中第二施用的组合物包括选自gp140和Gag的相同或不同的HIV异源蛋白质,其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和其中Gag来自中国HIV进化枝B。在进一步实施方式中,在(a)中第一施用和在(b)中第二施用的组合物是选自AdC6和AdC7的相同或不同血清型。
提供了产生对哺乳动物中异源蛋白质的适应性B细胞免疫应答的方法,方法包括以下步骤:(a)以有效引发哺乳动物中免疫应答的量向哺乳动物施用前述实施方式中任一项的组合物;(b)在第二随后的时间段施用第二有效量的前述实施方式中任一项的组合物,其中针对异源蛋白质的B记忆细胞在哺乳动物中被重新活化。
在一些实施方式中,方法进一步包括向哺乳动物施用免疫原的步骤。在进一步实施方式中,免疫原包含异源蛋白质,其中异源蛋白质是选自源于任意来源的任何进化枝的gp140的至少一种HIV蛋白质,其中B细胞免疫应答被进一步扩大。在一些实施方式中,异源蛋白质来自中国进化枝或来自非洲进化枝。在一些实施方式中,如此施用的异源蛋白质是与在前述实施方案中任一项的黑猩猩来源的腺病毒载体的核酸序列中表达的异源蛋白质相同的异源蛋白质。在一些实施方式中,如此施用的异源蛋白质是与前述方法中任一项的步骤(a)和/或步骤(b)中施用的异源蛋白质相同的异源蛋白质。在一些实施方式中,免疫原进一步包含佐剂,例如明矾。
在一些实施方式中,在步骤(a)和(b)后向哺乳动物施用免疫原。
在一些实施方式中,哺乳动物是人。
包含E1和/或E3中缺失的腺病毒载体在国际申请PCT/US2017/043315(WO 2018/026547)中公开,其以其全部并入本文。
包含使用本文公开的腺病毒载体制备的腺病毒颗粒的疫苗组合物可用于诱导哺乳动物中针对一种或多种编码的异源蛋白质或其抗原部分的免疫性。使用所公开的疫苗组合物或剂量单位可以诱导免疫性。可以使用本领域已知的合适方法来评估免疫应答,例如WO2012/02483中所公开的。
异源基因表达
在一个方面中,尽管巨细胞病毒即时早期启动子在本文中被示例为驱动HIV蛋白质表达的启动子,但本发明不应被解释为限于该启动子序列。可用于本发明的启动子序列包括诱导高水平的基因表达的任何启动子。这种启动子可以包括但不限于本文其他地方公开的那些。
在一个实施方式中,合适的启动子是即时早期巨细胞病毒(CMV)启动子序列。该启动子序列是强的组成型启动子序列,其能够驱动与其可操作地连接的任何多核苷酸序列高水平的表达。合适的启动子的另一个实例是延伸生长因子-1α(EF-1α)。然而,也可以使用其他组成型启动子序列,其包括但不限于猿猴病毒40(SV40)早期启动子、小鼠乳腺肿瘤病毒(MMTV)、人体免疫缺陷病毒(HIV)长末端重复序列(LTR)启动子、MoMuLV启动子、禽白血病病毒启动子、埃巴病毒即时早期启动子、劳斯肉瘤病毒启动子,以及人基因启动子,比如,但不限于肌动蛋白质启动子、肌球蛋白质启动子、血红蛋白质启动子和肌酸激酶启动子。此外,本发明不应限于使用组成型启动子。诱导型启动子也被认为是本发明的一部分。诱导型启动子的使用提供了分子开关,当需要这种表达时它被可操作地连接,能够启动多核苷酸序列的表达,或者当不需要表达时关闭表达。诱导型启动子的实例包括但不限于金属硫蛋白质启动子、糖皮质激素启动子、孕酮启动子和四环素启动子。
在一些实施方式中,本发明还包括使用驱动给定异源基因在一种或多种特定类型的细胞中表达的组织特异性启动子(例如,肌红蛋白质启动子、肌肉肌酸激酶启动子、结蛋白质启动子、哺乳动物肌钙蛋白质1启动子和骨骼α-作用启动子)。此外,本领域已知的任何人工合成的启动子可用于本发明中,因为这些启动子可提供用于异源基因的最佳效率和稳定性。另外,增强子序列调节载体内包含的基因的表达。通常,增强子与蛋白质因子结合以加强基因的转录。增强子可位于它调节的基因的上游或下游。增强子也可以是组织特异性的,以加强在特定细胞或组织类型中的转录。
为了评估感兴趣的异源基因的表达,待导入细胞的表达载体还可包含选择标记基因或报告基因或两者,以便于从寻求通过杂交病毒载体感染的细胞群鉴定和选择表达细胞。在其他方面中,选择标记可以在单独的DNA片段上携带并用于共感染/转染过程。选择标记和报告基因二者都可以侧接合适的调节序列以能够实现在宿主细胞中的表达。有用的选择标记包括例如抗生素抗性基因,例如新霉素抗性基因等。
报告基因用于鉴定潜在感染的细胞和评估调节序列的功能性。通常,报告基因是不存在于受体生物体或组织中或由受体生物体或组织表达并且编码多肽的基因,该多肽的表达通过一些可容易检测的性质例如酶活性来表现。合适的报告基因可以包括编码荧光素酶、β-半乳糖苷酶、氯霉素乙酰转移酶、分泌的碱性磷酸酶的基因或绿色荧光蛋白质基因(例如,Ui-Tei et al.,2000FEBS Letters 479:79-82)。
对于本领域技术人员显而易见的是,本发明不限于由本发明的腺病毒载体表达的异源基因的性质。可以使用任何合适的异源基因,其中基因的表达为哺乳动物提供益处。例如,异源基因可以是病毒蛋白质,其在哺乳动物中的表达赋予对病毒感染的免疫力。类似地,异源基因可以是细菌抗原、寄生虫抗原、真菌抗原、癌抗原,参与有害自身免疫反应的抗原,或其中针对其的免疫应答提供益处的任何其他蛋白质。
异源蛋白质
在本发明中,本发明的腺病毒载体可以编码异源蛋白质,其中异源蛋白质为选自gp140和Gag的至少一种HIV蛋白质,其中gp140来自选自B、AE、BC和C的中国HIV进化枝,和其中Gag来自中国HIV进化枝B。通常,异源蛋白质是肽片段、多肽、蛋白质或融合蛋白质。任选地,合适的异源蛋白质使得在向哺乳动物施用载体后在哺乳动物中诱导针对其的细胞介导的免疫和体液应答。
本发明的方法
本发明的载体在可用于使哺乳动物免疫疾病,和/或治疗、预防或降低哺乳动物疾病的风险的多种应用中是有用的。
因此本发明包括使哺乳动物免疫异源蛋白质的方法。方法包括向哺乳动物施用组合物,该组合物包含含有血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含表达盒,表达盒包含与编码异源蛋白质的序列可操作地连接的启动子序列,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质,其中gp140来自选自B、AE、BC和C的中国HIV进化枝,和其中Gag来自中国HIV进化枝B,和其中异源蛋白质的表达诱导哺乳动物的免疫应答。
在一些实施方式中,表达盒进一步包含与编码异源蛋白质的序列可操作地连接的至少一个调节序列.
在一些实施方式中,表达盒位于早期基因E1基因组区中。
在一些实施方式中,表达盒进一步包含嵌合内含子和/或CMV增强子。
在一些实施方式中,由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。
在一些实施方式中,整个早期基因E3基因组区被缺失。
在一个实施方式中,黑猩猩来源的Ad载体是AdC6。在一个实施方式中,AdC6具有Genbank登录号AY530877。在一个实施方式中,黑猩猩来源的Ad载体是AdC7。在一个实施方式中,AdC7具有Genbank登录号AY530878。
本发明进一步包括治疗有需要的哺乳动物的方法,其中方法施用治疗有效量的包含SEQ ID NO:6或7的核酸序列的黑猩猩来源的腺病毒载体编码的组合物,其中异源基因的表达对哺乳动物有益。在一个方面,本发明包括产生对哺乳动物中异源蛋白质的效应和记忆T细胞免疫应答的方法。在一些实施方式中,核酸序列由SEQ ID NO:6或7组成。在另一方面,本发明包括产生对哺乳动物中异源蛋白质的适应性B细胞免疫应答的方法。
本发明中另外包括了降低哺乳动物将发展疾病的风险的方法。方法包括向哺乳动物施用包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中核酸序列进一步包含表达盒,表达盒包含与编码异源蛋白质的序列可操作地连接的启动子序列,其中异源蛋白质是选自gp140和Gag的至少一种HIV蛋白质,其中gp140来自选自B、AE、BC和C的中国HIV进化枝,和其中Gag来自中国HIV进化枝B。
在一些实施方式中,表达盒进一步包含与编码异源蛋白质的序列可操作地连接的至少一个调节序列。
在一些实施方式中,表达盒位于早期基因E1基因组区中。
在一些实施方式中,表达盒进一步包含嵌合内含子和/或CMV增强子。
在一些实施方式中,由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。
在一些实施方式中,整个早期基因E3基因组区被缺失。
异源基因的表达诱导对哺乳动物中由此编码的异源蛋白质的免疫应答,由此降低哺乳动物将发展与异源蛋白质相关的疾病(如,HIV-1)的风险。
腺病毒载体生产
在本文的实验实施例部分和美国申请号14/190,787(美国专利号9,624,510)中详细描述了制备本发明的腺病毒载体的方法,该美国申请通过引用并入本文。通常,在本领域中已经很好地建立了腺病毒载体的生产、纯化和质量控制程序。一旦产生载体骨架,则分子克隆可用于产生包括抗原异源蛋白质的编码序列的腺病毒质粒。在一些实施方式中,可以将质粒转染到包装细胞中,包装细胞以反式提供合适的腺病毒血清型的E1。包装细胞在本领域中是熟知的,并且细胞系比如HEK293或PERC6可用于此目的。然后一旦斑块变得可见,收获病毒颗粒。然后可以感染新鲜细胞以确保腺病毒的连续复制。可以使用DNA印迹或其他方法,比如限制酶作图、测序和PCR来评估质量,以确认转基因的存在并且不存在基因重排或不期望的缺失。
包含使用本文公开的腺病毒载体制备的腺病毒颗粒的疫苗组合物可用于诱导针对编码的抗原蛋白质的免疫。疫苗可以使用标准技术配制,并且除了编码所需蛋白质的无复制能力的腺病毒载体以外,还可以包括药学上可接受的载体,比如磷酸盐缓冲盐水(PBS)或其他缓冲溶液,以及其他组分,比如抗细菌剂和抗真菌剂、等渗剂和吸收延迟剂、佐剂等。在一些实施方式中,疫苗组合物与一种或多种其他疫苗组合施用。可以提供疫苗组合物的剂量单位。这种剂量单位通常包括108至1011个腺病毒颗粒(如,108、5×108、109、5×109、1010、5×1010、1011)。在一些实施方式中,选择5×1010个病毒颗粒的剂量。具体地,该剂量(5×1010)在临床试验中最适合人类。
药物组合物和制剂
本发明的载体可以配制为药物组合物。
这种药物组合物可以是适于施用到受试者(即哺乳动物)的形式,或者药物组合物可以进一步包括一种或多种药学上可接受的运载体、一种或多种另外的成分、或这些的一些组合。如本领域所熟知的,药物组合物的各种组分可以以生理学上可接受的盐的形式存在,比如与生理学上可接受的阳离子或阴离子组合。
在一个实施方式中,可以施用可用于实践本发明的方法的药物组合物以递送106和1012VP之间的剂量。
在一个实施方式中,可用于实践本发明的方法的药物组合物可以包括佐剂。合适的非限制性实例是弗氏完全佐剂、弗氏不完全佐剂、Quil A、Detox、ISCOM或角鲨烯。
可用于本发明的方法的药物组合物可以被合适地开发用于吸入、口服、直肠、阴道、肠胃外、外用、经皮、肺、鼻内、口腔、眼内、鞘内、静脉内或另外的施用途径。其他考虑的制剂包括预计的纳米颗粒、脂质体制品、包含活性成分的重新密封的红细胞和基于免疫学的制剂。施用途径(一种或多种)对于技术人员来说是容易显而易见的,并且取决于很多因素,包括所治疗疾病的类型和严重程度、所治疗的兽或人类患者的类型和年龄等。
尽管本文提供的药物组合物的描述主要涉及适合于合乎伦理施用至人的药物组合物,但本领域技术人员应理解,这种组合物通常适合施用于各种动物。修改适合于施用至人的药物组合物以便使组合物适合于施用至各种动物是很好理解的,并且一般熟练的兽医药理学家可以仅通过普通的(如果有的话)实验来设计和进行这种修改。考虑向其施用本发明的药物组合物的受试者包括但不限于人和其他灵长类动物、哺乳动物,其包括商业上相关的哺乳动物,比如牛、猪、马、绵羊、猫和狗。
本发明的组合物可以包括按组合物的总重量计从约0.005%至2.0%的防腐剂。在暴露于环境中的污染物的情况下,防腐剂用于防止变质。
施用/给药
施用方案可以影响什么构成有效量。例如,本发明的腺病毒载体可以以单一药剂、若干分开的剂量以及交错剂量施用至受试者(即哺乳动物),其可以每天或顺序施用,或者药剂可以连续输注,或者可能是推注(bolus injection)。此外,如治疗或预防情况的紧急情况所指示的,剂量可以按比例增加或减少。
将本发明的组合物施用至受试者,优选地哺乳动物,更优选地人,可以使用已知的程序,以有效地治疗受试者的疾病的剂量和时间段进行。实现预期结果所需的组合物的有效量将变化,并且取决于因素,比如待治疗或预防的疾病,正在治疗的受试者的年龄、性别、体重、病症、一般健康状况和既往病史等,和医学领域众所周知的类似因素。在具体实施方式中,特别有利的是以剂量单位形式配制组合物以便于剂量的施用和均匀性。如本文所使用的剂量单位形式是指适合作为待治疗的受试者的单一剂量的物理离散单位;每个单位包含预定量的治疗化合物,其经计算可产生与所需药物载体相关的期望的治疗效果。本发明的剂量单位形式由(a)组合物和待表达的异源蛋白质的独特特征,以及待实现的具体治疗效果决定,并直接取决于(a)组合物和待表达的异源蛋白质的独特特征,以及待实现的具体治疗效果。
施用途径
本领域技术人员将认识到,尽管可以使用多于一种途径进行施用,但是具体途径可以提供比另一种途径更即时和更有效的反应。本发明的任何组合物的施用途径包括吸入、口服、鼻腔、直肠、肠胃外、舌下、经皮、经粘膜(例如,舌下、舌、(经)颊、(经)尿道、阴道(例如,经阴道和阴道周围)、鼻(内)和(经)直肠)、膀胱内、肺内、十二指肠内、胃内、鞘内、皮下、肌肉内、皮内、动脉内、静脉内、支气管内、吸入和外用施用。
试剂盒
在一些实施方式中,提供了用于治疗、预防或改善给定疾病、紊乱或病症或其症状的试剂盒,如本文所描述,其中试剂盒包含:a)如本文所描述的化合物或组合物;和任选地b)如本文所描述的另外的药剂或疗法。试剂盒可以进一步包括使用试剂盒治疗、预防或改善疾病、紊乱或病症的说明书或标签。在又其他实施方式中,本发明延伸至对于如本文所描述的给定疾病、紊乱或病症或其症状的试剂盒测试。例如,这种试剂盒可以包含来自PCR或其他核酸杂交技术(微阵列)的试剂或用于基于免疫学的检测技术(例如,ELISpot、ELISA)的试剂。
实施例
现在参考以下实施例描述本发明。提供这些实施例仅出于说明的目的,并且本发明决不应被解释为限于这些实施例,而应解释为涵盖由于本文提供的教导而变得明显的任何和所有变化。
无需进一步描述,使用前述描述和以下说明性实施例,认为本领域普通技术人员可以制备和利用本发明的化合物并实践所要求保护的方法。因此,以下工作实施例具体指出了本发明的优选实施方式,并且不应解释为以任何方式限制本公开内容的其余部分。
现在在以下实施例中描述实验结果。
方法:
根据Los Alamos数据库,在中国HIV-1最流行的进化枝是A/E(29.2%)、以07_B/C(18.7%)为主的不同类型的B/C(30.1%)、B(23.1%)和C(14.7%)。进行了广泛的数据库搜索,并组装了一组包膜(env)序列以诱导抗体,这些抗体可能是为中国开发综合HIV-1疫苗的候选抗体。在这些搜索中,关注可获得全长序列的较新的中国分离株。优先选择如此的Env序列,其携带169位的K和172位的V,这对结合广泛中和V2特异性抗体和其ADCC活性是重要的。对于Gag,选择含有如此表位的进化枝B,该表位对筛选实验动物中CD8+T细胞应答是重要的。
实施例1:载体构建和初始免疫原性测试
第一代载体
在E1和部分E3缺失的载体内使用没有内含子和增强子的表达盒产生表达HIV进化枝B的gag和HIV进化枝B、AE、BC和C的gp140的AdC6和AdC7。滴定载体的病毒含量。载体显示具有基因完整性并在系列培养后是基因稳定的。仅AdC7gp140BC载体诱导gp140-特异性B细胞应答。(图1A)
第二代载体
使用相同的AdC主链(但对于AdC7gp140BC)和插入物构建第二组载体,但通过在表达盒中包括内含子和增强子来改变表达盒。在拯救后,滴定载体,并建立基因完整性。发现如下所示的这些载体是免疫原性的。
对gag载体进行蛋白质印迹。第一代gag载体无法表达可检测量的gag蛋白。第二代gag载体显示出良好的表达。由于缺乏特异性抗体的Env载体给出了模棱两可的结果。质谱法可用于确定独立于抗体的表达,如通过使用第二代载体之一所确定的。
实施例2:第二代gag载体的免疫原性
用1011vp的第二代gag载体免疫BALB/c小鼠组。2周后,通过在利用携带gag优势免疫表位的肽刺激后或如上所述的假刺激后进行细胞内细胞因子染色,测试它们汇集血液中的CD8+T细胞应答(图1B)。用任一种载体免疫的小鼠均显示阳性应答。
3天后测试来自单个小鼠的脾细胞,包括干扰素(IFN)-γ、肿瘤坏死因子(TNF)-α,白介素(IL)-2和粒酶B(GrmB)的染色(图2)。接种疫苗后,小鼠对多种细胞因子表现出阳性应答。使用109和1010vp的AdC6gag载体的较低剂量重复该实验,然后以这些剂量的载体再次诱导可检测到的CD8+T细胞应答,而对于腺病毒载体而言,典型的情况是更适度的CD4+T细胞应答(图3)。
实施例3:第一代gp140载体
向ICR小鼠注射1011vp的gp140表达载体。4周后,为它们放血,并在杆状病毒来源的gp140(进化枝C)上或BSA涂覆的板上通过ELISA测试血清,并与来自幼稚小鼠(阴性对照)或来自注射已经建立的gp140载体的小鼠(阳性对照)的血清进行比较。用AdC7BC免疫的小鼠发展可检测的抗体应答(图4)。一些但不是全部免疫的小鼠发展了gp140-特异性抗体,而用其他载体免疫的小鼠没有血清转化。
实施例4:第二代gp140载体
使用带有内含子和增强子的表达盒产生表达gp140(AdC6gp140AE、AdC6gp140B、AdC6gp140C、AdC6gp140BC、AdC7gp140AE、AdC7gp140B、AdC7gp140C)的载体。滴定后,以1011vp向ICR小鼠注射载体,连同第一代AdC7gp140BC载体。4周后,在涂覆来自早期非洲HIV-1进化枝C分离株、中国HIV-1进化枝AE分离株和中国HIV-1进化枝BC分离株(后两个匹配AdC插入物的序列)的杆状病毒来源的gp140蛋白质的板上通过ELISA测试它们的血清中对gp140的抗体。即使不是所有小鼠应答,但是所有载体都诱导对3env蛋白质的抗体应答(图4)。5周后,施用109vp/小鼠的载体剂量,用表达相同插入物的异源载体初免后,小鼠被加强。4周后,在进化枝C、AE和BC gp140蛋白质上测试个体血清。如图5所示,加强后,一些无应答者变为血清阳性,这对于加强初免(如,进化枝C蛋白质上AdC6BC或所有蛋白质上AdC7AE后)后很低的应答最为有效。在一些组中,加强相对无效(图8),这可能归因于用于初免的高载体剂量和用于进行加强的100倍低剂量。
用AdC6载体初免并用AdC7载体加强的小鼠再次用在明矾中以1:1稀释的来自AIDS试剂项目(蛋白质CN54)2μg/小鼠的重组进化枝C gp140蛋白质进行加强。如图6和9所示,在加强载体初免的抗体应答方面蛋白质是非常有效的,以致在该加强后5周,AE组中除了一只小鼠外都对来自中国分离株的两种gp140表现出强大的抗体应答。为了比较,用明矾中相同蛋白免疫幼稚小鼠;这些小鼠中一些发展了gp140-特异性抗体应答,但是在载体初免的小鼠中观察到远低于这些的效价(图9)。比较了3种不同进化枝的gp140上测量的抗体效价。如图10中所示,应答取决于在其上测试他们的蛋白质不同而不同。已经对一种进化枝的gp140具有高抗体效价的小鼠不一定对其他进化枝的gp140具有高效价。出于同样的原因,在涂覆来自3中不同进化枝的gp140的板上获得的数据显示相对差的关联性(图11)。
实施例5:初免-加强方案
用AdCgag载体进行了多种初免加强方案。在第一组实验中,用AdC6gag或AdC7gag以109或1010vp进行初免并在6周后用异源载体以相同剂量给予进行加强。在后续实验中,施用gag和env载体的混合物。在初免后6周给予加强。在两个实验中,在初免后获得对gag的CD8+T细胞应答,这在加强后反常地下降。在图12中显示了第一个实验的结果。这样的结果在先前用其它载体获得,指示了CD8+T细胞在初免后高度保持了活化并因此易于在加强后再次遇到它们的抗原后而凋亡。使用1010vp的Ad6gag进行初免和1010vp的AdC7进行加强来重复实验。在后续实验中,初免和加强之间存在2个月的间隔。结果更加有希望,因为观察到gag-特异性CD8+T细胞的频率增加(图13)。然而,频率依然远在用US起源进化枝B gag初免-加强后常规观察到的那些以下,指示使用这种特定插入物,很可能需要在初免和加强之间更长的等待时间。
在BALB/c小鼠中用载体混合物进行实验,从而评估抗体应答。已经在ICR小鼠中进行了全部其他抗体实验。当测试针对进化枝AE和BC的血清时,幼稚小鼠中背景应答是非常高的。针对进化枝C的背景应答不是很高,但仍然实质上使得其几乎无法评估是否确实已经实现应答(图14)。
在ICR小鼠中测试了表达gp140的载体的混合物。以1010vp/载体向小鼠注射AdC6gP140进化枝C、B、AE和BC载体的混合物或对应AdC7载体的混合物。2周和8周后为小鼠放血,然后用异源载体(即,AdC6gp140进化枝B、C、AE、BC)进行加强,免疫小鼠用对应AdC7进行加强,反之亦然。2周后为小鼠放血。如本文其他地方所描述,通过ELISA确定对进化枝C、BC和AE的gp140的抗体。虽然在一些小鼠中观察到抗体应答,但是不如用仅表达一个进化枝的gp140的载体免疫后强。此外,在加强免疫后没有看到增加。在图15中示出了结果。
序列:
Gp140进化枝AE1:登录号,JX112804.SEQ ID NO:1
MRVKGTQMNWPNLWKWGTLILGLVIMCSASDNLWVTVYYGVPVWRDANTTLFCASDAKAHETEVHNVWATYACVPTDPNPQEIPMENVTENFNMWKNNMVEQMQEDVISLWDQSLKPCVKLTPLCVTLICTNANLTKINSTNSGPKVIGNVTDEVRNCSFNMTTLLTDKKQKVYALFYKLDIVPIDNSNSSEYRLINCNTSVIKQACPKISFDPIPIHYCTPAGYAILKCNDKNFNGTGPCKNVSSVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNAKTIIVHLNKAVEINCTRPSNNTRTSIRIGPGQIFYRTGDIIGDIRQAYCEINGTKWNETLRQVAKKLKEQFNNTIKFQPPSGGDLEITMLHFNCRGEFFYCNTTKLFNSTWERNETIKGGNGNGNDTIILPCRIKQIINMWQGAGQAMYAPPISGIINCVSNITGILLTRDGGNTNETAEIFRPGGGNIKDNWRSELYKYKVVQIEPLGVAPTKAKLTVQARQLLSGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQARILAVESYLKHQQFLGLWGCSNKIICTTAVPWNSSWSNKSYDEIWENMTWIEWEREIGNYTNQIYDILTKSQEQQDKNEKELLELDQWASLWNWFSITKWLW*
Gp140进化枝B:登录号,HM215399.SEQ ID NO:2
MRVKGIRKNYQHLWRWGTMLLGMLMICSAAENLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNIWATHACVPTDPNPQEVVLGNVTENFNMWKNDMVEQMHEDIISLWDQSLKPCVKLTPLCVTLNCTNLRNTNNTSSNTSNMTEGGEIKNCSFDITTSIRTKVKDYALFYELDIVAIDNTSYRLRQCNTSVITQACPKISFEPIPIHYCTPAGFAILKCNNKTFNGTGPCTNVSTVQCTHRIRPVVSTQLLLNGSLAEEEVVIRSSNFTDNAKVIIVQLKESVEINCTRPNNNTRKSIPLGPGKAWYTTGQIIGDIRQAHCNLSRAKWENTLQQITKKLREQFGNKTIIFNQSSGGDPEVVTHSFNCGGEFFYCNTSQLFNSTWYNNSTWNDTNDTTENSTITLPCRIKQIVNMWQEVGKAMYAPPIRGQIRCSSNITGLLLTRDGGKNESNTTETFRPGGGDMRDNWRSELYKYKVVKIEPLGVAPTRAKLTVQARQLLSGIVQQQRNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYLKDQQLLGIWGCSGKLICTTAVPWNVSWSNRSLSEIWDNMTWMEWEREIGNYTKQIYSLIEESQNQQEKNELELLEWDKWASLWNWFNITNWLW*
Gp140进化枝C:登录号,KF835515.SEQ ID NO:3
MRVRGTQRNYPQWWIWGILGFWMLMICNVGGNLWVTVYYGVPVWKEATTTLFCASDAKAYENEVHNVWATHACVPTDPNPQEMVLENVTENFNMWKNEMVNQMHEDVISLWDQSLKPCVKLTPLCVTLKCSNVTLKNNTVNSNETQYRKNCTFNTTTELKNRKQKVSAIFYRIDIVPLGNESSGNYRLINCNTSAITQACPKVSFDPIPIHYCTPAGYALLKCNNKTFNGTGPCNNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTNNVKTIIVHLNESVEIVCIRPGNNTRQSIRIGPGQTFYAPGEIIGNIRQAHCNINGTKWNETLQGVGKKLAEHFPNKTIKFKPSSGGDPEITTHSFNCRGEFFYCDTSGLFNSTYNSTYVPNGTESKPNITIQCRIKQIINMWQEVGRAMYAPPIKGSITCKSNITGLLLVRDGGANTTEEIFRPGGGDMRDNWRSELYKYKVVEIKPLGIAPTEAKLTVQARQLLSGIVQQQNNLLKAIEAQQHMLQLTVWGIKQLQTRVLAIERYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKTQDEIWKNMTWMQWDREINNYTNTIYSLLEESQNQQEKNEKDLLALDSWKNLWNWFDISNWLW*
Gp140进化枝BC:登录号,KC492738.SEQ ID NO:4
MRVMGIRRNCQHLWRWGIMLLGMLMICSVVGNLWVTVYYGVPVWKEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEMVLENVTENFNMWKNEMVNQMQEDVISLWDQSLKPCVKLTPLCVTLKCKNVSSNSTETPKLRGNSSETYKDEEMKNCSFNATTILRDKKQEVYALFYKLDIAPLLLNSSENSSAYYSLINCNTSAITQACPKVSFDPIPIHYCTPAGYAILKCNDKKFNGTGPCSNVSTVQCTHGIKPVVSTQLLLNGSLAEGEVIIRSKNLTDNAKTIIVQLNRSVEIVCTRPNNNTRKSIRIGPGQTFYATGDIIGDIRQAHCNISEDMWNETLHWVSRKLAEHFPNRTINFTSSSGGDLEIATHSFNCRGEFFYCNTSRLFNGTYMFNGTRGNSSSNSTITIPCRIKQIINMWQQVGRAMYAPPIEGNLTCRSNITGLLLVRDGGDNTNKTEIFRPQGGDMRDNWRSELYKYKVVEIKPLGIAPTTAKLTVQARQLLSGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQTRVLAIERYLKDQQLLGIWGCSGKLICTTAVPWNSSWSNKTQDEIWNNLTWMQWDKEISNYTDTIYKLLEDSQNQQERNEKDLLALDSWKNLWSWFDITNWLW*
HIVgag进化枝B:登录号,JF932500.SEQ ID NO:5
MGARASVLSGGELDRWEKIRLRPGGKKKYRLKHVVWASRELERFAVNPGLLETSEGCRQILEQLQPSLQTGSEELRSLYNTIAVLYCVHQKIEIKDTKEALDKIEEEQNKSKKKAQQAAADTGNNSQVSQNYPIVRNLQGQMVHQPLSPRTLNAWVKVVEEKAFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEWDRLHPPQAGPIAPGQIREPRGSDIAGTTSNLQEQIAWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSILDIKQGPKEPFRDYVDRFYKTLRAEQASQDVKNWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPSHKARILAEAMSQVTNSASVMMQRGNFRNQRKPVKCFNCGKEGHIAKNCRAPRKKGCWKCGKEGHQMKDCTERQANFLGKIWPSHKGRPGNFLQSRPEPTAPPEESFRFGEETTTPSQKQEQIDKELYPLASLKSLFGNDPSSQ*
1,C6 020CMV-HIVgp140 AE1.SEQ ID NO:6
catcatcaataatatacctcaaacttttggtgcgcgttaatatgcaaatgagctgtttgaatttggggagggaggaaggtgattggctgcgggagcggcgaccgttaggggcggggcgggtgacgttttgatgacgtggctatgaggcggagccggtttgcaagttctcgtgggaaaagtgacgtcaaacgaggtgtggtttgaacacggaaatactcaattttcccgcgctctctgacaggaaatgaggtgtttctgggcggatgcaagtgaaaacgggccattttcgcgcgaaaactgaatgaggaagtgaaaatctgagtaatttcgcgtttatggcagggaggagtatttgccgagggccgagtagactttgaccgattacgtgggggtttcgattaccgtatttttcacctaaatttccgcgtacggtgtcaaagtccggtgtttttacgtacgatatcatttccccgaaagtgccacctgaccgtaactataacggtcctaaggtagcgaaagctcagatctcccgatcccctatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagtatctgctccctgcttgtgtgttggaggtcgctgagtagtgcgcgagcaaaatttaagctacaacaaggcaaggcttgaccgacaattgcatgaagaatctgcttagggttaggcgttttgcgctgcttcgcgatgtacgggccagatatacgcgttgacattgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcactagaagctttattgcggtagtttatcacagttaaattgctaacgcagtcagtgcttctgacacaacagtctcgaacttaagctgcagaagttggtcgtgaggcactgggcaggtaagtatcaaggttacaagacaggtttaaggagaccaatagaaactgggcttgtcgagacagagaagactcttgcgtttctgataggcacctattggtcttactgacatccactttgcctttctctccacaggtgtccactcccagttcaattacagctcttaaaaggctagagtacttaatacgactcactataggctagcatgagagtgaaggggacacagatgaattggccaaacttgtggaaatgggggactttgatccttgggttggtgatcatgtgtagtgcctcagacaacttgtgggttacagtttattatggagttcctgtgtggagagatgcaaataccaccctattttgtgcatcagatgccaaagcacatgagacagaagtgcacaatgtctgggccacatatgcctgtgtacccacagatcccaacccacaagaaatacccatggaaaatgtgacagaaaattttaacatgtggaaaaataacatggtagagcaaatgcaggaggatgtaatcagtttatgggatcaaagtctaaagccatgtgtaaagttaactcctctctgcgttactttaatttgtaccaatgctaacttgaccaagatcaacagtaccaatagcgggcctaaagtaataggaaatgtaacagatgaagtaagaaactgttcttttaatatgaccacattactaacagataagaagcaaaaggtttatgcacttttttataagcttgatatagtaccaattgataatagtaatagtagtgagtatagattaataaattgtaatacttcagtcattaagcaggcttgtccaaagatatcctttgatccaattcctatacattattgtactccagctggttatgcgattttaaaatgtaatgataagaatttcaatgggacagggccatgtaaaaatgtcagctcagtacagtgcacacatggaattaagccagtggtctcaactcaattactgttaaatggcagtctagcagaagaagagataataatcagatctgaaaatctcacaaacaatgccaaaaccataatagtgcaccttaataaggctgtagaaatcaattgtaccagaccctccaacaatacaagaacaagtataagaataggaccaggacaaatattttatagaacaggagacataataggagatataagacaagcatattgtgaaattaatggaacaaaatggaatgaaactttaagacaggtagcaaaaaaattaaaagagcaatttaataacacaataaaattccagccaccctcaggaggagatctagaaattacaatgcttcattttaattgtagaggggaatttttctattgcaatacaacaaaactgttcaatagtacttgggaaagaaatgagaccataaaagggggtaatggcaatggcaatgacactatcatacttccatgcaggataaagcaaatcataaacatgtggcaaggagcaggacaagcaatgtatgctcctcccatcagtggaataattaactgtgtatcaaatattacaggaatactattgacaagagatggtggtaatactaatgaaactgccgagatcttcagacctggaggaggaaatataaaggacaattggagaagtgaattatataaatataaagtagtacaaattgaaccactaggagtagcacccaccaaggcaaagctgacggtacaggccagacaattattgtctggtatagtgcaacagcaaagcaatttgctgagggctatagaggcgcagcagcatatgttgcaactcacagtctggggcattaaacagctccaggcaagaatcctggctgtggaaagctacctaaagcatcaacagttcctaggactttggggctgctctaacaaaattatctgcaccactgctgtaccctggaattcctcttggagtaataaatcttatgatgagatttgggaaaatatgacatggatagaatgggagagagaaattggcaattacacaaaccaaatatatgatatacttacaaaatcgcaggaacagcaggacaaaaatgaaaaggaactgttggaattggatcaatgggcaagtctgtggaattggtttagcataacaaaatggctgtggtaatgtacaagtaaagcggccgccactgtgctggatgatccgagctcggtacctctagagtcgacccgggcggccaaaccgctgatcagcctcgactgtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctgaggcggaaagaaccagcagatctgcagatctgaattcatctatgtcgggtgcggagaaagaggtaatgaaatggcattatgggtattatgggtctgcattaatgaatcggtcagatatcgacatatgctggccaccgtgcatgtggcctcgcacccccgcaagacatggcccgagttcgagcacaacgtcatgacccgctgcaatgtgcacctgggctcccgccgaggcatgttcatgccctaccagtgcaacatgcaatttgtgaaggtgctgctggagcccgatgccatgtccagagtgagcctgacgggggtgtttgacatgaatgtggagctgtggaaaattctgagatatgatgaatccaagaccaggtgccgggcctgcgaatgcggaggcaagcacgccaggcttcagcccgtgtgtgtggaggtgacggaggacctgcgacccgatcatttggtgttgtcctgcaacgggacggagttcggctccagcggggaagaatctgactagagtgagtagtgtttggggctgggtgtgagcctgcatgaggggcagaatgactaaaatctgtggttttctgtgtgttgcagcagcatgagcggaagcgcctcctttgagggaggggtattcagcccttatctgacggggcgtctcccctcctgggcgggagtgcgtcagaatgtgatgggatccacggtggacggccggcccgtgcagcccgcgaactcttcaaccctgacctacgcgaccctgagctcctcgtccgtggacgcagctgccgccgcagctgctgcttccgccgccagcgccgtgcgcggaatggccctgggcgccggctactacagctctctggtggccaactcgagttccaccaataatcccgccagcctgaacgaggagaagctgctgctgctgatggcccagctcgaggccctgacccagcgcctgggcgagctgacccagcaggtggctcagctgcaggcggagacgcgggccgcggttgccacggtgaaaaccaaataaaaaatgaatcaataaataaacggagacggttgttgattttaacacagagtcttgaatctttatttgatttttcgcgcgcggtaggccctggaccaccggtctcgatcattgagcacccggtggatcttttccaggacccggtagaggtgggcttggatgttgaggtacatgggcatgagcccgtcccgggggtggaggtagctccattgcagggcctcgtgctcggggatggtgttgtaaatcacccagtcatagcaggggcgcagggcgtggtgctgcacgatgtccttgaggaggagactgatggccacgggcagccccttggtgtaggtgttgacgaacctgttgagctgggagggatgcatgcggggggagatgagatgcatcttggcctggatcttgagattggcgatgttcccgcccagatcccgccgggggttcatgttgtgcaggaccaccagcacggtgtatccggtgcacttggggaatttgtcatgcaacttggaagggaaggcgtgaaagaatttggagacgcccttgtgaccgcccaggttttccatgcactcatccatgatgatggcgatgggcccgtgggcggcggcctgggcaaagacgtttcgggggtcggacacatcgtagttgtggtcctgggtgagctcgtcataggccattttaatgaatttggggcggagggtgcccgactgggggacgaaggtgccctcgatcccgggggcgtagttgccctcgcagatctgcatctcccaggccttgagctcggagggggggatcatgtccacctgcggggcgatgaaaaaaacggtttccggggcgggggagatgagctgggccgaaagcaggttccggagcagctgggacttgccgcaaccggtggggccgtagatgaccccgatgaccggctgcaggtggtagttgagggagagacagctgccgtcctcgcggaggaggggggccacctcgttcatcatctcgcgcacatgcatgttctcgcgcacgagttccgccaggaggcgctcgccccccagcgagaggagctcttgcagcgaggcgaagtttttcagcggcttgagtccgtcggccatgggcattttggagagggtctgttgcaagagttccagacggtcccagagctcggtgatgtgctctagggcatctcgatccagcagacctcctcgtttcgcgggttggggcgactgcgggagtagggcaccaggcgatgggcgtccagcgaggccagggtccggtccttccagggccgcagggtccgcgtcagcgtggtctccgtcacggtgaaggggtgcgcgccgggctgggcgcttgcgagggtgcgcttcaggctcatccggctggtcgagaaccgctcccggtcggcgccctgcgcgtcggccaggtagcaattgagcatgagttcgtagttgagcgcctcggccgcgtggcccttggcgcggagcttacctttggaagtgtgtccgcagacgggacagaggagggacttgagggcgtagagcttgggggcgaggaagacggactcgggggcgtaggcgtccgcgccgcagctggcgcagacggtctcgcactccacgagccaggtgaggtcggggcggttggggtcaaaaacgaggtttcctccgtgctttttgatgcgtttcttacctctggtctccatgagctcgtgtccccgctgggtgacaaagaggctgtccgtgtccccgtagaccgactttatgggccggtcctcgagcggggtgccgcggtcctcgtcgtagaggaaccccgcccactccgagacgaaggcccgggtccaggccagcacgaaggaggccacgtgggaggggtagcggtcgttgtccaccagcgggtccaccttctccagggtatgcaagcacatgtccccctcgtccacatccaggaaggtgattggcttgtaagtgtaggccacgtgaccgggggtcccggccgggggggtataaaagggggcgggcccctgctcgtcctcactgtcttccggatcgctgtccaggagcgccagctgttggggtaggtattccctctcgaaggcgggcatgacctcggcactcaggttgtcagtttctagaaacgaggaggatttgatattgacggtgccgttggagacgcctttcatgagcccctcgtccatttggtcagaaaagacgatctttttgttgtcgagcttggtggcgaaggagccgtagagggcgttggagagcagcttggcgatggagcgcatggtctggttcttttccttgtcggcgcgctccttggcggcgatgttgagctgcacgtactcgcgcgccacgcacttccattcggggaagacggtggtgagctcgtcgggcacgattctgacccgccagccgcggttgtgcagggtgatgaggtccacgctggtggccacctcgccgcgcaggggctcgttggtccagcagaggcgcccgcccttgcgcgagcagaaggggggcagcgggtccagcatgagctcgtcgggggggtcggcgtccacggtgaagatgccgggcaggagctcggggtcgaagtagctgatgcaggtgcccagattgtccagcgccgcttgccagtcgcgcacggccagcgcgcgctcgtaggggctgaggggcgtgccccagggcatggggtgcgtgagcgcggaggcgtacatgccgcagatgtcgtagacgtagaggggctcctcgaggacgccgatgtaggtggggtagcagcgccccccgcggatgctggcgcgcacgtagtcgtacagctcgtgcgagggcgcgaggagccccgtgccgaggttggagcgttgcggcttttcggcgcggtagacgatctggcggaagatggcgtgggagttggaggagatggtgggcctttggaagatgttgaagtgggcgtggggcaggccgaccgagtccctgatgaagtgggcgtaggagtcctgcagcttggcgacgagctcggcggtgacgaggacgtccagggcgcagtagtcgagggtctcttggatgatgtcatacttgagctggcccttctgcttccacagctcgcggttgagaaggaactcttcgcggtccttccagtactcttcgagggggaacccgtcctgatcggcacggtaagagcccaccatgtagaactggttgacggccttgtaggcgcagcagcccttctccacggggagggcgtaagcttgcgcggccttgcgcagggaggtgtgggtgagggcgaaggtgtcgcgcaccatgaccttgaggaactggtgcttgaagtcgaggtcgtcgcagccgccctgctcccagagttggaagtccgtgcgcttcttgtaggcggggttaggcaaagcgaaagtaacatcgttgaagaggatcttgcccgcgcggggcatgaagttgcgagtgatgcggaaaggctggggcacctcggcccggttgttgatgacctgggcggcgaggacgatctcgtcgaagccgttgatgttgtgcccgacgatgtagagttccacgaatcgcgggcggcccttgacgtggggcagcttcttgagctcgtcgtaggtgagctcggcggggtcgctgagcccgtgctgctcgagggcccagtcggcgacgtgggggttggcgctgaggaaggaagtccagagatccacggccagggcggtctgcaagcggtcccggtactgacggaactgttggcccacggccattttttcgggggtgacgcagtagaaggtgcgggggtcgccgtgccagcggtcccacttgagctggagggcgaggtcgtgggcgagctcgacgagcggcgggtccccggagagtttcatgaccagcatgaaggggacgagctgcttgccgaaggaccccatccaggtgtaggtttccacatcgtaggtgaggaagagcctttcggtgcgaggatgcgagccgatggggaagaactggatctcctgccaccagttggaggaatggctgttgatgtgatggaagtagaaatgccgacggcgcgccgagcactcgtgcttgtgtttatacaagcgtccgcagtgctcgcaacgctgcacgggatgcacgtgctgcacgagctgtacctgggttcctttggcgaggaatttcagtgggcagtggagcgctggcggctgcatctcgtgctgtactacgtcttggccatcggcgtggccatcgtctgcctcgatggtggtcatgctgacgagcccgcgcgggaggcaggtccagacctcggctcggacgggtcggagagcgaggacgagggcgcgcaggccggagctgtccagggtcctgagacgctgcggagtcaggtcagtgggcagcggcggcgcgcggttgacttgcaggagcttttccagggcgcgcgggaggtccagatggtacttgatctccacggcgccgttggtggctacgtccacggcttgcagggtgccgtgcccctggggcgccaccaccgtgccccgtttcttcttgggcgctgcttccatgtcggtcagaagcggcggcgaggacgcgcgccgggcggcaggggcggctcggggcccggaggcaggggcggcaggggcacgtcggcgccgcgcgcgggcaggttctggtactgcgcccggagaagactggcgtgagcgacgacgcgacggttgacgtcctggatctgacgcctctgggtgaaggccacgggacccgtgagtttgaacctgaaagagagttcgacagaatcaatctcggtatcgttgacggcggcctgccgcaggatctcttgcacgtcgcccgagttgtcctggtaggcgatctcggtcatgaactgctcgatctcctcctcctgaaggtctccgcggccggcgcgctcgacggtggccgcgaggtcgttggagatgcggcccatgagctgcgagaaggcgttcatgccggcctcgttccagacgcggctgtagaccacggctccgtcggggtcgcgcgcgcgcatgaccacctgggcgaggttgagctcgacgtggcgcgtgaagaccgcgtagttgcagaggcgctggtagaggtagttgagcgtggtggcgatgtgctcggtgacgaagaagtacatgatccagcggcggagcggcatctcgctgacgtcgcccagggcttccaagcgttccatggcctcgtagaagtccacggcgaagttgaaaaactgggagttgcgcgccgagacggtcaactcctcctccagaagacggatgagctcggcgatggtggcgcgcacctcgcgctcgaaggccccggggggctcctcttccatctcctcctcttcctcctccactaacatctcttctacttcctcctcaggaggcggtggcgggggaggggccctgcgtcgccggcggcgcacgggcagacggtcgatgaagcgctcgatggtctccccgcgccggcgacgcatggtctcggtgacggcgcgcccgtcctcgcggggccgcagcatgaagacgccgccgcgcatctccaggtggccgccgggggggtctccgttgggcagggagagggcgctgacgatgcatcttatcaattgacccgtagggactccgcgcaaggacctgagcgtctcgagatccacgggatccgaaaaccgctgaacgaaggcttcgagccagtcgcagtcgcaaggtaggctgagcccggtttcttgttcttcgggtatttggtcgggaggcgggcgggcgatgctgctggtgatgaagttgaagtaggcggtcctgagacggcggatggtggcgaggagcaccaggtccttgggcccggcttgctggatgcgcagacggtcggccatgccccaggcgtggtcctgacacctggcgaggtccttgtagtagtcctgcatgagccgctccacgggcacctcctcctcgcccgcgcggccgtgcatgcgcgtgagcccgaacccgcgctgcggctggacgagcgccaggtcggcgacgacgcgctcggtgaggatggcctgctggatctgggtgagggtggtctggaagtcgtcgaagtcgacgaagcggtggtaggctccggtgttgatggtgtaggagcagttggccatgacggaccagttgacggtctggtggccgggtcgcacgagctcgtggtacttgaggcgcgagtaggcgcgcgtgtcgaagatgtagtcgttgcaggcgcgcacgaggtactggtatccgacgaggaagtgcggcggcggctggcggtagagcggccatcgctcggtggcgggggcgccgggcgcgaggtcctcgagcatgaggcggtggtagccgtagatgtacctggacatccaggtgatgccggcggcggtggtggaggcgcgcgggaactcgcggacgcggttccagatgttgcgcagcggcaggaagtagttcatggtggccgcggtctggcccgtgaggcgcgcgcagtcgtggatgctctagacatacgggcaaaaacgaaagcggtcagcggctcgactccgtggcctggaggctaagcgaacgggttgggctgcgcgtgtaccccggttcgaatctcgaatcaggctggagccgcagctaacgtggtactggcactcccgtctcgacccaagcctgctaacgaaacctccaggatacggaggcgggtcgttttttggccttggtcgctggtcatgaaaaactagtaagcgcggaaagcggccgcccgcgatggctcgctgccgtagtctggagaaagaatcgccagggttgcgttgcggtgtgccccggttcgagcctcagcgctcggcgccggccggattccgcggctaacgtgggcgtggctgccccgtcgtttccaagaccccttagccagccgacttctccagttacggagcgagcccctctttttttttcttgtgtttttgccagatgcatcccgtactgcggcagatgcgcccccaccctccaccacaaccgcccctaccgcagcagcagcaacagccggcgcttctgcccccgccccagcagcagccagccactaccgcggcggccgccgtgagcggagccggcgttcagtatgacctggccttggaagagggcgaggggctggcgcggctgggggcgtcgtcgccggagcggcacccgcgcgtgcagatgaaaagggacgctcgcgaggcctacgtgcccaagcagaacctgttcagagacaggagcggcgaggagcccgaggagatgcgcgcctcccgcttccacgcggggcgggagctgcggcgcggcctggaccgaaagcgggtgctgagggacgaggatttcgaggcggacgagctgacggggatcagccccgcgcgcgcgcacgtggccgcggccaacctggtcacggcgtacgagcagaccgtgaaggaggagagcaacttccaaaaatccttcaacaaccacgtgcgcacgctgatcgcgcgcgaggaggtgaccctgggcctgatgcacctgtgggacctgctggaggccatcgtgcagaaccccacgagcaagccgctgacggcgcagctgtttctggtggtgcagcacagtcgggacaacgagacgttcagggaggcgctgctgaatatcaccgagcccgagggccgctggctcctggacctggtgaacattttgcagagcatcgtggtgcaggagcgcgggctgccgctgtccgagaagctggcggccatcaacttctcggtgctgagtctgggcaagtactacgctaggaagatctacaagaccccgtacgtgcccatagacaaggaggtgaagatcgacgggttttacatgcgcatgaccctgaaagtgctgaccctgagcgacgatctgggggtgtaccgcaacgacaggatgcaccgcgcggtgagcgccagccgccggcgcgagctgagcgaccaggagctgatgcacagcctgcagcgggccctgaccggggccgggaccgagggggagagctactttgacatgggcgcggacctgcgctggcagcccagccgccgggccttggaagctgccggcggttccccctacgtggaggaggtggacgatgaggaggaggagggcgagtacctggaagactgatggcgcgaccgtatttttgctagatgcagcaacagccaccgccgccgcctcctgatcccgcgatgcgggcggcgctgcagagccagccgtccggcattaactcctcggacgattggacccaggccatgcaacgcatcatggcgctgacgacccgcaatcccgaagcctttagacagcagcctcaggccaaccggctctcggccatcctggaggccgtggtgccctcgcgctcgaaccccacgcacgagaaggtgctggccatcgtgaacgcgctggtggagaacaaggccatccgcggtgacgaggccgggctggtgtacaacgcgctgctggagcgcgtggcccgctacaacagcaccaacgtgcagacgaacctggaccgcatggtgaccgacgtgcgcgaggcggtgtcgcagcgcgagcggttccaccgcgagtcgaacctgggctccatggtggcgctgaacgccttcctgagcacgcagcccgccaacgtgccccggggccaggaggactacaccaacttcatcagcgcgctgcggctgatggtggccgaggtgccccagagcgaggtgtaccagtcggggccggactacttcttccagaccagtcgccagggcttgcagaccgtgaacctgagccaggctttcaagaacttgcagggactgtggggcgtgcaggccccggtcggggaccgcgcgacggtgtcgagcctgctgacgccgaactcgcgcctgctgctgctgctggtggcgcccttcacggacagcggcagcgtgagccgcgactcgtacctgggctacctgcttaacctgtaccgcgaggccatcggacaggcgcacgtggacgagcagacctaccaggagatcacccacgtgagccgcgcgctgggccaggaggacccgggcaacctggaggccaccctgaacttcctgctgaccaaccggtcgcagaagatcccgccccagtacgcgctgagcaccgaggaggagcgcatcctgcgctacgtgcagcagagcgtggggctgttcctgatgcaggagggggccacgcccagcgcggcgctcgacatgaccgcgcgcaacatggagcccagcatgtacgcccgcaaccgcccgttcatcaataagctgatggactacttgcatcgggcggccgccatgaactcggactactttaccaacgccatcttgaacccgcactggctcccgccgcccgggttctacacgggcgagtacgacatgcccgaccccaacgacgggttcctgtgggacgacgtggacagcagcgtgttctcgccgcgtccaggaaccaatgccgtgtggaagaaagagggcggggaccggcggccgtcctcggcgctgtccggtcgcgcgggtgctgccgcggcggtgcccgaggccgccagccccttcccgagcctgcccttttcgctgaacagcgtgcgcagcagcgagctgggtcggctgacgcgaccgcgcctgctgggcgaggaggagtacctgaacgactccttgttgaggcccgagcgcgagaagaacttccccaataacgggatagagagcctggtggacaagatgagccgctggaagacgtacgcgcacgagcacagggacgagccccgagctagcagcgcaggcacccgtagacgccagcggcacgacaggcagcggggactggtgtgggacgatgaggattccgccgacgacagcagcgtgttggacttgggtgggagtggtggtaacccgttcgctcacctgcgcccccgtatcgggcgcctgatgtaagaatctgaaaaaataaaagacggtactcaccaaggccatggcgaccagcgtgcgttcttctctgttgtttgtagtagtatgatgaggcgcgtgtacccggagggtcctcctccctcgtacgagagcgtgatgcagcaggcggtggcggcggcgatgcagcccccgctggaggcgccttacgtgcccccgcggtacctggcgcctacggaggggcggaacagcattcgttactcggagctggcacccttgtacgataccacccggttgtacctggtggacaacaagtcggcagacatcgcctcgctgaactaccagaacgaccacagcaacttcctgaccaccgtggtgcagaacaacgatttcacccccacggaggccagcacccagaccatcaactttgacgagcgctcgcggtggggcggccagctgaaaaccatcatgcacaccaacatgcccaacgtgaacgagttcatgtacagcaacaagttcaaggcgcgggtgatggtctcgcgcaagacccccaacggggtggatgatgattatgatggtagtcaggacgagctgacctacgagtgggtggagtttgagctgcccgagggcaacttctcggtgaccatgaccatcgatctgatgaacaacgccatcatcgacaactacttggcggtggggcggcagaacggggtgctggagagcgacatcggcgtgaagttcgacacgcgcaacttccggctgggctgggaccccgtgaccgagctggtgatgccgggcgtgtacaccaacgaggccttccaccccgacatcgtcctgctgcccggctgcggcgtggacttcaccgagagccgcctcagcaacctgctgggcatccgcaagcggcagcccttccaggagggcttccagatcctgtacgaggacctggaggggggcaacatccccgcgctcttggatgtcgaagcctacgagaaaagcaaggaggatagcaccgccgcggcgaccgcagccgtggccaccgcctctaccgaggtgcggggcgataattttgctagcgctgcggcagcggccgaggcggctgaaaccgaaagtaagatagtcatccagccggtggagaaggacagcaaggacaggagctacaacgtgctcgcggacaagaaaaacaccgcctaccgcagctggtacctggcctacaactacggcgaccccgagaagggcgtgcgctcctggacgctgctcaccacctcggacgtcacctgcggcgtggagcaagtctactggtcgctgcccgacatgatgcaagacccggtcaccttccgctccacgcgtcaagttagcaactacccggtggtgggcgccgagctcctgcccgtctactccaagagcttcttcaacgagcaggccgtctactcgcagcagctgcgcgccttcacctcgctcacgcacgtcttcaaccgcttccccgagaaccagatcctcgtccgcccgcccgcgcccaccattaccaccgtcagtgaaaacgttcctgctctcacagatcacgggaccctgccgctgcgcagcagtatccggggagtccagcgcgtgaccgtcactgacgccagacgccgcacctgcccctacgtctacaaggccctgggcgtagtcgcgccgcgcgtcctctcgagccgcaccttctaaaaaatgtccattctcatctcgcccagtaataacaccggttggggcctgcgcgcgcccagcaagatgtacggaggcgctcgccaacgctccacgcaacaccccgtgcgcgtgcgcgggcacttccgcgctccctggggcgccctcaagggccgcgtgcgctcgcgcaccaccgtcgacgacgtgatcgaccaggtggtggccgacgcgcgcaactacacgcccgccgccgcgcccgtctccaccgtggacgccgtcatcgacagcgtggtggccgacgcgcgccggtacgcccgcaccaagagccggcggcggcgcatcgcccggcggcaccggagcacccccgccatgcgcgcggcgcgagccttgctgcgcagggccaggcgcacgggacgcagggccatgctcagggcggccagacgcgcggcctccggcagcagcagcgccggcaggacccgcagacgcgcggccacggcggcggcggcggccatcgccagcatgtcccgcccgcggcgcggcaacgtgtactgggtgcgcgacgccgccaccggtgtgcgcgtgcccgtgcgcacccgcccccctcgcacttgaagatgctgacttcgcgatgttgatgtgtcccagcggcgaggaggatgtccaagcgcaaatacaaggaagagatgctccaggtcatcgcgcctgagatctacggccccgcggcggcggtgaaggaggaaagaaagccccgcaaactgaagcgggtcaaaaaggacaaaaaggaggaggaagatgacggactggtggagtttgtgcgcgagttcgccccccggcggcgcgtgcagtggcgcgggcggaaagtgaaaccggtgctgcggcccggcaccacggtggtcttcacgcccggcgagcgttccggctccgcctccaagcgctcctacgacgaggtgtacggggacgaggacatcctcgagcaggcggtcgagcgtctgggcgagtttgcgtacggcaagcgcagccgccccgcgcccttgaaagaggaggcggtgtccatcccgctggaccacggcaaccccacgccgagcctgaagccggtgaccctgcagcaggtgctaccgagcgcggcgccgcgccggggcttcaagcgcgagggcggcgaggatctgtacccgaccatgcagctgatggtgcccaagcgccagaagctggaggacgtgctggagcacatgaaggtggaccccgaggtgcagcccgaggtcaaggtgcggcccatcaagcaggtggccccgggcctgggcgtgcagaccgtggacatcaagatccccacggagcccatggaaacgcagaccgagcccgtgaagcccagcaccagcaccatggaggtgcagacggatccctggatgccagcaccagcttccaccagcactcgccgaagacgcaagtacggcgcggccagcctgctgatgcccaactacgcgctgcatccttccatcatccccacgccgggctaccgcggcacgcgcttctaccgcggctacaccagcagccgccgccgcaagaccaccacccgccgccgtcgtcgcagccgccgcagcagcaccgcgacttccgccttggtgcggagagtgtatcgcagcgggcgcgagcctctgaccctgccgcgcgcgcgctaccacccgagcatcgccatttaactaccgcctcctacttgcagatatggccctcacatgccgcctccgcgtccccattacgggctaccgaggaagaaagccgcgccgtagaaggctgacggggaacgggctgcgtcgccatcaccaccggcggcggcgcgccatcagcaagcggttggggggaggcttcctgcccgcgctgatccccatcatcgccgcggcgatcggggcgatccccggcatagcttccgtggcggtgcaggcctctcagcgccactgagacacaaaaaagcatggatttgtaataaaaaaaaaaatggactgacgctcctggtcctgtgatgtgtgtttttagatggaagacatcaatttttcgtccctggcaccgcgacacggcacgcggccgtttatgggcacctggagcgacatcggcaacagccaactgaacgggggcgccttcaattggagcagtctctggagcgggcttaagaatttcgggtccacgctcaaaacctatggcaacaaggcgtggaacagcagcacagggcaggcgctgagggaaaagctgaaagaacagaacttccagcagaaggtggttgatggcctggcctcaggcatcaacggggtggttgacctggccaaccaggccgtgcagaaacagatcaacagccgcctggacgcggtcccgcccgcggggtccgtggagatgccccaggtggaggaggagctgcctcccctggacaagcgcggcgacaagcgaccgcgtcccgacgcggaggagacgctgctgacgcacacggacgagccgcccccgtacgaggaggcggtgaaactgggcctgcccaccacgcggcccgtggcgcctctggccaccggagtgctgaaacccagcagcagccagcccgcgaccctggacttgcctccgcctcgcccctccacagtggctaagcccctgccgccggtggccgtcgcgtcgcgcgccccccgaggccgcccccaggcgaactggcagagcactctgaacagcatcgtgggtctgggagtgcagagtgtgaagcgccgccgctgctattaaaagacactgtagcgcttaacttgcttgtctgtgtgtatatgtatgtccgccgaccagaaggaggagtgtgaagaggcgcgtcgccgagttgcaagatggccaccccatcgatgctgccccagtgggcgtacatgcacatcgccggacaggacgcttcggagtacctgagtccgggtctggtgcagttcgcccgcgccacagacacctacttcagtctggggaacaagtttaggaaccccacggtggcgcccacgcacgatgtgaccaccgaccgcagccagcggctgacgctgcgcttcgtgcccgtggaccgcgaggacaacacctactcgtacaaagtgcgctacacgctggccgtgggcgacaaccgcgtgctggacatggccagcacctactttgacatccgcggcgtgctggaccggggccctagcttcaaaccctactctggcaccgcctacaacagcctagctcccaagggagctcccaattccagccagtgggagcaagcaaaaacaggcaatgggggaactatggaaacacacacatatggtgtggccccaatgggcggagagaatattacaaaagatggtcttcaaattggaactgacgttacagcgaatcagaataaaccaatttatgccgacaaaacatttcaaccagaaccgcaagtaggagaagaaaattggcaagaaactgaaaacttttatggcggtagagctcttaaaaaagacacaaacatgaaaccttgctatggctcctatgctagacccaccaatgaaaaaggaggtcaagctaaacttaaagttggagatgatggagttccaaccaaagaattcgacatagacctggctttctttgatactcccggtggcaccgtgaacggtcaagacgagtataaagcagacattgtcatgtataccgaaaacacgtatttggaaactccagacacgcatgtggtatacaaaccaggcaaggatgatgcaagttctgaaattaacctggttcagcagtctatgcccaacagacccaactacattgggttcagggacaactttatcggtcttatgtactacaacagcactggcaatatgggtgtgcttgctggtcaggcctcccagctgaatgctgtggttgatttgcaagacagaaacaccgagctgtcctaccagctcttgcttgactctttgggtgacagaacccggtatttcagtatgtggaaccaggcggtggacagttatgaccccgatgtgcgcatcatcgaaaaccatggtgtggaggatgaattgccaaactattgcttccccttggacggctctggcactaacgccgcataccaaggtgtgaaagtaaaagatggtcaagatggtgatgttgagagtgaatgggaaaatgacgatactgttgcagctcgaaatcaattatgtaaaggtaacattttcgccatggagattaatctccaggctaacctgtggagaagtttcctctactcgaacgtggccctgtacctgcccgactcctacaagtacacgccgaccaacgtcacgctgccgaccaacaccaacacctacgattacatgaatggcagagtgacacctccctcgctggtagacgcctacctcaacatcggggcgcgctggtcgctggaccccatggacaacgtcaaccccttcaaccaccaccgcaacgcgggcctgcgctaccgctccatgctcctgggcaacgggcgctacgtgcccttccacatccaggtgccccaaaagtttttcgccatcaagagcctcctgctcctgcccgggtcctacacctacgagtggaacttccgcaaggacgtcaacatgatcctgcagagctccctaggcaacgacctgcgcacggacggggcctccatcgccttcaccagcatcaacctctacgccaccttcttccccatggcgcacaacaccgcctccacgctcgaggccatgctgcgcaacgacaccaacgaccagtccttcaacgactacctctcggcggccaacatgctctaccccatcccggccaacgccaccaacgtgcccatctccatcccctcgcgcaactgggccgccttccgcggatggtccttcacgcgcctgaagacccgcgagacgccctcgctcggctccgggttcgacccctacttcgtctactcgggctccatcccctacctagacggcaccttctacctcaaccacaccttcaagaaggtctccatcaccttcgactcctccgtcagctggcccggcaacgaccgcctcctgacgcccaacgagttcgaaatcaagcgcaccgtcgacggagagggatacaacgtggcccagtgcaacatgaccaaggactggttcctggtccagatgctggcccactacaacatcggctaccagggcttctacgtgcccgagggctacaaggaccgcatgtactccttcttccgcaacttccagcccatgagccgccaggtcgtggacgaggtcaactacaaggactaccaggccgtcaccctggcctaccagcacaacaactcgggcttcgtcggctacctcgcgcccaccatgcgccagggccagccctaccccgccaactacccctacccgctcatcggcaagagcgccgtcgccagcgtcacccagaaaaagttcctctgcgaccgggtcatgtggcgcatccccttctccagcaacttcatgtccatgggcgcgctcaccgacctcggccagaacatgctctacgccaactccgcccacgcgctagacatgaatttcgaagtcgaccccatggatgagtccacccttctctatgttgtcttcgaagtcttcgacgtcgtccgagtgcaccagccccaccgcggcgtcatcgaagccgtctacctgcgcacgcccttctcggccggcaacgccaccacctaagccgctcttgcttcttgcaagatgacggcgggctccggcgagcaggagctcagggccatcctccgcgacctgggctgcgggccctgcttcctgggcaccttcgacaagcgcttccctggattcatggccccgcacaagctggcctgcgccatcgtgaacacggccggccgcgagaccgggggcgagcactggctggccttcgcctggaacccgcgctcccacacatgctacctcttcgaccccttcgggttctcggacgagcgcctcaagcagatctaccagttcgagtacgagggcctgctgcgtcgcagcgccctggccaccgaggaccgctgcgtcaccctggaaaagtccacccagaccgtgcagggtccgcgctcggccgcctgcgggctcttctgctgcatgttcctgcacgccttcgtgcactggcccgaccgccccatggacaagaaccccaccatgaacttactgacgggggtgcccaacggcatgctccagtcgccccaggtggaacccaccctgcgccgcaaccaggaagcgctctaccgcttcctcaatgcccactccgcctactttcgctcccaccgcgcgcgcatcgagaaggccaccgccttcgaccgcatgaatcaagacatgtaaaaaaccggtgtgtgtatgtgaatgctttattcataataaacagcacatgtttatgccaccttctctgaggctctgactttatttagaaatcgaaggggttctgccggctctcggcatggcccgcgggcagggatacgttgcggaactggtacttgggcagccacttgaactcggggatcagcagcttgggcacggggaggtcggggaacgagtcgctccacagcttgcgcgtgagttgcagggcgcccagcaggtcgggcgcggagatcttgaaatcgcagttgggacccgcgttctgcgcgcgagagttgcggtacacggggttgcagcactggaacaccatcagggccgggtgcttcacgcttgccagcaccgtcgcgtcggtgatgccctccacgtccagatcctcggcgttggccatcccgaagggggtcatcttgcaggtctgccgccccatgctgggcacgcagccgggcttgtggttgcaatcgcagtgcagggggatcagcatcatctgggcctgctcggagctcatgcccgggtacatggccttcatgaaagcctccagctggcggaaggcctgctgcgccttgccgccctcggtgaagaagaccccgcaggacttgctagagaactggttggtggcgcagccggcgtcgtgcacgcagcagcgcgcgtcgttgttggccagctgcaccacgctgcgcccccagcggttctgggtgatcttggcccggttggggttctccttcagcgcgcgctgcccgttctcgctcgccacatccatctcgatagtgtgctccttctggatcatcacggtcccgtgcaggcaccgcagcttgccctcggcttcggtgcagccgtgcagccacagcgcgcagccggtgcactcccagttcttgtgggcgatctgggagtgcgagtgcacgaagccctgcaggaagcggcccatcatcgcggtcagggtcttgttgctggtgaaggtcagcgggatgccgcggtgctcctcgttcacatacaggtggcagatgcggcggtacacctcgccctgctcgggcatcagctggaaggcggacttcaggtcgctctccacgcggtaccggtccatcagcagcgtcatcacttccatgcccttctcccaggccgaaacgatcggcaggctcagggggttcttcaccgccattgtcatcttagtcgccgccgccgaggtcagggggtcgttctcgtccagggtctcaaacactcgcttgccgtccttctcgatgatgcgcacggggggaaagctgaagcccacggccgccagctcctcctcggcctgcctttcgtcctcgctgtcctggctgatgtcttgcaaaggcacatgcttggtcttgcggggtttctttttgggcggcagaggcggcggcgatgtgctgggagagcgcgagttctcgttcaccacgactatttcttcttcttggccgtcgtccgagaccacgcggcggtaggcatgcctcttctggggcagaggcggaggcgacgggctctcgcggttcggcgggcggctggcagagccccttccgcgttcgggggtgcgctcctggcggcgctgctctgactgacttcctccgcggccggccattgtgttctcctagggagcaacaacaagcatggagactcagccatcgtcgccaacatcgccatctgcccccgccgccaccgccgacgagaaccagcagcagaatgaaagcttaaccgccccgccgcccagccccacctccgacgccgcggccccagacatgcaagagatggaggaatccatcgagattgacctgggctacgtgacgcccgcggagcacgaggaggagctggcagcgcgcttttcagccccggaagagaaccaccaagagcagccagagcaggaagcagagaacgagcagaaccaggctgggcacgagcatggcgactacctgagcggggcagaggacgtgctcatcaagcatctggcccgccaatgcatcatcgtcaaggacgcgctgctcgaccgcgccgaggtgcccctcagcgtggcggagctcagccgcgcctacgagcgcaacctcttctcgccgcgcgtgccccccaagcgccagcccaacggcacctgtgagcccaacccgcgcctcaacttctacccggtcttcgcggtgcccgaggccctggccacctaccacctctttttcaagaaccaaaggatccccgtctcctgccgcgccaaccgcacccgcgccgacgccctgctcaacctgggccccggcgcccgcctacctgatatcacctccttggaagaggttcccaagatcttcgagggtctgggcagcgacgagactcgggccgcgaacgctctgcaaggaagcggagaggagcatgagcaccacagcgccctggtggagttggaaggcgacaacgcgcgcctggcggtcctcaagcgcacggtcgagctgacccacttcgcctacccggcgctcaacctgccccccaaggtcatgagcgccgtcatggaccaggtgctcatcaagcgcgcctcgcccctctcggaggaggagatgcaggaccccgagagttcggacgagggcaagcccgtggtcagcgacgagcagctggcgcgctggctgggagcgagtagcaccccccagagcctggaagagcggcgcaagctcatgatggccgtggtcctggtgaccgtggagctggagtgtctgcgccgcttctttgccgacgcggagaccctgcgcaaggtcgaggagaacctgcactacctcttcaggcacgggttcgtgcgccaggcctgcaagatctccaacgtggagctgaccaacctggtctcctacatgggcatcctgcacgagaaccgcctggggcaaaacgtgctgcacaccaccctgcgcggggaggcccgccgcgactacatccgcgactgcgtctacctgtacctctgccacacctggcagacgggcatgggcgtgtggcagcagtgcctggaggagcagaacctgaaagagctctgcaagctcctgcagaagaacctcaaggccctgtggaccgggttcgacgagcgtaccaccgcctcggacctggccgacctcatcttccccgagcgcctgcggctgacgctgcgcaacgggctgcccgactttatgagccaaagcatgttgcaaaactttcgctctttcatcctcgaacgctccgggatcctgcccgccacctgctccgcgctgccctcggacttcgtgccgctgaccttccgcgagtgccccccgccgctctggagccactgctacttgctgcgcctggccaactacctggcctaccactcggacgtgatcgaggacgtcagcggcgagggtctgctggagtgccactgccgctgcaacctctgcacgccgcaccgctccctggcctgcaacccccagctgctgagcgagacccagatcatcggcaccttcgagttgcaaggccccggcgacggcgagggcaaggggggtctgaaactcaccccggggctgtggacctcggcctacttgcgcaagttcgtgcccgaggactaccatcccttcgagatcaggttctacgaggaccaatcccagccgcccaaggccgagctgtcggcctgcgtcatcacccagggggccatcctggcccaattgcaagccatccagaaatcccgccaagaatttctgctgaaaaagggccacggggtctacttggacccccagaccggagaggagctcaaccccagcttcccccaggatgccccgaggaagcagcaagaagctgaaagtggagctgccgccgccggaggatttggaggaagactgggagagcagtcaggcagaggaggaggagatggaagactgggacagcactcaggcagaggaggacagcctgcaagacagtctggaggaggaagacgaggtggaggaggcagaggaagaagcagccgccgccagaccgtcgtcctcggcggagaaagcaagcagcacggataccatctccgctccgggtcggggtcgcggcggccgggcccacagtaggtgggacgagaccgggcgcttcccgaaccccaccacccagaccggtaagaaggagcggcagggatacaagtcctggcgggggcacaaaaacgccatcgtctcctgcttgcaagcctgcgggggcaacatctccttcacccggcgctacctgctcttccaccgcggggtgaacttcccccgcaacatcttgcattactaccgtcacctccacagcccctactactgtttccaagaagaggcagaaacccagcagcagcagaaaaccagcggcagcagcagctagaaaatccacagcggcggcaggtggactgaggatcgcggcgaacgagccggcgcagacccgggagctgaggaaccggatctttcccaccctctatgccatcttccagcagagtcgggggcaggagcaggaactgaaagtcaagaaccgttctctgcgctcgctcacccgcagttgtctgtatcacaagagcgaagaccaacttcagcgcactctcgaggacgccgaggctctcttcaacaagtactgcgcgctcactcttaaagagtagcccgcgcccgcccacacacggaaaaaggcgggaattacgtcaccacctgcgcccttcgcccgaccatcatgagcaaagagattcccacgccttacatgtggagctaccagccccagatgggcctggccgccggcgccgcccaggactactccacccgcatgaactggctcagtgccgggcccgcgatgatctcacgggtgaatgacatccgcgcccaccgaaaccagatactcctagaacagtcagcgatcaccgccacgccccgccatcaccttaatccgcgtaattggcccgccgccctggtgtaccaggaaattccccagcccacgaccgtactacttccgcgagacgcccaggccgaagtccagctgactaactcaggtgtccagctggccggcggcgccgccctgtgtcgtcaccgccccgctcagggtataaagcggctggtgatccgaggcagaggcacacagctcaacgacgaggtggtgagctcttcgctgggtctgcgacctgacggagtcttccaactcgccggatcggggagatcttccttcacgcctcgtcaggccgtcctgactttggagagttcgtcctcgcagccccgctcgggcggcatcggcactctccagttcgtggaggagttcactccctcggtctacttcaaccccttctccggctcccccggccactacccggacgagttcatcccgaacttcgacgccatcagcgagtcggtggacggctacgattgaatgtcccatggtggcgcagctgacctagctcggcttcgacacctggaccactgccgccgcttccgctgcttcgctcgggatctcgccgagtttgcctactttgagctgcccgaggagcaccctcagggcccagcccacggagtgcggatcatcgtcgaagggggcctcgactcccacctgcttcggatcttcagccagcgaccgatcctggtcgagcgcgaacaaggacagacccttcttactttgtactgcatctgcaaccaccccggcctgcatgaaagtctttgttgtctgctgtgtactgagtataataaaagctgagatcagcgactactccggactcgattgtggtgttcctgctatcaaccggtccctgttcttcaccgggaacgagaccgagctccagctccagtgtaagccccacaagaagtacctcacctggctgttccagggctccccgatcgccgttgtcaaccactgcgacaacgacggagtcctgctgagcggccctgccaaccttactttttccacccgcagaagcaagctccagctcttccaacccttcctccccgggacctatcagtgcgtctcaggaccctgccatcacaccttccacctgatcccgaataccacagcgccgctccccgctactaacaaccaaactacccaccaacgccaccgtcgcgacctttcctctgaatctaataccactaccggaggtggcttctgctgttagtgctcccccgtcccgtcgacccccggtcccccactcagtcccccgaggaggttcgcaaatgcaaattccaagaaccctggaaattcctcaaatgctaccgccaaaaatcagacatgcatcccagctggatcatgatcattgggatcgtgaacattctggcctgcaccctcatctcctttgtgatttacccctgctttgactttggttggaactcgccagaggcgctctatctcccgcctgaacctgacacaccaccacagcagcaacctcaggcacacgcactaccaccaccacagcctaggccacaatacatgcccatattagactatgaggccgagccacagcgacccatgctccccgctattagttacttcaatctaaccggcggagatgactgacccactggccaataacaacgtcaacgaccttctcctggacatggacggccgcgcctcggagcagcgactcgcccaacttcgcattcgtcagcagcaggagagagccgtcaaggagctgcaggacggcatagccatccaccagtgcaagagaggcatcttctgcctggtgaaacaggccaagatctcctacgaggtcacccagaccgaccatcgcctctcctacgagctcctgcagcagcgccagaagttcacctgcctggtcggagtcaaccccatcgtcatcacccagcagtcgggcgataccaaggggtgcatccactgctcctgcgactcccccgactgcgtccacactctgatcaagaccctctgcggcctccgcgacctcctccccatgaactaatcacccccttatccagtgaaataaagatcatattgatgatgatttaaataaaaaaaataatcatttgatttgaaataaagatacaatcatattgatgatttgagtttaacaaaaataaagaatcacttacttgaaatctgataccaggtctctgtccatgttttctgccaacaccacctcactcccctcttcccagctctggtactgcaggccccggcgggctgcaaacttcctccacacgctgaaggggatgtcaaattcctcctgtccctcaatcttcattttatcttctatcagatgtccaaaaagcgcgtccgggtggatgatgacttcgaccccgtctacccctacgatgcagacaacgcaccgaccgtgcccttcatcaacccccccttcgtctcttcagatggattccaagagaagcccctgggggtgttgtccctgcgactggctgaccccgtcaccaccaagaacggggaaatcaccctcaagctgggagagggggtggacctcgactcgtcgggaaaactcatctccaacacggccaccaaggccgccgcccctctcagtatttcaaacaacaccatttcccttaaaactgctgcccctttctacaacaacaatggaactttaagcctcaatgtctccacaccattagcagtatttcccacatttaacactttaggcataagtcttggaaacggtcttcagacttcaaataagttgttgactgtacaactaactcatcctcttacattcagctcaaatagcatcacagtaaaaacagacaaagggctatatattaactccagtggaaacagaggacttgaggctaatataagcctaaaaagaggactagtttttgacggtaatgctattgcaacatatattggaaatggcttagactatggatcttatgatagtgatggaaaaacaagacccgtaattaccaaaattggagcaggattaaattttgatgctaacaaagcaatagctgtcaaactaggcacaggtttaagttttgactccgctggtgccttgacagctggaaacaaacaggatgacaagctaacactttggactacccctgacccaagccctaattgtcaattactttcagacagagatgccaaatttactctctgtcttacaaaatgcggtagtcaaatactaggcactgtggcagtggcggctgttactgtaggatcagcactaaatccaattaatgacacagtcaaaagcgccatagttttccttagatttgattccgatggtgtactcatgtcaaactcatcaatggtaggtgattactggaactttagggagggacagaccactcaaagtgtagcctatacaaatgctgtgggattcatgccaaatataggtgcatatccaaaaacccaaagtaaaacacctaaaaatagcatagtcagtcaggtatatttaactggagaaactactatgccaatgacactaaccataactttcaatggcactgatgaaaaagacacaaccccagttagcacctactctatgacttttacatggcagtggactggagactataaggacaaaaatattacctttgctaccaactcattctctttttcctacatcgcccaggaataatcccacccagcaagccaaccccttttcccaccacctttgtctatatggaaactctgaaacagaaaaataaagttcaagtgttttattgaatcaacagttttacaggactcgagcagttatttttcctccaccctcccaggacatggaatacaccaccctctccccccgcacagccttgaacatctgaatgccattggtgatggacatgcttttggtctccacgttccacacagtttcagagcgagccagtctcggatcggtcagggagatgaaaccctccgggcactcccgcatctgcacctcacagctcaacagctgaggattgtcctcggtggtcgggatcacggttatctggaagaagcagaagagcggcggtgggaatcatagtccgcgaacgggatcggccggtggtgtcgcatcaggccccgcagcagtcgctgccgccgccgctccgtcaagctgctgctcagggggttcgggtccagggactccctcagcatgatgcccacggccctcagcatcagtcgtctggtgcggcgggcgcagcagcgcatgcgaatctcgctcaggtcactgcagtacgtgcaacacaggaccaccaggttgttcaacagtccatagttcaacacgctccagccgaaactcatcgcgggaaggatgctacccacgtggccgtcgtaccagatcctcaggtaaatcaagtggcgctccctccagaagacgctgcccatgtacatgatctccttgggcatgtggcggttcaccacctcccggtaccacatcaccctctggttgaacatgcagccccggatgatcctgcggaaccacagggccagcaccgccccgcccgccatgcagcgaagagaccccggatcccggcaatgacaatggaggacccaccgctcgtacccgtggatcatctgggagctgaacaagtctatgttggcacagcacaggcatatgctcatgcatctcttcagcactctcagctcctcgggggtcaaaaccatatcccagggcacggggaactcttgcaggacagcgaaccccgcagaacagggcaatcctcgcacataacttacattgtgcatggacagggtatcgcaatcaggcagcaccgggtgatcctccaccagagaagcgcgggtctcggtctcctcacagcgtggtaagggggccggccgatacgggtgatggcgggacgcggctgatcgtgttctcgaccgtgtcatgatgcagttgctttcggacattttcgtacttgctgtagcagaacctggtccgggcgctgcacaccgatcgccggcggcggtctcggcgcttggaacgctcggtgttaaagttgtaaaacagccactctctcagaccgtgcagcagatctagggcctcaggagtgatgaagatcccatcatgcctgatagctctgatcacatcgaccaccgtggaatgggccaggcccagccagatgatgcaattttgttgggtttcggtgacggcgggggagggaagaacaggaagaaccatgattaacttttaatccaaacggtctcggagcacttcaaaatgaaggtcacggagatggcacctctcgcccccgctgtgttggtggaaaataacagccaggtcaaaggtgatacggttctcgagatgttccacggtggcttccagcaaagcctccacgcgcacatccagaaacaagacaatagcgaaagcgggagggttctctaattcctcaaccatcatgttacactcctgcaccatccccagataattttcatttttccagccttgaatgattcgaactagttcctgaggtaaatccaagccagccatgataaaaagctcgcgcagagcaccctccaccggcattcttaagcacaccctcataattccaagatattctgctcctggttcacctgcagcagattgacaagcggaatatcaaaatctctgccgcgatccctgagctcctccctcagcaataactgtaagtactctttcatatcgtctccgaaatttttagccataggacccccaggaataagagaagggcaagccacattacagataaaccgaagtcccccccagtgagcattgccaaatgtaagattgaaataagcatgctggctagacccggtgatatcttccagataactggacagaaaatcgggtaagcaatttttaagaaaatcaacaaaagaaaaatcttccaggtgcacgtttagggcctcgggaacaacgatggagtaagtgcaaggggtgcgttccagcatggttagttagctgatctgtaaaaaaacaaaaaataaaacattaaaccatgctagcctggcgaacaggtgggtaaatcgttctctccagcaccaggcaggccacggggtctccggcgcgaccctcgtaaaaattgtcgctatgattgaaaaccatcacagagagacgttcccggtggccggcgtgaatgattcgagaagaagcatacacccccggaacattggagtccgtgagtgaaaaaaagcggccgaggaagcaatgaggcactacaacgctcactctcaagtccagcaaagcgatgccatgcggatgaagcacaaaattttcaggtgcgtaaaaaatgtaattactcccctcctgcacaggcagcgaagctcccgatccctccagatacacatacaaagcctcagcgtccatagcttaccgagcggcagcagcagcggcacacaacaggcgcaagagtcagagaaaagactgagctctaacctgtccgcccgctctctgctcaatatatagccccagatctacactgacgtaaaggccaaagtctaaaaatacccgccaaataatcacacacgcccagcacacgcccagaaaccggtgacacactcagaaaaatacgcgcacttcctcaaacggccaaactgccgtcatttccgggttcccacgctacgtcatcaaaacacgactttcaaattccgtcgaccgttaaaaacatcacccgccccgcccctaacggtcgccgctcccgcagccaatcaccttcctccctccccaaattcaaacagctcatttgcatattaacgcgcaccaaaagtttgaggtatattattgatgatg
2,C7 010 CMV-HIVgp140 AE1.SEQ ID NO:7
catcatcaataatatacctcaaacttttggtgcgcgttaatatgcaaatgagctgtttgaatttggggagggaggaaggtgattggccgagagacgggcgaccgttaggggcggggcgggtgacgttttgatgacgtggccgtgaggcggagccggtttgcaagttctcgtgggaaaagtgacgtcaaacgaggtgtggtttgaacacggaaatactcaattttcccgcgctctctgacaggaaatgaggtgtttctgggcggatgcaagtgaaaacgggccattttcgcgcgaaaactgaatgaggaagtgaaaatctgagtaatttcgcgtttatggcagggaggagtatttgccgagggccgagtagactttgaccgattacgtgggggtttcgattaccgtatttttcacctaaatttccgcgtacggtgtcaaagtccggtgtttttacgtacgatatcatttccccgaaagtgccacctgaccgtaactataacggtcctaaggtagcgaaagctcagatctcccgatcccctatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagtatctgctccctgcttgtgtgttggaggtcgctgagtagtgcgcgagcaaaatttaagctacaacaaggcaaggcttgaccgacaattgcatgaagaatctgcttagggttaggcgttttgcgctgcttcgcgatgtacgggccagatatacgcgttgacattgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcactagaagctttattgcggtagtttatcacagttaaattgctaacgcagtcagtgcttctgacacaacagtctcgaacttaagctgcagaagttggtcgtgaggcactgggcaggtaagtatcaaggttacaagacaggtttaaggagaccaatagaaactgggcttgtcgagacagagaagactcttgcgtttctgataggcacctattggtcttactgacatccactttgcctttctctccacaggtgtccactcccagttcaattacagctcttaaaaggctagagtacttaatacgactcactataggctagcatgagagtgaaggggacacagatgaattggccaaacttgtggaaatgggggactttgatccttgggttggtgatcatgtgtagtgcctcagacaacttgtgggttacagtttattatggagttcctgtgtggagagatgcaaataccaccctattttgtgcatcagatgccaaagcacatgagacagaagtgcacaatgtctgggccacatatgcctgtgtacccacagatcccaacccacaagaaatacccatggaaaatgtgacagaaaattttaacatgtggaaaaataacatggtagagcaaatgcaggaggatgtaatcagtttatgggatcaaagtctaaagccatgtgtaaagttaactcctctctgcgttactttaatttgtaccaatgctaacttgaccaagatcaacagtaccaatagcgggcctaaagtaataggaaatgtaacagatgaagtaagaaactgttcttttaatatgaccacattactaacagataagaagcaaaaggtttatgcacttttttataagcttgatatagtaccaattgataatagtaatagtagtgagtatagattaataaattgtaatacttcagtcattaagcaggcttgtccaaagatatcctttgatccaattcctatacattattgtactccagctggttatgcgattttaaaatgtaatgataagaatttcaatgggacagggccatgtaaaaatgtcagctcagtacagtgcacacatggaattaagccagtggtctcaactcaattactgttaaatggcagtctagcagaagaagagataataatcagatctgaaaatctcacaaacaatgccaaaaccataatagtgcaccttaataaggctgtagaaatcaattgtaccagaccctccaacaatacaagaacaagtataagaataggaccaggacaaatattttatagaacaggagacataataggagatataagacaagcatattgtgaaattaatggaacaaaatggaatgaaactttaagacaggtagcaaaaaaattaaaagagcaatttaataacacaataaaattccagccaccctcaggaggagatctagaaattacaatgcttcattttaattgtagaggggaatttttctattgcaatacaacaaaactgttcaatagtacttgggaaagaaatgagaccataaaagggggtaatggcaatggcaatgacactatcatacttccatgcaggataaagcaaatcataaacatgtggcaaggagcaggacaagcaatgtatgctcctcccatcagtggaataattaactgtgtatcaaatattacaggaatactattgacaagagatggtggtaatactaatgaaactgccgagatcttcagacctggaggaggaaatataaaggacaattggagaagtgaattatataaatataaagtagtacaaattgaaccactaggagtagcacccaccaaggcaaagctgacggtacaggccagacaattattgtctggtatagtgcaacagcaaagcaatttgctgagggctatagaggcgcagcagcatatgttgcaactcacagtctggggcattaaacagctccaggcaagaatcctggctgtggaaagctacctaaagcatcaacagttcctaggactttggggctgctctaacaaaattatctgcaccactgctgtaccctggaattcctcttggagtaataaatcttatgatgagatttgggaaaatatgacatggatagaatgggagagagaaattggcaattacacaaaccaaatatatgatatacttacaaaatcgcaggaacagcaggacaaaaatgaaaaggaactgttggaattggatcaatgggcaagtctgtggaattggtttagcataacaaaatggctgtggtaatgtacaagtaaagcggccgccactgtgctggatgatccgagctcggtacctctagagtcgacccgggcggccaaaccgctgatcagcctcgactgtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctgaggcggaaagaaccagcagatctgcagatctgaattcatctatgtcgggtgcggagaaagaggtaatgaaatggcattatgggtattatgggtctgcattaatgaatcggccagatatcgatatgctggccaccgtgcatgtgacctcgcacccccgcaagacatggcccgagttcgagcacaacgtcatgacccgatgcaatgtgcacctggggtcccgccgaggcatgttcatgccctaccagtgcaacatgcaatttgtgaaggtgctgctggagcccgatgccatgtccagagtgagcctgacgggggtgtttgacatgaatgtggagctgtggaaaattctgagatatgatgaatccaagaccaggtgccgggcctgcgaatgcggaggcaagcacgccaggcttcagcccgtgtgtgtggaggtgacggaggacctgcgacccgatcatttggtgttgtcctgcaacgggacggagttcggctccagcggggaagaatctgactagagtgagtagtgtttgggggaggtggagggcttgtatgaggggcagaatgactaaaatctgtgtttttctgtgtgttgcagcagcatgagcggaagcgcctcctttgagggaggggtattcagcccttatctgacggggcgtctcccctcctgggcgggagtgcgtcagaatgtgatgggatccacggtggacggccggcccgtgcagcccgcgaactcttcaaccctgacctacgcgaccctgagctcctcgtccgtggacgcagctgccgccgcagctgctgcttccgccgccagcgccgtgcgcggaatggccctgggcgccggctactacagctctctggtggccaactcgacttccaccaataatcccgccagcctgaacgaggagaagctgctgctgctgatggcccagctcgaggccctgacccagcgcctgggcgagctgacccagcaggtggctcagctgcaggcggagacgcgggccgcggttgccacggtgaaaaccaaataaaaaatgaatcaataaataaacggagacggttgttgattttaacacagagtcttgaatctttatttgatttttcgcgcgcggtaggccctggaccaccggtctcgatcattgagcacccggtggattttttccaggacccggtagaggtgggcttggatgttgaggtacatgggcatgagcccgtcccgggggtggaggtagctccattgcagggcctcgtgctcgggggtggtgttgtaaatcacccagtcatagcaggggcgcagggcgtggtgctgcacgatgtccttgaggaggagactgatggccacgggcagccccttggtgtaggtgttgacgaacctgttgagctgggagggatgcatgcggggggagatgagatgcatcttggcctggatcttgagattggcgatgttcccgcccagatcccgccgggggttcatgttgtgcaggaccaccagcacggtgtatccggcgcacttggggaatttgtcatgcaacttggaagggaaggcgtgaaagaatttggagacgcccttgtgaccgcccaggttttccatgcactcatccatgatgatggcgatgggcccgtgggcggcggcctgggcaaagacgtttcgggggtcggacacatcgtagttgtggtcctgggtgagctcgtcataggccattttaatgaatttggggcggagggtgcccgactgggggacgaaggtgccctcgatcccgggggcgtagttgccctcgcagatctgcatctcccaggccttgagctcggagggggggatcatgtccacctgcggggcgatgaaaaaaacggtttccggggcgggggagatgagctgggccgaaagcaggttccggagcagctgggacttgccgcagccggtggggccgtagatgaccccgatgaccggctgcaggtggtagttgagggagagacagctgccgtcctcgcggaggaggggggccacctcgttcatcatctcgcgcacatgcatgttctcgcgcacgagttccgccaggaggcgctcgccccccagcgagaggagctcttgcagcgaggcgaagtttttcagcggcttgagyccgtcggccatgggcattttggagagggtctgttgcaagagttccagacggtcccagagctcggtgatgtgctctagggcatctcgatccagcagacctcctcgtttcgcgggttggggcgactgcgggagtagggcaccaggcgatgggcgtccagcgaggccagggtccggtccttccagggtcgcagggtccgcgtcagcgtggtctccgtcacggtgaaggggtgcgcgccgggctgggcgcttgcgagggtgcgcttcaggctcatccggctggtcgagaaccgctcccggtcggcgccctgcgcgtcggccaggtagcaattgagcatgagttcgtagttgagcgcctcggccgcgtggcccttggcgcggagcttacctttggaagtgtgtccgcagacgggacagaggagggacttgagggcgtagagcttgggggcgaggaagacggactcgggggcgtaggcgtccgcgccgcagctggcgcagacggtctcgcactccacgagccaggtgaggtcgggccggttggggtcaaaaacgaggtttcctccgtgctttttgatgcgtttcttacctctggtctccatgagctcgtgtccccgctgggtgacaaagaggctgtccgtgtccccgtagaccgactttatgggccggtcctcgagcggggtgccgcggtcctcgtcgtagaggaaccccgcccactccgagacgaaggcccgggtccaggccagcacgaaggaggccacgtgggaggggtagcggtcgttgtccaccagcgggtccaccttctccagggtatgcaagcacatgtccccctcgtccacatccaggaaggtgattggcttgtaagtgtaggccacgtgaccgggggtcccggccgggggggtataaaagggggcgggcccctgctcgtcctcactgtcttccggatcgctgtccaggagcgccagctgttggggtaggtattccctctcgaaggctggcataacctcggcactcaggttgtcagtttctagaaacgaggaggatttgatattgacggtgccgttggagacgcctttcatgagcccctcgtccatctggtcagaaaagacgatctttttgttgtcgagcttggtggcgaaggagccgtagagggcgttggagaggagcttggcgatggagcgcatggtctggttcttttccttgtcggcgcgctccttggcggcgatgttgagctgcacgtactcgcgcgccacgcacttccattcggggaagacggtggtgagctcgtcgggcacgattctgacccgccagccgcggttgtgcagggtgatgaggtccacgctggtggccacctcgccgcgcaggggctcgttggtccagcagaggcgcccgcccttgcgcgagcagaaggggggcagcgggtccagcatgagctcgtcgggggggtcggcgtccacggtgaagatgccgggcagaagctcggggtcgaagtagctgatgcaggtgtccagatcgtccagcgccgcttgccagtcgcgcacggccagcgcgcgctcgtaggggctgaggggcgtgccccagggcatggggtgcgtgagcgcggaggcgtacatgccgcagatgtcgtagacgtagaggggctcctcgaggacgccgatgtaggtggggtagcagcgccccccgcggatgctggcgcgcacgtagtcgtacagctcgtgcgagggcgcgaggagccccgtgccgaggttggagcgttgcggcttttcggcgcggtagacgatctggcggaagatggcgtgggagttggaggagatggtgggcctctggaagatgttgaagtgggcgtggggcaggccgaccgagtccctgatgaagtgggcgtaggagtcctgcagcttggcgacgagctcggcggtgacgaggacgtccagggcgcagtagtcgagggtctcttggatgatgtcgtacttgagctggcccttctgcttccacagctcgcggttgagaaggaactcttcgcggtccttccagtactcttcgagggggaacccgtcctgatcggcacggtaagagcccaccatgtagaactggttgacggccttgtaggcgcagcagcccttctccacggggagggcgtaagcttgtgcggccttgcgcagggaggtgtgggtgagggcgaaggtgtcgcgcaccatgaccttgaggaactggtgcttgaagtcgaggtcgtcgcagccgccctgctcccagagctggaagtccgtgcgcttcttgtaggcggggttgggcaaagcgaaagtaacatcgttgaagaggatcttgcccgcgcggggcatgaagttgcgagtgatgcggaaaggctggggcacctcggcccggttgttgatgacctgggcggcgaggacgatctcgtcgaagccgttgatgttgtgcccgacgatgtagagttccacgaatcgcgggcggcccttaacgtggggcagcttcttgagctcgtcgtaggtgagctcggcggggtcgctgagcccgtgctgctcgagggcccagtcggcgacgtgggggttggcgctgaggaaggaagtccagagatccacggccagggcggtctgcaagcggtcccggtactgacggaactgctggcccacggccattttttcgggggtgacgcagtagaaggtgcgggggtcgccgtgccagcggtcccacttgagctggagggcgaggtcgtgggcgagctcgacgagcggcgggtccccggagagtttcatgaccagcatgaaggggacgagctgcttgccgaaggaccccatccaggtgtaggtttccacatcgtaggtgaggaagagcctttcggtgcgaggatgcgagccgatggggaagaactggatctcctgccaccagttggaggaatggctgttgatgtgatggaagtagaaatgccgacggcgcgccgagcactcgtgcttgtgtttatacaagcgtccgcagtgctcgcaacgctgcacgggatgcacgtgctgcacgagctgtacctgggttcctttgacgaggaatttcagtgggcagtggagcgctggcggctgcatctggtgctgtactacgtcctggccatcggcgtggccatcgtctgcctcgatggtggtcatgctgacgagcccgcgcgggaggcaggtccagacttcggctcggacgggtcggagagcgaggacgagggcgcgcaggccggagctgtccagggtcctgagacgctgcggagtcaggtcagtgggcagcggcggcgcgcggttgacttgcaggagcttttccagggcgcgcgggaggtccagatggtacttgatctccacggcgccgttggtggcgacgtccacggcttgcagggtcccgtgcccctggggcgccaccaccgtgccccgtttcttcttgggcgctgcttccatgccggtcagaagcggcggcgaggacgcgcgccgggcggcaggggcggctcgggacccggaggcaggggcggcaggggcacgtcggcgccgcgcgcgggcaggttctggtactgcgcccggagaagactggcgtgagcgacgacgcgacggttgacgtcctggatctgacgcctctgggtgaaggccacgggacccgtgagtttgaacctgaaagagagttcgacagaatcaatctcggtatcgttgacggcggcctgccgcaggatctcttgcacgtcgcccgagttgtcctggtaggcgatctcggtcatgaactgctcgatctcctcctcctgaaggtctccgcggccggcgcgctcgacggtggccgcgaggtcgttggagatgcggcccatgagctgcgagaaggcgttcatgccggcctcgttccagacgcggctgtagaccacggctccgtcggggtcgcgcgcgcgcatgaccacctgggcgaggttgagctcgacgtggcgcgtgaagaccgcgtagttgcagaggcgctggtagaggtagttgagcgtggtggcgatgtgctcggtgacgaagaagtacatgatccagcggcggagcggcatctcgctgacgtcgcccagggcttccaagcgctccatggcctcgtagaagtccacggcgaagttgaaaaactgggagttgcgcgccgagacggtcaactcctcctccagaagacggatgagctcagcgatggtggcgcgcacctcgcgctcgaaggccccggggggctcctcttcttccatctcttcctcctccactaacatctcttctacttcctcctcaggaggcggcggcgggggaggggccctgcgtcgccggcggcgcacgggcagacggtcgatgaagcgctcgatggtctccccgcgccggcgacgcatggtctcggtgacggcgcgcccgtcctcgcggggccgcagcgtgaagacgccgccgcgcatctccaggtggccgccgggggggtctccgttgggcagggagagggcgctgacgatgcatcttatcaattggcccgtagggactccgcgcaaggacctgagcgtctcgagatccacgggatccgaaaaccgctgaacgaaggcttcgagccagtcgcagtcgcaaggtaggctgagcccggtttcttgttcttcggggatttcgggaggcgggcgggcgatgctgctggtgatgaagttgaagtaggcggtcctgagacggcggatggtggcgaggagcaccaggtccttgggcccggcttgctggatgcgcagacggtcggccatgccccaggcgtggtcctgacacctggcgaggtccttgtagtagtcctgcatgagccgctccacgggcacctcctcctcgcccgcgcggccgtgcatgcgcgtgagcccgaacccgcgctggggctggacgagcgccaggtcggcgacgacgcgctcggcgaggatggcctgctgtatctgggtgagggtggtctggaagtcgtcgaagtcgacgaagcggtggtaggctccggtgttgatggtataggagcagttggccatgacggaccagttgacggtctggtggccgggtcgcacgagctcgtggtacttgaggcgcgagtaggcgcgcgtgtcgaagatgtagtcgttgcaggtgcgcacgaggtactggtatccgacgaggaagtgcggcggcggctggcggtagagcggccatcgctcggtggcgggggcgccgggcgcgaggtcctcgagcatgaggcggtggtagccgtagatgtacctggacatccaggtgatgccggcggcggtggtggaggcgcgcgggaactcgcggacgcggttccagatgttgcgcagcggcaggaagtagttcatggtggccgcggtctggcccgtgaggcgcgcgcagtcgtggatgctctagacatacgggcaaaaacgaaagcggtcagcggctcgactccgtggcctggaggctaagcgaacgggttgggctgcgcgtgtaccccggttcgaatctcgaatcaggctggagccgcagctaacgtggtactggcactcccgtctcgacccaagcctgctaacgaaacctccaggatacggaggcgggtcgttttttggccttggtcgctggtcatgaaaaactagtaagcgcggaaagcgaccgcccgcgatggctcgctgccgtagtctggagaaagaatcgccagggttgcgttgcggtgtgccccggttcgagcctcagcgctcggcgccggccggattccgcggctaacgtgggcgtggctgccccgtcgtttccaagaccccttagccagccgacttctccagttacggagcgagcccctctttttcttgtgtttttgccagatgcatcccgtactgcggcagatgcgcccccaccctccacctcaaccgcccctaccgccgcagcagcagcaacagccggcgcttctgcccccgccccagcagcagccagccactaccgcggcggccgccgtgagcggagccggcgttcagtatgacctggccttggaagagggcgaggggctggcgcggctgggggcgtcgtcgccggagcggcacccgcgcgtgcagatgaaaagggacgctcgcgaggcctacgtgcccaagcagaacctgttcagagacaggagcggcgaggagcccgaggagatgcgcgcctcccgcttccacgcggggcgggagctgcggcgcggcctggaccgaaagcgggtgctgagggacgaggatttcgaggcggacgagctgacggggatcagccccgcgcgcgcgcacgtggccgcggccaacctggtcacggcgtacgagcagaccgtgaaggaggagagcaacttccaaaaatccttcaacaaccacgtgcgcacgctgatcgcgcgcgaggaggtgaccctgggcctgatgcacctgtgggacctgctggaggccatcgtgcagaaccccacgagcaagccgctgacggcgcagctgtttctggtggtgcagcacagtcgggacaacgagacgttcagggaggcgctgctgaatatcaccgagcccgagggccgctggctcctggacctggtgaacattctgcagagcatcgtggtgcaggagcgcgggctgccgctgtccgagaagctggcggctatcaacttctcggtgctgagcctgggcaagtactacgctaggaagatctacaagaccccgtacgtgcccatagacaaggaggtgaagatcgacgggttttacatgcgcatgaccctgaaagtgctgaccctgagcgacgatctgggggtgtaccgcaacgacaggatgcaccgcgcggtgagcgccagccgccggcgcgagctgagcgaccaggagctgatgcacagcctgcagcgggccctgaccggggccgggaccgagggggagagctactttgacatgggcgcggacctgcgctggcagcccagccgccgggccttggaagctgccggcggttccccctacgtggaggaggtggacgatgaggaggaggagggcgagtacctggaagactgatggcgcgaccgtatttttgctagatgcagcaacagccaccgcctcctgatcccgcgatgcgggcggcgctgcagagccagccgtccggcattaactcctcggacgattggacccaggccatgcaacgcatcatggcgctgacgacccgcaatcccgaagcctttagacagcagcctcaggccaaccggctctcggccatcctggaggccgtggtgccctcgcgctcgaaccccacgcacgagaaggtgctggccatcgtgaacgcgctggtggagaacaaggccatccgcggcgacgaggccgggctggtgtacaacgcgctgctggagcgcgtggcccgctacaacagcaccaacgtgcagacgaacctggaccgcatggtgaccgacgtgcgcgaggcggtgtcgcagcgcgagcggttccaccgcgagtcgaacctgggctccatggtggcgctgaacgccttcctgagcacgcagcccgccaacgtgccccggggccaggaggactacaccaacttcatcagcgcgctgcggctgatggtggccgaggtgccccagagcgaggtgtaccagtcggggccggactacttcttccagaccagtcgccagggcttgcagaccgtgaacctgagccaggctttcaagaacttgcagggactgtggggcgtgcaggccccggtcggggaccgcgcgacggtgtcgagcctgctgacgccgaactcgcgcctgctgctgctgctggtggcgcccttcacggacagcggcagcgtgagccgcgactcgtacctgggctacctgcttaacctgtaccgcgaggccatcgggcaggcgcacgtggacgagcagacctaccaggagatcacccacgtgagccgcgcgctgggccaggaggacccgggcaacctggaggccaccctgaacttcctgctgaccaaccggtcgcagaagatcccgccccagtacgcgctgagcaccgaggaggagcgcatcctgcgctacgtgcagcagagcgtggggctgttcctgatgcaggagggggccacgcccagcgccgcgctcgacatgaccgcgcgcaacatggagcccagcatgtacgctcgcaaccgcccgttcatcaataagctgatggactacttgcatcgggcggccgccatgaactcggactactttaccaacgccatcttgaacccgcactggctcccgccgcccgggttctacacgggcgagtacgacatgcccgaccccaacgacgggttcctgtgggacgacgtggacagcagcgtgttctcgccgcgccccgccaccaccgtgtggaagaaagagggcggggaccggcggccgtcctcggcgctgtccggtcgcgcgggtgctgccgcggcggtgcctgaggccgccagccccttcccgagcctgcccttttcgctgaacagcgtgcgcagcagcgagctgggtcggctgacgcggccgcgcctgctgggcgaggaggagtacctgaacgactccttgttgaggcccgagcgcgagaagaacttccccaataacgggatagagagcctggtggacaagatgagccgctggaagacgtacgcgcacgagcacagggacgagccccgagctagcagcagcgcaggcacccgtagacgccagcgacacgacaggcagcggggtctggtgtgggacgatgaggattccgccgacgacagcagcgtgttggacttgggtgggagtggtggtggtaacccgttcgctcacttgcgcccccgtatcgggcgcctgatgtaagaatctgaaaaaataaaaaacggtactcaccaaggccatggcgaccagcgtgcgttcttctctgttgtttgtagtagtatgatgaggcgcgtgtacccggagggtcctcctccctcgtacgagagcgtgatgcagcaggcggtggcggcggcgatgcagcccccgctggaggcgccttacgtgcccccgcggtacctggcgcctacggaggggcggaacagcattcgttactcggagctggcacccttgtacgataccacccggttgtacctggtggacaacaagtcggcggacatcgcctcgctgaactaccagaacgaccacagcaacttcctgaccaccgtggtgcagaacaacgatttcacccccacggaggccagcacccagaccatcaactttgacgagcgctcgcggtggggcggccagctgaaaaccatcatgcacaccaacatgcccaacgtgaacgagttcatgtacagcaacaagttcaaggcgcgggtgatggtctcgcgcaagacccccaatggggtcgcggtggatgagaattatgatggtagtcaggacgagctgacttacgagtgggtggagtttgagctgcccgagggcaacttctcggtgaccatgaccatcgatctgatgaacaacgccatcatcgacaactacttggcggtggggcgtcagaacggggtgctggagagcgacatcggcgtgaagttcgacacgcgcaacttccggctgggctgggaccccgtgaccgagctggtgatgccgggcgtgtacaccaacgaggccttccaccccgacatcgtcctgctgcccggctgcggcgtggacttcaccgagagccgcctcagcaacctgctgggcatccgcaagcggcagcccttccaggagggcttccagatcctgtacgaggacctggaggggggcaacatccccgcgctcttggatgtcgaagcctatgagaaaagcaaggaggaggccgccgcagcggcgaccgcagccgtggccaccgcctctaccgaggtgcggggcgataattttgctagcgccgcggcagtggccgaggcggctgaaaccgaaagtaagatagtcatccagccggtggagaaggacagcaaggacaggagctacaacgtgctcgcggacaagaaaaacaccgcctaccgcagctggtacctggcctacaactacggcgaccccgagaagggcgtgcgctcctggacgctgctcaccacctcggacgtcacctgcggcgtggagcaagtctactggtcgctgcccgacatgatgcaagacccggtcaccttccgctccacgcgtcaagttagcaactacccggtggtgggcgccgagctcctgcccgtctactccaagagcttcttcaacgagcaggccgtctactcgcagcagctgcgcgccttcacctcgctcacgcacgtcttcaaccgcttccccgagaaccagatcctcgtccgcccgcccgcgcccaccattaccaccgtcagtgaaaacgttcctgctctcacagatcacgggaccctgccgctgcgcagcagtatccggggagtccagcgcgtgaccgtcactgacgccagacgccgcacctgcccctacgtctacaaggccctgggcgtagtcgcgccgcgcgtcctctcgagccgcaccttctaaaaaatgtccattctcatctcgcccagtaataacaccggttggggcctgcgcgcgcccagcaagatgtacggaggcgctcgccaacgctccacgcaacaccccgtgcgcgtgcgcgggcacttccgcgctccctggggcgccctcaagggccgcgtgcgctcgcgcaccaccgtcgacgacgtgatcgaccaggtggtggccgacgcgcgcaactacacgcccgccgccgcgcccgcctccaccgtggacgccgtcatcgacagcgtggtggccgatgcgcgccggtacgcccgcgccaagagccggcggcggcgcatcgcccggcggcaccggagcacccccgccatgcgcgcggcgcgagccttgctgcgcagggccaggcgcacgggacgcagggccatgctcagggcggccagacgcgcggcctccggcagcagcagcgccggcaggacccgcagacgcgcggccacggcggcggcggcggccatcgccagcatgtcccgcccgcggcgcggcaacgtgtactgggtgcgcgacgccgccaccggtgtgcgcgtgcccgtgcgcacccgcccccctcgcacttgaagatgctgacttcgcgatgttgatgtgtcccagcggcgaggaggatgtccaagcgcaaatacaaggaagagatgctccaggtcatcgcgcctgagatctacggccccgcggtgaaggaggaaagaaagccccgcaaactgaagcgggtcaaaaaggacaaaaaggaggaggaagatgtggacggactggtggagtttgtgcgcgagttcgccccccggcggcgcgtgcagtggcgcgggcggaaagtgaaaccggtgctgcggcccggcaccacggtggtcttcacgcccggcgagcgttccggctccgcctccaagcgctcctacgacgaggtgtacggggacgaggacatcctcgagcaggcggtcgagcgtctgggcgagtttgcttacggcaagcgcagccgccccgcgcccttgaaagaggaggcggtgtccatcccgctggaccacggcaaccccacgccgagcctgaagccggtgaccctgcagcaggtgctgccgagcgcggcgccgcgccggggcttcaagcgcgagggcggcgaggatctgtacccgaccatgcagctgatggtgcccaagcgccagaagctggaggacgtgctggagcacatgaaggtggaccccgaggtgcagcccgaggtcaaggtgcggcccatcaagcaggtggccccgggcctgggcgtgcagaccgtggacatcaagatccccacggagcccatggaaacgcagaccgagcccgtgaagcccagcaccagcaccatggaggtgcagacggatccctggatgccggcgccggcttccaccactcgccgaagacgcaagtacggcgcggccagcctgctgatgcccaactacgcgctgcatccttccatcatccccacgccgggctaccgcggcacgcgcttctaccgcggctacaccagcagccgccgcaagaccaccacccgccgccgccgtcgtcgcacccgccgcagcagcaccgcgacttccgccgccgccctggtgcggagagtgtaccgcagcgggcgcgagcctctgaccctgccgcgcgcgcgctaccacccgagcatcgccatttaactctgccgtcgcctcctacttgcagatatggccctcacatgccgcctccgcgtccccattacgggctaccgaggaagaaagccgcgccgtagaaggctgacggggaacgggctgcgtcgccatcaccaccggcggcggcgcgccatcagcaagcggttggggggaggcttcctgcccgcgctgatccccatcatcgccgcggcgatcggggcgatccccggcatagcttccgtggcggtgcaggcctctcagcgccactgagacacagcttggaaaatttgtaataaaaaaatggactgacgctcctggtcctgtgatgtgtgtttttagatggaagacatcaatttttcgtccctggcaccgcgacacggcacgcggccgtttatgggcacctggagcgacatcggcaacagccaactgaacgggggcgccttcaattggagcagtctctggagcgggcttaagaatttcgggtccacgctcaaaacctatggcaacaaggcgtggaacagcagcacagggcaggcgctgagggaaaagctgaaagagcagaacttccagcagaaggtggtcgatggcctggcctcgggcatcaacggggtggtggacctggccaaccaggccgtgcagaaacagatcaacagccgcctggacgcggtcccgcccgcggggtccgtggagatgccccaggtggaggaggagctgcctcccctggacaagcgcggcgacaagcgaccgcgtcccgacgcggaggagacgctgctgacgcacacggacgagccgcccccgtacgaggaggcggtgaaactgggtctgcccaccacgcggcccgtggcgcctctggccaccggggtgctgaaacccagcagcagcagccagcccgcgaccctggacttgcctccgcctgcttcccgcccctccacagtggctaagcccctgccgccggtggccgtcgcgtcgcgcgccccccgaggccgcccccaggcgaactggcagagcactctgaacagcatcgtgggtctgggagtgcagagtgtgaagcgccgccgctgctattaaaagacactgtagcgcttaacttgcttgtctgtgtgtatatgtatgtccgccgaccagaaggaggaagaggcgcgtcgccgagttgcaagatggccaccccatcgatgctgccccagtgggcgtacatgcacatcgccggacaggacgcttcggagtacctgagtccgggtctggtgcagttcgcccgcgccacagacacctacttcagtctggggaacaagtttaggaaccccacggtggcgcccacgcacgatgtgaccaccgaccgcagccagcggctgacgctgcgcttcgtgcccgtggaccgcgaggacaacacctactcgtacaaagtgcgctacacgctggccgtgggcgacaaccgcgtgctggacatggccagcacctactttgacatccgcggcgtgctggatcgggggcccagcttcaaaccctactccggcaccgcctacaacagcctggctcccaagggagcgcccaacacttgccagtggacatataaagctggtgatactgatacagaaaaaacctatacatatggaaatgcacctgtgcaaggcattagcattacaaaggatggtattcaacttggaactgacagcgatggtcaggcaatctatgcagacgaaacttatcaaccagagcctcaagtgggtgatgctgaatggcatgacatcactggtactgatgaaaaatatggaggcagagctcttaagcctgacaccaaaatgaagccttgctatggttcttttgccaagcctaccaataaagaaggaggccaggcaaatgtgaaaaccgaaacaggcggtaccaaagaatatgacattgacatggcattcttcgataatcgaagtgcagctgccgccggcctagccccagaaattgttttgtatactgagaatgtggatctggaaactccagatacccatattgtatacaaggcaggtacagatgacagtagctcttctatcaatttgggtcagcagtccatgcccaacagacccaactacattggcttcagagacaactttatcggtctgatgtactacaacagcactggcaatatgggtgtactggctggacaggcctcccagctgaatgctgtggtggacttgcaggacagaaacaccgaactgtcctaccagctcttgcttgactctctgggtgacagaaccaggtatttcagtatgtggaatcaggcggtggacagttatgaccccgatgtgcgcattattgaaaatcacggtgtggaggatgaacttcctaactattgcttccccctggatgctgtgggtagaactgatacttaccagggaattaaggccaatggtgataatcaaaccacctggaccaaagatgatactgttaatgatgctaatgaattgggcaagggcaatcctttcgccatggagatcaacatccaggccaacctgtggcggaacttcctctacgcgaacgtggcgctgtacctgcccgactcctacaagtacacgccggccaacatcacgctgcccaccaacaccaacacctacgattacatgaacggccgcgtggtggcgccctcgctggtggacgcctacatcaacatcggggcgcgctggtcgctggaccccatggacaacgtcaaccccttcaaccaccaccgcaacgcgggcctgcgataccgctccatgctcctgggcaacgggcgctacgtgcccttccacatccaggtgccccaaaagtttttcgccatcaagagcctcctgctcctgcccgggtcctacacctacgagtggaacttccgcaaggacgtcaacatgatcctgcagagctccctcggcaacgacctgcgcacggacggggcctccatcgccttcaccagcatcaacctctacgccaccttcttccccatggcgcacaacaccgcctccacgctcgaggccatgctgcgcaacgacaccaacgaccagtccttcaacgactacctctcggcggccaacatgctctaccccatcccggccaacgccaccaacgtgcccatctccatcccctcgcgcaactgggccgccttccgcggctggtccttcacgcgcctcaagacccgcgagacgccctcgctcggctccgggttcgacccctacttcgtctactcgggctccatcccctacctcgacggcaccttctacctcaaccacaccttcaagaaggtctccatcaccttcgactcctccgtcagctggcccggcaacgaccgcctcctgacgcccaacgagttcgaaatcaagcgcaccgtcgacggagaggggtacaacgtggcccagtgcaacatgaccaaggactggttcctggtccagatgctggcccactacaacatcggctaccagggcttctacgtgcccgagggctacaaggaccgcatgtactccttcttccgcaacttccagcccatgagccgccaggtcgtggacgaggtcaactacaaggactaccaggccgtcaccctggcctaccagcacaacaactcgggcttcgtcggctacctcgcgcccaccatgcgccagggccagccctaccccgccaactacccctacccgctcatcggcaagagcgccgtcgccagcgtcacccagaaaaagttcctctgcgaccgggtcatgtggcgcatccccttctccagcaacttcatgtccatgggcgcgctcaccgacctcggccagaacatgctctacgccaactccgcccacgcgctagacatgaatttcgaagtcgaccccatggatgagtccacccttctctatgttgtcttcgaagtcttcgacgtcgtccgagtgcaccagccccaccgcggcgtcatcgaggccgtctacctgcgcacgcccttctcggccggcaacgccaccacctaagcctcttgcttcttgcaagatgacggcctgcgcgggctccggcgagcaggagctcagggccatcctccgcgacctgggctgcgggccctgcttcctgggcaccttcgacaagcgcttcccgggattcatggccccgcacaagctggcctgcgccatcgtcaacacggccggccgcgagaccgggggcgagcactggctggccttcgcctggaacccgcgctcccacacctgctacctcttcgaccccttcgggttctcggacgagcgcctcaagcagatctaccagttcgagtacgagggcctgctgcgtcgcagcgccctggccaccgaggaccgctgcgtcaccctggaaaagtccacccagaccgtgcagggtccgcgctcggccgcctgcgggctcttctgctgcatgttcctgcacgccttcgtgcactggcccgaccgccccatggacaagaaccccaccatgaacttgctgacgggggtgcccaacggcatgctccagtcgccccaggtggaacccaccctgcgccgcaaccaggaggcgctctaccgcttcctcaacgcccactccgcctactttcgctcccaccgcgcgcgcatcgagaaggccaccgccttcgaccgcatgaatcaagacatgtaatccggtgtgtgtatgtgaatgctttattcatcataataaacagcacatgtttatgccaccttctctgaggctctgactttatttagaaatcgaaggggttctgccggctctcggcatggcccgcgggcagggatacgttgcggaactggtacttgggcagccacttgaactcggggatcagcagcttcggcacggggaggtcggggaacgagtcgctccacagcttgcgcgtgagttgcagggcgcccagcaggtcgggcgcggagatcttgaaatcgcagttgggacccgcgttctgcgcgcgagagttacggtacacggggttgcagcactggaacaccatcagggccgggtgcttcacgctcgccagcaccgtcgcgtcggtgatgccctccacgtccagatcctcggcgttggccatcccgaagggggtcatcttgcaggtctgccgccccatgctgggcacgcagccgggcttgtggttgcaatcgcagtgcagggggatcagcatcatctgggcctgctcggagctcatgcccgggtacatggccttcatgaaagcctccagctggcggaaggcctgctgcgccttgccgccctcggtgaagaagaccccgcaggacttgctagagaactggttggtggcgcagccagcgtcgtgcacgcagcagcgcgcgtcgttgttggccagctgcaccacgctgcgcccccagcggttctgggtgatcttggcccggtcggggttctccttcagcgcgcgctgcccgttctcgctcgccacatccatctcgatcgtgtgctccttctggatcatcacggtcccgtgcaggcaccgcagcttgccctcggcctcggtgcacccgtgcagccacagcgcgcagccggtgctctcccagttcttgtgggcgatctgggagtgcgagtgcacgaagccctgcaggaagcggcccatcatcgtggtcagggtcttgttgctggtgaaggtcagcggaatgccgcggtgctcctcgttcacatacaggtggcagatacggcggtacacctcgccctgctcgggcatcagctggaaggcggacttcaggtcgctctccacgcggtaccggtccatcagcagcgtcatcacttccatgcccttctcccaggccgaaacgatcggcaggctcagggggttcttcaccgttgtcatcttagtcgccgccgccgaagtcagggggtcgttctcgtccagggtctcaaacactcgcttgccgtccttctcggtgatgcgcacggggggaaagctgaagcccacggccgccagctcctcctcggcctgcctttcgtcctcgctgtcctggctgatgtcttgcaaaggcacatgcttggtcttgcggggtttctttttgggcggcagaggcggcggcggagacgtgctgggcgagcgcgagttctcgctcaccacgactatttcttctccttggccgtcgtccgagaccacgcggcggtaggcatgcctcttctggggcagaggcggaggcgacgggctctcgcggttcggcgggcggctggcagagccccttccgcgttcgggggtgcgctcctggcggcgctgctctgactgacttcctccgcggccggccattgtgttctcctagggagcaagcatggagactcagccatcgtcgccaacatcgccatctgcccccgccgccgccgacgagaaccagcagcagcagaatgaaagcttaaccgccccgccgcccagccccacctccgacgccgcagccccagacatgcaagagatggaggaatccatcgagattgacctgggctacgtgacgcccgcggagcacgaggaggagctggcagcgcgcttttcagccccggaagagaaccaccaagagcagccagagcaggaagcagagagcgagcagaaccaggctgggctcgagcatggcgactacctgagcggggcagaggacgtgctcatcaagcatctggcccgccaatgcatcatcgtcaaggacgcgctgctcgaccgcgccgaggtgcccctcagcgtggcggagctcagccgcgcctacgagcgcaacctcttctcgccgcgcgtgccccccaagcgccagcccaacggcacctgcgagcccaacccgcgcctcaacttctacccggtcttcgcggtgcccgaggccctggccacctaccacctctttttcaagaaccaaaggatccccgtctcctgccgcgccaaccgcacccgcgccgacgccctgctcaacctgggccccggcgcccgcctacctgatatcgcctccttggaagaggttcccaagatcttcgagggtctgggcagcgacgagactcgggccgcgaacgctctgcaaggaagcggagaggagcatgagcaccacagcgccctggtggagttggaaggcgacaacgcgcgcctggcggtcctcaagcgcacggtcgagctgacccacttcgcctacccggcgctcaacctgccccccaaggtcatgagcgccgtcatggaccaggtgctcatcaagcgcgcctcgcccctctcggaggaggagatgcaggaccccgagagctcggacgagggcaagcccgtggtcagcgacgagcagctggcgcgctggctgggagcgagtagcaccccccagagcctggaagagcggcgcaagctcatgatggccgtggtcctggtgaccgtggagctggagtgtctgcgccgcttcttcgccgacgcggagaccctgcgcaaggtcgaggagaacctgcactacctcttcagacacgggttcgtgcgccaggcctgcaagatctccaacgtggagctgaccaacctggtctcctacatgggcatcctgcacgagaaccgcctggggcagaacgtgctgcacaccaccctgcgcggggaggcccgccgcgactacatccgcgactgcgtctacctgtacctctgccacacctggcagacgggcatgggcgtgtggcagcagtgcctggaggagcagaacctgaaagagctctgcaagctcctgcagaagaacctcaaggccctgtggaccgggttcgacgagcgcaccaccgccgcggacctggccgacctcatcttccccgagcgcctgcggctgacgctgcgcaacgggctgcccgactttatgagccaaagcatgttgcaaaactttcgctctttcatcctcgaacgctccgggatcctgcccgccacctgctccgcgctgccctcggacttcgtgccgctgaccttccgcgagtgccccccgccgctctggagccactgctacctgctgcgcctggccaactacctggcctaccactcggacgtgatcgaggacgtcagcggcgagggcctgctcgagtgccactgccgctgcaacctctgcacgccgcaccgctccctggcctgcaacccccagctgctgagcgagacccagatcatcggcaccttcgagttgcaaggccccggcgagggcaaggggggtctgaaactcaccccggggctgtggacctcggcctacttgcgcaagttcgtgcccgaggactaccatcccttcgagatcaggttctacgaggaccaatcccagccgcccaaggccgagctgtcggcctgcgtcatcacccagggggccatcctggcccaattgcaagccatccagaaatcccgccaagaatttctgctgaaaaagggccacggggtctacttggacccccagaccggagaggagctcaaccccagcttcccccaggatgccccgaggaagcagcaagaagctgaaagtggagctgccgccgccgccggaggatttggaggaagactgggagagcagtcaggcagaggaggaggagatggaagactgggacagcactcaggcagaggaggacagcctgcaagacagtctggaggaggaagacgaggtggaggaggcagaggaagaagcagccgccgccagaccgtcgtcctcggcggaggaggagaaagcaagcagcacggataccatctccgctccgggtcggggtcgcggcggccgggcccacagtagatgggacgagaccgggcgcttcccgaaccccaccacccagaccggtaagaaggagcggcagggatacaagtcctggcgggggcacaaaaacgccatcgtctcctgcttgcaagcctgcgggggcaacatctccttcacccggcgctacctgctcttccaccgcggggtgaacttcccccgcaacatcttgcattactaccgtcacctccacagcccctactactgtttccaagaagaggcagaaacccagcagcagcagcagcagcagaaaaccagcggcagcagctagaaaatccacagcggcggcaggtggactgaggatcgcggcgaacgagccggcgcagacccgggagctgaggaaccggatctttcccaccctctatgccatcttccagcagagtcgggggcaagagcaggaactgaaagtcaagaaccgttctctgcgctcgctcacccgcagttgtctgtatcacaagagcgaagaccaacttcagcgcactctcgaggacgccgaggctctcttcaacaagtactgcgcgctcactcttaaagagtagcccgcgcccgcccacacacggaaaaaggcgggaattacgtcaccacctgcgcccttcgcccgaccatcatcatgagcaaagagattcccacgccttacatgtggagctaccagccccagatgggcctggccgccggcgccgcccaggactactccacccgcatgaactggctcagtgccgggcccgcgatgatctcacgggtgaatgacatccgcgcccaccgaaaccagatactcctagaacagtcagcgatcaccgccacgccccgccatcaccttaatccgcgtaattggcccgccgccctggtgtaccaggaaattccccagcccacgaccgtactacttccgcgagacgcccaggccgaagtccagctgactaactcaggtgtccagctggccggcggcgccgccctgtgtcgtcaccgccccgctcagggtataaagcggctggtgatccgaggcagaggcacacagctcaacgacgaggtggtgagctcttcgctgggtctgcgacctgacggagtcttccaactcgccggatcggggagatcttccttcacgcctcgtcaggccgtcctgactttggagagttcgtcctcgcagccccgctcgggtggcatcggcactctccagttcgtggaggagttcactccctcggtctacttcaaccccttctccggctcccccggccactacccggacgagttcatcccgaacttcgacgccatcagcgagtcggtggacggctacgattgaatgtcccatggtggcgcggctgacctagctcggcttcgacacctggaccactgccgccgcttccgctgcttcgctcgggatctcgccgagtttgcctactttgagctgcccgaggagcaccctcagggcccggcccacggagtgcggatcgtcgtcgaagggggtctcgactcccacctgcttcggatcttcagccagcgtccgatcctggccgagcgcgagcaaggacagacccttctgaccctgtactgcatctgcaaccaccccggcctgcatgaaagtctttgttgtctgctgtgtactgagtataataaaagctgagatcagcgactactccggacttccgtgtgttcctgctatcaaccagtccctgttcttcaccgggaacgagaccgagctccagctccagtgtaagccccacaagaagtacctcacctggctgttccagggctctccgatcgccgttgtcaaccactgcgacaacgacggagtcctgctgagcggccctgccaaccttactttttccacccgcagaagcaagctccagctcttccaacccttcctccccgggacctatcagtgcgtctcgggaccctgccatcacaccttccacctgatcccgaataccacagcgtcgctccccgctactaacaaccaaactacccaccaacgccaccgtcgcgaccgcggacatgtacagagctcgagaagtactaggccacaatacatgcccatattagactatgaggccgagccacagcgacccatgctccccgctattagttacttcaatctaaccggcggagatgactgacccactggccaacaacaacgtcaacgaccttctcctggacatggacggccgcgcctcggagcagcgactcgcccaacttcgcattcgccagcagcaggagagagccgtcaaggagctgcaggacggcatagccatccaccagtgcaagaaaggcatcttctgcctggtgaaacaggccaagatctcctacgaggtcaccccgaccgaccatcgcctctcctacgagctcctgcagcagcgccagaagttcacctgcctggtcggagtcaaccccatcgtcatcacccagcagtcgggcgataccaaggggtgcatccactgctcctgcgactcccccgactgcgtccacactctgatcaagaccctctgcggcctccgcgacctcctccccatgaactaatcacccccttatccagtgaaataaatatcatattgatgatgatttaaataaaaaataatcatttgatttgaaataaagatacaatcatattgatgatttgagttttaaaaaataaagaatcacttacttgaaatctgataccaggtctctgtccatgttttctgccaacaccacctcactcccctcttcccagctctggtactgcagaccccggcgggctgcaaacttcctccacacgctgaaggggatgtcaaattcctcctgtccctcaatcttcattttatcttctatcagatgtccaaaaagcgcgtccgggtggatgatgacttcgaccccgtctacccctacgatgcagacaacgcaccgaccgtgcccttcatcaacccccccttcgtctcttcagatggattccaagagaagcccctgggggtgctgtccctgcgactggctgaccccgtcaccaccaagaacggggaaatcaccctcaagctgggagagggggtggacctcgactcctcgggaaaactcatctccaacacggccaccaaggccgccgcccctctcagtttttccaacaacaccatttcccttaacatggatacccctctttataccaaagatggaaaattatccttacaagtttctccaccgttaaacatattaaaatcaaccattctgaacacattagctgtagcttatggatcaggtttaggactgagtggtggcactgctcttgcagtacagttggcctctccactcacttttgatgaaaaaggaaatattaaaattaacctagccagtggtccattaacagttgatgcaagtcgacttagtatcaactgcaaaagaggggtcactgtcactacctcaggagatgcaattgaaagcaacataagctggcctaaaggtataagatttgaaggtaatggcatagctgcaaacattggcagaggattggaatttggaaccactagtacagagactgatgtcacagatgcatacccaattcaagttaaattgggtactggccttacctttgacagtacaggcgccattgttgcttggaacaaagaggatgataaacttacattatggaccacagccgacccctcgccaaattgcaaaatatactctgaaaaagatgccaaactcacactttgcttgacaaagtgtggaagtcaaattctgggtactgtgactgtattggcagtgaataatggaagtctcaacccaatcacaaacacagtaagcactgcactcgtctccctcaagtttgatgcaagtggagttttgctaagcagctccacattagacaaagaatattggaacttcagaaagggagatgttacacctgctgagccctatactaatgctataggttttatgcctaacataaaggcctatcctaaaaacacatctgcagcttcaaaaagccatattgtcagtcaagtttatctcaatggggatgaggccaaaccactgatgctgattattacttttaatgaaactgaggatgcaacttgcacctacagtatcacttttcaatggaaatgggatagtactaagtacacaggtgaaacacttgctaccagctccttcaccttctcctacatcgcccaagaatgaacactgtatcccaccctgcatgccaacccttcccaccccactctgtctatggaaaaaactctgaagcacaaaataaaataaagttcaagtgttttattgattcaacagttttacaggattcgagcagttatttttcctccaccctcccaggacatggaatacaccaccctctccccccgcacagccttgaacatctgaatgccattggtgatggacatgcttttggtctccacgttccacacagtttcagagcgagccagtctcgggtcggtcagggagatgaaaccctccgggcactcccgcatctgcacctcacagctcaacagctgaggattgtcctcggtggtcgggatcacggttatctggaagaagcagaagagcggcggtgggaatcatagtccgcgaacgggatcggccggtggtgtcgcatcaggccccgcagcagtcgctgccgccgccgctccgtcaagctgctgctcagggggtccgggtccagggactccctcagcatgatgcccacggccctcagcatcagtcgtctggtgcggcgggcgcagcagcgcatgcggatctcgctcaggtcgctgcagtacgtgcaacacaggaccaccaggttgttcaacagtccatagttcaacacgctccagccgaaactcatcgcgggaaggatgctacccacgtggccgtcgtaccagatcctcaggtaaatcaagtggcgctccctccagaacacgctgcccacgtacatgatctccttgggcatgtggcggttcaccacctcccggtaccacatcaccctctggttgaacatgcagccccggatgatcctgcggaaccacagggccagcaccgccccgcccgccatgcagcgaagagaccccgggtcccggcaatggcaatggaggacccaccgctcgtacccgtggatcatctgggagctgaacaagtctatgttggcacagcacaggcatatgctcatgcatctcttcagcactctcagctcctcgggggtcaaaaccatatcccagggcacggggaactcttgcaggacagcgaaccccgcagaacagggcaatcctcgcacataacttacattgtgcatggacagggtatcgcaatcaggcagcaccgggtgatcctccaccagagaagcgcgggtctcggtctcctcacagcgtggtaagggggccggccgatacgggtgatggcgggacgcggctgatcgtgttcgcgaccgtgtcatgatgcagttgctttcggacattttcgtacttgctgtagcagaacctggtccgggcgctgcacaccgatcgccggcggcggtcccggcgcttggaacgctcggtgttgaaattgtaaaacagccactctctcagaccgtgcagcagatctagggcctcaggagtgatgaagatcccatcatgcctgatagctctgatcacatcgaccaccgtggaatgggccagacccagccagatgatgcaattttgttgggtttcggtgacggcgggggagggaagaacaggaagaaccatgattaacttttaatccaaacggtctcggagcacttcaaaatgaaggtcgcggagatggcacctctcgcccccgctgtgttggtggaaaataacagccaggtcaaaggtgatacggttctcgagatgttccacggtggcttccagcaaagcctccacgcgcacatccagaaacaagacaatagcgaaagcgggagggttctctaattcctcaatcatcatgttacactcctgcaccatccccagataattttcatttttccagccttgaatgattcgaactagttcctgaggtaaatccaagccagccatgataaagagctcgcgcagagcgccctccaccggcattcttaagcacaccctcataattccaagatattctgctcctggttcacctgcagcagattgacaagcggaatatcaaaatctctgccgcgatccctaagctcctccctcagcaataactgtaagtactctttcatatcctctccgaaatttttagccataggaccaccaggaataagattagggcaagccacagtacagataaaccgaagtcctccccagtgagcattgccaaatgcaagactgctataagcatgctggctagacccggtgatatcttccagataactggacagaaaatcacccaggcaatttttaagaaaatcaacaaaagaaaaatcctccaggtgcacgtttagagcctcgggaacaacgatgaagtaaatgcaagcggtgcgttccagcatggttagttagctgatctgtaaaaaacaaaaaataaaacattaaaccatgctagcctggcgaacaggtgggtaaatcgttctctccagcaccaggcaggccacggggtctccggcgcgaccctcgtaaaaattgtcgctatgattgaaaaccatcacagagagacgttcccggtggccggcgtgaatgattcgacaagatgaatacacccccggaacattggcgtccgcgagtgaaaaaaagcgcccgaggaagcaataaggcactacaatgctcagtctcaagtccagcaaagcgatgccatgcggatgaagcacaaaatcctcaggtgcgtacaaaatgtaattactcccctcctgcacaggcagcgaagcccccgatccctccagatacacatacaaagcctcagcgtccatagcttaccgagcagcagcacacaacaggcgcaagagtcagagaaaggctgagctctaacctgtccacccgctctctgctcaatatatagcccagatctacactgacgtaaaggccaaagtctaaaaatacccgccaaataatcacacacgcccagcacacgcccagaaaccggtgacacactcaaaaaaatacgcgcacttcctcaaacgcccaaactgccgtcatttccgggttcccacgctacgtcatcggaattcgactttcaaattccgtcgaccgttaaaaacgtcacccgccccgcccctaacggtcgcccgtctctcggccaatcaccttcctccctccccaaattcaaacagctcatttgcatattaacgcgcaccaaaagtttgaggtatattattgatgatg
实施例6:基因稳定性和蛋白质表达
基因稳定性:如通过限制酶消化分析纯化的病毒DNA,然后在HEK 293细胞上连续12次传代进行凝胶电泳所确定的,所有载体都是基因稳定的。
蛋白质表达:用不同载体将HEK 293细胞感染48小时。制备细胞裂解物。通过SDS-PAGE分离蛋白质。从凝胶上切下约110-160kD大小的跨越蛋白质的条带,洗脱蛋白质并通过质谱法进行分析。结果(如下所示)表明,对于每种细胞裂解物,可以检测到源自gp140的肽。
从两个腺病毒载体血清型表达的3种不同的HIV进化枝的gp140内的细胞裂解物中检测到的序列用下划线标记并以粗体显示。结果表明,AdC7载体最有可能仅表达低水平的进化枝B gp140蛋白质。
Gp140进化枝BC:登录号,KC492738(SEQ ID
NO:4)
AdC6
gp140
BC
AdC7 gp140 BC
Gp140进化枝B:登录号,HM215399(SEQ
ID
NO:2)
AdC6
GP140
B
AdC7
GP140
B
Gp140进化枝C:登录号,KF835515(SEQ ID NO:3)
AdC6
GP140
C
AdC7
GP140
C
Gp140进化枝BC:登录号,KC492738(SEQ ID
NO:4)AdC6 GP140 BC
AdC7
GP140
BC
参考文献:
Carnathan DG,Wetzel KS,Yu J,Lee ST,Johnson BA,Paiardini M,Yan J,Morrow MP,Sardesai NY,Weiner DB,Ertl HC,Silvestri G.Activated CD4+CCR5+Tcells in the rectum predict increased SIV acquisition in SIVGag/Tat-vaccinated rhesus macaques.Proc Natl Acad Sci U S A.2015Jan 13;112(2):518-23。
Tuyishime S,Haut LH,Kurupati RK,Billingsley JM,Carnathan D,GangaharaS,Styles TM,Xiang Z,Li Y,Zopfs M,Liu Q,Zhou X,Lewis MG,Amara RR,Bosinger S,Silvestri G,Ertl HCJ.Correlates of Protection Against SIVmac251 Infection inRhesus Macaques Immunized With Chimpanzee-Derived AdenovirusVectors.EBioMedicine.2018May;31:25-35。
Cervasi B,Carnathan DG,Sheehan KM,Micci L,Paiardini M,Kurupati R,Tuyishime S,Zhou XY,Else JG,Ratcliffe SJ,Ertl HC,Silvestri G.Immunologicaland virological analyses of rhesus macaques immunized with chimpanzeeadenoviruses expressing the simian immunodeficiency virus Gag/Tat fusionprotein and challenged intrarectally with repeated low doses of SIVmac.JVirol.2013Sep;87(17):9420-30。
Lasaro MO,Haut LH,Zhou X,Xiang Z,Zhou D,Li Y,Giles-Davis W,Li H,Engram JC,Dimenna LJ,Bian A,Sazanovich M,Parzych EM,Kurupati R,Small JC,WuTL,Leskowitz RM,Klatt NR,Brenchley JM,Garber DA,Lewis M,Ratcliffe SJ,BettsMR,Silvestri G,Ertl HC.Vaccine-induced T cells provide partial protectionagainst high-dose rectal SIVmac239challenge of rhesus macaques.MolTher.2011Feb;19(2):417-26。
Tatsis N,Lasaro MO,Lin SW,Haut LH,Xiang ZQ,Zhou D,Dimenna L,Li H,BianA,Abdulla S,Li Y,Giles-Davis W,Engram J,Ratcliffe SJ,Silvestri G,Ertl HC,Betts MR.Adenovirus vector-induced immune responses in nonhuman primates:responses to prime boost regimens.J Immunol.2009May 15;182(10):6587-99。
其他实施方式
本文引用的每个专利、专利申请和出版物的公开内容均通过引用以其整体并入本文。
虽然已经参考具体实施方式公开了本发明,但是显而易见的是,在不背离本发明的真实精神和范围的情况下,本领域的其他技术人员可以设想本发明的其他实施方式和变型。所附权利要求旨在被解释为包括所有这样的实施方式和等同变型。
序列表
<110> 威斯达研究所
旷宇博捷医药科技有限公司
H·C·J·艾尔特
周向阳
罗小平
<120> 用于HIV疫苗应用的复制缺陷型腺病毒载体
<130> 368530-7021WO1(00098)
<150> 美国临时专利申请号62/835,108
<151> 2019-04-17
<160> 7
<170> PatentIn version 3.5
<210> 1
<211> 643
<212> PRT
<213> 人工序列
<220>
<223> 表达的蛋白
<220>
<223> Gp140进化枝AE1
<400> 1
Met Arg Val Lys Gly Thr Gln Met Asn Trp Pro Asn Leu Trp Lys Trp
1 5 10 15
Gly Thr Leu Ile Leu Gly Leu Val Ile Met Cys Ser Ala Ser Asp Asn
20 25 30
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Asp Ala Asn
35 40 45
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala His Glu Thr Glu Val
50 55 60
His Asn Val Trp Ala Thr Tyr Ala Cys Val Pro Thr Asp Pro Asn Pro
65 70 75 80
Gln Glu Ile Pro Met Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
85 90 95
Asn Asn Met Val Glu Gln Met Gln Glu Asp Val Ile Ser Leu Trp Asp
100 105 110
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
115 120 125
Ile Cys Thr Asn Ala Asn Leu Thr Lys Ile Asn Ser Thr Asn Ser Gly
130 135 140
Pro Lys Val Ile Gly Asn Val Thr Asp Glu Val Arg Asn Cys Ser Phe
145 150 155 160
Asn Met Thr Thr Leu Leu Thr Asp Lys Lys Gln Lys Val Tyr Ala Leu
165 170 175
Phe Tyr Lys Leu Asp Ile Val Pro Ile Asp Asn Ser Asn Ser Ser Glu
180 185 190
Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Lys Gln Ala Cys Pro
195 200 205
Lys Ile Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly
210 215 220
Tyr Ala Ile Leu Lys Cys Asn Asp Lys Asn Phe Asn Gly Thr Gly Pro
225 230 235 240
Cys Lys Asn Val Ser Ser Val Gln Cys Thr His Gly Ile Lys Pro Val
245 250 255
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile
260 265 270
Ile Ile Arg Ser Glu Asn Leu Thr Asn Asn Ala Lys Thr Ile Ile Val
275 280 285
His Leu Asn Lys Ala Val Glu Ile Asn Cys Thr Arg Pro Ser Asn Asn
290 295 300
Thr Arg Thr Ser Ile Arg Ile Gly Pro Gly Gln Ile Phe Tyr Arg Thr
305 310 315 320
Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala Tyr Cys Glu Ile Asn Gly
325 330 335
Thr Lys Trp Asn Glu Thr Leu Arg Gln Val Ala Lys Lys Leu Lys Glu
340 345 350
Gln Phe Asn Asn Thr Ile Lys Phe Gln Pro Pro Ser Gly Gly Asp Leu
355 360 365
Glu Ile Thr Met Leu His Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys
370 375 380
Asn Thr Thr Lys Leu Phe Asn Ser Thr Trp Glu Arg Asn Glu Thr Ile
385 390 395 400
Lys Gly Gly Asn Gly Asn Gly Asn Asp Thr Ile Ile Leu Pro Cys Arg
405 410 415
Ile Lys Gln Ile Ile Asn Met Trp Gln Gly Ala Gly Gln Ala Met Tyr
420 425 430
Ala Pro Pro Ile Ser Gly Ile Ile Asn Cys Val Ser Asn Ile Thr Gly
435 440 445
Ile Leu Leu Thr Arg Asp Gly Gly Asn Thr Asn Glu Thr Ala Glu Ile
450 455 460
Phe Arg Pro Gly Gly Gly Asn Ile Lys Asp Asn Trp Arg Ser Glu Leu
465 470 475 480
Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Val Ala Pro Thr
485 490 495
Lys Ala Lys Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val
500 505 510
Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln His Met
515 520 525
Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Ile Leu
530 535 540
Ala Val Glu Ser Tyr Leu Lys His Gln Gln Phe Leu Gly Leu Trp Gly
545 550 555 560
Cys Ser Asn Lys Ile Ile Cys Thr Thr Ala Val Pro Trp Asn Ser Ser
565 570 575
Trp Ser Asn Lys Ser Tyr Asp Glu Ile Trp Glu Asn Met Thr Trp Ile
580 585 590
Glu Trp Glu Arg Glu Ile Gly Asn Tyr Thr Asn Gln Ile Tyr Asp Ile
595 600 605
Leu Thr Lys Ser Gln Glu Gln Gln Asp Lys Asn Glu Lys Glu Leu Leu
610 615 620
Glu Leu Asp Gln Trp Ala Ser Leu Trp Asn Trp Phe Ser Ile Thr Lys
625 630 635 640
Trp Leu Trp
<210> 2
<211> 639
<212> PRT
<213> 人工序列
<220>
<223> 表达的蛋白
<220>
<223> Gp140进化枝B HM215399
<400> 2
Met Arg Val Lys Gly Ile Arg Lys Asn Tyr Gln His Leu Trp Arg Trp
1 5 10 15
Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Ala Glu Asn
20 25 30
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
35 40 45
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val
50 55 60
His Asn Ile Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65 70 75 80
Gln Glu Val Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys
85 90 95
Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
100 105 110
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
115 120 125
Asn Cys Thr Asn Leu Arg Asn Thr Asn Asn Thr Ser Ser Asn Thr Ser
130 135 140
Asn Met Thr Glu Gly Gly Glu Ile Lys Asn Cys Ser Phe Asp Ile Thr
145 150 155 160
Thr Ser Ile Arg Thr Lys Val Lys Asp Tyr Ala Leu Phe Tyr Glu Leu
165 170 175
Asp Ile Val Ala Ile Asp Asn Thr Ser Tyr Arg Leu Arg Gln Cys Asn
180 185 190
Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile
195 200 205
Pro Ile His Tyr Cys Thr Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
210 215 220
Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val Ser Thr Val
225 230 235 240
Gln Cys Thr His Arg Ile Arg Pro Val Val Ser Thr Gln Leu Leu Leu
245 250 255
Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Ser Asn Phe
260 265 270
Thr Asp Asn Ala Lys Val Ile Ile Val Gln Leu Lys Glu Ser Val Glu
275 280 285
Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Pro Leu
290 295 300
Gly Pro Gly Lys Ala Trp Tyr Thr Thr Gly Gln Ile Ile Gly Asp Ile
305 310 315 320
Arg Gln Ala His Cys Asn Leu Ser Arg Ala Lys Trp Glu Asn Thr Leu
325 330 335
Gln Gln Ile Thr Lys Lys Leu Arg Glu Gln Phe Gly Asn Lys Thr Ile
340 345 350
Ile Phe Asn Gln Ser Ser Gly Gly Asp Pro Glu Val Val Thr His Ser
355 360 365
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Gln Leu Phe
370 375 380
Asn Ser Thr Trp Tyr Asn Asn Ser Thr Trp Asn Asp Thr Asn Asp Thr
385 390 395 400
Thr Glu Asn Ser Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Val
405 410 415
Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg
420 425 430
Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg
435 440 445
Asp Gly Gly Lys Asn Glu Ser Asn Thr Thr Glu Thr Phe Arg Pro Gly
450 455 460
Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys
465 470 475 480
Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Arg Ala Lys Leu
485 490 495
Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Arg
500 505 510
Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr
515 520 525
Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg
530 535 540
Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys
545 550 555 560
Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Val Ser Trp Ser Asn Arg
565 570 575
Ser Leu Ser Glu Ile Trp Asp Asn Met Thr Trp Met Glu Trp Glu Arg
580 585 590
Glu Ile Gly Asn Tyr Thr Lys Gln Ile Tyr Ser Leu Ile Glu Glu Ser
595 600 605
Gln Asn Gln Gln Glu Lys Asn Glu Leu Glu Leu Leu Glu Trp Asp Lys
610 615 620
Trp Ala Ser Leu Trp Asn Trp Phe Asn Ile Thr Asn Trp Leu Trp
625 630 635
<210> 3
<211> 634
<212> PRT
<213> 人工序列
<220>
<223> 表达的蛋白
<220>
<223> Gp140进化枝C
<400> 3
Met Arg Val Arg Gly Thr Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp
1 5 10 15
Gly Ile Leu Gly Phe Trp Met Leu Met Ile Cys Asn Val Gly Gly Asn
20 25 30
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
35 40 45
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Asn Glu Val
50 55 60
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65 70 75 80
Gln Glu Met Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
85 90 95
Asn Glu Met Val Asn Gln Met His Glu Asp Val Ile Ser Leu Trp Asp
100 105 110
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
115 120 125
Lys Cys Ser Asn Val Thr Leu Lys Asn Asn Thr Val Asn Ser Asn Glu
130 135 140
Thr Gln Tyr Arg Lys Asn Cys Thr Phe Asn Thr Thr Thr Glu Leu Lys
145 150 155 160
Asn Arg Lys Gln Lys Val Ser Ala Ile Phe Tyr Arg Ile Asp Ile Val
165 170 175
Pro Leu Gly Asn Glu Ser Ser Gly Asn Tyr Arg Leu Ile Asn Cys Asn
180 185 190
Thr Ser Ala Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile
195 200 205
Pro Ile His Tyr Cys Thr Pro Ala Gly Tyr Ala Leu Leu Lys Cys Asn
210 215 220
Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val
225 230 235 240
Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu
245 250 255
Asn Gly Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser Glu Asn Leu
260 265 270
Thr Asn Asn Val Lys Thr Ile Ile Val His Leu Asn Glu Ser Val Glu
275 280 285
Ile Val Cys Ile Arg Pro Gly Asn Asn Thr Arg Gln Ser Ile Arg Ile
290 295 300
Gly Pro Gly Gln Thr Phe Tyr Ala Pro Gly Glu Ile Ile Gly Asn Ile
305 310 315 320
Arg Gln Ala His Cys Asn Ile Asn Gly Thr Lys Trp Asn Glu Thr Leu
325 330 335
Gln Gly Val Gly Lys Lys Leu Ala Glu His Phe Pro Asn Lys Thr Ile
340 345 350
Lys Phe Lys Pro Ser Ser Gly Gly Asp Pro Glu Ile Thr Thr His Ser
355 360 365
Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Ser Gly Leu Phe
370 375 380
Asn Ser Thr Tyr Asn Ser Thr Tyr Val Pro Asn Gly Thr Glu Ser Lys
385 390 395 400
Pro Asn Ile Thr Ile Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp
405 410 415
Gln Glu Val Gly Arg Ala Met Tyr Ala Pro Pro Ile Lys Gly Ser Ile
420 425 430
Thr Cys Lys Ser Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly Gly
435 440 445
Ala Asn Thr Thr Glu Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg
450 455 460
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys
465 470 475 480
Pro Leu Gly Ile Ala Pro Thr Glu Ala Lys Leu Thr Val Gln Ala Arg
485 490 495
Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Lys Ala
500 505 510
Ile Glu Ala Gln Gln His Met Leu Gln Leu Thr Val Trp Gly Ile Lys
515 520 525
Gln Leu Gln Thr Arg Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln
530 535 540
Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr
545 550 555 560
Ala Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Thr Gln Asp Glu Ile
565 570 575
Trp Lys Asn Met Thr Trp Met Gln Trp Asp Arg Glu Ile Asn Asn Tyr
580 585 590
Thr Asn Thr Ile Tyr Ser Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu
595 600 605
Lys Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser Trp Lys Asn Leu Trp
610 615 620
Asn Trp Phe Asp Ile Ser Asn Trp Leu Trp
625 630
<210> 4
<211> 647
<212> PRT
<213> 人工序列
<220>
<223> 表达的蛋白
<220>
<223> Gp140进化枝BC
<400> 4
Met Arg Val Met Gly Ile Arg Arg Asn Cys Gln His Leu Trp Arg Trp
1 5 10 15
Gly Ile Met Leu Leu Gly Met Leu Met Ile Cys Ser Val Val Gly Asn
20 25 30
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
35 40 45
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val
50 55 60
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65 70 75 80
Gln Glu Met Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
85 90 95
Asn Glu Met Val Asn Gln Met Gln Glu Asp Val Ile Ser Leu Trp Asp
100 105 110
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
115 120 125
Lys Cys Lys Asn Val Ser Ser Asn Ser Thr Glu Thr Pro Lys Leu Arg
130 135 140
Gly Asn Ser Ser Glu Thr Tyr Lys Asp Glu Glu Met Lys Asn Cys Ser
145 150 155 160
Phe Asn Ala Thr Thr Ile Leu Arg Asp Lys Lys Gln Glu Val Tyr Ala
165 170 175
Leu Phe Tyr Lys Leu Asp Ile Ala Pro Leu Leu Leu Asn Ser Ser Glu
180 185 190
Asn Ser Ser Ala Tyr Tyr Ser Leu Ile Asn Cys Asn Thr Ser Ala Ile
195 200 205
Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro Ile His Tyr
210 215 220
Cys Thr Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asp Lys Lys Phe
225 230 235 240
Asn Gly Thr Gly Pro Cys Ser Asn Val Ser Thr Val Gln Cys Thr His
245 250 255
Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu
260 265 270
Ala Glu Gly Glu Val Ile Ile Arg Ser Lys Asn Leu Thr Asp Asn Ala
275 280 285
Lys Thr Ile Ile Val Gln Leu Asn Arg Ser Val Glu Ile Val Cys Thr
290 295 300
Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile Arg Ile Gly Pro Gly Gln
305 310 315 320
Thr Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His
325 330 335
Cys Asn Ile Ser Glu Asp Met Trp Asn Glu Thr Leu His Trp Val Ser
340 345 350
Arg Lys Leu Ala Glu His Phe Pro Asn Arg Thr Ile Asn Phe Thr Ser
355 360 365
Ser Ser Gly Gly Asp Leu Glu Ile Ala Thr His Ser Phe Asn Cys Arg
370 375 380
Gly Glu Phe Phe Tyr Cys Asn Thr Ser Arg Leu Phe Asn Gly Thr Tyr
385 390 395 400
Met Phe Asn Gly Thr Arg Gly Asn Ser Ser Ser Asn Ser Thr Ile Thr
405 410 415
Ile Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Gln Val Gly
420 425 430
Arg Ala Met Tyr Ala Pro Pro Ile Glu Gly Asn Leu Thr Cys Arg Ser
435 440 445
Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly Gly Asp Asn Thr Asn
450 455 460
Lys Thr Glu Ile Phe Arg Pro Gln Gly Gly Asp Met Arg Asp Asn Trp
465 470 475 480
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly
485 490 495
Ile Ala Pro Thr Thr Ala Lys Leu Thr Val Gln Ala Arg Gln Leu Leu
500 505 510
Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala
515 520 525
Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln
530 535 540
Thr Arg Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu
545 550 555 560
Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro
565 570 575
Trp Asn Ser Ser Trp Ser Asn Lys Thr Gln Asp Glu Ile Trp Asn Asn
580 585 590
Leu Thr Trp Met Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asp Thr
595 600 605
Ile Tyr Lys Leu Leu Glu Asp Ser Gln Asn Gln Gln Glu Arg Asn Glu
610 615 620
Lys Asp Leu Leu Ala Leu Asp Ser Trp Lys Asn Leu Trp Ser Trp Phe
625 630 635 640
Asp Ile Thr Asn Trp Leu Trp
645
<210> 5
<211> 500
<212> PRT
<213> 人工序列
<220>
<223> 表达的蛋白
<220>
<223> Gp140进化枝B JF932500
<400> 5
Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp
1 5 10 15
Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Arg Leu Lys
20 25 30
His Val Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro
35 40 45
Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Glu Gln Leu
50 55 60
Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn
65 70 75 80
Thr Ile Ala Val Leu Tyr Cys Val His Gln Lys Ile Glu Ile Lys Asp
85 90 95
Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys
100 105 110
Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly Asn Asn Ser Gln Val
115 120 125
Ser Gln Asn Tyr Pro Ile Val Arg Asn Leu Gln Gly Gln Met Val His
130 135 140
Gln Pro Leu Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu
145 150 155 160
Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser
165 170 175
Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly
180 185 190
Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu
195 200 205
Ala Ala Glu Trp Asp Arg Leu His Pro Pro Gln Ala Gly Pro Ile Ala
210 215 220
Pro Gly Gln Ile Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr
225 230 235 240
Ser Asn Leu Gln Glu Gln Ile Ala Trp Met Thr Asn Asn Pro Pro Ile
245 250 255
Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys
260 265 270
Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Lys Gln Gly
275 280 285
Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu
290 295 300
Arg Ala Glu Gln Ala Ser Gln Asp Val Lys Asn Trp Met Thr Glu Thr
305 310 315 320
Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala
325 330 335
Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly
340 345 350
Val Gly Gly Pro Ser His Lys Ala Arg Ile Leu Ala Glu Ala Met Ser
355 360 365
Gln Val Thr Asn Ser Ala Ser Val Met Met Gln Arg Gly Asn Phe Arg
370 375 380
Asn Gln Arg Lys Pro Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His
385 390 395 400
Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys
405 410 415
Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn
420 425 430
Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe
435 440 445
Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg
450 455 460
Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Gln Ile Asp
465 470 475 480
Lys Glu Leu Tyr Pro Leu Ala Ser Leu Lys Ser Leu Phe Gly Asn Asp
485 490 495
Pro Ser Ser Gln
500
<210> 6
<211> 34406
<212> DNA
<213> 人工序列
<220>
<223> 运载体
<220>
<223> C6 020 CMV-HIV gp140 AE1
<400> 6
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtggc tatgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtacga tatcatttcc ccgaaagtgc 480
cacctgaccg taactataac ggtcctaagg tagcgaaagc tcagatctcc cgatccccta 540
tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagta tctgctccct 600
gcttgtgtgt tggaggtcgc tgagtagtgc gcgagcaaaa tttaagctac aacaaggcaa 660
ggcttgaccg acaattgcat gaagaatctg cttagggtta ggcgttttgc gctgcttcgc 720
gatgtacggg ccagatatac gcgttgacat tgattattga ctagttatta atagtaatca 780
attacggggt cattagttca tagcccatat atggagttcc gcgttacata acttacggta 840
aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat 900
gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg 960
taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac 1020
gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt 1080
cctacttggc agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg 1140
cagtacatca atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc 1200
attgacgtca atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt 1260
aacaactccg ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 1320
agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 1380
agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcaga 1440
agttggtcgt gaggcactgg gcaggtaagt atcaaggtta caagacaggt ttaaggagac 1500
caatagaaac tgggcttgtc gagacagaga agactcttgc gtttctgata ggcacctatt 1560
ggtcttactg acatccactt tgcctttctc tccacaggtg tccactccca gttcaattac 1620
agctcttaaa aggctagagt acttaatacg actcactata ggctagcatg agagtgaagg 1680
ggacacagat gaattggcca aacttgtgga aatgggggac tttgatcctt gggttggtga 1740
tcatgtgtag tgcctcagac aacttgtggg ttacagttta ttatggagtt cctgtgtgga 1800
gagatgcaaa taccacccta ttttgtgcat cagatgccaa agcacatgag acagaagtgc 1860
acaatgtctg ggccacatat gcctgtgtac ccacagatcc caacccacaa gaaataccca 1920
tggaaaatgt gacagaaaat tttaacatgt ggaaaaataa catggtagag caaatgcagg 1980
aggatgtaat cagtttatgg gatcaaagtc taaagccatg tgtaaagtta actcctctct 2040
gcgttacttt aatttgtacc aatgctaact tgaccaagat caacagtacc aatagcgggc 2100
ctaaagtaat aggaaatgta acagatgaag taagaaactg ttcttttaat atgaccacat 2160
tactaacaga taagaagcaa aaggtttatg cactttttta taagcttgat atagtaccaa 2220
ttgataatag taatagtagt gagtatagat taataaattg taatacttca gtcattaagc 2280
aggcttgtcc aaagatatcc tttgatccaa ttcctataca ttattgtact ccagctggtt 2340
atgcgatttt aaaatgtaat gataagaatt tcaatgggac agggccatgt aaaaatgtca 2400
gctcagtaca gtgcacacat ggaattaagc cagtggtctc aactcaatta ctgttaaatg 2460
gcagtctagc agaagaagag ataataatca gatctgaaaa tctcacaaac aatgccaaaa 2520
ccataatagt gcaccttaat aaggctgtag aaatcaattg taccagaccc tccaacaata 2580
caagaacaag tataagaata ggaccaggac aaatatttta tagaacagga gacataatag 2640
gagatataag acaagcatat tgtgaaatta atggaacaaa atggaatgaa actttaagac 2700
aggtagcaaa aaaattaaaa gagcaattta ataacacaat aaaattccag ccaccctcag 2760
gaggagatct agaaattaca atgcttcatt ttaattgtag aggggaattt ttctattgca 2820
atacaacaaa actgttcaat agtacttggg aaagaaatga gaccataaaa gggggtaatg 2880
gcaatggcaa tgacactatc atacttccat gcaggataaa gcaaatcata aacatgtggc 2940
aaggagcagg acaagcaatg tatgctcctc ccatcagtgg aataattaac tgtgtatcaa 3000
atattacagg aatactattg acaagagatg gtggtaatac taatgaaact gccgagatct 3060
tcagacctgg aggaggaaat ataaaggaca attggagaag tgaattatat aaatataaag 3120
tagtacaaat tgaaccacta ggagtagcac ccaccaaggc aaagctgacg gtacaggcca 3180
gacaattatt gtctggtata gtgcaacagc aaagcaattt gctgagggct atagaggcgc 3240
agcagcatat gttgcaactc acagtctggg gcattaaaca gctccaggca agaatcctgg 3300
ctgtggaaag ctacctaaag catcaacagt tcctaggact ttggggctgc tctaacaaaa 3360
ttatctgcac cactgctgta ccctggaatt cctcttggag taataaatct tatgatgaga 3420
tttgggaaaa tatgacatgg atagaatggg agagagaaat tggcaattac acaaaccaaa 3480
tatatgatat acttacaaaa tcgcaggaac agcaggacaa aaatgaaaag gaactgttgg 3540
aattggatca atgggcaagt ctgtggaatt ggtttagcat aacaaaatgg ctgtggtaat 3600
gtacaagtaa agcggccgcc actgtgctgg atgatccgag ctcggtacct ctagagtcga 3660
cccgggcggc caaaccgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt 3720
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 3780
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 3840
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 3900
gcggtgggct ctatggcttc tgaggcggaa agaaccagca gatctgcaga tctgaattca 3960
tctatgtcgg gtgcggagaa agaggtaatg aaatggcatt atgggtatta tgggtctgca 4020
ttaatgaatc ggtcagatat cgacatatgc tggccaccgt gcatgtggcc tcgcaccccc 4080
gcaagacatg gcccgagttc gagcacaacg tcatgacccg ctgcaatgtg cacctgggct 4140
cccgccgagg catgttcatg ccctaccagt gcaacatgca atttgtgaag gtgctgctgg 4200
agcccgatgc catgtccaga gtgagcctga cgggggtgtt tgacatgaat gtggagctgt 4260
ggaaaattct gagatatgat gaatccaaga ccaggtgccg ggcctgcgaa tgcggaggca 4320
agcacgccag gcttcagccc gtgtgtgtgg aggtgacgga ggacctgcga cccgatcatt 4380
tggtgttgtc ctgcaacggg acggagttcg gctccagcgg ggaagaatct gactagagtg 4440
agtagtgttt ggggctgggt gtgagcctgc atgaggggca gaatgactaa aatctgtggt 4500
tttctgtgtg ttgcagcagc atgagcggaa gcgcctcctt tgagggaggg gtattcagcc 4560
cttatctgac ggggcgtctc ccctcctggg cgggagtgcg tcagaatgtg atgggatcca 4620
cggtggacgg ccggcccgtg cagcccgcga actcttcaac cctgacctac gcgaccctga 4680
gctcctcgtc cgtggacgca gctgccgccg cagctgctgc ttccgccgcc agcgccgtgc 4740
gcggaatggc cctgggcgcc ggctactaca gctctctggt ggccaactcg agttccacca 4800
ataatcccgc cagcctgaac gaggagaagc tgctgctgct gatggcccag ctcgaggccc 4860
tgacccagcg cctgggcgag ctgacccagc aggtggctca gctgcaggcg gagacgcggg 4920
ccgcggttgc cacggtgaaa accaaataaa aaatgaatca ataaataaac ggagacggtt 4980
gttgatttta acacagagtc ttgaatcttt atttgatttt tcgcgcgcgg taggccctgg 5040
accaccggtc tcgatcattg agcacccggt ggatcttttc caggacccgg tagaggtggg 5100
cttggatgtt gaggtacatg ggcatgagcc cgtcccgggg gtggaggtag ctccattgca 5160
gggcctcgtg ctcggggatg gtgttgtaaa tcacccagtc atagcagggg cgcagggcgt 5220
ggtgctgcac gatgtccttg aggaggagac tgatggccac gggcagcccc ttggtgtagg 5280
tgttgacgaa cctgttgagc tgggagggat gcatgcgggg ggagatgaga tgcatcttgg 5340
cctggatctt gagattggcg atgttcccgc ccagatcccg ccgggggttc atgttgtgca 5400
ggaccaccag cacggtgtat ccggtgcact tggggaattt gtcatgcaac ttggaaggga 5460
aggcgtgaaa gaatttggag acgcccttgt gaccgcccag gttttccatg cactcatcca 5520
tgatgatggc gatgggcccg tgggcggcgg cctgggcaaa gacgtttcgg gggtcggaca 5580
catcgtagtt gtggtcctgg gtgagctcgt cataggccat tttaatgaat ttggggcgga 5640
gggtgcccga ctgggggacg aaggtgccct cgatcccggg ggcgtagttg ccctcgcaga 5700
tctgcatctc ccaggccttg agctcggagg gggggatcat gtccacctgc ggggcgatga 5760
aaaaaacggt ttccggggcg ggggagatga gctgggccga aagcaggttc cggagcagct 5820
gggacttgcc gcaaccggtg gggccgtaga tgaccccgat gaccggctgc aggtggtagt 5880
tgagggagag acagctgccg tcctcgcgga ggaggggggc cacctcgttc atcatctcgc 5940
gcacatgcat gttctcgcgc acgagttccg ccaggaggcg ctcgcccccc agcgagagga 6000
gctcttgcag cgaggcgaag tttttcagcg gcttgagtcc gtcggccatg ggcattttgg 6060
agagggtctg ttgcaagagt tccagacggt cccagagctc ggtgatgtgc tctagggcat 6120
ctcgatccag cagacctcct cgtttcgcgg gttggggcga ctgcgggagt agggcaccag 6180
gcgatgggcg tccagcgagg ccagggtccg gtccttccag ggccgcaggg tccgcgtcag 6240
cgtggtctcc gtcacggtga aggggtgcgc gccgggctgg gcgcttgcga gggtgcgctt 6300
caggctcatc cggctggtcg agaaccgctc ccggtcggcg ccctgcgcgt cggccaggta 6360
gcaattgagc atgagttcgt agttgagcgc ctcggccgcg tggcccttgg cgcggagctt 6420
acctttggaa gtgtgtccgc agacgggaca gaggagggac ttgagggcgt agagcttggg 6480
ggcgaggaag acggactcgg gggcgtaggc gtccgcgccg cagctggcgc agacggtctc 6540
gcactccacg agccaggtga ggtcggggcg gttggggtca aaaacgaggt ttcctccgtg 6600
ctttttgatg cgtttcttac ctctggtctc catgagctcg tgtccccgct gggtgacaaa 6660
gaggctgtcc gtgtccccgt agaccgactt tatgggccgg tcctcgagcg gggtgccgcg 6720
gtcctcgtcg tagaggaacc ccgcccactc cgagacgaag gcccgggtcc aggccagcac 6780
gaaggaggcc acgtgggagg ggtagcggtc gttgtccacc agcgggtcca ccttctccag 6840
ggtatgcaag cacatgtccc cctcgtccac atccaggaag gtgattggct tgtaagtgta 6900
ggccacgtga ccgggggtcc cggccggggg ggtataaaag ggggcgggcc cctgctcgtc 6960
ctcactgtct tccggatcgc tgtccaggag cgccagctgt tggggtaggt attccctctc 7020
gaaggcgggc atgacctcgg cactcaggtt gtcagtttct agaaacgagg aggatttgat 7080
attgacggtg ccgttggaga cgcctttcat gagcccctcg tccatttggt cagaaaagac 7140
gatctttttg ttgtcgagct tggtggcgaa ggagccgtag agggcgttgg agagcagctt 7200
ggcgatggag cgcatggtct ggttcttttc cttgtcggcg cgctccttgg cggcgatgtt 7260
gagctgcacg tactcgcgcg ccacgcactt ccattcgggg aagacggtgg tgagctcgtc 7320
gggcacgatt ctgacccgcc agccgcggtt gtgcagggtg atgaggtcca cgctggtggc 7380
cacctcgccg cgcaggggct cgttggtcca gcagaggcgc ccgcccttgc gcgagcagaa 7440
ggggggcagc gggtccagca tgagctcgtc gggggggtcg gcgtccacgg tgaagatgcc 7500
gggcaggagc tcggggtcga agtagctgat gcaggtgccc agattgtcca gcgccgcttg 7560
ccagtcgcgc acggccagcg cgcgctcgta ggggctgagg ggcgtgcccc agggcatggg 7620
gtgcgtgagc gcggaggcgt acatgccgca gatgtcgtag acgtagaggg gctcctcgag 7680
gacgccgatg taggtggggt agcagcgccc cccgcggatg ctggcgcgca cgtagtcgta 7740
cagctcgtgc gagggcgcga ggagccccgt gccgaggttg gagcgttgcg gcttttcggc 7800
gcggtagacg atctggcgga agatggcgtg ggagttggag gagatggtgg gcctttggaa 7860
gatgttgaag tgggcgtggg gcaggccgac cgagtccctg atgaagtggg cgtaggagtc 7920
ctgcagcttg gcgacgagct cggcggtgac gaggacgtcc agggcgcagt agtcgagggt 7980
ctcttggatg atgtcatact tgagctggcc cttctgcttc cacagctcgc ggttgagaag 8040
gaactcttcg cggtccttcc agtactcttc gagggggaac ccgtcctgat cggcacggta 8100
agagcccacc atgtagaact ggttgacggc cttgtaggcg cagcagccct tctccacggg 8160
gagggcgtaa gcttgcgcgg ccttgcgcag ggaggtgtgg gtgagggcga aggtgtcgcg 8220
caccatgacc ttgaggaact ggtgcttgaa gtcgaggtcg tcgcagccgc cctgctccca 8280
gagttggaag tccgtgcgct tcttgtaggc ggggttaggc aaagcgaaag taacatcgtt 8340
gaagaggatc ttgcccgcgc ggggcatgaa gttgcgagtg atgcggaaag gctggggcac 8400
ctcggcccgg ttgttgatga cctgggcggc gaggacgatc tcgtcgaagc cgttgatgtt 8460
gtgcccgacg atgtagagtt ccacgaatcg cgggcggccc ttgacgtggg gcagcttctt 8520
gagctcgtcg taggtgagct cggcggggtc gctgagcccg tgctgctcga gggcccagtc 8580
ggcgacgtgg gggttggcgc tgaggaagga agtccagaga tccacggcca gggcggtctg 8640
caagcggtcc cggtactgac ggaactgttg gcccacggcc attttttcgg gggtgacgca 8700
gtagaaggtg cgggggtcgc cgtgccagcg gtcccacttg agctggaggg cgaggtcgtg 8760
ggcgagctcg acgagcggcg ggtccccgga gagtttcatg accagcatga aggggacgag 8820
ctgcttgccg aaggacccca tccaggtgta ggtttccaca tcgtaggtga ggaagagcct 8880
ttcggtgcga ggatgcgagc cgatggggaa gaactggatc tcctgccacc agttggagga 8940
atggctgttg atgtgatgga agtagaaatg ccgacggcgc gccgagcact cgtgcttgtg 9000
tttatacaag cgtccgcagt gctcgcaacg ctgcacggga tgcacgtgct gcacgagctg 9060
tacctgggtt cctttggcga ggaatttcag tgggcagtgg agcgctggcg gctgcatctc 9120
gtgctgtact acgtcttggc catcggcgtg gccatcgtct gcctcgatgg tggtcatgct 9180
gacgagcccg cgcgggaggc aggtccagac ctcggctcgg acgggtcgga gagcgaggac 9240
gagggcgcgc aggccggagc tgtccagggt cctgagacgc tgcggagtca ggtcagtggg 9300
cagcggcggc gcgcggttga cttgcaggag cttttccagg gcgcgcggga ggtccagatg 9360
gtacttgatc tccacggcgc cgttggtggc tacgtccacg gcttgcaggg tgccgtgccc 9420
ctggggcgcc accaccgtgc cccgtttctt cttgggcgct gcttccatgt cggtcagaag 9480
cggcggcgag gacgcgcgcc gggcggcagg ggcggctcgg ggcccggagg caggggcggc 9540
aggggcacgt cggcgccgcg cgcgggcagg ttctggtact gcgcccggag aagactggcg 9600
tgagcgacga cgcgacggtt gacgtcctgg atctgacgcc tctgggtgaa ggccacggga 9660
cccgtgagtt tgaacctgaa agagagttcg acagaatcaa tctcggtatc gttgacggcg 9720
gcctgccgca ggatctcttg cacgtcgccc gagttgtcct ggtaggcgat ctcggtcatg 9780
aactgctcga tctcctcctc ctgaaggtct ccgcggccgg cgcgctcgac ggtggccgcg 9840
aggtcgttgg agatgcggcc catgagctgc gagaaggcgt tcatgccggc ctcgttccag 9900
acgcggctgt agaccacggc tccgtcgggg tcgcgcgcgc gcatgaccac ctgggcgagg 9960
ttgagctcga cgtggcgcgt gaagaccgcg tagttgcaga ggcgctggta gaggtagttg 10020
agcgtggtgg cgatgtgctc ggtgacgaag aagtacatga tccagcggcg gagcggcatc 10080
tcgctgacgt cgcccagggc ttccaagcgt tccatggcct cgtagaagtc cacggcgaag 10140
ttgaaaaact gggagttgcg cgccgagacg gtcaactcct cctccagaag acggatgagc 10200
tcggcgatgg tggcgcgcac ctcgcgctcg aaggccccgg ggggctcctc ttccatctcc 10260
tcctcttcct cctccactaa catctcttct acttcctcct caggaggcgg tggcggggga 10320
ggggccctgc gtcgccggcg gcgcacgggc agacggtcga tgaagcgctc gatggtctcc 10380
ccgcgccggc gacgcatggt ctcggtgacg gcgcgcccgt cctcgcgggg ccgcagcatg 10440
aagacgccgc cgcgcatctc caggtggccg ccgggggggt ctccgttggg cagggagagg 10500
gcgctgacga tgcatcttat caattgaccc gtagggactc cgcgcaagga cctgagcgtc 10560
tcgagatcca cgggatccga aaaccgctga acgaaggctt cgagccagtc gcagtcgcaa 10620
ggtaggctga gcccggtttc ttgttcttcg ggtatttggt cgggaggcgg gcgggcgatg 10680
ctgctggtga tgaagttgaa gtaggcggtc ctgagacggc ggatggtggc gaggagcacc 10740
aggtccttgg gcccggcttg ctggatgcgc agacggtcgg ccatgcccca ggcgtggtcc 10800
tgacacctgg cgaggtcctt gtagtagtcc tgcatgagcc gctccacggg cacctcctcc 10860
tcgcccgcgc ggccgtgcat gcgcgtgagc ccgaacccgc gctgcggctg gacgagcgcc 10920
aggtcggcga cgacgcgctc ggtgaggatg gcctgctgga tctgggtgag ggtggtctgg 10980
aagtcgtcga agtcgacgaa gcggtggtag gctccggtgt tgatggtgta ggagcagttg 11040
gccatgacgg accagttgac ggtctggtgg ccgggtcgca cgagctcgtg gtacttgagg 11100
cgcgagtagg cgcgcgtgtc gaagatgtag tcgttgcagg cgcgcacgag gtactggtat 11160
ccgacgagga agtgcggcgg cggctggcgg tagagcggcc atcgctcggt ggcgggggcg 11220
ccgggcgcga ggtcctcgag catgaggcgg tggtagccgt agatgtacct ggacatccag 11280
gtgatgccgg cggcggtggt ggaggcgcgc gggaactcgc ggacgcggtt ccagatgttg 11340
cgcagcggca ggaagtagtt catggtggcc gcggtctggc ccgtgaggcg cgcgcagtcg 11400
tggatgctct agacatacgg gcaaaaacga aagcggtcag cggctcgact ccgtggcctg 11460
gaggctaagc gaacgggttg ggctgcgcgt gtaccccggt tcgaatctcg aatcaggctg 11520
gagccgcagc taacgtggta ctggcactcc cgtctcgacc caagcctgct aacgaaacct 11580
ccaggatacg gaggcgggtc gttttttggc cttggtcgct ggtcatgaaa aactagtaag 11640
cgcggaaagc ggccgcccgc gatggctcgc tgccgtagtc tggagaaaga atcgccaggg 11700
ttgcgttgcg gtgtgccccg gttcgagcct cagcgctcgg cgccggccgg attccgcggc 11760
taacgtgggc gtggctgccc cgtcgtttcc aagacccctt agccagccga cttctccagt 11820
tacggagcga gcccctcttt ttttttcttg tgtttttgcc agatgcatcc cgtactgcgg 11880
cagatgcgcc cccaccctcc accacaaccg cccctaccgc agcagcagca acagccggcg 11940
cttctgcccc cgccccagca gcagccagcc actaccgcgg cggccgccgt gagcggagcc 12000
ggcgttcagt atgacctggc cttggaagag ggcgaggggc tggcgcggct gggggcgtcg 12060
tcgccggagc ggcacccgcg cgtgcagatg aaaagggacg ctcgcgaggc ctacgtgccc 12120
aagcagaacc tgttcagaga caggagcggc gaggagcccg aggagatgcg cgcctcccgc 12180
ttccacgcgg ggcgggagct gcggcgcggc ctggaccgaa agcgggtgct gagggacgag 12240
gatttcgagg cggacgagct gacggggatc agccccgcgc gcgcgcacgt ggccgcggcc 12300
aacctggtca cggcgtacga gcagaccgtg aaggaggaga gcaacttcca aaaatccttc 12360
aacaaccacg tgcgcacgct gatcgcgcgc gaggaggtga ccctgggcct gatgcacctg 12420
tgggacctgc tggaggccat cgtgcagaac cccacgagca agccgctgac ggcgcagctg 12480
tttctggtgg tgcagcacag tcgggacaac gagacgttca gggaggcgct gctgaatatc 12540
accgagcccg agggccgctg gctcctggac ctggtgaaca ttttgcagag catcgtggtg 12600
caggagcgcg ggctgccgct gtccgagaag ctggcggcca tcaacttctc ggtgctgagt 12660
ctgggcaagt actacgctag gaagatctac aagaccccgt acgtgcccat agacaaggag 12720
gtgaagatcg acgggtttta catgcgcatg accctgaaag tgctgaccct gagcgacgat 12780
ctgggggtgt accgcaacga caggatgcac cgcgcggtga gcgccagccg ccggcgcgag 12840
ctgagcgacc aggagctgat gcacagcctg cagcgggccc tgaccggggc cgggaccgag 12900
ggggagagct actttgacat gggcgcggac ctgcgctggc agcccagccg ccgggccttg 12960
gaagctgccg gcggttcccc ctacgtggag gaggtggacg atgaggagga ggagggcgag 13020
tacctggaag actgatggcg cgaccgtatt tttgctagat gcagcaacag ccaccgccgc 13080
cgcctcctga tcccgcgatg cgggcggcgc tgcagagcca gccgtccggc attaactcct 13140
cggacgattg gacccaggcc atgcaacgca tcatggcgct gacgacccgc aatcccgaag 13200
cctttagaca gcagcctcag gccaaccggc tctcggccat cctggaggcc gtggtgccct 13260
cgcgctcgaa ccccacgcac gagaaggtgc tggccatcgt gaacgcgctg gtggagaaca 13320
aggccatccg cggtgacgag gccgggctgg tgtacaacgc gctgctggag cgcgtggccc 13380
gctacaacag caccaacgtg cagacgaacc tggaccgcat ggtgaccgac gtgcgcgagg 13440
cggtgtcgca gcgcgagcgg ttccaccgcg agtcgaacct gggctccatg gtggcgctga 13500
acgccttcct gagcacgcag cccgccaacg tgccccgggg ccaggaggac tacaccaact 13560
tcatcagcgc gctgcggctg atggtggccg aggtgcccca gagcgaggtg taccagtcgg 13620
ggccggacta cttcttccag accagtcgcc agggcttgca gaccgtgaac ctgagccagg 13680
ctttcaagaa cttgcaggga ctgtggggcg tgcaggcccc ggtcggggac cgcgcgacgg 13740
tgtcgagcct gctgacgccg aactcgcgcc tgctgctgct gctggtggcg cccttcacgg 13800
acagcggcag cgtgagccgc gactcgtacc tgggctacct gcttaacctg taccgcgagg 13860
ccatcggaca ggcgcacgtg gacgagcaga cctaccagga gatcacccac gtgagccgcg 13920
cgctgggcca ggaggacccg ggcaacctgg aggccaccct gaacttcctg ctgaccaacc 13980
ggtcgcagaa gatcccgccc cagtacgcgc tgagcaccga ggaggagcgc atcctgcgct 14040
acgtgcagca gagcgtgggg ctgttcctga tgcaggaggg ggccacgccc agcgcggcgc 14100
tcgacatgac cgcgcgcaac atggagccca gcatgtacgc ccgcaaccgc ccgttcatca 14160
ataagctgat ggactacttg catcgggcgg ccgccatgaa ctcggactac tttaccaacg 14220
ccatcttgaa cccgcactgg ctcccgccgc ccgggttcta cacgggcgag tacgacatgc 14280
ccgaccccaa cgacgggttc ctgtgggacg acgtggacag cagcgtgttc tcgccgcgtc 14340
caggaaccaa tgccgtgtgg aagaaagagg gcggggaccg gcggccgtcc tcggcgctgt 14400
ccggtcgcgc gggtgctgcc gcggcggtgc ccgaggccgc cagccccttc ccgagcctgc 14460
ccttttcgct gaacagcgtg cgcagcagcg agctgggtcg gctgacgcga ccgcgcctgc 14520
tgggcgagga ggagtacctg aacgactcct tgttgaggcc cgagcgcgag aagaacttcc 14580
ccaataacgg gatagagagc ctggtggaca agatgagccg ctggaagacg tacgcgcacg 14640
agcacaggga cgagccccga gctagcagcg caggcacccg tagacgccag cggcacgaca 14700
ggcagcgggg actggtgtgg gacgatgagg attccgccga cgacagcagc gtgttggact 14760
tgggtgggag tggtggtaac ccgttcgctc acctgcgccc ccgtatcggg cgcctgatgt 14820
aagaatctga aaaaataaaa gacggtactc accaaggcca tggcgaccag cgtgcgttct 14880
tctctgttgt ttgtagtagt atgatgaggc gcgtgtaccc ggagggtcct cctccctcgt 14940
acgagagcgt gatgcagcag gcggtggcgg cggcgatgca gcccccgctg gaggcgcctt 15000
acgtgccccc gcggtacctg gcgcctacgg aggggcggaa cagcattcgt tactcggagc 15060
tggcaccctt gtacgatacc acccggttgt acctggtgga caacaagtcg gcagacatcg 15120
cctcgctgaa ctaccagaac gaccacagca acttcctgac caccgtggtg cagaacaacg 15180
atttcacccc cacggaggcc agcacccaga ccatcaactt tgacgagcgc tcgcggtggg 15240
gcggccagct gaaaaccatc atgcacacca acatgcccaa cgtgaacgag ttcatgtaca 15300
gcaacaagtt caaggcgcgg gtgatggtct cgcgcaagac ccccaacggg gtggatgatg 15360
attatgatgg tagtcaggac gagctgacct acgagtgggt ggagtttgag ctgcccgagg 15420
gcaacttctc ggtgaccatg accatcgatc tgatgaacaa cgccatcatc gacaactact 15480
tggcggtggg gcggcagaac ggggtgctgg agagcgacat cggcgtgaag ttcgacacgc 15540
gcaacttccg gctgggctgg gaccccgtga ccgagctggt gatgccgggc gtgtacacca 15600
acgaggcctt ccaccccgac atcgtcctgc tgcccggctg cggcgtggac ttcaccgaga 15660
gccgcctcag caacctgctg ggcatccgca agcggcagcc cttccaggag ggcttccaga 15720
tcctgtacga ggacctggag gggggcaaca tccccgcgct cttggatgtc gaagcctacg 15780
agaaaagcaa ggaggatagc accgccgcgg cgaccgcagc cgtggccacc gcctctaccg 15840
aggtgcgggg cgataatttt gctagcgctg cggcagcggc cgaggcggct gaaaccgaaa 15900
gtaagatagt catccagccg gtggagaagg acagcaagga caggagctac aacgtgctcg 15960
cggacaagaa aaacaccgcc taccgcagct ggtacctggc ctacaactac ggcgaccccg 16020
agaagggcgt gcgctcctgg acgctgctca ccacctcgga cgtcacctgc ggcgtggagc 16080
aagtctactg gtcgctgccc gacatgatgc aagacccggt caccttccgc tccacgcgtc 16140
aagttagcaa ctacccggtg gtgggcgccg agctcctgcc cgtctactcc aagagcttct 16200
tcaacgagca ggccgtctac tcgcagcagc tgcgcgcctt cacctcgctc acgcacgtct 16260
tcaaccgctt ccccgagaac cagatcctcg tccgcccgcc cgcgcccacc attaccaccg 16320
tcagtgaaaa cgttcctgct ctcacagatc acgggaccct gccgctgcgc agcagtatcc 16380
ggggagtcca gcgcgtgacc gtcactgacg ccagacgccg cacctgcccc tacgtctaca 16440
aggccctggg cgtagtcgcg ccgcgcgtcc tctcgagccg caccttctaa aaaatgtcca 16500
ttctcatctc gcccagtaat aacaccggtt ggggcctgcg cgcgcccagc aagatgtacg 16560
gaggcgctcg ccaacgctcc acgcaacacc ccgtgcgcgt gcgcgggcac ttccgcgctc 16620
cctggggcgc cctcaagggc cgcgtgcgct cgcgcaccac cgtcgacgac gtgatcgacc 16680
aggtggtggc cgacgcgcgc aactacacgc ccgccgccgc gcccgtctcc accgtggacg 16740
ccgtcatcga cagcgtggtg gccgacgcgc gccggtacgc ccgcaccaag agccggcggc 16800
ggcgcatcgc ccggcggcac cggagcaccc ccgccatgcg cgcggcgcga gccttgctgc 16860
gcagggccag gcgcacggga cgcagggcca tgctcagggc ggccagacgc gcggcctccg 16920
gcagcagcag cgccggcagg acccgcagac gcgcggccac ggcggcggcg gcggccatcg 16980
ccagcatgtc ccgcccgcgg cgcggcaacg tgtactgggt gcgcgacgcc gccaccggtg 17040
tgcgcgtgcc cgtgcgcacc cgcccccctc gcacttgaag atgctgactt cgcgatgttg 17100
atgtgtccca gcggcgagga ggatgtccaa gcgcaaatac aaggaagaga tgctccaggt 17160
catcgcgcct gagatctacg gccccgcggc ggcggtgaag gaggaaagaa agccccgcaa 17220
actgaagcgg gtcaaaaagg acaaaaagga ggaggaagat gacggactgg tggagtttgt 17280
gcgcgagttc gccccccggc ggcgcgtgca gtggcgcggg cggaaagtga aaccggtgct 17340
gcggcccggc accacggtgg tcttcacgcc cggcgagcgt tccggctccg cctccaagcg 17400
ctcctacgac gaggtgtacg gggacgagga catcctcgag caggcggtcg agcgtctggg 17460
cgagtttgcg tacggcaagc gcagccgccc cgcgcccttg aaagaggagg cggtgtccat 17520
cccgctggac cacggcaacc ccacgccgag cctgaagccg gtgaccctgc agcaggtgct 17580
accgagcgcg gcgccgcgcc ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac 17640
catgcagctg atggtgccca agcgccagaa gctggaggac gtgctggagc acatgaaggt 17700
ggaccccgag gtgcagcccg aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct 17760
gggcgtgcag accgtggaca tcaagatccc cacggagccc atggaaacgc agaccgagcc 17820
cgtgaagccc agcaccagca ccatggaggt gcagacggat ccctggatgc cagcaccagc 17880
ttccaccagc actcgccgaa gacgcaagta cggcgcggcc agcctgctga tgcccaacta 17940
cgcgctgcat ccttccatca tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg 18000
ctacaccagc agccgccgcc gcaagaccac cacccgccgc cgtcgtcgca gccgccgcag 18060
cagcaccgcg acttccgcct tggtgcggag agtgtatcgc agcgggcgcg agcctctgac 18120
cctgccgcgc gcgcgctacc acccgagcat cgccatttaa ctaccgcctc ctacttgcag 18180
atatggccct cacatgccgc ctccgcgtcc ccattacggg ctaccgagga agaaagccgc 18240
gccgtagaag gctgacgggg aacgggctgc gtcgccatca ccaccggcgg cggcgcgcca 18300
tcagcaagcg gttgggggga ggcttcctgc ccgcgctgat ccccatcatc gccgcggcga 18360
tcggggcgat ccccggcata gcttccgtgg cggtgcaggc ctctcagcgc cactgagaca 18420
caaaaaagca tggatttgta ataaaaaaaa aaatggactg acgctcctgg tcctgtgatg 18480
tgtgttttta gatggaagac atcaattttt cgtccctggc accgcgacac ggcacgcggc 18540
cgtttatggg cacctggagc gacatcggca acagccaact gaacgggggc gccttcaatt 18600
ggagcagtct ctggagcggg cttaagaatt tcgggtccac gctcaaaacc tatggcaaca 18660
aggcgtggaa cagcagcaca gggcaggcgc tgagggaaaa gctgaaagaa cagaacttcc 18720
agcagaaggt ggttgatggc ctggcctcag gcatcaacgg ggtggttgac ctggccaacc 18780
aggccgtgca gaaacagatc aacagccgcc tggacgcggt cccgcccgcg gggtccgtgg 18840
agatgcccca ggtggaggag gagctgcctc ccctggacaa gcgcggcgac aagcgaccgc 18900
gtcccgacgc ggaggagacg ctgctgacgc acacggacga gccgcccccg tacgaggagg 18960
cggtgaaact gggcctgccc accacgcggc ccgtggcgcc tctggccacc ggagtgctga 19020
aacccagcag cagccagccc gcgaccctgg acttgcctcc gcctcgcccc tccacagtgg 19080
ctaagcccct gccgccggtg gccgtcgcgt cgcgcgcccc ccgaggccgc ccccaggcga 19140
actggcagag cactctgaac agcatcgtgg gtctgggagt gcagagtgtg aagcgccgcc 19200
gctgctatta aaagacactg tagcgcttaa cttgcttgtc tgtgtgtata tgtatgtccg 19260
ccgaccagaa ggaggagtgt gaagaggcgc gtcgccgagt tgcaagatgg ccaccccatc 19320
gatgctgccc cagtgggcgt acatgcacat cgccggacag gacgcttcgg agtacctgag 19380
tccgggtctg gtgcagttcg cccgcgccac agacacctac ttcagtctgg ggaacaagtt 19440
taggaacccc acggtggcgc ccacgcacga tgtgaccacc gaccgcagcc agcggctgac 19500
gctgcgcttc gtgcccgtgg accgcgagga caacacctac tcgtacaaag tgcgctacac 19560
gctggccgtg ggcgacaacc gcgtgctgga catggccagc acctactttg acatccgcgg 19620
cgtgctggac cggggcccta gcttcaaacc ctactctggc accgcctaca acagcctagc 19680
tcccaaggga gctcccaatt ccagccagtg ggagcaagca aaaacaggca atgggggaac 19740
tatggaaaca cacacatatg gtgtggcccc aatgggcgga gagaatatta caaaagatgg 19800
tcttcaaatt ggaactgacg ttacagcgaa tcagaataaa ccaatttatg ccgacaaaac 19860
atttcaacca gaaccgcaag taggagaaga aaattggcaa gaaactgaaa acttttatgg 19920
cggtagagct cttaaaaaag acacaaacat gaaaccttgc tatggctcct atgctagacc 19980
caccaatgaa aaaggaggtc aagctaaact taaagttgga gatgatggag ttccaaccaa 20040
agaattcgac atagacctgg ctttctttga tactcccggt ggcaccgtga acggtcaaga 20100
cgagtataaa gcagacattg tcatgtatac cgaaaacacg tatttggaaa ctccagacac 20160
gcatgtggta tacaaaccag gcaaggatga tgcaagttct gaaattaacc tggttcagca 20220
gtctatgccc aacagaccca actacattgg gttcagggac aactttatcg gtcttatgta 20280
ctacaacagc actggcaata tgggtgtgct tgctggtcag gcctcccagc tgaatgctgt 20340
ggttgatttg caagacagaa acaccgagct gtcctaccag ctcttgcttg actctttggg 20400
tgacagaacc cggtatttca gtatgtggaa ccaggcggtg gacagttatg accccgatgt 20460
gcgcatcatc gaaaaccatg gtgtggagga tgaattgcca aactattgct tccccttgga 20520
cggctctggc actaacgccg cataccaagg tgtgaaagta aaagatggtc aagatggtga 20580
tgttgagagt gaatgggaaa atgacgatac tgttgcagct cgaaatcaat tatgtaaagg 20640
taacattttc gccatggaga ttaatctcca ggctaacctg tggagaagtt tcctctactc 20700
gaacgtggcc ctgtacctgc ccgactccta caagtacacg ccgaccaacg tcacgctgcc 20760
gaccaacacc aacacctacg attacatgaa tggcagagtg acacctccct cgctggtaga 20820
cgcctacctc aacatcgggg cgcgctggtc gctggacccc atggacaacg tcaacccctt 20880
caaccaccac cgcaacgcgg gcctgcgcta ccgctccatg ctcctgggca acgggcgcta 20940
cgtgcccttc cacatccagg tgccccaaaa gtttttcgcc atcaagagcc tcctgctcct 21000
gcccgggtcc tacacctacg agtggaactt ccgcaaggac gtcaacatga tcctgcagag 21060
ctccctaggc aacgacctgc gcacggacgg ggcctccatc gccttcacca gcatcaacct 21120
ctacgccacc ttcttcccca tggcgcacaa caccgcctcc acgctcgagg ccatgctgcg 21180
caacgacacc aacgaccagt ccttcaacga ctacctctcg gcggccaaca tgctctaccc 21240
catcccggcc aacgccacca acgtgcccat ctccatcccc tcgcgcaact gggccgcctt 21300
ccgcggatgg tccttcacgc gcctgaagac ccgcgagacg ccctcgctcg gctccgggtt 21360
cgacccctac ttcgtctact cgggctccat cccctaccta gacggcacct tctacctcaa 21420
ccacaccttc aagaaggtct ccatcacctt cgactcctcc gtcagctggc ccggcaacga 21480
ccgcctcctg acgcccaacg agttcgaaat caagcgcacc gtcgacggag agggatacaa 21540
cgtggcccag tgcaacatga ccaaggactg gttcctggtc cagatgctgg cccactacaa 21600
catcggctac cagggcttct acgtgcccga gggctacaag gaccgcatgt actccttctt 21660
ccgcaacttc cagcccatga gccgccaggt cgtggacgag gtcaactaca aggactacca 21720
ggccgtcacc ctggcctacc agcacaacaa ctcgggcttc gtcggctacc tcgcgcccac 21780
catgcgccag ggccagccct accccgccaa ctacccctac ccgctcatcg gcaagagcgc 21840
cgtcgccagc gtcacccaga aaaagttcct ctgcgaccgg gtcatgtggc gcatcccctt 21900
ctccagcaac ttcatgtcca tgggcgcgct caccgacctc ggccagaaca tgctctacgc 21960
caactccgcc cacgcgctag acatgaattt cgaagtcgac cccatggatg agtccaccct 22020
tctctatgtt gtcttcgaag tcttcgacgt cgtccgagtg caccagcccc accgcggcgt 22080
catcgaagcc gtctacctgc gcacgccctt ctcggccggc aacgccacca cctaagccgc 22140
tcttgcttct tgcaagatga cggcgggctc cggcgagcag gagctcaggg ccatcctccg 22200
cgacctgggc tgcgggccct gcttcctggg caccttcgac aagcgcttcc ctggattcat 22260
ggccccgcac aagctggcct gcgccatcgt gaacacggcc ggccgcgaga ccgggggcga 22320
gcactggctg gccttcgcct ggaacccgcg ctcccacaca tgctacctct tcgacccctt 22380
cgggttctcg gacgagcgcc tcaagcagat ctaccagttc gagtacgagg gcctgctgcg 22440
tcgcagcgcc ctggccaccg aggaccgctg cgtcaccctg gaaaagtcca cccagaccgt 22500
gcagggtccg cgctcggccg cctgcgggct cttctgctgc atgttcctgc acgccttcgt 22560
gcactggccc gaccgcccca tggacaagaa ccccaccatg aacttactga cgggggtgcc 22620
caacggcatg ctccagtcgc cccaggtgga acccaccctg cgccgcaacc aggaagcgct 22680
ctaccgcttc ctcaatgccc actccgccta ctttcgctcc caccgcgcgc gcatcgagaa 22740
ggccaccgcc ttcgaccgca tgaatcaaga catgtaaaaa accggtgtgt gtatgtgaat 22800
gctttattca taataaacag cacatgttta tgccaccttc tctgaggctc tgactttatt 22860
tagaaatcga aggggttctg ccggctctcg gcatggcccg cgggcaggga tacgttgcgg 22920
aactggtact tgggcagcca cttgaactcg gggatcagca gcttgggcac ggggaggtcg 22980
gggaacgagt cgctccacag cttgcgcgtg agttgcaggg cgcccagcag gtcgggcgcg 23040
gagatcttga aatcgcagtt gggacccgcg ttctgcgcgc gagagttgcg gtacacgggg 23100
ttgcagcact ggaacaccat cagggccggg tgcttcacgc ttgccagcac cgtcgcgtcg 23160
gtgatgccct ccacgtccag atcctcggcg ttggccatcc cgaagggggt catcttgcag 23220
gtctgccgcc ccatgctggg cacgcagccg ggcttgtggt tgcaatcgca gtgcaggggg 23280
atcagcatca tctgggcctg ctcggagctc atgcccgggt acatggcctt catgaaagcc 23340
tccagctggc ggaaggcctg ctgcgccttg ccgccctcgg tgaagaagac cccgcaggac 23400
ttgctagaga actggttggt ggcgcagccg gcgtcgtgca cgcagcagcg cgcgtcgttg 23460
ttggccagct gcaccacgct gcgcccccag cggttctggg tgatcttggc ccggttgggg 23520
ttctccttca gcgcgcgctg cccgttctcg ctcgccacat ccatctcgat agtgtgctcc 23580
ttctggatca tcacggtccc gtgcaggcac cgcagcttgc cctcggcttc ggtgcagccg 23640
tgcagccaca gcgcgcagcc ggtgcactcc cagttcttgt gggcgatctg ggagtgcgag 23700
tgcacgaagc cctgcaggaa gcggcccatc atcgcggtca gggtcttgtt gctggtgaag 23760
gtcagcggga tgccgcggtg ctcctcgttc acatacaggt ggcagatgcg gcggtacacc 23820
tcgccctgct cgggcatcag ctggaaggcg gacttcaggt cgctctccac gcggtaccgg 23880
tccatcagca gcgtcatcac ttccatgccc ttctcccagg ccgaaacgat cggcaggctc 23940
agggggttct tcaccgccat tgtcatctta gtcgccgccg ccgaggtcag ggggtcgttc 24000
tcgtccaggg tctcaaacac tcgcttgccg tccttctcga tgatgcgcac ggggggaaag 24060
ctgaagccca cggccgccag ctcctcctcg gcctgccttt cgtcctcgct gtcctggctg 24120
atgtcttgca aaggcacatg cttggtcttg cggggtttct ttttgggcgg cagaggcggc 24180
ggcgatgtgc tgggagagcg cgagttctcg ttcaccacga ctatttcttc ttcttggccg 24240
tcgtccgaga ccacgcggcg gtaggcatgc ctcttctggg gcagaggcgg aggcgacggg 24300
ctctcgcggt tcggcgggcg gctggcagag ccccttccgc gttcgggggt gcgctcctgg 24360
cggcgctgct ctgactgact tcctccgcgg ccggccattg tgttctccta gggagcaaca 24420
acaagcatgg agactcagcc atcgtcgcca acatcgccat ctgcccccgc cgccaccgcc 24480
gacgagaacc agcagcagaa tgaaagctta accgccccgc cgcccagccc cacctccgac 24540
gccgcggccc cagacatgca agagatggag gaatccatcg agattgacct gggctacgtg 24600
acgcccgcgg agcacgagga ggagctggca gcgcgctttt cagccccgga agagaaccac 24660
caagagcagc cagagcagga agcagagaac gagcagaacc aggctgggca cgagcatggc 24720
gactacctga gcggggcaga ggacgtgctc atcaagcatc tggcccgcca atgcatcatc 24780
gtcaaggacg cgctgctcga ccgcgccgag gtgcccctca gcgtggcgga gctcagccgc 24840
gcctacgagc gcaacctctt ctcgccgcgc gtgcccccca agcgccagcc caacggcacc 24900
tgtgagccca acccgcgcct caacttctac ccggtcttcg cggtgcccga ggccctggcc 24960
acctaccacc tctttttcaa gaaccaaagg atccccgtct cctgccgcgc caaccgcacc 25020
cgcgccgacg ccctgctcaa cctgggcccc ggcgcccgcc tacctgatat cacctccttg 25080
gaagaggttc ccaagatctt cgagggtctg ggcagcgacg agactcgggc cgcgaacgct 25140
ctgcaaggaa gcggagagga gcatgagcac cacagcgccc tggtggagtt ggaaggcgac 25200
aacgcgcgcc tggcggtcct caagcgcacg gtcgagctga cccacttcgc ctacccggcg 25260
ctcaacctgc cccccaaggt catgagcgcc gtcatggacc aggtgctcat caagcgcgcc 25320
tcgcccctct cggaggagga gatgcaggac cccgagagtt cggacgaggg caagcccgtg 25380
gtcagcgacg agcagctggc gcgctggctg ggagcgagta gcacccccca gagcctggaa 25440
gagcggcgca agctcatgat ggccgtggtc ctggtgaccg tggagctgga gtgtctgcgc 25500
cgcttctttg ccgacgcgga gaccctgcgc aaggtcgagg agaacctgca ctacctcttc 25560
aggcacgggt tcgtgcgcca ggcctgcaag atctccaacg tggagctgac caacctggtc 25620
tcctacatgg gcatcctgca cgagaaccgc ctggggcaaa acgtgctgca caccaccctg 25680
cgcggggagg cccgccgcga ctacatccgc gactgcgtct acctgtacct ctgccacacc 25740
tggcagacgg gcatgggcgt gtggcagcag tgcctggagg agcagaacct gaaagagctc 25800
tgcaagctcc tgcagaagaa cctcaaggcc ctgtggaccg ggttcgacga gcgtaccacc 25860
gcctcggacc tggccgacct catcttcccc gagcgcctgc ggctgacgct gcgcaacggg 25920
ctgcccgact ttatgagcca aagcatgttg caaaactttc gctctttcat cctcgaacgc 25980
tccgggatcc tgcccgccac ctgctccgcg ctgccctcgg acttcgtgcc gctgaccttc 26040
cgcgagtgcc ccccgccgct ctggagccac tgctacttgc tgcgcctggc caactacctg 26100
gcctaccact cggacgtgat cgaggacgtc agcggcgagg gtctgctgga gtgccactgc 26160
cgctgcaacc tctgcacgcc gcaccgctcc ctggcctgca acccccagct gctgagcgag 26220
acccagatca tcggcacctt cgagttgcaa ggccccggcg acggcgaggg caaggggggt 26280
ctgaaactca ccccggggct gtggacctcg gcctacttgc gcaagttcgt gcccgaggac 26340
taccatccct tcgagatcag gttctacgag gaccaatccc agccgcccaa ggccgagctg 26400
tcggcctgcg tcatcaccca gggggccatc ctggcccaat tgcaagccat ccagaaatcc 26460
cgccaagaat ttctgctgaa aaagggccac ggggtctact tggaccccca gaccggagag 26520
gagctcaacc ccagcttccc ccaggatgcc ccgaggaagc agcaagaagc tgaaagtgga 26580
gctgccgccg ccggaggatt tggaggaaga ctgggagagc agtcaggcag aggaggagga 26640
gatggaagac tgggacagca ctcaggcaga ggaggacagc ctgcaagaca gtctggagga 26700
ggaagacgag gtggaggagg cagaggaaga agcagccgcc gccagaccgt cgtcctcggc 26760
ggagaaagca agcagcacgg ataccatctc cgctccgggt cggggtcgcg gcggccgggc 26820
ccacagtagg tgggacgaga ccgggcgctt cccgaacccc accacccaga ccggtaagaa 26880
ggagcggcag ggatacaagt cctggcgggg gcacaaaaac gccatcgtct cctgcttgca 26940
agcctgcggg ggcaacatct ccttcacccg gcgctacctg ctcttccacc gcggggtgaa 27000
cttcccccgc aacatcttgc attactaccg tcacctccac agcccctact actgtttcca 27060
agaagaggca gaaacccagc agcagcagaa aaccagcggc agcagcagct agaaaatcca 27120
cagcggcggc aggtggactg aggatcgcgg cgaacgagcc ggcgcagacc cgggagctga 27180
ggaaccggat ctttcccacc ctctatgcca tcttccagca gagtcggggg caggagcagg 27240
aactgaaagt caagaaccgt tctctgcgct cgctcacccg cagttgtctg tatcacaaga 27300
gcgaagacca acttcagcgc actctcgagg acgccgaggc tctcttcaac aagtactgcg 27360
cgctcactct taaagagtag cccgcgcccg cccacacacg gaaaaaggcg ggaattacgt 27420
caccacctgc gcccttcgcc cgaccatcat gagcaaagag attcccacgc cttacatgtg 27480
gagctaccag ccccagatgg gcctggccgc cggcgccgcc caggactact ccacccgcat 27540
gaactggctc agtgccgggc ccgcgatgat ctcacgggtg aatgacatcc gcgcccaccg 27600
aaaccagata ctcctagaac agtcagcgat caccgccacg ccccgccatc accttaatcc 27660
gcgtaattgg cccgccgccc tggtgtacca ggaaattccc cagcccacga ccgtactact 27720
tccgcgagac gcccaggccg aagtccagct gactaactca ggtgtccagc tggccggcgg 27780
cgccgccctg tgtcgtcacc gccccgctca gggtataaag cggctggtga tccgaggcag 27840
aggcacacag ctcaacgacg aggtggtgag ctcttcgctg ggtctgcgac ctgacggagt 27900
cttccaactc gccggatcgg ggagatcttc cttcacgcct cgtcaggccg tcctgacttt 27960
ggagagttcg tcctcgcagc cccgctcggg cggcatcggc actctccagt tcgtggagga 28020
gttcactccc tcggtctact tcaacccctt ctccggctcc cccggccact acccggacga 28080
gttcatcccg aacttcgacg ccatcagcga gtcggtggac ggctacgatt gaatgtccca 28140
tggtggcgca gctgacctag ctcggcttcg acacctggac cactgccgcc gcttccgctg 28200
cttcgctcgg gatctcgccg agtttgccta ctttgagctg cccgaggagc accctcaggg 28260
cccagcccac ggagtgcgga tcatcgtcga agggggcctc gactcccacc tgcttcggat 28320
cttcagccag cgaccgatcc tggtcgagcg cgaacaagga cagacccttc ttactttgta 28380
ctgcatctgc aaccaccccg gcctgcatga aagtctttgt tgtctgctgt gtactgagta 28440
taataaaagc tgagatcagc gactactccg gactcgattg tggtgttcct gctatcaacc 28500
ggtccctgtt cttcaccggg aacgagaccg agctccagct ccagtgtaag ccccacaaga 28560
agtacctcac ctggctgttc cagggctccc cgatcgccgt tgtcaaccac tgcgacaacg 28620
acggagtcct gctgagcggc cctgccaacc ttactttttc cacccgcaga agcaagctcc 28680
agctcttcca acccttcctc cccgggacct atcagtgcgt ctcaggaccc tgccatcaca 28740
ccttccacct gatcccgaat accacagcgc cgctccccgc tactaacaac caaactaccc 28800
accaacgcca ccgtcgcgac ctttcctctg aatctaatac cactaccgga ggtggcttct 28860
gctgttagtg ctcccccgtc ccgtcgaccc ccggtccccc actcagtccc ccgaggaggt 28920
tcgcaaatgc aaattccaag aaccctggaa attcctcaaa tgctaccgcc aaaaatcaga 28980
catgcatccc agctggatca tgatcattgg gatcgtgaac attctggcct gcaccctcat 29040
ctcctttgtg atttacccct gctttgactt tggttggaac tcgccagagg cgctctatct 29100
cccgcctgaa cctgacacac caccacagca gcaacctcag gcacacgcac taccaccacc 29160
acagcctagg ccacaataca tgcccatatt agactatgag gccgagccac agcgacccat 29220
gctccccgct attagttact tcaatctaac cggcggagat gactgaccca ctggccaata 29280
acaacgtcaa cgaccttctc ctggacatgg acggccgcgc ctcggagcag cgactcgccc 29340
aacttcgcat tcgtcagcag caggagagag ccgtcaagga gctgcaggac ggcatagcca 29400
tccaccagtg caagagaggc atcttctgcc tggtgaaaca ggccaagatc tcctacgagg 29460
tcacccagac cgaccatcgc ctctcctacg agctcctgca gcagcgccag aagttcacct 29520
gcctggtcgg agtcaacccc atcgtcatca cccagcagtc gggcgatacc aaggggtgca 29580
tccactgctc ctgcgactcc cccgactgcg tccacactct gatcaagacc ctctgcggcc 29640
tccgcgacct cctccccatg aactaatcac ccccttatcc agtgaaataa agatcatatt 29700
gatgatgatt taaataaaaa aaataatcat ttgatttgaa ataaagatac aatcatattg 29760
atgatttgag tttaacaaaa ataaagaatc acttacttga aatctgatac caggtctctg 29820
tccatgtttt ctgccaacac cacctcactc ccctcttccc agctctggta ctgcaggccc 29880
cggcgggctg caaacttcct ccacacgctg aaggggatgt caaattcctc ctgtccctca 29940
atcttcattt tatcttctat cagatgtcca aaaagcgcgt ccgggtggat gatgacttcg 30000
accccgtcta cccctacgat gcagacaacg caccgaccgt gcccttcatc aaccccccct 30060
tcgtctcttc agatggattc caagagaagc ccctgggggt gttgtccctg cgactggctg 30120
accccgtcac caccaagaac ggggaaatca ccctcaagct gggagagggg gtggacctcg 30180
actcgtcggg aaaactcatc tccaacacgg ccaccaaggc cgccgcccct ctcagtattt 30240
caaacaacac catttccctt aaaactgctg cccctttcta caacaacaat ggaactttaa 30300
gcctcaatgt ctccacacca ttagcagtat ttcccacatt taacacttta ggcataagtc 30360
ttggaaacgg tcttcagact tcaaataagt tgttgactgt acaactaact catcctctta 30420
cattcagctc aaatagcatc acagtaaaaa cagacaaagg gctatatatt aactccagtg 30480
gaaacagagg acttgaggct aatataagcc taaaaagagg actagttttt gacggtaatg 30540
ctattgcaac atatattgga aatggcttag actatggatc ttatgatagt gatggaaaaa 30600
caagacccgt aattaccaaa attggagcag gattaaattt tgatgctaac aaagcaatag 30660
ctgtcaaact aggcacaggt ttaagttttg actccgctgg tgccttgaca gctggaaaca 30720
aacaggatga caagctaaca ctttggacta cccctgaccc aagccctaat tgtcaattac 30780
tttcagacag agatgccaaa tttactctct gtcttacaaa atgcggtagt caaatactag 30840
gcactgtggc agtggcggct gttactgtag gatcagcact aaatccaatt aatgacacag 30900
tcaaaagcgc catagttttc cttagatttg attccgatgg tgtactcatg tcaaactcat 30960
caatggtagg tgattactgg aactttaggg agggacagac cactcaaagt gtagcctata 31020
caaatgctgt gggattcatg ccaaatatag gtgcatatcc aaaaacccaa agtaaaacac 31080
ctaaaaatag catagtcagt caggtatatt taactggaga aactactatg ccaatgacac 31140
taaccataac tttcaatggc actgatgaaa aagacacaac cccagttagc acctactcta 31200
tgacttttac atggcagtgg actggagact ataaggacaa aaatattacc tttgctacca 31260
actcattctc tttttcctac atcgcccagg aataatccca cccagcaagc caaccccttt 31320
tcccaccacc tttgtctata tggaaactct gaaacagaaa aataaagttc aagtgtttta 31380
ttgaatcaac agttttacag gactcgagca gttatttttc ctccaccctc ccaggacatg 31440
gaatacacca ccctctcccc ccgcacagcc ttgaacatct gaatgccatt ggtgatggac 31500
atgcttttgg tctccacgtt ccacacagtt tcagagcgag ccagtctcgg atcggtcagg 31560
gagatgaaac cctccgggca ctcccgcatc tgcacctcac agctcaacag ctgaggattg 31620
tcctcggtgg tcgggatcac ggttatctgg aagaagcaga agagcggcgg tgggaatcat 31680
agtccgcgaa cgggatcggc cggtggtgtc gcatcaggcc ccgcagcagt cgctgccgcc 31740
gccgctccgt caagctgctg ctcagggggt tcgggtccag ggactccctc agcatgatgc 31800
ccacggccct cagcatcagt cgtctggtgc ggcgggcgca gcagcgcatg cgaatctcgc 31860
tcaggtcact gcagtacgtg caacacagga ccaccaggtt gttcaacagt ccatagttca 31920
acacgctcca gccgaaactc atcgcgggaa ggatgctacc cacgtggccg tcgtaccaga 31980
tcctcaggta aatcaagtgg cgctccctcc agaagacgct gcccatgtac atgatctcct 32040
tgggcatgtg gcggttcacc acctcccggt accacatcac cctctggttg aacatgcagc 32100
cccggatgat cctgcggaac cacagggcca gcaccgcccc gcccgccatg cagcgaagag 32160
accccggatc ccggcaatga caatggagga cccaccgctc gtacccgtgg atcatctggg 32220
agctgaacaa gtctatgttg gcacagcaca ggcatatgct catgcatctc ttcagcactc 32280
tcagctcctc gggggtcaaa accatatccc agggcacggg gaactcttgc aggacagcga 32340
accccgcaga acagggcaat cctcgcacat aacttacatt gtgcatggac agggtatcgc 32400
aatcaggcag caccgggtga tcctccacca gagaagcgcg ggtctcggtc tcctcacagc 32460
gtggtaaggg ggccggccga tacgggtgat ggcgggacgc ggctgatcgt gttctcgacc 32520
gtgtcatgat gcagttgctt tcggacattt tcgtacttgc tgtagcagaa cctggtccgg 32580
gcgctgcaca ccgatcgccg gcggcggtct cggcgcttgg aacgctcggt gttaaagttg 32640
taaaacagcc actctctcag accgtgcagc agatctaggg cctcaggagt gatgaagatc 32700
ccatcatgcc tgatagctct gatcacatcg accaccgtgg aatgggccag gcccagccag 32760
atgatgcaat tttgttgggt ttcggtgacg gcgggggagg gaagaacagg aagaaccatg 32820
attaactttt aatccaaacg gtctcggagc acttcaaaat gaaggtcacg gagatggcac 32880
ctctcgcccc cgctgtgttg gtggaaaata acagccaggt caaaggtgat acggttctcg 32940
agatgttcca cggtggcttc cagcaaagcc tccacgcgca catccagaaa caagacaata 33000
gcgaaagcgg gagggttctc taattcctca accatcatgt tacactcctg caccatcccc 33060
agataatttt catttttcca gccttgaatg attcgaacta gttcctgagg taaatccaag 33120
ccagccatga taaaaagctc gcgcagagca ccctccaccg gcattcttaa gcacaccctc 33180
ataattccaa gatattctgc tcctggttca cctgcagcag attgacaagc ggaatatcaa 33240
aatctctgcc gcgatccctg agctcctccc tcagcaataa ctgtaagtac tctttcatat 33300
cgtctccgaa atttttagcc ataggacccc caggaataag agaagggcaa gccacattac 33360
agataaaccg aagtcccccc cagtgagcat tgccaaatgt aagattgaaa taagcatgct 33420
ggctagaccc ggtgatatct tccagataac tggacagaaa atcgggtaag caatttttaa 33480
gaaaatcaac aaaagaaaaa tcttccaggt gcacgtttag ggcctcggga acaacgatgg 33540
agtaagtgca aggggtgcgt tccagcatgg ttagttagct gatctgtaaa aaaacaaaaa 33600
ataaaacatt aaaccatgct agcctggcga acaggtgggt aaatcgttct ctccagcacc 33660
aggcaggcca cggggtctcc ggcgcgaccc tcgtaaaaat tgtcgctatg attgaaaacc 33720
atcacagaga gacgttcccg gtggccggcg tgaatgattc gagaagaagc atacaccccc 33780
ggaacattgg agtccgtgag tgaaaaaaag cggccgagga agcaatgagg cactacaacg 33840
ctcactctca agtccagcaa agcgatgcca tgcggatgaa gcacaaaatt ttcaggtgcg 33900
taaaaaatgt aattactccc ctcctgcaca ggcagcgaag ctcccgatcc ctccagatac 33960
acatacaaag cctcagcgtc catagcttac cgagcggcag cagcagcggc acacaacagg 34020
cgcaagagtc agagaaaaga ctgagctcta acctgtccgc ccgctctctg ctcaatatat 34080
agccccagat ctacactgac gtaaaggcca aagtctaaaa atacccgcca aataatcaca 34140
cacgcccagc acacgcccag aaaccggtga cacactcaga aaaatacgcg cacttcctca 34200
aacggccaaa ctgccgtcat ttccgggttc ccacgctacg tcatcaaaac acgactttca 34260
aattccgtcg accgttaaaa acatcacccg ccccgcccct aacggtcgcc gctcccgcag 34320
ccaatcacct tcctccctcc ccaaattcaa acagctcatt tgcatattaa cgcgcaccaa 34380
aagtttgagg tatattattg atgatg 34406
<210> 7
<211> 34058
<212> DNA
<213> 人工序列
<220>
<223> 运载体
<220>
<223> C7 010 CMV-HIV gp140 AE1
<400> 7
catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60
atttggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120
tgacgttttg atgacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180
gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240
ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300
aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360
ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420
ttccgcgtac ggtgtcaaag tccggtgttt ttacgtacga tatcatttcc ccgaaagtgc 480
cacctgaccg taactataac ggtcctaagg tagcgaaagc tcagatctcc cgatccccta 540
tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagta tctgctccct 600
gcttgtgtgt tggaggtcgc tgagtagtgc gcgagcaaaa tttaagctac aacaaggcaa 660
ggcttgaccg acaattgcat gaagaatctg cttagggtta ggcgttttgc gctgcttcgc 720
gatgtacggg ccagatatac gcgttgacat tgattattga ctagttatta atagtaatca 780
attacggggt cattagttca tagcccatat atggagttcc gcgttacata acttacggta 840
aatggcccgc ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat 900
gttcccatag taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg 960
taaactgccc acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac 1020
gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt 1080
cctacttggc agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg 1140
cagtacatca atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc 1200
attgacgtca atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt 1260
aacaactccg ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata 1320
agcagagctc gtttagtgaa ccgtcagatc actagaagct ttattgcggt agtttatcac 1380
agttaaattg ctaacgcagt cagtgcttct gacacaacag tctcgaactt aagctgcaga 1440
agttggtcgt gaggcactgg gcaggtaagt atcaaggtta caagacaggt ttaaggagac 1500
caatagaaac tgggcttgtc gagacagaga agactcttgc gtttctgata ggcacctatt 1560
ggtcttactg acatccactt tgcctttctc tccacaggtg tccactccca gttcaattac 1620
agctcttaaa aggctagagt acttaatacg actcactata ggctagcatg agagtgaagg 1680
ggacacagat gaattggcca aacttgtgga aatgggggac tttgatcctt gggttggtga 1740
tcatgtgtag tgcctcagac aacttgtggg ttacagttta ttatggagtt cctgtgtgga 1800
gagatgcaaa taccacccta ttttgtgcat cagatgccaa agcacatgag acagaagtgc 1860
acaatgtctg ggccacatat gcctgtgtac ccacagatcc caacccacaa gaaataccca 1920
tggaaaatgt gacagaaaat tttaacatgt ggaaaaataa catggtagag caaatgcagg 1980
aggatgtaat cagtttatgg gatcaaagtc taaagccatg tgtaaagtta actcctctct 2040
gcgttacttt aatttgtacc aatgctaact tgaccaagat caacagtacc aatagcgggc 2100
ctaaagtaat aggaaatgta acagatgaag taagaaactg ttcttttaat atgaccacat 2160
tactaacaga taagaagcaa aaggtttatg cactttttta taagcttgat atagtaccaa 2220
ttgataatag taatagtagt gagtatagat taataaattg taatacttca gtcattaagc 2280
aggcttgtcc aaagatatcc tttgatccaa ttcctataca ttattgtact ccagctggtt 2340
atgcgatttt aaaatgtaat gataagaatt tcaatgggac agggccatgt aaaaatgtca 2400
gctcagtaca gtgcacacat ggaattaagc cagtggtctc aactcaatta ctgttaaatg 2460
gcagtctagc agaagaagag ataataatca gatctgaaaa tctcacaaac aatgccaaaa 2520
ccataatagt gcaccttaat aaggctgtag aaatcaattg taccagaccc tccaacaata 2580
caagaacaag tataagaata ggaccaggac aaatatttta tagaacagga gacataatag 2640
gagatataag acaagcatat tgtgaaatta atggaacaaa atggaatgaa actttaagac 2700
aggtagcaaa aaaattaaaa gagcaattta ataacacaat aaaattccag ccaccctcag 2760
gaggagatct agaaattaca atgcttcatt ttaattgtag aggggaattt ttctattgca 2820
atacaacaaa actgttcaat agtacttggg aaagaaatga gaccataaaa gggggtaatg 2880
gcaatggcaa tgacactatc atacttccat gcaggataaa gcaaatcata aacatgtggc 2940
aaggagcagg acaagcaatg tatgctcctc ccatcagtgg aataattaac tgtgtatcaa 3000
atattacagg aatactattg acaagagatg gtggtaatac taatgaaact gccgagatct 3060
tcagacctgg aggaggaaat ataaaggaca attggagaag tgaattatat aaatataaag 3120
tagtacaaat tgaaccacta ggagtagcac ccaccaaggc aaagctgacg gtacaggcca 3180
gacaattatt gtctggtata gtgcaacagc aaagcaattt gctgagggct atagaggcgc 3240
agcagcatat gttgcaactc acagtctggg gcattaaaca gctccaggca agaatcctgg 3300
ctgtggaaag ctacctaaag catcaacagt tcctaggact ttggggctgc tctaacaaaa 3360
ttatctgcac cactgctgta ccctggaatt cctcttggag taataaatct tatgatgaga 3420
tttgggaaaa tatgacatgg atagaatggg agagagaaat tggcaattac acaaaccaaa 3480
tatatgatat acttacaaaa tcgcaggaac agcaggacaa aaatgaaaag gaactgttgg 3540
aattggatca atgggcaagt ctgtggaatt ggtttagcat aacaaaatgg ctgtggtaat 3600
gtacaagtaa agcggccgcc actgtgctgg atgatccgag ctcggtacct ctagagtcga 3660
cccgggcggc caaaccgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt 3720
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 3780
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 3840
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 3900
gcggtgggct ctatggcttc tgaggcggaa agaaccagca gatctgcaga tctgaattca 3960
tctatgtcgg gtgcggagaa agaggtaatg aaatggcatt atgggtatta tgggtctgca 4020
ttaatgaatc ggccagatat cgatatgctg gccaccgtgc atgtgacctc gcacccccgc 4080
aagacatggc ccgagttcga gcacaacgtc atgacccgat gcaatgtgca cctggggtcc 4140
cgccgaggca tgttcatgcc ctaccagtgc aacatgcaat ttgtgaaggt gctgctggag 4200
cccgatgcca tgtccagagt gagcctgacg ggggtgtttg acatgaatgt ggagctgtgg 4260
aaaattctga gatatgatga atccaagacc aggtgccggg cctgcgaatg cggaggcaag 4320
cacgccaggc ttcagcccgt gtgtgtggag gtgacggagg acctgcgacc cgatcatttg 4380
gtgttgtcct gcaacgggac ggagttcggc tccagcgggg aagaatctga ctagagtgag 4440
tagtgtttgg gggaggtgga gggcttgtat gaggggcaga atgactaaaa tctgtgtttt 4500
tctgtgtgtt gcagcagcat gagcggaagc gcctcctttg agggaggggt attcagccct 4560
tatctgacgg ggcgtctccc ctcctgggcg ggagtgcgtc agaatgtgat gggatccacg 4620
gtggacggcc ggcccgtgca gcccgcgaac tcttcaaccc tgacctacgc gaccctgagc 4680
tcctcgtccg tggacgcagc tgccgccgca gctgctgctt ccgccgccag cgccgtgcgc 4740
ggaatggccc tgggcgccgg ctactacagc tctctggtgg ccaactcgac ttccaccaat 4800
aatcccgcca gcctgaacga ggagaagctg ctgctgctga tggcccagct cgaggccctg 4860
acccagcgcc tgggcgagct gacccagcag gtggctcagc tgcaggcgga gacgcgggcc 4920
gcggttgcca cggtgaaaac caaataaaaa atgaatcaat aaataaacgg agacggttgt 4980
tgattttaac acagagtctt gaatctttat ttgatttttc gcgcgcggta ggccctggac 5040
caccggtctc gatcattgag cacccggtgg attttttcca ggacccggta gaggtgggct 5100
tggatgttga ggtacatggg catgagcccg tcccgggggt ggaggtagct ccattgcagg 5160
gcctcgtgct cgggggtggt gttgtaaatc acccagtcat agcaggggcg cagggcgtgg 5220
tgctgcacga tgtccttgag gaggagactg atggccacgg gcagcccctt ggtgtaggtg 5280
ttgacgaacc tgttgagctg ggagggatgc atgcgggggg agatgagatg catcttggcc 5340
tggatcttga gattggcgat gttcccgccc agatcccgcc gggggttcat gttgtgcagg 5400
accaccagca cggtgtatcc ggcgcacttg gggaatttgt catgcaactt ggaagggaag 5460
gcgtgaaaga atttggagac gcccttgtga ccgcccaggt tttccatgca ctcatccatg 5520
atgatggcga tgggcccgtg ggcggcggcc tgggcaaaga cgtttcgggg gtcggacaca 5580
tcgtagttgt ggtcctgggt gagctcgtca taggccattt taatgaattt ggggcggagg 5640
gtgcccgact gggggacgaa ggtgccctcg atcccggggg cgtagttgcc ctcgcagatc 5700
tgcatctccc aggccttgag ctcggagggg gggatcatgt ccacctgcgg ggcgatgaaa 5760
aaaacggttt ccggggcggg ggagatgagc tgggccgaaa gcaggttccg gagcagctgg 5820
gacttgccgc agccggtggg gccgtagatg accccgatga ccggctgcag gtggtagttg 5880
agggagagac agctgccgtc ctcgcggagg aggggggcca cctcgttcat catctcgcgc 5940
acatgcatgt tctcgcgcac gagttccgcc aggaggcgct cgccccccag cgagaggagc 6000
tcttgcagcg aggcgaagtt tttcagcggc ttgagyccgt cggccatggg cattttggag 6060
agggtctgtt gcaagagttc cagacggtcc cagagctcgg tgatgtgctc tagggcatct 6120
cgatccagca gacctcctcg tttcgcgggt tggggcgact gcgggagtag ggcaccaggc 6180
gatgggcgtc cagcgaggcc agggtccggt ccttccaggg tcgcagggtc cgcgtcagcg 6240
tggtctccgt cacggtgaag gggtgcgcgc cgggctgggc gcttgcgagg gtgcgcttca 6300
ggctcatccg gctggtcgag aaccgctccc ggtcggcgcc ctgcgcgtcg gccaggtagc 6360
aattgagcat gagttcgtag ttgagcgcct cggccgcgtg gcccttggcg cggagcttac 6420
ctttggaagt gtgtccgcag acgggacaga ggagggactt gagggcgtag agcttggggg 6480
cgaggaagac ggactcgggg gcgtaggcgt ccgcgccgca gctggcgcag acggtctcgc 6540
actccacgag ccaggtgagg tcgggccggt tggggtcaaa aacgaggttt cctccgtgct 6600
ttttgatgcg tttcttacct ctggtctcca tgagctcgtg tccccgctgg gtgacaaaga 6660
ggctgtccgt gtccccgtag accgacttta tgggccggtc ctcgagcggg gtgccgcggt 6720
cctcgtcgta gaggaacccc gcccactccg agacgaaggc ccgggtccag gccagcacga 6780
aggaggccac gtgggagggg tagcggtcgt tgtccaccag cgggtccacc ttctccaggg 6840
tatgcaagca catgtccccc tcgtccacat ccaggaaggt gattggcttg taagtgtagg 6900
ccacgtgacc gggggtcccg gccggggggg tataaaaggg ggcgggcccc tgctcgtcct 6960
cactgtcttc cggatcgctg tccaggagcg ccagctgttg gggtaggtat tccctctcga 7020
aggctggcat aacctcggca ctcaggttgt cagtttctag aaacgaggag gatttgatat 7080
tgacggtgcc gttggagacg cctttcatga gcccctcgtc catctggtca gaaaagacga 7140
tctttttgtt gtcgagcttg gtggcgaagg agccgtagag ggcgttggag aggagcttgg 7200
cgatggagcg catggtctgg ttcttttcct tgtcggcgcg ctccttggcg gcgatgttga 7260
gctgcacgta ctcgcgcgcc acgcacttcc attcggggaa gacggtggtg agctcgtcgg 7320
gcacgattct gacccgccag ccgcggttgt gcagggtgat gaggtccacg ctggtggcca 7380
cctcgccgcg caggggctcg ttggtccagc agaggcgccc gcccttgcgc gagcagaagg 7440
ggggcagcgg gtccagcatg agctcgtcgg gggggtcggc gtccacggtg aagatgccgg 7500
gcagaagctc ggggtcgaag tagctgatgc aggtgtccag atcgtccagc gccgcttgcc 7560
agtcgcgcac ggccagcgcg cgctcgtagg ggctgagggg cgtgccccag ggcatggggt 7620
gcgtgagcgc ggaggcgtac atgccgcaga tgtcgtagac gtagaggggc tcctcgagga 7680
cgccgatgta ggtggggtag cagcgccccc cgcggatgct ggcgcgcacg tagtcgtaca 7740
gctcgtgcga gggcgcgagg agccccgtgc cgaggttgga gcgttgcggc ttttcggcgc 7800
ggtagacgat ctggcggaag atggcgtggg agttggagga gatggtgggc ctctggaaga 7860
tgttgaagtg ggcgtggggc aggccgaccg agtccctgat gaagtgggcg taggagtcct 7920
gcagcttggc gacgagctcg gcggtgacga ggacgtccag ggcgcagtag tcgagggtct 7980
cttggatgat gtcgtacttg agctggccct tctgcttcca cagctcgcgg ttgagaagga 8040
actcttcgcg gtccttccag tactcttcga gggggaaccc gtcctgatcg gcacggtaag 8100
agcccaccat gtagaactgg ttgacggcct tgtaggcgca gcagcccttc tccacgggga 8160
gggcgtaagc ttgtgcggcc ttgcgcaggg aggtgtgggt gagggcgaag gtgtcgcgca 8220
ccatgacctt gaggaactgg tgcttgaagt cgaggtcgtc gcagccgccc tgctcccaga 8280
gctggaagtc cgtgcgcttc ttgtaggcgg ggttgggcaa agcgaaagta acatcgttga 8340
agaggatctt gcccgcgcgg ggcatgaagt tgcgagtgat gcggaaaggc tggggcacct 8400
cggcccggtt gttgatgacc tgggcggcga ggacgatctc gtcgaagccg ttgatgttgt 8460
gcccgacgat gtagagttcc acgaatcgcg ggcggccctt aacgtggggc agcttcttga 8520
gctcgtcgta ggtgagctcg gcggggtcgc tgagcccgtg ctgctcgagg gcccagtcgg 8580
cgacgtgggg gttggcgctg aggaaggaag tccagagatc cacggccagg gcggtctgca 8640
agcggtcccg gtactgacgg aactgctggc ccacggccat tttttcgggg gtgacgcagt 8700
agaaggtgcg ggggtcgccg tgccagcggt cccacttgag ctggagggcg aggtcgtggg 8760
cgagctcgac gagcggcggg tccccggaga gtttcatgac cagcatgaag gggacgagct 8820
gcttgccgaa ggaccccatc caggtgtagg tttccacatc gtaggtgagg aagagccttt 8880
cggtgcgagg atgcgagccg atggggaaga actggatctc ctgccaccag ttggaggaat 8940
ggctgttgat gtgatggaag tagaaatgcc gacggcgcgc cgagcactcg tgcttgtgtt 9000
tatacaagcg tccgcagtgc tcgcaacgct gcacgggatg cacgtgctgc acgagctgta 9060
cctgggttcc tttgacgagg aatttcagtg ggcagtggag cgctggcggc tgcatctggt 9120
gctgtactac gtcctggcca tcggcgtggc catcgtctgc ctcgatggtg gtcatgctga 9180
cgagcccgcg cgggaggcag gtccagactt cggctcggac gggtcggaga gcgaggacga 9240
gggcgcgcag gccggagctg tccagggtcc tgagacgctg cggagtcagg tcagtgggca 9300
gcggcggcgc gcggttgact tgcaggagct tttccagggc gcgcgggagg tccagatggt 9360
acttgatctc cacggcgccg ttggtggcga cgtccacggc ttgcagggtc ccgtgcccct 9420
ggggcgccac caccgtgccc cgtttcttct tgggcgctgc ttccatgccg gtcagaagcg 9480
gcggcgagga cgcgcgccgg gcggcagggg cggctcggga cccggaggca ggggcggcag 9540
gggcacgtcg gcgccgcgcg cgggcaggtt ctggtactgc gcccggagaa gactggcgtg 9600
agcgacgacg cgacggttga cgtcctggat ctgacgcctc tgggtgaagg ccacgggacc 9660
cgtgagtttg aacctgaaag agagttcgac agaatcaatc tcggtatcgt tgacggcggc 9720
ctgccgcagg atctcttgca cgtcgcccga gttgtcctgg taggcgatct cggtcatgaa 9780
ctgctcgatc tcctcctcct gaaggtctcc gcggccggcg cgctcgacgg tggccgcgag 9840
gtcgttggag atgcggccca tgagctgcga gaaggcgttc atgccggcct cgttccagac 9900
gcggctgtag accacggctc cgtcggggtc gcgcgcgcgc atgaccacct gggcgaggtt 9960
gagctcgacg tggcgcgtga agaccgcgta gttgcagagg cgctggtaga ggtagttgag 10020
cgtggtggcg atgtgctcgg tgacgaagaa gtacatgatc cagcggcgga gcggcatctc 10080
gctgacgtcg cccagggctt ccaagcgctc catggcctcg tagaagtcca cggcgaagtt 10140
gaaaaactgg gagttgcgcg ccgagacggt caactcctcc tccagaagac ggatgagctc 10200
agcgatggtg gcgcgcacct cgcgctcgaa ggccccgggg ggctcctctt cttccatctc 10260
ttcctcctcc actaacatct cttctacttc ctcctcagga ggcggcggcg ggggaggggc 10320
cctgcgtcgc cggcggcgca cgggcagacg gtcgatgaag cgctcgatgg tctccccgcg 10380
ccggcgacgc atggtctcgg tgacggcgcg cccgtcctcg cggggccgca gcgtgaagac 10440
gccgccgcgc atctccaggt ggccgccggg ggggtctccg ttgggcaggg agagggcgct 10500
gacgatgcat cttatcaatt ggcccgtagg gactccgcgc aaggacctga gcgtctcgag 10560
atccacggga tccgaaaacc gctgaacgaa ggcttcgagc cagtcgcagt cgcaaggtag 10620
gctgagcccg gtttcttgtt cttcggggat ttcgggaggc gggcgggcga tgctgctggt 10680
gatgaagttg aagtaggcgg tcctgagacg gcggatggtg gcgaggagca ccaggtcctt 10740
gggcccggct tgctggatgc gcagacggtc ggccatgccc caggcgtggt cctgacacct 10800
ggcgaggtcc ttgtagtagt cctgcatgag ccgctccacg ggcacctcct cctcgcccgc 10860
gcggccgtgc atgcgcgtga gcccgaaccc gcgctggggc tggacgagcg ccaggtcggc 10920
gacgacgcgc tcggcgagga tggcctgctg tatctgggtg agggtggtct ggaagtcgtc 10980
gaagtcgacg aagcggtggt aggctccggt gttgatggta taggagcagt tggccatgac 11040
ggaccagttg acggtctggt ggccgggtcg cacgagctcg tggtacttga ggcgcgagta 11100
ggcgcgcgtg tcgaagatgt agtcgttgca ggtgcgcacg aggtactggt atccgacgag 11160
gaagtgcggc ggcggctggc ggtagagcgg ccatcgctcg gtggcggggg cgccgggcgc 11220
gaggtcctcg agcatgaggc ggtggtagcc gtagatgtac ctggacatcc aggtgatgcc 11280
ggcggcggtg gtggaggcgc gcgggaactc gcggacgcgg ttccagatgt tgcgcagcgg 11340
caggaagtag ttcatggtgg ccgcggtctg gcccgtgagg cgcgcgcagt cgtggatgct 11400
ctagacatac gggcaaaaac gaaagcggtc agcggctcga ctccgtggcc tggaggctaa 11460
gcgaacgggt tgggctgcgc gtgtaccccg gttcgaatct cgaatcaggc tggagccgca 11520
gctaacgtgg tactggcact cccgtctcga cccaagcctg ctaacgaaac ctccaggata 11580
cggaggcggg tcgttttttg gccttggtcg ctggtcatga aaaactagta agcgcggaaa 11640
gcgaccgccc gcgatggctc gctgccgtag tctggagaaa gaatcgccag ggttgcgttg 11700
cggtgtgccc cggttcgagc ctcagcgctc ggcgccggcc ggattccgcg gctaacgtgg 11760
gcgtggctgc cccgtcgttt ccaagacccc ttagccagcc gacttctcca gttacggagc 11820
gagcccctct ttttcttgtg tttttgccag atgcatcccg tactgcggca gatgcgcccc 11880
caccctccac ctcaaccgcc cctaccgccg cagcagcagc aacagccggc gcttctgccc 11940
ccgccccagc agcagccagc cactaccgcg gcggccgccg tgagcggagc cggcgttcag 12000
tatgacctgg ccttggaaga gggcgagggg ctggcgcggc tgggggcgtc gtcgccggag 12060
cggcacccgc gcgtgcagat gaaaagggac gctcgcgagg cctacgtgcc caagcagaac 12120
ctgttcagag acaggagcgg cgaggagccc gaggagatgc gcgcctcccg cttccacgcg 12180
gggcgggagc tgcggcgcgg cctggaccga aagcgggtgc tgagggacga ggatttcgag 12240
gcggacgagc tgacggggat cagccccgcg cgcgcgcacg tggccgcggc caacctggtc 12300
acggcgtacg agcagaccgt gaaggaggag agcaacttcc aaaaatcctt caacaaccac 12360
gtgcgcacgc tgatcgcgcg cgaggaggtg accctgggcc tgatgcacct gtgggacctg 12420
ctggaggcca tcgtgcagaa ccccacgagc aagccgctga cggcgcagct gtttctggtg 12480
gtgcagcaca gtcgggacaa cgagacgttc agggaggcgc tgctgaatat caccgagccc 12540
gagggccgct ggctcctgga cctggtgaac attctgcaga gcatcgtggt gcaggagcgc 12600
gggctgccgc tgtccgagaa gctggcggct atcaacttct cggtgctgag cctgggcaag 12660
tactacgcta ggaagatcta caagaccccg tacgtgccca tagacaagga ggtgaagatc 12720
gacgggtttt acatgcgcat gaccctgaaa gtgctgaccc tgagcgacga tctgggggtg 12780
taccgcaacg acaggatgca ccgcgcggtg agcgccagcc gccggcgcga gctgagcgac 12840
caggagctga tgcacagcct gcagcgggcc ctgaccgggg ccgggaccga gggggagagc 12900
tactttgaca tgggcgcgga cctgcgctgg cagcccagcc gccgggcctt ggaagctgcc 12960
ggcggttccc cctacgtgga ggaggtggac gatgaggagg aggagggcga gtacctggaa 13020
gactgatggc gcgaccgtat ttttgctaga tgcagcaaca gccaccgcct cctgatcccg 13080
cgatgcgggc ggcgctgcag agccagccgt ccggcattaa ctcctcggac gattggaccc 13140
aggccatgca acgcatcatg gcgctgacga cccgcaatcc cgaagccttt agacagcagc 13200
ctcaggccaa ccggctctcg gccatcctgg aggccgtggt gccctcgcgc tcgaacccca 13260
cgcacgagaa ggtgctggcc atcgtgaacg cgctggtgga gaacaaggcc atccgcggcg 13320
acgaggccgg gctggtgtac aacgcgctgc tggagcgcgt ggcccgctac aacagcacca 13380
acgtgcagac gaacctggac cgcatggtga ccgacgtgcg cgaggcggtg tcgcagcgcg 13440
agcggttcca ccgcgagtcg aacctgggct ccatggtggc gctgaacgcc ttcctgagca 13500
cgcagcccgc caacgtgccc cggggccagg aggactacac caacttcatc agcgcgctgc 13560
ggctgatggt ggccgaggtg ccccagagcg aggtgtacca gtcggggccg gactacttct 13620
tccagaccag tcgccagggc ttgcagaccg tgaacctgag ccaggctttc aagaacttgc 13680
agggactgtg gggcgtgcag gccccggtcg gggaccgcgc gacggtgtcg agcctgctga 13740
cgccgaactc gcgcctgctg ctgctgctgg tggcgccctt cacggacagc ggcagcgtga 13800
gccgcgactc gtacctgggc tacctgctta acctgtaccg cgaggccatc gggcaggcgc 13860
acgtggacga gcagacctac caggagatca cccacgtgag ccgcgcgctg ggccaggagg 13920
acccgggcaa cctggaggcc accctgaact tcctgctgac caaccggtcg cagaagatcc 13980
cgccccagta cgcgctgagc accgaggagg agcgcatcct gcgctacgtg cagcagagcg 14040
tggggctgtt cctgatgcag gagggggcca cgcccagcgc cgcgctcgac atgaccgcgc 14100
gcaacatgga gcccagcatg tacgctcgca accgcccgtt catcaataag ctgatggact 14160
acttgcatcg ggcggccgcc atgaactcgg actactttac caacgccatc ttgaacccgc 14220
actggctccc gccgcccggg ttctacacgg gcgagtacga catgcccgac cccaacgacg 14280
ggttcctgtg ggacgacgtg gacagcagcg tgttctcgcc gcgccccgcc accaccgtgt 14340
ggaagaaaga gggcggggac cggcggccgt cctcggcgct gtccggtcgc gcgggtgctg 14400
ccgcggcggt gcctgaggcc gccagcccct tcccgagcct gcccttttcg ctgaacagcg 14460
tgcgcagcag cgagctgggt cggctgacgc ggccgcgcct gctgggcgag gaggagtacc 14520
tgaacgactc cttgttgagg cccgagcgcg agaagaactt ccccaataac gggatagaga 14580
gcctggtgga caagatgagc cgctggaaga cgtacgcgca cgagcacagg gacgagcccc 14640
gagctagcag cagcgcaggc acccgtagac gccagcgaca cgacaggcag cggggtctgg 14700
tgtgggacga tgaggattcc gccgacgaca gcagcgtgtt ggacttgggt gggagtggtg 14760
gtggtaaccc gttcgctcac ttgcgccccc gtatcgggcg cctgatgtaa gaatctgaaa 14820
aaataaaaaa cggtactcac caaggccatg gcgaccagcg tgcgttcttc tctgttgttt 14880
gtagtagtat gatgaggcgc gtgtacccgg agggtcctcc tccctcgtac gagagcgtga 14940
tgcagcaggc ggtggcggcg gcgatgcagc ccccgctgga ggcgccttac gtgcccccgc 15000
ggtacctggc gcctacggag gggcggaaca gcattcgtta ctcggagctg gcacccttgt 15060
acgataccac ccggttgtac ctggtggaca acaagtcggc ggacatcgcc tcgctgaact 15120
accagaacga ccacagcaac ttcctgacca ccgtggtgca gaacaacgat ttcaccccca 15180
cggaggccag cacccagacc atcaactttg acgagcgctc gcggtggggc ggccagctga 15240
aaaccatcat gcacaccaac atgcccaacg tgaacgagtt catgtacagc aacaagttca 15300
aggcgcgggt gatggtctcg cgcaagaccc ccaatggggt cgcggtggat gagaattatg 15360
atggtagtca ggacgagctg acttacgagt gggtggagtt tgagctgccc gagggcaact 15420
tctcggtgac catgaccatc gatctgatga acaacgccat catcgacaac tacttggcgg 15480
tggggcgtca gaacggggtg ctggagagcg acatcggcgt gaagttcgac acgcgcaact 15540
tccggctggg ctgggacccc gtgaccgagc tggtgatgcc gggcgtgtac accaacgagg 15600
ccttccaccc cgacatcgtc ctgctgcccg gctgcggcgt ggacttcacc gagagccgcc 15660
tcagcaacct gctgggcatc cgcaagcggc agcccttcca ggagggcttc cagatcctgt 15720
acgaggacct ggaggggggc aacatccccg cgctcttgga tgtcgaagcc tatgagaaaa 15780
gcaaggagga ggccgccgca gcggcgaccg cagccgtggc caccgcctct accgaggtgc 15840
ggggcgataa ttttgctagc gccgcggcag tggccgaggc ggctgaaacc gaaagtaaga 15900
tagtcatcca gccggtggag aaggacagca aggacaggag ctacaacgtg ctcgcggaca 15960
agaaaaacac cgcctaccgc agctggtacc tggcctacaa ctacggcgac cccgagaagg 16020
gcgtgcgctc ctggacgctg ctcaccacct cggacgtcac ctgcggcgtg gagcaagtct 16080
actggtcgct gcccgacatg atgcaagacc cggtcacctt ccgctccacg cgtcaagtta 16140
gcaactaccc ggtggtgggc gccgagctcc tgcccgtcta ctccaagagc ttcttcaacg 16200
agcaggccgt ctactcgcag cagctgcgcg ccttcacctc gctcacgcac gtcttcaacc 16260
gcttccccga gaaccagatc ctcgtccgcc cgcccgcgcc caccattacc accgtcagtg 16320
aaaacgttcc tgctctcaca gatcacggga ccctgccgct gcgcagcagt atccggggag 16380
tccagcgcgt gaccgtcact gacgccagac gccgcacctg cccctacgtc tacaaggccc 16440
tgggcgtagt cgcgccgcgc gtcctctcga gccgcacctt ctaaaaaatg tccattctca 16500
tctcgcccag taataacacc ggttggggcc tgcgcgcgcc cagcaagatg tacggaggcg 16560
ctcgccaacg ctccacgcaa caccccgtgc gcgtgcgcgg gcacttccgc gctccctggg 16620
gcgccctcaa gggccgcgtg cgctcgcgca ccaccgtcga cgacgtgatc gaccaggtgg 16680
tggccgacgc gcgcaactac acgcccgccg ccgcgcccgc ctccaccgtg gacgccgtca 16740
tcgacagcgt ggtggccgat gcgcgccggt acgcccgcgc caagagccgg cggcggcgca 16800
tcgcccggcg gcaccggagc acccccgcca tgcgcgcggc gcgagccttg ctgcgcaggg 16860
ccaggcgcac gggacgcagg gccatgctca gggcggccag acgcgcggcc tccggcagca 16920
gcagcgccgg caggacccgc agacgcgcgg ccacggcggc ggcggcggcc atcgccagca 16980
tgtcccgccc gcggcgcggc aacgtgtact gggtgcgcga cgccgccacc ggtgtgcgcg 17040
tgcccgtgcg cacccgcccc cctcgcactt gaagatgctg acttcgcgat gttgatgtgt 17100
cccagcggcg aggaggatgt ccaagcgcaa atacaaggaa gagatgctcc aggtcatcgc 17160
gcctgagatc tacggccccg cggtgaagga ggaaagaaag ccccgcaaac tgaagcgggt 17220
caaaaaggac aaaaaggagg aggaagatgt ggacggactg gtggagtttg tgcgcgagtt 17280
cgccccccgg cggcgcgtgc agtggcgcgg gcggaaagtg aaaccggtgc tgcggcccgg 17340
caccacggtg gtcttcacgc ccggcgagcg ttccggctcc gcctccaagc gctcctacga 17400
cgaggtgtac ggggacgagg acatcctcga gcaggcggtc gagcgtctgg gcgagtttgc 17460
ttacggcaag cgcagccgcc ccgcgccctt gaaagaggag gcggtgtcca tcccgctgga 17520
ccacggcaac cccacgccga gcctgaagcc ggtgaccctg cagcaggtgc tgccgagcgc 17580
ggcgccgcgc cggggcttca agcgcgaggg cggcgaggat ctgtacccga ccatgcagct 17640
gatggtgccc aagcgccaga agctggagga cgtgctggag cacatgaagg tggaccccga 17700
ggtgcagccc gaggtcaagg tgcggcccat caagcaggtg gccccgggcc tgggcgtgca 17760
gaccgtggac atcaagatcc ccacggagcc catggaaacg cagaccgagc ccgtgaagcc 17820
cagcaccagc accatggagg tgcagacgga tccctggatg ccggcgccgg cttccaccac 17880
tcgccgaaga cgcaagtacg gcgcggccag cctgctgatg cccaactacg cgctgcatcc 17940
ttccatcatc cccacgccgg gctaccgcgg cacgcgcttc taccgcggct acaccagcag 18000
ccgccgcaag accaccaccc gccgccgccg tcgtcgcacc cgccgcagca gcaccgcgac 18060
ttccgccgcc gccctggtgc ggagagtgta ccgcagcggg cgcgagcctc tgaccctgcc 18120
gcgcgcgcgc taccacccga gcatcgccat ttaactctgc cgtcgcctcc tacttgcaga 18180
tatggccctc acatgccgcc tccgcgtccc cattacgggc taccgaggaa gaaagccgcg 18240
ccgtagaagg ctgacgggga acgggctgcg tcgccatcac caccggcggc ggcgcgccat 18300
cagcaagcgg ttggggggag gcttcctgcc cgcgctgatc cccatcatcg ccgcggcgat 18360
cggggcgatc cccggcatag cttccgtggc ggtgcaggcc tctcagcgcc actgagacac 18420
agcttggaaa atttgtaata aaaaaatgga ctgacgctcc tggtcctgtg atgtgtgttt 18480
ttagatggaa gacatcaatt tttcgtccct ggcaccgcga cacggcacgc ggccgtttat 18540
gggcacctgg agcgacatcg gcaacagcca actgaacggg ggcgccttca attggagcag 18600
tctctggagc gggcttaaga atttcgggtc cacgctcaaa acctatggca acaaggcgtg 18660
gaacagcagc acagggcagg cgctgaggga aaagctgaaa gagcagaact tccagcagaa 18720
ggtggtcgat ggcctggcct cgggcatcaa cggggtggtg gacctggcca accaggccgt 18780
gcagaaacag atcaacagcc gcctggacgc ggtcccgccc gcggggtccg tggagatgcc 18840
ccaggtggag gaggagctgc ctcccctgga caagcgcggc gacaagcgac cgcgtcccga 18900
cgcggaggag acgctgctga cgcacacgga cgagccgccc ccgtacgagg aggcggtgaa 18960
actgggtctg cccaccacgc ggcccgtggc gcctctggcc accggggtgc tgaaacccag 19020
cagcagcagc cagcccgcga ccctggactt gcctccgcct gcttcccgcc cctccacagt 19080
ggctaagccc ctgccgccgg tggccgtcgc gtcgcgcgcc ccccgaggcc gcccccaggc 19140
gaactggcag agcactctga acagcatcgt gggtctggga gtgcagagtg tgaagcgccg 19200
ccgctgctat taaaagacac tgtagcgctt aacttgcttg tctgtgtgta tatgtatgtc 19260
cgccgaccag aaggaggaag aggcgcgtcg ccgagttgca agatggccac cccatcgatg 19320
ctgccccagt gggcgtacat gcacatcgcc ggacaggacg cttcggagta cctgagtccg 19380
ggtctggtgc agttcgcccg cgccacagac acctacttca gtctggggaa caagtttagg 19440
aaccccacgg tggcgcccac gcacgatgtg accaccgacc gcagccagcg gctgacgctg 19500
cgcttcgtgc ccgtggaccg cgaggacaac acctactcgt acaaagtgcg ctacacgctg 19560
gccgtgggcg acaaccgcgt gctggacatg gccagcacct actttgacat ccgcggcgtg 19620
ctggatcggg ggcccagctt caaaccctac tccggcaccg cctacaacag cctggctccc 19680
aagggagcgc ccaacacttg ccagtggaca tataaagctg gtgatactga tacagaaaaa 19740
acctatacat atggaaatgc acctgtgcaa ggcattagca ttacaaagga tggtattcaa 19800
cttggaactg acagcgatgg tcaggcaatc tatgcagacg aaacttatca accagagcct 19860
caagtgggtg atgctgaatg gcatgacatc actggtactg atgaaaaata tggaggcaga 19920
gctcttaagc ctgacaccaa aatgaagcct tgctatggtt cttttgccaa gcctaccaat 19980
aaagaaggag gccaggcaaa tgtgaaaacc gaaacaggcg gtaccaaaga atatgacatt 20040
gacatggcat tcttcgataa tcgaagtgca gctgccgccg gcctagcccc agaaattgtt 20100
ttgtatactg agaatgtgga tctggaaact ccagataccc atattgtata caaggcaggt 20160
acagatgaca gtagctcttc tatcaatttg ggtcagcagt ccatgcccaa cagacccaac 20220
tacattggct tcagagacaa ctttatcggt ctgatgtact acaacagcac tggcaatatg 20280
ggtgtactgg ctggacaggc ctcccagctg aatgctgtgg tggacttgca ggacagaaac 20340
accgaactgt cctaccagct cttgcttgac tctctgggtg acagaaccag gtatttcagt 20400
atgtggaatc aggcggtgga cagttatgac cccgatgtgc gcattattga aaatcacggt 20460
gtggaggatg aacttcctaa ctattgcttc cccctggatg ctgtgggtag aactgatact 20520
taccagggaa ttaaggccaa tggtgataat caaaccacct ggaccaaaga tgatactgtt 20580
aatgatgcta atgaattggg caagggcaat cctttcgcca tggagatcaa catccaggcc 20640
aacctgtggc ggaacttcct ctacgcgaac gtggcgctgt acctgcccga ctcctacaag 20700
tacacgccgg ccaacatcac gctgcccacc aacaccaaca cctacgatta catgaacggc 20760
cgcgtggtgg cgccctcgct ggtggacgcc tacatcaaca tcggggcgcg ctggtcgctg 20820
gaccccatgg acaacgtcaa ccccttcaac caccaccgca acgcgggcct gcgataccgc 20880
tccatgctcc tgggcaacgg gcgctacgtg cccttccaca tccaggtgcc ccaaaagttt 20940
ttcgccatca agagcctcct gctcctgccc gggtcctaca cctacgagtg gaacttccgc 21000
aaggacgtca acatgatcct gcagagctcc ctcggcaacg acctgcgcac ggacggggcc 21060
tccatcgcct tcaccagcat caacctctac gccaccttct tccccatggc gcacaacacc 21120
gcctccacgc tcgaggccat gctgcgcaac gacaccaacg accagtcctt caacgactac 21180
ctctcggcgg ccaacatgct ctaccccatc ccggccaacg ccaccaacgt gcccatctcc 21240
atcccctcgc gcaactgggc cgccttccgc ggctggtcct tcacgcgcct caagacccgc 21300
gagacgccct cgctcggctc cgggttcgac ccctacttcg tctactcggg ctccatcccc 21360
tacctcgacg gcaccttcta cctcaaccac accttcaaga aggtctccat caccttcgac 21420
tcctccgtca gctggcccgg caacgaccgc ctcctgacgc ccaacgagtt cgaaatcaag 21480
cgcaccgtcg acggagaggg gtacaacgtg gcccagtgca acatgaccaa ggactggttc 21540
ctggtccaga tgctggccca ctacaacatc ggctaccagg gcttctacgt gcccgagggc 21600
tacaaggacc gcatgtactc cttcttccgc aacttccagc ccatgagccg ccaggtcgtg 21660
gacgaggtca actacaagga ctaccaggcc gtcaccctgg cctaccagca caacaactcg 21720
ggcttcgtcg gctacctcgc gcccaccatg cgccagggcc agccctaccc cgccaactac 21780
ccctacccgc tcatcggcaa gagcgccgtc gccagcgtca cccagaaaaa gttcctctgc 21840
gaccgggtca tgtggcgcat ccccttctcc agcaacttca tgtccatggg cgcgctcacc 21900
gacctcggcc agaacatgct ctacgccaac tccgcccacg cgctagacat gaatttcgaa 21960
gtcgacccca tggatgagtc cacccttctc tatgttgtct tcgaagtctt cgacgtcgtc 22020
cgagtgcacc agccccaccg cggcgtcatc gaggccgtct acctgcgcac gcccttctcg 22080
gccggcaacg ccaccaccta agcctcttgc ttcttgcaag atgacggcct gcgcgggctc 22140
cggcgagcag gagctcaggg ccatcctccg cgacctgggc tgcgggccct gcttcctggg 22200
caccttcgac aagcgcttcc cgggattcat ggccccgcac aagctggcct gcgccatcgt 22260
caacacggcc ggccgcgaga ccgggggcga gcactggctg gccttcgcct ggaacccgcg 22320
ctcccacacc tgctacctct tcgacccctt cgggttctcg gacgagcgcc tcaagcagat 22380
ctaccagttc gagtacgagg gcctgctgcg tcgcagcgcc ctggccaccg aggaccgctg 22440
cgtcaccctg gaaaagtcca cccagaccgt gcagggtccg cgctcggccg cctgcgggct 22500
cttctgctgc atgttcctgc acgccttcgt gcactggccc gaccgcccca tggacaagaa 22560
ccccaccatg aacttgctga cgggggtgcc caacggcatg ctccagtcgc cccaggtgga 22620
acccaccctg cgccgcaacc aggaggcgct ctaccgcttc ctcaacgccc actccgccta 22680
ctttcgctcc caccgcgcgc gcatcgagaa ggccaccgcc ttcgaccgca tgaatcaaga 22740
catgtaatcc ggtgtgtgta tgtgaatgct ttattcatca taataaacag cacatgttta 22800
tgccaccttc tctgaggctc tgactttatt tagaaatcga aggggttctg ccggctctcg 22860
gcatggcccg cgggcaggga tacgttgcgg aactggtact tgggcagcca cttgaactcg 22920
gggatcagca gcttcggcac ggggaggtcg gggaacgagt cgctccacag cttgcgcgtg 22980
agttgcaggg cgcccagcag gtcgggcgcg gagatcttga aatcgcagtt gggacccgcg 23040
ttctgcgcgc gagagttacg gtacacgggg ttgcagcact ggaacaccat cagggccggg 23100
tgcttcacgc tcgccagcac cgtcgcgtcg gtgatgccct ccacgtccag atcctcggcg 23160
ttggccatcc cgaagggggt catcttgcag gtctgccgcc ccatgctggg cacgcagccg 23220
ggcttgtggt tgcaatcgca gtgcaggggg atcagcatca tctgggcctg ctcggagctc 23280
atgcccgggt acatggcctt catgaaagcc tccagctggc ggaaggcctg ctgcgccttg 23340
ccgccctcgg tgaagaagac cccgcaggac ttgctagaga actggttggt ggcgcagcca 23400
gcgtcgtgca cgcagcagcg cgcgtcgttg ttggccagct gcaccacgct gcgcccccag 23460
cggttctggg tgatcttggc ccggtcgggg ttctccttca gcgcgcgctg cccgttctcg 23520
ctcgccacat ccatctcgat cgtgtgctcc ttctggatca tcacggtccc gtgcaggcac 23580
cgcagcttgc cctcggcctc ggtgcacccg tgcagccaca gcgcgcagcc ggtgctctcc 23640
cagttcttgt gggcgatctg ggagtgcgag tgcacgaagc cctgcaggaa gcggcccatc 23700
atcgtggtca gggtcttgtt gctggtgaag gtcagcggaa tgccgcggtg ctcctcgttc 23760
acatacaggt ggcagatacg gcggtacacc tcgccctgct cgggcatcag ctggaaggcg 23820
gacttcaggt cgctctccac gcggtaccgg tccatcagca gcgtcatcac ttccatgccc 23880
ttctcccagg ccgaaacgat cggcaggctc agggggttct tcaccgttgt catcttagtc 23940
gccgccgccg aagtcagggg gtcgttctcg tccagggtct caaacactcg cttgccgtcc 24000
ttctcggtga tgcgcacggg gggaaagctg aagcccacgg ccgccagctc ctcctcggcc 24060
tgcctttcgt cctcgctgtc ctggctgatg tcttgcaaag gcacatgctt ggtcttgcgg 24120
ggtttctttt tgggcggcag aggcggcggc ggagacgtgc tgggcgagcg cgagttctcg 24180
ctcaccacga ctatttcttc tccttggccg tcgtccgaga ccacgcggcg gtaggcatgc 24240
ctcttctggg gcagaggcgg aggcgacggg ctctcgcggt tcggcgggcg gctggcagag 24300
ccccttccgc gttcgggggt gcgctcctgg cggcgctgct ctgactgact tcctccgcgg 24360
ccggccattg tgttctccta gggagcaagc atggagactc agccatcgtc gccaacatcg 24420
ccatctgccc ccgccgccgc cgacgagaac cagcagcagc agaatgaaag cttaaccgcc 24480
ccgccgccca gccccacctc cgacgccgca gccccagaca tgcaagagat ggaggaatcc 24540
atcgagattg acctgggcta cgtgacgccc gcggagcacg aggaggagct ggcagcgcgc 24600
ttttcagccc cggaagagaa ccaccaagag cagccagagc aggaagcaga gagcgagcag 24660
aaccaggctg ggctcgagca tggcgactac ctgagcgggg cagaggacgt gctcatcaag 24720
catctggccc gccaatgcat catcgtcaag gacgcgctgc tcgaccgcgc cgaggtgccc 24780
ctcagcgtgg cggagctcag ccgcgcctac gagcgcaacc tcttctcgcc gcgcgtgccc 24840
cccaagcgcc agcccaacgg cacctgcgag cccaacccgc gcctcaactt ctacccggtc 24900
ttcgcggtgc ccgaggccct ggccacctac cacctctttt tcaagaacca aaggatcccc 24960
gtctcctgcc gcgccaaccg cacccgcgcc gacgccctgc tcaacctggg ccccggcgcc 25020
cgcctacctg atatcgcctc cttggaagag gttcccaaga tcttcgaggg tctgggcagc 25080
gacgagactc gggccgcgaa cgctctgcaa ggaagcggag aggagcatga gcaccacagc 25140
gccctggtgg agttggaagg cgacaacgcg cgcctggcgg tcctcaagcg cacggtcgag 25200
ctgacccact tcgcctaccc ggcgctcaac ctgcccccca aggtcatgag cgccgtcatg 25260
gaccaggtgc tcatcaagcg cgcctcgccc ctctcggagg aggagatgca ggaccccgag 25320
agctcggacg agggcaagcc cgtggtcagc gacgagcagc tggcgcgctg gctgggagcg 25380
agtagcaccc cccagagcct ggaagagcgg cgcaagctca tgatggccgt ggtcctggtg 25440
accgtggagc tggagtgtct gcgccgcttc ttcgccgacg cggagaccct gcgcaaggtc 25500
gaggagaacc tgcactacct cttcagacac gggttcgtgc gccaggcctg caagatctcc 25560
aacgtggagc tgaccaacct ggtctcctac atgggcatcc tgcacgagaa ccgcctgggg 25620
cagaacgtgc tgcacaccac cctgcgcggg gaggcccgcc gcgactacat ccgcgactgc 25680
gtctacctgt acctctgcca cacctggcag acgggcatgg gcgtgtggca gcagtgcctg 25740
gaggagcaga acctgaaaga gctctgcaag ctcctgcaga agaacctcaa ggccctgtgg 25800
accgggttcg acgagcgcac caccgccgcg gacctggccg acctcatctt ccccgagcgc 25860
ctgcggctga cgctgcgcaa cgggctgccc gactttatga gccaaagcat gttgcaaaac 25920
tttcgctctt tcatcctcga acgctccggg atcctgcccg ccacctgctc cgcgctgccc 25980
tcggacttcg tgccgctgac cttccgcgag tgccccccgc cgctctggag ccactgctac 26040
ctgctgcgcc tggccaacta cctggcctac cactcggacg tgatcgagga cgtcagcggc 26100
gagggcctgc tcgagtgcca ctgccgctgc aacctctgca cgccgcaccg ctccctggcc 26160
tgcaaccccc agctgctgag cgagacccag atcatcggca ccttcgagtt gcaaggcccc 26220
ggcgagggca aggggggtct gaaactcacc ccggggctgt ggacctcggc ctacttgcgc 26280
aagttcgtgc ccgaggacta ccatcccttc gagatcaggt tctacgagga ccaatcccag 26340
ccgcccaagg ccgagctgtc ggcctgcgtc atcacccagg gggccatcct ggcccaattg 26400
caagccatcc agaaatcccg ccaagaattt ctgctgaaaa agggccacgg ggtctacttg 26460
gacccccaga ccggagagga gctcaacccc agcttccccc aggatgcccc gaggaagcag 26520
caagaagctg aaagtggagc tgccgccgcc gccggaggat ttggaggaag actgggagag 26580
cagtcaggca gaggaggagg agatggaaga ctgggacagc actcaggcag aggaggacag 26640
cctgcaagac agtctggagg aggaagacga ggtggaggag gcagaggaag aagcagccgc 26700
cgccagaccg tcgtcctcgg cggaggagga gaaagcaagc agcacggata ccatctccgc 26760
tccgggtcgg ggtcgcggcg gccgggccca cagtagatgg gacgagaccg ggcgcttccc 26820
gaaccccacc acccagaccg gtaagaagga gcggcaggga tacaagtcct ggcgggggca 26880
caaaaacgcc atcgtctcct gcttgcaagc ctgcgggggc aacatctcct tcacccggcg 26940
ctacctgctc ttccaccgcg gggtgaactt cccccgcaac atcttgcatt actaccgtca 27000
cctccacagc ccctactact gtttccaaga agaggcagaa acccagcagc agcagcagca 27060
gcagaaaacc agcggcagca gctagaaaat ccacagcggc ggcaggtgga ctgaggatcg 27120
cggcgaacga gccggcgcag acccgggagc tgaggaaccg gatctttccc accctctatg 27180
ccatcttcca gcagagtcgg gggcaagagc aggaactgaa agtcaagaac cgttctctgc 27240
gctcgctcac ccgcagttgt ctgtatcaca agagcgaaga ccaacttcag cgcactctcg 27300
aggacgccga ggctctcttc aacaagtact gcgcgctcac tcttaaagag tagcccgcgc 27360
ccgcccacac acggaaaaag gcgggaatta cgtcaccacc tgcgcccttc gcccgaccat 27420
catcatgagc aaagagattc ccacgcctta catgtggagc taccagcccc agatgggcct 27480
ggccgccggc gccgcccagg actactccac ccgcatgaac tggctcagtg ccgggcccgc 27540
gatgatctca cgggtgaatg acatccgcgc ccaccgaaac cagatactcc tagaacagtc 27600
agcgatcacc gccacgcccc gccatcacct taatccgcgt aattggcccg ccgccctggt 27660
gtaccaggaa attccccagc ccacgaccgt actacttccg cgagacgccc aggccgaagt 27720
ccagctgact aactcaggtg tccagctggc cggcggcgcc gccctgtgtc gtcaccgccc 27780
cgctcagggt ataaagcggc tggtgatccg aggcagaggc acacagctca acgacgaggt 27840
ggtgagctct tcgctgggtc tgcgacctga cggagtcttc caactcgccg gatcggggag 27900
atcttccttc acgcctcgtc aggccgtcct gactttggag agttcgtcct cgcagccccg 27960
ctcgggtggc atcggcactc tccagttcgt ggaggagttc actccctcgg tctacttcaa 28020
ccccttctcc ggctcccccg gccactaccc ggacgagttc atcccgaact tcgacgccat 28080
cagcgagtcg gtggacggct acgattgaat gtcccatggt ggcgcggctg acctagctcg 28140
gcttcgacac ctggaccact gccgccgctt ccgctgcttc gctcgggatc tcgccgagtt 28200
tgcctacttt gagctgcccg aggagcaccc tcagggcccg gcccacggag tgcggatcgt 28260
cgtcgaaggg ggtctcgact cccacctgct tcggatcttc agccagcgtc cgatcctggc 28320
cgagcgcgag caaggacaga cccttctgac cctgtactgc atctgcaacc accccggcct 28380
gcatgaaagt ctttgttgtc tgctgtgtac tgagtataat aaaagctgag atcagcgact 28440
actccggact tccgtgtgtt cctgctatca accagtccct gttcttcacc gggaacgaga 28500
ccgagctcca gctccagtgt aagccccaca agaagtacct cacctggctg ttccagggct 28560
ctccgatcgc cgttgtcaac cactgcgaca acgacggagt cctgctgagc ggccctgcca 28620
accttacttt ttccacccgc agaagcaagc tccagctctt ccaacccttc ctccccggga 28680
cctatcagtg cgtctcggga ccctgccatc acaccttcca cctgatcccg aataccacag 28740
cgtcgctccc cgctactaac aaccaaacta cccaccaacg ccaccgtcgc gaccgcggac 28800
atgtacagag ctcgagaagt actaggccac aatacatgcc catattagac tatgaggccg 28860
agccacagcg acccatgctc cccgctatta gttacttcaa tctaaccggc ggagatgact 28920
gacccactgg ccaacaacaa cgtcaacgac cttctcctgg acatggacgg ccgcgcctcg 28980
gagcagcgac tcgcccaact tcgcattcgc cagcagcagg agagagccgt caaggagctg 29040
caggacggca tagccatcca ccagtgcaag aaaggcatct tctgcctggt gaaacaggcc 29100
aagatctcct acgaggtcac cccgaccgac catcgcctct cctacgagct cctgcagcag 29160
cgccagaagt tcacctgcct ggtcggagtc aaccccatcg tcatcaccca gcagtcgggc 29220
gataccaagg ggtgcatcca ctgctcctgc gactcccccg actgcgtcca cactctgatc 29280
aagaccctct gcggcctccg cgacctcctc cccatgaact aatcaccccc ttatccagtg 29340
aaataaatat catattgatg atgatttaaa taaaaaataa tcatttgatt tgaaataaag 29400
atacaatcat attgatgatt tgagttttaa aaaataaaga atcacttact tgaaatctga 29460
taccaggtct ctgtccatgt tttctgccaa caccacctca ctcccctctt cccagctctg 29520
gtactgcaga ccccggcggg ctgcaaactt cctccacacg ctgaagggga tgtcaaattc 29580
ctcctgtccc tcaatcttca ttttatcttc tatcagatgt ccaaaaagcg cgtccgggtg 29640
gatgatgact tcgaccccgt ctacccctac gatgcagaca acgcaccgac cgtgcccttc 29700
atcaaccccc ccttcgtctc ttcagatgga ttccaagaga agcccctggg ggtgctgtcc 29760
ctgcgactgg ctgaccccgt caccaccaag aacggggaaa tcaccctcaa gctgggagag 29820
ggggtggacc tcgactcctc gggaaaactc atctccaaca cggccaccaa ggccgccgcc 29880
cctctcagtt tttccaacaa caccatttcc cttaacatgg atacccctct ttataccaaa 29940
gatggaaaat tatccttaca agtttctcca ccgttaaaca tattaaaatc aaccattctg 30000
aacacattag ctgtagctta tggatcaggt ttaggactga gtggtggcac tgctcttgca 30060
gtacagttgg cctctccact cacttttgat gaaaaaggaa atattaaaat taacctagcc 30120
agtggtccat taacagttga tgcaagtcga cttagtatca actgcaaaag aggggtcact 30180
gtcactacct caggagatgc aattgaaagc aacataagct ggcctaaagg tataagattt 30240
gaaggtaatg gcatagctgc aaacattggc agaggattgg aatttggaac cactagtaca 30300
gagactgatg tcacagatgc atacccaatt caagttaaat tgggtactgg ccttaccttt 30360
gacagtacag gcgccattgt tgcttggaac aaagaggatg ataaacttac attatggacc 30420
acagccgacc cctcgccaaa ttgcaaaata tactctgaaa aagatgccaa actcacactt 30480
tgcttgacaa agtgtggaag tcaaattctg ggtactgtga ctgtattggc agtgaataat 30540
ggaagtctca acccaatcac aaacacagta agcactgcac tcgtctccct caagtttgat 30600
gcaagtggag ttttgctaag cagctccaca ttagacaaag aatattggaa cttcagaaag 30660
ggagatgtta cacctgctga gccctatact aatgctatag gttttatgcc taacataaag 30720
gcctatccta aaaacacatc tgcagcttca aaaagccata ttgtcagtca agtttatctc 30780
aatggggatg aggccaaacc actgatgctg attattactt ttaatgaaac tgaggatgca 30840
acttgcacct acagtatcac ttttcaatgg aaatgggata gtactaagta cacaggtgaa 30900
acacttgcta ccagctcctt caccttctcc tacatcgccc aagaatgaac actgtatccc 30960
accctgcatg ccaacccttc ccaccccact ctgtctatgg aaaaaactct gaagcacaaa 31020
ataaaataaa gttcaagtgt tttattgatt caacagtttt acaggattcg agcagttatt 31080
tttcctccac cctcccagga catggaatac accaccctct ccccccgcac agccttgaac 31140
atctgaatgc cattggtgat ggacatgctt ttggtctcca cgttccacac agtttcagag 31200
cgagccagtc tcgggtcggt cagggagatg aaaccctccg ggcactcccg catctgcacc 31260
tcacagctca acagctgagg attgtcctcg gtggtcggga tcacggttat ctggaagaag 31320
cagaagagcg gcggtgggaa tcatagtccg cgaacgggat cggccggtgg tgtcgcatca 31380
ggccccgcag cagtcgctgc cgccgccgct ccgtcaagct gctgctcagg gggtccgggt 31440
ccagggactc cctcagcatg atgcccacgg ccctcagcat cagtcgtctg gtgcggcggg 31500
cgcagcagcg catgcggatc tcgctcaggt cgctgcagta cgtgcaacac aggaccacca 31560
ggttgttcaa cagtccatag ttcaacacgc tccagccgaa actcatcgcg ggaaggatgc 31620
tacccacgtg gccgtcgtac cagatcctca ggtaaatcaa gtggcgctcc ctccagaaca 31680
cgctgcccac gtacatgatc tccttgggca tgtggcggtt caccacctcc cggtaccaca 31740
tcaccctctg gttgaacatg cagccccgga tgatcctgcg gaaccacagg gccagcaccg 31800
ccccgcccgc catgcagcga agagaccccg ggtcccggca atggcaatgg aggacccacc 31860
gctcgtaccc gtggatcatc tgggagctga acaagtctat gttggcacag cacaggcata 31920
tgctcatgca tctcttcagc actctcagct cctcgggggt caaaaccata tcccagggca 31980
cggggaactc ttgcaggaca gcgaaccccg cagaacaggg caatcctcgc acataactta 32040
cattgtgcat ggacagggta tcgcaatcag gcagcaccgg gtgatcctcc accagagaag 32100
cgcgggtctc ggtctcctca cagcgtggta agggggccgg ccgatacggg tgatggcggg 32160
acgcggctga tcgtgttcgc gaccgtgtca tgatgcagtt gctttcggac attttcgtac 32220
ttgctgtagc agaacctggt ccgggcgctg cacaccgatc gccggcggcg gtcccggcgc 32280
ttggaacgct cggtgttgaa attgtaaaac agccactctc tcagaccgtg cagcagatct 32340
agggcctcag gagtgatgaa gatcccatca tgcctgatag ctctgatcac atcgaccacc 32400
gtggaatggg ccagacccag ccagatgatg caattttgtt gggtttcggt gacggcgggg 32460
gagggaagaa caggaagaac catgattaac ttttaatcca aacggtctcg gagcacttca 32520
aaatgaaggt cgcggagatg gcacctctcg cccccgctgt gttggtggaa aataacagcc 32580
aggtcaaagg tgatacggtt ctcgagatgt tccacggtgg cttccagcaa agcctccacg 32640
cgcacatcca gaaacaagac aatagcgaaa gcgggagggt tctctaattc ctcaatcatc 32700
atgttacact cctgcaccat ccccagataa ttttcatttt tccagccttg aatgattcga 32760
actagttcct gaggtaaatc caagccagcc atgataaaga gctcgcgcag agcgccctcc 32820
accggcattc ttaagcacac cctcataatt ccaagatatt ctgctcctgg ttcacctgca 32880
gcagattgac aagcggaata tcaaaatctc tgccgcgatc cctaagctcc tccctcagca 32940
ataactgtaa gtactctttc atatcctctc cgaaattttt agccatagga ccaccaggaa 33000
taagattagg gcaagccaca gtacagataa accgaagtcc tccccagtga gcattgccaa 33060
atgcaagact gctataagca tgctggctag acccggtgat atcttccaga taactggaca 33120
gaaaatcacc caggcaattt ttaagaaaat caacaaaaga aaaatcctcc aggtgcacgt 33180
ttagagcctc gggaacaacg atgaagtaaa tgcaagcggt gcgttccagc atggttagtt 33240
agctgatctg taaaaaacaa aaaataaaac attaaaccat gctagcctgg cgaacaggtg 33300
ggtaaatcgt tctctccagc accaggcagg ccacggggtc tccggcgcga ccctcgtaaa 33360
aattgtcgct atgattgaaa accatcacag agagacgttc ccggtggccg gcgtgaatga 33420
ttcgacaaga tgaatacacc cccggaacat tggcgtccgc gagtgaaaaa aagcgcccga 33480
ggaagcaata aggcactaca atgctcagtc tcaagtccag caaagcgatg ccatgcggat 33540
gaagcacaaa atcctcaggt gcgtacaaaa tgtaattact cccctcctgc acaggcagcg 33600
aagcccccga tccctccaga tacacataca aagcctcagc gtccatagct taccgagcag 33660
cagcacacaa caggcgcaag agtcagagaa aggctgagct ctaacctgtc cacccgctct 33720
ctgctcaata tatagcccag atctacactg acgtaaaggc caaagtctaa aaatacccgc 33780
caaataatca cacacgccca gcacacgccc agaaaccggt gacacactca aaaaaatacg 33840
cgcacttcct caaacgccca aactgccgtc atttccgggt tcccacgcta cgtcatcgga 33900
attcgacttt caaattccgt cgaccgttaa aaacgtcacc cgccccgccc ctaacggtcg 33960
cccgtctctc ggccaatcac cttcctccct ccccaaattc aaacagctca tttgcatatt 34020
aacgcgcacc aaaagtttga ggtatattat tgatgatg 34058
Claims (39)
1.包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中所述核酸序列进一步包含含有与编码异源蛋白质的序列可操作地连接的启动子序列的表达盒,其中所述异源蛋白质为选自gp140和Gag的至少一种HIV蛋白质;
其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和
其中Gag来自中国HIV进化枝B。
2.根据权利要求1所述的组合物,其中所述表达盒位于所述早期基因E1基因组区中。
3.根据权利要求1所述的组合物,其中所述表达盒包含嵌合内含子和/或CMV增强子。
4.根据权利要求1-3中任一项所述的组合物,其中由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。
5.根据权利要求1-3中任一项所述的组合物,其中整个早期基因E3基因组区被缺失。
6.根据权利要求1所述的组合物,其中所述启动子是组成型启动子。
7.根据权利要求1所述的组合物,其中所述启动子是巨细胞病毒即时早期启动子(CMV)。
8.根据权利要求1所述的组合物,其中所述核酸序列包含SEQ ID NO:6或7。
9.包含根据权利要求1所述的组合物的蛋白质表达系统,其中所述核酸序列包含SEQID NO:6或7。
10.包含根据权利要求1-7中任一项所述的组合物的蛋白质表达系统,其中所述表达盒编码的所述异源蛋白质包含选自SEQ ID NO:1-5的氨基酸序列。
11.包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中所述核酸序列进一步包含含有与异源蛋白质的序列可操作地连接的组成型启动子的表达盒,其中所述表达盒位于所述早期基因E1基因组区中,其中所述异源蛋白质为选自gp140和Gag的至少一种HIV蛋白质;
其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和
其中Gag来自中国HIV进化枝B。
12.根据权利要求所述的组合物11,其中所述核酸序列包含SEQ ID NO:6或7。
13.包含根据权利要求11-12中任一项所述的组合物的蛋白质表达系统,其中所述表达盒编码的所述异源蛋白质包含选自SEQ ID NO:1-5的氨基酸序列。
14.引发哺乳动物中针对异源蛋白质的免疫应答的方法,所述方法包括向所述哺乳动物施用包含血清型AdC6或AdC7的黑猩猩来源的腺病毒载体的核酸序列的组合物,其中早期基因E1基因组区被缺失,和其中所述核酸序列进一步包含含有与编码异源蛋白质的序列可操作地连接的启动子的表达盒,其中所述异源蛋白质为选自gp140和Gag的至少一种HIV蛋白质;
其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和
其中Gag来自中国HIV进化枝B。
15.根据权利要求14所述的方法,其中所述表达盒位于所述早期基因E1区中。
16.根据权利要求14所述的方法,其中所述表达盒包含嵌合内含子和/或CMV增强子。
17.根据权利要求14-16中任一项所述的方法,其中由ORF3、ORF4、ORF5、ORF6、和ORF7组成的早期基因E3基因组区被缺失。
18.根据权利要求14-16中任一项所述的组合物,其中整个早期基因E3基因组区被缺失。
19.根据权利要求14-18中任一项所述的方法,其中所述启动子是组成型启动子。
20.根据权利要求14-18中任一项所述的方法,其中所述启动子是巨细胞病毒即时早期启动子(CMV)。
21.根据权利要求14-18中任一项所述的方法,其中所述核酸序列包含SEQ ID NO:6或7。
22.治疗和/或预防哺乳动物中HIV的方法,所述方法包括施用治疗有效量的由包含SEQID NO:6或7的核酸序列编码的组合物。
23.针对HIV感染为哺乳动物接种疫苗的方法,所述方法包括向所述哺乳动物施用治疗有效量的根据权利要求1所述的组合物,其中所述组合物的施用引发所述哺乳动物的免疫应答。
24.根据权利要求23所述的方法,其中为所述哺乳动物预防性施用所述组合物。
25.根据权利要求23所述的方法,其中为所述哺乳动物治疗性施用所述组合物。
26.根据权利要求23所述的方法,其中所述组合物与佐剂组合施用。
27.产生对哺乳动物中异源蛋白质的效应和记忆T细胞免疫应答的方法,所述方法包括以下步骤:(a)以有效引发哺乳动物中免疫应答的量向所述哺乳动物施用根据权利要求1所述的组合物;(b)在第二随后的时间段施用第二有效量的根据权利要求1所述的组合物,其中针对所述异源蛋白质的T记忆细胞在所述哺乳动物中被重新活化。
28.根据权利要求27所述的方法,其中在(a)中第一施用和在(b)中第二施用的所述组合物包括选自gp140和Gag的相同或不同的HIV异源蛋白质。
29.根据权利要求27所述的方法,其中在(a)中第一施用和在(b)中第二施用的所述组合物是选自AdC6和AdC7的相同或不同血清型。
30.根据权利要求27所述的方法,其中在(a)中第一施用和在(b)中第二施用的所述组合物具有相同或不同的HIV进化枝。
31.根据权利要求27所述的方法,进一步包括向所述哺乳动物施用免疫原的步骤。
32.根据权利要求31所述的方法,其中所述免疫原包含异源蛋白质,其中所述异源蛋白质为选自gp140和Gag的至少一种HIV蛋白质;其中gp140来自选自B、AE、BC和C的中国HIV进化枝;和其中Gag来自中国HIV进化枝B,其中B细胞免疫应答被进一步扩大。
33.产生对哺乳动物中异源蛋白质的适应性B细胞免疫应答的方法,所述方法包括以下步骤:(a)以有效引发哺乳动物中免疫应答的量向所述哺乳动物施用根据权利要求1所述的组合物;(b)在第二随后的时间段施用第二有效量的根据权利要求1所述的组合物,其中针对异源蛋白质的B记忆细胞在哺乳动物中被重新活化。
34.根据权利要求33所述的方法,其中在(a)中第一施用和在(b)中第二施用的所述组合物包括选自gp140和Gag的相同或不同的HIV异源蛋白质。
35.根据权利要求33所述的方法,其中在(a)中第一施用和在(b)中第二施用的组合物具有选自AdC6和AdC7的相同或不同血清型。
36.根据权利要求33所述的方法,其中在(a)中第一施用和在(b)中第二施用的所述组合物具有相同或不同的HIV进化枝。
37.根据权利要求33所述的方法,进一步包括向所述哺乳动物施用免疫原的步骤。
38.根据权利要求37所述的方法,其中所述免疫原包含异源蛋白质,其中所述异源蛋白质是选自任意来源的任何进化枝的至少一种HIV env蛋白质,其中所述B细胞免疫应答被进一步扩大。
39.根据权利要求14、22、23、27和33中任一项所述的方法,其中所述哺乳动物是人。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962835108P | 2019-04-17 | 2019-04-17 | |
US62/835,108 | 2019-04-17 | ||
PCT/US2019/054301 WO2020214203A1 (en) | 2019-04-17 | 2019-10-02 | Replication deficient adenoviral vectors for hiv vaccine applications |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114269363A true CN114269363A (zh) | 2022-04-01 |
Family
ID=72836976
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201980097540.7A Pending CN114269363A (zh) | 2019-04-17 | 2019-10-02 | 用于hiv疫苗应用的复制缺陷型腺病毒载体 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20220211835A1 (zh) |
CN (1) | CN114269363A (zh) |
WO (1) | WO2020214203A1 (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103589738A (zh) * | 2013-11-12 | 2014-02-19 | 中国疾病预防控制中心病毒病预防控制所 | HIV-1中国流行株CRF01_AE env基因的改造 |
CN105106971A (zh) * | 2007-03-02 | 2015-12-02 | 葛兰素史密丝克莱恩生物有限公司 | 疫苗组合物及其在刺激免疫反应中的用途 |
CN106999571A (zh) * | 2014-09-26 | 2017-08-01 | 贝斯以色列护理医疗中心有限公司 | 诱导针对人免疫缺陷病毒感染的保护性免疫性的方法和组合物 |
WO2018026547A1 (en) * | 2016-08-01 | 2018-02-08 | The Wistar Institute Of Anatomy And Biology | Compositions and methods of replication deficient adenoviral vectors for vaccine applications |
CN108368157A (zh) * | 2015-12-15 | 2018-08-03 | 扬森疫苗与预防公司 | 人类免疫缺陷病毒抗原、载体、组合物、及其使用方法 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3108899A1 (en) * | 2001-11-21 | 2016-12-28 | The Trustees of the University of Pennsylvania | Simian adenovirus adsv1 nucleic acid and amino acid sequences, vectors containing same, and methods of use |
WO2005033269A2 (en) * | 2003-06-18 | 2005-04-14 | The Wistar Institute | Methods for inducing an immune response via oral administration of an adenovirus |
WO2013074501A1 (en) * | 2011-11-14 | 2013-05-23 | Crucell Holland B.V. | Heterologous prime-boost immunization using measles virus-based vaccines |
US9624510B2 (en) * | 2013-03-01 | 2017-04-18 | The Wistar Institute | Adenoviral vectors comprising partial deletions of E3 |
-
2019
- 2019-10-02 US US17/604,329 patent/US20220211835A1/en active Pending
- 2019-10-02 CN CN201980097540.7A patent/CN114269363A/zh active Pending
- 2019-10-02 WO PCT/US2019/054301 patent/WO2020214203A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105106971A (zh) * | 2007-03-02 | 2015-12-02 | 葛兰素史密丝克莱恩生物有限公司 | 疫苗组合物及其在刺激免疫反应中的用途 |
CN103589738A (zh) * | 2013-11-12 | 2014-02-19 | 中国疾病预防控制中心病毒病预防控制所 | HIV-1中国流行株CRF01_AE env基因的改造 |
CN106999571A (zh) * | 2014-09-26 | 2017-08-01 | 贝斯以色列护理医疗中心有限公司 | 诱导针对人免疫缺陷病毒感染的保护性免疫性的方法和组合物 |
CN108368157A (zh) * | 2015-12-15 | 2018-08-03 | 扬森疫苗与预防公司 | 人类免疫缺陷病毒抗原、载体、组合物、及其使用方法 |
WO2018026547A1 (en) * | 2016-08-01 | 2018-02-08 | The Wistar Institute Of Anatomy And Biology | Compositions and methods of replication deficient adenoviral vectors for vaccine applications |
Non-Patent Citations (6)
Title |
---|
佚名: "GenBank: HM215399.1", 《NCBI》, 7 November 2014 (2014-11-07), pages 1 - 3 * |
佚名: "GenBank: JF932500.1", 《NCBI》, 4 December 2012 (2012-12-04), pages 1 - 5 * |
佚名: "GenBank: JX112804.1", 《NCBI》, 12 September 2013 (2013-09-12), pages 1 - 5 * |
佚名: "GenBank: KC492738.1", 《NCBI》, 23 December 2013 (2013-12-23), pages 1 - 5 * |
佚名: "GenBank: KF835515.1", 《NCBI》, 8 December 2013 (2013-12-08), pages 1 - 5 * |
袁瑞: "HIV-1亚型及其耐药突变的分子流行病学研究", 《全国优秀硕士学位论文(电子期刊)医药卫生科技辑》, no. 2017, 15 March 2017 (2017-03-15), pages 1 - 101 * |
Also Published As
Publication number | Publication date |
---|---|
WO2020214203A1 (en) | 2020-10-22 |
US20220211835A1 (en) | 2022-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2019271972B2 (en) | Adenovirus polynucleotides and polypeptides | |
AU2019204982B2 (en) | Recombinant HCMV and RhCMV Vectors and Uses Thereof | |
CN109790548B (zh) | 腺病毒载体 | |
KR102582561B1 (ko) | 비인간 대형 유인원 아데노바이러스 핵산-서열 및 아미노산-서열, 이를 포함하는 벡터 및 그의 용도 | |
KR101761425B1 (ko) | 시미안 아데노바이러스 핵산- 및 아미노산-서열, 이를 포함하는 벡터 및 이의 용도 | |
BE1023916B1 (fr) | Nouvel adenovirus | |
KR102608590B1 (ko) | 종양 살상 바이러스 및 이의 사용방법 | |
KR102205348B1 (ko) | 외인성 항원을 포함하는 인간 시토메갈로바이러스 | |
KR101668163B1 (ko) | Cmv용 백신으로서의 조건부 복제 시토메갈로바이러스 | |
JP2024073576A (ja) | 改変アデノウイルス | |
KR20220041844A (ko) | Hiv 항원 및 mhc 복합체 | |
JP2023145678A (ja) | エプスタインバールウイルス抗原構築物 | |
KR20200066349A (ko) | 복제 가능 아데노바이러스 벡터 | |
CN113897388A (zh) | 一种新型黑猩猩腺病毒载体及其构建方法和应用 | |
KR20200083540A (ko) | 시토메갈로바이러스의 안정한 제형 | |
CN114269363A (zh) | 用于hiv疫苗应用的复制缺陷型腺病毒载体 | |
CN111065408A (zh) | 免疫原性组合物 | |
RU2800361C2 (ru) | Стабильные составы цитомегаловируса | |
NL2023464B1 (en) | Oncolytic Non-human adenoviruses and uses thereof | |
CN113088530A (zh) | 一种基于黑猩猩ChAd63型腺病毒的表达载体及其构建方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |