KR20230069157A - Recombinant adeno-associated virus (rAAV) encoding GJB2 and uses thereof - Google Patents
Recombinant adeno-associated virus (rAAV) encoding GJB2 and uses thereof Download PDFInfo
- Publication number
- KR20230069157A KR20230069157A KR1020237012321A KR20237012321A KR20230069157A KR 20230069157 A KR20230069157 A KR 20230069157A KR 1020237012321 A KR1020237012321 A KR 1020237012321A KR 20237012321 A KR20237012321 A KR 20237012321A KR 20230069157 A KR20230069157 A KR 20230069157A
- Authority
- KR
- South Korea
- Prior art keywords
- gjb2
- nucleic acid
- cells
- isolated nucleic
- seq
- Prior art date
Links
- 101000954092 Homo sapiens Gap junction beta-2 protein Proteins 0.000 title claims abstract description 145
- 102100037156 Gap junction beta-2 protein Human genes 0.000 title claims abstract description 35
- 241000702421 Dependoparvovirus Species 0.000 title claims description 23
- 102000055974 Connexin 26 Human genes 0.000 claims abstract description 457
- 108010069156 Connexin 26 Proteins 0.000 claims abstract description 457
- 210000004027 cell Anatomy 0.000 claims abstract description 304
- 239000002773 nucleotide Substances 0.000 claims abstract description 244
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 244
- 230000014509 gene expression Effects 0.000 claims abstract description 195
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 193
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 163
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 163
- 108020003589 5' Untranslated Regions Proteins 0.000 claims abstract description 95
- 239000003623 enhancer Substances 0.000 claims abstract description 93
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 77
- 238000000034 method Methods 0.000 claims abstract description 73
- 210000003027 ear inner Anatomy 0.000 claims abstract description 61
- 108020005345 3' Untranslated Regions Proteins 0.000 claims abstract description 56
- 230000008093 supporting effect Effects 0.000 claims abstract description 48
- 210000002985 organ of corti Anatomy 0.000 claims abstract description 45
- 208000016354 hearing loss disease Diseases 0.000 claims abstract description 40
- 206010011878 Deafness Diseases 0.000 claims abstract description 38
- 239000000203 mixture Substances 0.000 claims abstract description 38
- 230000001105 regulatory effect Effects 0.000 claims abstract description 38
- 208000031514 autosomal recessive nonsyndromic hearing loss 1A Diseases 0.000 claims abstract description 34
- 231100000888 hearing loss Toxicity 0.000 claims abstract description 30
- 230000010370 hearing loss Effects 0.000 claims abstract description 30
- 210000000630 fibrocyte Anatomy 0.000 claims abstract description 29
- 239000013598 vector Substances 0.000 claims description 191
- 101150034593 Gjb2 gene Proteins 0.000 claims description 110
- 102000048085 human GJB2 Human genes 0.000 claims description 108
- 241000282414 Homo sapiens Species 0.000 claims description 70
- 210000003477 cochlea Anatomy 0.000 claims description 68
- 230000035772 mutation Effects 0.000 claims description 53
- 108090000565 Capsid Proteins Proteins 0.000 claims description 51
- 102100023321 Ceruloplasmin Human genes 0.000 claims description 51
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 39
- 239000013607 AAV vector Substances 0.000 claims description 31
- 210000001608 connective tissue cell Anatomy 0.000 claims description 25
- 101100276179 Homo sapiens GJB2 gene Proteins 0.000 claims description 23
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 23
- 108010006025 bovine growth hormone Proteins 0.000 claims description 22
- 210000000234 capsid Anatomy 0.000 claims description 21
- 238000002347 injection Methods 0.000 claims description 21
- 239000007924 injection Substances 0.000 claims description 21
- 239000008194 pharmaceutical composition Substances 0.000 claims description 19
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 18
- 241000702423 Adeno-associated virus - 2 Species 0.000 claims description 17
- 201000010099 disease Diseases 0.000 claims description 16
- 239000012528 membrane Substances 0.000 claims description 16
- 210000004379 membrane Anatomy 0.000 claims description 16
- 238000012217 deletion Methods 0.000 claims description 15
- 230000037430 deletion Effects 0.000 claims description 15
- 230000002792 vascular Effects 0.000 claims description 14
- 241000124008 Mammalia Species 0.000 claims description 13
- 238000011282 treatment Methods 0.000 claims description 12
- 210000002205 spiral ligament of cochlea Anatomy 0.000 claims description 11
- 239000013603 viral vector Substances 0.000 claims description 11
- 241001655883 Adeno-associated virus - 1 Species 0.000 claims description 10
- 230000001720 vestibular Effects 0.000 claims description 10
- 241001634120 Adeno-associated virus - 5 Species 0.000 claims description 8
- 210000000270 basal cell Anatomy 0.000 claims description 8
- 210000000238 cell of claudius Anatomy 0.000 claims description 8
- 230000001124 posttranscriptional effect Effects 0.000 claims description 8
- 238000011144 upstream manufacturing Methods 0.000 claims description 7
- 241000972680 Adeno-associated virus - 6 Species 0.000 claims description 6
- 241001164823 Adeno-associated virus - 7 Species 0.000 claims description 6
- 241001164825 Adeno-associated virus - 8 Species 0.000 claims description 6
- 241001492404 Woodchuck hepatitis virus Species 0.000 claims description 6
- 239000002775 capsule Substances 0.000 claims description 6
- 239000003937 drug carrier Substances 0.000 claims description 6
- 238000003780 insertion Methods 0.000 claims description 5
- 230000037431 insertion Effects 0.000 claims description 5
- 210000002480 semicircular canal Anatomy 0.000 claims description 5
- 230000010415 tropism Effects 0.000 claims description 5
- 241000202702 Adeno-associated virus - 3 Species 0.000 claims description 4
- 241000580270 Adeno-associated virus - 4 Species 0.000 claims description 4
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 4
- 108020004485 Nonsense Codon Proteins 0.000 claims description 4
- 210000000988 bone and bone Anatomy 0.000 claims description 4
- 230000037434 nonsense mutation Effects 0.000 claims description 4
- 239000012634 fragment Substances 0.000 claims description 3
- 210000000262 cochlear duct Anatomy 0.000 claims description 2
- 239000013600 plasmid vector Substances 0.000 claims description 2
- 210000005077 saccule Anatomy 0.000 claims description 2
- 210000001605 scala vestibuli Anatomy 0.000 claims description 2
- 210000000645 stria vascularis Anatomy 0.000 claims description 2
- 210000000116 strial intermediate cell Anatomy 0.000 claims description 2
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 claims 3
- 238000002955 isolation Methods 0.000 claims 1
- 230000013707 sensory perception of sound Effects 0.000 abstract description 29
- 102000004169 proteins and genes Human genes 0.000 abstract description 23
- 231100000895 deafness Toxicity 0.000 abstract description 8
- 108020004414 DNA Proteins 0.000 description 127
- 108091026890 Coding region Proteins 0.000 description 54
- 210000002768 hair cell Anatomy 0.000 description 54
- 101100276182 Mus musculus Gjb2 gene Proteins 0.000 description 50
- 108700019146 Transgenes Proteins 0.000 description 39
- 210000001519 tissue Anatomy 0.000 description 32
- 208000002267 Anti-neutrophil cytoplasmic antibody-associated vasculitis Diseases 0.000 description 29
- 241000699670 Mus sp. Species 0.000 description 28
- 230000006870 function Effects 0.000 description 26
- 241000282567 Macaca fascicularis Species 0.000 description 24
- 210000003976 gap junction Anatomy 0.000 description 24
- 241000699666 Mus <mouse, genus> Species 0.000 description 23
- 210000002569 neuron Anatomy 0.000 description 22
- 230000027455 binding Effects 0.000 description 20
- 108091070501 miRNA Proteins 0.000 description 18
- 239000000047 product Substances 0.000 description 18
- -1 miR-219-2-3p Proteins 0.000 description 16
- 101000834253 Gallus gallus Actin, cytoplasmic 1 Proteins 0.000 description 15
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 15
- 210000000067 inner hair cell Anatomy 0.000 description 14
- 201000006790 nonsyndromic deafness Diseases 0.000 description 14
- 102000010970 Connexin Human genes 0.000 description 13
- 108050001175 Connexin Proteins 0.000 description 13
- 230000002068 genetic effect Effects 0.000 description 13
- 239000002679 microRNA Substances 0.000 description 13
- 239000000243 solution Substances 0.000 description 13
- 238000001415 gene therapy Methods 0.000 description 12
- 239000013608 rAAV vector Substances 0.000 description 11
- 235000002639 sodium chloride Nutrition 0.000 description 11
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 150000001413 amino acids Chemical class 0.000 description 10
- 239000003795 chemical substances by application Substances 0.000 description 10
- 230000001939 inductive effect Effects 0.000 description 10
- 210000001323 spiral ganglion Anatomy 0.000 description 10
- 238000006467 substitution reaction Methods 0.000 description 10
- 241000700605 Viruses Species 0.000 description 9
- 238000002372 labelling Methods 0.000 description 9
- 108091056924 miR-124 stem-loop Proteins 0.000 description 9
- 239000004094 surface-active agent Substances 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 8
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 8
- KPKZJLCSROULON-QKGLWVMZSA-N Phalloidin Chemical compound N1C(=O)[C@@H]([C@@H](O)C)NC(=O)[C@H](C)NC(=O)[C@H](C[C@@](C)(O)CO)NC(=O)[C@H](C2)NC(=O)[C@H](C)NC(=O)[C@@H]3C[C@H](O)CN3C(=O)[C@@H]1CSC1=C2C2=CC=CC=C2N1 KPKZJLCSROULON-QKGLWVMZSA-N 0.000 description 8
- 101100069419 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GRE2 gene Proteins 0.000 description 8
- 101100069420 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GRE3 gene Proteins 0.000 description 8
- 239000000872 buffer Substances 0.000 description 8
- 238000009472 formulation Methods 0.000 description 8
- 210000003990 interdental cell Anatomy 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 238000004519 manufacturing process Methods 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- 102000007469 Actins Human genes 0.000 description 7
- 108010085238 Actins Proteins 0.000 description 7
- 241001465754 Metazoa Species 0.000 description 7
- 101100069417 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GRE1 gene Proteins 0.000 description 7
- 210000004556 brain Anatomy 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 230000018109 developmental process Effects 0.000 description 7
- 208000035475 disorder Diseases 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- 230000008685 targeting Effects 0.000 description 7
- 230000001225 therapeutic effect Effects 0.000 description 7
- 238000010361 transduction Methods 0.000 description 7
- 230000026683 transduction Effects 0.000 description 7
- 210000002845 virion Anatomy 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 108091023040 Transcription factor Proteins 0.000 description 6
- 102000040945 Transcription factor Human genes 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 230000000295 complement effect Effects 0.000 description 6
- 238000010166 immunofluorescence Methods 0.000 description 6
- 230000000670 limiting effect Effects 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 238000003752 polymerase chain reaction Methods 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 231100000419 toxicity Toxicity 0.000 description 6
- 230000001988 toxicity Effects 0.000 description 6
- 230000002103 transcriptional effect Effects 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- 108010077544 Chromatin Proteins 0.000 description 5
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 5
- 108091092195 Intron Proteins 0.000 description 5
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 5
- 210000003030 auditory receptor cell Anatomy 0.000 description 5
- 210000003169 central nervous system Anatomy 0.000 description 5
- 210000003483 chromatin Anatomy 0.000 description 5
- 230000001086 cytosolic effect Effects 0.000 description 5
- 239000006185 dispersion Substances 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 210000000981 epithelium Anatomy 0.000 description 5
- 238000010348 incorporation Methods 0.000 description 5
- 239000004615 ingredient Substances 0.000 description 5
- 238000011813 knockout mouse model Methods 0.000 description 5
- 239000002502 liposome Substances 0.000 description 5
- 238000010369 molecular cloning Methods 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 239000000843 powder Substances 0.000 description 5
- 108090000765 processed proteins & peptides Proteins 0.000 description 5
- 150000003839 salts Chemical class 0.000 description 5
- 230000001953 sensory effect Effects 0.000 description 5
- 239000002904 solvent Substances 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 238000011200 topical administration Methods 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- 239000003981 vehicle Substances 0.000 description 5
- 241000649045 Adeno-associated virus 10 Species 0.000 description 4
- 108091062157 Cis-regulatory element Proteins 0.000 description 4
- 108091028066 Mir-126 Proteins 0.000 description 4
- 108010009047 Myosin VIIa Proteins 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010009711 Phalloidine Proteins 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- 102100031835 Unconventional myosin-VIIa Human genes 0.000 description 4
- 239000004480 active ingredient Substances 0.000 description 4
- 230000030833 cell death Effects 0.000 description 4
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 210000000805 cytoplasm Anatomy 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 210000002919 epithelial cell Anatomy 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 210000002950 fibroblast Anatomy 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 108091040501 miR-129 stem-loop Proteins 0.000 description 4
- 108091045757 miR-129-3 stem-loop Proteins 0.000 description 4
- 108091090758 miR-129-4 stem-loop Proteins 0.000 description 4
- 108091065139 miR-129-5 stem-loop Proteins 0.000 description 4
- 108091047467 miR-136 stem-loop Proteins 0.000 description 4
- 108091079658 miR-142-1 stem-loop Proteins 0.000 description 4
- 108091071830 miR-142-2 stem-loop Proteins 0.000 description 4
- 108091032985 miR-382 Proteins 0.000 description 4
- 108091050135 miR-382 stem-loop Proteins 0.000 description 4
- 238000009126 molecular therapy Methods 0.000 description 4
- 238000005457 optimization Methods 0.000 description 4
- 238000009256 replacement therapy Methods 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 238000003860 storage Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 241000023308 Acca Species 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- 108700028369 Alleles Proteins 0.000 description 3
- 241000283707 Capra Species 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 230000005788 Cochlea function Effects 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108091029865 Exogenous DNA Proteins 0.000 description 3
- 108010010803 Gelatin Proteins 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 241001494479 Pecora Species 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 108091027981 Response element Proteins 0.000 description 3
- 241000714474 Rous sarcoma virus Species 0.000 description 3
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 241000282898 Sus scrofa Species 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 108091036078 conserved sequence Proteins 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 3
- 239000002612 dispersion medium Substances 0.000 description 3
- 239000008273 gelatin Substances 0.000 description 3
- 229920000159 gelatin Polymers 0.000 description 3
- 235000019322 gelatine Nutrition 0.000 description 3
- 235000011852 gelatine desserts Nutrition 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 235000011187 glycerol Nutrition 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 210000002570 interstitial cell Anatomy 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 244000005700 microbiome Species 0.000 description 3
- 239000002088 nanocapsule Substances 0.000 description 3
- 239000002105 nanoparticle Substances 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 238000004806 packaging method and process Methods 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 239000002953 phosphate buffered saline Substances 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000003755 preservative agent Substances 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 210000000130 stem cell Anatomy 0.000 description 3
- 238000007920 subcutaneous administration Methods 0.000 description 3
- 230000004083 survival effect Effects 0.000 description 3
- 230000000946 synaptic effect Effects 0.000 description 3
- 231100000331 toxic Toxicity 0.000 description 3
- 230000002588 toxic effect Effects 0.000 description 3
- 238000011269 treatment regimen Methods 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 241000649046 Adeno-associated virus 11 Species 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 2
- 201000004384 Alopecia Diseases 0.000 description 2
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 241000282836 Camelus dromedarius Species 0.000 description 2
- 101150044789 Cap gene Proteins 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 206010010356 Congenital anomaly Diseases 0.000 description 2
- 239000012591 Dulbecco’s Phosphate Buffered Saline Substances 0.000 description 2
- 108060006698 EGF receptor Proteins 0.000 description 2
- 241000283073 Equus caballus Species 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 206010064571 Gene mutation Diseases 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 108010033040 Histones Proteins 0.000 description 2
- 101000878605 Homo sapiens Low affinity immunoglobulin epsilon Fc receptor Proteins 0.000 description 2
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 2
- 102100038007 Low affinity immunoglobulin epsilon Fc receptor Human genes 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- 108091007685 MIR541 Proteins 0.000 description 2
- 108091007700 MIR543 Proteins 0.000 description 2
- MNNKPHGAPRUKMW-BPUTZDHNSA-N Met-Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MNNKPHGAPRUKMW-BPUTZDHNSA-N 0.000 description 2
- 108091046841 MiR-150 Proteins 0.000 description 2
- 108091028076 Mir-127 Proteins 0.000 description 2
- 108091027966 Mir-137 Proteins 0.000 description 2
- 108091027766 Mir-143 Proteins 0.000 description 2
- 108091062140 Mir-223 Proteins 0.000 description 2
- 108091061758 Mir-433 Proteins 0.000 description 2
- 108091027559 Mir-96 microRNA Proteins 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108091092724 Noncoding DNA Proteins 0.000 description 2
- 241000009328 Perro Species 0.000 description 2
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 2
- QRUOLOPKCOEZKU-HJWJTTGWSA-N Phe-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N QRUOLOPKCOEZKU-HJWJTTGWSA-N 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- 241000288906 Primates Species 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- ZTHYODDOHIVTJV-UHFFFAOYSA-N Propyl gallate Chemical compound CCCOC(=O)C1=CC(O)=C(O)C(O)=C1 ZTHYODDOHIVTJV-UHFFFAOYSA-N 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108091081021 Sense strand Proteins 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- RAHZWNYVWXNFOC-UHFFFAOYSA-N Sulphur dioxide Chemical compound O=S=O RAHZWNYVWXNFOC-UHFFFAOYSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 239000003429 antifungal agent Substances 0.000 description 2
- 229940121375 antifungal agent Drugs 0.000 description 2
- 230000001640 apoptogenic effect Effects 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 210000004204 blood vessel Anatomy 0.000 description 2
- 239000006172 buffering agent Substances 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 230000006727 cell loss Effects 0.000 description 2
- 230000009391 cell specific gene expression Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 229960004926 chlorobutanol Drugs 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 210000002808 connective tissue Anatomy 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- ALEXXDVDDISNDU-JZYPGELDSA-N cortisol 21-acetate Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@@](C(=O)COC(=O)C)(O)[C@@]1(C)C[C@@H]2O ALEXXDVDDISNDU-JZYPGELDSA-N 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 210000003060 endolymph Anatomy 0.000 description 2
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 2
- CBOQJANXLMLOSS-UHFFFAOYSA-N ethyl vanillin Chemical compound CCOC1=CC(C=O)=CC=C1O CBOQJANXLMLOSS-UHFFFAOYSA-N 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000002073 fluorescence micrograph Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 230000003676 hair loss Effects 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 206010021198 ichthyosis Diseases 0.000 description 2
- 238000002991 immunohistochemical analysis Methods 0.000 description 2
- 230000002055 immunohistochemical effect Effects 0.000 description 2
- 230000001771 impaired effect Effects 0.000 description 2
- 238000000126 in silico method Methods 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000004941 influx Effects 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 229940102223 injectable solution Drugs 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 230000002427 irreversible effect Effects 0.000 description 2
- 231100000225 lethality Toxicity 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 230000004777 loss-of-function mutation Effects 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000028161 membrane depolarization Effects 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108091058688 miR-141 stem-loop Proteins 0.000 description 2
- 108091032320 miR-146 stem-loop Proteins 0.000 description 2
- 108091024530 miR-146a stem-loop Proteins 0.000 description 2
- 108091059964 miR-154 stem-loop Proteins 0.000 description 2
- 108091023796 miR-182 stem-loop Proteins 0.000 description 2
- 108091029500 miR-183 stem-loop Proteins 0.000 description 2
- 108091074450 miR-200c stem-loop Proteins 0.000 description 2
- 108091040861 miR-300 stem-loop Proteins 0.000 description 2
- 108091062225 miR-323 stem-loop Proteins 0.000 description 2
- 108091089005 miR-329 stem-loop Proteins 0.000 description 2
- 108091065201 miR-341 stem-loop Proteins 0.000 description 2
- 108091057188 miR-369 stem-loop Proteins 0.000 description 2
- 108091087125 miR-376a stem-loop Proteins 0.000 description 2
- 108091073138 miR-376a-3 stem-loop Proteins 0.000 description 2
- 108091079007 miR-376b stem-loop Proteins 0.000 description 2
- 108091071616 miR-376c stem-loop Proteins 0.000 description 2
- 108091079015 miR-379 Proteins 0.000 description 2
- 108091086215 miR-379 stem-loop Proteins 0.000 description 2
- 108091029369 miR-410 stem-loop Proteins 0.000 description 2
- 108091023805 miR-411 stem-loop Proteins 0.000 description 2
- 108091048162 miR-434 stem-loop Proteins 0.000 description 2
- 108091037327 miR-449 stem-loop Proteins 0.000 description 2
- 108091040525 miR-449a stem-loop Proteins 0.000 description 2
- 108091031190 miR-495 stem-loop Proteins 0.000 description 2
- 108091023526 miR-541 stem-loop Proteins 0.000 description 2
- 108091076271 miR-543 stem-loop Proteins 0.000 description 2
- 108091057017 miR-551b stem-loop Proteins 0.000 description 2
- 108091086713 miR-96 stem-loop Proteins 0.000 description 2
- 108091070961 miR-96-3 stem-loop Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000001537 neural effect Effects 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 229960003742 phenol Drugs 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 108010079892 phosphoglycerol kinase Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 239000011148 porous material Substances 0.000 description 2
- 230000002035 prolonged effect Effects 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 239000012474 protein marker Substances 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000007115 recruitment Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000014493 regulation of gene expression Effects 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 102200145331 rs35887622 Human genes 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 239000004334 sorbic acid Substances 0.000 description 2
- 235000010199 sorbic acid Nutrition 0.000 description 2
- 229940075582 sorbic acid Drugs 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- GETQZCLCWQTVFV-UHFFFAOYSA-N trimethylamine Chemical compound CN(C)C GETQZCLCWQTVFV-UHFFFAOYSA-N 0.000 description 2
- 238000010246 ultrastructural analysis Methods 0.000 description 2
- 210000005166 vasculature Anatomy 0.000 description 2
- QBYIENPQHBMVBV-HFEGYEGKSA-N (2R)-2-hydroxy-2-phenylacetic acid Chemical compound O[C@@H](C(O)=O)c1ccccc1.O[C@@H](C(O)=O)c1ccccc1 QBYIENPQHBMVBV-HFEGYEGKSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- CHHHXKFHOYLYRE-UHFFFAOYSA-M 2,4-Hexadienoic acid, potassium salt (1:1), (2E,4E)- Chemical compound [K+].CC=CC=CC([O-])=O CHHHXKFHOYLYRE-UHFFFAOYSA-M 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- WXNZTHHGJRFXKQ-UHFFFAOYSA-N 4-chlorophenol Chemical compound OC1=CC=C(Cl)C=C1 WXNZTHHGJRFXKQ-UHFFFAOYSA-N 0.000 description 1
- WOVKYSAHUYNSMH-RRKCRQDMSA-N 5-bromodeoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-RRKCRQDMSA-N 0.000 description 1
- 241000649044 Adeno-associated virus 9 Species 0.000 description 1
- 206010067484 Adverse reaction Diseases 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- JQFJNGVSGOUQDH-XIRDDKMYSA-N Arg-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JQFJNGVSGOUQDH-XIRDDKMYSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- BFDDUDQCPJWQRQ-IHRRRGAJSA-N Arg-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O BFDDUDQCPJWQRQ-IHRRRGAJSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- AYZAWXAPBAYCHO-CIUDSAMLSA-N Asn-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N AYZAWXAPBAYCHO-CIUDSAMLSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- LRCIOEVFVGXZKB-BZSNNMDCSA-N Asn-Tyr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LRCIOEVFVGXZKB-BZSNNMDCSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 208000017234 Bone cyst Diseases 0.000 description 1
- 101150044301 CRYL1 gene Proteins 0.000 description 1
- 101100507655 Canis lupus familiaris HSPA1 gene Proteins 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 101710093463 Clarin-1 Proteins 0.000 description 1
- 102100031060 Clarin-1 Human genes 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 206010010904 Convulsion Diseases 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- MFMDKTLJCUBQIC-MXAVVETBSA-N Cys-Phe-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MFMDKTLJCUBQIC-MXAVVETBSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 230000008301 DNA looping mechanism Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 206010011891 Deafness neurosensory Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- 101150083557 Ear gene Proteins 0.000 description 1
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 1
- 101710120810 Elongation factor 1-alpha 1 Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283074 Equus asinus Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- GPISLLFQNHELLK-DCAQKATOSA-N Gln-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GPISLLFQNHELLK-DCAQKATOSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- 102000018932 HSP70 Heat-Shock Proteins Human genes 0.000 description 1
- 108010027992 HSP70 Heat-Shock Proteins Proteins 0.000 description 1
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- 102000006947 Histones Human genes 0.000 description 1
- 101000992973 Homo sapiens Clarin-1 Proteins 0.000 description 1
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- 208000001126 Keratosis Diseases 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- WPIKRJDRQVFRHP-TUSQITKMSA-N Leu-Trp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O WPIKRJDRQVFRHP-TUSQITKMSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- 241000282553 Macaca Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- IHITVQKJXQQGLJ-LPEHRKFASA-N Met-Asn-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N IHITVQKJXQQGLJ-LPEHRKFASA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 1
- OTKQHDPECKUDSB-SZMVWBNQSA-N Met-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OTKQHDPECKUDSB-SZMVWBNQSA-N 0.000 description 1
- 108700011259 MicroRNAs Proteins 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 206010060860 Neurological symptom Diseases 0.000 description 1
- 208000007256 Nevus Diseases 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 235000019483 Peanut oil Nutrition 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- HQCSLJFGZYOXHW-KKUMJFAQSA-N Phe-His-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N HQCSLJFGZYOXHW-KKUMJFAQSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 229920001214 Polysorbate 60 Polymers 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- IWYDHOAUDWTVEP-UHFFFAOYSA-N R-2-phenyl-2-hydroxyacetic acid Natural products OC(=O)C(O)C1=CC=CC=C1 IWYDHOAUDWTVEP-UHFFFAOYSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 208000009966 Sensorineural Hearing Loss Diseases 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- 241000714229 Simian retrovirus 1 Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108010012306 Tn5 transposase Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- XZLHHHYSWIYXHD-XIRDDKMYSA-N Trp-Gln-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XZLHHHYSWIYXHD-XIRDDKMYSA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- YRSOERSDNRSCBC-XIRDDKMYSA-N Trp-His-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CS)C(=O)O)N YRSOERSDNRSCBC-XIRDDKMYSA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- KXIQQAWIPDDVOE-BPUTZDHNSA-N Trp-Pro-Cys Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O KXIQQAWIPDDVOE-BPUTZDHNSA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- GQYPNFIFJRNDPY-ONUFPDRFSA-N Trp-Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 GQYPNFIFJRNDPY-ONUFPDRFSA-N 0.000 description 1
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- CYDVHRFXDMDMGX-KKUMJFAQSA-N Tyr-Asn-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O CYDVHRFXDMDMGX-KKUMJFAQSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- ZAGPDPNPWYPEIR-SRVKXCTJSA-N Tyr-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ZAGPDPNPWYPEIR-SRVKXCTJSA-N 0.000 description 1
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- 239000003070 absorption delaying agent Substances 0.000 description 1
- 235000011054 acetic acid Nutrition 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 230000006838 adverse reaction Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000010419 agar Nutrition 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 230000000172 allergic effect Effects 0.000 description 1
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 239000000908 ammonium hydroxide Substances 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 208000010668 atopic eczema Diseases 0.000 description 1
- 208000025341 autosomal recessive disease Diseases 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 210000002469 basement membrane Anatomy 0.000 description 1
- 238000003339 best practice Methods 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 210000000133 brain stem Anatomy 0.000 description 1
- 239000007975 buffered saline Substances 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- AXCZMVOFGPJBDE-UHFFFAOYSA-L calcium dihydroxide Chemical compound [OH-].[OH-].[Ca+2] AXCZMVOFGPJBDE-UHFFFAOYSA-L 0.000 description 1
- 239000000920 calcium hydroxide Substances 0.000 description 1
- 229910001861 calcium hydroxide Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000011712 cell development Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000004098 cellular respiration Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000000084 colloidal system Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000012938 design process Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- UREBDLICKHMUKA-CXSFZGCWSA-N dexamethasone Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@]2(F)[C@@H]1[C@@H]1C[C@@H](C)[C@@](C(=O)CO)(O)[C@@]1(C)C[C@@H]2O UREBDLICKHMUKA-CXSFZGCWSA-N 0.000 description 1
- 229960003957 dexamethasone Drugs 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- UGMCXQCYOVCMTB-UHFFFAOYSA-K dihydroxy(stearato)aluminium Chemical compound CCCCCCCCCCCCCCCCCC(=O)O[Al](O)O UGMCXQCYOVCMTB-UHFFFAOYSA-K 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003221 ear drop Substances 0.000 description 1
- 229940047652 ear drops Drugs 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 1
- 230000002500 effect on skin Effects 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 229940073505 ethyl vanillin Drugs 0.000 description 1
- 210000001808 exosome Anatomy 0.000 description 1
- 229960004887 ferric hydroxide Drugs 0.000 description 1
- 239000004088 foaming agent Substances 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 229960005150 glycerol Drugs 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 230000003284 homeostatic effect Effects 0.000 description 1
- 102000053164 human CLRN1 Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 235000011167 hydrochloric acid Nutrition 0.000 description 1
- 230000002480 immunoprotective effect Effects 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000007972 injectable composition Substances 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 230000002601 intratumoral effect Effects 0.000 description 1
- IEECXTSVVFWGSE-UHFFFAOYSA-M iron(3+);oxygen(2-);hydroxide Chemical compound [OH-].[O-2].[Fe+3] IEECXTSVVFWGSE-UHFFFAOYSA-M 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical compound CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 206010023332 keratitis Diseases 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 229960002510 mandelic acid Drugs 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108091041042 miR-18 stem-loop Proteins 0.000 description 1
- 108091062221 miR-18a stem-loop Proteins 0.000 description 1
- 108091076732 miR-99a stem-loop Proteins 0.000 description 1
- 108091064318 miR-99a-1 stem-loop Proteins 0.000 description 1
- 108091086202 miR-99a-2 stem-loop Proteins 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 239000002077 nanosphere Substances 0.000 description 1
- 210000001577 neostriatum Anatomy 0.000 description 1
- 230000003767 neural control Effects 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 231100000956 nontoxicity Toxicity 0.000 description 1
- 239000003921 oil Substances 0.000 description 1
- 235000019198 oils Nutrition 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000000046 organ of corti supporting cell Anatomy 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 229940090668 parachlorophenol Drugs 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000000312 peanut oil Substances 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 235000011007 phosphoric acid Nutrition 0.000 description 1
- PJNZPQUBCPKICU-UHFFFAOYSA-N phosphoric acid;potassium Chemical compound [K].OP(O)(O)=O PJNZPQUBCPKICU-UHFFFAOYSA-N 0.000 description 1
- 150000003016 phosphoric acids Chemical class 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 229920000771 poly (alkylcyanoacrylate) Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 239000004302 potassium sorbate Substances 0.000 description 1
- 235000010241 potassium sorbate Nutrition 0.000 description 1
- 229940069338 potassium sorbate Drugs 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- MFDFERRIHVXMIY-UHFFFAOYSA-N procaine Chemical compound CCN(CC)CCOC(=O)C1=CC=C(N)C=C1 MFDFERRIHVXMIY-UHFFFAOYSA-N 0.000 description 1
- 229960004919 procaine Drugs 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 239000000473 propyl gallate Substances 0.000 description 1
- 229940075579 propyl gallate Drugs 0.000 description 1
- 235000010388 propyl gallate Nutrition 0.000 description 1
- 101150066583 rep gene Proteins 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 210000001525 retina Anatomy 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 102220005903 rs80338939 Human genes 0.000 description 1
- 210000001079 scala tympani Anatomy 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 231100000879 sensorineural hearing loss Toxicity 0.000 description 1
- 208000023573 sensorineural hearing loss disease Diseases 0.000 description 1
- 210000002265 sensory receptor cell Anatomy 0.000 description 1
- 102000027509 sensory receptors Human genes 0.000 description 1
- 108091008691 sensory receptors Proteins 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 208000028528 solitary bone cyst Diseases 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 229940044609 sulfur dioxide Drugs 0.000 description 1
- 235000010269 sulphur dioxide Nutrition 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 210000000225 synapse Anatomy 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 239000012929 tonicity agent Substances 0.000 description 1
- 230000037426 transcriptional repression Effects 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000011820 transgenic animal model Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 239000011882 ultra-fine particle Substances 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000001291 vacuum drying Methods 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P27/00—Drugs for disorders of the senses
- A61P27/16—Otologicals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/008—Vector systems having a special element relevant for transcription cell type or tissue specific enhancer/promoter combination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/48—Vector systems having a special element relevant for transcription regulating transport or export of RNA, e.g. RRE, PRE, WPRE, CTE
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/50—Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
Abstract
본 개시내용은, 적어도 부분적으로, 간극 연접 베타 2 (GJB2) 단백질을 GJB2를 정상적으로 발현하는 내이 세포 (예를 들어, 섬유세포 및 코르티 기관 및 근처 영역의 지지 세포)에 전달함으로써 비-증후군성 청각 상실 및 난청 (DFNB1)을 치료하기 위한 조성물 (예를 들어, 단리된 핵산 및 rAAV) 및 방법에 관한 것이다. 본 개시내용의 단리된 핵산은 간극 연접 베타 2 (GJB2) 유전자 조절 요소 (GRE) (예를 들어, GJB2 인핸서, GJB2 프로모터, GJB2 5' UTR, 및/또는 GJB2 3' UTR), 및 GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 발현 카세트를 포함한다.The present disclosure provides, at least in part, non-syndromic hearing by delivering gap junction beta 2 (GJB2) protein to inner ear cells that normally express GJB2 (eg, fibrocytes and supporting cells of the organ of Corti and nearby areas). Compositions (eg, isolated nucleic acids and rAAV) and methods for treating hearing loss and deafness (DFNB1). Isolated nucleic acids of the present disclosure may comprise a gap junction beta 2 (GJB2) gene regulatory element (GRE) (e.g., a GJB2 enhancer, a GJB2 promoter, a GJB2 5' UTR, and/or a GJB2 3' UTR), and a GJB2 protein. An expression cassette comprising a nucleotide sequence that encodes.
Description
관련 출원related application
본 출원은 35 U.S.C. § 119(e) 하에 2021년 3월 16일에 출원된 미국 가출원, U.S.S.N. 63/161,619, 및 2020년 9월 14일에 출원된 미국 가출원, U.S.S.N. 63/078,233을 우선권 주장하며, 이들 각각은 본원에 참조로 포함된다.This application claims under 35 U.S.C. § 119(e), the U.S. provisional application filed on March 16, 2021, U.S.S.N. 63/161,619, and the U.S. provisional application filed on September 14, 2020, U.S.S.N. 63/078,233, each of which is incorporated herein by reference.
연방 정부 지원 연구federally funded research
본 발명은 국립 보건원에 의해 수여된 DA048787 하의 정부 지원으로 수행되었다. 정부는 본 발명에 특정 권리를 갖는다.This invention was made with government support under DA048787 awarded by the National Institutes of Health. The government has certain rights in this invention.
배경기술background art
내이에서의 간극 연접 베타 2 (GJB2) 발현의 상실은 열성, 경도 내지 극심한 감각신경성 청각 장애를 특징으로 하는, 비증후군성 청각 상실 및 난청 (DFNB1)으로 명명되는 장애의 근본을 이룬다. 이들 환자 중 다수는 극심한 청각 상실을 갖고 태어나며, 이는 아마 출생시에도 비가역적일 것이다. 3분의 2는 출생시 약간의 잔류 청각을 갖고, 이들 중 대부분은 다음 수년에 걸쳐 청각을 상실한다. 따라서, 이들 환자는 DFNB1의 치료를 위한 잠재적 후보이다. GJB2의 이전의 유전자 대체 요법은, GJB2 유전자의 유전자 부가가 세포 생존 및 간극 연접 네트워크를 구제하였음에도 불구하고 청각을 구제하지는 못했다. 청각 구제를 위한 효과적인 GJB2 유전자 대체 요법은 개발되지 않았다.Loss of gap junction beta 2 (GJB2) expression in the inner ear underlies a disorder termed non-syndromic hearing loss and deafness (DFNB1), characterized by recessive, mild to severe sensorineural hearing impairment. Many of these patients are born with severe hearing loss, which is probably irreversible even at birth. Two-thirds have some residual hearing at birth, and most of these lose hearing over the next few years. Thus, these patients are potential candidates for treatment of DFNB1. Previous gene replacement therapy of GJB2 did not rescue hearing, although genetic addition of the GJB2 gene rescued cell survival and gap junction networks. No effective GJB2 gene replacement therapy for hearing rescue has been developed.
요약summary
본 개시내용은 적어도 부분적으로, 간극 연접 베타 2 (GJB2) 유전자 조절 요소 (GRE), 및 GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 발현 카세트를 포함하는 단리된 핵산에 관한 것이다. 일부 실시양태에서, 발현 카세트는 프로모터 (예를 들어, GJB2 프로모터)를 추가로 포함한다. 일부 실시양태에서, 발현 카세트에는 2개의 아데노-연관 바이러스 (AAV) 역전된 말단 반복부 (ITR)가 플랭킹된다. 단리된 핵산 내의 천연 GJB2 조절 요소 (GRE)의 존재는 독성이고 청각을 손상시키는 내이에서의 혼재성 GJB2 유전자 발현을 방지한다. 따라서, 일부 실시양태에서, 본원에 기재된 단리된 핵산은 GJB2 유전자를 정상적으로 발현하는 내이 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에서는 GJB2 단백질을 발현할 수 있지만, GJB2를 정상적으로 발현하지 않는 세포 (예를 들어, 유모 세포 및 나선 신경절 뉴런)에서는 그렇지 않다.The present disclosure relates, at least in part, to an isolated nucleic acid comprising a gap junction beta 2 (GJB2) gene regulatory element (GRE) and an expression cassette comprising a nucleotide sequence encoding a GJB2 protein. In some embodiments, the expression cassette further comprises a promoter (eg, GJB2 promoter). In some embodiments, the expression cassette is flanked by two adeno-associated virus (AAV) inverted terminal repeats (ITRs). The presence of a native GJB2 regulatory element (GRE) in the isolated nucleic acid prevents coexistent GJB2 gene expression in the auris interna that is toxic and impairs hearing. Thus, in some embodiments, the isolated nucleic acids described herein are capable of expressing the GJB2 protein in inner ear cells that normally express the GJB2 gene (e.g., connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions); , but not in cells that do not normally express GJB2 (eg, hair cells and spiral ganglion neurons).
일부 측면에서, 본 개시내용은 발현 카세트를 포함하는 단리된 핵산을 제공하며, 여기서 발현 카세트는 간극 연접 베타 2 (GJB2) 유전자 조절 요소 (GRE), 및 GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함한다.In some aspects, the disclosure provides an isolated nucleic acid comprising an expression cassette, wherein the expression cassette comprises a gap junction beta 2 (GJB2) gene regulatory element (GRE), and a nucleotide sequence encoding a GJB2 protein.
일부 실시양태에서, GJB2 단백질은 인간 GJB2 단백질이다. 일부 실시양태에서, GJB2 단백질은 서열식별번호(SEQ ID NO): 1에 대해 적어도 80% 동일한 아미노산 서열을 포함한다. 일부 실시양태에서, 인간 GJB2 단백질을 코딩하는 뉴클레오티드 서열은 서열식별번호: 2에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the GJB2 protein is a human GJB2 protein. In some embodiments, the GJB2 protein comprises an amino acid sequence that is at least 80% identical to SEQ ID NO:1. In some embodiments, the nucleotide sequence encoding human GJB2 protein comprises a nucleotide sequence that is at least 80% identical to SEQ ID NO:2.
일부 실시양태에서, 발현 카세트는 GJB2 단백질을 코딩하는 뉴클레오티드 서열에 작동가능하게 연결된 프로모터를 추가로 포함한다. 일부 실시양태에서, 프로모터는 인간 GJB2 프로모터이다. 일부 실시양태에서, 프로모터는 인간 GJB2 프로모터의 500개의 뉴클레오티드를 포함한다. 일부 실시양태에서, 프로모터는 서열식별번호: 5에 대해 적어도 80% 동일한 핵산 서열을 포함한다. 일부 실시양태에서, 프로모터는 서열식별번호: 102에 대해 적어도 80% 동일한 핵산 서열을 포함한다. 일부 실시양태에서, 프로모터는 서열식별번호: 102에 대해 100% 동일한 핵산 서열을 포함한다.In some embodiments, the expression cassette further comprises a promoter operably linked to the nucleotide sequence encoding the GJB2 protein. In some embodiments, the promoter is the human GJB2 promoter. In some embodiments, the promoter comprises 500 nucleotides of a human GJB2 promoter. In some embodiments, the promoter comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:5. In some embodiments, the promoter comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:102. In some embodiments, the promoter comprises a nucleic acid sequence that is 100% identical to SEQ ID NO:102.
일부 실시양태에서, 프로모터는 인간 GJB2 기저 프로모터이다. 일부 실시양태에서, 인간 GJB2 기저 프로모터는 서열식별번호: 47에 대해 적어도 80% 동일한 핵산 서열을 포함한다.In some embodiments, the promoter is a human GJB2 basal promoter. In some embodiments, the human GJB2 basal promoter comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:47.
일부 실시양태에서, 발현 카세트는 5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 5' UTR은 프로모터와 GJB2 단백질을 코딩하는 뉴클레오티드 서열 사이에 위치한다. 일부 실시양태에서, 5' UTR은 인간 GJB2 유전자 5' UTR의 약 300개의 뉴클레오티드를 포함한다. 일부 실시양태에서, 프로모터 및 5' UTR은 서열식별번호: 30에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, an expression cassette comprises a nucleotide sequence encoding a 5' UTR. In some embodiments, a 5' UTR is located between the promoter and the nucleotide sequence encoding the GJB2 protein. In some embodiments, the 5' UTR comprises about 300 nucleotides of the human GJB2 gene 5' UTR. In some embodiments, the promoter and 5' UTR comprise a nucleotide sequence that is at least 80% identical to SEQ ID NO:30.
일부 실시양태에서, GJB2 유전자 조절 요소는 인핸서를 포함한다. 일부 실시양태에서, 인핸서는 프로모터의 5'에 위치한다. 일부 실시양태에서, 인핸서는 정상적으로 GJB2 유전자의 대략 200 kb 상류 또는 하류 내에 존재한다. 일부 실시양태에서, 인핸서는 통상적으로 GJB2 유전자의 대략 95 kb 내에 존재한다. 일부 실시양태에서, GJB2 GRE는 1개 이상의 인핸서를 포함한다. 일부 실시양태에서, 1개 이상의 인핸서는 동일한 인핸서 또는 상이한 인핸서이다. 일부 실시양태에서, 인핸서는 서열식별번호: 6 내지 29 중 어느 하나에 제시된 뉴클레오티드 서열 또는 그의 단편에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인핸서는 서열식별번호: 37-46 및 55-60 중 임의의 것에 제시된 GJB2 인핸서에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인핸서는 서열식별번호: 42에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, a GJB2 gene regulatory element comprises an enhancer. In some embodiments, an enhancer is located 5' to a promoter. In some embodiments, the enhancer is normally within approximately 200 kb upstream or downstream of the GJB2 gene. In some embodiments, the enhancer is typically within approximately 95 kb of the GJB2 gene. In some embodiments, the GJB2 GRE includes one or more enhancers. In some embodiments, one or more enhancers are the same enhancer or different enhancers. In some embodiments, the enhancer comprises a nucleotide sequence that is at least 80% identical to the nucleotide sequence set forth in any one of SEQ ID NOs: 6-29 or a fragment thereof. In some embodiments, the enhancer comprises a nucleotide sequence that is at least 80% identical to the GJB2 enhancer set forth in any of SEQ ID NOs: 37-46 and 55-60. In some embodiments, the enhancer comprises a nucleotide sequence that is at least 80% identical to SEQ ID NO:42.
일부 측면에서, 본 개시내용은 또한 간극 연접 베타 2 (GJB2) 프로모터, 및 GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 발현 카세트를 포함하는 단리된 핵산을 제공한다.In some aspects, the disclosure also provides an isolated nucleic acid comprising an expression cassette comprising a gap junction beta 2 (GJB2) promoter and a nucleotide sequence encoding a GJB2 protein.
일부 실시양태에서, GJB2 프로모터는 서열식별번호: 102에 대해 적어도 80% 동일한 핵산 서열을 포함한다. 일부 실시양태에서, GJB2 프로모터는 서열식별번호: 102에 대해 100% 동일한 핵산 서열을 포함한다.In some embodiments, the GJB2 promoter comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:102. In some embodiments, the GJB2 promoter comprises a nucleic acid sequence that is 100% identical to SEQ ID NO:102.
일부 실시양태에서, 발현 카세트는 5' UTR을 추가로 포함한다. 일부 실시양태에서, 5' UTR은 서열식별번호: 103에 대해 적어도 80% 동일한 제1 핵산 서열; 및/또는 서열식별번호: 104에 대해 적어도 80% 동일한 제2 핵산 서열을 포함한다. 일부 실시양태에서, 발현 카세트는 5' UTR을 추가로 포함한다. 일부 실시양태에서, 5' UTR은 서열식별번호: 103에 대해 100% 동일한 제1 핵산 서열; 및/또는 서열식별번호: 104에 대해 100% 동일한 제2 핵산 서열을 포함한다.In some embodiments, the expression cassette further comprises a 5' UTR. In some embodiments, a 5' UTR comprises a first nucleic acid sequence that is at least 80% identical to SEQ ID NO: 103; and/or a second nucleic acid sequence that is at least 80% identical to SEQ ID NO:104. In some embodiments, the expression cassette further comprises a 5' UTR. In some embodiments, a 5' UTR is a first nucleic acid sequence that is 100% identical to SEQ ID NO: 103; and/or a second nucleic acid sequence that is 100% identical to SEQ ID NO:104.
일부 실시양태에서, 단리된 핵산은 서열식별번호: 105에 대해 적어도 80% 동일한 핵산 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 서열식별번호: 105에 대해 100% 동일한 핵산 서열을 포함한다.In some embodiments, the isolated nucleic acid comprises a nucleic acid sequence that is at least 80% identical to SEQ ID NO:105. In some embodiments, the isolated nucleic acid comprises a nucleic acid sequence that is 100% identical to SEQ ID NO:105.
일부 실시양태에서, 단리된 핵산은 GJB2 유전자를 정상적으로 발현하는 세포에서 GJB2를 발현할 수 있다. 일부 실시양태에서, 단리된 핵산은 와우 결합 조직 세포 및 코르티 기관의 지지 세포에서 GJB2를 발현할 수 있다. 일부 실시양태에서, 코르티 기관의 지지 세포는 기둥 세포, 다이터 세포, 헨센 세포, 클라우디우스 세포, 내부 지골 세포 및 경계 세포이다. 일부 실시양태에서, 와우 결합 조직 세포는 혈관조 중간(strial intermediate) 세포, 측벽 및 상혈관조 부위(suprastrial zone)의 섬유세포, 혈관선조(stria vascularis)의 기저 세포, 나선 인대에서의 섬유세포, 나선판가장자리에서의 섬유세포, 전정계(scala vestibuli)에 대면하는 미로골낭(bony otic capsule)을 라이닝하는 중간엽 세포, 및 가장자리상부 암색(supralimbal dark) 세포이다.In some embodiments, the isolated nucleic acid is capable of expressing GJB2 in cells that normally express the GJB2 gene. In some embodiments, the isolated nucleic acid is capable of expressing GJB2 in cochlear connective tissue cells and supporting cells of the organ of Corti. In some embodiments, the supporting cells of the organ of Corti are pillar cells, diter cells, Hensen cells, Claudius cells, internal phalanx cells, and border cells. In some embodiments, the cochlear connective tissue cells are strial intermediate cells, fibrocytes of the lateral wall and suprastrial zone, basal cells of stria vascularis, fibrocytes in spiral ligaments, Fibrous cells at the edge of the spiral plate, mesenchymal cells lining the bony otic capsule facing the scala vestibuli, and supralimbal dark cells.
일부 실시양태에서, 발현 카세트에는 2개의 아데노-연관 바이러스 역전된 말단 반복부 (ITR)가 플랭킹된다. 일부 실시양태에서, AAV ITR은 AAV1 ITR, AAV2 ITR, AAV3 ITR, AAV4 ITR, AAV5 ITR, 및 AAV6 ITR로 이루어진 군으로부터 선택된 혈청형으로부터의 것이다. 일부 실시양태에서, AAV ITR은 AAV2 ITR이다.In some embodiments, the expression cassette is flanked by two adeno-associated virus inverted terminal repeats (ITRs). In some embodiments, the AAV ITR is from a serotype selected from the group consisting of AAV1 ITR, AAV2 ITR, AAV3 ITR, AAV4 ITR, AAV5 ITR, and AAV6 ITR. In some embodiments, the AAV ITR is an AAV2 ITR.
일부 실시양태에서, 발현 카세트는 서열식별번호: 106에 대해 적어도 80% 동일한 뉴클레오티드 서열을 갖는 5' ITR; 및/또는 서열식별번호: 107에 대해 적어도 80% 동일한 뉴클레오티드 서열을 갖는 3' ITR을 포함한다. 일부 실시양태에서, 발현 카세트는 서열식별번호: 106에 대해 100% 동일한 뉴클레오티드 서열을 갖는 5' ITR; 및/또는 서열식별번호: 107에 대해 100% 동일한 뉴클레오티드 서열을 갖는 3' ITR을 포함한다.In some embodiments, an expression cassette comprises a 5' ITR having a nucleotide sequence that is at least 80% identical to SEQ ID NO: 106; and/or a 3' ITR having a nucleotide sequence that is at least 80% identical to SEQ ID NO: 107. In some embodiments, an expression cassette comprises a 5' ITR having a nucleotide sequence that is 100% identical to SEQ ID NO: 106; and/or a 3' ITR with 100% identical nucleotide sequence to SEQ ID NO: 107.
일부 실시양태에서, 발현 카세트는 GJB2 단백질을 코딩하는 뉴클레오티드 서열의 3'에 우드척 간염 바이러스 (WHP) 전사후 조절 요소 (WPRE)를 추가로 포함한다.In some embodiments, the expression cassette further comprises a Woodchuck Hepatitis Virus (WHP) post-transcriptional regulatory element (WPRE) 3' to the nucleotide sequence encoding the GJB2 protein.
일부 실시양태에서, WPRE는 서열식별번호: 108에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, WPRE는 서열식별번호: 108에 대해 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the WPRE comprises a nucleotide sequence that is at least 80% identical to SEQ ID NO:108. In some embodiments, the WPRE comprises a nucleotide sequence that is 100% identical to SEQ ID NO:108.
일부 실시양태에서, 발현 카세트는 WPRE의 5'에 위치하는 3' UTR을 코딩하는 뉴클레오티드 서열을 추가로 포함한다. 일부 실시양태에서, 3' UTR은 GJB2 엑손 2 3' UTR이다. 일부 실시양태에서, GJB2 엑손 2 3' UTR은 서열식별번호: 32에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the expression cassette further comprises a nucleotide sequence encoding a 3' UTR located 5' of the WPRE. In some embodiments, the 3' UTR is the
일부 실시양태에서, 발현 카세트는 3' UTR에 위치하는 1개 이상의 miRNA 결합 부위를 추가로 포함한다. 일부 실시양태에서, miRNA 결합 부위는 뉴런-연관 miRNA 결합 부위이다. 일부 실시양태에서, 뉴런-연관 miRNA는 miR-124, miR-127, miR-129, miR-129*, miR-136, miR-136*, miR-137, miR-154, miR-300-3p, miR-323, miR-329, miR-341, miR-369-5p, miR-376a, miR-376b-3p, miR-376c, miR-379, miR-382, miR-382*, miR-410, miR-411, miR-433, miR-434, miR-495, miR-541, miR-543*, miR-551b, miR-143, miR-449a, miR-219-2-3p, miR-126, miR-126*, miR-141, miR-142-3p, miR-142-5p, miR-146a, miR-150, miR-200c 및 miR-223으로부터 선택된다. 일부 실시양태에서, 뉴런-연관 miRNA는 miR-124이다. 일부 실시양태에서, miRNA 결합 부위는 와우 유모 세포-연관 miRNA 결합 부위이다. 일부 실시양태에서, 와우 유모 세포-연관 miRNA 결합 부위는 miR-124, miR-96, miR-182, 및 miR-183으로부터 선택된다.In some embodiments, the expression cassette further comprises one or more miRNA binding sites located in the 3' UTR. In some embodiments, the miRNA binding site is a neuron-associated miRNA binding site. In some embodiments, the neuron-associated miRNA is miR-124, miR-127, miR-129, miR-129*, miR-136, miR-136*, miR-137, miR-154, miR-300-3p, miR-323, miR-329, miR-341, miR-369-5p, miR-376a, miR-376b-3p, miR-376c, miR-379, miR-382, miR-382*, miR-410, miR -411, miR-433, miR-434, miR-495, miR-541, miR-543*, miR-551b, miR-143, miR-449a, miR-219-2-3p, miR-126, miR- 126*, miR-141, miR-142-3p, miR-142-5p, miR-146a, miR-150, miR-200c and miR-223. In some embodiments, the neuron-associated miRNA is miR-124. In some embodiments, the miRNA binding site is a cochlear hair cell-associated miRNA binding site. In some embodiments, the cochlear hair cell-associated miRNA binding site is selected from miR-124, miR-96, miR-182, and miR-183.
일부 실시양태에서, 발현 카세트는 폴리 A 신호를 추가로 포함한다. 일부 실시양태에서, 폴리 A 신호는 소 성장 호르몬 폴리 A 신호이다.In some embodiments, the expression cassette further comprises a poly A signal. In some embodiments the poly A signal is a bovine growth hormone poly A signal.
일부 실시양태에서, 폴리 A 신호는 서열식별번호: 109에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 폴리 A 신호는 서열식별번호: 109에 대해 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the poly A signal comprises a nucleotide sequence that is at least 80% identical to SEQ ID NO:109. In some embodiments, the poly A signal comprises a nucleotide sequence that is 100% identical to SEQ ID NO:109.
일부 측면에서, 본 개시내용은 또한 서열식별번호: 110 또는 111에 대해 100% 동일한 뉴클레오티드 서열을 포함하는 단리된 핵산을 제공한다. 일부 측면에서, 본 개시내용은 또한 서열식별번호: 110 또는 111에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함하는 단리된 핵산을 제공한다.In some aspects, the disclosure also provides an isolated nucleic acid comprising a nucleotide sequence that is 100% identical to SEQ ID NO: 110 or 111. In some aspects, the disclosure also provides an isolated nucleic acid comprising a nucleotide sequence that is at least 80% identical to SEQ ID NO: 110 or 111.
일부 측면에서, 본 개시내용은 또한 본원에 기재된 바와 같은 단리된 핵산을 포함하는 벡터를 제공한다. 일부 실시양태에서, 벡터는 플라스미드 또는 바이러스 벡터이다. 일부 실시양태에서, 바이러스 벡터는 AAV 벡터이다.In some aspects, the disclosure also provides vectors comprising an isolated nucleic acid as described herein. In some embodiments, the vector is a plasmid or viral vector. In some embodiments, the viral vector is an AAV vector.
일부 측면에서, 본 개시내용은 또한 5'에서 3'으로: (a) AAV 5' ITR; (b) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열; (c) GJB2 5' UTR (예를 들어, GJB2 엑손 1 5' UTR); (d) GJB2 단백질을 코딩하는 뉴클레오티드 서열; (e) GJB2 3' UTR (예를 들어, GJB2 엑손 2 3' UTR) (임의로 GJB2 3' UTR은 1개 이상의 miR-124 결합 부위를 포함함); (f) 소 성장 호르몬 폴리 A 신호; 및 (g) AAV 3' ITR을 포함하는 벡터를 제공한다.In some aspects, the disclosure also provides a 5' to 3': (a) AAV 5' ITR; (b) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof; (c) GJB2 5'UTR (eg,
일부 측면에서, 본 개시내용은 또한 5'에서 3'으로: (a) AAV 5' ITR; (b) GJB2 인핸서; (c) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열; (d) GJB2 5' UTR (예를 들어, GJB2 엑손 1 5' UTR); (e) GJB2 단백질을 코딩하는 뉴클레오티드 서열; (f) GJB2 3' UTR (예를 들어, GJB2 엑손 2 3' UTR) (임의로 GJB2 3' UTR은 1개 이상의 miR-124 결합 부위를 포함함); (g) 소 성장 호르몬 폴리 A 신호; 및 (h) AAV 3' ITR을 포함하는 벡터를 제공한다.In some aspects, the disclosure also provides a 5' to 3': (a) AAV 5' ITR; (b) a GJB2 enhancer; (c) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof; (d) GJB2 5'UTR (eg,
일부 실시양태에서, 벡터는 서열식별번호: 36, 48-62 및 61-83 중 어느 하나에 대해 적어도 80% 동일한 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 벡터는 AAV 벡터이다. 일부 실시양태에서, 벡터는 GJB2를 정상적으로 발현하는 세포에서 GJB2 유전자를 발현할 수 있다.In some embodiments, the vector comprises a nucleotide sequence that is at least 80% identical to any one of SEQ ID NOs: 36, 48-62, and 61-83. In some embodiments, the vector is an AAV vector. In some embodiments, the vector is capable of expressing the GJB2 gene in cells that normally express GJB2.
일부 측면에서, 본 개시내용은 또한 (i) 캡시드 단백질; 및 (ii) 본원에 기재된 단리된 핵산을 포함하는 재조합 아데노-연관 바이러스 (rAAV)를 제공한다.In some aspects, the disclosure also relates to (i) a capsid protein; and (ii) a recombinant adeno-associated virus (rAAV) comprising an isolated nucleic acid described herein.
일부 측면에서, 본 개시내용은 또한 (i) 캡시드 단백질; 및 (ii) (a) AAV 5' ITR (예를 들어, GJB2 엑손 1 5' UTR); (b) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열; (c) GJB2 5' UTR (예를 들어, GJB2 엑손 2 3' UTR) (임의로, GJB2 엑손 2 3' UTR은 1개 이상의 miR-124 결합 부위를 포함함); (d) GJB2 단백질을 코딩하는 뉴클레오티드 서열; (e) GJB2 3' UTR; (f) 소 성장 호르몬 폴리 A 신호; 및 (g) AAV 3' ITR을 포함하는 단리된 핵산을 포함하는 재조합 아데노-연관 바이러스 (rAAV)를 제공한다.In some aspects, the disclosure also relates to (i) a capsid protein; and (ii) (a) AAV 5' ITR (eg,
일부 측면에서, 본 개시내용은 또한 (i) 캡시드 단백질; 및 (ii) (a) AAV 5' ITR; (b) GJB2 인핸서; (c) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열; (d) GJB2 5' UTR (예를 들어, GJB2 엑손 1 5' UTR); (e) GJB2 단백질을 코딩하는 뉴클레오티드 서열; (f) GJB2 3' UTR (예를 들어, GJB2 엑손 2 3' UTR) (임의로 GJB2 엑손 2 3' UTR은 1개 이상의 miR-124 결합 부위를 포함함); (g) 소 성장 호르몬 폴리 A 신호; 및 (h) AAV 3' ITR을 포함하는 단리된 핵산을 포함하는 재조합 아데노-연관 바이러스 (rAAV)를 제공한다.In some aspects, the disclosure also relates to (i) a capsid protein; and (ii) (a) AAV 5' ITR; (b) a GJB2 enhancer; (c) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof; (d) GJB2 5'UTR (eg,
일부 실시양태에서, rAAV는 GJB2 유전자를 정상적으로 발현하는 와우 세포의 하위세트에 대한 향성을 갖는다. 일부 실시양태에서, rAAV는 내이의 세포에 대해 향성을 갖는다.In some embodiments, the rAAV has tropism for a subset of cochlear cells that normally express the GJB2 gene. In some embodiments, the rAAV is tropic for cells of the inner ear.
일부 실시양태에서, 캡시드 단백질은 AAV1 캡시드 단백질, AAV2 캡시드 단백질, AAV5 캡시드 단백질, AAV7 캡시드 단백질, AAV8 캡시드 단백질, AAV9 캡시드 단백질, AAV-S 캡시드 단백질 또는 그의 변이체이다. 일부 실시양태에서, AAV 캡시드는 AAV9.PHP.B, AAV9.PHP.eB, 또는 AAV-S이다. 일부 실시양태에서, AAV 캡시드 단백질은 AAV-S이다.In some embodiments, the capsid protein is an AAV1 capsid protein, an AAV2 capsid protein, an AAV5 capsid protein, an AAV7 capsid protein, an AAV8 capsid protein, an AAV9 capsid protein, an AAV-S capsid protein, or a variant thereof. In some embodiments, the AAV capsid is AAV9.PHP.B, AAV9.PHP.eB, or AAV-S. In some embodiments, the AAV capsid protein is AAV-S.
일부 측면에서, 본 개시내용은 본원에 기재된 바와 같은 단리된 핵산, 벡터 또는 rAAV를 포함하는 숙주 세포를 제공한다.In some aspects, the disclosure provides a host cell comprising an isolated nucleic acid, vector or rAAV as described herein.
일부 측면에서, 본 개시내용은 본원에 기재된 바와 같은 단리된 핵산, 벡터, rAAV 또는 숙주 세포를 포함하는 제약 조성물을 제공한다. 일부 실시양태에서, 제약 조성물은 제약상 허용되는 담체를 추가로 포함한다.In some aspects, the disclosure provides a pharmaceutical composition comprising an isolated nucleic acid, vector, rAAV or host cell as described herein. In some embodiments, the pharmaceutical composition further comprises a pharmaceutically acceptable carrier.
일부 측면에서, 본 개시내용은 대상체에게 유효량의 본원에 기재된 바와 같은 단리된 핵산, 벡터, rAAV, 숙주 세포 또는 제약 조성물을 투여하는 것을 포함하는, 대상체에서 GJB2 유전자를 정상적으로 발현하는 세포에서 GJB2를 특이적으로 발현하는 방법을 제공한다.In some aspects, the disclosure provides specificity for GJB2 in cells that normally express the GJB2 gene in a subject, comprising administering to the subject an effective amount of an isolated nucleic acid, vector, rAAV, host cell, or pharmaceutical composition as described herein. It provides a way to express it as an enemy.
일부 측면에서, 본 개시내용은 대상체에게 유효량의 본원에 기재된 바와 같은 단리된 핵산, 벡터, rAAV, 숙주 세포 또는 제약 조성물을 투여하는 것을 포함하는, 대상체에서 비-증후군성 청각 상실 및 난청 (DFNB1)을 치료하는 방법을 제공한다.In some aspects, the disclosure provides treatment for non-syndromic deafness and deafness (DFNB1) in a subject, comprising administering to the subject an effective amount of an isolated nucleic acid, vector, rAAV, host cell, or pharmaceutical composition as described herein. provides a way to treat
GJB2-연관 질환의 치료를 필요로 하는 대상체에게 유효량의 본원에 기재된 바와 같은 단리된 핵산, 벡터, rAAV, 숙주 세포 또는 제약 조성물을 투여하는 것을 포함하는, 상기 대상체에서 GJB2-연관 질환을 치료하는 방법.A method of treating a GJB2-associated disease in a subject in need thereof comprising administering to the subject an effective amount of an isolated nucleic acid, vector, rAAV, host cell or pharmaceutical composition as described herein. .
일부 실시양태에서, 대상체는 포유동물이다. 일부 실시양태에서, 포유동물은 인간이다. 일부 실시양태에서, 포유동물은 비-인간 포유동물이다. 일부 실시양태에서, 비-인간 포유동물은 마우스, 래트 또는 비-인간 영장류이다.In some embodiments, the subject is a mammal. In some embodiments, the mammal is a human. In some embodiments, the mammal is a non-human mammal. In some embodiments, the non-human mammal is a mouse, rat, or non-human primate.
일부 실시양태에서, 청각 상실은 GJB2 유전자에서의 돌연변이와 연관된다. 일부 실시양태에서, GJB2 유전자에서의 돌연변이는 점 돌연변이, 미스센스 돌연변이, 넌센스 돌연변이, 스플라이스-변경 돌연변이, 동의 돌연변이, 결실, 삽입 또는 그의 조합이다. 일부 실시양태에서, 대상체는 인간이고; 돌연변이는 표 2 (하기)에 열거된 돌연변이 또는 그의 조합이다. 일부 실시양태에서, 돌연변이는 NM_004004.6 c.101T>C (GRCh37/hg19 Chr13:20763620A>G) 또는 c.35delG (GRCh37/hg19 chr13:20763685AC>A)이다.In some embodiments, hearing loss is associated with a mutation in the GJB2 gene. In some embodiments, the mutation in the GJB2 gene is a point mutation, missense mutation, nonsense mutation, splice-altering mutation, synonymous mutation, deletion, insertion, or combination thereof. In some embodiments, the subject is a human; The mutation is a mutation or combination thereof listed in Table 2 (below). In some embodiments, the mutation is NM_004004.6 c.101T>C (GRCh37/hg19 Chr13:20763620A>G) or c.35delG (GRCh37/hg19 chr13:20763685AC>A).
일부 실시양태에서, 투여는 와우 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포에서 GJB2 단백질의 발현을 일으킨다. 일부 실시양태에서, 코르티 기관의 지지 세포는 기둥 세포, 다이터 세포, 헨센 세포, 클라우디우스 세포, 내부 지골 세포, 및 경계 세포이다. 일부 실시양태에서, 결합 조직 세포는 혈관조 중간 세포, 측벽 및 상혈관조 부위의 섬유세포, 혈관선조의 기저 세포, 나선 인대에서의 섬유세포, 나선판가장자리에서의 섬유세포, 전정계에 대면하는 미로골낭을 라이닝하는 중간엽 세포, 및 가장자리상부 암색 세포이다.In some embodiments, the administration results in expression of the GJB2 protein in cochlear connective tissue cells and supporting cells of the organ of Corti and proximal region. In some embodiments, the supporting cells of the organ of Corti are pillar cells, diter cells, Hensen cells, Claudius cells, internal phalanx cells, and border cells. In some embodiments, the connective tissue cells are vascular interstitial cells, fibrocytes in the region of the lateral wall and supravascular striatum, basal cells in the vascular progenitors, fibrocytes in the spiral ligament, fibrocytes in the margins of the spiral plate, and those facing the vestibular system. mesenchymal cells lining the labyrinth bone cyst, and supermarginal dark cells.
일부 실시양태에서, 투여는 주사를 통한 것이다. 일부 실시양태에서, 주사는 와우의 정원창 막을 통해, 와우의 중간계(scala media) 내로, 와우의 고실계(scala tympani) 내로, 와우의 전정계 내로, 내이의 반고리관 내로, 또는 내이의 구형낭(saccule) 또는 난형낭(utricle) 내로 이루어진다.In some embodiments, administration is via injection. In some embodiments, the injection is via the round window membrane of the cochlea, into the scala media of the cochlea, into the scala tympani of the cochlea, into the vestibular system of the cochlea, into the semicircular canals of the inner ear, or into the saccule of the inner ear. ) or in the utricle.
본 발명의 하나 이상의 실시양태의 세부사항은 하기 설명에 제시된다. 본 발명의 다른 특색 또는 이점은 하기 도면 및 특정 실시양태의 상세한 설명 및 또한 첨부된 청구범위로부터 명백할 것이다.The details of one or more embodiments of the invention are set forth in the description below. Other features or advantages of the present invention will be apparent from the following drawings and detailed description of specific embodiments and also from the appended claims.
본 명세서에 포함되고 그의 일부를 구성하는 첨부 도면은 특정 실시양태를 예시하고, 서면 설명과 함께 본원에 개시된 조성물 및 방법의 특정 측면의 비제한적 예를 제공하는 역할을 한다.
도 1a-1c는 GJB2의 구조 및 발현 분포, 및 GJB2 발현의 상실이 환자에게 어떻게 영향을 미치는지를 보여준다. 도 1a는 GJB2 헤미채널의 구조를 나타낸다. 각각 4개의 막횡단 나선을 갖는 GJB2 단백질의 6개의 서브유닛은 막의 면에서 조립되어 큰 중심 포어를 형성한다. 인접한 세포로부터의 GJB2 헤미채널은 결합하여 한 세포의 세포질로부터 다른 세포의 세포질로의 채널을 생성한다. 간극 연접은 연접 플라크에 패킹된 수백개 또는 수천개의 채널에 의해 형성된다. 도 1b-1c는 GJB2가 발현되는 섬유세포 및 상피 세포의 네트워크 (도 1b), 및 GJB2가 발현되지 않는 내유모 및 외유모 세포 (도 1c)를 보여준다. 도 1d는 출생시 약간의 잔류 청각을 갖는 GJB2 돌연변이(들)를 보유하는 다수의 환자가 다음 3-6년에 걸쳐 추가의 청각 상실을 나타낸다는 것을 보여준다. 치료를 위한 윈도우는 출생 후 1-5년 동안 존재하며, 미국에서 0-5세의 병에 걸린 어린이 ~10,000명이 치료받을 수 있다.
도 2a-2b는 정원창 막 (RWM)을 통한 직접 주사에 의한 와우로의 바이러스 벡터의 전달, 및 주사한 마우스의 청각에 대한 Gjb2의 혼재성 발현의 유해 효과를 보여준다. 도 2a는 정원창 막 (RWM) 주사를 예시하는 카툰이다. 도 2b는 내이에서의 Gjb2의 혼재성 발현이 야생형 마우스에서 청각을 손상시켰다는 것을 보여준다.
도 3a-3n은 GJB2 유전자를 자연적으로 발현하는 와우 세포의 하위세트에서 GJB2 발현에 결정적인 시스-조절 요소 (예를 들어, 인핸서)의 확인을 보여준다. 도 3a-3b는 GJB2-연관 난청을 갖는 특정 환자가 GJB2 코딩 서열 돌연변이와 트랜스로 발생하는 상류 결실을 갖는다는 것을 보여주며, 이는 일부 환자가 시스-조절 요소에 돌연변이(들)를 보유한다는 것을 시사하고, CRYL1 유전자 옆의 영역은 이러한 시스-조절 요소의 확인에 특히 중요하다. 도 3c (상단)는 마우스 Gjb2 유전자 영역에서 ~300 kb에 걸친, 발생 단계 P2, P5 및 P8의 마우스 와우로부터의 ATAC-Seq의 UCSC 게놈 브라우저 뷰에서의 유전자 조절 요소 (GRE)의 확인을 나타낸다. 음영 영역은 추정 GRE를 함유하는 영역을 표시한다. X-축은 마우스 게놈 내의 chr14 상의 게놈 영역이다. Y-축은 게놈 내의 특이적 영역에 정렬되는 ATAC-Seq로부터의 판독물의 수이다. 밝은 음영은 판독물 파일업이 풍부한 전사상 활성 영역의 특징인 오픈 염색질의 영역을 나타내며, 이는 이들 영역에서의 보다 높은 활성을 시사한다. 영역 A 및 B는 마우스 Gjb2 자체 내의 전사상 활성 서열을 표시한다. 영역 C-M은 시스-조절 네트워크의 일부일 수 있는 Gjb2 주변에서 전사상 활성인 영역이다. 도 3c (하단)는 특이적 GRE (어두운 하이라이트)로서 검출된 밝은 음영 영역 내 및 주변의 전사상 활성 영역을 보여준다. GRE는 마우스에서 처음으로 확인되었음을 주목한다. 인간 GJB2 GRE는 마우스 GRE를 모델링함으로써 인 실리코로 확인하였다. 인간 GJB2 GRE를 후속 실험에서 시험하였다. 도 3d-3e는 GJB2 프로모터 및/또는 인핸서의 혼입이 있거나 없는 다양한 벡터 설계를 보여준다. 이들 벡터를 마우스 내이에서 시험하였다. GJB2 인핸서 벡터인 C15 벡터는 500 bp의 인간 GJB2 프로모터, 인간 GJB2 5' UTR에 이어 GFP에 대한 코딩 서열 및 인간 GJB2 3' UTR, 및 ATAC-seq에 의해 확인된 마우스 서열과 일치하는 3개의 인간 GJB2 인핸서를 잇는다. 벡터 c20-23은 마우스에서 Gjb2의 혼재성 발현의 독성을 시험하도록 구축되었다. 벡터 c20은 2 x 109 게놈 카피를 초과하는 용량에서 치사성이었다. 도 3f는 측벽 (상단)으로부터 치간 세포 (하단)까지의 마우스 와우의 분절을 나타낸다. AAV9-PHP.B-C15 벡터로 형질도입되고 Gjb2 인핸서 하에 GFP 마커 유전자를 발현하는 세포는 좌측 패널에 제시된다. GJB2를 정상적으로 발현하는 세포는 중간 패널에 제시된다. 우측 패널에서, IHC 및 OHC (표시됨)는 또한 액틴을 형광 팔로이딘으로 표지함으로써 확인된다. c15 구축물에 의해 유도된 GFP의 발현 패턴은 GJB2에 대한 동일한 항체를 사용하는 문헌 [Kikuchi et al., 1995]에 보고된 천연 Gjb2 발현과 일치한다. 특히, c15는 유모 세포에서 GFP 발현을 유도하지 않는다. 도 3g는 구축물 c20에 의해 유도되는 내유모 세포에서의 Gjb2의 발현을 보여준다. 주사하지 않은 마우스 와우에서의 코르티 기관 (외유모 세포 및 내유모 세포 포함)의 3D 재구성이 상단 패널에 제시된다. 지지 세포에서의 GJB2-함유 간극 연접을 GJB2 단백질에 대한 항체로 표지하였다. 유모 세포는 간극 연접을 이루지 않는다. 혼재성 프로모터를 갖는 벡터 c20은 내유모 세포 및 다른 세포 유형에서 GJB2 발현을 유도한다 (하단 패널 참조). 도 3h는 혼재성 Gjb2 발현이 야생형 마우스에서 청각을 손상시키지만, 표적화된 발현은 Gjb2 녹아웃 마우스에서 청각을 구제함을 보여준다. 그러나, ATAC-Seq로부터의 예비 결과에 기초하여 GJB2 프로모터/인핸서를 포함하는 C70 구축물은 15-20 dB만큼 청각을 구제할 수 있었고, 야생형에서 청각을 손상시키지 않았다. 도 3i-3l은 HA 태그를 갖거나 갖지 않는 마우스 GJB2 또는 인간 GJB2를 코딩하는 c70 벡터 플라스미드의 지도를 보여준다. 도 3m은 HA 태그를 갖거나 갖지 않는 마우스 GJB2 또는 인간 GJB2를 코딩하는 벡터 c.70의 개략도를 보여준다. 도 3n은 생성되고 시험된 추가의 벡터를 보여준다.
도 4는 CBA 프로모터를 갖는 eGFP를 코딩하는 AAV-S가 신생 마우스 및 어린 NHP 와우 둘 다에서 유모 세포, 지지 세포, 및 측벽의 세포를 효율적으로 형질도입한다는 것을 보여준다.
도 5a-5v는 각각 확인된 GJB2 GRE 1, 2, 3, 4, 5, 7, 8 및 9를 포함하는 AAV 벡터의 벡터 지도를 보여준다. 벡터는 5'에서 3'으로 5' ITR, 인간 GJB2 GRE, GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, eGFR을 코딩하는 뉴클레오티드 서열, 및 GJB2 엑손 2 3' UTR을 포함한다. 도 5a는 인간 GJB2 GRE1을 포함하고 인간 GJB2를 코딩하는 벡터 c.81.1을 보여주고; 도 5b는 인간 GJB2 GRE1을 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.1을 보여주고; 도 5c는 인간 GJB2 GRE2를 포함하고 eGFP를 코딩하는 벡터 c.81.2를 보여주고; 도 5d는 인간 GJB2 GRE2를 포함하고 인간 GJB2를 코딩하는 벡터 c.81.2를 보여주고; 도 5e는 인간 GJB2 GRE2를 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.2를 보여주고; 도 5f는 인간 GJB2 GRE3을 포함하고 eGFP를 코딩하는 벡터 c.81.3을 보여주고; 도 5g는 인간 GJB2 GRE3을 포함하고 인간 GJB2를 코딩하는 벡터 c.81.3을 보여주고; 도 5h는 인간 GJB2 GRE3을 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.3을 보여주고; 도 5i는 인간 GJB2 GRE4를 포함하고 인간 GJB2를 코딩하는 벡터 c.81.4를 보여주고; 도 5j는 인간 GJB2 GRE4를 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.4를 보여주고; 도 5k는 인간 GJB2 GRE5를 포함하고 eGFP를 코딩하는 벡터 c.81.5를 보여주고; 도 5l은 인간 GJB2 GRE5를 포함하고 인간 GJB2를 코딩하는 벡터 c.81.5를 보여주고; 도 5m은 인간 GJB2 GRE5를 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.5를 보여주고; 도 5n은 인간 GJB2 GRE7을 포함하고 eGFP를 코딩하는 벡터 c.81.7을 보여주고; 도 5o는 인간 GJB2 GRE7을 포함하고 인간 GJB2를 코딩하는 벡터 c.81.7을 보여주고; 도 5p는 인간 GJB2 GRE7을 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.7을 보여주고; 도 5q는 인간 GJB2 GRE8을 포함하고 인간 GJB2를 코딩하는 벡터 c.81.8을 보여주고; 도 5r은 인간 GJB2 GRE8을 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.8을 보여주고; 도 5s는 인간 GJB2 GRE9를 포함하고 eGFP를 코딩하는 벡터 c.81.9를 보여주고; 도 5t는 인간 GJB2 GRE9를 포함하고 인간 GJB2를 코딩하는 벡터 c.81.9를 보여주고; 도 5u는 인간 GJB2 GRE9를 포함하고 마우스 GJB2를 코딩하는 벡터 c.81.9를 보여준다. 도 5v는 상기 기재된 바와 같은 eGFP, 마우스 GJB2 및 인간 GJB2를 코딩하는 c81.2, c81.3, c81.5, c81.7 및 c81.9의 개략도를 보여준다.
도 6a-6d는 코르티 기관의 세포에서의 벡터 c81.5에 의한 GFP 발현을 보여준다. 도 6a는 코르티 기관 내의 및 내측의 다양한 지지 세포를 포함하는 GFP 발현 세포의 형광 영상을 보여준다. 도 6b는 코르티 기관의 영역에서의 내인성 GJB2의 항체 표지를 보여준다. Gjb2 발현은 외인성 GFP의 발현과 크게 중복되었다. 도 6c는 유모 세포의 부동섬모를 나타낸 액틴의 제3 염색을 포함한 도 6a 및 6b의 오버레이이다. 유모 세포에서 GFP는 발현되지 않았다. 도 6d는 GFP 및 유모 세포에 대한 단백질 마커 MYO7A의 동결 절편 면역형광 영상을 보여준다. GFP는 코르티 기관에서 다양한 지지 세포에서 발현되었지만, 유모 세포에서 발현된 MYO7A 발현과 중복되지 않았다.
도 7a-7e는 와우의 측벽에서의 벡터 81.5에 의한 GFP 발현 패턴을 보여준다. 도 7a는 측벽의 섬유세포를 포함하는 세포에서의 GFP 발현을 보여준다. 도 7b는 측벽의 영역에서의 내인성 Gjb2의 항체 표지를 보여준다. GJB2 발현은 외인성 GFP와 크게 중복된다. 도 7c는 도 7a 및 7b의 오버레이 영상이다. GFP는 Gjb2를 발현하는 세포에서 발현되었다는 점에 주목한다. 도 7d-7e는 코르티 기관의 지지 세포 및 측벽의 섬유세포에서의 GFP (도 7d) 및 GJB2 (도 7e)의 동결 절편 면역형광을 보여준다.
본 명세서에 포함되고 그의 일부를 구성하는 첨부 도면은 특정 실시양태를 예시하고, 서면 설명과 함께 본원에 개시된 조성물 및 방법의 특정 측면의 비제한적 예를 제공하는 역할을 한다.The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate certain embodiments and together with the written description serve to provide non-limiting examples of certain aspects of the compositions and methods disclosed herein.
1A-1C show the structure and expression distribution of GJB2 and how loss of GJB2 expression affects patients. Figure 1a shows the structure of the GJB2 hemichannel. The six subunits of the GJB2 protein, each with four transmembrane helices, assemble at the face of the membrane to form a large central pore. GJB2 hemichannels from adjacent cells bind to create channels from the cytoplasm of one cell to the cytoplasm of another cell. Gap junctions are formed by hundreds or thousands of channels packed in synaptic plaques. 1b-1c shows networks of fibroblasts and epithelial cells expressing GJB2 ( FIG. 1b ), and inner and outer hair cells without GJB2 expression ( FIG. 1c ). 1D shows that many patients carrying GJB2 mutation(s) with some residual hearing at birth develop additional hearing loss over the next 3-6 years. The window for treatment exists for 1-5 years after birth, and ~10,000 affected children aged 0-5 years in the United States can be treated.
2A-2B show the delivery of viral vectors to the cochlea by direct injection through the round window membrane (RWM), and the deleterious effects of mixed expression of Gjb2 on the hearing of the injected mice. 2A is a cartoon illustrating round window membrane (RWM) injection. 2B shows that mixed expression of Gjb2 in the inner ear impaired hearing in wild-type mice.
3A-3N show the identification of cis-regulatory elements (eg, enhancers) critical for GJB2 expression in a subset of cochlear cells that naturally express the GJB2 gene. 3A-3B show that certain patients with GJB2-associated hearing loss have GJB2 coding sequence mutations and upstream deletions that occur in trans, suggesting that some patients carry mutation(s) in cis-regulatory elements. and the region flanking the CRYL1 gene is particularly important for the identification of these cis-regulatory elements. 3C (top) shows the identification of gene regulatory elements (GREs) in the UCSC Genome Browser View of ATAC-Seqs from mouse cochleas at developmental stages P2, P5 and P8, spanning ~300 kb in the mouse Gjb2 gene region. Shaded regions indicate regions containing putative GREs. The X-axis is the genomic region on chr14 in the mouse genome. The Y-axis is the number of reads from ATAC-Seq that align to specific regions in the genome. Light shading indicates regions of open chromatin that are characteristic of regions of transcriptional activity enriched in read pile-up, suggesting higher activity in these regions. Regions A and B represent transcriptionally active sequences within mouse Gjb2 itself. The region CM is a transcriptionally active region around Gjb2 that may be part of a cis-regulatory network. 3C (bottom) shows transcriptional active regions within and around light shaded regions detected as specific GREs (dark highlights). Note that GRE was first identified in mice. The human GJB2 GRE was confirmed in silico by modeling the mouse GRE. Human GJB2 GRE was tested in subsequent experiments. Figures 3D-3E show various vector designs with and without incorporation of the GJB2 promoter and/or enhancer. These vectors were tested in the mouse inner ear. The C15 vector, a GJB2 enhancer vector, contains a 500 bp human GJB2 promoter, a human GJB2 5' UTR followed by a coding sequence for GFP and a human GJB2 3' UTR, and three human GJB2 identical to the mouse sequence identified by ATAC-seq. splicing enhancers Vector c20-23 was constructed to test the toxicity of mixed expression of Gjb2 in mice. Vector c20 was lethal at doses exceeding 2×10 9 genome copies. Figure 3f shows the segment of the mouse cochlea from the lateral wall (top) to the interdental cells (bottom). Cells transduced with the AAV9-PHP.B-C15 vector and expressing the GFP marker gene under the Gjb2 enhancer are shown in the left panel. Cells normally expressing GJB2 are shown in the middle panel. In the right panel, IHC and OHC (indicated) are also identified by labeling actin with fluorescent phalloidin. The expression pattern of GFP induced by the c15 construct is consistent with native Gjb2 expression reported by Kikuchi et al., 1995 using the same antibody against GJB2. In particular, c15 does not induce GFP expression in hair cells. 3G shows the expression of Gjb2 in inner hair cells induced by construct c20. A 3D reconstruction of the organ of Corti (including outer and inner hair cells) in an uninjected mouse cochlea is shown in the top panel. GJB2-containing gap junctions in feeder cells were labeled with an antibody against the GJB2 protein. Hair cells do not form gap junctions. Vector c20 with a hybrid promoter drives GJB2 expression in inner hair cells and other cell types (see lower panel). 3H shows that mixed Gjb2 expression impairs hearing in wild-type mice, but targeted expression rescues hearing in Gjb2 knockout mice. However, based on preliminary results from ATAC-Seq, the C70 construct containing the GJB2 promoter/enhancer was able to rescue hearing by 15-20 dB and did not impair hearing in wild type. 3I-3L show maps of c70 vector plasmids encoding mouse GJB2 or human GJB2 with or without an HA tag. 3M shows a schematic of vector c.70 encoding mouse GJB2 or human GJB2 with or without an HA tag. Figure 3n shows additional vectors generated and tested.
4 shows that AAV-S encoding eGFP with a CBA promoter efficiently transduces hair cells, feeder cells, and cells of the lateral wall in both neonatal mice and young NHP cochleas.
5A-5V show vector maps of AAV vectors comprising identified
6A-6D show GFP expression by the vector c81.5 in cells of the organ of Corti. 6A shows fluorescence images of GFP expressing cells, including various supporting cells within and medial to the organ of Corti. 6B shows antibody labeling of endogenous GJB2 in the region of the organ of Corti. Gjb2 expression largely overlapped with that of exogenous GFP. 6C is an overlay of FIGS. 6A and 6B including a third staining of actin showing stereocilia of hair cells. GFP was not expressed in hair cells. 6D shows frozen section immunofluorescence images of GFP and MYO7A, a protein marker for hair cells. GFP was expressed on various supporting cells in the organ of Corti, but did not overlap with MYO7A expression expressed on hair cells.
7A-7E show the pattern of GFP expression by vector 81.5 in the lateral wall of the cochlea. Figure 7a shows GFP expression in cells containing fibroblasts of the lateral wall. Figure 7b shows antibody labeling of endogenous Gjb2 in the region of the lateral wall. GJB2 expression largely overlaps with exogenous GFP. 7c is an overlay image of FIGS. 7a and 7b. Note that GFP was expressed in cells expressing Gjb2. 7D-7E show frozen section immunofluorescence of GFP (FIG. 7D) and GJB2 (FIG. 7E) in the supporting cells of the organ of Corti and the fibrocytes of the lateral wall.
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate certain embodiments and together with the written description serve to provide non-limiting examples of certain aspects of the compositions and methods disclosed herein.
상세한 설명details
본 개시내용은 적어도 부분적으로, 간극 연접 베타 2 (GJB2) 유전자 조절 요소 (GRE), 및 GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 발현 카세트를 포함하는 단리된 핵산에 관한 것이다. 일부 실시양태에서, 발현 카세트는 프로모터 (예를 들어, GJB2 프로모터)를 추가로 포함한다. 일부 실시양태에서, 발현 카세트에는 2개의 아데노-연관 바이러스 (AAV) 역전된 말단 반복부 (ITR)가 플랭킹된다. 단리된 핵산 내의 천연 GJB2 조절 요소 (GRE)의 존재는 독성이고 청각을 손상시키는 내이에서의 혼재성 GJB2 유전자 발현을 방지한다. 따라서, 일부 실시양태에서, 본원에 기재된 단리된 핵산은 GJB2 유전자를 정상적으로 발현하는 내이 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에서는 GJB2 단백질을 발현할 수 있지만, GJB2 유전자를 정상적으로 발현하지 않는 세포 (예를 들어, 유모 세포 및 나선 신경절 뉴런)에서는 그렇지 않다.The present disclosure relates, at least in part, to an isolated nucleic acid comprising a gap junction beta 2 (GJB2) gene regulatory element (GRE) and an expression cassette comprising a nucleotide sequence encoding a GJB2 protein. In some embodiments, the expression cassette further comprises a promoter (eg, GJB2 promoter). In some embodiments, the expression cassette is flanked by two adeno-associated virus (AAV) inverted terminal repeats (ITRs). The presence of a native GJB2 regulatory element (GRE) in the isolated nucleic acid prevents coexistent GJB2 gene expression in the auris interna that is toxic and impairs hearing. Thus, in some embodiments, the isolated nucleic acids described herein are capable of expressing the GJB2 protein in inner ear cells that normally express the GJB2 gene (e.g., connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions); , but not in cells that do not normally express the GJB2 gene (eg, hair cells and spiral ganglion neurons).
I. 단리된 핵산I. Isolated Nucleic Acids
일부 측면에서, 본 개시내용은 특정 상염색체 열성 유전 질환, 예를 들어 비-증후군성 청각 상실 (DFNB1)을 치료하기 위한 조성물 및 방법에 관한 것이다. DFNB1은 GJB2 유전자에서의 돌연변이에 의해 유발된다. GJB2 유전자는 코넥신 26으로도 공지된 GJB2 단백질을 코딩한다. 코넥신 26은 코넥신 단백질 패밀리의 구성원이다. GJB2 단백질은 간극 연접으로 불리는 클러스터에 채널을 형성하며, 이는 내이 내의 세포를 포함한 이웃 세포 사이의 소통을 가능하게 한다. GJB2 유전자에서의 돌연변이는 간극 연접의 구조를 제거하거나 변화시키고, 청각에 필요한 세포의 기능 또는 생존에 영향을 미친다. 유전자 대체 요법 (예를 들어, 재조합 아데노-연관 바이러스 (rAAV)에 의한 유전자 요법)은 GJB2 유전자 코딩 서열의 작은 크기 (700 bp 미만)로 인해 매력적이다. 그러나, 현재 이용가능한 유전자 요법을 사용한 내이에서의 GJB2 발현의 회복은 청각의 회복으로 이어지지 않는다.In some aspects, the present disclosure relates to compositions and methods for treating certain autosomal recessive inherited disorders, such as non-syndromic hearing loss (DFNB1). DFNB1 is caused by a mutation in the GJB2 gene. The GJB2 gene encodes the GJB2 protein, also known as connexin 26. Connexin 26 is a member of the connexin protein family. The GJB2 protein forms channels in clusters called gap junctions, which allow communication between neighboring cells, including cells within the inner ear. Mutations in the GJB2 gene eliminate or change the structure of the gap junction and affect the function or survival of cells required for hearing. Gene replacement therapy (eg, gene therapy with recombinant adeno-associated virus (rAAV)) is attractive due to the small size (less than 700 bp) of the GJB2 gene coding sequence. However, restoration of GJB2 expression in the inner ear using currently available gene therapies does not lead to restoration of hearing.
따라서, 본 개시내용은, 부분적으로, 성공적인 GJB2 유전자 요법이 GJB2 단백질을 정상적으로 발현하는 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에서 GJB2 발현을 필요로 하고 다른 세포 (예를 들어, 유모 세포 및 나선 신경절 뉴런)에서는 그렇지 않다는 놀라운 발견에 기초한다. 감각 세포를 제외하고, 와우 내의 대부분의 세포는 간극 연접을 통해 연결되고, 이들 간극 연접은 와우 기능에서 결정적인 역할을 하는 것으로 보인다. GJB2 단백질은 와우에서 대부분의 세포 부류를 연결하는 간극 연접에서 발생한다. 2개의 독립적인 세포 시스템이 존재하며, 이들은 상호연결 간극 연접에 의해 한정된다. 제1 시스템인 상피 세포 간극 연접 시스템은 주로 모든 코르티 기관의 지지 세포 (예를 들어, 내부 및 외부 고랑(sulcus)의 상피 세포, 및 치간 세포)로 구성되고, 또한 나선판가장자리에서의 치간 세포 및 나선 인대에서의 뿌리 세포를 포함한다. 내이에서, 코르티 기관으로 명명되는 와우의 감각 영역은 다양한 지지 세포에 의해 둘러싸인 1개 열의 내유모 세포 (IHC) 및 3 내지 4개 열의 외유모 세포 (OHC)를 포함한다. 지지 세포는 내이 감각 상피의 발생, 기능 및 유지에서 결정적인 역할을 한다. 상피의 내강 표면과만 접촉하는 유모 세포와 달리, 지지 세포는 기저층(basal lamina)으로부터 내강까지 상피의 전체 깊이에 걸쳐있다. 지지 세포는 밀착 및 부착 연접에 의해 서로 및 유모 세포에 연결되고; 이들은 간극 연접에 의해 다른 지지 세포와 직접 소통한다 (예를 들어, 문헌 [Wan et al., Inner ear supporting cells: Rethinking the silent majority, Semin Cell Dev Biol. 2013 May; 24(5): 448-459]). 코르티 기관에 대한 지지 세포의 비제한적 예는 기둥 세포, 다이터 세포, 헨센 세포, 클라우디우스 세포, 내부 지골 세포 및 경계 세포를 포함한다. 제2 시스템인 결합 조직 세포 간극 연접 시스템은, 혈관조 중간 세포, 측벽 및 상혈관조 부위의 섬유세포, 혈관선조의 기저 세포, 나선 인대에서의 섬유세포, 나선판가장자리에서의 섬유세포, 전정계에 대면하는 미로골낭을 라이닝하는 중간엽 세포, 및 가장자리상부 암색 세포를 포함한다. 일부 실시양태에서, 와우에서, GJB2는 코르티 기관 및 근처 영역의 지지 세포 (예를 들어, 기둥 세포, 다이터 세포, 헨센 세포, 클라우디우스 세포, 내부 지골 세포; 및 경계 세포), 및 혈관조 중간 세포, 측벽 및 상혈관조 부위의 섬유세포, 혈관선조의 기저 세포, 나선 인대에서의 섬유세포, 나선판가장자리에서의 섬유세포, 전정계에 대면하는 미로골낭을 라이닝하는 중간엽 세포, 및 가장자리상부 암색 세포를 포함하는 결합 조직 시스템에서 정상적으로 발현된다 (예를 들어, 문헌 [Kikuchi et al. (1995) Gap junctions in the rat cochlea: immunohistochemical and ultrastructural analysis. Anat Embryol (Berl) 191:101-118; and Kikuchi et al., Gap junction systems in the mammalian cochlea, Brain Res Brain Res Rev. 2000 Apr;32(1):163-6. doi: 10.1016/s0165-0173(99)00076-4] 참조).Thus, the present disclosure provides, in part, that successful GJB2 gene therapy requires GJB2 expression in cells that normally express the GJB2 protein (e.g., connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions) and other It is based on the surprising finding that this is not the case in cells (eg, hair cells and spiral ganglion neurons). Except for sensory cells, most cells within the cochlea are connected via gap junctions, and these gap junctions appear to play a crucial role in cochlear function. The GJB2 protein occurs at gap junctions that connect most cell classes in the cochlea. There are two independent cellular systems, which are bounded by interconnecting gap junctions. The first system, the epithelial gap junction system, consists primarily of the supporting cells of all organs of Corti (e.g. epithelial cells of the inner and outer sulcus, and interdental cells), but also interdental cells at the margins of the spiral plate and Root cells in the spiral ligament. In the inner ear, the sensory area of the cochlea, termed the organ of Corti, contains one row of inner hair cells (IHC) and three to four rows of outer hair cells (OHC) surrounded by various supporting cells. Supporting cells play a critical role in the development, function and maintenance of the inner ear sensory epithelium. Unlike hair cells, which only contact the luminal surface of the epithelium, supporting cells span the entire depth of the epithelium from the basal lamina to the lumen. Supporting cells are connected to each other and to hair cells by tight and adherent synapses; They communicate directly with other supporting cells by gap junctions (see, e.g., Wan et al., Inner ear supporting cells: Rethinking the silent majority, Semin Cell Dev Biol. 2013 May; 24(5): 448-459 ]). Non-limiting examples of supporting cells for the organ of Corti include pillar cells, diter cells, Hensen cells, Claudius cells, inner phalanx cells and border cells. The second system, the connective tissue gap junction system, is composed of vascular interstitial cells, fibrocytes in the lateral wall and supravascular acinar regions, basal cells in the vascular progenitors, fibrocytes in the spiral ligament, fibrocytes at the margin of the spiral plate, and vestibular system. mesenchymal cells lining the labyrinth bone capsule facing the , and dark cells at the marginal upper part. In some embodiments, in the cochlea, GJB2 is expressed in supporting cells (e.g., pillar cells, diter cells, Hensen cells, claudius cells, internal phalanx cells; and border cells), and vascular interstitial cells of the organ of Corti and nearby regions. , fibrocytes in the lateral wall and supravascular region, basal cells in the vascular progenitors, fibrocytes in the spiral ligament, fibrocytes at the margin of the spiral plate, mesenchymal cells lining the labyrinth bone capsule facing the vestibular system, and supermarginal dark color It is normally expressed in connective tissue systems including cells (see, eg, Kikuchi et al. (1995) Gap junctions in the rat cochlea: immunohistochemical and ultrastructural analysis. Anat Embryol (Berl) 191:101-118; and Kikuchi et al., Gap junction systems in the mammalian cochlea, Brain Res Brain Res Rev. 2000 Apr;32(1):163-6. doi: 10.1016/s0165-0173(99)00076-4).
GJB2 발현은 와우 기능에 중요하다. 예를 들어, 형질도입 채널을 통해 유모 세포에 진입하고 기저 K+ 채널을 통해 방출되는 K+은 상피계에 의해 코르티 기관으로부터 셔틀링되고, 세포질계에 의해 혈관조로 운반되며, 여기서 이는 다시 내림프로 펌핑된다. 또한, GJB2는, 비록 유모 세포가 Gjb2를 발현하지 않더라도, 내이에서 GJB2 단백질이 결여된 마우스가 출생후 제30일 (P30)까지 감소된 와우내 전위 및 유모 세포 및 지지 세포의 극심한 아폽토시스 손실을 갖기 때문에, 와우의 발생에서 역할을 한다 (문헌 [Cohen-Salmon et al., 2002; Wang et al., 2009; Sun et al., 2009; Crispino et al., 2011; Johnson et al., 2017]). Gjb2가 P6 후에 결실되면, 표현형은 훨씬 더 경미하다 (Chang et al., 2015). 그러나, GJB2 단백질에 대한 장기적인 요건이 남아있다: 유모 세포 손실은 결실에도 불구하고 P14만큼 늦게 수개월 후에 발생한다 (Ma et al., 2020). 어떠한 특정한 이론에 얽매이는 것을 원하지는 않지만, K+의 셔틀링에서의 GJB2의 기능은 와우의 발생에서의 그의 역할과 관련될 수 있다: K+가 간극 연접 네트워크에 의해 유모 세포로부터 멀리 운반되지 않는 경우에, K+ 축적은 유모 세포를 탈분극시켜 Ca2+ 유입 및 궁극적인 세포 사멸을 유발할 수 있다. 간극 연접 네트워크는 또한 글루코스 및 영양소를 혈관으로부터 감각 상피로 수송하는 데 요구될 수 있고, 그의 부재는 세포 사멸로 이어질 수 있다.GJB2 expression is important for cochlear function. For example, K + entering hair cells via transduction channels and released via basal K + channels is shuttled from the Organ of Corti by the epithelial system and transported by the cytoplasmic system to the vasculature, where it is returned to the endolymph. pumped up In addition, GJB2 is such that even though hair cells do not express Gjb2, mice lacking GJB2 protein in the inner ear have reduced cochlear potential and profound apoptotic loss of hair and supporting cells by postnatal day 30 (P30). Because of this, it plays a role in the development of the cochlea (Cohen-Salmon et al., 2002; Wang et al., 2009; Sun et al., 2009; Crispino et al., 2011; Johnson et al., 2017) . When Gjb2 is deleted after P6, the phenotype is much milder (Chang et al., 2015). However, a long-term requirement for the GJB2 protein remains: hair cell loss occurs after several months as late as P14 despite deletion (Ma et al., 2020). Without wishing to be bound by any particular theory, GJB2's function in the shuttling of K + may be related to its role in the development of the cochlea: where K + is not transported away from the hair cell by the gap junction network. , K + accumulation can depolarize hair cells, causing Ca 2+ influx and eventual cell death. Gap junction networks may also be required to transport glucose and nutrients from blood vessels to the sensory epithelium, the absence of which may lead to cell death.
일부 실시양태에서, 본 개시내용은 발현 카세트에 플랭킹된 2개의 아데노-연관 바이러스 (AAV) 역전된 말단 반복부 (ITR)를 포함하는 단리된 핵산을 제공하며, 여기서 발현 카세트는 GJB2 유전자 조절 요소 (GRE)를 코딩하는 뉴클레오티드 서열에 작동가능하게 연결된 프로모터 (예를 들어, 인간 GJB2 프로모터), 및 간극 연접 베타 2 (GJB2) 단백질을 코딩하는 뉴클레오티드 서열을 포함한다. 단리된 핵산 내의 천연 GJB2 유전자 조절 요소 및/또는 조직/세포-특이적 프로모터의 혼입은 이를 정상적으로 발현하는 세포 (예를 들어, 와우의 결합 조직 세포 (섬유세포 포함) 및 코르티 기관 및 근처 영역의 지지 세포)에서 GJB2 유전자의 발현을 용이하게 한다. 본원에 사용된 발현 카세트는 벡터 및 그의 조절 서열을 갖는 세포에 의해 발현될 단백질 코딩 서열을 포함하는 벡터 DNA의 성분을 지칭한다. 일단 표적 세포에 전달되면, 발현 카세트는 세포의 기구가 RNA 및/또는 단백질(들) (예를 들어, GJB2 단백질)을 제조하도록 지시한다.In some embodiments, the present disclosure provides an isolated nucleic acid comprising two adeno-associated virus (AAV) inverted terminal repeats (ITRs) flanked by an expression cassette, wherein the expression cassette comprises a GJB2 gene regulatory element. (GRE) (eg, human GJB2 promoter), and a nucleotide sequence encoding gap junction beta 2 (GJB2) protein. Incorporation of native GJB2 gene regulatory elements and/or tissue/cell-specific promoters within the isolated nucleic acid can be used in cells that normally express it (e.g., connective tissue cells of the cochlea (including fibrocytes) and support of the organ of Corti and nearby regions). cells) to facilitate the expression of the GJB2 gene. Expression cassette, as used herein, refers to a component of a vector DNA that contains a protein coding sequence to be expressed by a cell having the vector and its regulatory sequences. Once delivered to the target cell, the expression cassette directs the cell's machinery to produce RNA and/or protein(s) (eg, the GJB2 protein).
"핵산" 서열은 DNA 또는 RNA 서열을 지칭한다. 일부 실시양태에서, 본 개시내용의 단백질 및 핵산은 단리된다. 본원에 사용된 용어 "단리된"은 인공적으로 생산된 것을 의미한다. 핵산과 관련하여 본원에 사용된 용어 "단리된"은 (i) 예를 들어 폴리머라제 연쇄 반응 (PCR)에 의해 시험관내 증폭되거나; (ii) 클로닝에 의해 재조합적으로 생산되거나; (iii) 예를 들어 절단 및 겔 분리에 의해 정제되거나; 또는 (iv) 예를 들어 화학적 합성에 의해 합성되는 것을 의미한다. 단리된 핵산은 관련 기술분야에 널리 공지된 재조합 DNA 기술에 의해 용이하게 조작가능한 것이다. 따라서, 5' 및 3' 제한 부위가 공지되어 있거나 또는 폴리머라제 연쇄 반응 (PCR) 프라이머 서열이 개시된 벡터에 함유된 뉴클레오티드 서열은 단리된 것으로 간주되지만, 그의 천연 숙주에서 그의 천연 상태로 존재하는 핵산 서열은 그렇지 않다. 단리된 핵산은 실질적으로 정제될 수 있지만, 그럴 필요는 없다. 예를 들어, 클로닝 또는 발현 벡터 내에서 단리된 핵산은 그것이 존재하는 세포 내에 이 물질을 단지 작은 백분율로만 포함할 수 있다는 점에서 순수하지 않다. 그러나, 이러한 핵산은 관련 기술분야의 통상의 기술자에게 공지된 표준 기술에 의해 용이하게 조작가능하기 때문에 그 용어가 본원에 사용된 바와 같이 단리된다. 단백질 또는 펩티드와 관련하여 본원에 사용된 용어 "단리된"은 단백질 또는 펩티드가 그의 자연 환경으로부터 단리되거나 또는 인공적으로 생산된 (예를 들어, 화학적 합성, 재조합 DNA 기술 등에 의함) 것을 지칭한다.A “nucleic acid” sequence refers to a DNA or RNA sequence. In some embodiments, proteins and nucleic acids of the present disclosure are isolated. As used herein, the term "isolated" means artificially produced. As used herein, the term “isolated” in reference to a nucleic acid means (i) amplified in vitro, for example by polymerase chain reaction (PCR); (ii) produced recombinantly by cloning; (iii) purified, for example by cleavage and gel separation; or (iv) synthesized, for example by chemical synthesis. Isolated nucleic acids are readily manipulable by recombinant DNA techniques well known in the art. Thus, nucleotide sequences contained in vectors for which 5' and 3' restriction sites are known or for which polymerase chain reaction (PCR) primer sequences are disclosed are considered isolated, but nucleic acid sequences that exist in their native state in their natural host. is not Isolated nucleic acids may, but need not be substantially purified. For example, a nucleic acid isolated within a cloning or expression vector is not pure in that it may contain only a small percentage of this material within the cells in which it resides. However, such nucleic acids are readily manipulable by standard techniques known to those skilled in the art, and as such the term is isolated as used herein. As used herein, the term "isolated" in reference to a protein or peptide refers to the protein or peptide being isolated from its natural environment or produced artificially (eg, by chemical synthesis, recombinant DNA techniques, etc.).
일부 실시양태에서, GJB2 단백질은 인간 GJB2 단백질이다. 일부 실시양태에서, 인간 GJB2 단백질은 서열식별번호: 1에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 아미노산 서열을 포함한다.In some embodiments, the GJB2 protein is a human GJB2 protein. In some embodiments, the human GJB2 protein is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% relative to SEQ ID NO:1 %, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical amino acid sequences.
예시적인 인간 GJB2 단백질 서열은 서열식별번호: 1에 제시된다.An exemplary human GJB2 protein sequence is set forth in SEQ ID NO:1.
일부 실시양태에서, 단리된 핵산의 발현 카세트는 서열식별번호 1에 제시된 아미노산 서열을 갖는 인간 GJB2 단백질을 코딩한다. 일부 실시양태에서, 인간 GJB2 단백질을 코딩하는 뉴클레오티드 서열은 서열식별번호: 2에 대해 적어도 50%, 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the expression cassette of the isolated nucleic acid encodes a human GJB2 protein having the amino acid sequence set forth in SEQ ID NO:1. In some embodiments, the nucleotide sequence encoding the human GJB2 protein is at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least relative to SEQ ID NO:2 at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical nucleotide sequences.
인간 GJB2 단백질을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 2에 제시된다.An exemplary nucleotide sequence encoding human GJB2 protein is set forth in SEQ ID NO:2.
일부 실시양태에서, GJB2 단백질은 마우스 GJB2 단백질이다. 일부 실시양태에서, 마우스 GJB2 단백질은 서열식별번호: 3에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 아미노산 서열을 포함한다.In some embodiments, the GJB2 protein is a mouse GJB2 protein. In some embodiments, the mouse GJB2 protein is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% relative to SEQ ID NO:3 %, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical amino acid sequences.
예시적인 마우스 GJB2 단백질 서열은 서열식별번호: 3에 제시된다.An exemplary mouse GJB2 protein sequence is set forth in SEQ ID NO:3.
일부 실시양태에서, 단리된 핵산은 서열식별번호 3에 제시된 아미노산 서열을 갖는 마우스 GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 마우스 GJB2 단백질을 코딩하는 뉴클레오티드 서열은 서열식별번호: 4에 대해 적어도 50%, 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the isolated nucleic acid comprises a nucleotide sequence encoding a mouse GJB2 protein having the amino acid sequence set forth in SEQ ID NO:3. In some embodiments, the nucleotide sequence encoding the mouse GJB2 protein is at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least relative to SEQ ID NO:4 at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical nucleotide sequences.
마우스 GJB2 단백질을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 4에 제시된다.An exemplary nucleotide sequence encoding the mouse GJB2 protein is set forth in SEQ ID NO:4.
일부 실시양태에서, GJB2 단백질을 코딩하는 뉴클레오티드 서열은 숙주 (예를 들어, 인간)에서의 발현을 위해 코돈 최적화된다. 본원에 기재된 "코돈 최적화"는 코돈을 목적하는 세포에서 최대 단백질 발현 효율을 증가시키는 것으로 공지된 코돈으로 변경시키는 설계 과정을 지칭한다. 일부 대안에서, 코돈 최적화가 기재되며, 여기서 코돈 최적화는 높은 단백질 수율을 위해 최적화된 합성 유전자 전사체를 생성하기 위해 관련 기술분야의 통상의 기술자에게 공지된 알고리즘을 사용함으로써 수행될 수 있다. 코돈 최적화를 위한 알고리즘을 함유하는 프로그램은 관련 기술분야의 통상의 기술자에게 공지되어 있다. 프로그램은, 예를 들어 옵티멈진(OptimumGene)™, 진GPS(GeneGPS)® 알고리즘 등을 포함할 수 있다. 추가로, 합성 코돈 최적화된 서열은, 예를 들어 인테그레이티드 DNA 테크놀로지스(Integrated DNA Technologies) 및 다른 상업적으로 입수가능한 DNA 서열분석 서비스로부터 상업적으로 수득될 수 있다.In some embodiments, the nucleotide sequence encoding the GJB2 protein is codon optimized for expression in a host (eg, human). "Codon optimization" as described herein refers to a design process in which codons are altered to codons known to increase the efficiency of maximal protein expression in a desired cell. In some alternatives, codon optimization is described, where codon optimization can be performed using algorithms known to those skilled in the art to generate synthetic gene transcripts optimized for high protein yield. Programs containing algorithms for codon optimization are known to those skilled in the art. Programs may include, for example, OptimumGene™, GeneGPS® algorithms, and the like. Additionally, synthetic codon optimized sequences can be obtained commercially from, for example, Integrated DNA Technologies and other commercially available DNA sequencing services.
본원에 사용된 용어 "서열 동일성"은 서열을 정렬하고 필요한 경우에 갭을 도입하여 최대 퍼센트 동일성을 달성한 후의, 참조 서열, 예를 들어 본원에 개시된 GJB2 단백질 및 그의 코딩 서열의 아미노산 (또는 핵산) 잔기와 동일한 후보 서열의 아미노산 (또는 핵산) 잔기의 백분율을 지칭한다 (예를 들어, 갭은 최적 정렬을 위해 후보 및 참조 서열 중 하나 또는 둘 다에 도입될 수 있고, 비-상동 서열은 비교 목적을 위해 무시될 수 있음). 아미노산 서열 또는 핵산 코딩 서열의 변경은 참조 서열의 잔기의 결실, 부가 또는 치환에 의해 수득될 수 있다. 퍼센트 동일성을 결정하기 위한 목적의 정렬은 관련 기술분야의 기술 내에 있는 다양한 방식으로, 예를 들어 공중 이용가능한 컴퓨터 소프트웨어, 예컨대 BLAST, BLAST-2, BLAST-P, BLAST-N, BLAST-X, WU-BLAST-2, ALIGN, ALIGN-2, CLUSTAL, 또는 메갈라인 (DNASTAR) 소프트웨어를 사용하여 달성될 수 있다. 관련 기술분야의 통상의 기술자는 비교되는 서열의 전장에 걸쳐 최대 정렬을 달성하는 데 필요한 임의의 알고리즘을 포함한, 정렬을 측정하기 위한 적절한 파라미터를 결정할 수 있다. 예를 들어, 소정의 참조 서열에 대한, 이와의 또는 이에 대항한 소정의 후보 서열 (대안적으로, 소정의 참조 서열에 대한, 이와의 또는 이에 대항한 특정 퍼센트 아미노산 (또는 핵산) 서열 동일성을 갖거나 또는 이를 포함하는 소정의 후보 서열로서 표현될 수 있음)의 퍼센트 아미노산 (또는 핵산) 서열 동일성은 하기와 같이 계산되며:As used herein, the term "sequence identity" refers to the amino acids (or nucleic acids) of a reference sequence, e.g., the GJB2 protein and its coding sequence disclosed herein, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent identity. Refers to the percentage of amino acid (or nucleic acid) residues of a candidate sequence that are identical to residues (e.g., gaps may be introduced in one or both of the candidate and reference sequences for optimal alignment, and non-homologous sequences are used for comparison purposes may be overridden for Alterations in the amino acid sequence or nucleic acid coding sequence may be obtained by deletion, addition or substitution of residues in the reference sequence. Alignment for the purpose of determining percent identity can be performed in a variety of ways that are within the skill of the art, for example by using publicly available computer software such as BLAST, BLAST-2, BLAST-P, BLAST-N, BLAST-X, WU - can be achieved using BLAST-2, ALIGN, ALIGN-2, CLUSTAL, or Megaline (DNASTAR) software. One skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms necessary to achieve maximal alignment over the full length of the sequences being compared. For example, a given candidate sequence to, with or against a given reference sequence (alternatively, with a certain percent amino acid (or nucleic acid) sequence identity to, with or against a given reference sequence). The percent amino acid (or nucleic acid) sequence identity of a given candidate sequence that contains or contains the same is calculated as:
100 x (A/B의 분율)100 x (fraction of A/B)
여기서 A는 후보 서열 및 참조 서열의 정렬에서 동일한 것으로 점수화된 아미노산 (또는 핵산) 잔기의 수이고, B는 참조 서열 내의 아미노산 (또는 핵산) 잔기의 총 수이다. 특히, 후보 서열과의 비교를 위해 정렬된 참조 서열은 후보 서열이 후보 서열의 전장 또는 후보 서열의 인접 아미노산 (또는 핵산) 잔기의 선택된 부분에 걸쳐 예를 들어 50% 내지 100% 동일성을 나타낸다는 것을 보여줄 수 있다. 비교 목적을 위해 정렬된 후보 서열의 길이는 참조 서열의 길이의 적어도 30%, 예를 들어 적어도 40%, 예를 들어 적어도 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, 또는 100%이다. 후보 서열 내의 위치가 참조 서열 (예를 들어, GJB2 아미노산 서열, 코딩 서열, GJB2 유전자 조절 요소 (GRE)에 대한 뉴클레오티드 서열, 또는 본원에 기재된 임의의 다른 서열) 내의 상응하는 위치와 동일한 아미노산 (또는 핵산) 잔기에 의해 점유되는 경우에, 분자는 그 위치에서 동일하다.where A is the number of amino acid (or nucleic acid) residues scored identical in the alignment of the candidate sequence and the reference sequence, and B is the total number of amino acid (or nucleic acid) residues in the reference sequence. In particular, a reference sequence aligned for comparison with a candidate sequence indicates that the candidate sequence exhibits, for example, 50% to 100% identity over the full length of the candidate sequence or over a selected portion of contiguous amino acid (or nucleic acid) residues of the candidate sequence. can show The length of a candidate sequence aligned for comparison purposes is at least 30%, such as at least 40%, such as at least 50%, 60%, 70%, 80%, 90%, 95%, 98% of the length of the reference sequence. %, 99%, or 100%. an amino acid (or nucleic acid) whose position in the candidate sequence is identical to the corresponding position in a reference sequence (e.g., a GJB2 amino acid sequence, a coding sequence, a nucleotide sequence for a GJB2 gene regulatory element (GRE), or any other sequence described herein) ) residue, the molecules are identical at that position.
본원에 기재된 단리된 핵산 서열의 발현 카세트 (예를 들어, GJB2 단백질을 코딩하는 뉴클레오티드 서열을 포함하는 단리된 핵산의 발현 카세트)는 코딩 서열 (예를 들어, GJB2 단백질 코딩 서열)에 작동가능하게 연결된 프로모터를 추가로 포함할 수 있다. "프로모터"는 유전자의 전사를 개시하는 데 필요한, 세포의 합성 기구 또는 도입된 합성 기구에 의해 인식되는 DNA 서열을 지칭한다. "작동가능하게 연결된", "제어 하에 있는", 또는 "전사 제어 하에 있는"이라는 어구는 프로모터가 RNA 폴리머라제 개시 및 유전자의 발현을 제어하는 데에 있어서 핵산과 관련하여 올바른 위치 및 배향으로 존재함을 의미한다. 프로모터는 구성적 프로모터, 유도성 프로모터, 또는 조직-특이적 프로모터일 수 있다.An expression cassette of an isolated nucleic acid sequence described herein (eg, an expression cassette of an isolated nucleic acid comprising a nucleotide sequence encoding a GJB2 protein) is operably linked to a coding sequence (eg, a GJB2 protein coding sequence). A promoter may additionally be included. "Promoter" refers to a DNA sequence recognized by a cell's synthetic machinery or introduced synthetic machinery, which is necessary to initiate transcription of a gene. The phrase “operably linked,” “under control,” or “under transcriptional control” means that the promoter is in the correct position and orientation with respect to the nucleic acid to control RNA polymerase initiation and expression of the gene. means A promoter can be a constitutive promoter, an inducible promoter, or a tissue-specific promoter.
일부 실시양태에서, 프로모터는 조직/세포-특이적 프로모터이다. 본원에 사용된 조직/세포 특이적 프로모터는 오직 특정 세포 유형에서만 활성을 갖는 프로모터를 지칭한다. 일부 실시양태에서, 본원에 기재된 단리된 핵산에서 사용되는 프로모터는 GJB2 유전자를 정상적으로 발현하는 와우 세포에서 활성을 가진다. 본원에 기재된 단리된 핵산에서 조직/세포-특이적 프로모터의 사용은 원치않는 트랜스진 (예를 들어, GJB2 유전자) 발현을 제한할 뿐만 아니라 지속적인 트랜스진 발현을 용이하게 할 수 있다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 조직/세포 특이적 프로모터를 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 GJB2 프로모터 (예를 들어, 세포 특이적 GJB2 발현이 요구되는 임의의 종에 대한 GJB2 프로모터)를 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 인간 GJB2 프로모터를 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 인간 GJB2 프로모터의 임의의 연속적인 뉴클레오티드의 적어도 300 bp (예를 들어, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp, 또는 그 초과)를 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 인간 GJB2 프로모터의 500 bp 연속 뉴클레오티드를 갖는 프로모터를 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 서열식별번호: 5에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는 프로모터를 포함한다. 인간 GJB2 프로모터의 500 bp의 예시적인 뉴클레오티드 서열은 서열식별번호: 5에 제시된다.In some embodiments, the promoter is a tissue/cell-specific promoter. A tissue/cell specific promoter as used herein refers to a promoter that is active only in certain cell types. In some embodiments, the promoter used in the isolated nucleic acids described herein is active in cochlear cells that normally express the GJB2 gene. The use of tissue/cell-specific promoters in the isolated nucleic acids described herein can facilitate constitutive transgene expression as well as restrict unwanted transgene (eg, GJB2 gene) expression. In some embodiments, an expression cassette of an isolated nucleic acid includes a tissue/cell specific promoter. In some embodiments, the expression cassette of the isolated nucleic acid includes a GJB2 promoter (eg, a GJB2 promoter for any species in which cell specific GJB2 expression is desired). In some embodiments, the expression cassette of the isolated nucleic acid comprises a human GJB2 promoter. In some embodiments, an expression cassette of an isolated nucleic acid comprises at least 300 bp (e.g., 300 bp, 400 bp, 500 bp, 600 bp, 700 bp, or more) of any contiguous nucleotides of a human GJB2 promoter. include In some embodiments, the expression cassette of the isolated nucleic acid comprises a promoter having 500 bp contiguous nucleotides of a human GJB2 promoter. In some embodiments, an expression cassette of an isolated nucleic acid is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93% relative to SEQ ID NO:5 , a promoter that has a nucleotide sequence that is at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical. An exemplary nucleotide sequence of 500 bp of the human GJB2 promoter is set forth in SEQ ID NO:5.
일부 실시양태에서, 단리된 핵산의 발현 카세트는 GJB2 기저 프로모터 (예를 들어, 인간 GJB2 기저 프로모터)를 포함한다. GJB2 기저 프로모터는 상이한 종 (예를 들어, 인간 및 마우스)에 걸쳐 고도로 보존된 GJB2 유전자의 프로모터 영역이다. GJB2 기저 프로모터는, 예를 들어 문헌 [Tu, Z. J., and Kiang, D. T. (1998). Mapping and characterization of the basal promoter of the human connexin26 gene. Biochim. Biophys. Acta 1443, 169-181; Kiang, D. T., Jin, N., Tu, Z. J., and Lin, H. H. (1997). Upstream genomic sequence of the human connexin26 gene. Gene 199, 165-171; and Castillo et al., DFNB1 Non-syndromic Hearing Impairment: Diversity of Mutations and Associated Phenotypes, Front. Mol. Neurosci., 22 December 2017]에 이전에 기재되었고, 이들 각각은 본원에 참조로 포함된다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 서열식별번호: 47에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 뉴클레오티드 서열을 갖는 GJB2 기저 프로모터를 포함한다. 인간 GJB2 기저 프로모터의 예시적인 뉴클레오티드 서열은 서열식별번호: 47에 제시된다.In some embodiments, the expression cassette of the isolated nucleic acid comprises a GJB2 basal promoter (eg, a human GJB2 basal promoter). The GJB2 basal promoter is the promoter region of the GJB2 gene that is highly conserved across different species (eg, human and mouse). The GJB2 basal promoter is described, for example, in Tu, Z. J., and Kiang, D. T. (1998). Mapping and characterization of the basal promoter of the human connexin26 gene. Biochim. Biophys. Acta 1443, 169-181; Kiang, D. T., Jin, N., Tu, Z. J., and Lin, H. H. (1997). Upstream genomic sequence of the human connexin26 gene. Gene 199, 165-171; and Castillo et al., DFNB1 Non-syndromic Hearing Impairment: Diversity of Mutations and Associated Phenotypes, Front. Mol. Neurosci., 22 December 2017, each of which is incorporated herein by reference. In some embodiments, an expression cassette of an isolated nucleic acid is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93% relative to SEQ ID NO:47 , a GJB2 base promoter that has a nucleotide sequence that is at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical. An exemplary nucleotide sequence of the human GJB2 basal promoter is set forth in SEQ ID NO:47.
구성적 프로모터의 예는, 비제한적으로, 레트로바이러스 라우스 육종 바이러스 (RSV) 긴 말단 반복부 (LTR) 프로모터 (임의로 RSV 인핸서를 가짐), 시토메갈로바이러스 (CMV) 프로모터 (임의로 CMV 인핸서를 가짐) (예를 들어, 문헌 [Boshart et al., Cell, 41:521-530 (1985)] 참조), 원숭이 공포형성 바이러스 40 (SV40) 프로모터, 디히드로폴레이트 리덕타제 프로모터, β-액틴 프로모터, 포스포글리세롤 키나제 (PGK) 프로모터, 및 신장 인자 1-알파 1 (EF1α) 프로모터를 포함한다. 일부 실시양태에서, 프로모터는 닭 베타-액틴 (CBA) 프로모터이다. 일부 실시양태에서, 프로모터는 증진된 닭 β-액틴 프로모터이다. 일부 실시양태에서, 프로모터는 U6 프로모터이다. CBA 프로모터는 모든 세포 유형에서 구성적으로 활성이기 때문에, 본원에 기재된 단리된 핵산에서 CBA 프로모터를 사용하는 것은 GJB2 단백질을 정상적으로 발현하지 않는 세포 (예를 들어, 와우의 유모 세포)를 포함한 모든 세포 유형에서 GJB2 단백질의 혼재성 발현을 유도한다. 따라서, 일부 실시양태에서, CBA 프로모터는 본원에 기재된 단리된 핵산에 사용되지 않는다.Examples of constitutive promoters include, but are not limited to, the retroviral Rous Sarcoma Virus (RSV) long terminal repeat (LTR) promoter (optionally with an RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with a CMV enhancer) ( See, eg, Boshart et al., Cell, 41:521-530 (1985)), monkey vacuolization virus 40 (SV40) promoter, dihydrofolate reductase promoter, β-actin promoter, phospho glycerol kinase (PGK) promoter, and elongation factor 1-alpha 1 (EF1α) promoter. In some embodiments, the promoter is the chicken beta-actin (CBA) promoter. In some embodiments, the promoter is the enhanced chicken β-actin promoter. In some embodiments, the promoter is a U6 promoter. Because the CBA promoter is constitutively active in all cell types, use of the CBA promoter in the isolated nucleic acids described herein can be used in all cell types, including cells that do not normally express the GJB2 protein (eg, hair cells of the cochlea). induces heterogeneous expression of the GJB2 protein. Thus, in some embodiments, a CBA promoter is not used in the isolated nucleic acids described herein.
유도성 프로모터는 유전자 발현의 조절을 가능하게 하고, 외인성으로 공급된 화합물, 환경 인자, 예컨대 온도, 또는 특정 생리학적 상태, 예를 들어 급성기, 세포의 특정한 분화 상태의 존재에 의해, 또는 오직 복제 세포에서만 조절될 수 있다. 유도성 프로모터 및 유도성 시스템은 인비트로젠(Invitrogen), 클론테크(Clontech) 및 아리아드(Ariad)를 포함하나 이에 제한되지는 않는 다양한 상업적 공급원으로부터 입수가능하다. 다수의 다른 프로모터가 기재되어 있고, 관련 기술분야의 통상의 기술자에 의해 용이하게 선택될 수 있다. 외인성으로 공급되는 프로모터에 의해 조절되는 유도성 프로모터의 예는 아연-유도성 양 메탈로티오닌 (MT) 프로모터, 덱사메타손 (Dex)-유도성 마우스 유방 종양 바이러스 (MMTV) 프로모터, T7 폴리머라제 프로모터 시스템 (WO 98/10088); 엑디손 곤충 프로모터 (No et al., Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)), 테트라시클린-억제성 시스템 (Gossen et al., Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)), 테트라시클린-유도성 시스템 (Gossen et al., Science, 268:1766-1769 (1995), 또한 문헌 [Harvey et al., Curr. Opin. Chem. Biol., 2:512-518 (1998)] 참조), RU486-유도성 시스템 (Wang et al., Nat. Biotech., 15:239-243 (1997) 및 Wang et al., Gene Ther., 4:432-441 (1997)) 및 라파마이신-유도성 시스템 (Magari et al., J. Clin. Invest., 100:2865-2872 (1997))을 포함한다.Inducible promoters allow for the regulation of gene expression, by the presence of exogenously supplied compounds, environmental factors such as temperature, or specific physiological states such as acute phase, specific differentiation states of cells, or only replicating cells can be adjusted only in Inducible promoters and inducible systems are available from a variety of commercial sources including, but not limited to, Invitrogen, Clontech and Ariad. A number of other promoters have been described and can be readily selected by those skilled in the art. Examples of inducible promoters regulated by exogenously supplied promoters include the zinc-inducible sheep metallothioneine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system (WO 98/10088); Ecdysone insect promoter (No et al., Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)), tetracycline-repressor system (Gossen et al., Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)), tetracycline-derived systems (Gossen et al., Science, 268:1766-1769 (1995), see also Harvey et al., Curr. Opin. Chem. Biol., 2:512-518 (1998)), the RU486-inducible system (Wang et al., Nat. Biotech., 15:239-243 (1997) and Wang et al., Gene Ther., 4:432-441 (1997)) and rapamycin-inducible systems (Magari et al., J. Clin. Invest., 100:2865-2872 (1997)).
일부 실시양태에서, 단리된 핵산은 유전자 조절 요소 (GRE) (예를 들어, GJB2 GRE)를 포함한다. 본원에 사용된 유전자 조절 요소는 유전자 발현의 조절에 수반되는 다양한 DNA 서열을 지칭한다. 예를 들어, GRE는 DNA, 세포 단백질 (예를 들어, 히스톤), 및 전사 인자를 수반하는 상호작용에 의존하여 유전자 발현을 조절할 수 있다.In some embodiments, an isolated nucleic acid comprises a genetic regulatory element (GRE) (eg, GJB2 GRE). Gene regulatory elements as used herein refer to various DNA sequences involved in the regulation of gene expression. For example, GREs can regulate gene expression depending on interactions involving DNA, cellular proteins (eg, histones), and transcription factors.
일부 실시양태에서, 단리된 핵산은 시스-조절 요소 (예를 들어, GJB2 유전자에 대한 시스-조절 요소)인 유전자 조절 요소를 포함한다. 시스-조절 요소는 이웃 유전자의 전사를 조절하는 비-코딩 DNA의 영역이다. 시스-조절 요소는 이들이 조절하는 유전자 부근에서 발견된다. 시스-조절 요소는 전형적으로 전사 인자에 결합함으로써 유전자 전사를 조절한다. 일부 실시양태에서, 유전자 조절 요소는 세포-특이적 유전자 발현 능력 (예를 들어, 세포 특이적 GJB2 유전자 발현)을 부여한다. 일부 실시양태에서, 유전자 조절 요소는 GJB2 유전자와 연관된 시스-조절 요소이다.In some embodiments, the isolated nucleic acid comprises a gene regulatory element that is a cis-regulatory element (eg, a cis-regulatory element for the GJB2 gene). Cis-regulatory elements are regions of non-coding DNA that regulate the transcription of neighboring genes. Cis-regulatory elements are found in the vicinity of the genes they regulate. Cis-regulatory elements typically regulate gene transcription by binding to transcription factors. In some embodiments, a genetic regulatory element confers cell-specific gene expression capability (eg, cell-specific GJB2 gene expression). In some embodiments, the genetic regulatory element is a cis-regulatory element associated with the GJB2 gene.
일부 실시양태에서, GJB2 유전자의 시스-조절 요소는 인핸서이다. 본원에 사용된 인핸서는 부위-특이적 전사 인자와 상호작용하여 세포-유형 특이적 방식으로 유전자 발현을 조절할 수 있는, 프로모터에 비해 전사 개시 부위에 대해 더 원위에 위치하는 DNA 서열을 지칭한다. 인핸서는 세포에서 전사 인자의 집합에 결합함으로써 세포-특이적 유전자 발현 조절을 부여하며, 이는 다양한 메카니즘, 예를 들어 번역후 히스톤 변형을 촉매하는 후성적 효소의 동원, 및 DNA 루핑을 촉진하는 보조인자의 동원을 통한 전사 활성화 또는 억제로 이어진다. 인핸서는 이들이 조절하는 유전자 부근에서, 또는 그의 표적 유전자로부터 수백 킬로베이스의 거리에서 확인될 수 있다. 복수의 인핸서가 유전자 발현을 조절하기 위해 상가적으로 및 중복적으로 작용할 수 있다 (예를 들어, 문헌 [Doane et al., Regulatory elements in molecular networks, Wiley Interdiscip Rev Syst Biol Med. 2017 May; 9(3)]). 일부 실시양태에서, 본원에 기재된 인핸서는 게놈 GJB2 유전자 발현을 조절할 수 있는 인핸서이다. 일부 실시양태에서, GJB2 인핸서는 GJB2 유전자의 전사상 활성 서열에서 확인된다. 본원에 사용된 전사상 활성 서열은 서열이 노출되어 전사 인자의 결합 및 전사가 일어나게 하도록 DNA가 개방 염색질 입체형태로 존재하는 염색체 내의 DNA의 영역을 지칭한다. 일부 실시양태에서, GJB2 인핸서는 게놈 GJB2 유전자의 대략 1000 kb 이내 (예를 들어, GJB2 유전자의 1000 kb 이내, 900 kb 이내, 800 kb 이내, 700 kb 이내, 600 kb 이내, 500 kb 이내, 450 kb 이내, 400 kb 이내, 350 kb 이내, 300 kb 이내, 250 kb 이내, 200 kb 이내, 150 kb 이내, 100 kb 이내, 95 kb 이내, 90 kb 이내, 85 kb 이내, 85 kb 이내, 80 kb 이내, 75 kb 이내, 70 kb 이내, 65 kb 이내, 60 kb 이내, 55 kb 이내, 50 kb 이내, 45 kb 이내, 40 kb 이내, 35 kb 이내, 30 kb 이내, 25 kb 이내, 20 kb 이내, 15 kb 이내, 10 kb 이내, 또는 그 미만의 상류 또는 하류)에서 확인된다. 일부 실시양태에서, GJB2 인핸서는 GJB2 유전자의 대략 200 kb 이내에서 확인된다. 일부 실시양태에서, GJB2 인핸서는 GJB2 유전자의 대략 95 kb 이내 (예를 들어, 도 3c에 열거된 영역 C-M)에서 확인된다. 일부 실시양태에서, GJB2 인핸서는 표 1에 열거된 GJB2 유전자 근처의 DNA 서열의 영역 (도 3c) 내에 있다.In some embodiments, the cis-regulatory element of the GJB2 gene is an enhancer. Enhancer, as used herein, refers to a DNA sequence located more distal to the transcription initiation site than a promoter, capable of interacting with site-specific transcription factors to regulate gene expression in a cell-type specific manner. Enhancers confer cell-specific gene expression regulation by binding to a set of transcription factors in cells, which can be achieved through a variety of mechanisms, including the recruitment of epigenetic enzymes that catalyze post-translational histone modifications, and cofactors that promote DNA looping. leading to transcriptional activation or repression through the recruitment of Enhancers can be identified in the vicinity of the genes they regulate, or at a distance of several hundred kilobases from their target genes. Multiple enhancers can act additively and redundantly to regulate gene expression (see, e.g., Doane et al., Regulatory elements in molecular networks, Wiley Interdiscip Rev Syst Biol Med. 2017 May; 9( 3)]). In some embodiments, an enhancer described herein is an enhancer capable of modulating genomic GJB2 gene expression. In some embodiments, a GJB2 enhancer is identified in the transcriptionally active sequence of the GJB2 gene. Transcriptionally active sequence, as used herein, refers to a region of DNA within a chromosome where the DNA exists in an open chromatin conformation such that the sequence is exposed to allow binding of transcription factors and transcription to occur. In some embodiments, the GJB2 enhancer is within approximately 1000 kb of the genomic GJB2 gene (e.g., within 1000 kb, within 900 kb, within 800 kb, within 700 kb, within 600 kb, within 500 kb, 450 kb of the GJB2 gene) within, within 400 kb, within 350 kb, within 300 kb, within 250 kb, within 200 kb, within 150 kb, within 100 kb, within 95 kb, within 90 kb, within 85 kb, within 85 kb, within 80 kb, Within 75 kb, within 70 kb, within 65 kb, within 60 kb, within 55 kb, within 50 kb, within 45 kb, within 40 kb, within 35 kb, within 30 kb, within 25 kb, within 20 kb, within 15 kb within, upstream or downstream within 10 kb, or less). In some embodiments, a GJB2 enhancer is identified within approximately 200 kb of the GJB2 gene. In some embodiments, a GJB2 enhancer is identified within approximately 95 kb of the GJB2 gene (eg, region C-M listed in FIG. 3C). In some embodiments, the GJB2 enhancer is within a region of DNA sequence proximal to the GJB2 gene listed in Table 1 (FIG. 3C).
표 1. GJB2 인핸서를 포함하는 인간 및 마우스 DNA 영역.Table 1. Human and mouse DNA regions containing GJB2 enhancers.
일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서) 서열은 표 2에 열거된 영역 서열로부터 확인될 수 있다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 표 2에 기재된 임의의 영역 서열 (예를 들어, 인간 GJB2 영역 A-M 또는 마우스 Gjb2 영역 A-M)에서 적어도 200개, 적어도 250개, 적어도 300개, 적어도 350개, 적어도 400개, 적어도 450개, 적어도 500개, 적어도 550개, 적어도 600개, 적어도 650개, 적어도 700개, 적어도 750개, 적어도 800개, 적어도 850개, 적어도 900개, 적어도 1000개, 적어도 1100개, 적어도 1200개, 적어도 1300개, 적어도 1400개, 적어도 1500개, 적어도 1600개, 적어도 1700개, 적어도 1800개, 적어도 1900개, 적어도 2000개, 적어도 2100개, 적어도 2200개, 적어도 2300개, 적어도 2400개, 적어도 2500개, 적어도 2600개, 적어도 2700개, 적어도 2800개, 적어도 2800개, 적어도 2900개, 적어도 3000개, 적어도 3100개, 적어도 3200개, 적어도 3300개, 적어도 3400개, 적어도 3500개, 적어도 3600개, 적어도 3700개, 적어도 3800개, 적어도 3900개, 적어도 4000개, 적어도 4100개, 적어도 4200개, 적어도 4200개, 적어도 4400개, 적어도 4500개, 적어도 4600개, 적어도 4700개, 적어도 4800개, 적어도 4900개, 적어도 5000개, 또는 그 초과의 연속 뉴클레오티드를 포함한다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 GJB2 유전자의 전사상 활성 영역 (예를 들어, 영역 A 및/또는 B)으로 확인된다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 영역 A 및/또는 B 내에 적어도 200개, 적어도 250개, 적어도 300개, 적어도 350개, 적어도 400개, 적어도 450개, 적어도 500개, 적어도 550개, 적어도 600개, 적어도 650개, 적어도 700개, 적어도 750개, 적어도 800개, 적어도 850개, 적어도 900개, 적어도 1000개, 적어도 1100개, 적어도 1200개, 적어도 1300개, 적어도 1400개, 적어도 1500개, 적어도 1600개, 적어도 1700개, 적어도 1800개, 적어도 1900개, 적어도 2000개, 적어도 2100개, 적어도 2200개, 적어도 2300개, 적어도 2400개, 적어도 2500개, 적어도 2600개, 적어도 2700개, 적어도 2800개, 적어도 2800개, 적어도 2900개, 적어도 3000개, 적어도 3100개, 적어도 3200개, 적어도 3300개, 적어도 3400개, 적어도 3500개, 적어도 3600개, 적어도 3700개, 적어도 3800개, 적어도 3900개, 적어도 4000개, 적어도 4100개, 적어도 4200개, 적어도 4200개, 적어도 4400개, 적어도 4500개, 적어도 4600개, 적어도 4700개, 적어도 4800개, 적어도 4900개, 적어도 5000개, 또는 그 초과의 연속 뉴클레오티드를 포함한다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 영역 C-M 내에 적어도 200개, 적어도 250개, 적어도 300개, 적어도 350개, 적어도 400개, 적어도 450개, 적어도 500개, 적어도 550개, 적어도 600개, 적어도 650개, 적어도 700개, 적어도 750개, 적어도 800개, 적어도 850개, 적어도 900개, 적어도 1000개, 적어도 1100개, 적어도 1200개, 적어도 1300개, 적어도 1400개, 적어도 1500개, 적어도 1600개, 적어도 1700개, 적어도 1800개, 적어도 1900개, 적어도 2000개, 적어도 2100개, 적어도 2200개, 적어도 2300개, 적어도 2400개, 적어도 2500개, 적어도 2600개, 적어도 2700개, 적어도 2800개, 적어도 2800개, 적어도 2900개, 적어도 3000개, 적어도 3100개, 적어도 3200개, 적어도 3300개, 적어도 3400개, 적어도 3500개, 적어도 3600개, 적어도 3700개, 적어도 3800개, 적어도 3900개, 적어도 4000개, 적어도 4100개, 적어도 4200개, 적어도 4200개, 적어도 4400개, 적어도 4500개, 적어도 4600개, 적어도 4700개, 적어도 4800개, 적어도 4900개, 적어도 5000개, 또는 그 초과의 연속적인 뉴클레오티드를 포함한다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 표 3에 열거된 영역 중의 뉴클레오티드 서열을 포함한다.In some embodiments, a GJB2 GRE (eg, GJB2 enhancer) sequence can be identified from the region sequences listed in Table 2. In some embodiments, the GJB2 GRE (eg, GJB2 enhancer) is at least 200, at least 250, at least 300 in any region sequence set forth in Table 2 (eg, human GJB2 region A-M or mouse Gjb2 region A-M). at least 350, at least 400, at least 450, at least 500, at least 550, at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900; At least 1000, at least 1100, at least 1200, at least 1300, at least 1400, at least 1500, at least 1600, at least 1700, at least 1800, at least 1900, at least 2000, at least 2100, at least 2200 at least 2300, at least 2400, at least 2500, at least 2600, at least 2700, at least 2800, at least 2800, at least 2900, at least 3000, at least 3100, at least 3200, at least 3300; At least 3400, at least 3500, at least 3600, at least 3700, at least 3800, at least 3900, at least 4000, at least 4100, at least 4200, at least 4200, at least 4400, at least 4500, at least 4600 , at least 4700, at least 4800, at least 4900, at least 5000, or more contiguous nucleotides. In some embodiments, a GJB2 GRE (eg, a GJB2 enhancer) is identified in a transcriptionally active region (eg, region A and/or B) of the GJB2 gene. In some embodiments, the GJB2 GRE (eg, GJB2 enhancer) is at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500 in region A and/or B , at least 550, at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 1000, at least 1100, at least 1200, at least 1300, at least 1400, at least 1500, at least 1600, at least 1700, at least 1800, at least 1900, at least 2000, at least 2100, at least 2200, at least 2300, at least 2400, at least 2500, at least 2600 , at least 2700, at least 2800, at least 2800, at least 2900, at least 3000, at least 3100, at least 3200, at least 3300, at least 3400, at least 3500, at least 3600, at least 3700, at least 3800, at least 3900, at least 4000, at least 4100, at least 4200, at least 4200, at least 4400, at least 4500, at least 4600, at least 4700, at least 4800, at least 4900, at least 5000 , or more contiguous nucleotides. In some embodiments, the GJB2 GRE (eg, GJB2 enhancer) is at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550 within regions C-M. , at least 600, at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 1000, at least 1100, at least 1200, at least 1300, at least 1400, at least 1500, at least 1600, at least 1700, at least 1800, at least 1900, at least 2000, at least 2100, at least 2200, at least 2300, at least 2400, at least 2500, at least 2600, at least 2700 , at least 2800, at least 2800, at least 2900, at least 3000, at least 3100, at least 3200, at least 3300, at least 3400, at least 3500, at least 3600, at least 3700, at least 3800, at least 3900, at least 4000, at least 4100, at least 4200, at least 4200, at least 4400, at least 4500, at least 4600, at least 4700, at least 4800, at least 4900, at least 5000, or more contains consecutive nucleotides of In some embodiments, a GJB2 GRE (eg, a GJB2 enhancer) comprises a nucleotide sequence in a region listed in Table 3.
일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 게놈 내의 GJB2 코딩 서열의 센스 가닥 상에 위치한다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 게놈 내의 GJB2 코딩 서열의 역 상보체 가닥 상에 위치한다. 본원에 기재된 바와 같은 인핸서 서열을 사용하여 벡터를 설계할 때 적절한 서열 (예를 들어, 센스 가닥 상의 GRE 서열, 또는 역 상보체 가닥 상의 GRE 서열)을 선택하는 것은 관련 기술분야의 기술 내에 있다.In some embodiments, a GJB2 GRE (eg, a GJB2 enhancer) is located on the sense strand of a GJB2 coding sequence in a genome. In some embodiments, a GJB2 GRE (eg, a GJB2 enhancer) is located on the reverse complement strand of a GJB2 coding sequence in a genome. It is within the skill of the art to select an appropriate sequence (eg, a GRE sequence on the sense strand, or a GRE sequence on the reverse complement strand) when designing a vector using an enhancer sequence as described herein.
일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 적어도 200개, 적어도 250개, 적어도 300개, 적어도 350개, 적어도 400개, 적어도 450개, 적어도 500개, 적어도 550개, 적어도 600개, 적어도 650개, 적어도 700개, 적어도 750개, 적어도 800개, 적어도 850개, 적어도 900개, 적어도 1000개, 적어도 1100개, 적어도 1200개, 적어도 1300개, 적어도 1400개, 적어도 1500개, 적어도 1600개, 적어도 1700개, 적어도 1800개, 적어도 1900개, 적어도 2000개, 적어도 2100개, 적어도 2200개, 적어도 2300개, 적어도 2400개, 적어도 2500개, 적어도 2600개, 적어도 2700개, 적어도 2800개, 적어도 2800개, 적어도 2900개, 적어도 3000개, 적어도 3100개, 적어도 3200개, 적어도 3300개, 적어도 3400개, 적어도 3500개, 적어도 3600개, 적어도 3700개, 적어도 3800개, 적어도 3900개, 적어도 4000개, 적어도 4100개, 적어도 4200개, 적어도 4200개, 적어도 4400개, 적어도 4500개, 적어도 4600개, 적어도 4700개, 적어도 4800개, 적어도 4900개, 적어도 5000개, 또는 그 초과의 뉴클레오티드를 포함한다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 200-500개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 300-600개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 400-700개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 500-800개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 600-900개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 700-1000개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 1000-1500개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드, 1500-2000개의 뉴클레오티드 또는 그 사이의 임의의 수의 뉴클레오티드를 포함한다. 일부 실시양태에서, GJB2 GRE (예를 들어, GJB2 인핸서)는 700개의 뉴클레오티드를 포함한다.In some embodiments, a GJB2 GRE (e.g., a GJB2 enhancer) is at least 200, at least 250, at least 300, at least 350, at least 400, at least 450, at least 500, at least 550, at least 600 at least 650, at least 700, at least 750, at least 800, at least 850, at least 900, at least 1000, at least 1100, at least 1200, at least 1300, at least 1400, at least 1500, At least 1600, at least 1700, at least 1800, at least 1900, at least 2000, at least 2100, at least 2200, at least 2300, at least 2400, at least 2500, at least 2600, at least 2700, at least 2800 at least 2800, at least 2900, at least 3000, at least 3100, at least 3200, at least 3300, at least 3400, at least 3500, at least 3600, at least 3700, at least 3800, at least 3900; at least 4000, at least 4100, at least 4200, at least 4200, at least 4400, at least 4500, at least 4600, at least 4700, at least 4800, at least 4900, at least 5000, or more nucleotides include In some embodiments, a GJB2 GRE (e.g., a GJB2 enhancer) is 200-500 nucleotides or any number of nucleotides in between, 300-600 nucleotides or any number of nucleotides in between, 400-700 nucleotides in between. nucleotides or any number of nucleotides in between, 500-800 nucleotides or any number of nucleotides in between, 600-900 nucleotides or any number of nucleotides in between, 700-1000 nucleotides or any number in between Any number of nucleotides, 1000-1500 nucleotides or any number of nucleotides in between, 1500-2000 nucleotides or any number in between. In some embodiments, a GJB2 GRE (eg, a GJB2 enhancer) comprises 700 nucleotides.
일부 실시양태에서, GJB2 GRE는 인간 GJB2 인핸서이다. 일부 실시양태에서, GJB2 GRE (예를 들어, 인간 GJB2 인핸서)는 표 3에 열거된 바와 같은 GRE 서열 중 어느 하나에 대해 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the GJB2 GRE is a human GJB2 enhancer. In some embodiments, a GJB2 GRE (e.g., a human GJB2 enhancer) is at least 60%, at least 70%, at least 75%, at least 80%, at least 85% relative to any one of the GRE sequences as listed in Table 3. , at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical nucleotide sequences .
표 3: 인간 GJB2 인핸서 서열Table 3: Human GJB2 enhancer sequences
일부 실시양태에서, GJB2 GRE는 비-인간 영장류 (예를 들어, 시노몰구스 마카크) GJB2 인핸서이다. 일부 실시양태에서, GJB2 GRE (예를 들어, 시노몰구스 마카크 GJB2 인핸서)는 표 4에 열거된 바와 같은 GRE 서열 중 어느 하나에 대해 적어도 60%, 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 동일한 뉴클레오티드 서열을 포함한다.In some embodiments, the GJB2 GRE is a non-human primate (eg, Cynomolgus macaque) GJB2 enhancer. In some embodiments, a GJB2 GRE (e.g., a Cynomolgus macaque GJB2 enhancer) is at least 60%, at least 70%, at least 75%, at least 80% relative to any one of the GRE sequences as listed in Table 4. , at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical nucleotides contains sequence.
표 4: 시노몰구스 마카크 GJB2 (mfGJB2) 인핸서 서열Table 4: Cynomolgus macaque GJB2 (mfGJB2) enhancer sequences
일부 실시양태에서, 인간 GJB2 GRE는 mfGJB2 GRE와 상동성을 공유한다. 일부 실시양태에서, 인간 GJB2 GRE는 표 5에 제시된 바와 같은 mfGJB2 GRE에 상응한다:In some embodiments, the human GJB2 GRE shares homology with the mfGJB2 GRE. In some embodiments, the human GJB2 GRE corresponds to the mfGJB2 GRE as shown in Table 5:
표 5: 인간 GJB2 GRE와 mfGJB2 GRE 사이의 상동성Table 5: Homology between human GJB2 GRE and mfGJB2 GRE
일부 실시양태에서, 단리된 핵산은 1개 이상 (예를 들어, 1, 2, 3, 4, 5, 6, 7, 9개 또는 그 초과)의 인핸서 (예를 들어, GJB2 인핸서)를 포함한다. 일부 실시양태에서, 단리된 핵산은 1개 초과의 인핸서를 포함하고, 1개 초과의 인핸서는 동일한 인핸서 또는 상이한 인핸서이다. 일부 실시양태에서, GJB2 GRE는 프로모터의 5'에 위치한다. 다른 실시양태에서, GJB2 GRE는 프로모터의 3'에 위치한다. 일부 실시양태에서, 단리된 핵산 중 GJB2 인핸서(들)의 존재는 단리된 핵산에 의해 코딩되는 GJB2 단백질의 세포-유형 특이적 발현을 촉진시킨다. 일부 실시양태에서, GJB2 유전자를 정상적으로 발현하는 세포 (예를 들어, 섬유세포 및 코르티 기관 및 근처 영역의 지지 세포)는 GJB2 인핸서에 의해 조절되는 GJB2 발현을 활성화시키는 전사 네트워크를 갖지만, GJB2를 정상적으로 발현하지 않는 세포 (예를 들어, 유모 세포 및 나선 신경절 뉴런)에서는 그렇지 않다.In some embodiments, an isolated nucleic acid comprises one or more (eg, 1, 2, 3, 4, 5, 6, 7, 9 or more) enhancers (eg, the GJB2 enhancer). . In some embodiments, an isolated nucleic acid comprises more than one enhancer, and the more than one enhancer is the same enhancer or a different enhancer. In some embodiments, the GJB2 GRE is located 5' to the promoter. In another embodiment, the GJB2 GRE is located 3' of the promoter. In some embodiments, the presence of the GJB2 enhancer(s) in the isolated nucleic acid promotes cell-type specific expression of the GJB2 protein encoded by the isolated nucleic acid. In some embodiments, cells that normally express the GJB2 gene (eg, fibrocytes and support cells of the organ of Corti and nearby regions) have a transcriptional network that activates GJB2 expression regulated by the GJB2 enhancer, but normally express GJB2. This is not the case in cells that do not (e.g., hair cells and spiral ganglion neurons).
일부 실시양태에서, 단리된 핵산의 발현 카세트는 5' UTR을 추가로 포함한다. 일부 실시양태에서, 5' UTR은 게놈 GJB2 유전자의 천연 5' UTR이다. 5' 비번역 영역 (5' UTR) (리더 서열 또는 리더 RNA로도 공지됨)은 개시 코돈의 바로 상류에 있는 mRNA의 영역이다. 5' UTR은 하류 유전자 (예를 들어, GJB2 유전자)의 전사 및 번역 조절 둘 다에서 중요한 역할을 한다. 일부 실시양태에서, GJB2 5' UTR을 코딩하는 뉴클레오티드 서열을 포함하는 단리된 핵산은 또한 세포-특이적 방식으로 (예를 들어, 이를 정상적으로 발현하는 세포에서 GJB2를 발현함) GJB2를 발현할 수 있다. 일부 실시양태에서, GJB2 5' UTR을 코딩하는 뉴클레오티드 서열은 전장 인간 GJB2 유전자 5' UTR을 코딩하는 뉴클레오티드 서열의 부분을 포함한다. 일부 실시양태에서, 5' UTR은 인간 GJB2 유전자 엑손 1 5' UTR이다. 일부 실시양태에서, 5' UTR을 코딩하는 뉴클레오티드 서열은 천연 전장 5' UTR (예를 들어, 인간 GJB2 유전자 엑손 1 5' UTR)의 적어도 100개의 연속 뉴클레오티드, 적어도 200개의 연속 뉴클레오티드, 적어도 300개의 연속 뉴클레오티드, 적어도 400개의 연속 뉴클레오티드, 적어도 500개의 연속 뉴클레오티드, 적어도 600개의 연속 뉴클레오티드, 적어도 700개의 연속 뉴클레오티드, 적어도 800개의 연속 뉴클레오티드, 적어도 900개의 연속 뉴클레오티드, 적어도 1000개의 연속 뉴클레오티드, 또는 그 초과를 포함한다. 일부 실시양태에서, 발현 카세트는 인간 GJB2 유전자 5' UTR (예를 들어, 인간 GJB2 엑손 1 5' UTR)을 코딩하는 뉴클레오티드 서열에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 발현 카세트는 서열식별번호: 53에 제시된 인간 GJB2 유전자 5' UTR (예를 들어, 인간 GJB2 유전자 엑손 1 5' UTR)의 연속적인 300 bp를 코딩하는 뉴클레오티드 서열에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인간 GJB2 유전자 엑손 1 5' UTR의 300 bp를 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 53에 기재된 뉴클레오티드 서열을 갖는다.In some embodiments, the expression cassette of the isolated nucleic acid further comprises a 5' UTR. In some embodiments, the 5' UTR is the native 5' UTR of the genomic GJB2 gene. The 5' untranslated region (5' UTR) (also known as leader sequence or leader RNA) is the region of mRNA immediately upstream of the initiation codon. The 5' UTR plays an important role in both transcriptional and translational regulation of downstream genes (eg, the GJB2 gene). In some embodiments, an isolated nucleic acid comprising a nucleotide sequence encoding a GJB2 5' UTR may also express GJB2 in a cell-specific manner (eg, express GJB2 in a cell that normally expresses it) . In some embodiments, the nucleotide sequence encoding the GJB2 5' UTR comprises a portion of the nucleotide sequence encoding the full-length human GJB2 gene 5' UTR. In some embodiments, the 5' UTR is the human
일부 실시양태에서, 세포 특이적 GJB2 발현은 기저 프로모터 및 GJB2 5' UTR 또는 그의 부분 (기저 프로모터/5' UTR)을 코딩하는 뉴클레오티드 서열의 혼입에 의해 달성된다. 일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 단리된 핵산은 1개 이상의 GJB2 GRE (예를 들어, GJB2 인핸서)를 코딩하는 추가의 뉴클레오티드 서열을 추가로 포함할 수 있다. GJB2 GRE를 코딩하는 뉴클레오티드 서열 및 기저 프로모터/5' UTR을 코딩하는 뉴클레오티드 서열은 임의의 순서로 배치될 수 있다. 일부 실시양태에서, GJB2 GRE를 코딩하는 뉴클레오티드 서열은 기저 프로모터/5' UTR을 코딩하는 뉴클레오티드 서열의 5'에 위치한다. 일부 실시양태에서, GJB2 기저 프로모터/5' UTR을 코딩하는 뉴클레오티드 서열을 포함하는 단리된 핵산은 또한 세포-특이적 방식으로 (예를 들어, 이를 정상적으로 발현하는 세포에서 GJB2를 발현함) GJB2를 발현할 수 있다. 일부 실시양태에서, 기저 프로모터/5' UTR을 코딩하는 뉴클레오티드 서열은 전장 인간 GJB2 유전자 5' UTR을 코딩하는 뉴클레오티드 서열의 부분을 포함한다. 일부 실시양태에서, 5' UTR은 천연 전장 5' UTR (예를 들어, GJB2 5' UTR)의 적어도 100개의 연속 뉴클레오티드, 적어도 200개의 연속 뉴클레오티드, 적어도 300개의 연속 뉴클레오티드, 적어도 400개의 연속 뉴클레오티드, 적어도 500개의 연속 뉴클레오티드, 적어도 600개의 연속 뉴클레오티드, 적어도 700개의 연속 뉴클레오티드, 적어도 800개의 연속 뉴클레오티드, 적어도 900개의 연속 뉴클레오티드, 적어도 1000개의 연속 뉴클레오티드, 또는 그 초과를 포함한다. 일부 실시양태에서, 5' UTR은 인간 GJB2 유전자 엑손 1 5' UTR이다. 일부 실시양태에서, 발현 카세트는 기저 프로모터 및 약 300 bp의 인간 GJB2 유전자 5' UTR (예를 들어, 인간 GJB2 유전자 엑손 1 5' UTR) (서열식별번호: 30)을 코딩하는 뉴클레오티드 서열에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 기저 프로모터/5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인간 GJB2 유전자 기저 프로모터/엑손 1 5' UTR의 300 bp를 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 30에 제시된 뉴클레오티드 서열을 갖는다.In some embodiments, cell specific GJB2 expression is achieved by incorporation of a nucleotide sequence encoding a basal promoter and a GJB2 5'UTR or portion thereof (basal promoter/5'UTR). In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises a nucleotide sequence encoding a 5' UTR. In some embodiments, an isolated nucleic acid may further comprise additional nucleotide sequences encoding one or more GJB2 GREs (eg, GJB2 enhancers). The nucleotide sequence encoding the GJB2 GRE and the nucleotide sequence encoding the basal promoter/5'UTR can be placed in any order. In some embodiments, the nucleotide sequence encoding the GJB2 GRE is located 5' to the nucleotide sequence encoding the basal promoter/5' UTR. In some embodiments, an isolated nucleic acid comprising a nucleotide sequence encoding a GJB2 basal promoter/5'UTR also expresses GJB2 in a cell-specific manner (eg, expresses GJB2 in a cell that normally expresses it) can do. In some embodiments, the nucleotide sequence encoding the basal promoter/5' UTR comprises a portion of the nucleotide sequence encoding the full-length human GJB2 gene 5' UTR. In some embodiments, a 5' UTR is at least 100 contiguous nucleotides, at least 200 contiguous nucleotides, at least 300 contiguous nucleotides, at least 400 contiguous nucleotides, at least 500 contiguous nucleotides, at least 600 contiguous nucleotides, at least 700 contiguous nucleotides, at least 800 contiguous nucleotides, at least 900 contiguous nucleotides, at least 1000 contiguous nucleotides, or more. In some embodiments, the 5' UTR is the human
일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트) 내의 기저 프로모터/5' UTR (예를 들어, 인간 GJB2 기저 프로모터/엑손 1 5' UTR)을 코딩하는 뉴클레오티드 서열은 인트론 또는 그의 부분을 추가로 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트 (예를 들어, GJB2 발현 카세트)는 GJB2 유전자의 인트론 1의 보존된 서열을 추가로 포함한다. 일부 실시양태에서, 인트론 (예를 들어, 인간 GJB2 인트론 1)을 코딩하는 뉴클레오티드 서열은 서열식별번호: 54에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는다. GJB2 인트론 1의 보존된 서열을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 54에 제시된다.In some embodiments, the nucleotide sequence encoding the basal promoter/5'UTR (eg, human GJB2 basal promoter/
일부 실시양태에서, 기저 프로모터/5' UTR/인트론을 코딩하는 뉴클레오티드 서열은 서열식별번호: 31에 대해 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는다. 인간 GJB2 기저 프로모터/5' UTR/인트론 1의 보존된 서열을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 31에 제시된다.In some embodiments, the nucleotide sequence encoding the basal promoter/5'UTR/intron is at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, relative to SEQ ID NO: 31 have at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence encoding the conserved sequence of human GJB2 basal promoter / 5'UTR /
일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 인간 GJB2 유전자의 근위 프로모터를 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인간 GJB2 유전자의 근위 프로모터는 서열식별번호: 102에 제시된 뉴클레오티드 서열에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는다. 일부 실시양태에서, 인간 GJB2 유전자 근위 프로모터를 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 102에 제시된 뉴클레오티드 서열을 갖는다. 일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 서열식별번호: 102를 포함한다.In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises a nucleotide sequence encoding a proximal promoter of a human GJB2 gene. In some embodiments, the proximal promoter of the human GJB2 gene is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92% relative to the nucleotide sequence set forth in SEQ ID NO: 102 , at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. In some embodiments, an exemplary nucleotide sequence encoding a promoter proximal to the human GJB2 gene has the nucleotide sequence set forth in SEQ ID NO:102. In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises SEQ ID NO:102.
일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 인간 GJB2 유전자의 5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인간 GJB2 유전자의 5' UTR은 서열식별번호: 103 또는 CC에 제시된 뉴클레오티드 서열에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는다. 일부 실시양태에서, 인간 GJB2 유전자 5' UTR을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 103 또는 CC에 제시된 뉴클레오티드 서열을 갖는다. 일부 실시양태에서, 인간 GJB2 유전자 5' UTR을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 103 및 서열식별번호: 104를 포함하는 뉴클레오티드 서열을 갖는다. 일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 서열식별번호: 103을 포함한다.In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises a nucleotide sequence encoding the 5' UTR of a human GJB2 gene. In some embodiments, the 5' UTR of the human GJB2 gene is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, relative to the nucleotide sequence set forth in SEQ ID NO: 103 or CC; at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. In some embodiments, an exemplary nucleotide sequence encoding the human GJB2 gene 5' UTR has the nucleotide sequence set forth in SEQ ID NO: 103 or CC. In some embodiments, an exemplary nucleotide sequence encoding a human GJB2 gene 5' UTR has a nucleotide sequence comprising SEQ ID NO: 103 and SEQ ID NO: 104. In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises SEQ ID NO:103.
일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 서열식별번호: 104를 포함한다.In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises SEQ ID NO:104.
일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 인간 GJB2 유전자의 근위 프로모터 및 5' UTR을 코딩하는 뉴클레오티드 서열을 포함한다. 일부 실시양태에서, 인간 GJB2 유전자의 근위 프로모터 및 5' UTR은 서열식별번호: 105에 제시된 뉴클레오티드 서열에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는다. 일부 실시양태에서, 인간 GJB2 유전자 근위 프로모터 및 5' UTR을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 105에 제시된 뉴클레오티드 서열을 갖는다. 일부 실시양태에서, 발현 카세트 (예를 들어, GJB2 발현 카세트)는 서열식별번호: 105를 포함한다.In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises a nucleotide sequence encoding a proximal promoter and a 5' UTR of a human GJB2 gene. In some embodiments, the proximal promoter and 5' UTR of the human GJB2 gene are at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91% relative to the nucleotide sequence set forth in SEQ ID NO: 105 , at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. In some embodiments, an exemplary nucleotide sequence encoding a human GJB2 gene proximal promoter and 5' UTR has the nucleotide sequence set forth in SEQ ID NO: 105. In some embodiments, an expression cassette (eg, a GJB2 expression cassette) comprises SEQ ID NO:105.
본원에 기재된 단리된 핵산은 또한 바람직하게는 프로모터/인핸서 서열과 단백질 코딩 서열 (예를 들어, GJB2 단백질을 코딩하는 뉴클레오티드 서열) 사이에 위치하는 인공 인트론을 함유할 수 있다. 일부 실시양태에서, 인트론은 합성 또는 인공 (예를 들어, 이종) 인트론이다. 합성 인트론의 예는 SV-40으로부터 유래된 인트론 서열 (SV-40 T 인트론 서열로 지칭됨) 및 닭 베타-액틴 유전자로부터 유래된 인트론 서열을 포함한다. 일부 실시양태에서, 본 개시내용에 의해 기재된 트랜스진은 1개 이상 (1, 2, 3, 4, 5개, 또는 그 초과)의 인공 인트론을 포함한다. 일부 실시양태에서, 1개 이상의 인공 인트론은 프로모터와 GJB2 단백질을 코딩하는 뉴클레오티드 서열 사이에 위치한다.The isolated nucleic acids described herein may also contain artificial introns, preferably located between the promoter/enhancer sequence and the protein coding sequence (eg, the nucleotide sequence encoding the GJB2 protein). In some embodiments, an intron is a synthetic or artificial (eg, heterologous) intron. Examples of synthetic introns include an intron sequence derived from SV-40 (referred to as the SV-40 T intron sequence) and an intron sequence derived from the chicken beta-actin gene. In some embodiments, a transgene described by this disclosure comprises one or more (1, 2, 3, 4, 5, or more) artificial introns. In some embodiments, one or more artificial introns are located between the promoter and the nucleotide sequence encoding the GJB2 protein.
일부 실시양태에서, 발현 카세트 (예를 들어, GJB2)는 GJB2 단백질을 코딩하는 뉴클레오티드 서열의 3'에 위치하는 3' UTR을 코딩하는 뉴클레오티드 서열을 추가로 포함한다. 일부 실시양태에서, 3' UTR은 GJB2 유전자 3' UTR이다. 일부 실시양태에서, 3'UTR은 GJB2 유전자 엑손 2 3' UTR이다. 일부 실시양태에서, 3' UTR을 코딩하는 뉴클레오티드 서열은 서열식별번호: 32에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는다. GJB2 유전자 엑손 2 3' UTR을 코딩하는 예시적인 뉴클레오티드 서열은 서열식별번호: 32에 제시된다.In some embodiments, the expression cassette (eg, GJB2) further comprises a nucleotide sequence encoding a 3' UTR located 3' to the nucleotide sequence encoding the GJB2 protein. In some embodiments, the 3' UTR is the GJB2 gene 3' UTR. In some embodiments, the 3'UTR is the
일부 실시양태에서, 단리된 핵산의 발현 카세트는 세포 유형 (예를 들어, 유모 세포 또는 나선상 신경절 뉴런)에서 트랜스진 발현 (예를 들어, GJB2 발현)을 제한하거나 감소시키는 탈표적화제를 포함한다. 일부 실시양태에서, 1개 이상의 miRNA 결합 부위의 발현 내로의 혼입은 세포-유형 특이적 방식으로 (예를 들어, 유모 세포 또는 나선 신경절 뉴런에서) 트랜스진 발현의 탈표적화를 가능하게 한다. 일부 실시양태에서, 1개 이상의 miRNA 결합 부위는 3' UTR (예를 들어, 단리된 핵산의 발현 카세트의 GJB2 엑손 2 3' UTR)에 위치한다.In some embodiments, the expression cassette of the isolated nucleic acid comprises a detargeting agent that limits or reduces transgene expression (eg, GJB2 expression) in a cell type (eg, hair cell or spiral ganglion neuron). In some embodiments, incorporation of one or more miRNA binding sites into expression allows off-targeting of transgene expression in a cell-type specific manner (eg, in hair cells or spiral ganglion neurons). In some embodiments, the one or more miRNA binding sites are located in the 3' UTR (eg, the
일부 실시양태에서, 발현 카세트는 GJB2를 정상적으로 발현하지 않는 세포 (예를 들어, 유모 세포 또는 나선 신경절 뉴런)로부터 GJB2의 발현을 탈표적화하는 1개 이상 (예를 들어, 1, 2, 3, 4, 5개 또는 그 초과)의 miRNA 결합 부위를 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 뉴런 세포 (예를 들어, 나선상 신경절 뉴런)를 탈표적화하기 위한 1개 이상의 miR 결합 부위, 예를 들어 문헌 [Jovicic et al., Comprehensive Expression Analyses of Neural Cell-Type-Specific miRNAs Identify New Determinants of the Specification and Maintenance of Neuronal Phenotypes, J Neurosci. 2013 Mar 20; 33(12): 5127-5137] (이는 본원에 참조로 포함됨)에 기재된 바와 같은 뉴런 풍부화된 miR에 대한 결합 부위를 포함한다. 뉴런 풍부화된 miR의 비제한적 예는 miR-124, miR-127, miR-129, miR-129*, miR-136, miR-136*, miR-137, miR-154, miR-300-3p, miR-323, miR-329, miR-341, miR-369-5p, miR-376a, miR-376b-3p, miR-376c, miR-379, miR-382, miR-382*, miR-410, miR-411, miR-433, miR-434, miR-495, miR-541, miR-543*, miR-551b, miR-143, miR-449a, miR-219-2-3p, miR-126, miR-126*, miR-141, miR-142-3p, miR-142-5p, miR-146a, miR-150, miR-200c, 또는 miR-223을 포함한다. 일부 실시양태에서, 단리된 핵산의 발현 카세트는 유모 세포 (예를 들어, 내유모 또는 외유모 세포)를 탈표적화하기 위한 1개 이상의 miR 결합 부위, 예를 들어 문헌 [Li et al., MicroRNAs in hair cell development and deafness, Curr Opin Otolaryngol Head Neck Surg. 2010 Oct; 18(5): 459-465] (이는 본원에 참조로 포함됨)에 기재된 바와 같은 유모 세포 풍부화된 miR에 대한 결합 부위를 포함한다. 뉴런 풍부화된 miR의 비제한적인 예는 miR-96, miR-182, miR-183, miR-18a, 또는 miR-99a를 포함한다. 일부 실시양태에서, 발현 카세트의 GJB2 엑손 2 3' UTR은 뉴런 세포 및 유모 세포를 탈표적화하기 위한 1개 이상의 miR 결합 부위를 포함한다. 일부 실시양태에서, 발현 카세트의 GJB2 엑손 2 3' UTR은 miR-124에 대한 1개 이상의 miR 결합 부위를 포함한다.In some embodiments, the expression cassette comprises one or more (e.g., 1, 2, 3, 4 , 5 or more) miRNA binding sites. In some embodiments, the expression cassette of the isolated nucleic acid has one or more miR binding sites for detargeting neuronal cells (eg, spiral ganglion neurons), such as those described in Jovicic et al., Comprehensive Expression Analyzes of Neural Cell-Type-Specific miRNAs Identify New Determinants of the Specification and Maintenance of Neuronal Phenotypes, J Neurosci. 2013
본 개시내용의 측면은 본원에 기재된 바와 같은 단리된 핵산을 포함하는 유전자 요법 벡터에 관한 것이다. 유전자 요법 벡터는 바이러스 벡터 (예를 들어, 렌티바이러스 벡터, 아데노-연관 바이러스 벡터, 아데노바이러스 (Ad) 벡터 등), 플라스미드, 폐쇄형-말단 DNA (예를 들어, ceDNA), 지질/DNA 나노입자 등일 수 있다. 일부 실시양태에서, 유전자 요법은 바이러스 벡터이다. 일부 실시양태에서, 단백질 (예를 들어, GJB2 단백질)을 코딩하는 발현 카세트에는 1개 이상의 바이러스 복제 서열, 예를 들어 렌티바이러스 긴 말단 반복부 (LTR) 또는 아데노-연관 바이러스 (AAV) 역전된 말단 반복부 (ITR)가 플랭킹된다.Aspects of the present disclosure relate to gene therapy vectors comprising an isolated nucleic acid as described herein. Gene therapy vectors include viral vectors (eg, lentiviral vectors, adeno-associated viral vectors, adenovirus (Ad) vectors, etc.), plasmids, closed-end DNA (eg, ceDNA), lipid/DNA nanoparticles etc. In some embodiments, the gene therapy is a viral vector. In some embodiments, an expression cassette encoding a protein (eg, a GJB2 protein) contains one or more viral replication sequences, such as lentiviral long terminal repeats (LTRs) or adeno-associated virus (AAV) inverted ends. The repeats (ITR) are flanked.
본 개시내용의 단리된 핵산은 재조합 아데노-연관 바이러스 (AAV) 벡터 (rAAV 벡터)일 수 있다. 일부 실시양태에서, 본 개시내용에 의해 기재된 바와 같은 단리된 핵산은 2개의 아데노-연관 바이러스 (AAV) 역전된 말단 반복부 (ITR) 서열, 또는 그의 변이체를 포함한다. 단리된 핵산 (예를 들어, 재조합 AAV 벡터)은 캡시드 단백질 내로 패키징되고, 대상체에게 투여되고/거나 선택된 표적 세포에 전달될 수 있다. "재조합 AAV (rAAV) 벡터"는 전형적으로, 최소한 발현 카세트 (예를 들어, GJB2에 대한 발현 카세트), 및 5' 및 3' AAV 역전된 말단 반복부 (ITR)로 구성된다. 단리된 핵산은 또한 예를 들어 5' 및 3' 비번역 영역 (UTR)을 코딩하는 영역, 및/또는 발현 제어 서열 (예를 들어, 폴리-A 테일)을 포함할 수 있다.An isolated nucleic acid of the present disclosure may be a recombinant adeno-associated virus (AAV) vector (rAAV vector). In some embodiments, an isolated nucleic acid as described by this disclosure comprises two adeno-associated virus (AAV) inverted terminal repeat (ITR) sequences, or variants thereof. Isolated nucleic acids (eg, recombinant AAV vectors) can be packaged into capsid proteins, administered to a subject, and/or delivered to selected target cells. A "recombinant AAV (rAAV) vector" typically consists of a minimal expression cassette (eg, an expression cassette for GJB2), and 5' and 3' AAV inverted terminal repeats (ITRs). An isolated nucleic acid may also include, for example, regions encoding 5' and 3' untranslated regions (UTRs), and/or expression control sequences (eg, poly-A tails).
일반적으로, ITR 서열은 길이가 약 145 bp 이다. 바람직하게는, ITR을 코딩하는 실질적으로 전체 서열이 단리된 핵산에 사용되지만, 이들 서열의 어느 정도의 사소한 변형이 허용된다. 이들 ITR 서열을 변형시키는 능력은 관련 기술분야의 기술 범위 내에 있다 (예를 들어, 문헌 [Sambrook et al., Molecular Cloning. A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, New York (1989); 및 K. Fisher et al., J Virol., 70:520 532 (1996)]과 같은 문헌 참조). 본 발명에서 사용되는 이러한 분자의 예는 뉴클레오티드 서열 GJB2 단백질 및 GJB2 유전자 조절 요소 (GRE)를 포함하는 발현 카세트에 5' 및 3' AAV ITR 서열이 플랭킹되어 있는, GJB2 단백질을 코딩하는 발현 카세트를 포함하는 단리된 핵산이다. AAV ITR 서열은 현재 확인된 포유동물 AAV 유형을 포함한 임의의 공지된 AAV로부터 수득될 수 있다. 일부 실시양태에서, 단리된 핵산 (예를 들어, rAAV 벡터)은 AAV1, AAV2, AAV5, AAV6, AAV6.2, AAV7, AAV8, AAV9, AAV10, AAV11 및 그의 변이체로부터 선택된 혈청형을 갖는 적어도 1개의 ITR을 포함한다. 일부 실시양태에서, 단리된 핵산은 AAV2 ITR을 코딩하는 영역 (예를 들어, 제1 영역)을 포함한다.Generally, ITR sequences are about 145 bp in length. Preferably, substantially the entire sequence encoding the ITR is used in the isolated nucleic acid, although some minor variations of these sequences are permitted. The ability to modify these ITR sequences is within the skill of the art (eg, Sambrook et al., Molecular Cloning. A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, New York (1989); and K. Fisher et al., J Virol., 70:520 532 (1996)). An example of such a molecule used in the present invention is an expression cassette encoding the GJB2 protein, flanked by 5' and 3' AAV ITR sequences, to an expression cassette comprising the nucleotide sequence GJB2 protein and a GJB2 gene regulatory element (GRE). It is an isolated nucleic acid comprising AAV ITR sequences can be obtained from any known AAV, including currently identified mammalian AAV types. In some embodiments, the isolated nucleic acid (e.g., rAAV vector) is at least one having a serotype selected from AAV1, AAV2, AAV5, AAV6, AAV6.2, AAV7, AAV8, AAV9, AAV10, AAV11 and variants thereof. Include ITRs. In some embodiments, an isolated nucleic acid comprises a region (eg, a first region) encoding an AAV2 ITR.
일부 실시양태에서, 단리된 핵산은 제2 AAV ITR을 포함하는 영역 (예를 들어, 제2 영역, 제3 영역, 제4 영역 등)을 추가로 포함한다. 일부 실시양태에서, 제2 AAV ITR은 AAV1, AAV2, AAV5, AAV6, AAV6.2, AAV7, AAV8, AAV9, AAV10, AAV11, 및 그의 변이체로부터 선택된 혈청형을 갖는다. 일부 실시양태에서, 제2 AAV ITR은 AAV2 ITR이다. 일부 실시양태에서, 제2 ITR은 기능적 말단 분해 부위 (TRS)가 결여된 돌연변이체 ITR이다. 용어 "말단 분해 부위가 결여된"은 AAV ITR이 ITR의 말단 분해 부위 (TRS)의 기능을 제거하는 돌연변이 (예를 들어, 센스 돌연변이, 예컨대 비-동의 돌연변이, 또는 미스센스 돌연변이)를 포함하는 것, 또는 말단절단된 AAV ITR이 기능적 TRS를 코딩하는 핵산 서열이 결여된 것 (예를 들어, ΔTRS ITR, 또는 ΔITR)을 지칭할 수 있다. 어떠한 특정한 이론에 얽매이는 것을 원하지는 않지만, 기능적 TRS가 결여된 ITR을 포함하는 rAAV 벡터는, 예를 들어 문헌 [McCarthy (2008) Molecular Therapy 16(10):1648-1656]에 기재된 바와 같이 자기-상보적 rAAV 벡터를 생산한다. 일부 실시양태에서, 단리된 핵산은 5' AAV2 ITR 및 3' AAV2 ITR을 포함한다.In some embodiments, the isolated nucleic acid further comprises a region comprising a second AAV ITR (eg, a second region, a third region, a fourth region, etc.). In some embodiments, the second AAV ITR has a serotype selected from AAV1, AAV2, AAV5, AAV6, AAV6.2, AAV7, AAV8, AAV9, AAV10, AAV11, and variants thereof. In some embodiments, the second AAV ITR is an AAV2 ITR. In some embodiments, the second ITR is a mutant ITR lacking a functional terminal cleavage site (TRS). The term "lacking a terminal cleavage site" means that an AAV ITR includes mutations (e.g., sense mutations, such as non-synonymous mutations, or missense mutations) that eliminate the function of the terminal cleavage site (TRS) of the ITR. , or a truncated AAV ITR lacking a nucleic acid sequence encoding a functional TRS (eg, ΔTRS ITR, or ΔITR). Without wishing to be bound by any particular theory, rAAV vectors comprising an ITR lacking a functional TRS are self-complementary as described, for example, in McCarthy (2008) Molecular Therapy 16(10):1648-1656. Produce an enemy rAAV vector. In some embodiments, an isolated nucleic acid comprises a 5' AAV2 ITR and a 3' AAV2 ITR.
예시적인 5' AAV2 ITR 뉴클레오티드 서열은 서열식별번호: 34에 제시된다.An exemplary 5' AAV2 ITR nucleotide sequence is set forth in SEQ ID NO:34.
예시적인 5' ITR 뉴클레오티드 서열은 서열식별번호: 106에 제시된다.An exemplary 5' ITR nucleotide sequence is set forth in SEQ ID NO: 106.
예시적인 3' AAV2 ITR 뉴클레오티드 서열은 서열식별번호: 35에 제시된다.An exemplary 3' AAV2 ITR nucleotide sequence is set forth in SEQ ID NO:35.
예시적인 3' ITR 뉴클레오티드 서열은 서열식별번호: 107에 제시된다.An exemplary 3' ITR nucleotide sequence is set forth in SEQ ID NO: 107.
일부 실시양태에서, 본원에 기재된 단리된 핵산 (예를 들어, rAAV 벡터)은 서열식별번호: 34 또는 106에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100%의 서열 동일성을 갖는 5' ITR 서열을 포함한다.In some embodiments, an isolated nucleic acid described herein (eg, a rAAV vector) is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least relative to SEQ ID NO: 34 or 106 at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity; .
일부 실시양태에서, 본원에 기재된 단리된 핵산 (예를 들어, rAAV 벡터)은 서열식별번호: 35 또는 107에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100%의 서열 동일성을 갖는 3' ITR 서열을 포함한다.In some embodiments, an isolated nucleic acid (eg, a rAAV vector) described herein is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least relative to SEQ ID NO: 35 or 107 at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity; .
일부 실시양태에서, 본원에 기재된 단리된 핵산 (예를 들어, rAAV 벡터)은 전사후 반응 요소를 포함한다. 본원에 사용된 용어 "전사후 반응 요소"는 전사될 때 유전자의 발현을 증진시키는 3차 구조를 채택하는 핵산 서열을 지칭한다. 전사후 조절 요소의 예는 우드척 간염 바이러스 전사후 조절 요소 (WPRE), 마우스 RNA 수송 요소 (RTE), 원숭이 레트로바이러스 유형 1 (SRV-1)의 구성적 수송 요소 (CTE), 메이슨-화이자 원숭이 바이러스 (MPMV)로부터의 CTE, 및 인간 열 쇼크 단백질 70의 5' 비번역 영역 (Hsp70 5' UTR)을 포함하나 이에 제한되지는 않는다. 일부 실시양태에서, 단리된 핵산 (예를 들어, rAAV 벡터)은 우드척 간염 바이러스 전사후 조절 요소 (WPRE)를 포함한다.In some embodiments, an isolated nucleic acid described herein (eg, a rAAV vector) comprises a post-transcriptional response element. As used herein, the term “post-transcriptional response element” refers to a nucleic acid sequence that, when transcribed, adopts a tertiary structure that enhances the expression of a gene. Examples of post-transcriptional regulatory elements include Woodchuck Hepatitis Virus post-transcriptional regulatory element (WPRE), mouse RNA transport element (RTE), constitutive transport element (CTE) of monkey retrovirus type 1 (SRV-1), Mason-Pfizer monkey CTE from virus (MPMV), and the 5' untranslated region of human heat shock protein 70 (Hsp70 5' UTR). In some embodiments, an isolated nucleic acid (eg, a rAAV vector) comprises a Woodchuck hepatitis virus post-transcriptional regulatory element (WPRE).
일부 실시양태에서, 본원에 기재된 단리된 핵산 (예를 들어, rAAV 벡터)은 서열식별번호: 108에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100%의 서열 동일성을 갖는 전사 후 반응 요소를 포함한다. 예시적인 전사후 반응 요소는 서열식별번호: 108에 제시된다.In some embodiments, an isolated nucleic acid described herein (eg, a rAAV vector) is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91% relative to SEQ ID NO: 108 , at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary post-transcriptional response element is set forth in SEQ ID NO: 108.
일부 실시양태에서, 벡터는 벡터로 형질감염되거나 본 개시내용에 의해 생산된 바이러스로 감염된 세포에서 그의 전사, 번역 및/또는 발현을 허용하는 방식으로 GJB2 코딩 서열의 요소와 작동가능하게 연결된 통상적인 제어 요소를 추가로 포함한다. 발현 제어 서열은 적절한 전사 개시, 종결; 효율적인 RNA 프로세싱 신호, 예컨대 스플라이싱 및 폴리아데닐화 (폴리A) 신호; 세포질 mRNA를 안정화시키는 서열; 번역 효율을 증진시키는 서열 (예를 들어, 코작 컨센서스 서열); 단백질 안정성을 증진시키는 서열을 포함한다. 폴리아데닐화 서열은 일반적으로 코딩 서열 다음에 및 임의로 3' AAV ITR 서열 전에 삽입된다. 본 개시내용에 유용한 rAAV 구축물은 또한 바람직하게는 프로모터/인핸서 서열과 트랜스진 사이에 위치하는 인트론을 함유할 수 있다.In some embodiments, the vector is a conventional control operably linked element of the GJB2 coding sequence in a manner permitting its transcription, translation and/or expression in cells transfected with the vector or infected with the virus produced by the present disclosure. contains additional elements. Expression control sequences include appropriate transcriptional initiation, termination; efficient RNA processing signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translational efficiency (eg, Kozak consensus sequence); Contains sequences that enhance protein stability. The polyadenylation sequence is usually inserted after the coding sequence and optionally before the 3' AAV ITR sequence. rAAV constructs useful in the present disclosure may also contain introns, preferably located between the promoter/enhancer sequence and the transgene.
일부 실시양태에서, 본원에 기재된 단리된 핵산 (예를 들어, rAAV 벡터)은 서열식별번호: 109에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100%의 서열 동일성을 갖는 폴리아데닐화 신호 서열을 포함한다. 예시적인 폴리아데닐화 신호 서열은 서열식별번호: 109에 제시된다.In some embodiments, an isolated nucleic acid described herein (eg, a rAAV vector) is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91% relative to SEQ ID NO: 109 , a polyadenylation signal sequence having at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary polyadenylation signal sequence is set forth in SEQ ID NO: 109.
일부 실시양태에서, 본원에 기재된 AAV 벡터는 GJB2 근위 프로모터 (예를 들어, 서열식별번호: 102), GJB2 5' UTR (예를 들어, 서열식별번호: 103 및 CC), GJB2 유전자 산물을 코딩하는 뉴클레오티드 서열 (예를 들어, 서열식별번호: 2), GJB2 3' UTR (예를 들어, 서열식별번호: 32), WPRE (예를 들어, 서열식별번호: 108), 및 소 성장 호르몬 폴리 A 신호 (예를 들어, 서열식별번호: 109)를 포함한다. 일부 실시양태에서, 본원에 기재된 AAV 벡터는 서열식별번호: 110에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100%의 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 예시적인 AAV 벡터 서열은 서열식별번호: 110에 제시된다.In some embodiments, an AAV vector described herein encodes a GJB2 proximal promoter (eg, SEQ ID NO: 102), a GJB2 5' UTR (eg, SEQ ID NO: 103 and CC), a GJB2 gene product. nucleotide sequence (eg, SEQ ID NO: 2), GJB2 3' UTR (eg, SEQ ID NO: 32), WPRE (eg, SEQ ID NO: 108), and bovine growth hormone poly A signal (eg SEQ ID NO: 109). In some embodiments, an AAV vector described herein is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, relative to SEQ ID NO: 110; nucleotide sequences having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary AAV vector sequence is set forth in SEQ ID NO: 110.
일부 실시양태에서, 본원에 기재된 AAV 벡터는 5' ITR (예를 들어, 서열식별번호: 106), GJB2 근위 프로모터 (예를 들어, 서열식별번호: 102), GJB2 5' UTR (예를 들어, 서열식별번호: 103 및 CC), GJB2 유전자 산물을 코딩하는 뉴클레오티드 서열 (예를 들어, 서열식별번호: 2), GJB2 3' UTR (예를 들어, 서열식별번호: 32), WPRE (예를 들어, 서열식별번호: 108), 소 성장 호르몬 폴리 A 신호 (예를 들어, 서열식별번호: 109), 및 3' ITR (예를 들어, 서열식별번호: 107)을 포함한다. 일부 실시양태에서, 본원에 기재된 AAV 벡터는 서열식별번호: 111에 대해 적어도 70%, 적어도 75%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100%의 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 예시적인 AAV 벡터 서열은 서열식별번호: 111에 제시된다.In some embodiments, an AAV vector described herein comprises a 5' ITR (eg, SEQ ID NO: 106), a GJB2 proximal promoter (eg, SEQ ID NO: 102), a GJB2 5' UTR (eg, SEQ ID NO: 106). SEQ ID NO: 103 and CC), nucleotide sequence encoding the GJB2 gene product (eg SEQ ID NO: 2), GJB2 3 'UTR (eg SEQ ID NO: 32), WPRE (eg SEQ ID NO: 32) , SEQ ID NO: 108), bovine growth hormone poly A signal (eg SEQ ID NO: 109), and 3' ITR (eg SEQ ID NO: 107). In some embodiments, an AAV vector described herein is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, nucleotide sequences having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary AAV vector sequence is set forth in SEQ ID NO: 111.
일부 실시양태에서, 본원에 기재된 AAV 벡터는 5' ITR, GJB2 기저 프로모터, 5' UTR (예를 들어, GJB2 엑손 1 5' UTR), 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, 임의적인 HA 태그, 3' UTR (예를 들어, GJB2 엑손 2 3' UTR), WPRE, 소 성장 호르몬 폴리 A 신호, 및 3' ITR을 포함한다 (예를 들어, 벡터 c70). 일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 36에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. HA 태그를 갖는 마우스 GJB2 단백질을 코딩하는 벡터 c70에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 36에 제시된다 (마우스 GJB2 코딩 서열은 볼드체로 표시함; HA 태그는 밑줄표시함).In some embodiments, an AAV vector described herein contains a 5' ITR, a GJB2 basal promoter, a 5' UTR (eg,
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 61에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. HA 태그를 갖는 인간 GJB2 단백질을 코딩하는 벡터 c70에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 61에 제시된다 (인간 GJB2 코딩 서열은 볼드체로 표시함; HA 태그는 밑줄표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c70 encoding human GJB2 protein with an HA tag is set forth in SEQ ID NO: 61 (human GJB2 coding sequence in bold; HA tag underlined).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 62에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. HA 태그를 갖는 마우스 GJB2 단백질을 코딩하는 벡터 c70에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 62에 제시된다 (마우스 GJB2 코딩 서열은 볼드체로 표시함; HA 태그 없음).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c70 encoding mouse GJB2 protein with an HA tag is set forth in SEQ ID NO: 62 (mouse GJB2 coding sequence in bold; no HA tag).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 63에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. HA 태그를 갖는 마우스 GJB2 단백질을 코딩하는 벡터 c70에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 63에 제시된다 (인간 GJB2 코딩 서열은 볼드체로 표시함; HA 태그 없음).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, relative to SEQ ID NO: 63; a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c70 encoding mouse GJB2 protein with an HA tag is set forth in SEQ ID NO: 63 (human GJB2 coding sequence in bold; no HA tag).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE1), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c81.1).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE1), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 64에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. eGFP를 코딩하는 벡터 c81.1에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 64에 제시된다 (hGJB2 GRE1은 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c81.1 encoding eGFP is shown in SEQ ID NO: 64 (hGJB2 GRE1 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 65에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 인간 GJB2를 코딩하는 벡터 c81.1에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 65에 제시된다 (hGJB2 GRE1은 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c81.1 encoding human GJB2 is shown in SEQ ID NO: 65 (hGJB2 GRE1 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 66에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 마우스 GJB2를 코딩하는 벡터 c81.1에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 66에 제시된다 (hGJB2 GRE1은 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c81.1 encoding mouse GJB2 is shown in SEQ ID NO: 66 (hGJB2 GRE1 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE2), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c81.2).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE2), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 48에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. eGFP를 코딩하는 벡터 c81.2에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 48에 제시된다 (hGJB2 GRE2는 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c81.2 encoding eGFP is shown in SEQ ID NO: 48 (hGJB2 GRE2 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 67에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 인간 GJB2를 코딩하는 벡터 c81.2에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 67에 제시된다 (hGJB2 GRE2는 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c81.2 encoding human GJB2 is shown in SEQ ID NO: 67 (hGJB2 GRE2 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 68에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 마우스 GJB2를 코딩하는 벡터 c81.2에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 68에 제시된다 (hGJB2 GRE2는 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c81.2 encoding mouse GJB2 is shown in SEQ ID NO: 68 (hGJB2 GRE2 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE3), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c.81.3).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE3), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 49에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.3에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 49에 제시된다 (hGJB2 GRE3은 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.3 is shown in SEQ ID NO: 49 (hGJB2 GRE3 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 70에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.3에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 70에 제시된다 (hGJB2 GRE3은 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.3 is shown in SEQ ID NO: 70 (hGJB2 GRE3 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 71에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.3에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 71에 제시된다 (hGJB2 GRE3은 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.3 is shown in SEQ ID NO: 71 (hGJB2 GRE3 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE4), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c.81.4).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE4), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 72에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.4에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 72에 제시된다 (hGJB2 GRE4는 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.4 is shown in SEQ ID NO: 72 (hGJB2 GRE4 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 73에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.4에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 73에 제시된다 (hGJB2 GRE4는 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.4 is shown in SEQ ID NO: 73 (hGJB2 GRE4 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 74에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.4에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 74에 제시된다 (hGJB2 GRE4는 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.4 is shown in SEQ ID NO: 74 (hGJB2 GRE4 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE5), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c.81.5).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE5), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 50에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.5에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 50에 제시된다 (hGJB2 GRE5는 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.5 is shown in SEQ ID NO: 50 (hGJB2 GRE5 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 75에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.5에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 75에 제시된다 (hGJB2 GRE5는 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.5 is shown in SEQ ID NO: 75 (hGJB2 GRE5 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 76에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.5에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 76에 제시된다 (hGJB2 GRE5는 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.5 is shown in SEQ ID NO: 76 (hGJB2 GRE5 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE7), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c.81.7).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE7), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 51에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.7에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 51에 제시된다 (hGJB2 GRE7은 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.7 is shown in SEQ ID NO: 51 (hGJB2 GRE7 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 77에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.7에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 77에 제시된다 (hGJB2 GRE7은 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.7 is shown in SEQ ID NO: 77 (hGJB2 GRE7 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 78에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.7에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 78에 제시된다 (hGJB2 GRE7은 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.7 is shown in SEQ ID NO: 78 (hGJB2 GRE7 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE8), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c.81.8).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE8), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 79에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.8에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 79에 제시된다 (hGJB2 GRE8은 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.8 is shown in SEQ ID NO: 79 (hGJB2 GRE8 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 80에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.8에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 80에 제시된다 (hGJB2 GRE8은 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.8 is shown in SEQ ID NO: 80 (hGJB2 GRE8 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 81에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.8에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 81에 제시된다 (hGJB2 GRE8은 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.8 is shown in SEQ ID NO: 81 (hGJB2 GRE8 underlined; mouse GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AAV 벡터는 AAV 5' ITR, GJB2 GRE 인핸서 (hGJB2 GRE9), GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 유전자 산물 (예를 들어, GJB2 또는 GFP)을 코딩하는 뉴클레오티드 서열, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다 (예를 들어, 벡터 c.81.9).In some embodiments, an AAV vector described herein comprises an AAV 5' ITR, a GJB2 GRE enhancer (hGJB2 GRE9), a GJB2 basal promoter, a
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 52에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.9에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 52에 제시된다 (hGJB2 GRE9는 밑줄표시함; eGFP 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.9 is shown in SEQ ID NO: 52 (hGJB2 GRE9 underlined; eGFP coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 82에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.9에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 82에 제시된다 (hGJB2 GRE9는 밑줄표시함; 인간 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, relative to SEQ ID NO: 82; a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.9 is shown in SEQ ID NO: 82 (hGJB2 GRE9 underlined; human GJB2 coding sequence in bold).
일부 실시양태에서, 본원에 기재된 AVV 벡터는 서열식별번호: 83에 대해 적어도 60%, 적어도 70%, 적어도 80%, 적어도 85%, 적어도 90%, 적어도 91%, 적어도 92%, 적어도 93%, 적어도 94%, 적어도 95%, 적어도 96%, 적어도 97%, 적어도 98%, 적어도 99%, 또는 100% 서열 동일성을 갖는 뉴클레오티드 서열을 포함한다. 벡터 c.81.9에 대한 예시적인 뉴클레오티드 서열은 서열식별번호: 83에 제시된다 (hGJB2 GRE9는 밑줄표시함; 마우스 GJB2 코딩 서열은 볼드체로 표시함).In some embodiments, an AVV vector described herein is at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 93%, a nucleotide sequence having at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity. An exemplary nucleotide sequence for vector c.81.9 is shown in SEQ ID NO: 83 (hGJB2 GRE9 underlined; mouse GJB2 coding sequence in bold).
II. 재조합 아데노-연관 바이러스 (rAAV)II. Recombinant adeno-associated virus (rAAV)
일부 측면에서, 본 개시내용은 단리된 AAV를 제공한다. AAV와 관련하여 본원에 사용된 용어 "단리된"은 AAV가 인공적으로 생산, 조작 또는 수득된 것을 지칭한다. 단리된 AAV는 재조합 방법을 사용하여 생산될 수 있다. 이러한 AAV는 본원에서 "재조합 AAV"로 지칭된다. 재조합 AAV (rAAV)는 바람직하게는 조직-특이적 표적화 능력을 가지며, 따라서 rAAV의 트랜스진이 1개 이상의 미리 결정된 조직(들)에 특이적으로 전달될 것이다. AAV 캡시드는 이들 조직-특이적 표적화 능력을 결정하는 데 중요한 요소이다. 따라서, 표적화되는 조직에 적절한 캡시드를 갖는 rAAV가 선택될 수 있다.In some aspects, the present disclosure provides isolated AAV. The term “isolated,” as used herein with reference to AAV, refers to AAV produced, engineered, or obtained artificially. Isolated AAV can be produced using recombinant methods. Such AAVs are referred to herein as “recombinant AAVs”. Recombinant AAV (rAAV) preferably has tissue-specific targeting capabilities, such that the transgene of the rAAV will be specifically delivered to one or more pre-determined tissue(s). AAV capsids are an important factor in determining these tissue-specific targeting abilities. Thus, rAAVs with capsids appropriate for the tissue being targeted can be selected.
목적하는 캡시드 단백질을 갖는 재조합 AAV를 수득하는 방법은 관련 기술분야에 공지되어 있다 (예를 들어, 본원에 참조로 포함된 US 2003/0138772 참조). 전형적으로, 방법은 AAV 캡시드 단백질을 코딩하는 핵산 서열; 기능적 rep 유전자; AAV 역전된 말단 반복부 (ITR) 및 발현 카세트 (예를 들어, GJB2 발현 카세트)로 구성된 재조합 AAV 벡터; 및 재조합 AAV 벡터가 AAV 캡시드 내로 패키징되도록 아데노바이러스로부터의 E2b 및 E4 전사체를 발현하는 헬퍼 플라스미드를 함유하는 숙주 세포를 배양하는 것을 수반한다. 일부 실시양태에서, 캡시드 단백질은 AAV의 cap 유전자에 의해 코딩되는 구조 단백질이다. AAV는 3개의 캡시드 단백질, 비리온 단백질 1 내지 3 (VP1, VP2 및 VP3으로 명명됨)을 포함하며, 이들 모두는 대안적 스플라이싱을 통해 단일 cap 유전자로부터 전사된다. 일부 실시양태에서, VP1, VP2 및 VP3의 분자량은 각각 약 87 kDa, 약 72 kDa, 및 약 62 kDa이다. 일부 실시양태에서, 번역 시, 캡시드 단백질은 바이러스 게놈 주위에 구형 60량체 단백질 쉘을 형성한다. 일부 실시양태에서, 캡시드 단백질의 기능은 바이러스 게놈을 보호하고/거나, 게놈을 전달하고/거나, 숙주와 상호작용하는 것이다. 일부 측면에서, 캡시드 단백질은 바이러스 게놈을 조직 특이적 방식으로 숙주에게 (예를 들어, 내이 내의 세포에) 전달한다.Methods of obtaining recombinant AAV having the desired capsid protein are known in the art (see, eg, US 2003/0138772, incorporated herein by reference). Typically, the method comprises a nucleic acid sequence encoding an AAV capsid protein; functional rep gene; a recombinant AAV vector consisting of an AAV inverted terminal repeat (ITR) and an expression cassette (eg, the GJB2 expression cassette); and culturing a host cell containing a helper plasmid expressing the E2b and E4 transcripts from adenovirus such that the recombinant AAV vector is packaged into an AAV capsid. In some embodiments, the capsid protein is a structural protein encoded by the cap gene of AAV. AAV contains three capsid proteins,
본 개시내용은 부분적으로 특정 AAV 혈청형 캡시드가 트랜스진 (예를 들어, GJB2 유전자)을 귀 (예를 들어, 내이 내의 세포)에 전달할 수 있다는 발견에 기초한다. 일부 실시양태에서, AAV 캡시드 단백질은 AAV9.PHP.B, AAV9.PHP.eB, exoAAV, Anc80, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAVrh8, AAV9, AAV10, AAVrh10, 및 AAV-S로 이루어진 군으로부터 선택된 AAV 혈청형의 것이다. AAV2.7m8은 와우 유모 세포 및 지지 세포 및 망막을 표적화하는 트랜스진을 전달할 수 있다. AAV2.7m8은 내이로의 우수한 형질도입을 나타낸다 (Isgrig et al., "AAV2.7m8 is a powerful viral vector for inner ear gene therapy," Nature Communications volume 10, Article number: 427 (2019)). 일부 실시양태에서, 캡시드 단백질은 AAV 혈청형 9 (AAV9)의 것이다. 일부 실시양태에서, AAV 캡시드 단백질은 AAV9로부터 유래된 혈청형 (예를 들어, AAV9 캡시드 변이체), 예를 들어 AAV9.PHP.B의 것이다. 일부 실시양태에서, AAV9 캡시드 변이체는 AAV9.PHP.B이다. 일부 실시양태에서, AAV9 캡시드 변이체는 AAV-S이다. AAV-S는 원래 중추 신경계 (CNS)를 표적화하기 위해 개발된 AAV9 캡시드 단백질 변이체이다 (문헌 [Hanlon et al., Selection of an Efficient AAV Vector for Robust CNS Transgene Expression, Molecular Therapy Method & Clinical Development, vol. 15,pp. 320-332, December 13, 2019], 및 PCT/US2020/025720, 이들은 본원에 참조로 포함됨). 놀랍게도, AAV-S는 외유모 세포 (OHC), 내유모 세포 (IHC), 지지 세포 (예를 들어, 경계 세포, 내부 지골 세포, 내부 기둥 세포, 외부 기둥 세포, 다이터 세포, 헨센 세포, 또는 클라우디우스 세포), 나선 신경절 뉴런, 나선판가장자리 세포 (예를 들어, 신경교 세포 또는 치간 세포), 외부 고랑 세포, 측벽, 혈관선조 (예를 들어, 기저 세포 및 중간 세포), 내부 고랑, 나선 인대 (예를 들어, 섬유세포), 또는 전정계의 세포를 포함하나 이에 제한되지는 않는, 내이 세포에 대해 우수한 형질도입 효율을 나타냈다 (예를 들어, 본원에 참조로 포함된 문헌 [Hanlon et al., AAV-S: A novel AAV vector selected in brain transduces the inner ear with high efficiency, Molecular Therapy Vol 18 No 4S1, April 28, 2020, Abstract 151] 참조). 일부 실시양태에서, AAV 캡시드는 AAV-S이다. AAV-S에 대한 예시적인 아미노산 서열은 서열식별번호: 33에 제시된다. 일부 실시양태에서, AAV 캡시드는 엑소AAV이다. 엑소AAV는 엑소솜-연관 AAV를 지칭한다. 엑소AAV 캡시드 단백질은 AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAVrh8, AAV9, AAV10, AAVrh10, 및 AAV.PHP.B로 이루어진 군으로부터 선택될 수 있다. 일부 예에서, 엑소AAV는 엑소AAV1 또는 엑소AAV9이다.The present disclosure is based in part on the discovery that certain AAV serotype capsids can deliver transgenes (eg, the GJB2 gene) to the ear (eg, cells within the inner ear). In some embodiments, the AAV capsid protein is AAV9.PHP.B, AAV9.PHP.eB, exoAAV, Anc80, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAVrh8, AAV9, AAV10, AAVrh10, and of an AAV serotype selected from the group consisting of AAV-S. AAV2.7m8 can deliver transgenes targeting cochlear hair cells and supporting cells and the retina. AAV2.7m8 exhibits excellent transduction into the inner ear (Isgrig et al., "AAV2.7m8 is a powerful viral vector for inner ear gene therapy,"
AAV-S에 대한 예시적인 아미노산 서열은 서열식별번호: 33에 제시된다.An exemplary amino acid sequence for AAV-S is set forth in SEQ ID NO:33.
관련 기술분야의 통상의 기술자는 또한 캡시드 단백질의 기능적 등가 변이체 또는 상동체를 제공하도록 보존적 아미노산 치환이 이루어질 수 있음을 알 것이다. 일부 측면에서, 본 개시내용은 보존적 아미노산 치환을 생성하는 서열 변경을 포함한다. 본원에 사용된 보존적 아미노산 치환은 아미노산 치환이 이루어지는 단백질의 상대 전하 또는 크기 특징을 변경시키지 않는 아미노산 치환을 지칭한다. 변이체는 관련 기술분야의 통상의 기술자에게 공지된 폴리펩티드 서열을 변경시키는 방법에 따라 제조될 수 있으며, 예컨대 이러한 방법을 편찬한 참고문헌, 예를 들어, 문헌 [Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, 또는 Current Protocols in Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York]에서 발견된다. 아미노산의 보존적 치환은 하기 군 내의 아미노산 중에서 이루어진 치환을 포함한다: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; 및 (g) E, D. 따라서, 본원에 기재된 단백질 및 폴리펩티드의 아미노산 서열 (예를 들어, GJB2 단백질 서열)에 보존적 아미노산 치환이 이루어질 수 있다.One skilled in the art will also appreciate that conservative amino acid substitutions may be made to provide functionally equivalent variants or homologues of capsid proteins. In some aspects, the disclosure includes sequence alterations that create conservative amino acid substitutions. Conservative amino acid substitutions, as used herein, refer to amino acid substitutions that do not alter the relative charge or size characteristics of the protein in which the amino acid substitution is made. Variants can be prepared according to methods for altering polypeptide sequences known to those skilled in the art, such as references that compile such methods, such as Molecular Cloning: A Laboratory Manual, J. Sambrook. , et al., eds., Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or Current Protocols in Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York]. Conservative substitutions of amino acids include substitutions made among amino acids within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) S, T; (f) Q, N; and (g) E, D. Thus, conservative amino acid substitutions may be made to the amino acid sequences of the proteins and polypeptides described herein (eg, the GJB2 protein sequence).
일부 실시양태에서, rAAV는 단일 가닥 AAV (ssAAV)이다. 본원에 사용된 ssAAV는 별개의 가닥 상에 트랜스진 발현 카세트의 코딩 서열 및 상보적 서열을 갖고 별개의 바이러스 캡시드에 패키징된 rAAV를 지칭한다. 일부 실시양태에서, rAAV는 자기-상보적 AAV (scAAV)이다. 본원에 사용된 scAAV는 AAV 게놈의 단일 가닥 상에 존재하는 트랜스진 발현 카세트의 코딩 및 상보적 서열 둘 다를 갖는 rAAV를 지칭한다. scAAV의 코딩 영역은 분자내 이중-가닥 DNA 주형을 형성하도록 설계되었다. 감염 시, 제2 가닥의 세포 매개 합성을 기다리기 보다는, scAAV의 2개의 상보적 절반은 회합되어 즉시 복제 및 전사를 위해 준비된 1개의 이중 가닥 DNA (dsDNA) 단위를 형성할 것이다.In some embodiments, rAAV is single-stranded AAV (ssAAV). As used herein, ssAAV refers to rAAV that has the coding sequence of the transgene expression cassette and complementary sequences on separate strands and is packaged in separate viral capsids. In some embodiments, the rAAV is self-complementary AAV (scAAV). scAAV, as used herein, refers to rAAV that has both the coding and complementary sequences of a transgene expression cassette present on a single strand of the AAV genome. The coding region of scAAV is designed to form an intramolecular double-stranded DNA template. Upon infection, rather than waiting for cell-mediated synthesis of the second strand, the two complementary halves of scAAV will associate to form one double-stranded DNA (dsDNA) unit that is ready for immediate replication and transcription.
일부 실시양태에서, 본원에 제공된 바와 같은 rAAV는 트랜스진 (예를 들어, GJB2)을 포유동물에게 전달할 수 있다. 일부 예에서, 포유동물은 인간 또는 비-인간 포유동물, 예컨대 마우스, 래트 또는 비-인간 영장류 (예를 들어, 시노몰구스 원숭이), 고양이, 개, 돼지, 말, 당나귀, 낙타, 양 또는 염소일 수 있다. 특정 실시양태에서, 포유동물은 인간이다.In some embodiments, a rAAV as provided herein is capable of delivering a transgene (eg, GJB2) to a mammal. In some instances, the mammal is a human or non-human mammal, such as a mouse, rat or non-human primate (eg, cynomolgus monkey), cat, dog, pig, horse, donkey, camel, sheep or goat. can be In certain embodiments, the mammal is a human.
일부 실시양태에서, 본원에 제공된 바와 같은 rAAV는 트랜스진 (예를 들어, GJB2)을 귀에 전달할 수 있다. 일부 경우에, 본원에 제공된 바와 같은 rAAV는 트랜스진 (예를 들어, GJB2)을 내이 내의 세포 (예를 들어, 와우, 구형낭, 난형낭 및 반고리관)에 전달할 수 있다. 표적 세포의 비제한적 예는 외유모 세포 (OHC), 내유모 세포 (IHC), 나선상 신경절 뉴런, 혈관선조의 세포, 내부 고랑의 세포, 나선 인대의 세포, 전정계의 세포, 코르티 기관 지지 세포 (예를 들어, 내부 및 외부 고랑의 상피 세포, 및 치간 세포), 나선판가장자리에서의 치간 세포, 나선 인대에서의 뿌리 세포, 기둥 세포, 다이터 세포, 헨센 세포, 클라우디우스 세포, 내부 지골 세포; 및 경계 세포, 혈관조 중간 세포, 측벽 및 상혈관조 부위의 섬유세포, 혈관선조의 기저 세포, 나선 인대에서의 섬유세포, 나선판가장자리에서의 섬유세포, 전정계에 대면하는 미로골낭을 라이닝하는 중간엽 세포, 및 가장자리상부 암색 세포이다. 일부 실시양태에서, 내이에 대한 향성을 갖는 AAV 캡시드 (예를 들어, AAV-S 또는 AAV-PHP.B) 및 본원에 기재된 단리된 핵산 (예를 들어, GJB2 유전자 조절 요소의 제어 하에 GJB2 발현을 유도하는 단리된 핵산)의 조합은 GJB2 유전자 대체 요법에서 이를 정상적으로 발현하는 세포로 GJB2 발현을 제한하고, 혼재성 GJB2 발현과 연관된 독성 (예를 들어, 유모 세포 및/또는 중추 신경계 (CNS)에서 발현되는 GJB2와 연관된 독성)을 감소시킨다는 점에서 우수하다.In some embodiments, a rAAV as provided herein is capable of delivering a transgene (eg, GJB2) to the ear. In some cases, rAAVs as provided herein are capable of delivering a transgene (eg, GJB2) to cells within the inner ear (eg, cochlea, sac, ovoid, and semicircular canals). Non-limiting examples of target cells include outer hair cells (OHC), inner hair cells (IHC), spiral ganglion neurons, cells of the vascular progenitors, cells of the inner sulcus, cells of the spiral ligament, cells of the vestibular system, organ of Corti supporting cells ( eg, epithelial cells of the inner and outer grooves, and interdental cells), interdental cells at the edge of the spiral plate, root cells in the spiral ligament, column cells, diter cells, Hensen cells, Claudius cells, internal phalanx cells; And border cells, vascular intermediate cells, fibrous cells in the lateral wall and supravascular vascular region, basal cells in vascular progenitors, fibrous cells in spiral ligaments, fibrous cells at the edge of the spiral plate, lining the labyrinth bone sac facing the vestibular system. mesenchymal cells, and supermarginal dark cells. In some embodiments, an AAV capsid (e.g., AAV-S or AAV-PHP.B) that has tropism for the inner ear and an isolated nucleic acid described herein (e.g., that expresses GJB2 under the control of a GJB2 gene regulatory element Inducing GJB2 gene replacement therapy to limit GJB2 expression to cells that normally express it, and to limit the toxicities associated with confluent GJB2 expression (e.g., expression in hair cells and/or central nervous system (CNS)). It is excellent in that it reduces GJB2-associated toxicity).
AAV 캡시드에 rAAV 벡터를 패키징하기 위해 숙주 세포에서 배양될 성분은 숙주 세포에 트랜스로 제공될 수 있다. 대안적으로, 임의의 하나 이상의 필요한 성분 (예를 들어, 재조합 AAV 벡터, rep 서열, cap 서열, 및/또는 헬퍼 기능)은 관련 기술분야의 통상의 기술자에게 공지된 방법을 사용하여 1개 이상의 필요한 성분을 함유하도록 조작된 안정한 숙주 세포에 의해 제공될 수 있다. 가장 적합하게는, 이러한 안정한 숙주 세포는 유도성 프로모터의 제어 하에 필요한 성분(들)을 함유할 것이다. 그러나, 필요한 성분(들)은 구성적 프로모터의 제어 하에 있을 수 있다. 적합한 유도성 및 구성적 프로모터의 예는 트랜스진과 함께 사용하기에 적합한 조절 요소의 논의에서 본원에 제공된다. 또 다른 대안에서, 선택된 안정한 숙주 세포는 구성적 프로모터의 제어 하에 선택된 성분(들) 및 1개 이상의 유도성 프로모터의 제어 하에 다른 선택된 성분(들)을 함유할 수 있다. 예를 들어, 293 세포 (구성적 프로모터의 제어 하에 E1 헬퍼 기능을 함유함)로부터 유래되지만, 유도성 프로모터의 제어 하에 rep 및/또는 cap 단백질을 함유하는 안정한 숙주 세포가 생성될 수 있다. 또 다른 안정한 숙주 세포가 관련 기술분야의 통상의 기술자에 의해 생성될 수 있다.Components to be cultured in a host cell to package the rAAV vector into an AAV capsid may be provided to the host cell in trans. Alternatively, any one or more required components (e.g., recombinant AAV vectors, rep sequences, cap sequences, and/or helper functions) may be combined with one or more required components using methods known to those skilled in the art. It can be provided by stable host cells engineered to contain the components. Most suitably, such stable host cells will contain the necessary component(s) under the control of an inducible promoter. However, the necessary component(s) may be under the control of a constitutive promoter. Examples of suitable inducible and constitutive promoters are provided herein in the discussion of regulatory elements suitable for use with transgenes. In another alternative, the selected stable host cell may contain the selected component(s) under the control of a constitutive promoter and other selected component(s) under the control of one or more inducible promoters. For example, stable host cells can be generated that are derived from 293 cells (which contain E1 helper functions under the control of a constitutive promoter) but contain the rep and/or cap proteins under the control of an inducible promoter. Other stable host cells can be generated by those skilled in the art.
일부 실시양태에서, 본 개시내용은 단백질 (예를 들어, GJB2 단백질)을 코딩하는 코딩 서열을 포함하는 핵산을 함유하는 숙주 세포에 관한 것이다. 일부 실시양태에서, 숙주 세포는 포유동물 세포 (예를 들어, 인간 세포), 효모 세포, 박테리아 세포, 곤충 세포, 식물 세포 또는 진균 세포이다.In some embodiments, the present disclosure relates to a host cell containing a nucleic acid comprising a coding sequence encoding a protein (eg, a GJB2 protein). In some embodiments, the host cell is a mammalian cell (eg, a human cell), a yeast cell, a bacterial cell, an insect cell, a plant cell, or a fungal cell.
본 개시내용의 rAAV를 생산하는 데 필요한 재조합 AAV 벡터, rep 서열, cap 서열 및 헬퍼 기능은 임의의 적절한 유전 요소 (예를 들어, 벡터)를 사용하여 패키징 숙주 세포에 전달될 수 있다. 선택된 유전 요소는 본원에 기재되고 관련 기술분야에 공지된 것을 포함한 임의의 적합한 방법에 의해 전달될 수 있다. 본 개시내용의 임의의 실시양태를 구축하는 데 사용되는 방법은 핵산 조작의 통상의 기술자에게 공지되어 있고, 유전자 조작, 재조합 조작, 및 합성 기술을 포함한다. 예를 들어, 문헌 [Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y.]을 참조한다. 유사하게, rAAV 비리온을 생성하는 방법은 관련 기술분야에 공지되어 있고, 적합한 방법의 선택은 본 개시내용에 대한 제한이 아니다. 예를 들어, 문헌 [K. Fisher et al., J. Virol., 70:520-532 (1993)] 및 미국 특허 번호 5,478,745를 참조하고, 이들 각각은 본원에 참조로 포함된다.Recombinant AAV vectors, rep sequences, cap sequences, and helper functions necessary to produce the rAAV of the present disclosure can be delivered to packaging host cells using any suitable genetic elements (eg, vectors). The selected genetic element may be delivered by any suitable method, including those described herein and known in the art. Methods used to construct any embodiment of the present disclosure are known to those skilled in the art of nucleic acid engineering, and include genetic engineering, recombinant engineering, and synthetic techniques. See, eg, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. Similarly, methods for generating rAAV virions are known in the art, and selection of a suitable method is not a limitation of the present disclosure. See, for example, K. Fisher et al., J. Virol., 70:520-532 (1993)] and US Patent No. 5,478,745, each of which is incorporated herein by reference.
일부 실시양태에서, 삼중 형질감염 방법 (본원에 참조로 포함된 미국 특허 번호 6,001,650에 상세하게 기재됨)을 사용하여 재조합 AAV가 생산될 수 있다. 전형적으로, 재조합 AAV는 숙주 세포를 AAV 입자 내로 패키징될 재조합 AAV 벡터 (트랜스진을 포함함), AAV 헬퍼 기능 벡터, 및 보조 기능 벡터로 형질감염시킴으로써 생산된다. AAV 헬퍼 기능 벡터는 생산적 AAV 복제 및 캡시드화를 위해 트랜스로 기능하는 "AAV 헬퍼 기능" 서열 (예를 들어, rep 및 cap)을 코딩한다. 바람직하게는, AAV 헬퍼 기능 벡터는 임의의 검출가능한 야생형 AAV 비리온 (예를 들어, 기능적 rep 및 cap 유전자를 함유하는 AAV 비리온)을 생성하지 않으면서 효율적인 AAV 벡터 생산을 뒷받침한다. 본 개시내용에서 사용하기에 적합한 벡터의 비제한적 예는 미국 특허 번호 6,001,650에 기재된 pHLP19 및 미국 특허 번호 6,156,303에 기재된 pRep6cap6 벡터를 포함하며, 이들 둘 다는 본원에 참조로 포함된다. 보조 기능 벡터는 AAV가 복제에 의존하는 비-AAV 유래 바이러스 및/또는 세포 기능 (즉, "보조 기능")을 위한 뉴클레오티드 서열을 코딩한다. 보조 기능은 AAV 유전자 전사의 활성화, 단계 특이적 AAV mRNA 스플라이싱, AAV DNA 복제, cap 발현 산물의 합성, 및 AAV 캡시드 어셈블리에 수반되는 모이어티를 포함하나 이에 제한되지는 않는, AAV 복제에 요구되는 기능을 포함한다. 바이러스-기반 보조 기능은 임의의 공지된 헬퍼 바이러스, 예컨대 아데노바이러스, 헤르페스바이러스 (단순 헤르페스 바이러스 유형-1 이외의 것), 및 백시니아 바이러스로부터 유래될 수 있다.In some embodiments, recombinant AAV may be produced using a triple transfection method (described in detail in US Pat. No. 6,001,650, incorporated herein by reference). Typically, recombinant AAV is produced by transfecting a host cell with a recombinant AAV vector (including transgene) to be packaged into AAV particles, an AAV helper function vector, and an accessory function vector. AAV helper function vectors encode "AAV helper function" sequences (eg, rep and cap) that function in trans for productive AAV replication and encapsidation. Preferably, AAV helper functional vectors support efficient AAV vector production without producing any detectable wild-type AAV virions (eg, AAV virions containing functional rep and cap genes). Non-limiting examples of vectors suitable for use in the present disclosure include the pHLP19 described in U.S. Patent No. 6,001,650 and the pRep6cap6 vector described in U.S. Patent No. 6,156,303, both of which are incorporated herein by reference. An accessory function vector encodes nucleotide sequences for non-AAV derived viral and/or cellular functions (ie, “auxiliary functions”) on which AAV depends for replication. Auxiliary functions are required for AAV replication, including but not limited to, activation of AAV gene transcription, step-specific AAV mRNA splicing, AAV DNA replication, synthesis of cap expression products, and moieties involved in AAV capsid assembly includes the function to be Virus-based auxiliary functions may be derived from any known helper virus, such as adenovirus, herpesvirus (other than herpes simplex virus type-1), and vaccinia virus.
일부 측면에서, 본 개시내용은 형질감염된 숙주 세포를 제공한다. 용어 "형질감염"은 세포에 의한 외래 DNA의 흡수를 지칭하는 데 사용되고, 세포는 외인성 DNA가 세포 막 내부에 도입된 경우에 "형질감염"된 것이다. 다수의 형질감염 기술이 일반적으로 관련 기술분야에 공지되어 있다. 예를 들어, 문헌 [Graham et al. (1973) Virology, 52:456, Sambrook et al. (1989) Molecular Cloning, a Laboratory Manual, Cold Spring Harbor Laboratories, New York, Davis et al. (1986) Basic Methods in Molecular Biology, Elsevier, and Chu et al. (1981) Gene 13:197]을 참조한다. 이러한 기술을 사용하여 1개 이상의 외인성 핵산을 적합한 숙주 세포 내로 도입할 수 있다.In some aspects, the present disclosure provides transfected host cells. The term "transfection" is used to refer to uptake of foreign DNA by a cell, and a cell is "transfected" when the exogenous DNA is introduced inside the cell membrane. A number of transfection techniques are generally known in the art. See, eg, Graham et al. (1973) Virology, 52:456, Sambrook et al. (1989) Molecular Cloning, a Laboratory Manual, Cold Spring Harbor Laboratories, New York, Davis et al. (1986) Basic Methods in Molecular Biology, Elsevier, and Chu et al. (1981) Gene 13:197. These techniques can be used to introduce one or more exogenous nucleic acids into a suitable host cell.
"숙주 세포"는 관심 물질을 보유하거나 보유할 수 있는 임의의 세포를 지칭한다. 종종 숙주 세포는 포유동물 세포이다. 숙주 세포는 AAV 헬퍼 구축물, AAV 플라스미드, 보조 기능 벡터, 또는 재조합 AAV의 생산과 연관된 다른 전달 DNA의 수용자로서 사용될 수 있다. 상기 용어는 형질감염된 원래 세포의 자손을 포함한다. 따라서, 본원에 사용된 "숙주 세포"는 외인성 DNA 서열로 형질감염된 세포를 지칭할 수 있다. 단일 모 세포의 자손은 자연적, 우발적, 또는 고의적 돌연변이 또는 조작으로 인해, 원래 모체와 형태에 있어서 또는 게놈 또는 전체 DNA 상보체에 있어서 반드시 완전히 동일하지는 않을 수 있는 것으로 이해된다."Host cell" refers to any cell that carries or is capable of carrying a substance of interest. Often the host cell is a mammalian cell. The host cell can be used as a recipient of AAV helper constructs, AAV plasmids, helper function vectors, or other transfer DNA involved in the production of recombinant AAV. The term includes the progeny of the original transfected cell. Thus, a “host cell” as used herein may refer to a cell that has been transfected with an exogenous DNA sequence. It is understood that the progeny of a single parental cell may not necessarily be completely identical in morphology or in genome or total DNA complement to the original parent, due to natural, accidental, or deliberate mutation or manipulation.
본원에 사용된 용어 "세포주"는 시험관내에서 연속적 또는 연장된 성장 및 분열이 가능한 세포의 집단을 지칭한다. 종종, 세포주는 단일 전구 세포로부터 유래된 클론 집단이다. 이러한 클론 집단의 저장 또는 전달 동안 핵형에서 자발적이거나 유도된 변화가 발생할 수 있다는 것이 관련 기술분야에 추가로 공지되어 있다. 따라서, 언급된 세포주로부터 유래된 세포는 조상 세포 또는 배양물과 정확하게 동일하지 않을 수 있고, 언급된 세포주는 이러한 변이체를 포함한다.As used herein, the term “cell line” refers to a population of cells capable of continuous or prolonged growth and division in vitro. Often, a cell line is a clonal population derived from a single progenitor cell. It is further known in the art that spontaneous or induced changes in karyotype can occur during storage or transfer of such clonal populations. Thus, cells derived from a referenced cell line may not be exactly identical to the progenitor cell or culture, and the referenced cell line includes such variants.
본원에 사용된 용어 "재조합 세포"는 외인성 DNA 절편, 예컨대 생물학적 활성 폴리펩티드 (예를 들어, GJB2 단백질)의 전사를 유도하는 DNA 절편이 도입된 세포를 지칭한다.As used herein, the term "recombinant cell" refers to a cell into which an exogenous DNA segment has been introduced, such as a DNA segment that directs the transcription of a biologically active polypeptide (eg, GJB2 protein).
본원에 사용된 용어 "벡터"는 적절한 제어 요소와 회합되는 경우에 복제될 수 있고 세포 사이에 유전자 서열을 전달할 수 있는 임의의 유전 요소, 예컨대 플라스미드, 파지, 트랜스포손, 코스미드, 염색체, 인공 염색체, 바이러스, 비리온 등을 포함한다. 따라서, 상기 용어는 클로닝 및 발현 비히클, 뿐만 아니라 바이러스 벡터를 포함한다. 일부 실시양태에서, 유용한 벡터는 전사될 핵산 절편이 프로모터의 전사 제어 하에 위치하는 벡터인 것으로 고려된다. 용어 "발현 벡터 또는 구축물"은 핵산 코딩 서열의 일부 또는 전부가 전사될 수 있는 핵산을 함유하는 임의의 유형의 유전적 구축물을 의미한다.As used herein, the term “vector” refers to any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, artificial chromosome, capable of replicating and transferring genetic sequences between cells when associated with appropriate control elements. , viruses, virions, etc. Thus, the term includes cloning and expression vehicles as well as viral vectors. In some embodiments, useful vectors are considered to be those in which the nucleic acid segments to be transcribed are placed under the transcriptional control of a promoter. The term “expression vector or construct” refers to any type of genetic construct containing a nucleic acid from which part or all of a nucleic acid coding sequence can be transcribed.
본 개시내용의 rAAV를 생산하기 위해 목적하는 AAV 캡시드에 재조합 벡터를 패키징하는 상기 방법은 제한적인 것으로 의도되지 않고, 다른 적합한 방법은 관련 기술분야의 통상의 기술자에게 명백할 것이다.The above method of packaging a recombinant vector into a desired AAV capsid to produce the rAAV of the present disclosure is not intended to be limiting, and other suitable methods will be apparent to those skilled in the art.
본 개시내용은 트랜스진 (예를 들어, GJB2)을 발현하기 위한 벡터 (예를 들어, AAV 벡터)를 포함하는 rAAV를 제공하며, 이러한 벡터는 AAV LTR (예를 들어, AAV2 LTR) 및 프로모터 (예를 들어, 인간 GJB2 프로모터 또는 그의 단편)에 작동가능하게 연결된 프로모터를 포함하는 발현 카세트를 포함한다. 또한, 벡터는 특정 조절 요소 (예를 들어, GJB2 인핸서, GJB2 유전자의 5' 및 3' UTR, WPRE, 및 폴리아데닐화 부위)를 추가로 포함할 수 있다. 또한, rAAV는 캡시드 단백질 (예를 들어, AAV9.PHP.B 캡시드 또는 AAV-S 캡시드)을 포함할 수 있다. 이러한 rAAV는 트랜스진 (예를 들어, GJB2)을 표적 조직 (예를 들어, 내이에서 GJB2를 정상적으로 발현하는 세포)에 전달할 수 있다. 일부 실시양태에서, 이러한 rAAV는 트랜스진 (예를 들어, GJB2)을 표적 조직 내의 특이적 세포, 예를 들어 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포 내로 전달할 수 있다.The present disclosure provides a rAAV comprising a vector (eg, an AAV vector) for expressing a transgene (eg, GJB2), such a vector comprising an AAV LTR (eg, an AAV2 LTR) and a promoter ( eg, an expression cassette comprising a promoter operably linked to the human GJB2 promoter or a fragment thereof). In addition, the vector may further include specific regulatory elements (eg, GJB2 enhancer, 5' and 3' UTRs of the GJB2 gene, WPRE, and polyadenylation sites). In addition, rAAV can include capsid proteins (eg, AAV9.PHP.B capsid or AAV-S capsid). Such rAAVs can deliver a transgene (eg, GJB2) to a target tissue (eg, cells that normally express GJB2 in the inner ear). In some embodiments, such rAAVs are capable of delivering a transgene (eg, GJB2) into specific cells within a target tissue, such as connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions.
III. 제약 조성물III. pharmaceutical composition
rAAV는 관련 기술분야에 공지된 임의의 적절한 방법에 따라 조성물로 대상체에게 전달될 수 있다. 바람직하게는 생리학상 상용성인 담체 중에 (즉, 조성물 중에) 현탁된 rAAV는 대상체, 예를 들어 숙주 동물, 환자, 실험 동물에게 투여될 수 있다. 일부 실시양태에서, 대상체는 포유동물이다. 일부 예에서, 포유동물은 인간이다. 다른 실시양태에서, 포유동물은 비-인간 포유동물, 예컨대 인간, 마우스, 래트, 고양이, 개, 양, 토끼, 말, 소, 염소, 돼지, 기니 피그, 햄스터, 닭, 칠면조, 또는 비-인간 영장류 (예를 들어, 시노몰구스 원숭이)일 수 있다. 대상체는 임의의 발달 단계 및 임의의 성별의 것일 수 있다.rAAV can be delivered to a subject in a composition according to any suitable method known in the art. The rAAV, preferably suspended in a physiologically compatible carrier (i.e., in a composition), can be administered to a subject, eg, a host animal, patient, laboratory animal. In some embodiments, the subject is a mammal. In some instances, the mammal is a human. In other embodiments, the mammal is a non-human mammal, such as a human, mouse, rat, cat, dog, sheep, rabbit, horse, cow, goat, pig, guinea pig, hamster, chicken, turkey, or non-human primates (eg, cynomolgus monkeys). A subject can be of any developmental stage and of any gender.
rAAV는 임의의 관심 기관 또는 조직에 전달될 수 있다. 일부 실시양태에서, rAAV는 내이에 전달된다. 포유동물 대상체로의 rAAV의 전달은, 예를 들어 귀로의 주사에 의한 것일 수 있다. 일부 실시양태에서, 주사는 내이의 정원창 막을 통해 귀에, 와우의 중간계 내로, 와우의 전정계 내로, 내이의 반고리관 내로, 또는 내이의 구형낭 또는 난형낭 내로 이루어진다. 일부 실시양태에서, rAAV는 국소 투여 (예를 들어, 점이제)에 의해 귀에 전달된다. 일부 실시양태에서, 주사는 국소 투여가 아니다. 투여 방법의 조합 (예를 들어, 내이의 정원창 막을 통한 국소 투여 및 주사)이 또한 사용될 수 있다.rAAV can be delivered to any organ or tissue of interest. In some embodiments, rAAV is delivered to the inner ear. Delivery of rAAV to a mammalian subject can be, for example, by injection into the ear. In some embodiments, the injection is through the round window membrane of the inner ear into the ear, into the middle system of the cochlea, into the vestibular system of the cochlea, into the semicircular canals of the inner ear, or into the sac or ovoid sac of the inner ear. In some embodiments, rAAV is delivered to the ear by topical administration (eg, ear drops). In some embodiments, injection is not topical administration. A combination of methods of administration (eg, topical administration and injection through the round window membrane of the inner ear) may also be used.
본 개시내용의 조성물은 본원에 기재된 rAAV를 단독으로, 또는 1종 이상의 다른 바이러스 (예를 들어, 1개 이상의 상이한 트랜스진을 코딩하는 제2 rAAV)와 조합하여 포함할 수 있다. 일부 실시양태에서, 조성물은 각각 1개 이상의 상이한 트랜스진을 갖는 1, 2, 3, 4, 5, 6, 7, 8, 9, 10종 또는 그 초과의 상이한 rAAV를 포함한다.A composition of the present disclosure may comprise a rAAV described herein alone or in combination with one or more other viruses (eg, a second rAAV encoding one or more different transgenes). In some embodiments, the composition comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more different rAAVs, each having at least one different transgene.
일부 실시양태에서, 조성물은 제약상 허용되는 담체를 추가로 포함한다. 적합한 담체는 rAAV가 지시되는 적응증의 관점에서 관련 기술분야의 통상의 기술자에 의해 용이하게 선택될 수 있다. "허용되는"은 담체가 조성물의 rAAV 또는 단리된 핵산과 상용성이어야 하고 (바람직하게는, 활성 성분을 안정화시킬 수 있어야 하고), 치료될 대상체에게 유해하지 않아야 함을 의미한다. 일부 실시양태에서, 제약상 허용되는 담체/부형제는 투여 방식과 상용성이다. 완충제를 포함한 제약상 허용되는 부형제 (담체)는 관련 기술분야에 널리 공지되어 있다. 예를 들어, 문헌 [Remington: The Science and Practice of Pharmacy 20th Ed. (2000) Lippincott Williams and Wilkins, Ed. K. E. Hoover]을 참조한다. 예를 들어, 하나의 허용되는 담체는 염수를 포함하며, 이는 다양한 완충 용액과 함께 제제화될 수 있다 (예를 들어, 포스페이트 완충 염수). 다른 예시적인 담체는 멸균 염수, 락토스, 수크로스, 인산칼슘, 젤라틴, 덱스트란, 한천, 펙틴, 땅콩 오일, 참깨 오일 및 물을 포함한다. 담체의 선택은 본 개시내용의 제한이 아니다.In some embodiments, the composition further comprises a pharmaceutically acceptable carrier. Suitable carriers can be readily selected by those skilled in the art in view of the indication for which rAAV is indicated. "Acceptable" means that the carrier must be compatible with the rAAV or isolated nucleic acid of the composition (preferably capable of stabilizing the active ingredient) and must not be detrimental to the subject being treated. In some embodiments, the pharmaceutically acceptable carrier/excipient is compatible with the mode of administration. Pharmaceutically acceptable excipients (carriers), including buffers, are well known in the art. See, eg, Remington: The Science and Practice of Pharmacy 20th Ed. (2000) Lippincott Williams and Wilkins, Ed. K. E. Hoover]. For example, one acceptable carrier includes saline, which may be formulated with various buffering solutions (eg, phosphate buffered saline). Other exemplary carriers include sterile saline, lactose, sucrose, calcium phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil and water. The choice of carrier is not a limitation of the present disclosure.
본원에 개시된 rAAV 함유 제약 조성물은 적합한 완충제를 추가로 포함할 수 있다. 완충제는 또 다른 산 또는 염기의 첨가 후에 용액의 pH를 선택된 값에 가깝게 유지하는 데 사용되는 약산 또는 약염기이다. 일부 예에서, 본원에 개시된 완충제는 이산화탄소 농도 (예를 들어, 세포 호흡에 의해 생성됨)의 변화에도 불구하고 생리학적 pH를 유지할 수 있는 완충제일 수 있다. 예시적인 완충제는 HEPES (4-(2-히드록시에틸)-1-피페라진에탄술폰산) 완충제, 둘베코 포스페이트-완충 염수 (DPBS) 완충제 또는 포스페이트-완충 염수 (PBS) 완충제를 포함하나 이에 제한되지는 않는다. 이러한 완충제는 인산수소이나트륨 및 염화나트륨, 또는 인산이수소칼륨 및 염화칼륨을 포함할 수 있다.The rAAV-containing pharmaceutical compositions disclosed herein may further include suitable buffering agents. A buffer is a weak acid or base used to maintain the pH of a solution close to a selected value after the addition of another acid or base. In some examples, a buffer disclosed herein may be a buffer capable of maintaining physiological pH despite changes in carbon dioxide concentration (eg, produced by cellular respiration). Exemplary buffers include, but are not limited to, HEPES (4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid) buffer, Dulbecco's phosphate-buffered saline (DPBS) buffer, or phosphate-buffered saline (PBS) buffer. does not Such buffering agents may include disodium hydrogen phosphate and sodium chloride, or potassium dihydrogen phosphate and potassium chloride.
임의로, 본 개시내용의 조성물은 rAAV 및 담체(들) 이외에도 다른 제약 성분, 예컨대 보존제 또는 화학적 안정화제를 함유할 수 있다. 적합한 예시적인 보존제는 클로로부탄올, 소르브산칼륨, 소르브산, 이산화황, 프로필 갈레이트, 파라벤, 에틸 바닐린, 글리세린, 페놀, 및 파라클로로페놀을 포함한다. 적합한 화학적 안정화제는 젤라틴 및 알부민을 포함한다.Optionally, the compositions of the present disclosure may contain other pharmaceutical ingredients, such as preservatives or chemical stabilizers, in addition to the rAAV and carrier(s). Suitable exemplary preservatives include chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, parabens, ethyl vanillin, glycerin, phenol, and parachlorophenol. Suitable chemical stabilizers include gelatin and albumin.
본원에 기재된 rAAV 함유 제약 조성물은 1종 이상의 적합한 표면-활성제, 예컨대 계면활성제를 포함한다. 계면활성제는 두 액체 사이, 기체와 액체 사이, 또는 액체와 고체 사이의 표면 장력 (또는 계면 장력)을 낮추는 화합물이다. 계면활성제는 세제, 습윤제, 유화제, 발포제 및 분산제로서 작용할 수 있다. 적합한 계면활성제는 특히 비-이온성 작용제, 예컨대 폴리옥시에틸렌소르비탄 (예를 들어, 트윈(Tween)™ 20, 40, 60, 80 또는 85) 및 다른 소르비탄 (예를 들어, 스팬(Span)™ 20, 40, 60, 80 또는 85)을 포함한다. 표면 활성제를 갖는 조성물은 편리하게는 0.05 내지 5%의 표면-활성제를 포함할 것이고, 0.1 내지 2.5%일 수 있다. 필요한 경우에, 다른 성분, 예를 들어 만니톨 또는 다른 제약상 허용되는 비히클이 첨가될 수 있다는 것이 이해될 것이다.The rAAV-containing pharmaceutical compositions described herein include one or more suitable surface-active agents, such as surfactants. A surfactant is a compound that lowers the surface tension (or interfacial tension) between two liquids, between a gas and a liquid, or between a liquid and a solid. Surfactants can act as detergents, wetting agents, emulsifying agents, foaming agents and dispersing agents. Suitable surfactants are in particular non-ionic agents such as polyoxyethylenesorbitan (eg
rAAV는 목적하는 조직의 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)를 형질감염시키고 과도한 유해 효과 없이 충분한 수준의 유전자 전달 및 발현을 제공하기에 충분한 양으로 투여된다. 제약상 허용되는 투여 경로의 예는 선택된 기관 (예를 들어, 귀) 또는 조직으로의 직접 전달, 정맥내, 근육내, 피하, 피내, 종양내, 및 다른 비경구 투여 경로를 포함하나 이에 제한되지는 않는다. 투여 경로는 원하는 경우에 조합될 수 있다.rAAV is administered in an amount sufficient to transfect cells of the tissue of interest (e.g., connective tissue cells of the cochlea and support cells of the organ of Corti and nearby regions) and provide sufficient levels of gene transfer and expression without undue deleterious effects. do. Examples of pharmaceutically acceptable routes of administration include, but are not limited to, direct delivery to a selected organ (eg, ear) or tissue, intravenous, intramuscular, subcutaneous, intradermal, intratumoral, and other parenteral routes of administration. does not Routes of administration can be combined if desired.
특정한 "치료 효과"를 달성하는 데 요구되는 rAAV 비리온의 용량, 예를 들어 체중 킬로그램당 바이러스 게놈 카피 (GC/kg 또는 VG/kg)의 용량의 단위는 rAAV 비리온 투여의 경로, 치료 효과를 달성하는 데 요구되는 유전자 또는 RNA 발현의 수준, 치료될 특정 질환 또는 장애, 및 유전자 또는 rAAV 생성물의 안정성을 포함하나 이에 제한되지는 않는 여러 인자에 기초하여 달라질 것이다. 관련 기술분야의 통상의 기술자는 상기 언급된 인자, 뿐만 아니라 다른 인자에 기초하여 특정한 질환 또는 장애 (예를 들어, 비증후군성 청각 상실 및 난청, 또는 임의의 GJB2-연관 장애)를 갖는 환자를 치료하기 위한 rAAV 비리온 용량 범위를 용이하게 결정할 수 있다.The dose of rAAV virion required to achieve a particular "therapeutic effect", e.g., the dose of viral genome copies per kilogram of body weight (GC/kg or VG/kg), determines the route of rAAV virion administration, the therapeutic effect. It will vary based on several factors including, but not limited to, the level of gene or RNA expression required to achieve, the particular disease or disorder being treated, and the stability of the gene or rAAV product. One skilled in the art would treat a patient with a particular disease or disorder (e.g., non-syndromic deafness and hearing loss, or any GJB2-associated disorder) based on the factors mentioned above, as well as other factors. rAAV virion dose ranges for
rAAV의 유효량은 동물 (예를 들어, 마우스, 래트, 비-인간 영장류 또는 인간)을 감염시키거나 또는 목적하는 조직 또는 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)를 표적화하기에 충분한 양이다. 유효량은 주로 대상체의 종, 연령, 체중, 건강, 및 표적화될 조직과 같은 인자에 따라 좌우될 것이며, 따라서 동물 및 조직 사이에서 달라질 수 있다. 예를 들어, rAAV의 유효량은 일반적으로 약 109 내지 1016 게놈 카피를 함유하는 용액 약 1 ml 내지 약 100 ml의 범위이다. 일부 경우에, 약 1011 내지 1013 rAAV 게놈 카피의 투여량이 적절하다. 특정 실시양태에서, 109 rAAV 게놈 카피는 내이 조직 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)을 표적화하는 데 효과적이다. 일부 실시양태에서, 109 rAAV 게놈 카피보다 더 농축된 용량은 대상체의 귀에 투여될 때 독성이다. 일부 실시양태에서, 유효량은 다중 용량의 rAAV에 의해 생성된다.An effective amount of rAAV can infect an animal (e.g., mouse, rat, non-human primate, or human) or target tissue or cells (e.g., connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions). ) in an amount sufficient to target. The effective amount will depend primarily on factors such as the species, age, weight, health, and tissue to be targeted of the subject, and may therefore vary between animals and tissues. For example, an effective amount of rAAV generally ranges from about 1 ml to about 100 ml of a solution containing about 10 9 to 10 16 genome copies. In some cases, a dosage of about 10 11 to 10 13 rAAV genome copies is appropriate. In certain embodiments, 10 9 rAAV genome copies are effective in targeting inner ear tissues (eg, connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions). In some embodiments, doses more concentrated than 10 9 rAAV genome copies are toxic when administered to the ear of a subject. In some embodiments, the effective amount is produced by multiple doses of rAAV.
일부 실시양태에서, rAAV의 용량은 대상체에게 1일에 1회 이하 (예를 들어, 24-시간 기간)로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 2, 3, 4, 5, 6 또는 7일에 1회 이하로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 1주에 1회 이하 (예를 들어, 7 역일)로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 격주 이하 (예를 들어, 2-주 기간에 1회)로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 1개월에 1회 이하 (예를 들어, 30 역일에 1회)로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 6개월 당 1회 이하로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 1년에 1회 이하 (예를 들어, 365일 또는 윤년에는 366일)로 투여된다. 일부 실시양태에서, rAAV의 용량은 대상체에게 일생에 1회 투여된다.In some embodiments, the dose of rAAV is administered to the subject no more than once per day (eg, over a 24-hour period). In some embodiments, the dose of rAAV is administered to the subject no more than once every 2, 3, 4, 5, 6, or 7 days. In some embodiments, the dose of rAAV is administered to the subject no more than once per week (eg, 7 calendar days). In some embodiments, the dose of rAAV is administered to the subject every other week or less (eg, once in a 2-week period). In some embodiments, the dose of rAAV is administered to the subject no more than once per month (eg, once every 30 calendar days). In some embodiments, the dose of rAAV is administered to the subject no more than once every 6 months. In some embodiments, the dose of rAAV is administered to the subject no more than once per year (eg, 365 days or 366 days in a leap year). In some embodiments, the dose of rAAV is administered to the subject once per lifetime.
일부 실시양태에서, rAAV 조성물은, 특히 높은 rAAV 농도가 존재하는 경우에 (예를 들어, ~1013 GC/ml 또는 그 초과), 조성물 중 AAV 입자의 응집을 감소시키도록 제제화된다. 예를 들어 계면활성제의 첨가, pH 조정, 염 농도 조정 등을 포함하여, 응집을 감소시키기 위한 적절한 방법이 사용될 수 있다 (예를 들어, 그의 내용이 본원에 참조로 포함되는 문헌 [Wright et al., Molecular Therapy (2005) 12, 171-178] 참조).In some embodiments, the rAAV composition is formulated to reduce aggregation of AAV particles in the composition, particularly when high rAAV concentrations are present (eg, -10 13 GC/ml or greater). Appropriate methods for reducing aggregation may be used, including, for example, addition of surfactants, adjustment of pH, adjustment of salt concentration, etc. (see, for example, Wright et al. , Molecular Therapy (2005) 12, 171-178).
제약상 허용되는 부형제 및 담체 용액의 제제화는, 다양한 치료 요법에서 본원에 기재된 특정한 조성물을 사용하기 위한 적합한 투여 및 치료 요법의 개발과 마찬가지로, 관련 기술분야의 통상의 기술자에게 널리 공지되어 있다. 용해도, 생체이용률, 생물학적 반감기, 투여 경로, 제품 보관 수명, 뿐만 아니라 다른 약리학적 고려사항과 같은 인자가 이러한 제약 제제를 제조하는 관련 기술분야의 통상의 기술자에 의해 고려될 것이고, 따라서 다양한 투여량 및 치료 요법이 바람직할 수 있다.The formulation of pharmaceutically acceptable excipient and carrier solutions is well known to those skilled in the art, as is the development of suitable administration and treatment regimens for use of the particular compositions described herein in a variety of treatment regimens. Factors such as solubility, bioavailability, biological half-life, route of administration, product shelf life, as well as other pharmacological considerations will be taken into account by those skilled in the art of preparing such pharmaceutical formulations, and thus various dosages and A treatment regimen may be desirable.
일부 실시양태에서, 본원에 개시된 적합하게 제제화된 제약 조성물 중의 rAAV는 표적 조직에 직접, 예를 들어 내이 조직 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에 직접 전달된다. 그러나, 특정 상황에서, rAAV-기반 치료 구축물을 또 다른 경로를 통해, 예를 들어 피하로, 비경구로, 정맥내로, 근육내로, 척수강내로, 경구로 또는 복강내로 개별적으로 또는 추가로 전달하는 것이 바람직할 수 있다. 일부 실시양태에서, 미국 특허 번호 5,543,158; 5,641,515 및 5,399,363 (각각 그 전문이 본원에 구체적으로 참조로 포함됨)에 기재된 바와 같은 투여 양식을 사용하여 rAAV를 전달할 수 있다.In some embodiments, the rAAV in a suitably formulated pharmaceutical composition disclosed herein is delivered directly to a target tissue, e.g., to inner ear tissue (e.g., connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions) do. However, in certain circumstances, it is desirable to individually or additionally deliver the rAAV-based therapeutic construct via another route, for example subcutaneously, parenterally, intravenously, intramuscularly, intrathecally, orally or intraperitoneally. can do. In some embodiments, U.S. Patent Nos. 5,543,158; 5,641,515 and 5,399,363, each specifically incorporated herein by reference in its entirety, may be used to deliver rAAV.
주사가능한 용도에 적합한 제약 형태는 멸균 수용액 또는 분산액, 및 멸균 주사가능한 용액 또는 분산액의 즉석 제조를 위한 멸균 분말을 포함한다. 분산액은 또한 글리세롤, 액체 폴리에틸렌 글리콜, 및 그의 혼합물 중에서 및 오일 중에서 제조될 수 있다. 통상적인 저장 및 사용 조건 하에, 이들 제제는 미생물의 성장을 방지하기 위해 보존제를 함유한다. 많은 경우에, 형태는 멸균성이다. 이는 제조 및 저장 조건 하에 안정해야 하고, 미생물, 예컨대 박테리아, 진균 및 다른 바이러스에 의한 오염을 방지하기 위해 보존되어야 한다. 담체는, 예를 들어 물, 에탄올, 폴리올 (예를 들어, 글리세롤, 프로필렌 글리콜, 및 액체 폴리에틸렌 글리콜 등), 그의 적합한 혼합물, 및/또는 식물성 오일을 함유하는 용매 또는 분산 매질일 수 있다. 적합한 유동성은, 예를 들어 코팅, 예컨대 레시틴의 사용에 의해, 분산액의 경우에 요구되는 입자 크기의 유지에 의해 및 계면활성제의 사용에 의해 유지될 수 있다. 미생물에 의한 오염의 방지는 다양한 항박테리아제 및 항진균제, 예를 들어 파라벤, 클로로부탄올, 페놀, 소르브산, 티메로살 등에 의해 달성될 수 있다. 많은 경우에, 등장화제, 예를 들어 당 또는 염 (예를 들어, 염화나트륨)을 포함하는 것이 바람직할 것이다. 주사가능한 조성물의 연장된 흡수는 흡수를 지연시키는 작용제, 예를 들어 알루미늄 모노스테아레이트 및 젤라틴을 조성물에 사용함으로써 달성될 수 있다.Pharmaceutical forms suitable for injectable use include sterile aqueous solutions or dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. Dispersions can also be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof, and in oils. Under normal conditions of storage and use, these formulations contain preservatives to prevent the growth of microorganisms. In many cases, the form is sterile. It must be stable under the conditions of manufacture and storage and must be preserved to prevent contamination by microorganisms such as bacteria, fungi and other viruses. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyols (eg, glycerol, propylene glycol, liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable oils. Adequate fluidity can be maintained, for example, by the use of coatings such as lecithin, by maintenance of the required particle size in the case of dispersions, and by the use of surfactants. Prevention of contamination by microorganisms can be achieved by various antibacterial and antifungal agents, such as parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases it will be desirable to include a tonicity agent such as a sugar or salt (eg sodium chloride). Prolonged absorption of the injectable compositions can be brought about by using in the composition an agent that delays absorption, for example, aluminum monostearate and gelatin.
주사가능한 수용액의 투여를 위해, 예를 들어 용액은 필요한 경우에 적합하게 완충될 수 있고, 액체 희석제는 먼저 충분한 염수 또는 글루코스로 등장성이 될 수 있다. 이들 특정한 수용액은 정맥내 투여, 근육내 투여, 피하 투여, 복강내 투여, 및 내이의 정원창 막을 통한 주사에 특히 적합하다. 이와 관련하여, 적합한 멸균 수성 매질이 사용될 수 있다. 예를 들어, 1회 투여량을 1 ml의 등장성 NaCl 용액에 용해시키고, 1000 ml의 피하주입액에 첨가하거나 또는 제안된 주입 부위에 주사할 수 있다 (예를 들어, 문헌 [Remington's Pharmaceutical Sciences 15th Edition, pages 1035-1038 and 1570-1580] 참조). 숙주의 상태에 따라 투여량의 일부 변경이 필연적으로 발생할 것이다. 투여를 담당하는 사람은 어떠한 경우라도 개별 대상체/숙주에 대한 적절한 용량을 결정할 것이다.For administration of an aqueous injectable solution, for example, the solution may be suitably buffered if necessary and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are particularly suitable for intravenous administration, intramuscular administration, subcutaneous administration, intraperitoneal administration, and injection through the round window membrane of the inner ear. In this regard, any suitable sterile aqueous medium may be used. For example, a single dose can be dissolved in 1 ml of isotonic NaCl solution and added to 1000 ml of subcutaneous infusion or injected at the proposed site of infusion (see, e.g., Remington's Pharmaceutical Sciences 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will inevitably occur depending on the condition of the host. The person responsible for administration will determine the appropriate dose for the individual subject/host in any case.
멸균 주사가능한 용액은 활성 rAAV를 필요한 양으로 적절한 용매 중에 필요에 따라 본원에 기재된 다양한 다른 성분과 함께 도입한 후, 여과 멸균에 의해 제조된다. 일반적으로, 분산액은 다양한 멸균된 활성 성분을 기본 분산 매질 및 상기 열거된 것들로부터의 필요한 다른 성분을 함유하는 멸균 비히클 내로 혼입시킴으로써 제조된다. 멸균 주사가능한 용액의 제조를 위한 멸균 분말의 경우에, 바람직한 제조 방법은 그의 이전에 멸균-여과된 용액으로부터 활성 성분 플러스 임의의 추가의 목적하는 성분의 분말을 생성하는 진공-건조 및 동결-건조 기술이다.Sterile injectable solutions are prepared by incorporating the active rAAV in the required amount in an appropriate solvent with various other ingredients described herein as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle that contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of a sterile powder for the preparation of a sterile injectable solution, the preferred methods of preparation are vacuum-drying and freeze-drying techniques which produce a powder of the active ingredient plus any additional desired ingredient from its previously sterile-filtered solution. am.
본원에 개시된 rAAV 조성물은 또한 중성 또는 염 형태로 제제화될 수 있다. 제약상 허용되는 염은 염산 또는 인산, 또는 유기 산, 예컨대 아세트산, 옥살산, 타르타르산, 만델산 등을 포함하나 이에 제한되지는 않는다. 유리 카르복실 기로 형성된 염은 또한 무기 염기, 예컨대 예를 들어 수산화나트륨, 수산화칼륨, 수산화암모늄, 수산화칼슘 또는 수산화제2철, 및 유기 염기, 예컨대 이소프로필아민, 트리메틸아민, 히스티딘, 프로카인 등으로부터 유래될 수 있다. 제제화 시, 용액은 투여 제제와 상용성인 방식으로 및 치료상 유효한 양으로 투여될 것이다. 제제는 다양한 투여 형태, 예컨대 주사액, 약물-방출 캡슐 등으로 용이하게 투여된다.The rAAV compositions disclosed herein may also be formulated in neutral or salt form. Pharmaceutically acceptable salts include, but are not limited to, hydrochloric or phosphoric acids, or organic acids such as acetic acid, oxalic acid, tartaric acid, mandelic acid, and the like. Salts formed with free carboxyl groups are also derived from inorganic bases such as, for example, sodium hydroxide, potassium hydroxide, ammonium hydroxide, calcium hydroxide or ferric hydroxide, and organic bases such as isopropylamine, trimethylamine, histidine, procaine, and the like. It can be. When formulated, the solution will be administered in a manner compatible with the dosage formulation and in a therapeutically effective amount. The formulation is readily administered in a variety of dosage forms, such as injectable solutions, drug-releasing capsules, and the like.
본원에 사용된 "담체"는 임의의 및 모든 용매, 분산 매질, 비히클, 용매, 코팅, 희석제, 항박테리아제 및 항진균제, 등장화제 및 흡수 지연제, 완충제, 담체 용액, 현탁액, 콜로이드 등을 포함한다. 제약 활성 물질을 위한 이러한 매질 및 작용제의 사용은 관련 기술분야에 널리 공지되어 있다. 보충 활성 성분이 또한 조성물에 혼입될 수 있다. 어구 "제약상 허용되는"은 분자 물질 및 조성물이 숙주에게 투여되었을 때 알레르기 반응 또는 유사한 불리한 반응을 일으키지 않는 것을 지칭한다.As used herein, "carrier" includes any and all solvents, dispersion media, vehicles, solvents, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. . The use of such media and agents for pharmaceutical active substances is well known in the art. Supplementary active ingredients may also be incorporated into the compositions. The phrase “pharmaceutically acceptable” refers to molecular substances and compositions that do not cause allergic or similar adverse reactions when administered to a host.
전달 비히클, 예컨대 리포솜, 나노캡슐, 마이크로입자, 마이크로구체, 지질 입자, 소포 등이 본 개시내용의 조성물을 적합한 숙주 세포 내로 도입하는 데 사용될 수 있다. 특히, rAAV 벡터 전달된 트랜스진은 지질 입자, 리포솜, 소포, 나노구체, 나노입자 등에 캡슐화된 전달을 위해 제제화될 수 있다.Delivery vehicles such as liposomes, nanocapsules, microparticles, microspheres, lipid particles, vesicles, and the like can be used to introduce compositions of the present disclosure into suitable host cells. In particular, rAAV vector-delivered transgenes can be formulated for delivery encapsulated in lipid particles, liposomes, vesicles, nanospheres, nanoparticles, and the like.
이러한 제제는 본원에 개시된 핵산 또는 rAAV 구축물의 제약상 허용되는 제제의 도입에 바람직할 수 있다. 리포솜의 형성 및 사용은 일반적으로 관련 기술분야의 통상의 기술자에게 공지되어 있다. 최근에, 개선된 혈청 안정성 및 순환 반감기를 갖는 리포솜이 개발되었다 (미국 특허 번호 5,741,516, 이는 본원에 참조로 포함됨). 추가로, 잠재적 약물 담체로서의 리포솜 및 리포솜-유사 제제의 다양한 방법이 기재되어 있다 (미국 특허 번호 5,567,434; 5,552,157; 5,565,213; 5,738,868 및 5,795,587 (이들 각각은 본원에 참조로 포함됨)).Such formulations may be desirable for incorporation of pharmaceutically acceptable formulations of the nucleic acids or rAAV constructs disclosed herein. The formation and use of liposomes is generally known to those skilled in the art. Recently, liposomes with improved serum stability and circulating half-life have been developed (U.S. Patent No. 5,741,516, incorporated herein by reference). Additionally, various methods of liposomes and liposome-like formulations as potential drug carriers have been described (US Pat. Nos. 5,567,434; 5,552,157; 5,565,213; 5,738,868 and 5,795,587, each of which is incorporated herein by reference).
대안적으로, rAAV의 나노캡슐 제제가 사용될 수 있다. 나노캡슐은 일반적으로 물질을 안정하고 재현가능한 방식으로 포획할 수 있다. 세포내 중합체 과부하로 인한 부작용을 피하기 위해, 이러한 초미립자 (대략 0.1 μm 크기)는 생체내에서 분해될 수 있는 중합체를 사용하여 설계되어야 한다. 이들 요건을 충족시키는 생분해성 폴리알킬-시아노아크릴레이트 나노입자가 사용을 위해 고려된다.Alternatively, nanocapsule formulations of rAAV may be used. Nanocapsules can generally entrap substances in a stable and reproducible manner. To avoid side effects due to intracellular polymer overload, these ultrafine particles (approximately 0.1 μm in size) should be designed using polymers that can be degraded in vivo. Biodegradable polyalkyl-cyanoacrylate nanoparticles that meet these requirements are contemplated for use.
IV. 치료 용도IV. therapeutic use
본 개시내용은 또한 청각 상실을 치료하기 위해 대상체의 귀에서 트랜스진 (예를 들어, GJB2)을 정상적으로 발현하는 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에 트랜스진 (예를 들어, GJB2)을 (예를 들어, 본원에 기재된 단리된 핵산, 벡터, rAAV, 숙주 세포 또는 제약 조성물에 의해) 전달하는 방법을 제공한다. 일부 측면에서, 본 개시내용은 대상체의 귀에서 트랜스진 (예를 들어, GJB2)을 정상적으로 발현하는 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에 트랜스진 (예를 들어, GJB2)을 (예를 들어, 본원에 기재된 단리된 핵산, 벡터, rAAV, 숙주 세포 또는 제약 조성물에 의해) 전달함으로써 대상체에서 GJB2 연관 질환 (예를 들어, 비-증후군성 청각 상실 및 난청 (DFNB1))을 치료하는 방법을 제공한다. 일부 측면에서, 본 개시내용은 대상체의 귀에서 트랜스진 (예를 들어, GJB2)을 정상적으로 발현하는 세포 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에 트랜스진 (예를 들어, GJB2)을 (예를 들어, 본원에 기재된 단리된 핵산, 벡터, rAAV, 숙주 세포, 또는 제약 조성물에 의해) 전달함으로써, 내이 지지 세포에서의 표적화된 GJB2 발현 및/또는 뉴런 및/또는 와우 유모 세포에서의 GJB2를 탈표적화하는 방법을 제공한다. 일부 실시양태에서, 내이 지지 세포에서의 표적화된 GJB2 발현 및/또는 뉴런 및/또는 와우 유모 세포에서의 GJB2의 탈표적화는 본원에 기재된 GJB2 연관 질환을 치료하도록 설계된다. 일부 실시양태에서, 대상체는 포유동물이다. 일부 예에서, 대상체는 인간이다. 다른 실시양태에서, 대상체는 비-인간 포유동물, 예컨대 마우스, 래트, 소, 염소, 돼지, 낙타 또는 비-인간 영장류 (예를 들어, 시노몰구스 원숭이)이다.The present disclosure also relates to cells that normally express a transgene (eg, GJB2) in the ear of a subject (eg, connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby areas) to treat hearing loss. Methods of delivering a transgene (eg, GJB2) (eg, by an isolated nucleic acid, vector, rAAV, host cell, or pharmaceutical composition described herein) are provided. In some aspects, the present disclosure provides cells that normally express a transgene (eg, GJB2) in the ear of a subject (eg, connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby areas) transgene ( GJB2 associated disease (eg, non-syndromic hearing loss and A method for treating hearing loss (DFNB1)) is provided. In some aspects, the present disclosure provides cells that normally express a transgene (eg, GJB2) in the ear of a subject (eg, connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby areas) transgene ( eg, GJB2) (eg, by an isolated nucleic acid, vector, rAAV, host cell, or pharmaceutical composition described herein), thereby targeting GJB2 expression in inner ear supporting cells and/or neurons and/or or detargeting GJB2 in cochlear hair cells. In some embodiments, targeted GJB2 expression in inner ear supporting cells and/or detargeting of GJB2 in neurons and/or cochlear hair cells is designed to treat a GJB2-associated disease described herein. In some embodiments, the subject is a mammal. In some examples, the subject is a human. In other embodiments, the subject is a non-human mammal, such as a mouse, rat, cow, goat, pig, camel, or non-human primate (eg, a cynomolgus monkey).
일부 실시양태에서, 대상체는 청각 상실을 갖거나 갖는 것으로 의심된다. 특정 실시양태에서, 대상체는 비-증후군성 청각 상실 및 난청 (DFNB1)을 갖는 것으로 진단된다. 특정 실시양태에서, 청각 상실은 GJB2 유전자에서의 돌연변이와 연관된다. 일부 실시양태에서, GJB2 유전자의 돌연변이는 점 돌연변이, 미스센스 돌연변이, 넌센스 돌연변이, 결실, 삽입 또는 그의 조합이다. GJB2 유전자에서의 돌연변이의 비제한적 예는 표 2에 제시된다. 본원에 사용된 돌연변이는 서열, 예를 들어 핵산 또는 아미노산 서열 내의 잔기의 또 다른 잔기로의 치환, 또는 서열 내의 1개 이상의 잔기의 결실 또는 삽입을 지칭한다. 돌연변이는 전형적으로 원래 잔기를 확인한 다음, 서열 내에서의 이러한 잔기의 위치 및 새로 치환된 잔기의 실체를 확인함으로써 본원에 기재된다.In some embodiments, the subject has or is suspected of having a hearing loss. In certain embodiments, the subject is diagnosed as having non-syndromic deafness and hearing loss (DFNB1). In certain embodiments, hearing loss is associated with mutations in the GJB2 gene. In some embodiments, the mutation of the GJB2 gene is a point mutation, missense mutation, nonsense mutation, deletion, insertion, or combination thereof. Non-limiting examples of mutations in the GJB2 gene are shown in Table 2. Mutation, as used herein, refers to the substitution of a residue in a sequence, eg, a nucleic acid or amino acid sequence, with another residue, or the deletion or insertion of one or more residues in a sequence. Mutations are typically described herein by identifying the original residue, then identifying the location of that residue in the sequence and the identity of the newly substituted residue.
표 2: GJB2 유전자에서의 예시적인 돌연변이 (뉴클레오티드 번호는 NM_004004.6의 ATG에서 시작함).Table 2: Exemplary mutations in the GJB2 gene (nucleotide numbers start at ATG of NM_004004.6).
본 개시내용의 측면은 유전자 요법 (예를 들어, GJB2 단백질을 코딩하는 rAAV)을 사용하여 기능적 유전자 산물 (예를 들어, GJB2 단백질)을, 유전자 산물의 부재 또는 기능부전을 초래하는 관련 유전자 (예를 들어, GJB2) 내의 적어도 하나의 대립유전자에서의 1개 이상의 돌연변이를 포함하는 표적 세포 (예를 들어, GJB2를 정상적으로 발현하는 세포, 예컨대 섬유세포 및 코르티 기관 및 근처 영역의 지지 세포)에 전달함으로써 청각 상실 (예를 들어, DFNB1)을 치료하는 방법에 관한 것이다.Aspects of the present disclosure include the use of gene therapy (eg, a rAAV encoding a GJB2 protein) to generate a functional gene product (eg, a GJB2 protein) in the presence of a related gene (eg, a gene product that results in the absence or malfunction of the gene product). For example, by delivering to a target cell (eg, cells that normally express GJB2, such as fibrocytes and support cells of the organ of Corti and nearby regions) comprising one or more mutations in at least one allele in GJB2) A method of treating hearing loss (eg, DFNB1).
본 발명의 측면은 대상체에게 전달되는 경우에 청각 상실 (예를 들어, DFNB1)을 치료하는 데 효과적인 특정 단백질-코딩 트랜스진 (예를 들어, GJB2)에 관한 것이다. 일부 실시양태에서, 대상체는 청각 상실을 갖거나 갖는 것으로 의심된다. 일부 실시양태에서, 청각 상실은 GJB2 유전자에서의 돌연변이와 연관된다. 일부 실시양태에서, 청각 상실은 표 2 (상기됨)에 열거된 GJB2 유전자에서의 돌연변이와 연관된다. 일부 실시양태에서, 대상체는 DFNB1로 진단된다.Aspects of the invention relate to certain protein-encoding transgenes (eg, GJB2) effective for treating hearing loss (eg, DFNB1) when delivered to a subject. In some embodiments, the subject has or is suspected of having a hearing loss. In some embodiments, hearing loss is associated with a mutation in the GJB2 gene. In some embodiments, the hearing loss is associated with mutations in the GJB2 gene listed in Table 2 (above). In some embodiments, the subject is diagnosed with DFNB1.
따라서, 본 개시내용에 의해 기재된 방법 및 조성물은, 일부 실시양태에서, GJB2 유전자에서의 1개 이상의 돌연변이 또는 결실과 연관된 DFNB1의 치료에 유용하다.Thus, the methods and compositions described by this disclosure are useful, in some embodiments, for the treatment of DFNB1 associated with one or more mutations or deletions in the GJB2 gene.
트랜스진 (예를 들어, GJB2)을 대상체에게 전달하는 방법이 본 개시내용에 의해 제공된다. 상기 방법은 전형적으로 대상체에게 GJB2 단백질을 코딩하는 단리된 핵산, 또는 GJB2를 발현하기 위한 핵산을 포함하는 rAAV의 유효량을 투여하는 것을 수반한다.Methods of delivering a transgene (eg, GJB2) to a subject are provided by the present disclosure. The methods typically involve administering to the subject an effective amount of an isolated nucleic acid encoding a GJB2 protein, or a rAAV comprising a nucleic acid for expressing GJB2.
일부 실시양태에서, GJB2 돌연변이는 점 돌연변이, 미스센스 돌연변이, 넌센스 돌연변이, 삽입 및 결실이나, 이에 제한되지는 않는다. 일부 실시양태에서, DFNB1과 연관된 GJB2 유전자 돌연변이는 표 2에서의 돌연변이를 포함하나 이에 제한되지는 않는다. 일부 실시양태에서, GJB2 유전자에서의 돌연변이는 c.101T>C이다. 일부 실시양태에서, GJB2 유전자에서의 돌연변이는 35DelG이다. 대상체 (예를 들어, GJB2 유전자의 결실 또는 돌연변이와 연관된 DFNB1을 갖거나 갖는 것으로 의심되는 대상체)에서의 GJB2 돌연변이는 관련 기술분야에 공지된 임의의 방법에 의해 대상체로부터 수득된 샘플 (예를 들어, DNA 샘플, RNA 샘플, 혈액 샘플, 또는 다른 생물학적 샘플)로부터 확인될 수 있다. 예를 들어, 일부 실시양태에서, 핵산 (예를 들어, DNA, RNA 또는 그의 조합)은 대상체로부터 수득된 생물학적 샘플로부터 추출되고, 핵산 서열분석은 GJB2 유전자에서의 돌연변이를 확인하기 위해 수행된다. 일부 실시양태에서, GJB2 유전자에서의 돌연변이는, 예를 들어 GJB2 단백질 발현 (예를 들어, 웨스턴 블롯에 의함) 또는 기능을 정량화함으로써 (예를 들어, 구조, 기능 등의 분석에 의함), 또는 DNA를 직접 서열분석하고 수득된 서열을 대조군 DNA 서열 (예를 들어, 야생형 GJB2 DNA 서열)과 비교함으로써 간접적으로 검출된다.In some embodiments, GJB2 mutations include, but are not limited to, point mutations, missense mutations, nonsense mutations, insertions and deletions. In some embodiments, GJB2 gene mutations associated with DFNB1 include, but are not limited to, those in Table 2. In some embodiments, the mutation in the GJB2 gene is c.101T>C. In some embodiments, the mutation in the GJB2 gene is 35DelG. A GJB2 mutation in a subject (eg, a subject suspected of having or having DFNB1 associated with a deletion or mutation of the GJB2 gene) can be determined in a sample obtained from the subject by any method known in the art (eg, DNA samples, RNA samples, blood samples, or other biological samples). For example, in some embodiments, nucleic acids (eg, DNA, RNA, or combinations thereof) are extracted from a biological sample obtained from a subject, and nucleic acid sequencing is performed to identify mutations in the GJB2 gene. In some embodiments, mutations in the GJB2 gene are determined, for example, by quantifying GJB2 protein expression (eg, by Western blot) or function (eg, by analysis of structure, function, etc.), or DNA is detected indirectly by direct sequencing and comparing the resulting sequence to a control DNA sequence (eg, wild-type GJB2 DNA sequence).
일부 측면에서, 본 개시내용은 DFNB1을 갖거나 갖는 것으로 의심되는 대상체에게 치료 유효량의 단리된 핵산, 또는 트랜스진 (예를 들어, GJB2)을 코딩하는 rAAV를 투여하는 단계를 포함하는, DFNB1의 치료를 필요로 하는 대상체에서 DFNB1을 치료하는 방법을 제공한다. 일부 실시양태에서, 트랜스진 (예를 들어, GJB2)을 코딩하는 rAAV는 본 개시내용에 의해 기재된 바와 같이, 내이의 정원창 막에의 주사를 통해 주사된다. 일부 측면에서, 본 개시내용은 요법에서의 의약의 제조에 사용하기 위한, 단리된 핵산 또는 트랜스진 (예를 들어, GJB2)을 코딩하는 rAAV, 또는 그의 제약 조성물을 제공한다. 일부 측면에서, 본 개시내용은 GJB2 유전자와 연관된 청각 상실 및/또는 난청을 치료하기 위한 의약의 제조에 사용하기 위한 단리된 핵산 또는 트랜스진 (예를 들어, GJB2)을 코딩하는 rAAV, 또는 그의 제약 조성물을 제공한다. 일부 측면에서, 본 개시내용은 비-증후군성 난청 및/또는 청각 상실 (DFNB1)을 치료하기 위한 의약의 제조에 사용하기 위한, 단리된 핵산 또는 트랜스진 (예를 들어, GJB2)을 코딩하는 rAAV, 또는 그의 제약 조성물을 제공한다.In some aspects, the present disclosure provides treatment of DFNB1 comprising administering to a subject having or suspected of having DFNB1 a therapeutically effective amount of an isolated nucleic acid, or rAAV encoding a transgene (eg, GJB2). A method of treating DFNB1 in a subject in need thereof is provided. In some embodiments, an rAAV encoding a transgene (eg, GJB2) is injected via injection into the round window membrane of the inner ear, as described by the present disclosure. In some aspects, the present disclosure provides an isolated nucleic acid or rAAV encoding a transgene (eg, GJB2), or a pharmaceutical composition thereof, for use in the manufacture of a medicament in therapy. In some aspects, the present disclosure provides an isolated nucleic acid or rAAV encoding a transgene (e.g., GJB2) for use in the manufacture of a medicament for treating hearing loss and/or deafness associated with the GJB2 gene, or a pharmaceutical thereof. composition is provided. In some aspects, the present disclosure provides an isolated nucleic acid or rAAV encoding a transgene (eg, GJB2) for use in the manufacture of a medicament for treating non-syndromic deafness and/or hearing loss (DFNB1). , or a pharmaceutical composition thereof.
물질의 "유효량"은 목적하는 효과를 생성하기에 충분한 양이다. 일부 실시양태에서, 단리된 핵산 (예를 들어, GJB2 단백질을 코딩하는 트랜스진을 포함하는 단리된 핵산)의 유효량은 대상체의 표적 조직의 충분한 수의 표적 세포를 형질감염시키기에 (또는 rAAV 매개 전달의 맥락에서 감염시키기에) 충분한 양이다. 일부 실시양태에서, 표적 조직은 와우 (예를 들어, 본원에 기재된 바와 같은 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)이다. 일부 실시양태에서, 단리된 핵산 (예를 들어, rAAV를 통해 전달될 수 있는 것)의 유효량은, 예를 들어 관심 유전자 또는 단백질 (예를 들어, GJB2 단백질)의 발현을 증가 또는 보충하거나, 대상체에서 질환의 1개 이상의 증상 (예를 들어, DFNB1의 증상 또는 징후)을 개선시키는 등의 치료 이익을 갖는 데 충분한 양일 수 있다. 유효량은 다양한 인자, 예컨대, 예를 들어 대상체의 종, 연령, 체중, 건강, 및 표적화될 조직에 따라 좌우될 것이고, 따라서 본 개시내용의 다른 곳에 기재된 바와 같은 대상체 및 조직 사이에서 달라질 수 있다. 일부 실시양태에서, rAAV의 유효량은 안정한 체세포 트랜스제닉 동물 모델을 생산하기에 충분한 양일 수 있다.An “effective amount” of a substance is an amount sufficient to produce the desired effect. In some embodiments, an effective amount of an isolated nucleic acid (eg, an isolated nucleic acid comprising a transgene encoding a GJB2 protein) is sufficient to transfect (or rAAV mediated delivery) a sufficient number of target cells of a target tissue of a subject. in an amount sufficient to infect). In some embodiments, the target tissue is the cochlea (eg, connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions, as described herein). In some embodiments, an effective amount of an isolated nucleic acid (eg, one that can be delivered via rAAV) increases or supplements expression of, eg, a gene or protein of interest (eg, a GJB2 protein), or a subject in an amount sufficient to have therapeutic benefit, such as ameliorating one or more symptoms of a disease (eg, symptoms or signs of DFNB1). The effective amount will depend on various factors, such as, for example, the species, age, weight, health, and tissue to be targeted of the subject, and may therefore vary between subjects and tissues as described elsewhere in this disclosure. In some embodiments, an effective amount of rAAV may be an amount sufficient to produce a stable somatic transgenic animal model.
유효량은 또한 사용된 rAAV에 따라 달라질 수 있다. 본 발명은 부분적으로, 특정한 혈청형 (예를 들어, AAV9.PHP.B 또는 AAV-S)을 갖는 캡시드 단백질을 포함하는 rAAV가 상이한 혈청형을 갖는 캡시드 단백질을 포함하는 rAAV보다 와우 (예를 들어, 내유모 세포, 외유모 세포) 조직의 보다 효율적인 형질도입을 매개한다는 인식에 기초한다.An effective amount may also vary depending on the rAAV used. The present invention relates in part to the fact that rAAVs comprising capsid proteins having a particular serotype (e.g., AAV9.PHP.B or AAV-S) are superior to rAAVs comprising capsid proteins having different serotypes (e.g., AAV9.PHP.B or AAV-S). , inner hair cells, outer hair cells) mediate more efficient transduction of tissues.
특정 실시양태에서, rAAV의 유효량은 kg 당 1010, 1011, 1012, 1013, 또는 1014 게놈 카피이다. 특정 실시양태에서, rAAV의 유효량은 대상체 당 1010, 1011, 1012, 1013, 1014, 또는 1015 게놈 카피이다.In certain embodiments, an effective amount of rAAV is 10 10 , 10 11 , 10 12 , 10 13 , or 10 14 genome copies per kg. In certain embodiments, an effective amount of rAAV is 10 10 , 10 11 , 10 12 , 10 13 , 10 14 , or 10 15 genome copies per subject.
유효량은 또한 투여 방식에 따라 좌우될 수 있다. 예를 들어, 내이의 정원창 막을 통한 주사에 의해 와우 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포) 조직을 표적화하는 것은, 일부 경우에, 또 다른 방법 (예를 들어, 전신 투여, 국소 투여)에 의해 와우 (예를 들어, 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포) 조직을 표적화하는 것과 상이한 (예를 들어, 더 높거나 더 낮은) 용량을 필요로 할 수 있다. 따라서, 일부 실시양태에서, 주사는 내이의 정원창 막을 통한 주사이다. 일부 실시양태에서, 투여는 국소 투여 (예를 들어, 귀에 대한 국소 투여)이다. 일부 실시양태에서, 주사는 후방 반고리관 주사이다. 일부 경우에, 다중 용량의 rAAV가 투여된다.An effective amount may also depend on the mode of administration. For example, targeting tissue of the cochlea (e.g., connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby regions) by injection through the round window membrane of the inner ear is, in some cases, another method (e.g., , systemic administration, local administration) requires different (eg, higher or lower) doses to target tissues of the cochlea (eg, connective tissue cells of the cochlea and supporting cells of the organ of Corti and nearby areas) can be done with Thus, in some embodiments, the injection is through the round window membrane of the inner ear. In some embodiments, the administration is topical administration (eg, topical administration to the ear). In some embodiments, the injection is a posterior semicircular canal injection. In some cases, multiple doses of rAAV are administered.
어떠한 특정한 이론에 얽매이는 것을 원하지는 않지만, 본원에 기재된 rAAV에 의한 와우 세포 (예를 들어, 본원에 기재된 바와 같은 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)의 효율적인 형질도입은 유전성 청각 상실 (예를 들어, DFNB1)을 갖는 대상체의 치료에 유용할 수 있다. 일부 실시양태에서, 본원에 기재된 조성물 및 방법은 다른 GJB2-연관 질환을 치료하는 데 유용할 수 있다. 본원에 사용된 GJB2-연관 질환은 GJB2 돌연변이 (예를 들어, 기능 상실 돌연변이)에 의해 유발된 상태 및/또는 장애를 지칭한다. 비제한적 GJB2-연관 질환은 난청, 상염색체 열성 1A, 난청, 상염색체 우성 3A, DFNB1, 각막염-어린선-난청 (KID), 어린선, 히스트릭스-유사-난청 (HID), 수장족저 각피증-난청 (PPK), 한공각화증성 에크린구멍 및 진피관 모반, 포빙켈(Vohwinkel), 바트-펌프리, 비통상적 점막피부-난청을 포함한다 (예를 들어, 문헌 [Srinivas et al., Human diseases associated with connexin mutations, Biochimica et Biophysica Acta (BBA) - Biomembranes,Volume 1860, Issue 1, January 2018, Pages 192-201; Lossa et al., GJB2 Gene Mutations in Syndromic Skin Diseases with Sensorineural Hearing Loss, Curr Genomics. 2011 Nov; 12(7): 475-785] 참조).Without wishing to be bound by any particular theory, efficient transduction of cochlear cells (e.g., connective tissue cells of the cochlea as described herein and supporting cells of the organ of Corti and nearby regions) by the rAAVs described herein may It can be useful in the treatment of a subject with a loss (eg, DFNB1). In some embodiments, the compositions and methods described herein may be useful for treating other GJB2-associated diseases. A GJB2-associated disease, as used herein, refers to conditions and/or disorders caused by GJB2 mutations (eg, loss-of-function mutations). Non-limiting GJB2-associated disorders include deafness, autosomal recessive 1A, deafness, autosomal dominant 3A, DFNB1, keratitis-ichthyosis-deafness (KID), ichthyosis, histrix-like-deafness (HID), palmar plantar keratoderma. -includes deafness (PPK), porkeratosis eccrine pits and dermal ductal nevi, Vohwinkel's, Bart-Pumpuri, and atypical mucocutaneous-deafness (see, e.g., Srinivas et al., Human diseases associated with connexin mutations, Biochimica et Biophysica Acta (BBA) - Biomembranes, Volume 1860,
따라서, 유전성 청각 상실을 치료하기 위한 방법 및 조성물이 또한 본원에 제공된다. 일부 측면에서, 본 개시내용은 유전성 청각 상실을 갖거나 또는 갖는 것으로 의심되는 대상체에게 유효량의 rAAV를 투여하는 것을 포함하는, 유전성 청각 상실 (예를 들어, DFNB1) 또는 본원에 기재된 임의의 다른 GJB2-연관 질환을 치료하는 방법을 제공하며, 여기서 rAAV는 (i) AAV9.PHP.B, 또는 AAV-S의 혈청형을 갖는 캡시드 단백질, 및 (ii) 발현 카세트에 플랭킹된 2개의 아데노-연관 바이러스 (AAV) 역전된 말단 반복부 (ITR)를 포함하는 단리된 핵산을 포함하고, 여기서 발현 카세트는 GJB2 유전자 조절 요소 (GRE)를 코딩하는 뉴클레오티드 서열, 및 간극 연접 베타 2 (GJB2) 단백질을 코딩하는 뉴클레오티드 서열에 작동가능하게 연결된 프로모터를 포함한다.Accordingly, methods and compositions for treating hereditary hearing loss are also provided herein. In some aspects, the present disclosure provides treatment of a subject having, or suspected of having, a genetic hearing loss (eg, DFNB1) or any other GJB2- described herein, comprising administering an effective amount of rAAV. A method of treating an associated disease is provided, wherein the rAAV comprises (i) a capsid protein having a serotype of AAV9.PHP.B, or AAV-S, and (ii) two adeno-associated viruses flanking an expression cassette. (AAV) an inverted terminal repeat (ITR), wherein the expression cassette comprises a nucleotide sequence encoding a GJB2 gene regulatory element (GRE), and a gap junction beta 2 (GJB2) protein encoding A promoter operably linked to the nucleotide sequence.
일부 실시양태에서, rAAV (예를 들어, GJB2를 코딩하는 rAAV)는 1일령, 10일령, 1개월령, 3개월령, 6개월령, 1년령, 2년령, 3년령, 5년령, 6년령, 7년령, 8년령, 9년령, 10년령, 11년령, 12년령, 13년령, 14년령, 15년령, 16년령, 17년령, 18년령 이상의 나이의 환자 (예를 들어, DFNB1을 갖는 환자)에게 투여될 수 있다. 일부 실시양태에서, 환자는 유아, 소아 또는 성인이다. 일부 실시양태에서, GJB2-연관 질환 (예를 들어, DFNB1)의 치료 윈도우는 정상적으로 출생시부터 취학전까지의 연령 (예를 들어, 출생시부터 1세, 1 내지 2세, 2-3세, 3-4세, 4-5세, 또는 5-6세)이다. 일부 실시양태에서, rAAV (예를 들어, GJB2를 코딩하는 rAAV)는 환자 (예를 들어, DFNB1을 갖는 환자)에게 일생 동안 1회, 10년마다, 5년마다, 2년마다, 매년, 6개월마다, 3개월마다, 매월, 2주마다, 또는 매주 투여된다. 다른 실시양태에서, rAAV (예를 들어, GJB2를 코딩하는 rAAV)의 투여는 GJB2-연관 질환 (예를 들어, DFNB1)에 대한 다른 공지된 치료 방법과 조합하여 환자 (예를 들어, DFNB1을 갖는 환자)에게 투여된다.In some embodiments, an rAAV (eg, an rAAV encoding GJB2) is 1 day old, 10 days old, 1 month old, 3 months old, 6 months old, 1 year old, 2 years old, 3 years old, 5 years old, 6 years old, 7 years old , 8 years of age, 9 years of age, 10 years of age, 11 years of age, 12 years of age, 13 years of age, 14 years of age, 15 years of age, 16 years of age, 17 years of age, 18 years of age or older (eg, patients with DFNB1) can In some embodiments, the patient is an infant, child, or adult. In some embodiments, the treatment window for a GJB2-associated disease (eg, DFNB1) normally ranges from birth to preschool age (eg, from birth to 1 year, 1 to 2 years, 2-3 years, 3 years -4 years old, 4-5 years old, or 5-6 years old). In some embodiments, an rAAV (eg, an rAAV encoding GJB2) is administered to a patient (eg, a patient with DFNB1) once, every 10 years, every 5 years, every 2 years, every year, 6 years Monthly, every 3 months, monthly, every 2 weeks, or weekly administration. In other embodiments, administration of a rAAV (eg, an rAAV encoding GJB2) is administered to a patient (eg, having DFNB1) in combination with other known methods of treatment for a GJB2-associated disease (eg, DFNB1). administered to the patient).
V. 키트 및 관련 조성물V. Kits and Related Compositions
본원에 기재된 작용제는, 일부 실시양태에서, 치료 또는 연구 용도에서의 그의 사용을 용이하게 하기 위해 제약 또는 연구 키트로 조립될 수 있다. 키트는 본 개시내용의 성분 (예를 들어, 핵산, rAAV)을 수용하는 1개 이상의 용기 및 사용 지침서를 포함할 수 있다. 구체적으로, 이러한 키트는 본원에 기재된 1종 이상의 작용제를, 이들 작용제의 의도된 용도 및 적절한 용도를 기재하는 지침서와 함께 포함할 수 있다. 특정 실시양태에서, 키트 내의 작용제는 작용제의 특정한 적용 및 투여 방법에 적합한 제약 제제 및 투여량일 수 있다. 연구 목적을 위한 키트는 다양한 실험을 수행하기 위한 적절한 농도 또는 양의 성분을 함유할 수 있다.Agents described herein, in some embodiments, can be assembled into pharmaceutical or research kits to facilitate their use in therapeutic or research applications. A kit may include one or more containers containing components of the present disclosure (eg, nucleic acids, rAAV) and instructions for use. Specifically, such kits may include one or more agents described herein, along with instructions describing the intended and appropriate uses of these agents. In certain embodiments, the agent in the kit may be a pharmaceutical formulation and dosage suitable for the particular application and method of administration of the agent. Kits for research purposes may contain components in appropriate concentrations or amounts to perform various experiments.
일부 실시양태에서, 본 개시내용은 본원에 기재된 바와 같은 rAAV를 투여하기 위한 키트에 관한 것이다. 일부 실시양태에서, 키트는 rAAV를 수용하는 용기, 및 수용소로부터 rAAV를 추출하기 위한 장치 (예를 들어, 시린지)를 포함한다. 일부 실시양태에서, 수용소으로부터 rAAV를 추출하기 위한 장치는 또한 투여 (예를 들어, 주사)에 사용된다.In some embodiments, the disclosure relates to kits for administering rAAV as described herein. In some embodiments, a kit includes a container that contains rAAV, and a device (eg, a syringe) for extracting rAAV from the reservoir. In some embodiments, the device for extracting rAAV from the reservoir is also used for administration (eg, injection).
일부 실시양태에서, 본 개시내용은 단백질을 코딩하는 트랜스진 (예를 들어, GJB2)을 포함하는 단리된 핵산을 수용하는 용기를 포함하는, rAAV를 생산하기 위한 키트에 관한 것이다. 일부 실시양태에서, 키트는 AAV 캡시드 단백질, 예를 들면 AAV.PHP.B 캡시드 단백질 또는 AAV-S 캡시드 단백질을 코딩하는 단리된 핵산을 수용하는 용기를 추가로 포함한다. 일부 실시양태에서, 키트는 rep/cap 유전자를 코딩하는 벡터, 및 rAAV를 생산하기 위한 숙주를 추가로 포함한다.In some embodiments, the disclosure relates to a kit for producing rAAV comprising a container containing an isolated nucleic acid comprising a transgene encoding a protein (eg, GJB2). In some embodiments, the kit further comprises a container containing an isolated nucleic acid encoding an AAV capsid protein, eg, an AAV.PHP.B capsid protein or an AAV-S capsid protein. In some embodiments, the kit further comprises a vector encoding the rep/cap genes, and a host for producing the rAAV.
일부 실시양태에서, 본 개시내용은 청각 상실 (예를 들어, DFNB1)을 치료하기 위한 키트에 관한 것이다. 일부 실시양태에서, 키트는 유전자 요법 (예를 들어, 본원에 기재된 rAAV)을 사용하여 기능성을 표적 세포 (예를 들어, 본원에 기재된 바와 같은 와우의 결합 조직 세포 및 코르티 기관 및 근처 영역의 지지 세포)에 전달하기 위한 것이다 (예를 들어, DFNB1).In some embodiments, the disclosure relates to kits for treating hearing loss (eg, DFNB1). In some embodiments, the kit uses gene therapy (e.g., a rAAV described herein) to target cells (e.g., connective tissue cells of the cochlea and support cells of the organ of Corti and nearby regions as described herein) ) (eg, DFNB1).
키트는 본원에 기재된 방법이 연구자에 의한 사용에 용이하게 하도록 설계될 수 있고, 여러 상이한 형태를 취할 수 있다. 키트의 각각의 조성물은, 적용가능한 경우에, 액체 형태 (예를 들어, 용액) 또는 고체 형태 (예를 들어, 건조 분말)로 제공될 수 있다. 특정 경우에, 일부 조성물은, 예를 들어 키트 내에 제공될 수 있거나 제공되지 않을 수 있는 적합한 용매 또는 다른 매질 (예를 들어, 물 또는 세포 배양 배지)의 첨가에 의해 (예를 들어, 활성 형태로) 구성가능하거나 또는 달리 가공가능할 수 있다. 본원에 사용된 "지침서"는 지침 및/또는 홍보의 구성요소를 포함할 수 있고, 전형적으로 패키징 상의 또는 패키징과 결합된 서면 지침서를 포함한다. 지침서는 또한 사용자가 지침서가 키트와 결합되어야 함을 명백하게 인식하도록 하는 임의의 방식으로 제공되는 임의의 구두 또는 전자 지침서, 예를 들어 시청각 자료 (예를 들어, 비디오테이프, DVD, CD-ROM, 다운로드가능한 파일에 대한 웹사이트 링크 등), 인터넷 및/또는 웹-기반 통신 등을 포함할 수 있다. 서면 지침서는 제약 또는 생물학적 제품의 제조, 사용 또는 판매를 규제하는 정부 기관에 의해 규정된 형태일 수 있으며, 이 지침서는 또한 동물 투여를 위한 제조, 사용 또는 판매의 기관에 의한 승인을 반영할 수 있다.Kits can be designed to facilitate use by researchers of the methods described herein, and can take many different forms. Each composition of the kit may be provided in liquid form (eg, a solution) or solid form (eg, a dry powder), where applicable. In certain instances, some compositions may or may not be provided (e.g., in active form) by addition of a suitable solvent or other medium (e.g., water or cell culture medium), which may or may not be provided in a kit. ) may be configurable or otherwise machinable. As used herein, “instructions” may include elements of instructions and/or publicity, and typically include written instructions on or associated with packaging. Instructions may also include any oral or electronic instructional material, eg audiovisual material (eg videotape, DVD, CD-ROM, downloadable website links to possible files, etc.), Internet and/or web-based communications, and the like. Written instructions may be in the form prescribed by a government agency regulating the manufacture, use, or sale of pharmaceutical or biological products, and these instructions may also reflect approval by the agency of manufacture, use, or sale for veterinary administration. .
키트는 1개 이상의 용기에 본원에 기재된 임의의 1종 이상의 성분을 함유할 수 있다. 예로서, 한 실시양태에서, 키트는 키트의 1종 이상의 성분을 혼합하고/거나 샘플을 단리 및 혼합하고 대상체에게 적용하는 것에 대한 지침서를 포함할 수 있다. 키트는 본원에 기재된 rAAV를 수용하는 용기를 포함할 수 있다. rAAV는 액체, 겔 또는 고체 (분말)의 형태일 수 있다. rAAV는 멸균 제조되고, 시린지 내에 패키징되고, 냉장 수송될 수 있다. 대안적으로, rAAV는 저장을 위해 바이알 또는 다른 용기에 수용될 수 있다. 제2 용기는 멸균 제조된 다른 작용제를 가질 수 있다. 대안적으로, 키트는 사전혼합되고 시린지, 바이알, 튜브 또는 다른 용기로 수송된 rAAV를 포함할 수 있다.A kit may contain any one or more components described herein in one or more containers. By way of example, in one embodiment, a kit may include instructions for mixing one or more components of the kit and/or isolating and mixing a sample and applying to a subject. A kit may include a container containing a rAAV described herein. rAAV can be in the form of a liquid, gel or solid (powder). rAAV can be prepared sterile, packaged in syringes, and shipped refrigerated. Alternatively, rAAV may be housed in vials or other containers for storage. The second container may have the other agent prepared sterilely. Alternatively, the kit may include rAAV premixed and shipped in a syringe, vial, tube, or other container.
VI. 일반적 기술VI. general skills
본 발명의 실시는, 달리 나타내지 않는 한, 관련 기술분야의 기술 내에 있는 분자 생물학 (재조합 기술 포함), 미생물학, 세포 생물학, 생화학 및 면역학의 통상적인 기술을 사용할 것이다. 문헌 [Molecular Cloning: A Laboratory Manual, second edition (Sambrook, et al., 1989) Cold Spring Harbor Press; Oligonucleotide Synthesis (M. J. Gait, ed., 1984); Methods in Molecular Biology, Humana Press; Cell Biology: A Laboratory Notebook (J. E. Cellis, ed., 1998) Academic Press; Animal Cell Culture (R. I. Freshney, ed., 1987); Introduction to Cell and Tissue Culture (J. P. Mather and P. E. Roberts, 1998) Plenum Press; Cell and Tissue Culture: Laboratory Procedures (A. Doyle, J. B. Griffiths, and D. G. Newell, eds., 1993-8) J. Wiley and Sons; Methods in Enzymology (Academic Press, Inc.); Handbook of Experimental Immunology (D. M. Weir and C. C. Blackwell, eds.); Gene Transfer Vectors for Mammalian Cells (J. M. Miller and M. P. Calos, eds., 1987); Current Protocols in Molecular Biology (F. M. Ausubel, et al., eds., 1987); PCR: The Polymerase Chain Reaction, (Mullis, et al., eds., 1994); Current Protocols in Immunology (J. E. Coligan et al., eds., 1991); Short Protocols in Molecular Biology (Wiley and Sons, 1999); Immunobiology (C. A. Janeway and P. Travers, 1997); Antibodies (P. Finch, 1997); Antibodies: a practical approach (D. Catty., ed., IRL Press, 1988-1989); Monoclonal antibodies: a practical approach (P. Shepherd and C. Dean, eds., Oxford University Press, 2000); Using antibodies: a laboratory manual (E. Harlow and D. Lane (Cold Spring Harbor Laboratory Press, 1999)); The Antibodies (M. Zanetti and J. D. Capra, eds., Harwood Academic Publishers, 1995)].The practice of the present invention will, unless otherwise indicated, employ conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are within the skill of the relevant art. Molecular Cloning: A Laboratory Manual, second edition (Sambrook, et al., 1989) Cold Spring Harbor Press; Oligonucleotide Synthesis (M. J. Gait, ed., 1984); Methods in Molecular Biology, Humana Press; Cell Biology: A Laboratory Notebook (J. E. Cellis, ed., 1998) Academic Press; Animal Cell Culture (R. I. Freshney, ed., 1987); Introduction to Cell and Tissue Culture (J. P. Mather and P. E. Roberts, 1998) Plenum Press; Cell and Tissue Culture: Laboratory Procedures (A. Doyle, J. B. Griffiths, and D. G. Newell, eds., 1993-8) J. Wiley and Sons; Methods in Enzymology (Academic Press, Inc.); Handbook of Experimental Immunology (D. M. Weir and C. C. Blackwell, eds.); Gene Transfer Vectors for Mammalian Cells (J. M. Miller and M. P. Calos, eds., 1987); Current Protocols in Molecular Biology (F. M. Ausubel, et al., eds., 1987); PCR: The Polymerase Chain Reaction, (Mullis, et al., eds., 1994); Current Protocols in Immunology (J. E. Coligan et al., eds., 1991); Short Protocols in Molecular Biology (Wiley and Sons, 1999); Immunobiology (C. A. Janeway and P. Travers, 1997); Antibodies (P. Finch, 1997); Antibodies: a practical approach (D. Catty., ed., IRL Press, 1988-1989); Monoclonal antibodies: a practical approach (P. Shepherd and C. Dean, eds., Oxford University Press, 2000); Using antibodies: a laboratory manual (E. Harlow and D. Lane (Cold Spring Harbor Laboratory Press, 1999)); The Antibodies (M. Zanetti and J. D. Capra, eds., Harwood Academic Publishers, 1995)].
추가의 상술 없이, 관련 기술분야의 통상의 기술자는 본 개시내용에 기초하여 본 발명을 그의 최대 정도로 이용할 수 있는 것으로 여겨진다. 따라서, 하기 구체적 실시양태는 단지 예시적이며, 어떠한 방식으로도 나머지 개시내용을 제한하지 않는 것으로 해석되어야 한다. 본원에 인용된 모든 간행물은 본원에 언급된 목적 또는 대상을 위해 참조로 포함된다.Without further elaboration, it is believed that a person skilled in the relevant art can utilize the present invention to its fullest extent based on this disclosure. Accordingly, the specific embodiments that follow are to be construed as illustrative only and not limiting the remainder of the disclosure in any way. All publications cited herein are incorporated by reference for the purpose or subject matter mentioned herein.
본 발명의 예시적인 실시양태는 하기 실시예에 의해 보다 상세히 기재될 것이다. 이들 실시양태는 본 발명의 예시이며, 관련 기술분야의 통상의 기술자는 예시적인 실시양태로 제한되지 않음을 인식할 것이다.Exemplary embodiments of the present invention will be described in more detail by means of the following examples. These embodiments are illustrative of the present invention, and those skilled in the art will recognize that they are not limited to the illustrative embodiments.
실시예Example
유전적 기원의 청각 장애는 1,000명의 출생 중 약 1명에서 발생하며; 대부분은 상염색체 열성 및 비증후군성이다. 70종 초과의 상이한 난청 유전자가 확인되었지만, 중증 내지 극심한 상염색체 열성 비증후군성 청각 상실의 모든 사례의 거의 절반은, 헤미채널을 형성하는 6개의 서브유닛을 함유하는 간극-연접 단백질 코넥신26을 코딩하는, 단지 1개의 유전자 GJB2에서의 돌연변이로부터 발생한다. 각각의 서브유닛은 4개의 막횡단 나선을 갖고, 이는 막의 면에서 조립되어 큰 중심 포어를 형성한다 (도 1a). 인접한 세포로부터의 GJB2 헤미채널은 결합하여 한 세포의 세포질로부터 다른 세포의 세포질로의 채널을 생성한다. 간극 연접은 연접 플라크에 패킹된 수백개 또는 수천개의 채널에 의해 형성된다.Hearing impairment of genetic origin occurs in about 1 in 1,000 births; Most are autosomal recessive and non-syndromic. Although more than 70 different hearing loss genes have been identified, nearly half of all cases of severe to severe autosomal recessive non-syndromic hearing loss involve the gap-junction protein connexin 26, which contains six subunits that form a hemichannel. It arises from a mutation in only one gene, GJB2, which encodes. Each subunit has four transmembrane helices, which assemble at the face of the membrane to form a large central pore (FIG. 1A). GJB2 hemichannels from adjacent cells bind to create channels from the cytoplasm of one cell to the cytoplasm of another cell. Gap junctions are formed by hundreds or thousands of channels packed in synaptic plaques.
와우에서, GJB2는 2개의 세포 군에서 발현된다: 코르티 기관의 지지 세포, 내부 및 외부 고랑의 상피 세포, 및 치간 세포를 포함하는 상피계; 및 측벽 및 상혈관조 부위의 섬유세포, 혈관선조의 기저 세포, 및 가장자리상부 암색 세포를 포함하는 세포질계 (예를 들어, 문헌 [Kikuchi et al., (1995) Gap junctions in the rat cochlea: immunohistochemical and ultrastructural analysis. Anat Embryol (Berl) 191:101-118] 참조). 이는 유모 세포에서 발현되지 않는다. 와우에서, 상피계는 주로 유사분열후이다. 대조적으로, 세포질 시스템의 섬유모세포는 천천히 전환되지만, BrdU 표지화로 일부 세포 분열이 관찰된다 (Lang et al., 2002; Li et al., 2017). 와우 및 섬유세포/코르티 지지 세포 네트워크의 구조가 도 1a-1b에서 제시된다.In the cochlea, GJB2 is expressed in two cell populations: the epithelial system, which includes supporting cells of the organ of Corti, epithelial cells of the inner and outer sulci, and interdental cells; and the cytoplasmic system including fibrocytes of the lateral wall and supravascular regions, basal cells of the vascular progenitors, and supramarginal dark cells (see, e.g., Kikuchi et al., (1995) Gap junctions in the rat cochlea: immunohistochemical and ultrastructural analysis (see Anat Embryol (Berl) 191:101-118). It is not expressed in hair cells. In the cochlea, the epithelial system is predominantly post-mitotic. In contrast, fibroblasts in the cytoplasmic system convert slowly, but some cell division is observed with BrdU labeling (Lang et al., 2002; Li et al., 2017). The structure of the cochlea and fibrocyte/Corti support cell network is shown in FIGS. 1A-1B.
GJB2 발현은 와우 기능에 중요하다. 예를 들어, 형질도입 채널을 통해 유모 세포에 진입하고 기저 K+ 채널을 통해 방출되는 K+은 상피계에 의해 코르티 기관으로부터 셔틀링되고, 세포질계에 의해 혈관조로 운반되며, 여기서 이는 다시 내림프로 펌핑된다. 또한, GJB2는, 비록 유모 세포가 Gjb2를 발현하지 않더라도, 내이에서 GJB2가 결여된 마우스가 P30까지 감소된 와우내 전위 및 유모 세포 및 지지 세포의 극심한 아폽토시스 손실을 갖기 때문에, 와우의 발생에서 역할을 한다 (문헌 [Cohen-Salmon et al., 2002; Wang et al., 2009; Sun et al., 2009; Crispino et al., 2011; Johnson et al., 2017]). Gjb2가 P6 후에 결실되면, 표현형은 훨씬 더 경미하다 (Chang et al., 2015). 그러나, GJB2에 대한 장기적인 요건이 남아있다: 유모 세포 손실은 결실에도 불구하고 P14만큼 늦게 수개월 후에 발생한다 (Ma et al., 2020). 본원에 기재된 이론에 얽매이는 것을 원하지는 않지만, K+의 셔틀링에서의 GJB2의 기능은 와우의 발생에서의 그의 역할과 관련될 수 있다: K+가 간극 연접 네트워크에 의해 유모 세포로부터 멀리 운반되지 않는 경우에, K+ 축적은 유모 세포를 탈분극시켜 Ca2+ 유입 및 궁극적인 세포 사멸을 유발할 수 있다. 간극 연접 네트워크는 또한 글루코스 및 영양소를 혈관으로부터 감각 상피로 수송하는 데 요구될 수 있고, 그의 부재는 세포 사멸로 이어질 수 있다 (Chang et al., 2008; Mammano, 2019).GJB2 expression is important for cochlear function. For example, K + entering hair cells via transduction channels and released via basal K + channels is shuttled from the Organ of Corti by the epithelial system and transported by the cytoplasmic system to the vasculature, where it is returned to the endolymph. pumped up In addition, GJB2 plays a role in the development of cochlea, as mice lacking GJB2 in the inner ear have reduced intra-cochlear potential and severe apoptotic loss of hair cells and supporting cells by P30, even though hair cells do not express Gjb2. (Cohen-Salmon et al., 2002; Wang et al., 2009; Sun et al., 2009; Crispino et al., 2011; Johnson et al., 2017). When Gjb2 is deleted after P6, the phenotype is much milder (Chang et al., 2015). However, a long-term requirement for GJB2 remains: hair cell loss occurs after several months as late as P14 despite deletion (Ma et al., 2020). Without wishing to be bound by the theory described herein, GJB2's function in the shuttling of K + may be related to its role in the development of the cochlea: K + is not transported away from hair cells by the gap junction network. In some cases, K + accumulation can depolarize hair cells, causing Ca 2+ influx and eventual cell death. The gap junction network may also be required to transport glucose and nutrients from blood vessels to the sensory epithelium, and its absence can lead to cell death (Chang et al., 2008; Mammano, 2019).
GJB2 발현의 상실은 열성, 경도 내지 극심한 감각신경성 청각 장애를 특징으로 하는, 비증후군성 청각 상실 및 난청 (DFNB1)으로 불리는 장애의 근본을 이룬다 (Kelsell et al., 1997; Kenna et al., 2010). 이후 100개 초과의 돌연변이가 환자에서 기재되었지만, 환자의 거의 60%가 단일 염기 결실 (35delG)을 갖고 프레임시프트 및 정지로 이어진다 (Kenna et al., 2010). 미국에서만, 매년 원인 유전자인 GJB2에 2개의 돌연변이가 있는 약 3,500명의 소아가 태어난다 (Kelsell et al., 1997; Zelante et al., 1997; Azaiez et al.,2018). 다수는 극심한 청각 상실을 갖고 태어나며, 이는 아마 출생시에도 비가역적일 것이다. 3분의 2는 출생시 약간의 잔류 청각을 갖고, 이들 중 대부분은 다음 수년에 걸쳐 청각을 상실하며, 이는 치료적 개입을 위한 윈도우가 존재함을 시사한다 (Kenna et al., 2010). 따라서, DFNB1의 치료를 위한 잠재적 후보인 5-10,000명의 취학전-연령의 어린이가 존재한다 (도 1d).Loss of GJB2 expression underlies a disorder called non-syndromic deafness and hearing loss (DFNB1), characterized by recessive, mild to severe sensorineural hearing impairment (Kelsell et al., 1997; Kenna et al., 2010). ). More than 100 mutations have since been described in patients, but nearly 60% of patients have a single base deletion (35delG) leading to frameshifts and arrests (Kenna et al., 2010). In the United States alone, approximately 3,500 children with two mutations in the causative gene, GJB2, are born each year (Kelsell et al., 1997; Zelante et al., 1997; Azaiez et al., 2018). Many are born with severe hearing loss, which is probably irreversible even at birth. Two-thirds have some residual hearing at birth, and most of these lose hearing over the next few years, suggesting that a window for therapeutic intervention exists (Kenna et al., 2010). Thus, there are 5-10,000 preschool-aged children who are potential candidates for treatment of DFNB1 ( FIG. 1D ).
와우가 외과적으로 접근가능하고 비교적 면역보호된 환경이기 때문에, 바이러스 벡터를 사용하는 유전자 요법이 매력적인 접근법이다. GJB2 코딩 서열은 작고 (~680 bp), AAV 벡터에 용이하게 맞을 것이다. AAV가 게놈 내로 삽입되지 않고 분열하는 세포에서 희석되지만, 대부분의 와우 세포는 분열하지 않고, AAV는 수십년 이상 동안 발현을 유도할 수 있다. GJB2의 코딩 서열을 보유하는 rAAV의 주사는 정상적으로 정원창 막 (RWM)을 통해 주사된다 (도 2a). 그러나, 유전자 요법의 이전의 시험은 GJB2의 유전자 부가가 세포 생존 및 간극 연접 네트워크를 구제하였음에도 불구하고 청각을 구제하지 못했다.Because the cochlea is a surgically accessible and relatively immunoprotective environment, gene therapy using viral vectors is an attractive approach. The GJB2 coding sequence is small (~680 bp) and will fit easily into an AAV vector. Although AAV does not integrate into the genome and is diluted in dividing cells, most cochlear cells do not divide, and AAV can induce expression for decades or more. Injections of rAAV carrying the coding sequence of GJB2 are normally injected through the round window membrane (RWM) (Fig. 2a). However, previous trials of gene therapy did not rescue hearing, although the genetic addition of GJB2 rescued cell survival and gap junction networks.
놀랍게도, 와우에서의 GJB2의 무차별한 발현이 섬유세포 및 지지 세포에서 기능을 구제하더라도 유모 세포 및 뉴런의 기능을 손상시키는 것으로 밝혀졌다. 추가로, 내이에서의 GJB2의 혼재성 발현은 야생형 마우스의 청각을 손상시켰다 (도 2b).Surprisingly, it was found that indiscriminate expression of GJB2 in the cochlea impairs the function of hair cells and neurons, although it rescues function in fibrocytes and supporting cells. Additionally, mixed expression of GJB2 in the inner ear impaired hearing in wild-type mice (Fig. 2b).
간극 연접은 인접한 세포 사이에 저-저항성 경로를 생성한다. 그러나, 유모 세포 및 와우의 뉴런은 작은 형질도입 또는 시냅스 전류로 탈분극을 발생시키는 고-저항성 막에 의존한다. 어느 하나가 인접한 세포에 전기적으로 커플링되면, 탈분극이 단락될 것이고, 뇌에 대한 신호가 손실된다. 혼재성 GJB2 발현에 의해 유발된 청각 상실의 놀라운 현상은 GJB2를 정상적으로 발현하지 않는 유모 세포의 무차별한 간극-연접 커플링에 의해 설명될 수 있다. 따라서, 효과적인 유전자 요법 치료는 GJB2 돌연변이를 갖는 대상체에서 청각을 구제하기 위해 유전자를 정상적으로 발현하는 세포 (예를 들어, 섬유세포 및 지지 세포)에서의 외인성 GJB2의 세포-특이적 발현을 유도해야 한다.Gap junctions create low-resistance pathways between adjacent cells. However, hair cells and neurons of the cochlea rely on highly-resistive membranes to generate depolarization with small transduction or synaptic currents. If either is electrically coupled to an adjacent cell, the depolarization will be shorted and the signal to the brain is lost. The surprising phenomenon of hearing loss induced by confluent GJB2 expression can be explained by indiscriminate gap-junction coupling of hair cells that do not normally express GJB2. Thus, effective gene therapy treatment should induce cell-specific expression of exogenous GJB2 in cells that normally express the gene (eg, fibrocytes and supporting cells) to rescue hearing in subjects with GJB2 mutations.
세포 특이적 GJB2 발현을 달성하기 위해, GJB2 유전자의 시스-조절 요소를 평가하였다. 130 내지 >300 kb의 GJB2 상류의 큰 게놈 결실은 선천적으로 극심한 난청을 유발하는 것으로 밝혀졌다. 이들 결실의 중복 분석은 내이에서 GJB2 발현을 위한 중요한 인핸서(들)를 수용하는 것으로 의심되는 ~95 kb의 공유 영역을 밝혀내었다 (도 3a).To achieve cell specific GJB2 expression, cis-regulatory elements of the GJB2 gene were evaluated. Large genomic deletions from 130 to >300 kb upstream of GJB2 have been shown to cause congenital extreme hearing loss. Duplicate analysis of these deletions revealed a ∼95 kb shared region suspected to house the important enhancer(s) for GJB2 expression in the inner ear (FIG. 3A).
인간 환자에서 GJB2의 시스-조절 인핸서를 확인하기 위해, 환자 게놈 데이터, ATAC-Seq 및 시험관내 검정의 조합을 사용하였다. 의심되는 GJB2-관련 청각 상실을 갖는 환자를 대규모 병렬 서열분석과 커플링된 표적화된 게놈 풍부화 또는 게놈 서열분석으로 스크리닝하여 ~95.4 kb 윈도우 내의 비-코딩 질환-유발 변이체를 검색하였다 (도 3b). 오토스코프(OtoSCOPE) 패널로 스크리닝된 환자의 유전자형 및 표현형을 검토하였다. 초기 선택 라운드는 GJB2 코딩 서열에서 기지의 또는 예측된 병원성 변이체에 대해 이형접합이고 그의 청각 상실에 대해 음성 유전자 진단을 받은 모든 환자를 포함하였다. 다음으로, 환자의 코호트를 표현형에 기초하여 정밀화하였다. 시스-조절 요소에서의 돌연변이를 갖는 기능-상실 돌연변이를 트랜스로 보유하는 환자는 선천적으로 중증 내지 극심한 난청을 가질 것이다. GJB2 유전자좌에 대한 연관/대립유전자 분리를 나타내고 GJB2 내의 코딩 변이체가 부재하는 열성 난청을 갖는 패밀리를 또한 연구하였다.A combination of patient genomic data, ATAC-Seq and in vitro assays was used to identify cis-regulatory enhancers of GJB2 in human patients. Patients with suspected GJB2-related hearing loss were screened by targeted genome enrichment or genome sequencing coupled with massively parallel sequencing to search for non-coding disease-causing variants within the -95.4 kb window (FIG. 3B). The genotype and phenotype of patients screened with the OtoSCOPE panel were reviewed. The initial selection round included all patients who were heterozygous for a known or predicted pathogenic variant in the GJB2 coding sequence and had a negative genetic diagnosis for their hearing loss. Next, the cohort of patients was refined based on phenotype. Patients carrying loss-of-function mutations in trans with mutations in cis-regulatory elements will have congenital severe to extreme hearing loss. Families with recessive hearing loss that exhibit linkage/allelic segregation for the GJB2 locus and lack coding variants within GJB2 were also studied.
서열분석 후, 데이터를 브로드 인스티튜트(The Broad Institute)의 GATK 최선의 실시에 따라 맞춤 생물정보학 파이프라인에 의해 분석하였다. 간략하게, 미가공 서열을 버로우-휠러 얼라이너(Burrows-Wheeler Aligner)를 사용하여 게놈에 맵핑하고, 이어서 피카드(Picard)로 중복물을 제거하고, 변이체 검출(variant calling)을 위해 게놈 분석 도구 키트 (GATK), 및 변이체 주석화(variant annotation)를 위한 주석을 달기 위해 앙상블 변이체 효과 예측인자 및 dbNSFP를 맵핑하였다. 주석화 후에, 변이체를 품질, 부차 대립유전자 빈도 및 위치 (~95 kb 윈도우 내)에 기초하여 필터링하였다. 변이체는 DNA 요소의 백과사전 (ENCODE) 및 유전자형-조직 발현에 의해 규정된 바와 같은 조절 요소 내에 속하는 변이체를 기초로 하여 우선 순위를 매겼다. 100명 초과의 환자를 서열분석하고, 200개 초과의 후보 변이체를 확인하였다. 대략 5-10%의 DFNB1 환자는 비-코딩 영역에 제2 질환-유발 대립유전자를 갖는다.After sequencing, data were analyzed by a custom bioinformatics pipeline according to the GATK best practices of The Broad Institute. Briefly, the raw sequence was mapped to the genome using a Burrows-Wheeler Aligner, followed by removal of duplicates with a Picard, and a genome analysis tool kit (for variant calling) GATK), and ensemble variant effect predictors and dbNSFP were mapped to annotate for variant annotation. After annotation, variants were filtered based on quality, minor allele frequency and location (within a ~95 kb window). Variants were prioritized based on variants falling within regulatory elements as defined by the Encyclopedia of DNA Elements (ENCODE) and genotype-tissue expression. More than 100 patients were sequenced and more than 200 candidate variants were identified. Approximately 5-10% of DFNB1 patients have a second disease-causing allele in the non-coding region.
마우스 및 비-인간 영장류에서, ATAC-Seq (서열분석을 사용한 트랜스포사제-접근가능한 염색질에 대한 검정; 문헌 [Buenrostro et al., 2013])를 사용하여 와우에서 활성인 유전자에 대한 인핸서를 확인하였다. ATAC-Seq는 서열분석 어댑터를 게놈의 개방 영역에 삽입하는 과다활성 돌연변이체 Tn5 트랜스포사제를 사용한다. 이어서, 게놈 DNA를 어댑터로부터 서열분석하여 개방 염색질을 확인하였다.Identification of enhancers for genes active in the cochlea using ATAC-Seq (assay for transposase-accessible chromatin using sequencing; Buenrostro et al., 2013) in mice and non-human primates did ATAC-Seq uses a hyperactive mutant Tn5 transposase that inserts sequencing adapters into open regions of the genome. Genomic DNA was then sequenced from the adapters to confirm open chromatin.
와우가 정상 기능을 획득한 시점인 P2, P5 및 P8 기의 신생 마우스로부터 와우를 절개하였다. 성체 마카크 원숭이로부터 하나의 와우를 절개하였다. 이 데이터 세트는 와우에서의 유전자 조절 연구에 중요한 기여를 한다. 이는, 예를 들어 유전성 및 후천성 청각 상실 둘 다에서 빈번하게 손상되는 특정 세포 유형, 예컨대 유모 세포, 인접 줄기 세포, 및 나선 신경절 뉴런에서 유전자 발현을 유도하는 데 사용될 수 있다.Cochleas were dissected from neonatal mice at P2, P5 and P8 stages, when the cochleas had acquired normal function. One cochlea was dissected from an adult macaque monkey. This data set makes an important contribution to the study of gene regulation in the cochlea. It can be used, for example, to induce gene expression in certain cell types that are frequently damaged in both hereditary and acquired hearing loss, such as hair cells, adjacent stem cells, and spiral ganglion neurons.
마우스 Gjb2 유전자와 연관된 18개의 후보 인핸서가 확인되었다. 도 3c는 마우스 Gjb2 유전자의 영역에서 ~200 kb의 마우스 게놈 서열을 보여주고; 다수의 ATAC-Seq 판독체를 갖는 영역이 강조된다. 후속 연구는 포유동물 종 사이에서 보존되는 마우스 Gjb2 유전자 근처에 있는 인핸서에 초점을 맞추었다. 도 3c (상단)는 마우스 Gjb2 유전자 영역에서 ~300 kb에 걸친, 발생 단계 P2, P5 및 P8의 마우스 와우로부터의 ATAC-Seq의 UCSC 게놈 브라우저 뷰에서의 마우스 Gjb2 유전자 조절 요소 (GRE)의 확인을 나타낸다. 음영 영역은 추정 GRE를 함유하는 영역을 표시한다 (GRE를 함유하는 인간 및 마우스 영역 서열은 표 1에 열거됨). X-축은 마우스 게놈 내의 chr14 상의 게놈 영역이다. Y-축은 게놈 내의 특이적 영역에 정렬되는 ATAC-Seq로부터의 판독물의 수이다. 담청색 하이라이트는 판독물 파일업이 풍부한 전사상 활성 영역의 특징인 오픈 염색질의 영역을 나타내며, 이는 이들 영역에서의 보다 높은 활성을 시사한다. 영역 A 및 B는 마우스 Gjb2 자체 내의 전사상 활성 서열을 표시한다. 영역 C-M은 시스-조절 네트워크의 일부일 수 있는 Gjb2 주변에서 전사상 활성인 영역이다. GJB2 GRE 서열을 표 1에 열거된 영역 서열로 확인하였다. 도 3c (하단)는 특이적 마우스 Gjb2 GRE (GRE 2, 3, 5, 7, 및 9)로서 확인된 담청색 음영 영역 내 및 주변의 전사상 활성 영역을 보여준다. 인간 GJB2 GRE 서열은 마우스 Gjb2 GRE를 모델링함으로써 인 실리코로 확인하였다. 인간 GRE 1, 2, 3, 4, 5, 7 및 9의 뉴클레오티드 서열은 표 3에 제시되고, 이를 후속 실험에서 시험하였다.Eighteen candidate enhancers associated with the mouse Gjb2 gene were identified. Figure 3c shows ~200 kb of mouse genomic sequence in the region of the mouse Gjb2 gene; Regions with multiple ATAC-Seq reads are highlighted. Subsequent studies have focused on enhancers located near the mouse Gjb2 gene that are conserved among mammalian species. Figure 3c (top) shows the identification of the mouse Gjb2 gene regulatory element (GRE) in the UCSC Genome Browser View of ATAC-Seq from mouse cochleas at developmental stages P2, P5 and P8, spanning ~300 kb in the mouse Gjb2 gene region. indicate Shaded regions indicate regions containing putative GREs (human and mouse region sequences containing GREs are listed in Table 1). The X-axis is the genomic region on chr14 in the mouse genome. The Y-axis is the number of reads from ATAC-Seq that align to specific regions in the genome. Light blue highlights indicate regions of open chromatin that are characteristic of regions of transcriptional activity enriched in read pile-up, suggesting higher activity in these regions. Regions A and B represent transcriptionally active sequences within mouse Gjb2 itself. Regions C-M are transcriptionally active regions around Gjb2 that may be part of a cis-regulatory network. The GJB2 GRE sequence was confirmed with the region sequences listed in Table 1. Figure 3c (bottom) shows the transcriptionally active regions within and around light blue shaded regions identified as specific mouse Gjb2 GREs (
추가로, GJB2 유전자의 프로모터, 5' UTR 및/또는 3' UTR은 또한 천연 조절 서열을 함유한다. 프로모터, 5' UTR 및/또는 3' UTR을 포함하는 구축물을 설계하고, 세포 특이적 GJB2 발현에서의 그의 능력에 대해 시험하였다. 구축물을 rAAV 내로 패키징하고, 마우스의 내이 내로 주사하였다. 마커 유전자를 발현하는 세포 유형을 GJB2를 발현하는 세포 유형과 비교하였다. 예를 들어, 500 bp의 인간 GJB2 프로모터, 및 300 bp의 5' UTR, 이어서 GFP 및 인간 GJB2 3' UTR에 대한 코딩 서열을 포함하도록 C15 벡터를 구축하였다 (도 3d의 벡터 C15). C15 벡터는 AAV9-PHP.B 캡시드를 사용하여 rAAV 내로 패키징되었고, 이는 이전에 다수의 와우 세포 유형을 형질도입하는 데 효과적인 것으로 밝혀졌다 (Gyorgy et al., 2018). AAV9-PHP.B-C15 바이러스를 P0 마우스 새끼의 내이에 주사하였다. GJB2 발현을 GJB2를 표적화하는 항체를 사용하여 면역형광에 의해 검출하였다 (도 3f, 중간 패널). AAV9-PHP.B-c15 벡터로 형질도입되고 GJB2 인핸서 하에 GFP 마커 유전자를 발현하는 세포는 좌측 패널에 제시된다. 내이에서의 GJB2의 발현 패턴은 키쿠치(Kikuchi)에 의해 보고된 것과 일치하였다. 우측 패널에서, IHC 및 OHC (표시됨)는 또한 액틴을 형광 팔로이딘으로 표지함으로써 확인된다. 우측 패널에서, IHC 및 OHC (표시됨)는 또한 액틴을 형광 팔로이딘으로 표지함으로써 확인된다. 특히, AAV9-PHP.B-C15는 유모 세포에 효율적으로 형질도입될 수 있지만, 유모 세포에서 GFP 발현은 관찰되지 않았다. 이는 Gjb2 인핸서가 유모 세포에서 활성이 아니기 때문일 것이다. 도 3f는 측벽 (상단)으로부터 치간 세포 (하단)까지의 마우스 와우의 분절을 나타낸다. AAV9-PHP.B-C15 벡터로 형질도입되고 Gjb2 인핸서 하에 GFP 마커 유전자를 발현하는 세포는 좌측 패널에 제시된다. Gjb2를 정상적으로 발현하는 세포는 중간 패널에 제시된다. 우측 패널에서, IHC 및 OHC (표시됨)는 또한 액틴을 형광 팔로이딘으로 표지함으로써 확인된다. c15 구축물에 의해 유도된 GFP의 발현 패턴은 GJB2에 대한 동일한 항체를 사용하는 문헌 [Kikuchi et al., 1995]에 보고된 천연 Gjb2 발현과 일치한다. 특히, c15는 유모 세포에서 GFP 발현을 유도하지 않는다.Additionally, the promoter, 5' UTR and/or 3' UTR of the GJB2 gene also contains native regulatory sequences. Constructs containing promoters, 5' UTRs and/or 3' UTRs were designed and tested for their ability in cell specific GJB2 expression. The construct was packaged into rAAV and injected into the inner ear of mice. Cell types expressing the marker gene were compared to cell types expressing GJB2. For example, a C15 vector was constructed to contain a 500 bp human GJB2 promoter, and a 300 bp 5' UTR, followed by the coding sequences for GFP and human GJB2 3' UTR (vector C15 in FIG. 3D). The C15 vector was packaged into rAAV using the AAV9-PHP.B capsid, which was previously shown to be effective in transducing multiple cochlear cell types (Gyorgy et al., 2018). AAV9-PHP.B-C15 virus was injected into the inner ear of PO mouse pups. GJB2 expression was detected by immunofluorescence using an antibody targeting GJB2 (Fig. 3f, middle panel). Cells transduced with the AAV9-PHP.B-c15 vector and expressing the GFP marker gene under the GJB2 enhancer are shown in the left panel. The expression pattern of GJB2 in the inner ear was consistent with that reported by Kikuchi. In the right panel, IHC and OHC (indicated) are also identified by labeling actin with fluorescent phalloidin. In the right panel, IHC and OHC (indicated) are also identified by labeling actin with fluorescent phalloidin. In particular, AAV9-PHP.B-C15 could be efficiently transduced into hair cells, but no GFP expression was observed in hair cells. This may be because the Gjb2 enhancer is not active in hair cells. Figure 3f shows the segment of the mouse cochlea from the lateral wall (top) to the interdental cells (bottom). Cells transduced with the AAV9-PHP.B-C15 vector and expressing the GFP marker gene under the Gjb2 enhancer are shown in the left panel. Cells normally expressing Gjb2 are shown in the middle panel. In the right panel, IHC and OHC (indicated) are also identified by labeling actin with fluorescent phalloidin. The expression pattern of GFP induced by the c15 construct is consistent with native Gjb2 expression reported by Kikuchi et al., 1995 using the same antibody against GJB2. In particular, c15 does not induce GFP expression in hair cells.
추가로, 다른 구축물 (C20-C23)을 혼재성 닭 베타 액틴 (CBA) 프로모터 하에 외인성 GJB2 발현을 시험하기 위해 설계하였다. C20 벡터에서, 인간 GJB2 코딩 서열은 CBA 프로모터에 의해 유도되었다 (도 3e, 벡터 C20). C20 벡터를 rAAV 내로 패키징하고, 이를 마우스에서 P0 와우 내로 주사하였다. GJB2 발현을 GJB2 항체를 사용하여 면역형광을 갖는 유모 세포에서 확인하였다 (도 3g). 유모 세포에 의한 GJB2의 발현은 인접한 지지 세포에 대한 전기적 커플링을 생성하고, 정상적인 감각 수용체 전위를 단락시킬 것이다. 이 이론을 시험하기 위해, 여러 다른 벡터를 설계하였다. C21 벡터는 35delG 돌연변이를 보유하는 인간 GJB2 코딩 서열에 작동가능하게 연결된 CBA 프로모터를 포함한다. 활성 GJB2 단백질은 C21 벡터에 의해 생산될 수 없다. C22 벡터는 GJB2 코딩 서열을 갖지 않는 CBA 프로모터를 포함한다. C23 벡터는 유모 세포에 의해 정상적으로 발현되는 단백질인 인간 클라린 1의 발현을 유도하는 CBA 프로모터를 포함한다. AAV1 또는 AAV9-PHP.B 캡시드를 사용하여 벡터를 rAAV 내로 패키징하였다. rAAV를 P1에서 정원창 막을 통해 마우스의 내이 내로 주사하고, 청각 뇌간 반응 (ABR)을 P30에서 측정하였다 (8, 11 및 16 kHz에서의 역치를 평균함). 도 3h에 제시된 바와 같이, 비감염된 야생형 마우스는 30 dB 근처의 ABR 역치를 가졌고, 염수 모의 주사는 야생형 마우스에서 ABR 역치를 변화시키지 않았다. AAV1 또는 AAV9-PHP.B 캡시드에서 CBA 프로모터를 사용한 GJB2 발현은 역치를 30-40 dB 상승시켰다. 비교를 위해, 조건부 녹아웃 Cre+, Gjb2 fl/fl 마우스는 시험된 최고 수준 (90 dB)에서 반응을 나타내지 않았다. 추가로, AAV9-PHP.B-C20을 주사한 마우스는 발작 및 종종 사망을 포함한 신경계 증상을 흔히 나타내는 것으로 관찰되었다. 벡터 AAV9-PHP.B-C21 (불활성화 돌연변이를 갖는 GJB2 발현), AAV9-PHP.B-C22 (GJB2 코딩 서열 없음), 또는 AAV9-PHP.B-C23 (정상 유모-세포 단백질인 클라린 1 발현)에서는 치사성이 관찰되지 않았다. 또한, rAAV가 주사 전에 10배 또는 100배 희석되면, 어느 벡터에 의해서도 독성 또는 치사성이 관찰되지 않았다. 뉴런의 전기적 커플링이 항상성 시스템의 신경 조절을 손상시키는 AAV9-PHP.B의 뇌 향성으로 인해 GJB2를 코딩하는 소량의 rAAV가 뇌에 도달했을 가능성이 있다. 이는 예기치 않게 그러나 극적으로 독성을 감소시키기 위해 GJB2 발현을 적절한 세포로 제한할 필요성을 보여주었다.Additionally, other constructs (C20-C23) were designed to test exogenous GJB2 expression under the mixed chicken beta actin (CBA) promoter. In the C20 vector, the human GJB2 coding sequence was driven by the CBA promoter (Fig. 3e, vector C20). The C20 vector was packaged into rAAV and injected into the P0 cochlea in mice. GJB2 expression was confirmed in hair cells by immunofluorescence using the GJB2 antibody (FIG. 3g). Expression of GJB2 by hair cells will create electrical coupling to adjacent supporting cells and short-circuit normal sensory receptor potentials. To test this theory, several different vectors were designed. The C21 vector contains a CBA promoter operably linked to the human GJB2 coding sequence carrying the 35delG mutation. Active GJB2 protein cannot be produced by the C21 vector. The C22 vector contains the CBA promoter without the GJB2 coding sequence. The C23 vector contains a CBA promoter that drives expression of
Sox10-Cre+,Gjb2 fl/fl 녹아웃 마우스는 시험된 최고 수준 (90 dB)에서 반응이 없었다 (도 3h). 녹아웃에서, AAV1-CBA-GJB2 또는 AAV9-PHP.B-CBA-GJB2 rAAV는 구제를 일으키지 않았다. 청각을 구제하는 데 있어서 인핸서를 시험하기 위해 C70 구축물을 생산하였다. C70 구축물은 AAV 5' ITR, GJB2 기저 프로모터, GJB2 엑손 1 5' UTR, 코작 서열, 마우스 또는 인간 GJB2 코딩 서열, 임의적인 HA 태그, GJB2 엑손 2 3' UTR, WPRE, 소 성장 호르몬 폴리 A 신호, 및 AAV 3' ITR을 포함한다. C70 구축물을 AAV9-PHP.B 캡시드 단백질을 사용하여 rAAV 내로 패키징하고, 야생형 마우스 및 Sox10-Cre+,Gjb2 fl/fl 녹아웃 마우스 둘 다의 내이 내로 주사하였다. Gjb2 발현은 Sox10-Cre+,Gjb2 fl/fl 녹아웃 마우스에서 청각을 15-20 dB만큼 구제하였다. 동일한 벡터는 야생형 마우스에서 청각을 손상시키지 않았다 (도 3h). 도 3i-3l은 HA 태그를 갖거나 갖지 않는 마우스 GJB2 또는 인간 GJB2를 코딩하는 c70 벡터 플라스미드의 지도를 보여준다. 도 3m은 HA 태그를 갖거나 갖지 않는 마우스 GJB2 또는 인간 GJB2를 코딩하는 벡터 c.70의 개략도를 보여준다. 도 3n은 생성되고 시험된 추가의 벡터를 보여준다.Sox10-Cre+,Gjb2 fl/fl knockout mice were unresponsive at the highest level tested (90 dB) (Fig. 3h). In knockout, AAV1-CBA-GJB2 or AAV9-PHP.B-CBA-GJB2 rAAVs did not cause rescue. A C70 construct was produced to test the enhancer in rescuing hearing. The C70 construct contains an AAV 5' ITR, a GJB2 basal promoter, a
또한, 내이 세포에 대한 향성을 갖는 다른 AAV 캡시드 단백질을, 트랜스진 (예를 들어, GJB2 또는 GFP)을 마우스 및 영장류 둘 다에서 적절한 내이 세포에 전달하고 청각을 구제하는 그의 능력에 대해 시험하였다. 원래 뇌 향성을 위해 개발된 AAV-S 캡시드 단백질은 마우스 및 영장류 와우 둘 다에서 GJB2-발현 세포의 우수한 형질도입을 나타냈다 (도 4). AAV-S 캡시드 단백질, 및 GJB2 기저 프로모터 및 5' UTR 하에 GJB2의 발현을 유도하는 c70 벡터를 포함하는 rAAV를 패키징하였다. AAV-S-C70 rAAV를 Gjb2 조건부 녹아웃 마우스 내로 주사한다. 이들 마우스의 청각을 시험하였다. AAV-S-C70 rAAV는 AAV9-PHP.B-C70 rAAV와 유사하게, 또는 훨씬 더 우수하게 청각을 구제할 수 있다.In addition, other AAV capsid proteins with tropism for inner ear cells were tested for their ability to deliver a transgene (eg, GJB2 or GFP) to appropriate inner ear cells and rescue hearing in both mice and primates. The AAV-S capsid protein originally developed for brain orientation showed superior transduction of GJB2-expressing cells in both the mouse and primate cochlea ( FIG. 4 ). An rAAV containing the AAV-S capsid protein and a c70 vector driving expression of GJB2 under the GJB2 basal promoter and 5' UTR was packaged. AAV-S-C70 rAAV is injected into Gjb2 conditional knockout mice. The hearing of these mice was tested. AAV-S-C70 rAAV can rescue hearing similarly, or even better, than AAV9-PHP.B-C70 rAAV.
AAV-S-C70 rAAV를 야생형 마우스에 주사한다. C70 벡터는 항-HA 항체를 사용하여 내이에서 GJB2 발현의 용이한 검출을 가능하게 하는 HA 태그를 포함한다. GJB2 발현은 GJB2를 정상적으로 발현하는 코르티 기관 및 섬유세포의 지지 세포에서만 검출될 것으로 예상된다. 주사된 야생형 마우스의 청각은 또한 GJB2-연관 독성을 평가하도록 시험된다.AAV-S-C70 rAAV is injected into wild type mice. The C70 vector contains an HA tag that allows easy detection of GJB2 expression in the inner ear using an anti-HA antibody. GJB2 expression is expected to be detected only in the feeder cells of the organ of Corti and fibrocytes that normally express GJB2. Hearing of injected wild-type mice is also tested to assess GJB2-associated toxicity.
추가로, 비-인간 영장류 (NHP)의 내이 세포를 형질도입하는 AAV-S의 능력을 시험하였다. AAV-S 캡시드 단백질 및 GFP를 코딩하는 벡터를 포함하는 rAAV를 비-인간 영장류의 양쪽 귀에 주사하였다. 동물을 3주 후에 안락사시키고, 와우를 조직학용으로 준비하였다. GFP 발현은 이들 동물의 와우에서 평가된다. 마우스에서 유사한 실험을 병행하여 수행하였다.Additionally, the ability of AAV-S to transduce inner ear cells of non-human primates (NHP) was tested. An rAAV containing a vector encoding AAV-S capsid protein and GFP was injected into both ears of non-human primates. Animals were euthanized after 3 weeks, and cochleas were prepared for histology. GFP expression is assessed in the cochlea of these animals. Similar experiments were performed in parallel in mice.
GFP를 코딩하는 AAV-S 벡터를 후관 경로 (마우스에서 내이 전반에 걸쳐 벡터를 강건하게 전달함)를 사용하여 성체 마우스의 내이 내로 주사하였다. 주사 20일 후에 동물을 안락사시키고, 와우를 수거하였다.An AAV-S vector encoding GFP was injected into the inner ear of adult mice using the tracheal route (which robustly delivers the vector across the inner ear in mice). Animals were euthanized 20 days after injection and cochleas were harvested.
표 3에 열거된 GJB2 GRE가 이를 정상적으로 발현하는 세포에서 GJB2 발현을 가능하게 하고, GJB2를 정상적으로 발현하지 않는 세포에서 GJB2 발현을 방지하는지 여부를 시험하기 위해, GRE를 각각 기저 GJB2 프로모터 및 GJB2 엑손 1 5' UTR의 제어 하에 GFP, 인간 GJB2 또는 마우스 Gjb2 발현을 유도하는 AAV 벡터 내로 혼입시켰다. 벡터 지도가 도 5a-5u에 제시된다. 벡터는 5'에서 3'으로, AAV 5' ITR, 인간 GJB2 GRE, GJB2 기저 프로모터, 인간 GJB2 엑손 1 5' UTR, eGFR, 인간 GJB2 또는 마우스 Gjb2를 코딩하는 뉴클레오티드 서열, 및 GJB2 엑손 2 3' UTR을 포함한다. 벡터 c.81.1은 인간 GJB2 GRE1을 포함하고; 벡터 c.81.2는 인간 GJB2 GRE2를 포함하고; 벡터 c.81.3은 인간 GJB2 GRE3을 포함하고; 벡터 c.81.4는 인간 GJB2 GRE4를 포함하고; 벡터 c.81.5는 인간 GJB2 GRE5를 포함하고; 벡터 c.81.7은 인간 GJB2 GRE7을 포함하고; 벡터 c.81.8은 인간 GJB2 GRE8을 포함하고; 벡터 c.81.9는 인간 GJB2 GRE9를 포함한다 (도 5a-5u). 도 5v는 상기 기재된 바와 같은 eGFP, 마우스 GJB2 및 인간 GJB2를 코딩하는 c81.2, c81.3, c81.5, c81.7 및 c81.9의 개략도를 보여준다.To test whether the GJB2 GREs listed in Table 3 enable GJB2 expression in cells that normally express them and prevent GJB2 expression in cells that do not normally express GJB2, the GREs were isolated from the basal GJB2 promoter and
GFP를 코딩하는 c.81.2, c81.3, c81.5, c81.7, 및 c81.9 벡터를 각각 AAV9.PHP.B 캡시드 단백질을 사용하여 rAAV 내로 패키징하고, 야생형 마우스의 출생후 제1일에 정원창 막을 통해 주사하였다. 와우를 P6에서 조직학에 대해 고정시키고, 와우 조직에서 GFP 발현을 평가하였다.The c.81.2, c81.3, c81.5, c81.7, and c81.9 vectors encoding GFP were each packaged into rAAV using the AAV9.PHP.B capsid protein and
GJB2 유전자 조절 요소 5 (GJB2 GRE5, 리포터로서 eGFP를 코딩하는 벡터 c81.5에서)는 GJB2-발현 세포에 대한 eGFP의 표적 발현을 돕는 것으로 밝혀졌다. 도 6a는 코르티 기관 내의 및 내측의 다양한 지지 세포를 포함하는 eGFP 발현 세포의 형광 영상을 보여준다. 도 6b는 코르티 기관의 영역에서의 내인성 GJB2의 항체 표지를 보여준다. GJB2 발현은 외인성 eGFP의 발현과 크게 중복되었다. 도 6c는 유모 세포의 부동섬모를 나타낸 액틴의 제3 염색을 포함한 도 6a 및 6b의 오버레이이다. 유모 세포에서 eGFP는 발현되지 않았다. 도 6d는 eGFP 및 유모 세포에 대한 단백질 마커 MYO7A의 동결 절편 면역형광 영상을 보여준다. eGFP는 코르티 기관에서 다양한 지지 세포에서 발현되었지만, 유모 세포에서 발현된 MYO7A 발현과 중복되지 않았다. 인간 GJB2 또는 마우스 GJB2를 코딩하는 벡터를 의도된 세포에서의 GJB2 발현에 대해 시험할 것이다.GJB2 gene regulatory element 5 (GJB2 GRE5, in vector c81.5 encoding eGFP as reporter) was found to help target expression of eGFP to GJB2-expressing cells. 6A shows fluorescence images of eGFP expressing cells, including various supporting cells within and medial to the organ of Corti. 6B shows antibody labeling of endogenous GJB2 in the region of the organ of Corti. GJB2 expression largely overlapped with that of exogenous eGFP. 6C is an overlay of FIGS. 6A and 6B including a third staining of actin showing stereocilia of hair cells. eGFP was not expressed in hair cells. 6D shows frozen section immunofluorescence images of eGFP and MYO7A, a protein marker for hair cells. eGFP was expressed on various supporting cells in the organ of Corti, but did not overlap with MYO7A expression expressed on hair cells. Vectors encoding human GJB2 or mouse GJB2 will be tested for GJB2 expression in the intended cells.
도 7a-7d는 와우의 측벽에서의 벡터 c.81.5에 의한 eGFP 발현 패턴을 보여준다. 도 7a는 측벽의 섬유세포를 포함하는 세포에서의 eGFP 발현을 보여준다. 도 7b는 측벽의 영역에서의 내인성 GJB2의 항체 표지를 보여준다. GJB2 발현은 외인성 GFP와 크게 중복된다. 도 7c는 도 7a 및 7b의 오버레이 영상이다. eGFP는 Gjb2를 발현하는 세포에서 발현되었다는 점에 주목한다. 도 7d-7e는 코르티 기관의 지지 세포 및 측벽의 섬유세포에서의 GFP (도 7d) 및 GJB2 (도 7e)의 동결 절편 면역형광을 보여준다.7A-7D show the eGFP expression pattern by vector c.81.5 in the lateral wall of the cochlea. Figure 7a shows eGFP expression in cells containing fibroblasts of the lateral wall. Figure 7b shows antibody labeling of endogenous GJB2 in the region of the lateral wall. GJB2 expression largely overlaps with exogenous GFP. 7c is an overlay image of FIGS. 7a and 7b. Note that eGFP was expressed in cells expressing Gjb2. 7D-7E show frozen section immunofluorescence of GFP (FIG. 7D) and GJB2 (FIG. 7E) in the supporting cells of the organ of Corti and the fibrocytes of the lateral wall.
인간 결실에 기초하여 확인된 인간 GJB2 인핸서는 청각을 구제할 수 있고, 유사하게 GJB2 연관 독성을 유도하지 않는다.A human GJB2 enhancer identified on the basis of a human deletion can rescue hearing and similarly does not induce GJB2-associated toxicity.
참고문헌references
Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ (2013) Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nature methods 10:1213-1218.Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ (2013) Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nature methods 10:1213-1218.
Cohen-Salmon M, Ott T, Michel V, Hardelin JP, Perfettini I, Eybalin M, Wu T, Marcus DC, Wangemann P, Willecke K, Petit C (2002) Targeted ablation of connexin26 in the inner ear epithelial gap junction network causes hearing impairment and cell death. Curr Biol 12:1106-1111.Cohen-Salmon M, Ott T, Michel V, Hardelin JP, Perfettini I, Eybalin M, Wu T, Marcus DC, Wangemann P, Willecke K, Petit C (2002) Targeted ablation of connexin26 in the inner ear epithelial gap junction network causes hearing impairment and cell death. Curr Biol 12:1106-1111.
Crispino G, Di Pasquale G, Scimemi P, Rodriguez L, Galindo Ramirez F, De Siati RD, Santarelli RM, Arslan E, Bortolozzi M, Chiorini JA, Mammano F (2011) BAAV mediated GJB2 gene transfer restores gap junction coupling in cochlear organotypic cultures from deaf Cx26Sox10Cre mice. PloS one 6:e23279.Crispino G, Di Pasquale G, Scimemi P, Rodriguez L, Galindo Ramirez F, De Siati RD, Santarelli RM, Arslan E, Bortolozzi M, Chiorini JA, Mammano F (2011) BAAV mediated GJB2 gene transfer restores gap junction coupling in cochlear organotypic cultures from deaf Cx26Sox10Cre mice. PloS one 6:e23279.
Deverman BE, Pravdo PL, Simpson BP, Kumar SR, Chan KY, Banerjee A, Wu WL, Yang B, Huber N, Pasca SP, Gradinaru V (2016) Cre-dependent selection yields AAV variants for widespread gene transfer to the adult brain. Nat Biotechnol 34:204-209.Deverman BE, Pravdo PL, Simpson BP, Kumar SR, Chan KY, Banerjee A, Wu WL, Yang B, Huber N, Pasca SP, Gradinaru V (2016) Cre-dependent selection yields AAV variants for widespread gene transfer to the adult brain . Nat Biotechnol 34:204-209.
Feigenspan A, Janssen-Bienhold U, Hormuzdi S, Monyer H, Degen J, Sohl G, Willecke K, Ammermuller J, Weiler R (2004) Expression of connexin36 in cone pedicles and OFF-cone bipolar cells of the mouse retina. J Neurosci 24:3325-3334.Feigenspan A, Janssen-Bienhold U, Hormuzdi S, Monyer H, Degen J, Sohl G, Willecke K, Ammermuller J, Weiler R (2004) Expression of connexin36 in cone pedicles and OFF-cone bipolar cells of the mouse retina. J Neurosci 24:3325-3334.
Forge A, Becker D, Casalotti S, Edwards J, Marziano N, Nevill G (2003) Gap junctions in the inner ear: comparison of distribution patterns in different vertebrates and assessment of connexin composition in mammals. J Comp Neurol 467:207-231.Forge A, Becker D, Casalotti S, Edwards J, Marziano N, Nevill G (2003) Gap junctions in the inner ear: comparison of distribution patterns in different vertebrates and assessment of connexin composition in mammals. J Comp Neurol 467:207-231.
Gyorgy B, Sage C, Indzhykulian AA, Scheffer DI, Brisson AR, Tan S, Wu X, Volak A, Mu D, Tamvakologos PI, Li Y, Fitzpatrick Z, Ericsson M, Breakefield XO, Corey DP, Maguire CA (2017) Rescue of hearing by gene delivery to inner-ear hair cells using exosome-associated AAV. Mol Ther 25:379-391.Gyorgy B, Sage C, Indzhykulian AA, Scheffer DI, Brisson AR, Tan S, Wu X, Volak A, Mu D, Tamvakologos PI, Li Y, Fitzpatrick Z, Ericsson M, Breakefield XO, Corey DP, Maguire CA (2017) Rescue of hearing by gene delivery to inner-ear hair cells using exosome-associated AAV. Mol Ther 25:379-391.
Gyorgy B, Meijer EJ, Ivanchenko MV, Tenneson K, Emond F, Hanlon KS, Indzhykulian AA, Volak A, Karavitaki KD, Tamvakologos PI, Vezina M, Berezovskii VK, Born RT, O'Brien M, Lafond JF, Arsenijevic Y, Kenna MA, Maguire CA, Corey DP (2018) Gene Transfer with AAV9-PHP.B Rescues Hearing in a Mouse Model of Usher Syndrome 3A and Transduces Hair Cells in a Non-human Primate. Mol Ther Methods Clin Dev 13:1-13.Gyorgy B, Meijer EJ, Ivanchenko MV, Tenneson K, Emond F, Hanlon KS, Indzhykulian AA, Volak A, Karavitaki KD, Tamvakologos PI, Vezina M, Berezovskii VK, Born RT, O'Brien M, Lafond JF, Arsenijevic Y, Kenna MA, Maguire CA, Corey DP (2018) Gene Transfer with AAV9-PHP.B Rescues Hearing in a Mouse Model of Usher Syndrome 3A and Transduces Hair Cells in a Non-human Primate. Mol Ther Methods Clin Dev 13:1-13.
Iizuka T, Kamiya K, Gotoh S, Sugitani Y, Suzuki M, Noda T, Minowa O, Ikeda K (2015) Perinatal Gjb2 gene transfer rescues hearing in a mouse model of hereditary deafness. Hum Mol Genet 24:3651-3661.Iizuka T, Kamiya K, Gotoh S, Sugitani Y, Suzuki M, Noda T, Minowa O, Ikeda K (2015) Perinatal Gjb2 gene transfer rescues hearing in a mouse model of hereditary deafness. Hum Mol Genet 24:3651-3661.
Kelsell DP, Dunlop J, Stevens HP, Lench NJ, Liang JN, Parry G, Mueller RF, Leigh IM (1997) Connexin 26 mutations in hereditary non-syndromic sensorineural deafness. Nature 387:80-83.Kelsell DP, Dunlop J, Stevens HP, Lench NJ, Liang JN, Parry G, Mueller RF, Leigh IM (1997) Connexin 26 mutations in hereditary non-syndromic sensorineural deafness. Nature 387:80-83.
Kenna MA, Feldman HA, Neault MW, Frangulov A, Wu BL, Fligor B, Rehm HL (2010) Audiologic phenotype and progression in GJB2 (Connexin 26) hearing loss. Arch Otolaryngol Head Neck Surg 136:81-87.Kenna MA, Feldman HA, Neault MW, Frangulov A, Wu BL, Fligor B, Rehm HL (2010) Audiologic phenotype and progression in GJB2 (Connexin 26) hearing loss. Arch Otolaryngol Head Neck Surg 136:81-87.
Kikuchi T, Kimura RS, Paul DL, Adams JC (1995) Gap junctions in the rat cochlea: immunohistochemical and ultrastructural analysis. Anat Embryol (Berl) 191:101-118.Kikuchi T, Kimura RS, Paul DL, Adams JC (1995) Gap junctions in the rat cochlea: immunohistochemical and ultrastructural analysis. Anat Embryol (Berl) 191:101-118.
Li W, Wu J, Yang J, Sun S, Chai R, Chen ZY, Li H (2015) Notch inhibition induces mitotically generated hair cells in mammalian cochleae via activating the Wnt pathway. Proceedings of the National Academy of Sciences of the United States of America 112:166-171.Li W, Wu J, Yang J, Sun S, Chai R, Chen ZY, Li H (2015) Notch inhibition induces mitotically generated hair cells in mammalian cochleae via activating the Wnt pathway. Proceedings of the National Academy of Sciences of the United States of America 112:166-171.
Lin FR, Niparko JK, Ferrucci L (2011) Hearing loss prevalence in the United States. Arch Intern Med 171:1851- 1852.Lin FR, Niparko JK, Ferrucci L (2011) Hearing loss prevalence in the United States. Arch Intern Med 171:1851-1852.
Mason JA, Herrmann KR (1998) Universal infant hearing screening by automated auditory brainstem response measurement. Pediatrics 101:221-228.Mason JA, Herrmann KR (1998) Universal infant hearing screening by automated auditory brainstem response measurement. Pediatrics 101:221-228.
Shu Y, Tao Y, Wang Z, Tang Y, Li H, Dai P, Gao G, Chen ZY (2016) Identification of Adeno-Associated Viral Vectors That Target Neonatal and Adult Mammalian Inner Ear Cell Subtypes. Hum Gene Ther 27:687-699.Shu Y, Tao Y, Wang Z, Tang Y, Li H, Dai P, Gao G, Chen ZY (2016) Identification of Adeno-Associated Viral Vectors That Target Neonatal and Adult Mammalian Inner Ear Cell Subtypes. Hum Gene Ther 27:687-699.
Sun Y, Tang W, Chang Q, Wang Y, Kong W, Lin X (2009) Connexin30 null and conditional connexin26 null mice display distinct pattern and time course of cellular degeneration in the cochlea. J Comp Neurol 516:569-579.Sun Y, Tang W, Chang Q, Wang Y, Kong W, Lin X (2009) Connexin30 null and conditional connexin26 null mice display distinct patterns and time course of cellular degeneration in the cochlea. J Comp Neurol 516:569-579.
Takada Y, Beyer LA, Swiderski DL, O'Neal AL, Prieskorn DM, Shivatzki S, Avraham KB, Raphael Y (2014) Connexin 26 null mice exhibit spiral ganglion degeneration that can be blocked by BDNF gene therapy. Hearing research 309:124-135.Takada Y, Beyer LA, Swiderski DL, O'Neal AL, Prieskorn DM, Shivatzki S, Avraham KB, Raphael Y (2014) Connexin 26 null mice exhibit spiral ganglion degeneration that can be blocked by BDNF gene therapy. Hearing research 309:124-135.
Wang Y, Chang Q, Tang W, Sun Y, Zhou B, Li H, Lin X (2009) Targeted connexin26 ablation arrests postnatal development of the organ of Corti. Biochem Biophys Res Commun 385:33-37.Wang Y, Chang Q, Tang W, Sun Y, Zhou B, Li H, Lin X (2009) Targeted connexin26 ablation arrests postnatal development of the organ of Corti. Biochem Biophys Res Commun 385:33-37.
Watanabe K, Takeda K, Katori Y, Ikeda K, Oshima T, Yasumoto K, Saito H, Takasaka T, Shibahara S (2000) Expression of the Sox10 gene during mouse inner ear development. Brain Res Mol Brain Res 84:141-145.Watanabe K, Takeda K, Katori Y, Ikeda K, Oshima T, Yasumoto K, Saito H, Takasaka T, Shibahara S (2000) Expression of the Sox10 gene during mouse inner ear development. Brain Res Mol Brain Res 84:141-145.
Wise AK, Tu T, Atkinson PJ, Flynn BO, Sgro BE, Hume C, O'Leary SJ, Shepherd RK, Richardson RT (2011) The effect of deafness duration on neurotrophin gene therapy for spiral ganglion neuron protection. Hearing research 278:69-76.Wise AK, Tu T, Atkinson PJ, Flynn BO, Sgro BE, Hume C, O'Leary SJ, Shepherd RK, Richardson RT (2011) The effect of deafness duration on neurotrophin gene therapy for spiral ganglion neuron protection. Hearing research 278:69-76.
Yu Q, Wang Y, Chang Q, Wang J, Gong S, Li H, Lin X (2014) Virally expressed connexin26 restores gap junction function in the cochlea of conditional Gjb2 knockout mice. Gene Ther 21:71-80.Yu Q, Wang Y, Chang Q, Wang J, Gong S, Li H, Lin X (2014) Virally expressed connexin26 restores gap junction function in the cochlea of conditional Gjb2 knockout mice. Gene Ther 21:71-80.
Zelante L, Gasparini P, Estivill X, Melchionda S, D'Agruma L, Govea N, Mila M, Monica MD, Lutfi J, Shohat M, Mansfield E, Delgrosso K, Rappaport E, Surrey S, Fortina P (1997) Connexin26 mutations associated with the most common form of non-syndromic neurosensory autosomal recessive deafness (DFNB1) in Mediterraneans. Hum Mol Genet 6:1605-1609.Zelante L, Gasparini P, Estivill X, Melchionda S, D'Agruma L, Govea N, Mila M, Monica MD, Lutfi J, Shohat M, Mansfield E, Delgrosso K, Rappaport E, Surrey S, Fortina P (1997) Connexin26 Mutations associated with the most common form of non-syndromic neurosensory autosomal recessive deafness (DFNB1) in Mediterraneans. Hum Mol Genet 6:1605-1609.
다른 실시양태another embodiment
본 명세서에 개시된 모든 특색은 임의의 조합으로 조합될 수 있다. 본 명세서에 개시된 각각의 특색은 동일하거나, 동등하거나 또는 유사한 목적을 제공하는 대안적 특색으로 대체될 수 있다. 따라서, 달리 명백하게 언급되지 않는 한, 개시된 각각의 특색은 단지 일반적인 일련의 동등하거나 유사한 특색의 예이다.All features disclosed herein may be combined in any combination. Each feature disclosed in this specification may be replaced by an alternative feature serving the same, equivalent, or similar purpose. Thus, unless expressly stated otherwise, each feature disclosed is merely an example of a general series of equivalent or similar features.
상기 설명으로부터, 관련 기술분야의 통상의 기술자는 본 발명의 본질적인 특징을 용이하게 확인할 수 있고, 그의 취지 및 범주로부터 벗어나지 않으면서, 본 발명의 다양한 변화 및 변형을 만들어 이를 다양한 용법 및 조건에 적합화시킬 수 있다. 따라서, 다른 실시양태가 또한 청구범위 내에 있다.From the above description, those skilled in the art can readily ascertain the essential features of the present invention, and without departing from its spirit and scope, make various changes and modifications of the present invention to adapt it to various uses and conditions. can make it Accordingly, other embodiments are also within the scope of the claims.
등가물equivalent
본 발명의 몇 가지 실시양태가 본원에 기재되고 예시되었지만, 관련 기술분야의 통상의 기술자는 본원에 기재된 기능을 수행하고/거나 결과 및/또는 한 가지 이상의 이점을 수득하기 위한 각종 다른 수단 및/또는 구조를 용이하게 구상할 것이고, 각각의 이러한 변경 및/또는 변형은 본원에 기재된 본 발명의 실시양태의 범주 내에 있는 것으로 간주된다. 보다 일반적으로, 관련 기술분야의 통상의 기술자는 본원에 기재된 모든 파라미터, 치수, 물질 및 구성이 예시적인 것으로 의도되고, 실제 파라미터, 치수, 물질 및/또는 구성은 본 발명의 교시가 사용되는 구체적 적용 또는 적용들에 따라 좌우될 것임을 용이하게 인지할 것이다. 관련 기술분야의 통상의 기술자는 상용 실험만을 사용하여 본원에 기재된 구체적인 본 발명의 실시양태에 대한 다수의 등가물을 인식하거나 또는 확인할 수 있을 것이다. 따라서, 상기 실시양태는 단지 예로서 제시되고, 첨부된 청구범위 및 그에 대한 등가물의 범주 내에서, 본 발명의 실시양태는 구체적으로 기재되고 청구된 것과 달리 실시될 수 있는 것으로 이해되어야 한다. 본 개시내용의 본 발명의 실시양태는 본원에 기재된 각각의 개별 특색, 시스템, 물품, 물질, 키트 및/또는 방법에 관한 것이다. 또한, 2개 이상의 이러한 특색, 시스템, 물품, 물질, 키트 및/또는 방법의 임의의 조합은, 이러한 특색, 시스템, 물품, 물질, 키트 및/또는 방법이 상호 모순되지 않는 경우에, 본 개시내용의 본 발명의 범주 내에 포함된다.Although several embodiments of the present invention have been described and illustrated herein, those skilled in the art will recognize various other means and/or methods for performing the functions described herein and/or obtaining results and/or one or more advantages. Structures will be readily envisioned, and each such alteration and/or variation is considered to be within the scope of the embodiments of the invention described herein. More generally, those skilled in the relevant art will understand that all parameters, dimensions, materials and configurations described herein are intended to be illustrative, and that actual parameters, dimensions, materials and/or configurations are used for specific applications in which the teachings of the present invention are used. Or it will be readily appreciated that it will depend on the applications. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific inventive embodiments described herein. It is therefore to be understood that the foregoing embodiments are presented by way of example only, and that within the scope of the appended claims and equivalents thereto, embodiments of the present invention may be practiced otherwise than as specifically described and claimed. Inventive embodiments of the present disclosure relate to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more of these features, systems, articles, materials, kits, and/or methods may be incorporated into the present disclosure, provided that such features, systems, articles, materials, kits, and/or methods do not contradict each other. are included within the scope of the present invention.
본원에 정의되고 사용된 모든 정의는 사전적 정의, 참조로 포함된 문헌에서의 정의, 및/또는 정의된 용어의 통상의 의미보다 우선하는 것으로 이해되어야 한다.All definitions defined and used herein are to be construed as taking precedence over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
본원에 개시된 모든 참고문헌, 특허 및 특허 출원은 각각이 인용된 대상과 관련하여 참조로 포함되며, 일부 경우에 문헌의 전체를 포괄할 수 있다.All references, patents and patent applications disclosed herein are each incorporated by reference with respect to the subject matter cited and may, in some cases, encompass the entirety of the document.
명세서 및 청구범위에서 본원에 사용된 단수형은, 달리 명백하게 나타내지 않는 한, "적어도 하나"를 의미하는 것으로 이해되어야 한다.As used herein in the specification and claims, the singular forms "a" and "an" are to be understood to mean "at least one" unless the context clearly dictates otherwise.
본원에서 본 명세서 및 청구범위에서 사용된 어구 "및/또는"은 이와 같이 결합된 요소, 즉 일부 경우에는 결합하여 존재하고 다른 경우에는 분리되어 존재하는 요소 중 "어느 하나 또는 둘 다"를 의미하는 것으로 이해되어야 한다. "및/또는"을 사용하여 열거된 다수의 요소들은 동일한 방식으로, 즉 그렇게 결합된 요소 중 "하나 이상"으로 해석되어야 한다. 구체적으로 확인된 요소와 관련되든 관련되지 않든, "및/또는" 절에 의해 구체적으로 확인된 요소 이외의 다른 요소가 임의로 존재할 수 있다. 따라서, 비제한적 예로서, "A 및/또는 B"에 대한 언급은, "포함하는"과 같은 개방형 언어와 함께 사용되는 경우에, 한 실시양태에서, A 단독 (임의로 B 이외의 요소를 포함함); 또 다른 실시양태에서, B 단독 (임의로 A 이외의 요소를 포함함); 또 다른 실시양태에서, A 및 B 둘 다 (임의로 다른 요소를 포함함) 등을 지칭할 수 있다.As used herein in the specification and claims, the phrase “and/or” refers to “either or both” of the elements so combined, i.e., present in conjunction in some cases and separate in other cases. should be understood as Multiple elements listed with "and/or" should be construed in the same manner, i.e., as "one or more" of the elements so conjoined. Other elements may optionally be present other than the elements specifically identified by the "and/or" clause, whether related or unrelated to the elements specifically identified. Thus, as a non-limiting example, a reference to “A and/or B,” when used in conjunction with open-ended language such as “comprising,” can, in one embodiment, refer to A alone (optionally including elements other than B). ); In another embodiment, B alone (optionally with elements other than A); in another embodiment, to both A and B (optionally including other elements), and the like.
명세서 및 청구범위에서 본원에 사용된 "또는"은 상기 정의된 바와 같은 "및/또는"과 동일한 의미를 갖는 것으로 이해되어야 한다. 예를 들어, 목록에서 항목을 분리할 때, "또는" 또는 "및/또는"은 포괄적인 것으로, 즉 다수의 요소 또는 요소 목록 중 적어도 하나를 포함할 뿐만 아니라 하나 초과를 포함하며, 임의로 추가의 열거되지 않은 항목을 포함하는 것으로 해석되어야 한다. 이와 반대로 명확히 나타낸 용어, 예컨대 "~ 중 오직 하나" 또는 "~ 중 정확히 하나", 또는 청구범위에서 사용될 때, "~로 이루어진" 만이 다수의 요소 또는 요소 목록 중 정확히 하나의 요소를 포함하는 것을 지칭할 것이다. 일반적으로, 본원에 사용된 용어 "또는"은 "어느 하나", "중 하나", "중 단지 하나" 또는 "중 정확히 하나"와 같은 배타성의 용어가 선행될 때 배타적 대안 (즉, "하나 또는 다른 하나이지만 둘 다는 아님")을 나타내는 것으로만 해석될 것이다. "본질적으로 이루어진"은 청구범위에 사용될 때 특허법의 분야에서 사용되는 바와 같은 그의 통상적인 의미를 가질 것이다.As used herein in the specification and claims, "or" should be understood to have the same meaning as "and/or" as defined above. For example, when separating items in a list, “or” or “and/or” is inclusive, i.e. includes at least one of a number of elements or lists of elements as well as more than one, optionally with additional It should be construed as including items not listed. To the contrary, only explicitly stated terms such as "only one of" or "exactly one of" or, when used in the claims, "consisting of" refer to the inclusion of exactly one element of a plurality of elements or a list of elements. something to do. In general, as used herein, the term "or" refers to an exclusive alternative (i.e., "one or one but not both"). “Consisting essentially of” when used in the claims shall have its ordinary meaning as used in the field of patent law.
하나 이상의 요소의 목록과 관련하여 명세서 및 청구범위에서 본원에 사용된 어구 "적어도 하나"는 요소의 목록에서 요소 중 임의의 하나 이상으로부터 선택된 적어도 하나의 요소를 의미하지만, 요소의 목록 내에 구체적으로 열거된 각각의 및 모든 요소 중 적어도 하나를 반드시 포함하는 것은 아니며, 요소의 목록에서 요소의 임의의 조합을 배제하는 것은 아닌 것으로 이해되어야 한다. 이러한 정의는 또한, 구체적으로 확인된 요소와 관련되든 관련되지 않든, 어구 "적어도 하나"가 지칭하는 요소의 목록 내에서 구체적으로 확인된 요소 이외의 요소가 임의로 존재할 수 있음을 허용한다. 따라서, 비제한적 예로서, "A 및 B 중 적어도 하나" (또는 동등하게, "A 또는 B 중 적어도 하나", 또는 동등하게 "A 및/또는 B 중 적어도 하나")는, 한 실시양태에서, B가 존재하지 않는 (및 임의로 B이외의 요소를 포함함) 임의로 하나 초과를 포함하는, 적어도 하나의 A; 또 다른 실시양태에서, A가 존재하지 않는 (및 임의로 A이외의 요소를 포함함) 임의로 하나 초과를 포함하는, 적어도 하나의 B; 또 다른 실시양태에서, 임의로 하나 초과를 포함하는 적어도 하나의 A, 및 임의로 하나 초과를 포함하는 적어도 하나의 B (및 임의로 다른 요소를 포함함) 등을 지칭할 수 있다.The phrase “at least one” as used herein in the specification and claims with reference to a list of one or more elements means at least one element selected from any one or more of the elements in the list of elements, but specifically recited within the list of elements. It should be understood that it does not necessarily include at least one of each and every element listed, and does not exclude any combination of elements from the list of elements. This definition also permits that there may optionally be elements other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to the elements specifically identified. Thus, as a non-limiting example, "at least one of A and B" (or equivalently, "at least one of A or B", or equivalently "at least one of A and/or B") means, in one embodiment, at least one A, optionally including more than one, in which no B is present (and optionally including elements other than B); in another embodiment, at least one B, optionally including more than one, in which A is absent (and optionally includes elements other than A); in another embodiment, at least one A, optionally including more than one, and at least one B, optionally including more than one (and optionally including other elements), and the like.
또한, 달리 명백하게 나타내지 않는 한, 1개 초과의 단계 또는 작용을 포함하는 본원에 청구된 임의의 방법에서, 방법의 단계 또는 작용의 순서는 반드시 방법의 단계 또는 작용이 언급된 순서로 제한되지는 않는 것으로 이해되어야 한다.Also, unless expressly indicated otherwise, in any method claimed herein that includes more than one step or action, the order of the steps or actions of the method is not necessarily limited to the order in which the steps or actions of the method are recited. should be understood as
SEQUENCE LISTING
<110> President and Fellows of Harvard College
<120> RECOMBINANT ADENO ASSOCIATED VIRUS (RAAV) ENCODING GJB2 AND USES
THEREOF
<130> H0824.70367WO00
<140> Not Yet Assigned
<141> 2021-09-14
<150> US 63/078,233
<151> 2020-09-14
<150> US 63/161,619
<151> 2021-03-16
<160> 111
<170> PatentIn version 3.5
<210> 1
<211> 225
<212> PRT
<213> Homo sapiens
<400> 1
Met Asp Trp Gly Thr Leu Gln Thr Ile Leu Gly Gly Val Asn Lys His
1 5 10 15
Ser Thr Ser Ile Gly Lys Ile Trp Leu Thr Val Leu Phe Ile Phe Arg
20 25 30
Ile Met Ile Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gln
35 40 45
Ala Asp Phe Val Cys Asn Thr Leu Gln Pro Gly Cys Lys Asn Val Cys
50 55 60
Tyr Asp His Tyr Phe Pro Ile Ser His Ile Arg Leu Trp Ala Leu Gln
65 70 75 80
Leu Ile Phe Val Ser Thr Pro Ala Leu Leu Val Ala Met His Val Ala
85 90 95
Tyr Arg Arg His Glu Lys Arg Lys Phe Ile Lys Gly Glu Ile Lys Ser
100 105 110
Glu Phe Lys Asp Ile Glu Glu Ile Lys Thr Gln Lys Val Arg Ile Glu
115 120 125
Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser Ile Phe Phe Arg Val Ile
130 135 140
Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly Phe
145 150 155 160
Ser Met Gln Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn Thr
165 170 175
Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr Val
180 185 190
Phe Met Ile Ala Val Ser Gly Ile Cys Ile Leu Leu Asn Val Thr Glu
195 200 205
Leu Cys Tyr Leu Leu Ile Arg Tyr Cys Ser Gly Lys Ser Lys Lys Pro
210 215 220
Val
225
<210> 2
<211> 678
<212> DNA
<213> Homo sapiens
<400> 2
atggattggg gcacgctgca gacgatcctg gggggtgtga acaaacactc caccagcatt 60
ggaaagatct ggctcaccgt cctcttcatt tttcgcatta tgatcctcgt tgtggctgca 120
aaggaggtgt ggggagatga gcaggccgac tttgtctgca acaccctgca gccaggctgc 180
aagaacgtgt gctacgatca ctacttcccc atctcccaca tccggctatg ggccctgcag 240
ctgatcttcg tgtccacgcc agcgctccta gtggccatgc acgtggccta ccggagacat 300
gagaagaaga ggaagttcat caagggggag ataaagagtg aatttaagga catcgaggag 360
atcaaaaccc agaaggtccg catcgaaggc tccctgtggt ggacctacac aagcagcatc 420
ttcttccggg tcatcttcga agccgccttc atgtacgtct tctatgtcat gtacgacggc 480
ttctccatgc agcggctggt gaagtgcaac gcctggcctt gtcccaacac tgtggactgc 540
tttgtgtccc ggcccacgga gaagactgtc ttcacagtgt tcatgattgc agtgtctgga 600
atttgcatcc tgctgaatgt cactgaattg tgttatttgc taattagata ttgttctggg 660
aagtcaaaaa agccagtt 678
<210> 3
<211> 226
<212> PRT
<213> Mus musculus
<400> 3
Met Asp Trp Gly Thr Leu Gln Ser Ile Leu Gly Gly Val Asn Lys His
1 5 10 15
Ser Thr Ser Ile Gly Lys Ile Trp Leu Thr Val Leu Phe Ile Phe Arg
20 25 30
Ile Met Ile Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gln
35 40 45
Ala Asp Phe Val Cys Asn Thr Leu Gln Pro Gly Cys Lys Asn Val Cys
50 55 60
Tyr Asp His His Phe Pro Ile Ser His Ile Arg Leu Trp Ala Leu Gln
65 70 75 80
Leu Ile Met Val Ser Thr Pro Ala Leu Leu Val Ala Met His Val Ala
85 90 95
Tyr Arg Arg His Glu Lys Lys Arg Lys Phe Met Lys Gly Glu Ile Lys
100 105 110
Asn Glu Phe Lys Asp Ile Glu Glu Ile Lys Thr Gln Lys Val Arg Ile
115 120 125
Glu Gly Ser Leu Trp Trp Thr Tyr Thr Thr Ser Ile Phe Phe Arg Val
130 135 140
Ile Phe Glu Ala Val Phe Met Tyr Val Phe Tyr Ile Met Tyr Asn Gly
145 150 155 160
Phe Phe Met Gln Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn
165 170 175
Thr Val Asp Cys Phe Ile Ser Arg Pro Thr Glu Lys Thr Val Phe Thr
180 185 190
Val Phe Met Ile Ser Val Ser Gly Ile Cys Ile Leu Leu Asn Ile Thr
195 200 205
Glu Leu Cys Tyr Leu Phe Val Arg Tyr Cys Ser Gly Lys Ser Lys Arg
210 215 220
Pro Val
225
<210> 4
<211> 678
<212> DNA
<213> Mus musculus
<400> 4
atggattggg gcacactcca gagcatcctc gggggtgtca acaaacactc caccagcatt 60
ggaaagatct ggctcacggt cctcttcatc ttccgcatca tgatcctcgt ggtggctgca 120
aaggaggtgt ggggagatga gcaagccgat tttgtctgca acacgctcca gcctggctgc 180
aagaatgtat gctacgacca ccacttcccc atctctcaca tccggctctg ggctctgcag 240
ctgatcatgg tgtccacgcc agccctcctg gtagctatgc atgtggccta ccggagacat 300
gaaaagaaac ggaagttcat gaagggagag ataaagaacg agtttaagga catcgaagag 360
atcaaaaccc agaaggtccg tatcgaaggg tccctgtggt ggacctacac caccagcatc 420
ttcttccggg tcatctttga agccgtcttc atgtacgtct tttacatcat gtacaatggc 480
ttcttcatgc aacgtctggt gaaatgcaac gcttggccct gccccaatac agtggactgc 540
ttcatttcca ggcccacaga aaagactgtc ttcaccgtgt ttatgatttc tgtgtctgga 600
atttgcattc tgctaaatat cacagagctg tgctatttgt tcgttaggta ttgctcagga 660
aagtccaaaa gaccagtc 678
<210> 5
<211> 500
<212> DNA
<213> Homo sapiens
<400> 5
acctgtctcc cgccgtggcg ccttttaacc gcaccccaca ccccgcctct tccctcggag 60
actgggaaag ttacggaggg ggcggcgccg cgggcggagc gcgcccggcc tctgggtcct 120
cagagcttcc cgggtccgcg aacccccgac cgcccccgaa agccccgaac cccccaagtc 180
cccttcgagg tcccgatctc ctagttcctt tgagccccca tgagttcccc aagtgccccc 240
agcgccctga gtctcccccg gttaccccga gcgccgcctc ccccagcccc ttggcggccc 300
gggtgaagcg ggggcggctg agagtcggga ccccccagga agcggcgccc cagaccccgg 360
ctccggcgct gtgccgtggg cggggttcag ggatggctgt ggtcgttgtc ctctgtactc 420
cgcatagtgc gagaggactt ggcatttatg agcgcttctt taatttttta ttgttagaga 480
aacaggcatt cctccaagga 500
<210> 6
<211> 4843
<212> DNA
<213> Homo sapiens
<400> 6
ctttgtggat ggcttggtgg cctcactgtc aggctggcac tgatggctca gttagcatat 60
ctgttttgat aagtgctgca acagtgcatt ataattgtgg gctgtggttt taatttcaaa 120
gtgtttctta aaagacacat tattttaaaa tgacagaaaa ttcaactccc tcggttactg 180
gcccagctaa gcgacgtcac tgcattgcag ttcagcgctg aagcttggga gagtcccaca 240
ctccttactg caagcggatg tggagaggcc agtggataat ctcctgtgag cccatggcct 300
tcttttcatc ccaggatgtg aattgtcttc actgattcat agttacaccc tgcctgccac 360
aaccaacgct ctcctaaaca agattccacc ctctccacaa tccggatgaa tcatctcttt 420
tccacccttc agagctggta gtgaatcctc cttcttcttt ttcttaaaag catcctcctc 480
tcctcatttt aggcaagttg catcccgttt tctgatggac tccagaagca ggctcgtagt 540
gaatgtcttt catgacccac agtcgctgcc acggggcacc aaggtcaggc agaaaccatc 600
cagtgccacc ttggtcagag gctaacagga gagaggtggc cacgaaagtt acatcagatt 660
gacataggcc tgtgaaacat ttagcttcac tgagcttggg aaagacaaca tcattggaaa 720
aaacaatatt ttagcccagg ttcagcactg acccattgat aatccagact gggaggccct 780
taggtgagct ggttgtcctg ctacagcacc cacagctcag gccagtcccg tcccaacagc 840
agaaccaccg aggacagcaa cattccgatt ttaacaaaag catcttatgg aattagacat 900
tcttcattgg ccctcactga gtggaaaaca ggatactccc cgaagtaaac tctctcctgg 960
tttacaacaa tacacctggc caagaatatg gggctgcagg aggaggggtt tatcctttgc 1020
cctcttccac ctgccaaacc caggtcatac acccttctac agacctgtcc agttaccatc 1080
agctgagaaa aatacagttc cgagaaaccc tatattgtta ttttataaag cttgagttga 1140
agctacctgt tttaaagatc ctttttcagg aagaggagta aattaagatt tactccccaa 1200
tgggctaggg ggtcatgggt taagaggggc tcagaagcag gacgaagttg ttttcaatat 1260
tcaagtcaga ggaggagctg ccctcctggc ctcccgaccc tgggcggtta catgcagctt 1320
cctaccgggc ccacgccatc ctgcaccgcc tggagggctg ccagaggcca gcggaggagt 1380
tggttcagtt ccttagggaa gacactaggt gaatcaccag gatccagaaa aggcaaaagg 1440
gactcttcac cccttaaatt tctccaccct taggtgatgg gtggtcgacc ttgcctggct 1500
gtccccagag ggttcctcca cccttctcac cagtgtctga aattgtgacc gactgtgcac 1560
agcagtttcg aaagggactc taaggtcaca tggggacacg gccgtaccac gcttctcaag 1620
gcagtcccag gtgcatggcc acggaaccca gctctcagca gctgttagtt aggtgagcgc 1680
tgttcgggct gccttcctcc tccagtgggg caggatcgag gcactgatgg aaccgtcctg 1740
aggacgcggg tctcagccgc acaccacctc ttcgcgaaca agggtcctaa aaattttcct 1800
tctaggcggg gagcacagcc cggaaacaga ccctcgtgaa gtgtttagga aaaagggaag 1860
ccactgaaat cttggccccg gggtaggccg ggatcggctg gctccgcgtt agttctaggc 1920
aaactccgcc caaatctctg cccggggatt tttctgcaga agccgctcca agaggtaaag 1980
gtcagttcct gcagcgaagg cttcctgctt caccggcgaa acggagcttt gcttcgaagc 2040
taagctttcg gtgaatttaa aacgtttggt ggcagtgggt caagtagcca ggcggctgcg 2100
ctagagtacc ccgaagggac atcggcgaca ccacaaacct cgcgctggcg gctcgcccgc 2160
gcctttttcc cctcccgcgc gcgcccggcc ccactcgcac cccgggcggt gccatcgcgt 2220
ccacttcccc ggccgcccca ttccagctcc ggagctcggc cgcagaaacg cccgctccag 2280
aaggcggccc ccgccccccg gcccaaggac gtgtgttggt ccagcccccc ggttccccga 2340
gacccacgcg gccgggcaac cgctctgggt ctcgcggtcc ctccccgcgc caggttcctg 2400
gccgggcagt ccggggccgg cgggctcacc tgcgtcggga ggaagcgcgg cggggccggg 2460
gcgggggtct cggcgttggg gtctctgcgc tggggctcct gcgctcctag gcgggtcctg 2520
ggccgggcgc cgccgagggg ctccgagtcg gggagaggag cgcgcgggcg ctgcggggcc 2580
gcaacacctg tctcccgccg tggcgccttt taaccgcacc ccacaccccg cctcttccct 2640
cggagactgg gaaagttacg gagggggcgg cgccgcgggc ggagcgcgcc cggcctctgg 2700
gtcctcagag cttcccgggt ccgcgaaccc ccgaccgccc ccgaaagccc cgaacccccc 2760
aagtcccctt cgaggtcccg atctcctagt tcctttgagc ccccatgagt tccccaagtg 2820
cccccagcgc cctgagtctc ccccggttac cccgagcgcc gcctccccca gccccttggc 2880
ggcccgggtg aagcgggggc ggctgagagt cgggaccccc caggaagcgg cgccccagac 2940
cccggctccg gcgctgtgcc gtgggcgggg ttcagggatg gctgtggtcg ttgtcctctg 3000
tactccgcat agtgcgagag gacttggcat ttatgagcgc ttctttaatt ttttattgtt 3060
agagaaacag gcattcctcc aaggactgaa gatctgttcg agtcgcggag gctgcgcggg 3120
cccgcgaggc tctcgcaggg ggacctaggc tgggtggcgg ggcagtgccc tctggaatgg 3180
gggttaacgg tggccgagga gggggcgccg ctggtgccgg cgaagtcccc gcttctttct 3240
cccctcaaaa tctcaccaat ccgaacgaac gccttctcga atttccgatt ttattcaatt 3300
actttcaaca atgtgccaag gactaaggtt gggggcggtg ggagagacaa gcctcgtttt 3360
tgccatggcc ggcagggggg tcccgccatc tgcggagggt gccccccgcg gcccccggcc 3420
cagccaactt cctcctcttt tcgcaactgg ggaactgcaa ggaggtgact cctttcgggg 3480
tgaggaggcc cagacttttc agaaaggaaa gagggcaggt aaaacctgcc aagccccttc 3540
ctgctcgatg cacacagcac gaaaggggga aactgatagg attctgcgga agaccgctgg 3600
ggggctggct ctgcactgca cacctgctgg gggctttctg gataccgtga aactttgtct 3660
cagattatga ggtctcagta tttgcatttg gttggggatt ttgatgtctt gcgatacaaa 3720
tgacagaaga cagatttgca cagcgcaagc ggatgaggga ctaagatgtg cagagcaggc 3780
tgggtgggga ctcccgggga ggtctccccc aacccccgcc ccacctcggg cacccacttc 3840
gcgatttttg cagaggggag ccaggtcaga ggtgcagcct ggtcccctcg cgctcacgtt 3900
tttacccagg tcagttcgaa gttaagtgga aatgatgatt aatcctgaca agtcagatct 3960
ggcctcagaa tggatttccc gtgattgcca ccattattag cattgacttt tccttgaaaa 4020
attggcgccc cgtggccatg ggccgaccta ggcagtttct gcagggacga gcgtgagttt 4080
tgtaccgcgg ttaccaccta ctttccagct ccaggtctta gtctaagagg gagtgtctgc 4140
tcatgaagag gcaaagcccc aggagctgcg aaaagccttg catggcccat ctgagagatg 4200
tgctgagtcg gcttgttaaa aatgacaggc aaagcctgtg gggtggggca gctttcttgg 4260
cctgagcgca tcttggttga gccagaggtg acttggggtg gggagtgggg cgccggttgg 4320
tgggttctcc ctttaatttc tcaaaggctg tggtgtttat gagtctgttg gaatcctggt 4380
tgggttggaa tgaaggaagg ttctagaacc attgtgggaa gctcgctagt aaagatggtt 4440
tggagatcgg aagttgactg actttccccc attgaaaaat gtcacctgag attttagtgc 4500
ctgtatcacg attataggct caactttctt ttccttgttt tctttgattt agttctcctt 4560
atgtgcaaaa ttactgtgtg atgttggcta gtcgtattat cacagccact ccgtgttttc 4620
aggatttgta gctggaagtc ctatagcact taagtcttca cttacagatc agcgcttgct 4680
tttattctgt tttgtgtgat ttctgctgtt ttcctgtgag ttggtgtttt cttcccaagt 4740
aggctcagga ctcctctagg gcaggacatt atatgcatgt acatagtgtc ctccagtgta 4800
ggggaggaga aggaggagag gtgaggtggg aaaagggtga ggg 4843
<210> 7
<211> 5178
<212> DNA
<213> Mus musculus
<400> 7
ccaaaaaggg acaaaaacag acaaacaaac aacaccaaca caaacaacaa cagcactaaa 60
acgagtctct gcacctaggt cttcgcacgc aggctggtag tcccaccctc aggtagggcc 120
tgtttggtta acgatccgtg tctgttttga tatgtgttgc aagtgagtgt tgcactgtgg 180
actatggttt taaccttgaa gtgattctaa aataaatata tgatgaaaaa tgacggaaaa 240
ttagctcagc ggttcaccag ttgctggtcc aaggagccac ctgatggggg ttttgccttg 300
ggtggcatca cagtgtatcc tgtctgagtg acacagtgtc tatatatggc ctgtgcccta 360
gatgagcctc cataagccaa tgaccttcta tttcatccca gggcaggaac cttccatggc 420
tacacctggt ctgtcacaat caacccctct tttgattaat cccatcttcc cggctgtcct 480
gactcacttg cttccacccc ttccttccaa gctgtaaaga atcctctgac tctttcttaa 540
aagcacccta ccctcctgct tagcaagtta catcctgttt cgcagtggac tcacagcagg 600
cgcagagaga agtccctcct tgtccctagt ggcggtggca gagcaccagg gaacccactt 660
gctggaaccc actcagctct gccttggaca gaggagatag ggccaggggc atgggaatta 720
aggaatactg acatacaccg gtaaaacatc aagtcctatc caacttggaa agcagaaaca 780
gacaggctcg gcaggttcag ccctgaccca tttataccta gactgtcaga ggccctttgg 840
gaagctggtt gtcctctgaa cagtctctca gctccatgtg gtctgccccc aacagcagaa 900
ggattgaaaa gcaacagtgt tccaagttta acaaaacaat ctgattggaa ttagaccttc 960
tgttcttcct tccccttctc ccgagtggag atcaggacat tgaaataaac atctacacac 1020
ctgacccaaa atacagagct ggaggatccc tttgcctgcc tatagcatcc acagactagc 1080
ccaattatta tcaacacaga aaaaaaaaaa aaccctcaat ttctgcgtaa actgtgcact 1140
tgtttataaa agtacttaag tgtttgttga atttgagttt accgtgttac ccaggatggc 1200
ttctaaatcc atgcagttgg agttagcaca acatgggggt gggggtaggg ggttaataca 1260
tctataatag cagaactctg gaggctgagg taggaggagt gtgctaactt gaggaaaact 1320
tttctgcaga gcaagaccct ggctcaagaa aacaaacacc aaaagagaca agaaaagaaa 1380
agaacagaac caaaacaaaa acaaacaaac aaacaaacaa aaaaccaaaa aatgggaagg 1440
ccggattgaa caaacaaggt caagaagaga gagagagaga gagagagaga gagagagaga 1500
gagagagaga gaaaactcca aaagaaaacc aaatagctgg gacatagctg tgggtcccgg 1560
catatctgat tgcagctgct tgtcttaaat ggcctttcta agtggaagga gaggttaaaa 1620
tttgacctca caaaggggtt aggagtacta agccagcagg tgaaatcgtc aatattcaac 1680
tgtggtgtag gaggtgattt ccaggctggc cttaggacta ggtcacacgc aggtccctac 1740
ctggcatggg acacctggag attgccttga accggtgaat cattcgctcc tgagtagaag 1800
ggagcttctc catgtttata gtatatactg catatgaccc ttatttgcct taaaggatac 1860
ttcggggagc tggtggactg cctctagatg ctgaccccac cgcaccctcc acccttctca 1920
taattcactg gctttgccca tagttcccaa aggactccgg ggtaagtgta gccatgactg 1980
agccaggctt ctcaggacaa tcccgtggac ctgagcaatg ggtcccattt aggcctacgc 2040
tcccttccct tccattgagg cagcaccaag gggctgatgc aattgtccta agggacaagt 2100
ttctcagcag cacgccatct gtgaacctgt gccttccctt ccagctgtaa cgtcccgcct 2160
ggacgcaaat ccttaaaaag catttaagga aagaaaaaaa aaaaaagcaa tcaaaatctc 2220
cacccgagtg caggttgggg ttccccagct cgcgggagcg gctacggccg cgcgttttgg 2280
gcggtcgccc acgtcacccc agtgctttag gtggtaaagg tcagtgtctt cccacggagg 2340
cttcctgctt aacaaatgaa actgagtttt cctgctcagc tttcggttag ctaaaaactt 2400
ttcaatggcg gcagacaacg cagccaggag gcctcgggaa aattctagcg aaggaatact 2460
ggcgacacgt cgcagtcgtg cgcggaacag cctggccccc gcgtccctcc ccaccccgcg 2520
ctgtgcggga cctcccggct caggctgtgc gcggcggtga gagcagccgg ctccaacccc 2580
gagccgggcc agacgcctgc agccgaagaa acgcgttcac agctcgggtc cctatgcacg 2640
ggtggcggtg gcccgtaggg accgcgcagc gcgttccggc ctcggtttcc caggaccgtg 2700
gcggcccgca cccctcctcg cacctcacgc gtccctactg gctgagtctc gcgccccagc 2760
caccgtgggg cgttgcggtc gggggcgggt tacaccagtg tgactcggtg gcgcggattg 2820
gcggtcgcac ctgtgtccgg aggagcgtgc agcgttgggt ggcgggaagc ggcgaggcgc 2880
tgtccccggt aaggagcagg tctgaagcgg gtcccggggc cgctcctggg ttggtccgaa 2940
atgggtcgcc ggctgatcct gtgctggtcg ccgcgggtcc cggtggaggc tgcgctcagt 3000
ggactggagc gccgccgact ggctgcgagt tgggagagcg gagcgcgccg cgcgctgcga 3060
tcctggacac ctgttggccg cggcgccttt taaccaaagc cctcaccccg cctctctcac 3120
cctggagcga ttgagaaagt tgcggaggag gcggctccca gtagcccgcc acccccagcg 3180
ccacgggcgg ggctctccgg gcacccagag ccgtcagggc ccgccgagtc gcgagctctc 3240
ctggagccta ggtcactccc caccccactc cgccccaccc cacccccagc tctctttgag 3300
ctcaaggctc ttccagtgtc ctgtcccgag cgcagcctga acagagctgg tagacctgtg 3360
tcttcaccca ggacgcaggt cgcaaagctc caagtcccag ctactcgctt ttgggggatt 3420
gggtgatgtt gaaagagagt tgatgttgct cttactactc tcactagtgg aaagtgtgct 3480
gttatattcg aagcttcgct gtagtaatat tatatatact tgtgtgtgtg tgtgtgtgtg 3540
tgtgtgtgtg tgtgtgtgcg tgttagataa acggacggta cagttttgtg ttggcctgca 3600
gcttccagta gcgcacagga gactcctctc ccgtagtgca gtgagctgag gcatctagaa 3660
ttcgggttca aggcagacta acagagggcg ccgccagggc tggccaaatt ctggcttcta 3720
tttctttgaa ttcccgattt aattcgatca ctttgaacag ggtgccagtg gctaggacag 3780
aagaagatgt agaggtgcgt ctccagggct ggcctggaag tggacttgtc acagtctctg 3840
gagggttctc tgcctgtgcc cccgctctct gtgtcctctt ttccacaact gaaagcattg 3900
caaggaaggg gcacccagat tctgccggtg caggggatgc ggaagggggg ggggagcaga 3960
agaggttagg caagcccatc cctcttggag tccaggatgc tgggaagacc tgggcagcct 4020
gcatctacct ctctccgcca agctgttcgt gggttttgag ggctcggtgt tccacattgc 4080
ttggctgtct ggatagtttt gagaggagtt acggtggaca ttcacaagag ctagctacgc 4140
tttgggatac ctaggccagc tagcttcacc ttactacttg caacccgagt cctacagctg 4200
ccaggtttgg aatgaaaacg gcacatcccc acaaagttcc ttcagattag ctttacacgc 4260
agtgaagaga ctgattcatt ctgacaaggc ccgtctggtc gaaggattgg ctttcaatga 4320
aaggaccatg gctgaaggta catgctttcc ctgtaaagct ggcacattgc cgcgggcaga 4380
cctgactgct cttgcttggg cagaggaagg ttgcacgctc gcttgctact acccccacct 4440
cctttctaac tgtaagtctt agtctaagag ggagtgtctc taaggaagag agcctcggat 4500
ctgtgtccag cccttcagag agagagagat gtgctgaatc agcttgtgtg gaataactgg 4560
ccaagcaaga tggggtggta caactccctt ggcctgagca catctaaaga tgaatcaaag 4620
aggagatgag gtagtggcag caggcagggg tggaaggatg ttggcacctt tagcttctca 4680
tgggtcgtac agtttccagt caattggagc ccctgttcag tgaggatgac agaagcttct 4740
agaatcattg taggaagctg gccagtaaaa gataggttgg agatcagaac tgcttcactt 4800
tctccattga acaatttctc ctgagggtta gtgcccacgt tatgattaca gcttcagcgt 4860
ctagctccct aacttgcttc tacagattcg cctaatggct gtgtgttggc tgatggtcac 4920
aggtgctggg aatattagga tgtatcgcta gctcatctcc tcctctgttc cagccatccc 4980
tccttgtttc ttgttttctc accaactaga ccagaggctc ctctagggta agaaatgcta 5040
aatttatttg tgtatgtgta ttctccagag ggggagaggg gagagggaag gagaagggag 5100
gggaagagag gcaaggagaa gggagaaggg aggagaaggg aggacagggg gacagaggaa 5160
gctagaaaag agctagga 5178
<210> 8
<211> 4964
<212> DNA
<213> Homo sapiens
<400> 8
taatccagat gttaacactg aaacttccaa gcaggggagt gaaatgagac tttcactttt 60
gacttcgtat actcctgtat tatttaagtg aaaatgtatt tatatattct ataattacaa 120
aaatcacatt ggttgccttt tcattttgaa atgagcaaaa gtgacagggc tgttaaaaag 180
ctaagtcact tgagcaataa cgtgatgtcc agaacagtgg ttccatggct cagccatgtc 240
gggggctgca ctgaggacag ggggccatct gccttctagg aggacactgt ggactggaat 300
attgttcctg ccttgaggag gagtctccca gcacagttac tgctgcttga ctgtcagagc 360
atgcgttttc ttagggaagt tgaaggcagc ctgtatctag taaggtggta tgcagtagtt 420
gcttaatgct gaatgtgtga aggaatgtgg ggctgtggag caggaggata aagtctgaac 480
ttggacctgt tgttctcagc tattcgaagc tttctcaagt ggaaaataga ctgactttgg 540
gtccatcaga gggcagaaca aatgctggag agcagatgct agaattccgt cttaaaacca 600
tgaatcctta cagcggcctg cgtggcctgc gccatctgtc ccagccacgc cctccttggc 660
cccatctccc cctttctcgc cctgactctt tggcatcctg gcctttccgt ctcactggga 720
tgcttcccta agagactcgt gtggtttgct gccctgtatc ctccggatct cctgaccacc 780
ctatgttagt tacattgcaa tttcccgttt ccctcatgac gtcttatttt cctccattta 840
aattacctgc agcaggtacc acctacaggg atctgttgag agtcggcctc cttcaatgtg 900
aagcctgatg ttttgttctg ttcacagcta tgcccccagc ccctaacagt tggtggcagt 960
cagtaaatat tgcctgggaa aacgaatcat tagccatgtg cagaaatgga acagcgtctc 1020
accaagttgg ggttgcccct ggaccctgtg aacactgggg cagctggggt gttcctactg 1080
tgcttgttac cggcttcagg aatcaaatgc actagagaat tgtagaagtg cggtccacat 1140
cctctgtgtg gtaggaccag ctgctgttgg cctctgagca ggatctctta cctctctgag 1200
cagtgccttc ctgttgccct cagcaagaat aacactaaca gcctaggact tcagagcact 1260
gctgcgaggt gcaaatgagg tgatatggga aaagcatttg gtgagatgta tggaaagtgt 1320
agagaccctg accagatgag tcaatggcct tcttcgttac tctgttgacc tttctttaat 1380
tacagagtcg catagctgtc accaccttat ccttttttgc tgctatattt gcccccagcc 1440
attcctctcc cggcttatgt ggctagactc acctgcctgt gctgcagtta ctccaggctt 1500
tgtgtaaatg tgcatttttt tccagccccc agtttatcaa gctttgcttg agtcacttgt 1560
atctgaaata ccatctgtca ctcttccagg ttgggatctg tctagtggaa aacagatgac 1620
agtcatatgt tacttagtgc tttactatgt ggagaacgtt tacataaatt atcttatttc 1680
attgccacta agccggggaa agattcagga aacccatttt aagatgagga cactgaggtc 1740
agggtaagtg agtgagcttt tacccacctc tcagctgctc tctagttgtc aaagaccaac 1800
ccgtgggggt ggctcaggcc cgacccctgc agcatattcc ttggggcctc ccaagtgggc 1860
ccgatctgct caccccagct gtgactgtct tttgacagga ggagggagca gcgaggctgc 1920
acccactgct cataaaaagc agagcttgtc cacgccgagg gctcggctgg gtgggaggcc 1980
gcttccacaa ggctttttct tgctccatac aaagtgcaga ctgatgcttt gagatatagt 2040
caggattatc attttcagag ctcaagctct aatttccagg catgtgacca gacctctcta 2100
tccattccta caagtggtcg agagtagccc ataattattt tggcttggtc ttttaatagc 2160
ttgagagtaa taatctacat agcttgtaga agtgaatgta cttattttaa aagttctgtg 2220
ttttttgatg ttgttgttgt ttgggacagg atcttgctgt cgcctaggct ggagtgcagt 2280
ggcacaatct cagctcactg cagcatggac ctcccaggtt caagcaatct tcccacctca 2340
gcctcctgag tagctgagac tacaggcaca tgttaccacg cctgcctggc taacattttt 2400
attttttata gaaacaatgt ctccctatat tgcccaggct ggttttgaac tcctgggctc 2460
aagtgatcct ctcgtctcag cctcccaaag tgttgggatt ataggtataa gcctctgcac 2520
ccagcttaaa aaatcctatt ttcacagtct atgtgcagag cattttggaa gtcaggtaga 2580
aaccatttcc cattttctat tacctgggtg atagttgact ggtttttgtt ctttgaaatc 2640
cattttaaaa gtgtatggtc ctctatgaaa atacttctaa ttattgatgt gtgaaatgct 2700
ttgaaatcct tggatggaaa tcttgtacca tgaaagaaca gaactgttgg tggtgtctct 2760
gggagaggct cacgagggcc gggcaagcct gtgggggtag caggcagtca ctcccatggg 2820
gacaggctga cctggcaggc ttatttccca tggaagtggg cactgaggaa taaaaagcag 2880
tttcaggcca ggtgcggtgg cccatgcctg taatccttgc actttaggag actgaggcag 2940
ggggatccct tcagcccagg agttcgagac cagactgggc aatatagtgg gacctcgttt 3000
ctacaaaaaa tgaaaaaatt agtggagtgt ggtggcacac tccagtggtc ccagctactt 3060
gggacgctga ggtgggagga tcgcttgagc ctgggaggca gaggttgcag tgagccaagg 3120
tcatgctatg agtaacattt tgaaggtcca cttctgggat tcatccagga gctaaacggg 3180
tcatgtccag ccaactcagc attcaccaag gtacgtttcc agaccaaaca ccacattgtc 3240
catagactga tatgcctcaa aaacctggta gaggtgggca cggggttagg tagaaatcat 3300
cttcctccct tccttcccca ccaaactttc tggtgacaga agcttttctg taactggggc 3360
agaatggggt cagacactct ggcaacttac ccattggtgt tatgaaatat aaaacattaa 3420
tgtatttata taaaaagtga tagatgaaat taaaatttgc tgttctatta aaaccatatt 3480
agattttaaa ttattataga gattatattt taatgtttta aatgtatttg atacattaca 3540
aaattatttt agttacaagc atatcattaa agctattctt tattattaca aaatgctttt 3600
acaatgctat tcttgacaac aggaaaatac ttaccctcac tgaaatatgt ggagtaccat 3660
tttttggaaa ccatgtcaag cataatggca atattcaggt tcaatcttcc tatagatctg 3720
ctcaatattt atctaaacct tagcttctat tcttttcaca tgttattagc tatattttca 3780
cttaaaaaat tggaggctga aggggtaagc aaacaaactt ttgaagtaga caaagctcat 3840
ctttaatcaa cagactttag agtccagtct ttccaaatct gtttttaacg acagaaactt 3900
ctccctcccc tgccccattt tgtcctcccc attaaatggt actgtgtcaa taaaattccc 3960
aagcgacctc tttaaatcag cgttctttcc gatgctggct accacagtca tggaaaaggg 4020
agatgtgttg gacaggcctg tcattacagg tagtagttgg tggtacatcc agtctgtatt 4080
tcttacacaa aattacatct aaatatttga catgaggcca tttgctatca taagccatca 4140
ctaggaactt ctagtctgtc tcactcgatt gaggctacaa tgttgttagg tgctatgacc 4200
acaatgaata caacagacag cctctcagct gtgctgcaaa gtattcataa ccaaaagacc 4260
atatttcaaa ttaaatcata gtagcgaatg acataccatt tacatattac aatctgagcc 4320
tctgaaacag ggggaacata taatggtatc cagaacatct ttacatcaaa ataacctatc 4380
atactacaaa gttttcactt ccaaaaagtg taacagagtt taaggcactg gtaactttgt 4440
ccactgttag agattaaaac ttccaaagca aatgaaagaa ccaatgttca cctttaacgt 4500
ggggaaagtt ggcaaaaaga accccaggag gacacccaaa ccttctctgt gtcctctgtg 4560
gaacctggct tttttctctt gtcctcagag aaagaaacaa atgccgatat cctctgttta 4620
aaatatgaaa gtaccttaca ccaataaccc ctaacagcct ggggtctcag tggaactaac 4680
ttaagtgaaa gaaaattaag acaggcatag aattaggcct ttgttttgag gctttagggg 4740
agcagagctc cattgtggca tctggagttt cacctgaggc ctacaggggt ttcaaatggt 4800
tgcatttaag gtcagaatct ttgtgttggg aaatgctagc gactgagcct tgacagctga 4860
gcacgggttg cctcatccct ctcatgctgt ctatttctta atctaacaac tgggcaatgc 4920
gttaaactgg cttttttgac ttcccagaac aatatctaat tagc 4964
<210> 9
<211> 5166
<212> DNA
<213> Mus musculus
<400> 9
catggagaga gatggataac tgagatttct gggcaagaga tgaaatgggc tgaatcccac 60
tcctgactgc acacacctct cagtgattta attagaaata aaaacaagtc tctacattaa 120
catttacata agtaacatca gccgtctttt ccattcaaag tgactgaagg agatggtgtt 180
gttaaaagat tgaaattaga cagcagcaac acgtctagaa gagcatccct ggggcagggt 240
tctgcctcaa caccacacag cactacacag caccacactt agcacaaggc tcctcgtggc 300
tcctcatgtc ccttcagcaa gtcaccagtg caccaggagg cgttggggag ggaactcctg 360
accacaatca cagcctgagg gttggagttg tgtttcagtc atcctggggg gcagggggag 420
cttaaactcg ttggcattta ctagggcagt acacagcagc cgctccacgt tgaacgagtg 480
gatgatcagc ctgagaatca aggctgggct gagcttggct ctatcctcaa ttatctgcag 540
agcgccctgg tagagaacag atctgccttt gagtttccaa gtgagagcgg agcaaggctg 600
ggcacagagc agggtggcaa ggtggctgct gtgggcacag cacagaagat actcaggggc 660
atagatcttc ctggtggctg cttggtctca tgttggtcag gtcacctcca tttttggcct 720
catcatcttc tgacatgcac ctgcttcatg cgtctgcttc ctggaaccca ttcctggctt 780
tttgtcttaa ttctctgagg caggtggctc cattgcttgt ctcctttagg tttcatctaa 840
gagggaccgt cacacacagc ctgtgtgggc atcatgctgg tgcctgacag tcctctctct 900
ctctctctct ctctctctct ctctctctct ctcccccccc cctctgctgt ggctttggcc 960
tctgcagaaa caatctatgg gatttgttga tatgctgcct ccttcaacac aaaggcttaa 1020
gttgtattta tcagctccag tcccagggaa taatcatgtc tggtgcttag ctggtgctca 1080
gtagatagca gctgatgaaa aaaaatcagg agggatacgt aggaactgac cacaaaatct 1140
tgtgggggtg cagttacacc acggactcca gcagtgttgc aacagatgta ggttgtgggc 1200
ctgtggagtt agtcttcatt gtgggagggg caactccaca aggcctatca acataacctc 1260
cgaggggttg gactactctt gctggccttc gatcttgaca attaccagtg ccttcttcac 1320
aacccctccc ccacccctgc acaggtgatg acttgatggt tcttaagttg caataagaat 1380
gacaggaagc aagcaggaag caagagatgt gatatacaca ttaggtcgta tggagaccct 1440
gacagagcaa acctgtaaca ttcattctta ctgtattagc ccctttctta gtcacttatt 1500
aatattcatt tagtcattta gtttttgctg tttgcttgat gcagagtctc atgaagttca 1560
ggctggcttt gaactaagta tgcagctgag gatagccttg aacttcaaat tctcctacct 1620
tcatttctga gccattggga atgcaggcat ccaccttgga gcgccatttc tatttattta 1680
ctttctctaa ggctggggat ggagcctatg gctgtgtgtg gtaggcacag gctggggatg 1740
gagcctatgg ctgtgtgtgg taggtagcat tttggcattg actcacttac tctccagccc 1800
ttgattcttt tgagttacag agtgatacca ttgcctgtca ctcatcttta ctgtgctttt 1860
gtgtatgcac ccagcccccc ttcctctgtt gacctggctg gtctctgagg tcactgtgtt 1920
atgtttattt cagtgtcaac ctgcacactc tcaagcttcc ggttaattga gctttgcagg 1980
agacattcct acttactctg tcattcacca tgtcactcag ggtctactga gtgggagaga 2040
gatgacatat taatgctaat atcattctac tgccctaggt ggaggagagg gtctgtgtga 2100
atcaccccat tgcttttcct aggggtgggg agtatttagg aagcccactg taaggtggag 2160
agcctaggcc agggtaagca cggagctccc ttccacccgt ggccacccat tcagcatttg 2220
caagctgctc cctggtgcat cacctagtta gaacagtggc acctgagaca gcttaggcct 2280
ggggaaacca atagaacact ctgttgttcc acttggacta gcagtggcct gtctctccac 2340
agggagcacc acccatgttg gggagcatca cctgtaacct ccagagttca ctcacaccaa 2400
ggcttcttct cttcacaaac tgccatctgc tagtatcagg atgatcatat tccagaggcc 2460
aagcttatgg ccagccctct ccgtcagtcc tatgaagtgg ttgttggcag tttgtaatta 2520
ttttggccct gttctttaat accttaagag taataatctt cataatgtgt aggagtggaa 2580
ctagccattt aaaaagctgt gcattctttt aacagggtac gtccaggaca ccctggcagg 2640
tgggagagac tattcacttt ttctactgtc caagtggacg tgggctaagt tgtatccctt 2700
tcgagctagg ttgtatggtc ctccataaaa acatagtatc actgatgttt aaaatgcctt 2760
gacagcctca gtgtgaagct tataatttaa aggatgatag tgtaggtacc acccaggaga 2820
gagacgtata gcctgtccct tacctgggac acgcttgcct ggcaaggtct gtcccgtggg 2880
aatagacatg gaggaaacaa agaacatggg ccacatgctt ctacacacac acacacacac 2940
acacacacac acacagagag agagagagag agagagagag agagagagag agagagagag 3000
agagagagag agtcttgcaa agttctgcag aggacggttc tcaaagtgta gtcttcacag 3060
tggaagatgt tttaattttt aaatataaag aggtttgttg ttgttgtttt ctgtgatact 3120
ggtgttccaa tatgggggcc cacacacgga gacaggtgtt ttagcgctga ttacacactg 3180
agcctaagga ccatgtaaac tgtgagttcc tctgcttctt ctagaaacgg aacggaactg 3240
atcccgtcac caggacttag catcctcctg ctgcactctg actctcagac cttgcagccc 3300
ttaggttggg gctcacggaa cctcttagag tgcgtggatt tgggcagcag tggtctgtct 3360
gttccctctc tctttatcaa gttttctagc cacagggtat tttttgtaac tggagcagaa 3420
tcccagaaca tgttgtaaca tgtgagcata cttctgggat gctttaagat ataaactatg 3480
aaatatatgt atatacaaat tagtatagct gggcatggtg gtgtgcacgt ttaatctcag 3540
tccttgggag gcagagacag gcagatttat gagagttcta ggccagtctg gtgacagagt 3600
gaggccctgt ttcaaagaca aaaacaaatc aaagccagaa aaacttacca ttggtcacgt 3660
tagagtttgg tattctatta aaaaccttat ttaattttaa agtatacaaa ataatcatat 3720
tttaataaag ggcatttagg ggtttacaaa attatatcag tgacaagcat gaaaccacaa 3780
ctcttattta ttgttacaaa atggctttcc aatgacattc ttggcaggaa gaagtgtccc 3840
ctgttggatt tgttgactgt catcttgtag gatacacata aggcatagtg gtaatggttc 3900
aacttgccct agaaaggtta catactgacc taaactagtt tcttctattt cttccaaata 3960
tccacatttc tgtttccagt taagaaggca atgctgaaga gggaggcaaa cacactttca 4020
aaagtagaaa aacttagttt taatcaacag gattgggagt ctagaagttt cattggttct 4080
ctgaaaacca ccccatttgg tttctgcacc attgaattgt cccatggcag tgaaattccc 4140
aagcaaaccc atgaagtccc tatcttctga tgctgactgc aacatcccac agctacagag 4200
tagacaaact ggtggggggt gggggtgggg tggggctgag ttaggctcat ggcaggtggc 4260
agttgtcggc atatcctatc tgtctcttac acaaaattac agttgactat tttaattgag 4320
gcctcttctt gtcagaagcc agcacgagac gcttccagtt tgtctcactt atgacaggca 4380
gtagggttat agccctgagc ccagcacgcc agtgatgaat acaataggtg ggccctcagc 4440
cacactgcag gtttcccata acccaaaggc caacatctta aagaccctgt gagatctggt 4500
tacacaccat gctcacttca cacactgaac ctctggacta ggaggaatgt ataatacttt 4560
ccagatcatt ttaggaaaaa aaagagccta tcttatttta aggttttcat taaaaaaaaa 4620
aagtacacag cacttgaagt attaatagct ttttgtccat tgttgcacac gtaaactatc 4680
aaagcaaata acagtatggc atttctttac ctttagctag gggtaacttg ggggggggga 4740
ctttctcagt ggcaccttcc tcaggaccgg gttcctctct cctgtcctca gaggaagaga 4800
aacaatgtga gatccctttg tttaaactgt gaatgtatcc tccaagcttg gtcgctacca 4860
gcacggggtc tcagtggaac taactttaga acccattaat acaggcatag aattgggcct 4920
ttgtttggga gctttggggg aagggaggcc cacggaggct tctggagttt cataggaggc 4980
ctccagggac ttcaaatggt ggcattttag atgggaatgt ttgtcttggg aactgctggt 5040
ggctgagctc tgccgactaa gcgactaagc atgggttgcc tcatcctctc cctccatctt 5100
tgctctagca gccaggcaat gcattagact ggtcttttgg actttcctga gcaataccta 5160
acgaac 5166
<210> 10
<211> 2504
<212> DNA
<213> Homo sapiens
<400> 10
aaggggacag gacatctctt tccaaaactt aggtttggtg actcctggat ttcacactct 60
ctgactgctt gggtgagggt ggaatggagg gctgtccccc accctcgcac ctgcacggtg 120
gcatgctttc ctcctactcc agggaattcc tcgtggcctc atggcctggg ctgtttctgg 180
cttcaagctc cacgtggcct ggccccagcg gtctggtcca ccttgtactc ggtgcccccg 240
ctgccccctg gcctcagctg gagtgacgca cctcatccat gcgggcctgg cgtctggaag 300
gtggctgggt ctctcgggct tgagcaccat catcttagct ccaacatgtc attattcctt 360
cctcactgag gacttttctg cttcctaatt ggttgttgaa gatgaggccc ccatgctctt 420
ttaagaaaac ctgttgtgcc ccaggcttgg ctgtgatggg cactgactca tacagaagta 480
gaaaggcctg ctgagtcatc aacactcgtg cgacgccctc gcattttcat taatgatggc 540
ctccctgcca cacgtgaatc actccagccc gagatctgaa accaggacac accccagggg 600
cgaggtgacg ctgagtgagc ccagctgtgt ccctttcatg agaactcaga gcacagggct 660
ctgtgtgcat ggccgtcccc tccagagagg aggaagtaaa tgccgggatt agtggaagat 720
catttccttc tatttgcctt ggcttacgtc tttcagaatt caaacacgtg cactgttgac 780
cctgcaatgg tggagttttt ggattttcct tcagtccgat tgctaaaata cttccctctc 840
atgtgagctg ttgtgaaagt catcagccag ataccattct aaaaacaaag aatgtgcttc 900
tcgtatgttg catgctggtt actgaaatat tagggaatta cataaaggtt ttctggggca 960
catattcaag ctgaatgata aaattgaagg tcacacaaag ctaaggtctt tcaaatcctg 1020
acccaattag ctctctgtta gctctctgac tttggacaag ctgtctggtc ctctgaagca 1080
tactttgttc gccctgggta ggggccctct gttttaacag cgtttggcag atgaaaacat 1140
ttgcaaagcc aaaggacaat gaaatctacg gaagcctacc atatgccaat gactccacca 1200
aatgttttct cttcttggga tcttctaaaa ttcatctgaa tacttataag ttatgcaaat 1260
tttggttatt aatctaggtt gtattacctt gggggaagtc agttaatctc tttgaactca 1320
gtttctttat ctgtgaacct gaaagaacac cttcaaactc caagggtggc tgtcagaatt 1380
aactatagag gtgcaggtat cagatgaaag ctataaaaca gtttacagat cttagatatt 1440
atgatggatg gctatgatac gtttctcgaa tcactgcttg ccaatgagct gtacaatctt 1500
cctgaagggg tctgcctttc caatctgggc agcaacagtt aatgacggtg tgccaggata 1560
tctgtgtctc cttttatctg ctccagactt taaacacacc ctctgattac atcacactat 1620
caatttgaaa aagggctcag agccaaaatc accactgtta gcgagttctc cagggctgcc 1680
tcctatcctc tggaggtggg gctctcgtct gcagaaatag gcataagggt tttctatggt 1740
ttttgtttgt tttaaagacg aaacatgttt tgggatcttt taagaatcct aatcgttgtg 1800
aaagaaactg aagtaagtta ctgttcaagt gactctcatt ctgctgtgaa tagtttctcc 1860
cacgtgaagt cagctcaaga gactgtgaat tgcttcagcc tacctgagac ctggtacaca 1920
gggaggcttc ctagccacgg aagaggagag cgtttgcagg aggagaagga ggagagaggg 1980
cccacgcagg tgacattctg gaaagggaat gctggtgcga aactgcctca cctactttgc 2040
tccttggatg ttcaggaaaa gccagcccca tccgccccag tccgagggcc tcactcatgg 2100
aacaaatgaa gctgagaaga ggagcttcct gttttccagc tgctggggtc atcattatct 2160
tcaggaagga ccccgaaaag catcgtgtgt tgttgcaaag gcctgcctta tcctggcccc 2220
caggtccctc tccgctggcc ctgtctactg gataagctga ggttgcacga agtaggtcca 2280
ggcctaatgt gacagtgaat aatatggtgt ttggccacac agagatgtgt gtaggtacaa 2340
aaaccaccat gcttttggcg gcaaagtaaa aaatgaagat gtcgtcaaac gatctgaact 2400
ctgatggaga ctgagcgaga gaccctggcc caaaacaatc actccatggc ggatgcgctc 2460
tggggtagac agctactgct ctcagagcag ctgttttcag gcca 2504
<210> 11
<211> 3870
<212> DNA
<213> Mus musculus
<400> 11
gtaagagcca attaggaagt tccagggtta gtaaaggcca atcagtaagc accagggtaa 60
gagccaatca gtaagctcca aggttagtaa gagccaatca gtaagctcca ggttagtaag 120
aaccaatcgg taagcaccag ggttagtaaa ggccaatcag taaactccag ggttagcaaa 180
gaccaatcag gaagttccag ggttagtaat ggccaatcag taagctcctg ggttagtaag 240
agcttctggt tttggtcctt caatcactgg cctgagcact catgtgattg gctaggctgg 300
ctaatcaacc agctgtggga atactatcca gtgatgggct tgcagacaga tgccacagca 360
tgtggcacct ttaatgtggg tgctgaggat acaaagtcag gtctctccac gcttgcatag 420
gaaacacttt accaaatgag ccatttttct cagtttcgat tttattttat tttttgagac 480
agggtcccac tgtatagctc aggttggaca cagacttgtg atactcctat cttggcctcc 540
ttgactactg gaattgcaag tgtgtggcac catgccagct ggaaaggtaa ctttctaagg 600
tacctctttc taaaatagat gttgaccttt tgtaaggaca gactaaacgc cccctgggct 660
tgaggctggc gccatccaga acagggtaga gcgtattgag cctggcaggt tgaatccatc 720
tcccaaatga agagggcagg tgggttttgg gggttgatga cgagggaggg gcagaaagag 780
ggagacaaga cagagagtgt tactcagtcc aggtactctc ttgaactaag agcacacagg 840
gaagaagggc ctcatctgag gccaaggtgt cattgtatcc ggtataaggg gacaggatca 900
cctcctttca tgttggagct cgtggatctt acattctcta atgcttgact agatgtgagt 960
ggagctagaa cacgtatctt ctcctggtca ccgcccaggg ttcgtgcgct tttcttactc 1020
ggtacatcat cctcatcgca gtgggctggt ctctggctgc ctcatccagt ttgtcgtctc 1080
agttcatacg gacaccccct ggcttgtcag tgctggccca gtaccctcgg gcctgagcac 1140
ctgtgatgcc cctgcctcca gctcttcctc cccagagtct gcaatgctat cattccttcc 1200
cggcccagag acttacgctt cctcattaga tgtgggagat gaggttctca agctccaaca 1260
aaccagtcct gacctcgttt tggcaggaac tcaaagagaa gtcagaagct tgctgaatca 1320
cccacaccgg ccggccggcc gagcatcctg gcaaggcctg taattagagc ctctctttca 1380
caccttgaat cttgagggcc ccacgtctga aatgaggggt gtcccagtgc ctgctgcaag 1440
tttatgagca gcacacagac tcctttcctt tggaactcag gggtgctgcc tgcgtctggc 1500
ttctgtggag gaggaagtaa tgtgtgtgga ttagtaaaag atcattttcc tgctgtttgt 1560
cttggcctcc gtgcttcaga attcaagcac ttgtactctt gaccctgcag tggtggctgg 1620
ttttgagtcc acttcctgtc tgatcgctaa actgctcctt ctctgaggac cttcagctga 1680
agccacttac ctgctaacac ttaattaatt aataattaat attgtaatta attttttgtt 1740
gcaggattgg cagtgaaacc caaaacgtca cacatgctaa gcaggcacgg ggccatcaaa 1800
tcattttctt aattttttac ttttttattt tttgtgtgtg acagggtctc aagtaaccca 1860
ggttgacctt aaacttcctg tgtggccaga atggctttga atctctggcc cttcttctcc 1920
ctcccatggt actgagatta caggtatgta ccaccatgcc tgacaccctg atgctgtggt 1980
ggactcaagg aatgcacata cctaagcttg aatgctcgct gttgaaatac tagagacatt 2040
taaaataatt tgccagttag gaaaagcttt ctatggcaca cagtccaatt gaatcttaac 2100
acacacacac acacacacac acacacacac acacacacac acacaagact taggtctttc 2160
aaattccagc ttggtggctt gttccatgtc ttctttggac aagccctcca gctctcctct 2220
cctctgctct cctccttggt aactaagggg aggccacgcc tactttattg gcatcctaga 2280
gatgccaaca ttggcaaaga gaagggacaa ttaaattcat tgaggcctgt gtggtgtgtc 2340
agcaactctg ccaaccactt tcttatcttg gtatcattta aattagtttg aacacttaaa 2400
aggttgtgta aatgtggctg tctagtatta gaagctgttt tgtattattg ttagttgtgt 2460
tccctcaggg gaagtgagct gccctgagct cagttcttta tctggaaact gggcctaata 2520
cctccagact caaatgactg tcacaggact tagctatgaa ggaaagggtt gaggcagaag 2580
tcagagcact ttacaaatat taggcgcact tactaatgct catgataaat tcttcaaatt 2640
gttgtgcgat aaagatcttg tcagggtttc tcaggcggct atctttccca tcagagctgt 2700
ctgtccaagt taaagacagc ttactggaat atttctgtat ccttttgtcc aatacaggat 2760
ttaaatatac cctgcgatta gattgtaatg ccaataaaaa gaaaagaggg gatgtcagag 2820
cataagccca gggtgacaac cctgggactg gcattctaga ttctggggag gagactcttt 2880
ctgggaagag aggctcatgg cgttttgcag tttttgtttt ctgttttaag acaggagttg 2940
ctttggggag ctttatctta agaatccgaa cggttgtgta ggcaagcaag caagcaaggc 3000
agctactgtt cggttgacct cgttctgctg tgaagaattt gcactgtgtg aagtgtgttc 3060
aggaaaccct gaatagcctt ggcacacctc cgacgtgctg cttcgtggta aagtttcctg 3120
tcctcaaaag agaagacatt taaaggaaga ggagggacca aagaacgggt cacctagaca 3180
acagggatct gggcacctgg taggaaggaa accttagctt atttactcct tgaatgttgg 3240
gagagaacag ccaggaccct gccctagagc ctcactcatg aaagctgaat ctgggacagt 3300
gagtcctccc ctctaactgc tcccagttcc actgtctcca gggtggatcc caagtggatg 3360
ctgtgtacat ggccttcatt ctggtgccta agctccactc tgtggaccct gtcaccaagt 3420
tggtgtgagg aaatgtaaca tttaatatta tgggtctggg ccacaccaat aaactacgag 3480
gcattgtagt caaagctgct gccgcctttc agtcacctga cctcggtggc cattgaataa 3540
gtgaccttgg tctaaaacaa ttgctccaat gttctgttct gatgctctgg gtggatcgct 3600
gcttgtgtca gagcagatgt ttccaggctg ttgctggggc caatgtcacc attcctgtta 3660
gtttcagatt gtctattagt tctagatagg gtctcattat atgagacacc ccaccctcct 3720
gcatggctca aaagtttact gatttttatt ctttgtgtgt aagtgtcttg tgtgcacgca 3780
catatatgtg caccatatgc attcctggtg gtaggaagct agaagagggg ctcagattct 3840
ctggaactgg agttacagat agtcgtgagt 3870
<210> 12
<211> 1768
<212> DNA
<213> Homo sapiens
<400> 12
atcacgcagc ccataccctg cggttctccg gggacttatg catcggccca agttgagggt 60
ttgtctgaac tgaaacccgc atcctagacc tggctttctt ctccccaaat ccaaggggac 120
accccggtga cccacaaaag cttagaaaat ccaacacgca gcaaatgaaa cgggggaaag 180
gggcaccggc cctcactctg gcctcttaga cacacgatat gaaaccttca taaaacctgt 240
tgtacaagtc aaaggggacc acgctggggt aaaagtcaaa ccagtccatc ctcgttcctc 300
tgcgtacaga gagagggtcc agcgcgggcg gcgcccactg ccatcgggcc ggggccgggg 360
cgcgtggaca ggagggtgcg gatagaggca gatcgggggc ccggtcgccc cacgtgcggc 420
cagacaccca tcccggccgc gctctgccgg ctctgatccg gtgccagaca ggagcgacag 480
gggcgaggtg gggaccagcc gccgacctca cctgttttgt tttcttggag gaaattcctc 540
cgctgggggg ccgaggtggc accgcccgct cgccccccgc aagacccagc cggtccgcgc 600
ccgcttacct gctctgcggc cggcggccct ggcgcgggct ctgcgcgggg cggcgccctt 660
cgctccggct gggcaggcag gtcgggctcg ggcgccgccg gctgtcgggc tctcgtcggg 720
tttcgggtga aggccccggc tcccacctgc tgcgcctttt aaccgcgccc caccccgcct 780
ctgccctgac gcggctcggg cgggctgcgg gaggcgagcg ctgtcactcg acgagccccc 840
cgcccccacc tacccggggc gcactagccg ctgggcgcgg accgtccccc tgaggagcaa 900
ggagtgcagg accggggctg tccctccggg gccggatgcg cagagcgggg acctttttcc 960
cgtggcgggg gcgcagggtg ggggacccct aagaagtgca cagtgcgcgg ggccctcttt 1020
ccggcccttg gagggaacgg ggtaccgggg atgcaggggg tagggctctc cctcgggagc 1080
gcagagggcg ggcccagccc cctctgcacg ggtgcaggtg tggggcgcct gctcaggccc 1140
tcgagggaac tcttcctccc tagtgcaccc gtggggagca gtgtgagggg caggctgtgt 1200
ttttgccagg acacatcctc agtctttctg ggtgatccag ccttctcata gcccgcgggg 1260
tgcacagacc tctcctatag gagcctggag gttctttatt aattaatgac cacttagagg 1320
aggtacaggg gttgttttta ttaattacct ccatcctttg aagactcctc cggggaagcg 1380
gagcaggcct tcctcgggac agtgcaccag gagagaccac attgcctccc cgcttttcag 1440
tcaagactag aaagctcagg gccagtacag ggagtggtgc aagggctggt ggggtggaaa 1500
cgttggaagc tatttaggca cctggcttta caggttcaaa cctgtcacgc atcggacaaa 1560
agatgtgtga cttgcttatt ctacaaaact gttcggtaat taaacgtccc cacctaaacc 1620
atatgccact tgttgggtca tattctccca cgaaacaatt aagatgtctg ttaaaggtca 1680
tggaatttga gccaagactt cataaaaatc cgctttccaa aatattttat ttgaggagaa 1740
caaggttctt aaagaatttg cccaagtc 1768
<210> 13
<211> 1751
<212> DNA
<213> Mus musculus
<400> 13
aatcatgcag cctgaatggg catttctctc caagtcgcag ggtttgactg accataaaca 60
tcattccttg ctgtgctttt ctgcccgctc cccaaatcga tgacagcccc aaaccagcaa 120
aggaaatgag aaaagggact taatccggac tctagtcact ttaaacagcc tggtgtgttt 180
ataaaacctg tcgtgcaagt cagaggggca tggtgcatgc agaagtcaaa ctagtccatc 240
ccagttccta ctgcagggca cgagggaggg ggcggcgcgg gtgacaacca ccctgccgcg 300
gttccagttc ccggtgggct cgcaaaggcg ggatgccgat gggaggcaga taaggatgct 360
ggcaaacccc cgcctccccc ccccccaccc cccgcatggt caagactgtc tgtaaccgcc 420
gggccgcctg gagatacttg ccaccccctc gtcccacaaa tctggcgaga aagggaacag 480
accacttcct ttacctgccc gggtttctcg gaggaaatgc tcccactcgc gcttacctgc 540
tcggtgggag ccggctccag gctcgcagcg gcactcagag ctcctaccct gagcgtaggt 600
tggatcaggc gccggcggtt cacagcggga atggaatcgg ggacagtgcg ggtggagccc 660
cggtttccac ctgtggcttc ttttaaccgc gcccccaccc cgcctctgcc tgacgccgca 720
cgggagggct gcgggagagg agcgcgggca ctcgacgcgc cttctgtggt gcgcaccgcc 780
ctctctccgg gacagaggag cggggcgggt ccccttctgt ggagcaaggg gcaggggacc 840
ttccctgtta gggccaggtc ttagtggtac tatattaggg cactcgttgg gatccttctt 900
ctgaagccag ggaccactgc gagtgtcccc taggagagac tccaggtgta ggctggtctt 960
cccttgggtt ggggacagaa ggcttgtccc ttcttgtgga tgtgggtgga gcgtggaccg 1020
cgatgggcaa gctcagccag atcccatcaa ggacagggaa aagttgcccg ctggggcctt 1080
gctggggctg gacactggag ggcccttaat gaagtgaggg ctatccagag tacggggaac 1140
aggcttgtgg acccagctag tagtgagtct ctcctgttgg tcatcctggt aggaagacaa 1200
ctggtttgtt ttcatccttt ctagaccctt tgggcaccct ctcctctaga gcagcctgga 1260
ggttctttat tccttaatga ccacttagga gtctcaaagg tttgttttta ttagtcatct 1320
gaatcccttc ctgcattgtc cagggaaggg gagtggactt ccatcttgag agatcccact 1380
gtgtctgctg tcacatcaag ggcagggtaa ggtcaaggca agcatagagg gtggtacagg 1440
gggtcctggg ctggaaatgt tggaagccat gtaaggacct agttttacag ggcctgccct 1500
gtgctacttc agacaagact tgtaacatgt gtaacttggt tattttacaa aattggctgg 1560
caggtatgtt cttacctgtt gggtcatatt ctcactttag ctacattcta cctgttggtt 1620
cacgttctct cacaaaacga gagtaatagt gcttcctaaa atgtctctcc caggtcatgg 1680
aggttgagtc aacgctttat aaaaacccac cttaataaaa tacttgaacc agagttctcg 1740
gaattggacc c 1751
<210> 14
<211> 3358
<212> DNA
<213> Homo sapiens
<400> 14
taaaagtgag caaacagctt gaaccaatct aaacagctta tttatttgag gtaataaact 60
tttccttctt cctgagtttt cctaaattct tctctatcat gaaaatagca ttaatagcta 120
aaattttaag tgtttagagg ttttgccttt caaatccagt aagtctccag agtcaacagg 180
tgctacaaga tgctactggc agtaacagtg cttctccagg attgtggtag gtggtgtcta 240
agggtctttt cagcttgaag gttctgtttc ccagttctgt ctcacttaag atcagatctt 300
ggtgagtata ttggcaaacc atttcattat ttaaatttgt aaaatacagg ctttaggccg 360
ggcgcggtgg ctcacacctg taatcccagc actttgggag gcccaggcgg gcagatcacc 420
tgaggttggg agtttgagac cagcctgacc aacatggtga aactacgtct ctactgaaaa 480
tacaaactta gccaggcttg gtggcacatg cctgtaatcc cagctactcg agaggctgag 540
gcaggagaat cgcttgaacc cgagaggcgg aggttgctgt gagctaagat tgtgccattg 600
cactccagct tgggcaacaa gaatgaaact ccatctcaaa aaaaaaaaaa caacaacaac 660
aacaaaaaca ggctttaatt gtatttcata ctctttaact aactagatat taactataaa 720
atattaacaa tttcaaattt ttgttaaagg aatacattta cacagcttaa aaattcaagt 780
ggaactaaaa ggtttacaag gcaatatttc agtcctctgc cccattctct gctcctccca 840
ccctgtatgc tgtcccagag gcaaccaacg cctttcattt tttagagctc ttctgacgtt 900
tacctttatg tttccaaata atgtgcttat tatgccattt actgattgct ggactttaga 960
cctgttgact ttttctgcta tggtagtgga ggctttagct ctgacctgag ccccactgct 1020
cctgctccac ccacacctct tccctcaccc tcatgacatg atcatggctc atactctggt 1080
caaatacata ttgttattta tattattttg actgcgagca taatgacgtc tggaccaagt 1140
tgtattctat gttacatttt cttttggttg caattgcctc ccttccctga gagtgaacca 1200
tgactggggt tttcatttgc ttggctttct atgtgtctat tgttcggctt ttcctactct 1260
tccaacaaat ctgtcatatg cccggaaaca attttttcaa gttcccagac atggttccgc 1320
acagtccatc tattccatct gtttctttcc cttttcccgg gggctgtggt ctgggcaggg 1380
tgctctggcc ctctgcccag tggtcccctg ggctcccctt gcctttcccc tgggccagag 1440
cttgtgcttt ctggagtccg tgtcttcctg tcttggtctc taccttcatt ttgctgaagc 1500
acacaccttc caggaacttc ctcaggaggg gaatgtggaa ctaaacttct atgcacataa 1560
agtcttcata tcaccctcaa acccgatctg tctccccgcc tccaatgtac tttcctttcc 1620
tctcttattt tctctgtttt tatgaactta cacctttttt cttcactatt gtgtaattgg 1680
catttaagat gggagtagag ataaatgcac ctgtgtaggc tcatactaac cacacgcctc 1740
agtgcatggg tgtttatcag acttctctca atcaagagct gcgctgagta cttgtgaagg 1800
ccctgcaggg ctggtgctga gtaagttcag gattgggcac ctctgagggg tgaggaaatg 1860
gaggttcaga gacgagaagg aacttcccca aggccacatg gttaatgatt ggaagatctg 1920
agattctaaa ccaaacctga gtcgatcact tccctttctg tccactgcac tgataactga 1980
agcccaaggg ctgaggccac acctcagcgt gtgaggatca gcagaggaga ccctgctggc 2040
tgcgggatgt ggataggctt tgaggaagag gaaaagcaca ggcaaaatgt caaagataag 2100
tgggaatgag gttccctgga gcatgagtcg caggtgctca ggaaggtgct ggcagctcta 2160
gagaaggcca gagagaagca cccagtggtg ggagccacag ccccaagaca caggctaaag 2220
ccccagccca gggtgggtga gctccaccct gtcacctatg gggttgcatg caagtggttc 2280
ctctaagcat tggcttcatc tgggaggcgg gggtgacatc gcttctttga gccttatttg 2340
gaggactaaa caacacatgc attttgtcat taggctggtg caaaagtaat tgtggttttt 2400
ttctattact tttaatggta aaaaccgcaa ttagttttgc agcaacatac taactttaaa 2460
gttcttaata catatgagat attatttcta tcagcttaga aggatccatt atgattgtag 2520
aagacctggg atgccagtct gaggaactct tcttttctta agcaaaggag aaacaaaata 2580
attctgatgg gggagtgact gaccccagtc tggctcaccg gcggctgtga agtcctgagt 2640
gtcctctggc agctgccttt gaaagcgcag tggtgtccgg ggctcgccac tgaatagcgt 2700
ttgttctcag aagggagccc ggtggaaaat ttgaagctgc agttaggaac tgtgtgtatg 2760
gccttggaaa ctgaagatgt tcctttaaaa gaaaaatcac agtgttttta aaactcagat 2820
gacagctttg accattatct gctttcctct cctgccagct ctagagtttt cttgggatgt 2880
tatcaaggat gatatcacaa caatgcccac ttctgttttg tttttaacct gaatgacaaa 2940
ttaccaatca gcagatgtag gccatccagg gaagtttctt ttaaatgctg gacttttgca 3000
aaaatgtaga gccttggtgg caattgtgat tctttttttt ttcttttctt ttccccaatg 3060
aaggtacttt tttttatgtc cagttttgga aggctcctga agattgtttg agaacttgac 3120
tgctgtgtca gggcagtgct gacactctct gttgccaact gttattcatt attccaaaaa 3180
atcagagaag caaaaacgac ccctccaaac aactccaaga caaactccaa gcaaaacaac 3240
aacacacaca caaacccaca attttccttt ggttgcttct gagaaggagt tttaatggta 3300
tagtaaatac agcatttatc ggatgatttt tgctgccatt gatatgtttc tcttcttg 3358
<210> 15
<211> 5018
<212> DNA
<213> Mus musculus
<400> 15
aggaggtgtg tcttcctgga ggaaatatgt cacaagggtg ggctttgagc atttaaaaat 60
ttaccccctt tccaggtttt tctctctgct tcctgcttat ggttcaagat acaaactctc 120
agcttccagc ttcagcccct ctgctctcag agatgctcat ctctctggaa ccatgggtcc 180
aaataaactc tttgttctat aagttaccat ggtcacggtg ctttaccaca gcaacagcaa 240
agtagctaat ataatctttt caaggccacg aaaaagagaa aggcaaacca agagtttggc 300
tgaccaaatc agctgagaac acaaaccttc ccatcctaaa ttccccaatg ttcttttatt 360
tttcatcatg caaatagcca ctgatattta aattatatta atgtgctcat tatggcagtt 420
tcatatattt atatattgta ctttgaacat attcacacac ctccaaatac cctcttctgt 480
cccccacatt ttaagactgg aagtctcgtt ttttcaaatc cattattagg tccttagggt 540
caatggggtc atatgatggt gtctgtggtt ctaattagtg gccagctgga tacctgcaga 600
atcaatgact agtgggtaaa aagtgagcag tcagggtcag cagctcacaa agcgtcagtg 660
agaggcggac aaagagagct ttcagcaacc cctaactggg tgggcagcat gtgagccaag 720
tgtgagtccc tcctttttgg acctgggaga ccagcagagt gtgcaggccc tccgttggct 780
tggcccaggt gataagctga cctcagcagg aattacctca gtcttagtcc agctcctgat 840
gtaagtctca ctcaaaacaa aacaaacaag cctagacaaa accagcttgt tgtctttttt 900
ctgttgtggg aactgctccc actcaggaat ttctcagtgg ccccctcaag gaagtttgct 960
tcttctctgc ttccttccac acatctgtgt ctttctggtt ggagaccatg gacttgagag 1020
ttcaagttga gcttccacta ccctaagtgc ctgggtcaag cacacctgcg ctgagaaggg 1080
tcctgccagt ctcaaaactg catcactaga tcagcagtat actctctcac ttaagcatgg 1140
agtggggagg tgcctttgta tgtcttagca atagtcatct acgtgatttt gaggtcattt 1200
tacttttaaa gtatataatc ttcaaaccaa attcaaagac taggcaaaat ttttaaatta 1260
gcttttaaaa aatgagctgg tttgcttact tccctgatct taattcctat aggcagtatt 1320
gtgaggtaac ttatttaggt ttagggatga tagagaaata atgtcttagg gttttactcc 1380
tgtgaacaga cactatgacc aaggcaacac ttataaagac aatgtttaat tggggctggc 1440
ttacaggttc agttgttcag tccattatca aggcaggaac atggcagtgt ctaggcaggt 1500
atggtgcagg aggagctgag agttctacag cttcatctga aggaagctac gagaatcctg 1560
gcttctagga agctaggatg aggatcttaa agcccacgct cacagtgaca cacttcttcc 1620
aacaaggcca cacctccaaa tagtgccact ccttgggcca agcatattca aatcactatg 1680
ggtactctta aaagaatgca tgttttagct ttaaacattg ttcatttatc cgtgtaacag 1740
actggtttga gatctctcag caaagggagt tatccttata cagggactct tttcattctt 1800
tttcttagtg catattcatt gtagatagtg ctgagttgta taaaggcttt atctatctat 1860
ctatctatct atctatctac atcccaaatg ttgcccccct ccccgtaccc cctcaaagag 1920
ttctttctcc cacccccatt ctctttgcct ttaagaggca acctcctctt atatctcccc 1980
aacctgatgc atcaaatctc tgcaggatta ggcctcaggc cagcccatgt atgctctttg 2040
gttggtgact cagtctctgg aagctcccag gggtccaggt tagttgacac tgttggtttt 2100
cttgtggggt tgccatctgc ttgagggcct tcaatccttc ccctaactct cccacagggg 2160
ttcccaacct ccagtcagtc cagtgtttat ctatgggtat ctggatatcc ccctctgtct 2220
catcagctgc tgggtacagc ctctcagagg cctgctatgc taggctcctg tctgcaagca 2280
caacatagta tcatcaatgg tgtgagtgat gggtgcctgc ccatgggatg ggtctcaaaa 2340
cgatctgatc actggtcagc cattccttca gtctttgctc catctttgtc cctgcctttc 2400
ttttagacaa gatcaatttg gggtcaaatt ataaaggcat tttcatgtta agtgtataat 2460
gtattttgac catgtttccc catatcctcc taccctccca tttgccctcc ccctttctca 2520
ttagtattct ttgttctaga caaatttact ctacttttat ggcatatgac acatacatga 2580
tttaatgaaa cataaaatgg agaatctaca gacaaaagaa agcatgaaat atttggctga 2640
agctgactca actcatttaa tatgacaacc tccatttccc tacaaataag agaatctcat 2700
tctttattgc agactaaaat tccacaggtg tatataccac atttctttcc ctatccctct 2760
gtctttggac acctaggcag gttccaccgt gtagctattg tgagtaatgc tgtagtcaac 2820
attgacatgc aagtgtctct gtgacatgtt gacacagagt tctctggata aacacatagg 2880
agtgtcgtag ctgaatggca gtcgattgag aaaacaaata ataaaagggt tggtgagcag 2940
gtgggaaaag gaaactttga acgcattgct ggtgagaagg aaagtcagtc tagctgctat 3000
ggaaatcagg gcgagggttc ctcaggccct aaaaccagaa ctgccttatg acccaggcag 3060
tcttgacagc tgttgttgtc tgtgcttaag ttcttgactc tgtcagacat agagaaacca 3120
gatctcaggc tagaagttcc ttctttctcc atgttccctt aaccaccctc ttctctcctg 3180
cctcagcctt gtagaagtgt gccttccatt aggcacctaa gaagaggaac ttgacagtca 3240
gctgccacct tctagtgact ggaagaacca aatattctgg atctgaataa aagattttac 3300
attctgcttt gtggctcaca ggagactcag tgacaggccc acctaagcac acacagaaca 3360
gtagagcgac aggttgaaac agcttccagg aggagtgggg ggaggacggg ctgaggaagt 3420
gggatgtgta attccagtag agaaagtcat tggaggtacg gaaggtgctg gcaaccctga 3480
gaaacagcag ctgatccacc agctgcaggg ccaggcctct ggatgcaaca gccaagtcag 3540
agcccagctg ggcctggctg tgttccacct gctccctggg tggccccagg caagtgactc 3600
ccctgagaac tggcttcagt agtgagaaga ggggtggggt gacaatagcc tctttacagg 3660
gttacctaga ggactaaata atgcacatac gcatacacac acacagacat gcacacatag 3720
acgcacacat agacacatag acacagacac acacacagaa acagacactg acacacacat 3780
acacatacac aaagacacac agaaacagac acatacatat atgtatacac acagagatat 3840
acaaatatac atacacacat ggacacaaac acacacatac agaaacagac acacagacac 3900
acacaccaac atataataca cacccatata acacacacat ataacacaca cacacaggca 3960
aacacatggg tttatgggct ctgcagtaca ataaggcttt attttcatca gcttagtcag 4020
cagtagccta caaatattag tgttcaaaag tattttctag gcaagggaga gacagaaagt 4080
ggttgtggtg gggagtgagg ctggtgactg tgagtgggca gtgtctagtg tctggggaca 4140
gctgagattg gcagcccact ggccactgac tagagttgct tcccacaagt gagtccagtg 4200
gaaattttta gtttgctctt agaaactgtg ccttcagcct tggaaactga agatgtttct 4260
ttaaaagaaa aatcgtgctt tttgaaactc aaatgagagc attgcctgcg gtctgctttt 4320
ctctctctct ctctcaccag ttttcctggg atgttatcag ggccaatcat cagaacaatg 4380
ctcacttcta tcttgtgtct aacctggatg acaaatggcc agtcagccga tgtaggtcac 4440
gcaaggaagt ctgtctttcg ggttggactg aggtagccgc agtgcgatgg ctgctttgtt 4500
gtttctttcc cttttcttgt cccaactaaa agcgcttctg gtctgggagt aggggcgact 4560
gaaggctgtt tgagaacttg actgctgggc ccctctaaca ttttctgttg ccaacagctt 4620
actccttttg ctaaaaaaaa aaaaaaaaaa aaaaaaagca aacaagccca aactacttct 4680
tcaaacaatt ctaagacacc acacaaacag aacagactga agccccagta acccagcttt 4740
cccagggatg tttgtgagaa ccagggtagt ttttgatcac tactaaattc tacttaaaca 4800
tttttaaagg atttcttttt cttctcgttt ttaaatttgt tcttcgaata caatgtattt 4860
ttgatcatat gtgcacccct cccccaaccc ctccttctat caagccaacc tggtgttccc 4920
tcccctcccc tctccctcct cctctccctc ccctccctct ctccttccct ttccctcatc 4980
tccccctccc cttcccctca tttccccctc cccttccc 5018
<210> 16
<211> 5079
<212> DNA
<213> Homo sapiens
<400> 16
gttttaatgg tatagtaaat acagcattta tcggatgatt tttgctgcca ttgatatgtt 60
tctcttcttg aaagaggaat tcaaatgaca atgaacattt ttggggtcct cttttatgga 120
gtttgatttt caggggattg tcaggcatgt cgtctccggg ttcccatgct gcacagtccc 180
agcactctct gtggctcagc cttcccgtcc cttgccctct gaataccttg ccgttgactg 240
aatggtcatc gttagcacag gtcatcacaa tacatgactc ctgggcagga ggaacagagg 300
agcggaggtt gtgccatgca tttaaaaccc agttagcatc ccagtgggtc ttccaaggcc 360
gaagatggca aaacgttttt attttacttt gttgaaatca tctgtttccc tccaaatggt 420
gggctgtttg ggcacaaggt catgttgtct tcaatttcat agccccggta cccagcaagg 480
atggctgccc ataggctcta ttaagatgcc gagtgcatcc gtggcacggc caggaggagt 540
gtgctgtggt cagccttcca gaaggaatca atctcctggg agaagtggag aagttggcct 600
gcagcagggg cctcgagaat ggcgggtctc atccaccacc agcaggctcg tctgttgccc 660
agcagtgtga tcctagctga ggtttattct ctttccctca ttagactgca gtctcctgaa 720
aggcagggtg tgcacctgac ttgtcttttt gtcccttcat cctgcgccct gcacggtttg 780
atcagtaaat ggtggctgag agacaaggga gtgggaagga aggaggtcag gaggggagag 840
aggtctgagt gcttgaaaga gtccctcctc tgcttcaggg gcttgttctg gggttttctg 900
gatcttcagt acttgcgggt aggatctgag ctctcccggc ccctggtggt tgttggccag 960
gcctggccag cttccagcag cacaggtcat cataatatat gactcctgga caggaggaac 1020
agaggagcgg aggtcgtgcc atgcatttaa aacccagtta gcatcccact gggtcttcca 1080
aggcggaaga tggcaaaacg tttttatttt actttgttga aatgcaggtt gttccttttt 1140
ttttaaccaa cttttatgtt ccaaggctaa aacatagcat aaaacaattt gaaaaagtcg 1200
gtttcaatgt ttcccattgt tcactgagag agggtcacac agggtgcaag gcaacagagg 1260
acaccattgc ttacgtagta cctcgtgagc tgcactgcga gaggcctttc aaaggaaggt 1320
tttatttagg aagcaaggaa tgattaaaaa ctgatggctc taatcaaatg agatttaaaa 1380
ttttccatta aaccttcata gttaggctgc atgcagtggc tcatgcttgt aactccagca 1440
ctttgggagg ctgagatggg aggatcactt gaggccagga ggttgaggct gcagtgagct 1500
gtgactgggg cactgcactt cagtctgagt gacagaggga gactgtatct caaaaaataa 1560
aaaaaattaa aaattaaaag aaataaacct ttaacattgg gtgtaatttt actttccatc 1620
tactccttct tcctcacctg caacgttcaa gagcaggagg gaagatgtga acacacattt 1680
gtgtgtgtgt gtaaacatgc tcatgtgttt ctaaattatc aagtcaggat aagaacttct 1740
actgtgaaat acagatatac aacaatatgt cccaagctat gtttaatgca cttttattat 1800
cctgctagtt cttctaaata tgatcattat acaatagttc tttttttttt tttttttgag 1860
atggagtctt gctctgtcac ctaggctgga gtgcagtagc gcaatctcgg ctcactgcaa 1920
cctccgcccc ccagattcaa gcaattatcc tgactcagcc tcccgagtag ctgggactac 1980
aggcgcgtgc caccacaccc agctaatttt tgtattttta gtagagacgg gggtcttgcc 2040
tcgtgggcca gtttggtctc gaactcctga cctcaggtga tccacccacc ttggcctccc 2100
aaagtgctag gattacaggt gtgagccact gtgcccggcc cattatacaa tagttctaca 2160
aagaaaattt aagagcaagc tctggcttag tctttgaaaa acaagtttgg aatttcctat 2220
acgagtggat aaaatgtcag ctcttggtat tgtccttaag acacagtaca tggtatttac 2280
tctcttttta tagggtaaag atagataaat ccccaaaggc cttggcattt aggaaacaat 2340
catgctttat ctattaactt actctttaag ctctgtcatt ttttgcgtct gagtgagaca 2400
ctctatttac tgagccacag accacctgct agataagcag agactcttcc agggcacaca 2460
gcctggagaa aaaacgcctg aatgcacaac tagaagtatt agcaagtctg gtttaactgt 2520
ccccaaatgt ctaactaaga atattagtgg gccaggcgca gtggctcacg cctgtaatcc 2580
cagcactttg ggaggccgag gcgggcggat catgaggtca ggagatcgag accatcctgg 2640
ctaacacagt gaaaccccat ctctactgaa aatacaaaaa aattagctgg acatggtggc 2700
agccacctgc tctagtccca gctactcggg aggctgaggc aggagaatgg catgaacccg 2760
ggaggcggag cttgcagtga gccgagcccg cgccactgca ctccagcctg ggcgatagag 2820
cgagactctg cctcaaaaaa aaaaaaagaa tattagtgaa tgattagtat atgggaaaca 2880
cctccggacc accctacatt attattagtc ttcactttgt ggtgggtaaa gataaaataa 2940
aagtagctac cgtttattga atgtttacca tgtgtggatg aaaaccatgt taatcattgt 3000
cttctttaat cctcacagca acctaatgaa gtaggtacta taattttgca gatagccaca 3060
ttgagggtga gtgaggttaa acaacttgct catatgactc aaaagtttgg aagccatttt 3120
caaatcagat gtggacaaag tgtgcctttt taaccattgt attattcagt cttcctatga 3180
agacacgcct ctatttgggg catttacttc ctatataact tgatgaaaaa aaacccagca 3240
ttttcattgc ttgcctataa aaactctaaa ggtgtttctg tgggagggtg tgttattcca 3300
ctcagctatt gataaatata gtcctgtctt aatgtttaat gtggatcttt tttctgtttc 3360
atgcttttct gaatttttga gtgaccatgt cactcagaaa agctttgaat cagcaacatt 3420
tccagtggac tgtagggaaa gcctgttgtt ttggtggaaa gtagagagtc acagatcccc 3480
aaccttcatc tgagccgtgg ttctgcatca gtacagacag gaaaccaact attaggagcc 3540
actacatgaa atagtatttc ctcaggtgag caaaaaattc ttttgctttt gtagattggc 3600
cctgtctata cgtggtagcc actagtcaca tgtggctttt gacgtttgca ttttaattaa 3660
ttaaagtgaa acacaattta aagttcagtc acccctgcca cactataagt gcccagtatt 3720
caatacaact gcccagtggc tgccatgctg ggcggcgcaa acgtagagca cttctgtcct 3780
ggctgaaaat tctactagac agagccatcc aggaatttgg actagcaagc accaagttca 3840
cagttagaga acacagttgc aggccaggcg cggtggctca cgcctgtaat cccagcactt 3900
tgggaggcca aggcggatgg atcacgaaat caggagtttg agaccagcct ggccagcacg 3960
gtgaaacccc atctctacta aaaatacaaa aaattagcca ggcatggtgg tgctcacctg 4020
taatcccagc tactcgggag gctgaggcag aagaatcact tgaacccagg aggcggaggt 4080
tgcagtgagc tgagattgcg tcactgcact ccagcctggg caatagagca agactctgtc 4140
tcaaaaaaaa aaaaaaaaaa aaaaaaaagg aaagaaaaag aaaaaagaga agacagctgc 4200
tttacaaagc aagagggctt caagaatctg gaaaccaaag gagcaatgtc ctttgagttt 4260
ctacaaattt gggccacact gattgggcct ttccacagcc aattccattt gccttcatta 4320
tggaaagtaa acagtttaac ttcctactga catgctctgc agtgcagaca gtaaacagta 4380
gctcaccgct gcttctgcca gctgctctcg ggtgttctac ttgggtgggg aacagcagca 4440
ctggcactgg cactggcccc ggtggcccca cagagcatgg ctccatcagg ctgggtgcta 4500
cagagggatg ccaagaacat ttgggcattg aatgcctctc tctctctctc tctctgaaat 4560
gaaaaccctc atcaattcaa caatagtttc tctaatagaa catatagtga tttgtttcat 4620
ctcaactgtt cccatacaat aatagaaagg agggagtctg tgcctgagag tgcctgcaaa 4680
ccccagggca caccagcccc gtggagccat aacagttgct cacagagaca gcccctcaca 4740
gcagcccccg gcacagtgac tcgtgtaatg aaagctggaa aattgcccag gaaaacctga 4800
agatgcattc ctgaagctcc cacactccaa cgcacgcaca cacagacttc tctcctggct 4860
ttaggaacat gaatttacct tgaatcttta aacttaattg aaaatcttgc aaaataacga 4920
gctttccttt gaatcttcat ggcactttgt aataaaatgt ctaaaagggg gccattccat 4980
gaaatcattt aattggcatt aatagtacac tattacttca tataaaatca taatcatata 5040
aatgtactta tataactcca tgtaaattaa tttatataa 5079
<210> 17
<211> 4077
<212> DNA
<213> Mus musculus
<400> 17
gggtagtttt tgatcactac taaattctac ttaaacattt ttaaaggatt tctttttctt 60
ctcgttttta aatttgttct tcgaatacaa tgtatttttg atcatatgtg cacccctccc 120
ccaacccctc cttctatcaa gccaacctgg tgttccctcc cctcccctct ccctcctcct 180
ctccctcccc tccctctctc cttccctttc cctcatctcc ccctcccctt cccctcattt 240
ccccctcccc ttcccctccc tcctccttcc cctccctttc tctcccctcc tttacctccc 300
ctctcttccc cttccccctc cctccctccc ttcctccttc ttctggaggt tatggtagca 360
ctaggagtca aatccagagc ctgacactca actgctgatt gaacccctga cccttcttat 420
tttttctgtc catgtttatt ttcttgaagg aggaattaca taaaaaatga gcctttcgga 480
ggtcttcctt ccttgagtct gctgttaggg atgagtcccg tttgaatttc tgtccatggc 540
agggtctagc gccgatttct ctctgatccc cagaacctca ccctgatgag gtttgtgcga 600
tgggtgacac taaacagtgt tttctactaa acagtgggct ttgtggggac agggtgacac 660
tgtcttccac ttgctctgag ttccccgcag gcatcacccc cttcctcccc actggtgccc 720
cactctctct atctgggtag gttgcaggcc ccctcacagt tctacctgga acgtgctgtg 780
gtcagcgcag gcaggagctg gctggccttt gtaagactgg ccaactagag cgatgcaaag 840
ccggcctggc accaacccgg gctgctctgc agaaagctag ctgatttcca gcctgagcag 900
gtgcctgtga ctccaggggc agggtctctg tcagacgcac ctctatccat ccttcatctt 960
atccctatgt tctgactgtt aaatggcaac tgagtgagga ggggaaggaa ggcagaggag 1020
gggtctgaga gggatttgag tgttcccagg cccttgcaga ggctgtcccg ggtctggagg 1080
gcttcagcca gggtgtccta tgtaacacag gatcctcaga tagcaggtac tgttaaagag 1140
gaggccatca cacctgtgca tttgagacca tgccaaagca aaaggtgtca acacccgcat 1200
tttactgcat ggaaatgtag ttcgttcctt ttcaaccttt tgtatcgtgg ggctgaagag 1260
atgatgtgaa aggactttaa aaactccact aggcttctct gctttgttca ctgtagaagg 1320
tcacagggag ttcaagaaaa caggctaggg ataggaggat gctcatgtgc ttctcttgtg 1380
agcggtggca gggccagctc cgtctcaaag caggctttat ctagaaactg gtgaggtggc 1440
aggagcttag gaggagggag aaattgattt aaatattttc attaaacact ccctcactga 1500
tggtaatttc acttgctctc tccctcttag ccccccacac ttcagaacag gagagagagg 1560
atactcgcat acacacacat ttaagtgcag gcacacacat agatatgtat ttctaaacca 1620
tttttcctgt gaatacaatg atgtgctccg atatatactt aagccagtct tactattaaa 1680
ccatctcttc taaaaaatat gatcaaaaca cagttgttct aaaagcaaac tctaaaagac 1740
tgacctagtc tctgacaatg agtttgaaaa agtgcagctc ttggtgttgt ctgcaaaccc 1800
aacactattt gttgacttga caggcaagac agacaaaccc tcaaagttaa tggtttctct 1860
attcgtttac tctgtaagtg ctctctgcat tcaagcgaga tactgcattg gctgacacat 1920
taaatatgct gagactcttc cagaacgcag caggcagaca acccacggtc aacagtgggg 1980
gaatggtatt tgtctggctt agttatctcc aaatgtctag agagagaata atagtatata 2040
atggtgcatg gaaaacaccc atgagccttg gtgtgttatt agtagtagtt actttatagt 2100
gggtaatgac aaaataaagg tagcttccag tttctgaagg tttactatgt gtggatgtaa 2160
cccttgctaa tcaccacctt agttaatcca aacaacagtc ccatgaagta tgactattat 2220
tatccccatt ttacagacaa acaaaatgag gactacagag gttaataact tgccccaagt 2280
catggtacca aagggtttgg gagccattat ttcagtcaaa ttctaaccaa gtgtgcttag 2340
ccatcgtgcc agaggttcca aggaaggagt ttgcttgttt gttttattta tatcacttga 2400
tgaaataaaa ctaccattcc cattacatat aaaacctcct atagatgcct ccttagcatg 2460
ctgtgtgatt ccactaagct gttgatagac acagtcctcg gggctggggg tgtgggtcat 2520
ttgttagcat gcatgaggtc ttgggtttga tccccagcac tgataaagct ggcatggtga 2580
tgtatgcctg tcaccccagg acttcagaga tggaggaagc cattcagtgc catcaccagc 2640
tacataatga gtaagaaaga gaccagcctg gaacacatgg cattttatct taaaaaaaaa 2700
aaaagacatt cgttttgaca tgtatatttt ttgcttttgt aaattttcaa gggaatgttt 2760
cacccagaag ctttgcactg ctgatggtac acgtctgaaa tgtcagcaat ccagaggctg 2820
aggcaggagg attattgagt tccaggtcag ctgggtctaa acacaggagg aaagtagagc 2880
tttgagtgga caccatgttc agatgctcaa tgatcttcag agttatgctt ttggcagaca 2940
ccacaccaac agaaaaacaa gaacaacaat tgccttcaaa gggagggcag ccttgtgaag 3000
ctctgattca aaggagaatt gtcctttgga gtctgaatga atttggaccg ctctttctga 3060
gcctttccaa ttctactggc atccacaact gaaaacaaac agcggtgccc tgattgccac 3120
agacactctc tgctgggcag acagcacacc gcagttccca ggctgttctg ccagcatctc 3180
tcaggtgttc agcctgggtg gggaattgca acatgtgtag caagccaggt ggccctgcag 3240
agcctgtctc caacttcgat gctgctgggg acacaaagaa cattagggca tggagtggct 3300
ctgtcagtct ctgtgaggga agcccttgct caccacataa catcattccc taggtgtgtt 3360
cctgcacata tcctaatttg ttttaactct gtatttatag tgagaattgt taagagaatc 3420
ttaggactga gcaggactga accagacaga gacagcagtt ccatgttgcc agacagatct 3480
tacacaggct tagcctggtc gcagccacca gaccaggtcc ctgttcagtg agaggtggaa 3540
agaaatacac atggattttt tttttcattt tttgctttgt aaatcatgtg ggagatggaa 3600
aagtttacac atagattttt tttttctttt cgttatttgt tttataagtc attactcact 3660
agcctaggct agcttggagc actctctgta gctcaggctg gccttgaact cttagcatct 3720
cagcttcagc ctcctgagaa ctgggattac atagctatga tactatacct ggcgcccaga 3780
tgtgtttaaa agcctcaact tcccaataga cctagacgct cctttctcag tctgaaggac 3840
acaaatgtac ctcaatctac aaacttaatc acaaatctct caagggtgtt tctgaaactt 3900
cagagcactt tggaacaaac tttcctagtg gggaggtttg tttcttcact catttaactg 3960
gcaaagtcac aactatacaa cttcatttat ttatataatt ctatctaact aatggaaata 4020
agaggtgagg ttagagaaga ggaataactt ttaatattct gtagtaaagt agtgaag 4077
<210> 18
<211> 1501
<212> DNA
<213> Mus musculus
<400> 18
gacttgcagt cttcaagaac ggatgatgcc ccaggcaaaa ggggtatcct accctgccac 60
ttagtgggcc ccaaaggaga ggcttctgct ctagggcaaa gcttcatttc cctcttcctt 120
tgagctcact tatttggaat gagtatgtct gccccttgcc tgccctatca tggtcttttg 180
ggaacacaca acaaacctgg ttttgccggt tcacagccag aggacggatt cccttctaca 240
tgggtctgcc tataccagat gatgtgatac tgtgttgact tgggacttgg agtggtttgg 300
gcatgggtta agactttggg ccagttggga tggggtaagt gcgtttagca tgtgaggatg 360
ctaaatatga acttggggga catagagaat atggagttat agacccagtg gtatccttcc 420
agatttgtaa ttaaatctgt acagttcaat acctcaaaat gtgactatat ttggagacag 480
ggcttccatg gggagatgac attgaaatgg ggccgtcagg atggactcta acctgaatga 540
tgtctttgta agagaatcat tagctacaaa gagagcccag gggcacacac ttagaaagga 600
tcccacaagg acacaggaag ggagtggaca tgtgcaaggc aggcagaggc ctcctgagaa 660
atcggttctg tctgcacctt gatcttggat atccagcctc tagaattatg aatgcattgc 720
cttctttgac aaatctgtat ctaaaagaaa ggagggtgtt atttgtttta gctcaagttc 780
tagtacaagg tcacttggcc ccttgtgctt gggtggagca tcataacatt tggcagaaga 840
cagccattcg tgtcatagga gataggatgc agaggacaag tggaagggga ggggactgga 900
cacataggca caacacccgt ggtgacctgc ttaccccagc tgggccgata cctcctgaga 960
ttccagcacc atccaaaaca gcaccatgag caggagaaca gatttgagag ccattatgca 1020
tgcaagccat aacagtgagg gaatacattt ctgctaagtc ataagtaata ctgacttcaa 1080
tcttaaaatc ccagggaagc tgatgaagct cagcggtaag gcacttgctg gcgtgctaga 1140
ggctctgggt tcccatccct cccagacaat ttaccagagt cttcccttgg tgttagcagt 1200
tttgggtcct cttgtcttca cattaaaact gacattcaca tggaatgatt tttgctaatg 1260
gtgagaaagg gttcatttta ttctcattaa gagggtcaac taagtaccac acacacacac 1320
acacacacac acacacacac accccacaga ttatttgcag cccctcggtc ttaagtgatg 1380
caattgctgt gcactcctgt cttgcaggct gtgctctgtt ctattggtgg ttcaccagcc 1440
tgtgccaaca ctgactggaa gaacaagctc tctctggttc atcttcacag tcttggttat 1500
t 1501
<210> 19
<211> 1909
<212> DNA
<213> Homo sapiens
<400> 19
gaatgtttac atgtacattt caaacccagt tttctaattg tgcagtctta atttcctagt 60
taatttcact ttacagataa gaagctctgg agacatggcc tttccggtta aagacacaga 120
gcccaggcac tgcccacggc ttcctccaca ctcatgctgc tttcccttag gtaagacaaa 180
cctcaccaaa gctgagactg gctcaagaaa cggggaagcc taatgcttgt aaacattccc 240
ttaattggaa gcattaggca ccaaaattct tcctaaaaaa tatgtaagcc ccaagaatga 300
aagggccatg gttagcacaa accgcacctc ctgagcccag caaaacccaa caggcacagt 360
gcagcacagc ctgggcggtc tctcaggtga gtctctgcct cgctcttgcc ctgtctgtca 420
cctcatctct gccaagtctg aaaatcctga gctccaggga ctgtgggaac ttcactagac 480
atgtgtgaac aactctacat tctgatccgt agcgtctccc taatgatgca catctaggaa 540
ggagagggag ggagagggag cgtgtgcatt ccttggagca acgaggacag cctagtgatt 600
tgcaaactct ttgcggcctc ctggtgggct tcagaatcaa tttgtgagtc ccaaccagaa 660
ttttctacat aattagaata aaacagagtt aagatatgag tgcatcgtat gttgcaagat 720
actgttttgt aaacgttgtt tcagatattt gtgagtgcac atgtgtgtgt gcagtaatgg 780
gtcacaaaat atatttactc tgggtcatgt tttaagaggg ctagaaggca acactaacat 840
aggatggttg gaagatggtc aggctcagaa catcagattt tgcctccttc cagggtacca 900
cttttatcaa gtcacacatt ccttcccgct ctgcttttgt gtttctcaat cgctatccaa 960
atttgcgcag aagtcaggaa tcacgtgggt aaagatttaa gctgtacttc tgtgttaatt 1020
aagcacgttg aagaagaggt gctctggggg aacgtggaga aggtgggtag cgagggctcc 1080
aggggctcag aaggtggcct cgaggggctc tcatctgcca tccttgtgag ggagaaagtc 1140
ctaaaccagt cgtaacattg ccagaacaag gggtcccaat ccagacctcc aaagagggtg 1200
cttggatctc tcatgggaag gaattcaagg tgagtcacaa agtgctgtga gaagagagag 1260
ttttttggaa gttacgcaga tacagagtag ggtgtcctca gaaagcaaga ggaggaactg 1320
cctcgtcttt aagtttttct tacataggag tcctctctat gtaaagacag agctaagctg 1380
tgtctctatg tgggtgggct gacagcgtga caaaatttat tattctgttg atttaaagaa 1440
aactatactc aatattttaa tgtgtaagta catcaagtca taattataat tatcttgaaa 1500
gcatatattg ttatgggtat tgggacctct ggacttttcg ttgtcatatg attgtatcct 1560
tgcaggtatc tttaggctgt ttcttcaact gtaaatatct tatgactgtg ggtcgtgacc 1620
ggcaaggaat ggagttggtt tttaaaatgg tgtcaccctg gctcttctat gctcctgttt 1680
ccctaacagt aatagcccag ccattctctc ccatgttctc ctctgccctc aacttcagaa 1740
tgaagtcaat ttttatttca gccaaaatag gaggattcta ttctgtctgt tgaggtctgc 1800
tgtggtctaa tgatgttaat aaccagtggc tgggcatgat tacacgacga ggattctaaa 1860
tcctgtttca tgtttccctc tgggcccact ggctatatga ccccttaaa 1909
<210> 20
<211> 1201
<212> DNA
<213> Mus musculus
<400> 20
gagtatatat gtttctaagc caggttccta actatgtagt attaatttcc taatgaaaca 60
ccctttacag gtagtgaggc ctttggagac cagggcttta aaggccaagt agctgaagcc 120
cagggtcttt ccatggcttc ttcctatgac tgtttatcta atagatgaga caaacctttt 180
caaaactgat tatcagttaa gttccaagaa agcaccactg taaatgttaa tgttcctttg 240
aaatggaagt atttagcgct ctgtgtgtgt gtgtgagtgt gtgtgtgttg tgcagttggg 300
tacatatatg cagatatgca caattgtttg tgtttgtggg tctttgtgtg tgtgtgcagg 360
tctaaagttt ttcttttcat tagttatggt ctaaagtggt tttaaaaaaa gaaaaagaag 420
agcagagaag gctatgatag catgaggttc ctttgggatt gtctggctta gaacgctagg 480
ttttcccatg ttttaacagc ttcccatgtc cttcccactc tgcctttgtc tttctcattg 540
tgatccagat ttgccccaga gggggagaac ccagtaggta agagttcacg ctgtacttcc 600
atgttaatta agtgatgtgg aagtcttgga aaggctgggc agtttttcct gtcttcccag 660
gagctggggg aggttcatcc ttaatggaac cagttccatg ccatccccag gaggcaagaa 720
gtctggaaac atcaataatt attcagtcac aacaacccac tttcctctct ccccctaatc 780
ctcaactgct gacttcagga caaagtccat ctgatttcaa tcagatagga agactagtta 840
gaggcctgcc ccagtttact ggctgcagca acaggaagca caggttacaa taccaagtga 900
ttccacgctg aaagcttcac tctgatcatc ctaccaggct gctacatgag cccttgaaag 960
cgaattatcc ccggagactt actttctata taacacatat atacttacat atacatgtcg 1020
actttgtttt ttcttgtatg ctgtaaagat gcctaggata catttaagga tgcaacataa 1080
aagtcacttt cttcatggag taattattat aatagtactt gtttctgggg gagcaaattg 1140
aaatgtttcc cagtgtgaac tgccaagtta aaacaacaaa aagctagttg gagctccccc 1200
t 1201
<210> 21
<211> 3995
<212> DNA
<213> Homo sapiens
<400> 21
ctaacatagg gtcgttagtg tcagaactga attaaattgt aggacatgca ggtggtgact 60
gcagagaatt ggagcattgc ttggagtgaa aaccaagccc acatatttgg tgtcaaaagt 120
gttatacaag tagaaaaaca ggttctcttt aatggaatat tattcagccg tattaaggaa 180
tgaggttcag acccatacta cagcacatat gaatctccaa aatattgtgt ttagtgaaat 240
aatatagaca caaaggacaa atactgtata attgcactta catgaggtgc ctggaatagg 300
caaatccata gagacaggca gtagaatcat ggttgccagg ggctgggcgg gagggagaat 360
ggagagttag tgcttaatgg gtacagagtt tctgtttaga ggtgatgaaa acagtttgga 420
aatagtggtg atgattgtac tatattgtga atgtatgtaa tgccactcac cgaacactct 480
aaagtgtttg aaatagcaaa tttctattat acgtatttta ccatagtttt taagttaatt 540
accatagttt ttaaaagtta ataggataat attccctgaa ccactataca ctttagattg 600
gtacactgtg tggcatgtgc attatatctc aatgaagttg ttaaaaacaa gatttaaaag 660
cagagattgg gtaaagtaaa ggtttgctct gtgctgagct gtgtggcatg tggacctgtt 720
ttcccaggag ggagcactcc tggggttttg gccgcagctg cacatcagcc ccctgtgcag 780
aggaggtatg gtgtgtgatc tggagattag ctgtttctag tgcagtattt acatttaaag 840
acattgctga gttaggcaga attttctata tccatttgta ttttgcttgg cattcacttt 900
cttacaaaaa tggacaatca agacaaagaa aacaaaaggt ccaattacta ctcttcattt 960
caccccaaag caaaacaata ttagttttca attttttttt cccatagaaa gcaataacag 1020
tcccatacta cctcctcttc catgaaagta gtgcttgaga tgccccaagg aaaaaccatt 1080
ctttccaaag atgaaagact ttgtacctgt caggtgaaga gatggaataa atgccactcc 1140
tagtgggtgt gggacttgtg cagcccctgg tccccagtta tctgcttatc agaatgtggt 1200
ttgcatatca cctttagcgg aattccttgg gatgcttgta attctggggg agatgtctgg 1260
agtctgcatt tttagccagt actcctatga cttaggcaca gtagggaacc actggtgcca 1320
ttccttcctt cctttcttcc ttccttcctt ccttccttct ttccttcctt ccttccttcc 1380
tccctccctc cgtccttccc tccctccttc tttctctctt tctttctttc ttcggagtct 1440
cactctgtca cccaagctgg attgcaatgg tgtgatcttg gctcactgca acctctgtct 1500
tctgggttca agtgattctc ctgcctcagc ctgctcagta gctggtatta taggtgtgca 1560
ccaccacacc cagctaattt ttttggattt tagtggaggg gtttcaccac gttgagcagg 1620
ctgatcttga actcctggct tcaaatgatc cacccgcctc agcctcccaa agtacttgga 1680
ttacaggcgt gaaccactgc gccctgctgc aatgcttttg ctttccgtat acaaggaggg 1740
gttgcaggct tgactctaaa atgattgact ttatggagga ccgtctcatg tctggatggt 1800
aagtgatagg ggagggggca accctaaatg ggatcccaat gacttgatga aagactggaa 1860
gatgagacac tttcaggtgt gcataatgga agacttacgt aggactagga ccaagcctct 1920
caattatact aagttgtcca tgattgacca gggatttgat gaaaatccca ctgccttcct 1980
agaaaggtta agagaggcct tggtaaagca cacctctcta tctcctgatt cagtcaaggg 2040
acagctaatc ctaaaggatg aatttggctg ggcatggtgg ctcatgcgtg taatcccagc 2100
actttgggag gctgaggtgg gaggatcacc tgaggtcaag agtttgagac cagccttgtc 2160
aacgtggtga aaccctgtct ctactaaaaa tacaaaaaaa attagctggg tgtggtggca 2220
ggtgcctgta atctcagcta ctcgggaggt ggaggcagga gaattgtttg aatctgggag 2280
gcagaggttt gcagggaacc tagatcgcac cattgcactc caacctgggt gacaagcaaa 2340
actccatctc aaaaaaataa aagggataaa tttattactc aagctgcccg atatcaggag 2400
gaagttgcag aaaggggccc tgggtccaga aagtacatta gaggacctcc tgaaaatggc 2460
caccttggtc ttttatgatt gagacaggga ggcctgggaa agagagagga gatacaggta 2520
ttccagggtg cacctgttaa cttctaaaga tatggcaaga acagttctct ctcttctaaa 2580
gtttatctgc ccccgtacaa ggtttaattt ctttcaccag ggtgaaacag cttggagtac 2640
aatgttgttg ttagtatatt tcacttatct ctgttggcac taaattcttt ccttgtataa 2700
tacacatgtt taacttatgc atacttgacc ttataaaact tgtttttttc tctcatgcct 2760
agaagccatc aaactccaaa tggtcaggca actggagcct cagatgatag ctcccctttg 2820
ctaggaaccc ttaaatagac ctctgggagg actctgactg ccattttctc caaaacaaca 2880
ccccttgtca gcaggaagca gcaagactgg tcatcaacca tattctaacg gcagtattcc 2940
tatgatttag ccagtgggcc gtgaccggca aggaatgtgc cttgttagtt tcaagatgga 3000
gttgattttt aaaatcatgt caccctggct cttctatgct cctgttcccc taacagtaat 3060
agcccagcca ttctctgcca tgttttcctc tgcccccagc ttccgaatga agtcaatttt 3120
tatttcttca acgtacctct tcagagggga aattatacag gaggggggca gggaagtgct 3180
gggtagagaa aggtggatcc ccagctaggg ttccaccccc acagacctag gtgaggaaag 3240
gcacttctgg cttcacaccc aaatgttgca ttttcgaaga ccaacctggc ctgccatgcc 3300
cccattctgg gcctataaaa acccaccacc ctagcggaca gacacacagg tggccagacg 3360
tcaagaacag cacatcagca gttgaagaca caaaagggtg gacgacaaga aggcatcaca 3420
agagaacgtc aagggagcac gccgatggaa gaacctgctg gcaggctatc cactgttggc 3480
atgaggggga gtttggctgg ggcagtcaga gaagagcccg gctgcatagc ggcccaattc 3540
caggggaaaa ccatctctct tttggctccc ccggcagaga gctacttctg ctcaataaaa 3600
cttggctttt attcaccaag cccaggtgtg atccgattct tccggtacac caaagcaaga 3660
atccctctgt ccttgtgaca aggtagaggg tctaattgag ctggttaata caagccacct 3720
atagagagca aactaagaaa gcaccctgta acacaggccc actggggctt caggagctgt 3780
aaacattcac ccctagacac tgccgtgggg tcggagcccc ccagcctgcc tatctgtatg 3840
ctcccctaga ggtttgtgca gtgaggcact gaggaagtga gccatactcc catccacgcc 3900
ctacaaaggg gataagggaa tctttcctgt ttcataagta gcaatctctg tggtaacagc 3960
ccctgtggtg atgccgtctc tctcggttct gccct 3995
<210> 22
<211> 1651
<212> DNA
<213> Mus musculus
<400> 22
tccttggcta ctttctctag ctcctccatt gggagcccta tgatccatcc attagctgac 60
tgatgacact gcattcttta atatatgggg tttgcactaa cttggggtag ttattgtcat 120
gtttgaacta aattatagga cctccagttg ctggagaatt gctctgtgtg gactgtccac 180
acatatttgg tttctaaaat gtcatataag cagacactgc agtttctcca cagtggaatc 240
ttacccgggc ataataaggg aagacattcg gcacaagctt caacacaggt gaaccttaga 300
aaacatgcta gtgaaataat ccacacccca aaggacaaac aggaaatgat tcttatacaa 360
gacacctggc agaggccagc ttaaagagac aggcagaaga tgtgagtccc aaggactgcg 420
gaggggaaat gacagccagt gttttgtggg tgctgagggc aacagtttgg agtagacaat 480
ggtgatgcag ggctgtgaac gggctcagtg ccgctcactg aaccaaacag cctaagtgtt 540
tataataaca aaagtaatac tgacatacac cttccgttgt ttgaaagagt taataaggta 600
acattcccca aatcacttta aacaggcaaa ctatgtgaaa tataaatctg tttctgtgaa 660
gctgcttttt taaatgcttc tcctatcaga ggtcagaaga aagaaggctt gctgggagtg 720
gagttggctg tgtatctcag acctgttttt gcaggaggag tgtgcgctcc gggatttggc 780
agcggctcga gtcatccctg tgagaggcag gcatggtgcg tgatcctggg gcttttctgt 840
ttctagtgtt ctatttattt taaagacatt gctgagttca gcagaaatgt ttcacatcca 900
tttgtatttt ccttggtact catttcctta caaaaatgac gatcaaagca aagaaaacag 960
agaatcttca ttttacccca aagcaaagtg agtgcacttc taataccata acagaaaaaa 1020
cgcttcgggc ccttaggaag tgctgaagaa gctgggcaag gtggtgggtg cctttagacc 1080
caaaggaaag tgattttctc caaatgtgag aggcctgcga tgatggggtg agtggccccc 1140
agaggatgtg gggactgact agcgctgtct ccgtctgtat gcccagtgaa gctgtgggtg 1200
ggacacaatt aacagcacaa gtctgagtgg tgagaccctc tgctgtgacg aaccctgcac 1260
tgatgttact gttgaaggta tctctcaagt gctcatgctg gaaactaagc ccccagtttc 1320
tagttgatgt tgtttggagg tgggatctta tgggagggga ttaggattag atgatgtcat 1380
aggggtgggg cctccacaat ggcattaatt gctttagagg aagcagacaa gaccaaacta 1440
gcacatttac gctgtcttac cgtgagagta atctgccatc ttctgaggca ggtgagttga 1500
tatcaccaga tgcccacacc atgcatttgg gctccacagt ctccagaatc ataggttttg 1560
aacctttatt ctttataagt tttctagact ggggcattct gttacagcag caagaactag 1620
actaatatac atccctcctt ccatctgccc a 1651
<210> 23
<211> 751
<212> DNA
<213> Mus musculus
<400> 23
tgtgtgcacc agctttgact gctgctggag gctgcccatt tcctgtgatc tcaaccagct 60
tttctgatag gccagtttat ctctggactc tggcctatgc ctgatacaga tgtaatcagg 120
catccaggaa gctatctata tggaggcaaa ggtcctttta ttcaggccac tggaagcctc 180
ttccataaag ttcagtagta cgagtacagt gtcctttcct gtgtacagcc cctcgctttc 240
tcttctggac tcccagctga gccagtgttt gagccaccca tcactctgaa aacagcatct 300
tcatctcctt aggctcagct tctcaagtca cacaggctac attgctgccc tcagggtgag 360
cctcccttca ttcatctcgg tgataattct aaacaatggc ctgtgtgtta tagaaaggcc 420
ctgcaagcat acatgttatc aacttactag ctgtgcccaa ggttgcatag ctagtaagtg 480
gtaagactga aatttgagcc taggggacca taactctaaa caatgttcta tccactaggc 540
ggtactgtgt agaccatggg ctcacacaca cacacacaca cacacacaca aaatgtattg 600
aataaaataa ttgtgggttt tgcatatttt cctgttttat gtcagcttga cacaagctag 660
aatcatttgt gaagagggac tctcaattga gaaaatgctt ccactttttg ttgttttgtt 720
tgttgttttt gcctgtcgga aagtctgcac t 751
<210> 24
<211> 490
<212> DNA
<213> Homo sapiens
<400> 24
ctgtggagtg cctatagcac tgtgtgtagg cagaatgcaa aggggacagt gtgggtgggg 60
acagtgttgg tgtagaaatg gcggggaggt tagattgcag gcacagaggg cctcagccat 120
ctcgagagcc cagacttcct ccctgaggtg atggcacttg gggaagtcag tcatggaagg 180
attttaagaa agatgtgaaa ggggcaggtt tctattttca gaaaaccatt ctgggccagt 240
ggaagatgga gtacacagga ccacaccttg gtgaagggag attgtaggag cctgggcttg 300
gtggcggggg acagtggaga gaacagcctg ggatgtatga acatggcaag tctcccttcc 360
tggacagtgg ggtttgccta tggtggacag aaggtgagat catcctttga aaaatgccac 420
ttcatagtgt ttccccagct gtgggccttc actcattgga gggtcaaata atcaatgtat 480
taggttgcaa 490
<210> 25
<211> 1505
<212> DNA
<213> Mus musculus
<400> 25
tcccagagaa cctaagcctg attcccagca cccaaaggac tgcttacaac caactgaaac 60
tccagttcag ggatccaaca ccctcttctg gcctctgtag gcaccaggct tgcatgtggt 120
acccagacat tcgtgcaagc aaaacactca tacatataaa aatagataaa taaatgccta 180
tttaaaaccc ttgcctcatc tgaaattatc tgaatgttga tttctttgga ttccctttcc 240
ttttgccctt gggaaaaata ggtcacccct gtgtcagtta ctgtatgttt tggtcactgt 300
tcatagtttt agagaggatg tctaggaggg cagggtcacc tgtggtgtgg caattgggag 360
ctccatgtgc agaaggaatg cagacacagc agcagagagt gcaggaggcc cggaaggttc 420
caccatcccc acagccccac ttcctccctc tgccgaaggg gttgggggtc aggcagaggc 480
tttaagaggg gcgtggacag ggtagatttc tgttttggga aaaccatcta tcagagggca 540
gaggacaggg tggaacccaa cacagctgag agcttgcaag gggctgggct gggcagcagt 600
gaagaggaac ctcacaggga ggagcccctg gggtgcaggg gctctgaaac tgccctgtga 660
aaaacactgc ctcattgtct tggcagtttg ggccctgacc cagtagcagc aggtcagaca 720
attgttatat aaagttccga aaattcaaac ctcccccttc ctccttcatc cttcttagct 780
acacgtgtgt ccatgagtgg cagagcaggc actcacatag aggtgtgccc actgcagcgg 840
ctacagcact aaagaaaatc cctctctccc cttcctctcc ccctttcttt tacttcaaag 900
cagagtctta ctatagggcc cggcccctgt gggctgctca cttttaatcc tctgccttgg 960
cctatctagc actgagatca cacacctgcc tgtgtcacta tgcctggctt ccagcacttc 1020
tttgagtgct gacagacacc tcaagtggaa aattcttgtc cttgcttcat ttgacagatc 1080
acagtgaaaa tgggagccca ctaaaaatac tttataggat taccctcggg ctgtgtctga 1140
ggcgggtagg taacataagg aatttcaggg ttagacttta gtcctgtcac caagacatct 1200
atctctttat acatataaaa gtattccaca gtctgaaaaa agctctgaaa tagagaatgc 1260
ttcttgtcca tagcatcata gatagagacc cttcagactt gtatataaaa cagaattgaa 1320
aagtcaattc aggtgtgcac acacacatgc atgcacgcac cagcacgcct gacatctctc 1380
agggctgccg ggcatcactc aggtgactgc ttgacgtgtt gatgtttgtg tctttggctt 1440
cttctttgag tcttttgttt ttcttctttt attttattta tgagacaggg ttgagttcat 1500
tgcat 1505
<210> 26
<211> 1840
<212> DNA
<213> Homo sapiens
<400> 26
cacaccattg catgcttcag ccgttgcccg tgctatttcc tcccttggaa agccctctac 60
tgtgaggccc tcacctctca accctctccc tggcccccat gttgtctatg tgatttcttg 120
ccatttaaaa atctacccag gtgtcagcgc ttgggcagtt tcctcacacc tctcacccag 180
ttcatcctcc cttgcttggt gctatttctg cccttgtcca tatccccacc acagcatgca 240
ctttggattc caggcacgct ccttgagtgt gaccccgagg ccctctgtgg gctcttggag 300
cagggcaaag ctgggtgtgc tggggcgcag cacgggcctg atgccctgag gttgtttgtt 360
gtgctgggct ggaggcgttc gaagaaacgt ccaaggaggc tgctagactc agttctttct 420
ttctgttttc cctccacctc ctctgctagt ggaagctcca tgtctcccag gctcgtgagc 480
tggcaaacac cccgcttgca tggttcagtg ttgtcgttgg cggcaggcgt acgtggaagg 540
ccagttacag agggtctcta gggctaatgc atttcacaac acaccgccct ctgacactcc 600
acgctctgct tttcctccag aaccactccc tttgcaaaac tctgtttcaa acaaaaagag 660
cacaaagagg ctgaccgtgc cttcctccaa ccaagctccc ctctccacag gtgcacagca 720
agagcccttt gtctgtgatg ggacaggcct gggctccagt gagcaagaca ggcactgtgg 780
gcccatccaa atattaactg tggacacttt cctactttga aaacatgaga ctttgtactc 840
agagccctgc cctccagaga acacaattac ttctgttttt cttttcctag tggaaggagg 900
cttgacactg gtgatggcct tgcctttaca atgctcaggg tttgggaaag tcagggccta 960
gggctgctga tctccaggca ctgtctgctt tccatctatc ctctctgctt ggtccctgaa 1020
aagcaggagg gagacaggag gaatgggagc atgaatgccc tcagggtcca cgggggatcc 1080
cggaaggcct agaacaccag gggtctgggc tccacccatg atggatcatg cctttggggg 1140
aagattggcc tacactcatg tcaagtaata agttttactt cctgcacctg gtgttaggtt 1200
ggttctaaga tgcagctgta acctgtgact aagatcaata tttttcatgt cactatctga 1260
tcatacaatg gtcaatttat cgatttagaa aattgttgca caacgaggca acaccgagtc 1320
atgacttaaa aaaaaaaaaa gtggatctaa ccgaagctag attgtggctt atcacctttg 1380
attgtcagtt tcttgggtca aatcttaatg ccacattgac cactgtgtca agagaggcca 1440
ggttccaact cagctccgtg tatagtgttc atggaatctc aatgctcatc aggcgctgct 1500
ggggctgggc ctcggggagg ggcaggctcc tgtcagcaca agtcaccagc acaggtttta 1560
accagccagt ctgggctact tttaccactg aagcagtggg gcgagaaact ctattttaca 1620
gtgtttctaa aacctctgtg agctaaaagt agaagcaact caaatgcccc tcacctgatg 1680
aataaacaaa cacagtgtgg catcctcgta caatggagta ttattcagcc atagaaaggg 1740
aggaaatagt tgtgctcgat acagtatgga tgaggcttgg agacatgatg ataagtgaaa 1800
agaagccaat cacaaaagga caaataatgt atgattccat 1840
<210> 27
<211> 1451
<212> DNA
<213> Mus musculus
<400> 27
taagccatca catgcttcaa ccatgggcta cttccacctg ctcccccccc ccccacacac 60
acacactgct acccctcacc cccagcttgg tgcctcactt ctcaggctat aatgctgctt 120
tcatggacat tccttgttct ttggaaacaa gggcccttcc ctctgcagag ttctcctgcc 180
tgaggctgtg tgttcttggt ttgtgggcct ttgcccagct ggtgcccagt gcaaggtgcc 240
ctgctaactg aacaaatgac cttgctcatc gtcatcttct tggtctccat ctttgtggtg 300
gagccttctg gaccaccggc aggtaccctt tgcaggacag cctatcctgc cctgtctccc 360
tacagagcca ctccctgaag ctgcagaaaa caagagagca tagaggtgac cctctccaca 420
ggtgtgtggc cagagccact catccacagt ggccaggccc atccaaatat taatgatggg 480
tgttttctgc tttgaagttg agaatgtcgg tcctcaagag tccaccctga agagaacaca 540
accacatctg tttccttcca gggaacaggg gctgcactgc ccttcttctc tgtccgtgcc 600
cagagcatgt atctgagcat gcccagagcc aaacacagca tctatttcct actgatcttc 660
acagctggac aggctcccac acagccagat gctccctggg gagcctcaaa agcaaggttc 720
accaggtgga gctctgggga aattgctttc aactctgtct tggcagggct tgccttctgc 780
acctggcttt aggagggctc caagatgcag cataacatgg gacggatatc aacgcttctg 840
tctgatctta taacaaaggt caatttgtaa agttgatacc accaagtcct ttcttccttc 900
ctttcttcca caccccgtcc tctctgagaa aatggatcca atagaagcta gagtgtgact 960
tgtaggttct gactgtcact tctttggggt gaattttaat gccaaatcag ccaggggcga 1020
agctgaggag agccaagttc acacacagtt cagcacgaag ttttaattca gtcccatccg 1080
tccgaatctg cactgctgtg ggtgggttaa agggagagca ggctcctgac agcatgtgct 1140
ccagcacagg tgagtctgtc acactttttc ctacagctgc caggcaagac gtcaagtcta 1200
cttaaggttt cttatgcctg gaatcgccta aaacgtaaag caatcaaaat gtctatcacc 1260
caaagagtag ccagacaaaa cacagcaggt ccttttatga agagtcctgt gtcacaagac 1320
acaggaatat caattctcag ccattaaaag gcacgctgta atgacactgg ccacgatatg 1380
ccacatctta gaaatattac aataagtcaa agaagccagc agcaaaaggc taactaatgt 1440
attatttcca t 1451
<210> 28
<211> 6212
<212> DNA
<213> Homo sapiens
<400> 28
ctctaggtgg tgaaaatgac cagatttggt tgtggggtca tagtggacac taaagatcag 60
caagggaaaa aagatgtgac tataaacttt ccattctcac agttgttttg agacccgagt 120
gtacgtttaa tgttttcaac agaagaggct gcatgaagaa gagtaagtta accgcgggga 180
ggctgtgaga atttttctgc gcggacaatg gagctcagtg tctgtttcag tgtttgtgct 240
ctctatagat acctggatga ttcttgggcc tcagtgtgtt ctcgctccct ccctgccgag 300
actcaaaggg atgatgcacg ctgcccagcc aaaaccagga cagaacgtct ttttccccgt 360
gggaatgcgc tcccggcgcc aattccaagg cctgcctggg tcctattcag gcagtgctgg 420
ggtgagcagc aggctcgggc ccagctgaca cggccagaga tccccagtga ctactttcct 480
gacatggcag agatggcaga tggagaatcc ataagcccca gttacacccg ggagctcaca 540
ctgtggcttc agtctccaag gagagtgggg agagccctgg ccctccgtga aggattgctt 600
ccgcccaagg ggggccagtg aacccgaatc actctgctgg atggtgctgg ggggctgatg 660
caatctgcat tccttcccct cgcacccctt acccctcgct acctccccct tctcatcctc 720
cccactcgca cctctccttc tcccacacct ggctgacacc cactcttgag tcactgtcag 780
ctccaagaca gaaccggcat cctgggtgct tggcaggagc caaaggagca tgttacagga 840
tctctggctt cacagatggg gagagagcag ttcagagaat tgcgggttcc acatttgctt 900
gaagtcactc atcagccttt atgttacatt acaacaaagc agcccagggg acatggactc 960
atagggtacc tggtgtttcc ccaactgtag gggggattcc gggacaaata aagtttgcca 1020
ctgggaccct cccccgaact gtgccctgtc ccactcctgt gacacactct ctgcccacaa 1080
gagagtggcc aacagtggag gctgagagtg accacctgcc tgccctcagt tattaaaggc 1140
tactggagaa caagccttga gtgcgtgctg agaacacatg cccctagctg ccatcaaaga 1200
gaatcacttc atatgatttt gaccataagc aaactcttcc accttcattt tttaaaataa 1260
cggctttatt gagatatgca tcacttacca tgaaactcac tcttttaaag tgtacaaccc 1320
agggttttca gtgtattcac ggaattgtgc aaccatcacc catcacccct aatttcagga 1380
catttttatc actccaaaaa gaaactttgc acacatcatt cttctctccc cacagcctct 1440
gacaactgct gatctatttt gtctctatgg atttagcagt catggacatt tcatatacat 1500
ggaatcatac actatatgtc ctttcatgac tgacatctgt cacttagcat gattttatga 1560
gattcatcat gttggagcat gcacccatgc ttccatcctt tctttttttt ttttcacagt 1620
cttgctctgt cgtgcaggct gaagtgcaat ggcacgattt tggctcactg caacctctgc 1680
ctcccaggtt caagccattc tcctgcctca gcctcccagg tagctgggac tacaggtatg 1740
tgccactatg cctggctaat ttttttgtat ttttagtaga gatggagttt caccatgctg 1800
gccaggctgg tctcaaactc ctgacctcaa gtgatctgcc cgcttcggcc tcccaaagtg 1860
ctgggattac agacgtgagc caccacatcc tttctaaggc tgaatagtat tgcactgtat 1920
ggatagacca catttagttt atctgcctgc tggcttatgg acaatgagtc actccacttt 1980
ttggctacta tgaatcatgc tgttgtgagc acttgtgtac atgtctttat atggatgtct 2040
gttttccctt ccattgggtt tgcttggggg tggaattgct gggccacctt ctttctccat 2100
gagtggagca tgcctatgcg cccatccccg catctcccat gtgtggaggc actgcccaag 2160
ctcgtctgta ctctgagtca cagggctgtg caccattacc gatcaccatc tatgggtcag 2220
ggacttatca atgagcaaga catagcccct gccatcacta actcacattc tgcatcgtcc 2280
tgtgccatcc ccaccacccc accttggtca ggcccagtgt ccaggtgtct tcaactgctc 2340
accttccccc tattttgttg ccctgaagtt catccagaca tcagggtgcc ctattgaaaa 2400
tgctagttaa tatgacctct ctgctctaac cccaatgttg gagtcttgtc atcagtggga 2460
tagagctggt gtgactgcac cagaccagtc aggttcaact tttatgaaag gaagttgtga 2520
gttgctttca gttgccatgg accccaagtc gtaggtcatg taagctgagc atgcccaaac 2580
ggaccaagca tgcaaccatg ggcagaacct gagtgctcag actgaggagc aggggctgaa 2640
ttaagaagca gagcatacat ggcaggatcc aggatccagg agccaatcag actgagtttg 2700
gcatcactcc atggcaggat ccaatcagat cacacctccc tgcagcacct cattgcaaga 2760
tccaatcaga ccacacctca ttaccctagg cttataaaat ccaggccagc cgctagcttg 2820
gggaggcaga tttgagtgtt tttttttttc tgtctccttg ccagactacc agcaaaaaag 2880
gttttctttt ctcaaaagcc ggtgtcatgg tattggcctc tgtgcacatt gggcagtgag 2940
cccactgatt gctcagtaac atgggcacac tctggggccc acacaagcca ggaatgatgt 3000
ggcctttacc tgctgctcca gctgcatctg agcccagtat cccctgaaca caaaccccca 3060
cctgcatgga gctgcatgcg gttctcgggt acctcctggc tatgttcagc tcctgtagat 3120
tccttcagat ccactccttc ccatttcctc atccaactgc ccagcagagt gcctactatg 3180
cgccacacac tgggattcag cagtaaacga cacaaacatg atccccaccc ttatccttct 3240
cccaggactc ttattaatct aaggctcacc tcccttcttg taacttccat gaactcatat 3300
gctccctctc agctcaggga cgttgctgga ggaagcaaga gagcagcaga tgaaccctta 3360
tgttcaggag gcagatggag ctcattcaaa gcccaccttg gcctcttctt aacccgaaga 3420
ttttagcaag tcatataacc tttgaactgc aactccctgg attgtggaat gcccaaagtg 3480
tgctgagcgt gaagtaaata atgcaagtgt aaagtgtgcg gcatggtcct ggttcatctc 3540
aggaggccgt taggaaacta gcacttattt ttgccagggc ttgagcatag aacatactaa 3600
tttccccaat ggcattatca cattgtatta ctttttattt acatgttctt tctcccctac 3660
caatctcaga gaatctcaag ggcagcaatg attaattatt aattttggaa tccttggttc 3720
ctggcacatt ccttgaaaat aaatcattgg cttactttcc actgattctc ttaattaccc 3780
ctgagaggca gagattggaa ttatactatg ctgagcagct caatgttttc ccagtaacag 3840
caggaaaatc ccaatgcaca gagaaggaac ctgaatgact taggtgggac acaccaggac 3900
agacacccgt ggtgatgaca ttctgtgccc ttcatcccac agagtggtct gtcttcacag 3960
tggtctcccc tcaccacact gagccctcaa acttcctctt tccgctgacc aaagtgcacc 4020
caggcctgct tgtccattca gacagatgcc agggccctct gcactccatc tgacctctgc 4080
aatatgccgg ttcctaataa gggagcagga tccaggtcca gttgttcaca cttctaattt 4140
cataccggca gcctcagtaa agttctgcca tcaggctaag gccccactga tcgtcgacct 4200
tttctgcata aagattcacc tccagggctc ttagaaaata ctgctgcctg gctaccaccc 4260
catccttagt gtgacatagg gttttttttt cttcttcttc tgttttttgt tttttttaga 4320
ataattaggc agctctgttg cccaggctgg agtgcagtgg catgatctca gctcactgca 4380
acctctgcct cctggttcaa gcaattctcc tacctcagcc tcttgagtac ctaggactat 4440
aggcacacgc caccatgccc ggctaatttt ttgtattttt agtagagacg gggtttcacc 4500
aggttagcca ggatggtctc aatctcctga ccttgtgatc cgcccacctc agcctcccaa 4560
agtgctggga ttacagacgt gaggcaccac acctggcctg ccccgggttg tttttttttt 4620
taaagctccc cagggatttg taagtgcata ccaaagactg ggaacccctg gcttagctca 4680
cagagcaaag agccttttga gggttcccct cgacagttgc tccctcacct ccagctgtgg 4740
ggccacacag agcgctgggc cattgtggtg ttagagacca gagttaaagg gactccatct 4800
gtaatatcca ggacaaatgg gctggcaggt gctgctcaaa cccttacaca cagatagtat 4860
ttggggaggt gaggtcaatt cccccattat ggaacgctgc ggttttaaaa gcaagcaaac 4920
aaacaaaaac aggaaaaaag tgagcttttt aaaactaagg taaaatttgt cctcaacttc 4980
ctggccttga ttgggctctg ctactagagc ggcagaagca actcacttcc ctgcttccac 5040
ggacctgttt catgtaatgc attttgcaga gatttgaaga cagggtcctt gacttgggca 5100
gctaacagcc tgaggctaga ggcagccacc cctgaacagt gaacaattct gcaaggcgcc 5160
tggcaatagt actatgcggg gagggggtag gaacaaggtg ctgcagggcg gggtggagga 5220
ggaaatgaat tctgcctggg agaagcggga gtgcgtattt gagtggggtc tggagcaggt 5280
gcatgcaaag aagcacctca aaggcacggg caggtgtgtg caggcgtggg caggcgtggg 5340
caggcgtggg aaggcgtggg caggcgtggg caggtgtggg caggcgtggg caggcgtggg 5400
caggcgtggg caggtgtggg caggtgtggg caggcatgtg ggcacggcac agggcttgtc 5460
caggccagat gccattaagc acaggtatct gtggtgggca ggggacacag tggaagcaga 5520
tagagaaggt ttgctggggt cccatggagg ggcgccttgt aggccatggt cactctaggc 5580
tgatgcaagg tgctcaaggt tgaaggcaga ggtgactgac ctgtgcttga gagagggtag 5640
ggaagagaag ctgccggact tgaggggctg aaattgtcct gtaatagtcc aggtcaggag 5700
tgttaatgat gccccagctc gggcagtgac tacggcaagg agagtttaac atgtggttca 5760
gttcagcaga catggggaac tcactatgtg tgaagcagga cacatcacgg aggcagccct 5820
caaatgcttg aagacagtaa tcctgcccct gtgctgtggc gggttcttta aggggtgtga 5880
cttcctcatc agacccattg ctctcacacc taatgatgct gccatgtggc agggctgtgg 5940
gcagagccat gccctagcag gggaagtgga ggacagcggc ggggagggag tgtgggcagg 6000
gctttcctgc cctctgggtc ctctcctctc tttcgtggca gggccttgag gtccattcgc 6060
tgggctgcac agaaggagga ctccagagcc ccccttgggt tcaggatttt atacacgcag 6120
cattccagac agatggaccc gtgtattgac aatgaaagca tgggagaact gtatttcttt 6180
ggtgattaaa gtaaatgcaa aagttatgat gc 6212
<210> 29
<211> 2501
<212> DNA
<213> Mus musculus
<400> 29
cctcagctgg aattaaccct acacagttcc tcagagccta gggcttagta aaaaggccaa 60
gcctgaccta tgacctctct gacatctgtc cttagcacgt gttcttttct ttccaagtac 120
attgtaccac catgatggcc tgtgccctcc tccccatcac ctccatacaa cgaatgagct 180
ctcatgagag cagagtggag gctggtgctg tggcctccac tcaggaattg tgaaccactc 240
caaccttctt ttgttaaaca ttacctagcc tcaaatatct tgtgatagca acagaagaga 300
ctaagatact taaaaatatc tatggatgaa gaaaatgacc aatgtgagga cgtcgtggat 360
attggccatc agcaaagaag agagcataaa gttcccattc tcacagatat tctgaaacct 420
gtgtatttca tttttgatgg aaaagagctg cacacagaat agtaagttag ctggagggaa 480
cttatgagcc tttttttttc cccctcacat aaacaacaat ggagcttagt gtccatttca 540
ttctctttgt gcttgactgg gacccagatg gctcactgtc cctcagtatg tccctgctcc 600
ctccctgctg agatctcatt ggctgtgacg cactgccctg ctccagccag gacactactg 660
tctttcttcc ccgtgggaat gtgttctcaa agccaactcc aacaacgctg acctgggcat 720
cacttgggtg gtgctggagt gagctgtagg ctctggtcct gctgttgtag cctggggtcc 780
tagttgtcat tcccctgaca cagcagagag agcaaacaac agaaccaatg gctgtagcca 840
catggtgaac agctagacct ccagaacaat aggagtaaat gcttctgcca cgaagtgtat 900
ggagaaccta aaccaatctt caggcagaac tggggccagg taccacacac agccctgccc 960
ctttctcagc tggctgttgc ccatgccaga gtcatgatca cccataggat tctcagaccc 1020
agggcattgt gtagctggag ctcaatgagt cttacgggcc ggaagcagcc aattcaggga 1080
actctgggtt ctgcgtttgc tttgcatcta tttggtgaga gacagtgtga gttcttccat 1140
tacaaaattc caatgtttaa agagcaaaca gtcaagaaac aagaaaaaaa aacccaaggg 1200
tgtgtctgtg tgtgtgtgtg tgtgcatgtg tttatgtatg tgcaggtaca tgttggggac 1260
atgtgcatgt gcatgtttac atgtgcatag agaggtcaga agacaacacc agctgttgtt 1320
ccccaagtac aatccatagt tcaaccccct gtgtgtgtgt gtgtgtgtgt ttatgtgtgc 1380
atatgctatg gaagtcaaag attgagtctg gtgtcttcaa ctgccctcta ccctattttc 1440
tgaaacagag tctctcacta aatctagacc tcactggttg ggcatccttg ttagccaatg 1500
agctcaacta tctgcccgtt tgttctctct ctctctctct ctctctctct ctctctctct 1560
ctctctctct ctctctctct ctctccataa atgaatgaat gtgtgttttt aaaaagagag 1620
tttaaaaaaa actaaggtgg catgtatccc agcttctctc cacaatccaa ctggaacggc 1680
tcaggccagc ctcatttcac gcagctcact ctatcaacac atctgctgca cagagcatgc 1740
tttgtgagtg actcaaagat cagaaccctg acttccaatg gcttatagcc taagggtaga 1800
gaagttacct gtattctggc aagataccag ggattgtagg aggggtagca acctggggag 1860
gagggaatgc actctgtgta ggagatgcag aaaggattgg aagagctggt gagtatttga 1920
gttggatgtt ggactgataa atgcagggag catctcacag gttgggatca ggcacaccgg 1980
taggatgttt catccatccg agtcaaatgg agggcaggtg tagggatttc aggttagagg 2040
gcagggaaag aaagtagaga ggagagcctg gggttgtgct ggagtgtgca cagagcactc 2100
agctggcact ttgaagaaca aagtggactg tccctggacg tgagactgag caggtaaggt 2160
gggttaagag acggtaagat cactactgca ataatccaaa ataagaacct ttatgatctc 2220
taggtgggat aacaaccagg gggagggact tttaacacac aattcagttc aacaggaact 2280
cgcacatcct ggaggcaaca cgtgaactgc gcaggctcag cagtcattgt ctgttctgcg 2340
tggtgctctt ccaagtggca cagtgtcttc atcagacctg gtgctcacat gactgatcta 2400
gtcacagaac aggccatgta tcaagttttg ggaaacagga agcaatggga gaaatgtatt 2460
ttattggtga ttaagtgaag tgcaaaagat aggacgtgct a 2501
<210> 30
<211> 347
<212> DNA
<213> Homo sapiens
<400> 30
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 60
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 120
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 180
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 240
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 300
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtag 347
<210> 31
<211> 1131
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 31
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 60
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 120
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 180
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 240
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 300
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag caggtgagtt 360
tgtggtgtcg ccgatgtccc ttcggggtac tctagcgcag ccgcctggct acttgaccca 420
ctgccaccaa acgttttaaa ttcaccgaaa gcttagcttc gaagcaaagc tccgtttcgc 480
cggtgaagca ggaagccttc gctgcaggaa ctgaccttta cctcttggag cggcttctgc 540
agaaaaatcc ccgggcagag atttgggcgg agtttgccta gaactaacgc ggagccagcc 600
gatcccggcc taccccgggg ccaagatttt aaggggtgaa gagtcccttt tgccttttct 660
ggatcctggt gattcaccta gtgtcttccc taaggaactg aaccaactcc tccgctggcc 720
tctggcagcc ctccaggcgg tgcaggatgg cgtgggcccg gtaggaagct gcatgtaacc 780
gcccagggtc gggaggccag gagggcagct cctcctctga cttgaatatt gaaaacaaga 840
ggatgctttt aagaaaaaga agaaggagga ttcactacca gctctgaagg gtggaaaaga 900
gatgattcat ccggattgtg gagagggtgg aatcttgttt aggagagcgt tggttgtggc 960
aggcagggtg taactatgaa tcagtgaaga caattcacat cctgggatga aaagaaggcc 1020
atgggctcac aggagattat ccactggcct ctccacatcc gcttgcagta aggagtgtgg 1080
gactctccca agcttcagcg ctgaactgca atgcagtgac gtcgcttaag a 1131
<210> 32
<211> 1431
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 32
cgcattgccc agttgttaga ttaagaaata gacagcatga gagggatgag gcaacccgtg 60
ctcagctgtc aaggctcagt cgctagcatt tcccaacaca aagattctga ccttaaatgc 120
aaccatttga aacccctgta ggcctcaggt gaaactccag atgccacaat ggagctctgc 180
tcccctaaag cctcaaaaca aaggcctaat tctatgcctg tcttaatttt ctttcactta 240
agttagttcc actgagaccc caggctgtta ggggttattg gtgtaaggta ctttcatatt 300
ttaaacagag gatatcggca tttgtttctt tctctgagga caagagaaaa aagccaggtt 360
ccacagagga cacagagaag gtttgggtgt cctcctgggg ttctttttgc caactttccc 420
cacgttaaag gtgaacattg gttctttcat ttgctttgga agttttaatc tctaacagtg 480
gacaaagtta ccagtgcctt aaactctgtt acactttttg gaagtgaaaa ctttgtagta 540
tgataggtta ttttgatgta aagatgttct ggataccatt atatgttccc cctgtttcag 600
aggctcagat tgtaatatgt aaatggtatg tcattcgcta ctatgattta atttgaaata 660
tggtcttttg gttatgaata ctttgcagca cagctgagag gctgtctgtt gtattcattg 720
tggtcatagc acctaacaac attgtagcct caatcgagtg agacagacta gaagttccta 780
gtgatggctt atgatagcaa atggcctcat gtcaaatatt tagatgtaat tttgtgtaag 840
aaatacagac tggatgtacc accaactact acctgtaatg acaggcctgt ccaacacatc 900
tcccttttcc atgactgtgg tagccagcat cggaaagaac gctgatttaa agaggtcgct 960
tgggaatttt attgacacag taccatttaa tggggaggac aaaatggggc aggggaggga 1020
gaagtttctg tcgttaaaaa cagatttgga aagactggac tctaaagtct gttgattaaa 1080
gatgagcttt gtctacttca aaagtttgtt tgcttacccc ttcagcctcc aattttttaa 1140
gtgaaaatat agctaataac atgtgaaaag aatagaagct aaggtttaga taaatattga 1200
gcagatctat aggaagattg aacctgaata ttgccattat gcttgacatg gtttccaaaa 1260
aatggtactc cacatatttc agtgagggta agtattttcc tgttgtcaag aatagcattg 1320
taaaagcatt ttgtaataat aaagaatagc tttaatgata tgcttgtaac taaaataatt 1380
ttgtaatgta tcaaatacat ttaaaacatt aaaatataat ctctataata a 1431
<210> 33
<211> 743
<212> PRT
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 33
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser
1 5 10 15
Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro
20 25 30
Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro
35 40 45
Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro
50 55 60
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp
65 70 75 80
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala
85 90 95
Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly
100 105 110
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro
115 120 125
Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg
130 135 140
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly
145 150 155 160
Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr
165 170 175
Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro
180 185 190
Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly
195 200 205
Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser
210 215 220
Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile
225 230 235 240
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu
245 250 255
Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn
260 265 270
Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg
275 280 285
Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn
290 295 300
Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile
305 310 315 320
Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn
325 330 335
Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu
340 345 350
Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro
355 360 365
Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp
370 375 380
Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe
385 390 395 400
Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu
405 410 415
Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu
420 425 430
Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser
435 440 445
Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser
450 455 460
Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro
465 470 475 480
Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn
485 490 495
Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn
500 505 510
Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys
515 520 525
Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly
530 535 540
Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile
545 550 555 560
Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser
565 570 575
Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ser Thr Thr Leu
580 585 590
Tyr Ser Pro Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile
595 600 605
Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro
610 615 620
Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro
625 630 635 640
Leu Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile
645 650 655
Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp
660 665 670
Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val
675 680 685
Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro
690 695 700
Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe
705 710 715 720
Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr
725 730 735
Arg Tyr Leu Thr Arg Asn Leu
740
<210> 34
<211> 149
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 34
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactagggg ttcctagat 149
<210> 35
<211> 139
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 35
cccctagtga tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccgcc 60
cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag cgagcgagcg 120
cgcagagagg gagtggcca 139
<210> 36
<211> 6374
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 36
ttgctggcct tttgctcaca tgtcctgcag gcagctgcgc gctcgctcgc tcactgaggc 60
cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag tgagcgagcg 120
agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc gcacgcgttt 180
aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg 240
cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc 300
ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg 360
ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg 420
gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga 480
gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa 540
ccgcccagag tagaagcgga tccgccacca tggattgggg cacactccag agcatcctcg 600
ggggtgtcaa caaacactcc accagcattg gaaagatctg gctcacggtc ctcttcatct 660
tccgcatcat gatcctcgtg gtggctgcaa aggaggtgtg gggagatgag caagccgatt 720
ttgtctgcaa cacgctccag cctggctgca agaatgtatg ctacgaccac cacttcccca 780
tctctcacat ccggctctgg gctctgcagc tgatcatggt gtccacgcca gccctcctgg 840
tagctatgca tgtggcctac cggagacatg aaaagaaacg gaagttcatg aagggagaga 900
taaagaacga gtttaaggac atcgaagaga tcaaaaccca gaaggtccgt atcgaagggt 960
ccctgtggtg gacctacacc accagcatct tcttccgggt catctttgaa gccgtcttca 1020
tgtacgtctt ttacatcatg tacaatggct tcttcatgca acgtctggtg aaatgcaacg 1080
cttggccctg ccccaataca gtggactgct tcatttccag gcccacagaa aagactgtct 1140
tcaccgtgtt tatgatttct gtgtctggaa tttgcattct gctaaatatc acagagctgt 1200
gctatttgtt cgttaggtat tgctcaggaa agtccaaaag accagtctac ccatacgatg 1260
ttccagatta cgcttaaggc gcgccacccc tgcagggaat tccgcattgc ccagttgtta 1320
gattaagaaa tagacagcat gagagggatg aggcaacccg tgctcagctg tcaaggctca 1380
gtcgctagca tttcccaaca caaagattct gaccttaaat gcaaccattt gaaacccctg 1440
taggcctcag gtgaaactcc agatgccaca atggagctct gctcccctaa agcctcaaaa 1500
caaaggccta attctatgcc tgtcttaatt ttctttcact taagttagtt ccactgagac 1560
cccaggctgt taggggttat tggtgtaagg tactttcata ttttaaacag aggatatcgg 1620
catttgtttc tttctctgag gacaagagaa aaaagccagg ttccacagag gacacagaga 1680
aggtttgggt gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat 1740
tggttctttc atttgctttg gaagttttaa tctctaacag tggacaaagt taccagtgcc 1800
ttaaactctg ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg 1860
taaagatgtt ctggatacca ttatatgttc cccctgtttc agaggctcag attgtaatat 1920
gtaaatggta tgtcattcgc tactatgatt taatttgaaa tatggtcttt tggttatgaa 1980
tactttgcag cacagctgag aggctgtctg ttgtattcat tgtggtcata gcacctaaca 2040
acattgtagc ctcaatcgag tgagacagac tagaagttcc tagtgatggc ttatgatagc 2100
aaatggcctc atgtcaaata tttagatgta attttgtgta agaaatacag actggatgta 2160
ccaccaacta ctacctgtaa tgacaggcct gtccaacaca tctccctttt ccatgactgt 2220
ggtagccagc atcggaaaga acgctgattt aaagaggtcg cttgggaatt ttattgacac 2280
agtaccattt aatggggagg acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa 2340
aacagatttg gaaagactgg actctaaagt ctgttgatta aagatgagct ttgtctactt 2400
caaaagtttg tttgcttacc ccttcagcct ccaatttttt aagtgaaaat atagctaata 2460
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2520
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatatt 2580
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2640
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2700
atttaaaaca ttaaaatata atctctataa taatttaaaa tctaatatgg ttttaataga 2760
acagcgatat caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 2820
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2880
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2940
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3000
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3060
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3120
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3180
catcgtcctt tccttggctg ctcgcctatg ttgccacctg gattctgcgc gggacgtcct 3240
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3300
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3360
ccgcctcccc gcgaattcat cgataccgag cgctgctcga gagatctgtg atagcggcca 3420
tcaagctggc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3480
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 3540
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 3600
gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg agcggccgca 3660
ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 3720
cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg 3780
agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg 3840
tatttcacac cgcatacgtc aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc 3900
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 3960
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 4020
ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 4080
aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 4140
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 4200
ctcaacccta tctcgggcta ttcttttgat ttataaggga ttttgccgat ttcggcctat 4260
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg 4320
tttacaattt tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 4380
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 4440
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 4500
tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc 4560
atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc 4620
cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc 4680
tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc 4740
gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg 4800
gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat 4860
ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc 4920
acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa 4980
ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa 5040
aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt 5100
gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct 5160
tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat 5220
gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc aacaacgttg 5280
cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg 5340
atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt 5400
attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg 5460
ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg 5520
gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg 5580
tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa 5640
aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt 5700
tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt 5760
tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt 5820
ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag 5880
ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta 5940
gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat 6000
aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg 6060
ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg 6120
agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac 6180
aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga 6240
aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt 6300
ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta 6360
cggttcctgg cctt 6374
<210> 37
<211> 700
<212> DNA
<213> Homo sapiens
<400> 37
ccatgatatg ttaagaaaag caaagtgtgg aatagtaggt aaaatattct atcttatgtg 60
caaaagggga aataaaagtc atcaatattc atgtagattc aattcacata tagattcata 120
tcacattcct atatatatag aaattctgga aagacacaaa ataaattaat aaaagttgtt 180
acttcattgt agtttttaaa gttttttgag tcttaagact tactttccac ttctgtagaa 240
aggaattaca aatcctttct ttatagagct atgtgatgaa ataaacataa agcatttggc 300
acacttcagg atagcaactt gtggattaat gattaacaca gtcacctttg caccagatta 360
cacccagaga ttccttcatt tatatttatg tggttttgtg tgtcagttat gcagtctaac 420
tcagtcattc aactatgtta cagctgcaac actctatttt tttctttggt acaggagtcg 480
ccctcttatc cactgtttca tttttgtggt tccagttacc tgtagtcaac cacagttgga 540
aaatatgata gcattttgag agagagactg catccaaaaa cttatattac aatatattgt 600
tatacattgt tataagtgtt gttttattat tctttattgt taatctctta ccattaagcc 660
ttatggtagg tttgtatgta taggaaaaaa cagattatat 700
<210> 38
<211> 700
<212> DNA
<213> Homo sapiens
<400> 38
atataatctg ttttttccta tacatacaaa cctaccataa ggcttaatgg taagagatta 60
acaataaaga ataataaaac aacacttata acaatgtata acaatatatt gtaatataag 120
tttttggatg cagtctctct ctcaaaatgc tatcatattt tccaactgtg gttgactaca 180
ggtaactgga accacaaaaa tgaaacagtg gataagaggg cgactcctgt accaaagaaa 240
aaaatagagt gttgcagctg taacatagtt gaatgactga gttagactgc ataactgaca 300
cacaaaacca cataaatata aatgaaggaa tctctgggtg taatctggtg caaaggtgac 360
tgtgttaatc attaatccac aagttgctat cctgaagtgt gccaaatgct ttatgtttat 420
ttcatcacat agctctataa agaaaggatt tgtaattcct ttctacagaa gtggaaagta 480
agtcttaaga ctcaaaaaac tttaaaaact acaatgaagt aacaactttt attaatttat 540
tttgtgtctt tccagaattt ctatatatat aggaatgtga tatgaatcta tatgtgaatt 600
gaatctacat gaatattgat gacttttatt tccccttttg cacataagat agaatatttt 660
acctactatt ccacactttg cttttcttaa catatcatgg 700
<210> 39
<211> 700
<212> DNA
<213> Homo sapiens
<400> 39
gcagagacct acagacagaa gtacatttta cactggatcc aggacacaca tcagtctgaa 60
aacacacaca tgaaccaaac gtttcctaaa gcattactta tccttgctaa tagcaacaca 120
ttctcatatt cttttatact tcatttaatt tcatataaaa aagaaaagga aaggaaagaa 180
atctatttct cagcccatta ataaggtcag gagcagcaac accagactag aagaaaagct 240
tacctataga tttttctgcc acctcttgag tgcgtccagc tttccgacaa gtctcagtgc 300
catctactgt gcgctctggg tattgcaatt gctttttttt tttttttttt ttttttttta 360
gaatgagact aagtcagaga acacaaagaa cttctttccc cacagtggag atggctctga 420
aagcgtttaa ggaatagctt agatgagtgg ctaacacatt ctcccggttc tgaattctaa 480
gaccacagac tccatgtcca gtccccaaag agaggctttg caagctacag aatacccctc 540
tgactgggac ctcaggagct aaactgacca cgtaattggt tctagaaagt gaaacgtttt 600
aatttgaaac atccaaatga gcattttgtg aaaagctact gccgtccatc aaatacaaca 660
cagccaggga gtcatcgctc tattgccctt gtcaatccta 700
<210> 40
<211> 700
<212> DNA
<213> Homo sapiens
<400> 40
taggattgac aagggcaata gagcgatgac tccctggctg tgttgtattt gatggacggc 60
agtagctttt cacaaaatgc tcatttggat gtttcaaatt aaaacgtttc actttctaga 120
accaattacg tggtcagttt agctcctgag gtcccagtca gaggggtatt ctgtagcttg 180
caaagcctct ctttggggac tggacatgga gtctgtggtc ttagaattca gaaccgggag 240
aatgtgttag ccactcatct aagctattcc ttaaacgctt tcagagccat ctccactgtg 300
gggaaagaag ttctttgtgt tctctgactt agtctcattc taaaaaaaaa aaaaaaaaaa 360
aaaaaaaagc aattgcaata cccagagcgc acagtagatg gcactgagac ttgtcggaaa 420
gctggacgca ctcaagaggt ggcagaaaaa tctataggta agcttttctt ctagtctggt 480
gttgctgctc ctgaccttat taatgggctg agaaatagat ttctttcctt tccttttctt 540
ttttatatga aattaaatga agtataaaag aatatgagaa tgtgttgcta ttagcaagga 600
taagtaatgc tttaggaaac gtttggttca tgtgtgtgtt ttcagactga tgtgtgtcct 660
ggatccagtg taaaatgtac ttctgtctgt aggtctctgc 700
<210> 41
<211> 700
<212> DNA
<213> Homo sapiens
<400> 41
atccattatt tgattagcca tttcaaaaac acatttacgg agatcttcat ctgggcagag 60
cattattcca ggcctctgaa gaaccaaaga tgattttgaa aggaggtcac agtgcagaca 120
gcaggtgtgt atataaggtg gctactttac aaaacaggat atggcaagct ggacatgaca 180
ggcacagcaa agtctctgaa cagagttcgg ggcatgaaat tgtttctttt gggggtcttc 240
aggaacaatt tcatgaaagc taaatcatga aagatagcag gcttttgcca ggaaaaaaaa 300
aaacaagact agtgattagt ttggcgtttt cggtttcttt gagaagcgaa ataacttatc 360
aaggactctt tttgccactt gatgttataa ttggttgata ggtctctcag aagccctttg 420
tgcaaactag aacctgcagg gatgtgcaaa gcctctctct gctgccatct gctgtcttac 480
aagaggtaac tgcaagaggt tgaatcctcc aatgccctgg ggattcccat tgcagggcag 540
gggcagcagc ctgtgttaat aaccacccga acagccacat gtacccctcc acaaaagtgt 600
cactgtctcc attgctctgg agtttgtatt cccaatttgt aatctttgtt agggcactca 660
taaaaaatta aaaacaaaaa ttcacacaaa catacactac 700
<210> 42
<211> 700
<212> DNA
<213> Homo sapiens
<400> 42
gtagtgtatg tttgtgtgaa tttttgtttt taatttttta tgagtgccct aacaaagatt 60
acaaattggg aatacaaact ccagagcaat ggagacagtg acacttttgt ggaggggtac 120
atgtggctgt tcgggtggtt attaacacag gctgctgccc ctgccctgca atgggaatcc 180
ccagggcatt ggaggattca acctcttgca gttacctctt gtaagacagc agatggcagc 240
agagagaggc tttgcacatc cctgcaggtt ctagtttgca caaagggctt ctgagagacc 300
tatcaaccaa ttataacatc aagtggcaaa aagagtcctt gataagttat ttcgcttctc 360
aaagaaaccg aaaacgccaa actaatcact agtcttgttt ttttttttcc tggcaaaagc 420
ctgctatctt tcatgattta gctttcatga aattgttcct gaagaccccc aaaagaaaca 480
atttcatgcc ccgaactctg ttcagagact ttgctgtgcc tgtcatgtcc agcttgccat 540
atcctgtttt gtaaagtagc caccttatat acacacctgc tgtctgcact gtgacctcct 600
ttcaaaatca tctttggttc ttcagaggcc tggaataatg ctctgcccag atgaagatct 660
ccgtaaatgt gtttttgaaa tggctaatca aataatggat 700
<210> 43
<211> 700
<212> DNA
<213> Homo sapiens
<400> 43
gctaattggg tcaggatttg aaagacctta gctttgtgtg accttcaatt ttatcattca 60
gcttgaatat gtgccccaga aaacctttat gtaattccct aatatttcag taaccagcat 120
gcaacatacg agaagcacat tctttgtttt tagaatggta tctggctgat gactttcaca 180
acagctcaca tgagagggaa gtattttagc aatcggactg aaggaaaatc caaaaactcc 240
accattgcag ggtcaacagt gcacgtgttt gaattctgaa agacgtaagc caaggcaaat 300
agaaggaaat gatcttccac taatcccggc atttacttcc tcctctctgg aggggacggc 360
catgcacaca gagccctgtg ctctgagttc tcatgaaagg gacacagctg ggctcactca 420
gcgtcacctc gcccctgggg tgtgtcctgg tttcagatct cgggctggag tgattcacgt 480
gtggcaggga ggccatcatt aatgaaaatg cgagggcgtc gcacgagtgt tgatgactca 540
gcaggccttt ctacttctgt atgagtcagt gcccatcaca gccaagcctg gggcacaaca 600
ggttttctta aaagagcatg ggggcctcat cttcaacaac caattaggaa gcagaaaagt 660
cctcagtgag gaaggaataa tgacatgttg gagctaagat 700
<210> 44
<211> 700
<212> DNA
<213> Homo sapiens
<400> 44
atcttagctc caacatgtca ttattccttc ctcactgagg acttttctgc ttcctaattg 60
gttgttgaag atgaggcccc catgctcttt taagaaaacc tgttgtgccc caggcttggc 120
tgtgatgggc actgactcat acagaagtag aaaggcctgc tgagtcatca acactcgtgc 180
gacgccctcg cattttcatt aatgatggcc tccctgccac acgtgaatca ctccagcccg 240
agatctgaaa ccaggacaca ccccaggggc gaggtgacgc tgagtgagcc cagctgtgtc 300
cctttcatga gaactcagag cacagggctc tgtgtgcatg gccgtcccct ccagagagga 360
ggaagtaaat gccgggatta gtggaagatc atttccttct atttgccttg gcttacgtct 420
ttcagaattc aaacacgtgc actgttgacc ctgcaatggt ggagtttttg gattttcctt 480
cagtccgatt gctaaaatac ttccctctca tgtgagctgt tgtgaaagtc atcagccaga 540
taccattcta aaaacaaaga atgtgcttct cgtatgttgc atgctggtta ctgaaatatt 600
agggaattac ataaaggttt tctggggcac atattcaagc tgaatgataa aattgaaggt 660
cacacaaagc taaggtcttt caaatcctga cccaattagc 700
<210> 45
<211> 658
<212> DNA
<213> Homo sapiens
<400> 45
cgcctcggcc tcccaaagtg ctgggattac aggcgtgagc caccaccgtg cctggcttat 60
acaagtaatt gtaaacgaaa aggaaaaaat ggagatacag ttttctcgtg catcttaaac 120
tttggtgctt aaaagcacca ttaaattctg ctttcacatg aacacacaca agattaccac 180
gtttgctctg ggctgctgcg tattggaagg acatacacat tcaacaaata tttgttgaac 240
ttccattctg tacacaaagc acaaagaaag attcgttcac agtccgtgtg ggtactggaa 300
agcagttcca gccctgcctg ccagggggca ccccaggcaa gcacatctca gtggctgcta 360
gaaagtgaat tgaggctgag tctctccaca cccaagtgtt aggcgttcta ggctcagaaa 420
gagacaatga caatgcgggc aattctctct tcactgtgtc ctcttctttg ctagaaatgt 480
tattagaata tggaaatgtg acattcagca ctaatcagtt tgacatatga atatatctat 540
acacatattt ctccctgaaa ttggcctaaa tactctttct tggaaccaaa tgagaagcaa 600
acaaccttta caactaaaca ttaaaccata agatgaacat cttagttgtc tacctaga 658
<210> 46
<211> 682
<212> DNA
<213> Homo sapiens
<400> 46
ttctaggtag acaactaaga tgttcatctt atggtttaat gtttagttgt aaaggttgtt 60
tgcttctcat ttggttccaa gaaagagtat ttaggccaat ttcagggaga aatatgtgta 120
tagatatatt catatgtcaa actgattagt gctgaatgtc acatttccat attctaataa 180
catttctagc aaagaagagg acacagtgaa gagagaattg cccgcattgt cattgtctct 240
ttctgagcct agaacgccta acacttgggt gtggagagac tcagcctcaa ttcactttct 300
agcagccact gagatgtgct tgcctggggt gccccctggc aggcagggct ggaactgctt 360
tccagtaccc acacggactg tgaacgaatc tttctttgtg ctttgtgtac agaatggaag 420
ttcaacaaat atttgttgaa tgtgtatgtc cttccaatac gcagcagccc agagcaaacg 480
tggtaatctt gtgtgtgttc atgtgaaagc agaatttaat ggtgctttta agcaccaaag 540
tttaagatgc acgagaaaac tgtatctcca ttttttcctt ttcgtttaca attacttgta 600
taagccaggc acggtggtgg ctcacgcctg taatcccagc actttgggag gccgaggcgg 660
gcggatcaca tgaggtcggg ag 682
<210> 47
<211> 135
<212> DNA
<213> Homo sapiens
<400> 47
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 60
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 120
gaagaggcgg ggtgt 135
<210> 48
<211> 7163
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 48
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcagtgatgc ctgaaacctc 1320
agatggtact gaaccctcta tataatctgt tttttcctat acatacaaac ctaccataag 1380
gcttaatggt aagagattaa caataaagaa taataaaaca acacttataa caatgtataa 1440
caatatattg taatataagt ttttggatgc agtctctctc tcaaaatgct atcatatttt 1500
ccaactgtgg ttgactacag gtaactggaa ccacaaaaat gaaacagtgg ataagagggc 1560
gactcctgta ccaaagaaaa aaatagagtg ttgcagctgt aacatagttg aatgactgag 1620
ttagactgca taactgacac acaaaaccac ataaatataa atgaaggaat ctctgggtgt 1680
aatctggtgc aaaggtgact gtgttaatca ttaatccaca agttgctatc ctgaagtgtg 1740
ccaaatgctt tatgtttatt tcatcacata gctctataaa gaaaggattt gtaattcctt 1800
tctacagaag tggaaagtaa gtcttaagac tcaaaaaact ttaaaaacta caatgaagta 1860
acaactttta ttaatttatt ttgtgtcttt ccagaatttc tatatatata ggaatgtgat 1920
atgaatctat atgtgaattg aatctacatg aatattgatg acttttattt ccccttttgc 1980
acataagata gaatatttta cctactattc cacactttgc ttttcttaac atatcatggg 2040
atctttttat ataagtgaac aaagagtttc ttcattcttt cacacagttt aattaagacc 2100
tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg cggacccggg 2160
aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc ctccgtaact 2220
ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg ccacggcggg 2280
agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg gagcccctcg 2340
gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga gaccccaacg 2400
ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa ccgcccagag 2460
tagaagccat ggtgagcaag ggcgaggagc tgttcaccgg ggtggtgccc atcctggtcg 2520
agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc cggcgagggc gagggcgatg 2580
ccacctacgg caagctgacc ctgaagttca tctgcaccac cggcaagctg cccgtgccct 2640
ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg cttcagccgc taccccgacc 2700
acatgaagca gcacgacttc ttcaagtccg ccatgcccga aggctacgtc caggagcgca 2760
ccatcttctt caaggacgac ggcaactaca agacccgcgc cgaggtgaag ttcgagggcg 2820
acaccctggt gaaccgcatc gagctgaagg gcatcgactt caaggaggac ggcaacatcc 2880
tggggcacaa gctggagtac aactacaaca gccacaacgt ctatatcatg gccgacaagc 2940
agaagaacgg catcaaggtg aacttcaaga tccgccacaa catcgaggac ggcagcgtgc 3000
agctcgccga ccactaccag cagaacaccc ccatcggcga cggccccgtg ctgctgcccg 3060
acaaccacta cctgagcacc cagtccgccc tgagcaaaga ccccaacgag aagcgcgatc 3120
acatggtcct gctggagttc gtgaccgccg ccgggatcac tctcggcatg gacgagctgt 3180
acaagtaaag gcgcgccacc cctgcaggga attccgcatt gcccagttgt tagattaaga 3240
aatagacagc atgagaggga tgaggcaacc cgtgctcagc tgtcaaggct cagtcgctag 3300
catttcccaa cacaaagatt ctgaccttaa atgcaaccat ttgaaacccc tgtaggcctc 3360
aggtgaaact ccagatgcca caatggagct ctgctcccct aaagcctcaa aacaaaggcc 3420
taattctatg cctgtcttaa ttttctttca cttaagttag ttccactgag accccaggct 3480
gttaggggtt attggtgtaa ggtactttca tattttaaac agaggatatc ggcatttgtt 3540
tctttctctg aggacaagag aaaaaagcca ggttccacag aggacacaga gaaggtttgg 3600
gtgtcctcct ggggttcttt ttgccaactt tccccacgtt aaaggtgaac attggttctt 3660
tcatttgctt tggaagtttt aatctctaac agtggacaaa gttaccagtg ccttaaactc 3720
tgttacactt tttggaagtg aaaactttgt agtatgatag gttattttga tgtaaagatg 3780
ttctggatac cattatatgt tccccctgtt tcagaggctc agattgtaat atgtaaatgg 3840
tatgtcattc gctactatga tttaatttga aatatggtct tttggttatg aatactttgc 3900
agcacagctg agaggctgtc tgttgtattc attgtggtca tagcacctaa caacattgta 3960
gcctcaatcg agtgagacag actagaagtt cctagtgatg gcttatgata gcaaatggcc 4020
tcatgtcaaa tatttagatg taattttgtg taagaaatac agactggatg taccaccaac 4080
tactacctgt aatgacaggc ctgtccaaca catctccctt ttccatgact gtggtagcca 4140
gcatcggaaa gaacgctgat ttaaagaggt cgcttgggaa ttttattgac acagtaccat 4200
ttaatgggga ggacaaaatg gggcagggga gggagaagtt tctgtcgtta aaaacagatt 4260
tggaaagact ggactctaaa gtctgttgat taaagatgag ctttgtctac ttcaaaagtt 4320
tgtttgctta ccccttcagc ctccaatttt ttaagtgaaa atatagctaa taacatgtga 4380
aaagaataga agctaaggtt tagataaata ttgagcagat ctataggaag attgaacctg 4440
aatattgcca ttatgcttga catggtttcc aaaaaatggt actccacata tttcagtgag 4500
ggtaagtatt ttcctgttgt caagaatagc attgtaaaag cattttgtaa taataaagaa 4560
tagctttaat gatatgcttg taactaaaat aattttgtaa tgtatcaaat acatttaaaa 4620
cattaaaata taatctctat aataatttaa aatctaatat ggttttaata gaacagcgat 4680
atcaagctta tcgataatca acctctggat tacaaaattt gtgaaagatt gactggtatt 4740
cttaactatg ttgctccttt tacgctatgt ggatacgctg ctttaatgcc tttgtatcat 4800
gctattgctt cccgtatggc tttcattttc tcctccttgt ataaatcctg gttgctgtct 4860
ctttatgagg agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct 4920
gacgcaaccc ccactggttg gggcattgcc accacctgtc agctcctttc cgggactttc 4980
gctttccccc tccctattgc cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg 5040
acaggggctc ggctgttggg cactgacaat tccgtggtgt tgtcggggaa atcatcgtcc 5100
tttccttggc tgctcgccta tgttgccacc tggattctgc gcgggacgtc cttctgctac 5160
gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg 5220
cctcttccgc gtcttcgcct tcgccctcag acgagtcgga tctccctttg ggccgcctcc 5280
ccgcgaattc atcgataccg agcgctgctc gagagatctg tgatagcggc catcaagctg 5340
gctgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 5400
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 5460
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 5520
tgggaagaca atagcaggca tgctggggac acgtgcggac cgagcggccg caggaacccc 5580
tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac 5640
caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca 5700
gctgcctgca ggggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 5760
accgcatacg tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg 5820
tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 5880
cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 5940
ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 6000
tttgggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 6060
gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 6120
tatctcgggc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa 6180
aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa cgtttacaat 6240
tttatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca 6300
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 6360
acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 6420
acgcgcgaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat 6480
aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 6540
tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 6600
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 6660
tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 6720
aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 6780
cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 6840
agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg 6900
ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 6960
tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 7020
tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 7080
caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 7140
accaaacgac gagcgtgaca cca 7163
<210> 49
<211> 7247
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 49
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tgctatctat catcttgaag 1320
ggcttctgga acaagttaga atagagtcaa cactcatgaa ctgctgtagc aaaaaaaact 1380
atagatgtag gattgacaag ggcaatagag cgatgactcc ctggctgtgt tgtatttgat 1440
ggacggcagt agcttttcac aaaatgctca tttggatgtt tcaaattaaa acgtttcact 1500
ttctagaacc aattacgtgg tcagtttagc tcctgaggtc ccagtcagag gggtattctg 1560
tagcttgcaa agcctctctt tggggactgg acatggagtc tgtggtctta gaattcagaa 1620
ccgggagaat gtgttagcca ctcatctaag ctattcctta aacgctttca gagccatctc 1680
cactgtgggg aaagaagttc tttgtgttct ctgacttagt ctcattctaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaagcaat tgcaataccc agagcgcaca gtagatggca ctgagacttg 1800
tcggaaagct ggacgcactc aagaggtggc agaaaaatct ataggtaagc ttttcttcta 1860
gtctggtgtt gctgctcctg accttattaa tgggctgaga aatagatttc tttcctttcc 1920
ttttcttttt tatatgaaat taaatgaagt ataaaagaat atgagaatgt gttgctatta 1980
gcaaggataa gtaatgcttt aggaaacgtt tggttcatgt gtgtgttttc agactgatgt 2040
gtgtcctgga tccagtgtaa aatgtacttc tgtctgtagg tctctgccac agaaaagttg 2100
gaaagccatt gttgtattcc atttccaggg caacaaaaga taccactgtc acttcatgtg 2160
aaatggtgtt gtttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg 2220
cggtcggggg ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc 2280
cgcggcgccg ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt 2340
gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc 2400
ctctccccga ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg 2460
agccccagcg cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc 2520
cgacgcagag caaaccgccc agagtagaag ccatggtgag caagggcgag gagctgttca 2580
ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg 2640
tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca 2700
ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc 2760
agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc 2820
ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc 2880
gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg 2940
acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac aacagccaca 3000
acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc 3060
acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg 3120
gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca 3180
aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga 3240
tcactctcgg catggacgag ctgtacaagt aaaggcgcgc cacccctgca gggaattccg 3300
cattgcccag ttgttagatt aagaaataga cagcatgaga gggatgaggc aacccgtgct 3360
cagctgtcaa ggctcagtcg ctagcatttc ccaacacaaa gattctgacc ttaaatgcaa 3420
ccatttgaaa cccctgtagg cctcaggtga aactccagat gccacaatgg agctctgctc 3480
ccctaaagcc tcaaaacaaa ggcctaattc tatgcctgtc ttaattttct ttcacttaag 3540
ttagttccac tgagacccca ggctgttagg ggttattggt gtaaggtact ttcatatttt 3600
aaacagagga tatcggcatt tgtttctttc tctgaggaca agagaaaaaa gccaggttcc 3660
acagaggaca cagagaaggt ttgggtgtcc tcctggggtt ctttttgcca actttcccca 3720
cgttaaaggt gaacattggt tctttcattt gctttggaag ttttaatctc taacagtgga 3780
caaagttacc agtgccttaa actctgttac actttttgga agtgaaaact ttgtagtatg 3840
ataggttatt ttgatgtaaa gatgttctgg ataccattat atgttccccc tgtttcagag 3900
gctcagattg taatatgtaa atggtatgtc attcgctact atgatttaat ttgaaatatg 3960
gtcttttggt tatgaatact ttgcagcaca gctgagaggc tgtctgttgt attcattgtg 4020
gtcatagcac ctaacaacat tgtagcctca atcgagtgag acagactaga agttcctagt 4080
gatggcttat gatagcaaat ggcctcatgt caaatattta gatgtaattt tgtgtaagaa 4140
atacagactg gatgtaccac caactactac ctgtaatgac aggcctgtcc aacacatctc 4200
ccttttccat gactgtggta gccagcatcg gaaagaacgc tgatttaaag aggtcgcttg 4260
ggaattttat tgacacagta ccatttaatg gggaggacaa aatggggcag gggagggaga 4320
agtttctgtc gttaaaaaca gatttggaaa gactggactc taaagtctgt tgattaaaga 4380
tgagctttgt ctacttcaaa agtttgtttg cttacccctt cagcctccaa ttttttaagt 4440
gaaaatatag ctaataacat gtgaaaagaa tagaagctaa ggtttagata aatattgagc 4500
agatctatag gaagattgaa cctgaatatt gccattatgc ttgacatggt ttccaaaaaa 4560
tggtactcca catatttcag tgagggtaag tattttcctg ttgtcaagaa tagcattgta 4620
aaagcatttt gtaataataa agaatagctt taatgatatg cttgtaacta aaataatttt 4680
gtaatgtatc aaatacattt aaaacattaa aatataatct ctataataat ttaaaatcta 4740
atatggtttt aatagaacag cgatatcaag cttatcgata atcaacctct ggattacaaa 4800
atttgtgaaa gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac 4860
gctgctttaa tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc 4920
ttgtataaat cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt 4980
ggcgtggtgt gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc 5040
tgtcagctcc tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc 5100
gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg 5160
gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg cctatgttgc cacctggatt 5220
ctgcgcggga cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc 5280
cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt 5340
cggatctccc tttgggccgc ctccccgcga attcatcgat accgagcgct gctcgagaga 5400
tctgtgatag cggccatcaa gctggctgtg ccttctagtt gccagccatc tgttgtttgc 5460
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 5520
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 5580
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggacacgtgc 5640
ggaccgagcg gccgcaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc 5700
gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg 5760
cctcagtgag cgagcgagcg cgcagctgcc tgcaggggcg cctgatgcgg tattttctcc 5820
ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag caaccatagt acgcgccctg 5880
tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc 5940
cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg 6000
ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta gtgctttacg 6060
gcacctcgac cccaaaaaac ttgatttggg tgatggttca cgtagtgggc catcgccctg 6120
atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg gactcttgtt 6180
ccaaactgga acaacactca accctatctc gggctattct tttgatttat aagggatttt 6240
gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta acgcgaattt 6300
taacaaaata ttaacgttta caattttatg gtgcactctc agtacaatct gctctgatgc 6360
cgcatagtta agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg 6420
tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca 6480
gaggttttca ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt 6540
tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg 6600
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct 6660
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat 6720
tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc 6780
tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg 6840
ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg 6900
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga 6960
cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta 7020
ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc 7080
tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc 7140
gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg 7200
ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacacca 7247
<210> 50
<211> 7243
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 50
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagctacta actacaacca 1320
cgagattata gatgtttgct gatattgttc tcagtttggt tattgtgttg tttatgaatg 1380
aaagtagtgt atgtttgtgt gaatttttgt ttttaatttt ttatgagtgc cctaacaaag 1440
attacaaatt gggaatacaa actccagagc aatggagaca gtgacacttt tgtggagggg 1500
tacatgtggc tgttcgggtg gttattaaca caggctgctg cccctgccct gcaatgggaa 1560
tccccagggc attggaggat tcaacctctt gcagttacct cttgtaagac agcagatggc 1620
agcagagaga ggctttgcac atccctgcag gttctagttt gcacaaaggg cttctgagag 1680
acctatcaac caattataac atcaagtggc aaaaagagtc cttgataagt tatttcgctt 1740
ctcaaagaaa ccgaaaacgc caaactaatc actagtcttg tttttttttt tcctggcaaa 1800
agcctgctat ctttcatgat ttagctttca tgaaattgtt cctgaagacc cccaaaagaa 1860
acaatttcat gccccgaact ctgttcagag actttgctgt gcctgtcatg tccagcttgc 1920
catatcctgt tttgtaaagt agccacctta tatacacacc tgctgtctgc actgtgacct 1980
cctttcaaaa tcatctttgg ttcttcagag gcctggaata atgctctgcc cagatgaaga 2040
tctccgtaaa tgtgtttttg aaatggctaa tcaaataatg gataccctta ggtatttttg 2100
cagaaacact tggcagcctt ccataatatc cctactatga aatggaaact tgtgaatgag 2160
atgtggcttt aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt 2220
cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg 2280
gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg 2340
ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct 2400
ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc 2460
ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac 2520
gcagagcaaa ccgcccagag tagaagccat ggtgagcaag ggcgaggagc tgttcaccgg 2580
ggtggtgccc atcctggtcg agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc 2640
cggcgagggc gagggcgatg ccacctacgg caagctgacc ctgaagttca tctgcaccac 2700
cggcaagctg cccgtgccct ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg 2760
cttcagccgc taccccgacc acatgaagca gcacgacttc ttcaagtccg ccatgcccga 2820
aggctacgtc caggagcgca ccatcttctt caaggacgac ggcaactaca agacccgcgc 2880
cgaggtgaag ttcgagggcg acaccctggt gaaccgcatc gagctgaagg gcatcgactt 2940
caaggaggac ggcaacatcc tggggcacaa gctggagtac aactacaaca gccacaacgt 3000
ctatatcatg gccgacaagc agaagaacgg catcaaggtg aacttcaaga tccgccacaa 3060
catcgaggac ggcagcgtgc agctcgccga ccactaccag cagaacaccc ccatcggcga 3120
cggccccgtg ctgctgcccg acaaccacta cctgagcacc cagtccgccc tgagcaaaga 3180
ccccaacgag aagcgcgatc acatggtcct gctggagttc gtgaccgccg ccgggatcac 3240
tctcggcatg gacgagctgt acaagtaaag gcgcgccacc cctgcaggga attccgcatt 3300
gcccagttgt tagattaaga aatagacagc atgagaggga tgaggcaacc cgtgctcagc 3360
tgtcaaggct cagtcgctag catttcccaa cacaaagatt ctgaccttaa atgcaaccat 3420
ttgaaacccc tgtaggcctc aggtgaaact ccagatgcca caatggagct ctgctcccct 3480
aaagcctcaa aacaaaggcc taattctatg cctgtcttaa ttttctttca cttaagttag 3540
ttccactgag accccaggct gttaggggtt attggtgtaa ggtactttca tattttaaac 3600
agaggatatc ggcatttgtt tctttctctg aggacaagag aaaaaagcca ggttccacag 3660
aggacacaga gaaggtttgg gtgtcctcct ggggttcttt ttgccaactt tccccacgtt 3720
aaaggtgaac attggttctt tcatttgctt tggaagtttt aatctctaac agtggacaaa 3780
gttaccagtg ccttaaactc tgttacactt tttggaagtg aaaactttgt agtatgatag 3840
gttattttga tgtaaagatg ttctggatac cattatatgt tccccctgtt tcagaggctc 3900
agattgtaat atgtaaatgg tatgtcattc gctactatga tttaatttga aatatggtct 3960
tttggttatg aatactttgc agcacagctg agaggctgtc tgttgtattc attgtggtca 4020
tagcacctaa caacattgta gcctcaatcg agtgagacag actagaagtt cctagtgatg 4080
gcttatgata gcaaatggcc tcatgtcaaa tatttagatg taattttgtg taagaaatac 4140
agactggatg taccaccaac tactacctgt aatgacaggc ctgtccaaca catctccctt 4200
ttccatgact gtggtagcca gcatcggaaa gaacgctgat ttaaagaggt cgcttgggaa 4260
ttttattgac acagtaccat ttaatgggga ggacaaaatg gggcagggga gggagaagtt 4320
tctgtcgtta aaaacagatt tggaaagact ggactctaaa gtctgttgat taaagatgag 4380
ctttgtctac ttcaaaagtt tgtttgctta ccccttcagc ctccaatttt ttaagtgaaa 4440
atatagctaa taacatgtga aaagaataga agctaaggtt tagataaata ttgagcagat 4500
ctataggaag attgaacctg aatattgcca ttatgcttga catggtttcc aaaaaatggt 4560
actccacata tttcagtgag ggtaagtatt ttcctgttgt caagaatagc attgtaaaag 4620
cattttgtaa taataaagaa tagctttaat gatatgcttg taactaaaat aattttgtaa 4680
tgtatcaaat acatttaaaa cattaaaata taatctctat aataatttaa aatctaatat 4740
ggttttaata gaacagcgat atcaagctta tcgataatca acctctggat tacaaaattt 4800
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 4860
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 4920
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 4980
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 5040
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 5100
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 5160
tgtcggggaa atcatcgtcc tttccttggc tgctcgccta tgttgccacc tggattctgc 5220
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 5280
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 5340
tctccctttg ggccgcctcc ccgcgaattc atcgataccg agcgctgctc gagagatctg 5400
tgatagcggc catcaagctg gctgtgcctt ctagttgcca gccatctgtt gtttgcccct 5460
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 5520
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 5580
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggac acgtgcggac 5640
cgagcggccg caggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc 5700
gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc 5760
agtgagcgag cgagcgcgca gctgcctgca ggggcgcctg atgcggtatt ttctccttac 5820
gcatctgtgc ggtatttcac accgcatacg tcaaagcaac catagtacgc gccctgtagc 5880
ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc 5940
gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt 6000
ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac 6060
ctcgacccca aaaaacttga tttgggtgat ggttcacgta gtgggccatc gccctgatag 6120
acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa 6180
actggaacaa cactcaaccc tatctcgggc tattcttttg atttataagg gattttgccg 6240
atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac 6300
aaaatattaa cgtttacaat tttatggtgc actctcagta caatctgctc tgatgccgca 6360
tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg ggcttgtctg 6420
ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat gtgtcagagg 6480
ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg cctattttta 6540
taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 6600
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 6660
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 6720
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 6780
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 6840
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 6900
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc 6960
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 7020
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 7080
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 7140
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 7200
ccggagctga atgaagccat accaaacgac gagcgtgaca cca 7243
<210> 51
<211> 7253
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 51
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tctcagctgg agtgacgcac 1320
ctcatccatg cgggcctggc gtctggaagg tggctgggtc tctcgggctt gagcaccatc 1380
atcttagctc caacatgtca ttattccttc ctcactgagg acttttctgc ttcctaattg 1440
gttgttgaag atgaggcccc catgctcttt taagaaaacc tgttgtgccc caggcttggc 1500
tgtgatgggc actgactcat acagaagtag aaaggcctgc tgagtcatca acactcgtgc 1560
gacgccctcg cattttcatt aatgatggcc tccctgccac acgtgaatca ctccagcccg 1620
agatctgaaa ccaggacaca ccccaggggc gaggtgacgc tgagtgagcc cagctgtgtc 1680
cctttcatga gaactcagag cacagggctc tgtgtgcatg gccgtcccct ccagagagga 1740
ggaagtaaat gccgggatta gtggaagatc atttccttct atttgccttg gcttacgtct 1800
ttcagaattc aaacacgtgc actgttgacc ctgcaatggt ggagtttttg gattttcctt 1860
cagtccgatt gctaaaatac ttccctctca tgtgagctgt tgtgaaagtc atcagccaga 1920
taccattcta aaaacaaaga atgtgcttct cgtatgttgc atgctggtta ctgaaatatt 1980
agggaattac ataaaggttt tctggggcac atattcaagc tgaatgataa aattgaaggt 2040
cacacaaagc taaggtcttt caaatcctga cccaattagc tctctgttag ctctctgact 2100
ttggacaagc tgtctggtcc tctgaagcat actttgttcg ccctgggtag gggccctctg 2160
ttttaacagc gtttggcatt aattaagacc tcgaagggga cttggggggt tcggggcttt 2220
cgggggcggt cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc 2280
tccgcccgcg gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg 2340
tggggtgcgg ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc 2400
gcgctcctct ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag 2460
cgcaggagcc ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct 2520
tcctcccgac gcagagcaaa ccgcccagag tagaagccat ggtgagcaag ggcgaggagc 2580
tgttcaccgg ggtggtgccc atcctggtcg agctggacgg cgacgtaaac ggccacaagt 2640
tcagcgtgtc cggcgagggc gagggcgatg ccacctacgg caagctgacc ctgaagttca 2700
tctgcaccac cggcaagctg cccgtgccct ggcccaccct cgtgaccacc ctgacctacg 2760
gcgtgcagtg cttcagccgc taccccgacc acatgaagca gcacgacttc ttcaagtccg 2820
ccatgcccga aggctacgtc caggagcgca ccatcttctt caaggacgac ggcaactaca 2880
agacccgcgc cgaggtgaag ttcgagggcg acaccctggt gaaccgcatc gagctgaagg 2940
gcatcgactt caaggaggac ggcaacatcc tggggcacaa gctggagtac aactacaaca 3000
gccacaacgt ctatatcatg gccgacaagc agaagaacgg catcaaggtg aacttcaaga 3060
tccgccacaa catcgaggac ggcagcgtgc agctcgccga ccactaccag cagaacaccc 3120
ccatcggcga cggccccgtg ctgctgcccg acaaccacta cctgagcacc cagtccgccc 3180
tgagcaaaga ccccaacgag aagcgcgatc acatggtcct gctggagttc gtgaccgccg 3240
ccgggatcac tctcggcatg gacgagctgt acaagtaaag gcgcgccacc cctgcaggga 3300
attccgcatt gcccagttgt tagattaaga aatagacagc atgagaggga tgaggcaacc 3360
cgtgctcagc tgtcaaggct cagtcgctag catttcccaa cacaaagatt ctgaccttaa 3420
atgcaaccat ttgaaacccc tgtaggcctc aggtgaaact ccagatgcca caatggagct 3480
ctgctcccct aaagcctcaa aacaaaggcc taattctatg cctgtcttaa ttttctttca 3540
cttaagttag ttccactgag accccaggct gttaggggtt attggtgtaa ggtactttca 3600
tattttaaac agaggatatc ggcatttgtt tctttctctg aggacaagag aaaaaagcca 3660
ggttccacag aggacacaga gaaggtttgg gtgtcctcct ggggttcttt ttgccaactt 3720
tccccacgtt aaaggtgaac attggttctt tcatttgctt tggaagtttt aatctctaac 3780
agtggacaaa gttaccagtg ccttaaactc tgttacactt tttggaagtg aaaactttgt 3840
agtatgatag gttattttga tgtaaagatg ttctggatac cattatatgt tccccctgtt 3900
tcagaggctc agattgtaat atgtaaatgg tatgtcattc gctactatga tttaatttga 3960
aatatggtct tttggttatg aatactttgc agcacagctg agaggctgtc tgttgtattc 4020
attgtggtca tagcacctaa caacattgta gcctcaatcg agtgagacag actagaagtt 4080
cctagtgatg gcttatgata gcaaatggcc tcatgtcaaa tatttagatg taattttgtg 4140
taagaaatac agactggatg taccaccaac tactacctgt aatgacaggc ctgtccaaca 4200
catctccctt ttccatgact gtggtagcca gcatcggaaa gaacgctgat ttaaagaggt 4260
cgcttgggaa ttttattgac acagtaccat ttaatgggga ggacaaaatg gggcagggga 4320
gggagaagtt tctgtcgtta aaaacagatt tggaaagact ggactctaaa gtctgttgat 4380
taaagatgag ctttgtctac ttcaaaagtt tgtttgctta ccccttcagc ctccaatttt 4440
ttaagtgaaa atatagctaa taacatgtga aaagaataga agctaaggtt tagataaata 4500
ttgagcagat ctataggaag attgaacctg aatattgcca ttatgcttga catggtttcc 4560
aaaaaatggt actccacata tttcagtgag ggtaagtatt ttcctgttgt caagaatagc 4620
attgtaaaag cattttgtaa taataaagaa tagctttaat gatatgcttg taactaaaat 4680
aattttgtaa tgtatcaaat acatttaaaa cattaaaata taatctctat aataatttaa 4740
aatctaatat ggttttaata gaacagcgat atcaagctta tcgataatca acctctggat 4800
tacaaaattt gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt 4860
ggatacgctg ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc 4920
tcctccttgt ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg 4980
caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc 5040
accacctgtc agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa 5100
ctcatcgccg cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat 5160
tccgtggtgt tgtcggggaa atcatcgtcc tttccttggc tgctcgccta tgttgccacc 5220
tggattctgc gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt 5280
ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag 5340
acgagtcgga tctccctttg ggccgcctcc ccgcgaattc atcgataccg agcgctgctc 5400
gagagatctg tgatagcggc catcaagctg gctgtgcctt ctagttgcca gccatctgtt 5460
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 5520
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 5580
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggac 5640
acgtgcggac cgagcggccg caggaacccc tagtgatgga gttggccact ccctctctgc 5700
gcgctcgctc gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc 5760
gggcggcctc agtgagcgag cgagcgcgca gctgcctgca ggggcgcctg atgcggtatt 5820
ttctccttac gcatctgtgc ggtatttcac accgcatacg tcaaagcaac catagtacgc 5880
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac 5940
acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt 6000
cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 6060
tttacggcac ctcgacccca aaaaacttga tttgggtgat ggttcacgta gtgggccatc 6120
gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact 6180
cttgttccaa actggaacaa cactcaaccc tatctcgggc tattcttttg atttataagg 6240
gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc 6300
gaattttaac aaaatattaa cgtttacaat tttatggtgc actctcagta caatctgctc 6360
tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg 6420
ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat 6480
gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg 6540
cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt 6600
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 6660
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 6720
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 6780
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 6840
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 6900
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 6960
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 7020
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 7080
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 7140
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 7200
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca cca 7253
<210> 52
<211> 7057
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 52
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg ttctaggtag acaactaaga 1320
tgttcatctt atggtttaat gtttagttgt aaaggttgtt tgcttctcat ttggttccaa 1380
gaaagagtat ttaggccaat ttcagggaga aatatgtgta tagatatatt catatgtcaa 1440
actgattagt gctgaatgtc acatttccat attctaataa catttctagc aaagaagagg 1500
acacagtgaa gagagaattg cccgcattgt cattgtctct ttctgagcct agaacgccta 1560
acacttgggt gtggagagac tcagcctcaa ttcactttct agcagccact gagatgtgct 1620
tgcctggggt gccccctggc aggcagggct ggaactgctt tccagtaccc acacggactg 1680
tgaacgaatc tttctttgtg ctttgtgtac agaatggaag ttcaacaaat atttgttgaa 1740
tgtgtatgtc cttccaatac gcagcagccc agagcaaacg tggtaatctt gtgtgtgttc 1800
atgtgaaagc agaatttaat ggtgctttta agcaccaaag tttaagatgc acgagaaaac 1860
tgtatctcca ttttttcctt ttcgtttaca attacttgta taagccaggc acggtggtgg 1920
ctcacgcctg taatcccagc actttgggag gccgaggcgg gcggatcaca tgaggtcggg 1980
agttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg 2040
ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg 2100
ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa 2160
ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga 2220
ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg 2280
cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcagag 2340
caaaccgccc agagtagaag ccatggtgag caagggcgag gagctgttca ccggggtggt 2400
gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg tgtccggcga 2460
gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca ccaccggcaa 2520
gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc agtgcttcag 2580
ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc ccgaaggcta 2640
cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc gcgccgaggt 2700
gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg acttcaagga 2760
ggacggcaac atcctggggc acaagctgga gtacaactac aacagccaca acgtctatat 2820
catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc acaacatcga 2880
ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg gcgacggccc 2940
cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca aagaccccaa 3000
cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga tcactctcgg 3060
catggacgag ctgtacaagt aaaggcgcgc cacccctgca gggaattccg cattgcccag 3120
ttgttagatt aagaaataga cagcatgaga gggatgaggc aacccgtgct cagctgtcaa 3180
ggctcagtcg ctagcatttc ccaacacaaa gattctgacc ttaaatgcaa ccatttgaaa 3240
cccctgtagg cctcaggtga aactccagat gccacaatgg agctctgctc ccctaaagcc 3300
tcaaaacaaa ggcctaattc tatgcctgtc ttaattttct ttcacttaag ttagttccac 3360
tgagacccca ggctgttagg ggttattggt gtaaggtact ttcatatttt aaacagagga 3420
tatcggcatt tgtttctttc tctgaggaca agagaaaaaa gccaggttcc acagaggaca 3480
cagagaaggt ttgggtgtcc tcctggggtt ctttttgcca actttcccca cgttaaaggt 3540
gaacattggt tctttcattt gctttggaag ttttaatctc taacagtgga caaagttacc 3600
agtgccttaa actctgttac actttttgga agtgaaaact ttgtagtatg ataggttatt 3660
ttgatgtaaa gatgttctgg ataccattat atgttccccc tgtttcagag gctcagattg 3720
taatatgtaa atggtatgtc attcgctact atgatttaat ttgaaatatg gtcttttggt 3780
tatgaatact ttgcagcaca gctgagaggc tgtctgttgt attcattgtg gtcatagcac 3840
ctaacaacat tgtagcctca atcgagtgag acagactaga agttcctagt gatggcttat 3900
gatagcaaat ggcctcatgt caaatattta gatgtaattt tgtgtaagaa atacagactg 3960
gatgtaccac caactactac ctgtaatgac aggcctgtcc aacacatctc ccttttccat 4020
gactgtggta gccagcatcg gaaagaacgc tgatttaaag aggtcgcttg ggaattttat 4080
tgacacagta ccatttaatg gggaggacaa aatggggcag gggagggaga agtttctgtc 4140
gttaaaaaca gatttggaaa gactggactc taaagtctgt tgattaaaga tgagctttgt 4200
ctacttcaaa agtttgtttg cttacccctt cagcctccaa ttttttaagt gaaaatatag 4260
ctaataacat gtgaaaagaa tagaagctaa ggtttagata aatattgagc agatctatag 4320
gaagattgaa cctgaatatt gccattatgc ttgacatggt ttccaaaaaa tggtactcca 4380
catatttcag tgagggtaag tattttcctg ttgtcaagaa tagcattgta aaagcatttt 4440
gtaataataa agaatagctt taatgatatg cttgtaacta aaataatttt gtaatgtatc 4500
aaatacattt aaaacattaa aatataatct ctataataat ttaaaatcta atatggtttt 4560
aatagaacag cgatatcaag cttatcgata atcaacctct ggattacaaa atttgtgaaa 4620
gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac gctgctttaa 4680
tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc ttgtataaat 4740
cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt ggcgtggtgt 4800
gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc tgtcagctcc 4860
tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc gccgcctgcc 4920
ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg gtgttgtcgg 4980
ggaaatcatc gtcctttcct tggctgctcg cctatgttgc cacctggatt ctgcgcggga 5040
cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc cgcggcctgc 5100
tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt cggatctccc 5160
tttgggccgc ctccccgcga attcatcgat accgagcgct gctcgagaga tctgtgatag 5220
cggccatcaa gctggctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg 5280
tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa 5340
ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcaggaca 5400
gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggacacgtgc ggaccgagcg 5460
gccgcaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc gctcgctcac 5520
tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag 5580
cgagcgagcg cgcagctgcc tgcaggggcg cctgatgcgg tattttctcc ttacgcatct 5640
gtgcggtatt tcacaccgca tacgtcaaag caaccatagt acgcgccctg tagcggcgca 5700
ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta 5760
gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt 5820
caagctctaa atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac 5880
cccaaaaaac ttgatttggg tgatggttca cgtagtgggc catcgccctg atagacggtt 5940
tttcgccctt tgacgttgga gtccacgttc tttaatagtg gactcttgtt ccaaactgga 6000
acaacactca accctatctc gggctattct tttgatttat aagggatttt gccgatttcg 6060
gcctattggt taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata 6120
ttaacgttta caattttatg gtgcactctc agtacaatct gctctgatgc cgcatagtta 6180
agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg 6240
gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca 6300
ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt 6360
aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc 6420
ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa 6480
taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc 6540
cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa 6600
acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa 6660
ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg 6720
atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa 6780
gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc 6840
acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc 6900
atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta 6960
accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag 7020
ctgaatgaag ccataccaaa cgacgagcgt gacacca 7057
<210> 53
<211> 212
<212> DNA
<213> Homo sapiens
<400> 53
ggggtgcggt taaaaggcgc cacggcggga gacaggtgtt gcggccccgc agcgcccgcg 60
cgctcctctc cccgactcgg agcccctcgg cggcgcccgg cccaggaccc gcctaggagc 120
gcaggagccc cagcgcagag accccaacgc cgagaccccc gccccggccc cgccgcgctt 180
cctcccgacg cagagcaaac cgcccagagt ag 212
<210> 54
<211> 784
<212> DNA
<213> Homo sapiens
<400> 54
aagcaggtga gtttgtggtg tcgccgatgt cccttcgggg tactctagcg cagccgcctg 60
gctacttgac ccactgccac caaacgtttt aaattcaccg aaagcttagc ttcgaagcaa 120
agctccgttt cgccggtgaa gcaggaagcc ttcgctgcag gaactgacct ttacctcttg 180
gagcggcttc tgcagaaaaa tccccgggca gagatttggg cggagtttgc ctagaactaa 240
cgcggagcca gccgatcccg gcctaccccg gggccaagat tttaaggggt gaagagtccc 300
ttttgccttt tctggatcct ggtgattcac ctagtgtctt ccctaaggaa ctgaaccaac 360
tcctccgctg gcctctggca gccctccagg cggtgcagga tggcgtgggc ccggtaggaa 420
gctgcatgta accgcccagg gtcgggaggc caggagggca gctcctcctc tgacttgaat 480
attgaaaaca agaggatgct tttaagaaaa agaagaagga ggattcacta ccagctctga 540
agggtggaaa agagatgatt catccggatt gtggagaggg tggaatcttg tttaggagag 600
cgttggttgt ggcaggcagg gtgtaactat gaatcagtga agacaattca catcctggga 660
tgaaaagaag gccatgggct cacaggagat tatccactgg cctctccaca tccgcttgca 720
gtaaggagtg tgggactctc ccaagcttca gcgctgaact gcaatgcagt gacgtcgctt 780
aaga 784
<210> 55
<211> 771
<212> DNA
<213> Homo sapiens
<400> 55
tcatccatgt ccctacaaag gacatgaact catcattttt tatggctgca taagtcgttc 60
tttcaaacac cctgcagtca gcttctcctc acgagaaacc acatgaaagc cctcggggaa 120
atgcctctcg ggatctactt ttctttgtgt gtatcctact tagcctatcg gtttctgctt 180
cctgtggggc tacagccgtc tcgtcttttt ctgctggctc ctttgctctg ttctccagtg 240
gctatcttct ttctcctttc tttcaaatgt tctcccttat cttctctgat acagacagaa 300
ggtcaggagc cacgcccatt acactgacag aacccgatgt cctgatgcgc tctgtgcctc 360
ccagatttgg atgtggatgc gaggcgagct ggccagagag caatcatttc agcgagggtc 420
gttattccca tcttctctct taggacggag gtagggggac ttctggcccc aaatgttcct 480
tcttccagct gtggctgcct ccatcccgca gagtgagcct ttaatttgga gatcctaatg 540
ccccagtgct gtgccaggca cagtacacgt tctgcatgga ggacggttta cgctcccctt 600
acagaagagg aaggacactc agaaggctga actgttctgc ctaaggtcac cgagttgcta 660
aggcaagaag cagcctccaa ttcctgcctt actgatttct gggatgtgaa accaaaaggg 720
tgaggcggca agccccggct gccctcgggg gctcttccca agtgctctct t 771
<210> 56
<211> 771
<212> DNA
<213> Homo sapiens
<400> 56
aagagagcac ttgggaagag cccccgaggg cagccggggc ttgccgcctc acccttttgg 60
tttcacatcc cagaaatcag taaggcagga attggaggct gcttcttgcc ttagcaactc 120
ggtgacctta ggcagaacag ttcagccttc tgagtgtcct tcctcttctg taaggggagc 180
gtaaaccgtc ctccatgcag aacgtgtact gtgcctggca cagcactggg gcattaggat 240
ctccaaatta aaggctcact ctgcgggatg gaggcagcca cagctggaag aaggaacatt 300
tggggccaga agtcccccta cctccgtcct aagagagaag atgggaataa cgaccctcgc 360
tgaaatgatt gctctctggc cagctcgcct cgcatccaca tccaaatctg ggaggcacag 420
agcgcatcag gacatcgggt tctgtcagtg taatgggcgt ggctcctgac cttctgtctg 480
tatcagagaa gataagggag aacatttgaa agaaaggaga aagaagatag ccactggaga 540
acagagcaaa ggagccagca gaaaaagacg agacggctgt agccccacag gaagcagaaa 600
ccgataggct aagtaggata cacacaaaga aaagtagatc ccgagaggca tttccccgag 660
ggctttcatg tggtttctcg tgaggagaag ctgactgcag ggtgtttgaa agaacgactt 720
atgcagccat aaaaaatgat gagttcatgt cctttgtagg gacatggatg a 771
<210> 57
<211> 699
<212> DNA
<213> Homo sapiens
<400> 57
cttgcttacc cagactcaga gaagtctccc tgttctgtcc tagctagtga ttcctgtgtt 60
gtgtgcattc gtcttttcca gagcaaaccg cccagagtag aagatggatt ggggcacgct 120
gcagacgatc ctggggggtg tgaacaaaca ctccaccagc attggaaaga tctggctcac 180
cgtcctcttc atttttcgca ttatgatcct cgttgtggct gcaaaggagg tgtggggaga 240
tgagcaggcc gactttgtct gcaacaccct gcagccaggc tgcaagaacg tgtgctacga 300
tcactacttc cccatctccc acatccggct atgggccctg cagctgatct tcgtgtccac 360
gccagcgctc ctagtggcca tgcacgtggc ctaccggaga catgagaaga agaggaagtt 420
catcaagggg gagataaaga gtgaatttaa ggacatcgag gagatcaaaa cccagaaggt 480
ccgcatcgaa ggctccctgt ggtggaccta cacaagcagc atcttcttcc gggtcatctt 540
cgaagccgcc ttcatgtacg tcttctatgt catgtacgac ggcttctcca tgcagcggct 600
ggtgaagtgc aacgcctggc cttgtcccaa cactgtggac tgctttgtgt cccggcccac 660
ggagaagact gtcttcacag tgttcatgat tgcagtgtc 699
<210> 58
<211> 699
<212> DNA
<213> Homo sapiens
<400> 58
gacactgcaa tcatgaacac tgtgaagaca gtcttctccg tgggccggga cacaaagcag 60
tccacagtgt tgggacaagg ccaggcgttg cacttcacca gccgctgcat ggagaagccg 120
tcgtacatga catagaagac gtacatgaag gcggcttcga agatgacccg gaagaagatg 180
ctgcttgtgt aggtccacca cagggagcct tcgatgcgga ccttctgggt tttgatctcc 240
tcgatgtcct taaattcact ctttatctcc cccttgatga acttcctctt cttctcatgt 300
ctccggtagg ccacgtgcat ggccactagg agcgctggcg tggacacgaa gatcagctgc 360
agggcccata gccggatgtg ggagatgggg aagtagtgat cgtagcacac gttcttgcag 420
cctggctgca gggtgttgca gacaaagtcg gcctgctcat ctccccacac ctcctttgca 480
gccacaacga ggatcataat gcgaaaaatg aagaggacgg tgagccagat ctttccaatg 540
ctggtggagt gtttgttcac accccccagg atcgtctgca gcgtgcccca atccatcttc 600
tactctgggc ggtttgctct ggaaaagacg aatgcacaca acacaggaat cactagctag 660
gacagaacag ggagacttct ctgagtctgg gtaagcaag 699
<210> 59
<211> 700
<212> DNA
<213> Homo sapiens
<400> 59
gcctgacaca gtctgagcct cctcaggcgg cctcaggggt tgggatagag tggagaattc 60
aggcaagaat gccaacccta gctccaggcc tgggacccac aggcctgggg aaaagagtgg 120
ttgccccgtc ttgagacagc cgaaaactgt gtccccagga ttgttggttt cataaaagca 180
agtagctagg gaggccacat ttacagggga tcacagaaca cttgggtagg ggcttgctgt 240
aggtgtcatc agggaagtgg gggacggcag gagggatgtg gcccagtacg cagatgaaga 300
caggtgatca tccgctgggc cacacgtggc agggatatgg gcagagtgag cttggctggc 360
cccaggctcc aaagctgccc agcccccgct gaaggtgagg cctcagctgg tgggaatgtc 420
accttccagg tgactggctg gctccaaagg cctttgcatg atctccagga gtttggaggg 480
gagaggccac attccaaatc cagcttgaaa agtgctctgt atcaccctca gcactgaggg 540
ggccagagtc taggaggaag gaggcacagg gttggggggc agccctgacc tggtggccgc 600
acctgccagg tcccgagaga caacccatct cacacacatt caaaaacaca caccagggag 660
cacatggcta aacaaatcgc actaaacgcc aggaaggcag 700
<210> 60
<211> 700
<212> DNA
<213> Homo sapiens
<400> 60
ctgccttcct ggcgtttagt gcgatttgtt tagccatgtg ctccctggtg tgtgtttttg 60
aatgtgtgtg agatgggttg tctctcggga cctggcaggt gcggccacca ggtcagggct 120
gccccccaac cctgtgcctc cttcctccta gactctggcc ccctcagtgc tgagggtgat 180
acagagcact tttcaagctg gatttggaat gtggcctctc ccctccaaac tcctggagat 240
catgcaaagg cctttggagc cagccagtca cctggaaggt gacattccca ccagctgagg 300
cctcaccttc agcgggggct gggcagcttt ggagcctggg gccagccaag ctcactctgc 360
ccatatccct gccacgtgtg gcccagcgga tgatcacctg tcttcatctg cgtactgggc 420
cacatccctc ctgccgtccc ccacttccct gatgacacct acagcaagcc cctacccaag 480
tgttctgtga tcccctgtaa atgtggcctc cctagctact tgcttttatg aaaccaacaa 540
tcctggggac acagttttcg gctgtctcaa gacggggcaa ccactctttt ccccaggcct 600
gtgggtccca ggcctggagc tagggttggc attcttgcct gaattctcca ctctatccca 660
acccctgagg ccgcctgagg aggctcagac tgtgtcaggc 700
<210> 61
<211> 6374
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 61
ttgctggcct tttgctcaca tgtcctgcag gcagctgcgc gctcgctcgc tcactgaggc 60
cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag tgagcgagcg 120
agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc gcacgcgttt 180
aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg 240
cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc 300
ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg 360
ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg 420
gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga 480
gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa 540
ccgcccagag tagaagcgga tccgccacca tggattgggg cacgctgcag acgatcctgg 600
ggggtgtgaa caaacactcc accagcattg gaaagatctg gctcaccgtc ctcttcattt 660
ttcgcattat gatcctcgtt gtggctgcaa aggaggtgtg gggagatgag caggccgact 720
ttgtctgcaa caccctgcag ccaggctgca agaacgtgtg ctacgatcac tacttcccca 780
tctcccacat ccggctatgg gccctgcagc tgatcttcgt gtccacgcca gcgctcctag 840
tggccatgca cgtggcctac cggagacatg agaagaagag gaagttcatc aagggggaga 900
taaagagtga atttaaggac atcgaggaga tcaaaaccca gaaggtccgc atcgaaggct 960
ccctgtggtg gacctacaca agcagcatct tcttccgggt catcttcgaa gccgccttca 1020
tgtacgtctt ctatgtcatg tacgacggct tctccatgca gcggctggtg aagtgcaacg 1080
cctggccttg tcccaacact gtggactgct ttgtgtcccg gcccacggag aagactgtct 1140
tcacagtgtt catgattgca gtgtctggaa tttgcatcct gctgaatgtc actgaattgt 1200
gttatttgct aattagatat tgttctggga agtcaaaaaa gccagtttac ccatacgatg 1260
ttccagatta cgcttaaggc gcgccacccc tgcagggaat tccgcattgc ccagttgtta 1320
gattaagaaa tagacagcat gagagggatg aggcaacccg tgctcagctg tcaaggctca 1380
gtcgctagca tttcccaaca caaagattct gaccttaaat gcaaccattt gaaacccctg 1440
taggcctcag gtgaaactcc agatgccaca atggagctct gctcccctaa agcctcaaaa 1500
caaaggccta attctatgcc tgtcttaatt ttctttcact taagttagtt ccactgagac 1560
cccaggctgt taggggttat tggtgtaagg tactttcata ttttaaacag aggatatcgg 1620
catttgtttc tttctctgag gacaagagaa aaaagccagg ttccacagag gacacagaga 1680
aggtttgggt gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat 1740
tggttctttc atttgctttg gaagttttaa tctctaacag tggacaaagt taccagtgcc 1800
ttaaactctg ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg 1860
taaagatgtt ctggatacca ttatatgttc cccctgtttc agaggctcag attgtaatat 1920
gtaaatggta tgtcattcgc tactatgatt taatttgaaa tatggtcttt tggttatgaa 1980
tactttgcag cacagctgag aggctgtctg ttgtattcat tgtggtcata gcacctaaca 2040
acattgtagc ctcaatcgag tgagacagac tagaagttcc tagtgatggc ttatgatagc 2100
aaatggcctc atgtcaaata tttagatgta attttgtgta agaaatacag actggatgta 2160
ccaccaacta ctacctgtaa tgacaggcct gtccaacaca tctccctttt ccatgactgt 2220
ggtagccagc atcggaaaga acgctgattt aaagaggtcg cttgggaatt ttattgacac 2280
agtaccattt aatggggagg acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa 2340
aacagatttg gaaagactgg actctaaagt ctgttgatta aagatgagct ttgtctactt 2400
caaaagtttg tttgcttacc ccttcagcct ccaatttttt aagtgaaaat atagctaata 2460
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2520
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatatt 2580
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2640
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2700
atttaaaaca ttaaaatata atctctataa taatttaaaa tctaatatgg ttttaataga 2760
acagcgatat caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 2820
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2880
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2940
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3000
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3060
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3120
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3180
catcgtcctt tccttggctg ctcgcctatg ttgccacctg gattctgcgc gggacgtcct 3240
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3300
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3360
ccgcctcccc gcgaattcat cgataccgag cgctgctcga gagatctgtg atagcggcca 3420
tcaagctggc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3480
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 3540
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 3600
gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg agcggccgca 3660
ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 3720
cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg 3780
agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg 3840
tatttcacac cgcatacgtc aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc 3900
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 3960
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 4020
ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 4080
aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 4140
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 4200
ctcaacccta tctcgggcta ttcttttgat ttataaggga ttttgccgat ttcggcctat 4260
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg 4320
tttacaattt tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 4380
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 4440
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 4500
tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc 4560
atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc 4620
cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc 4680
tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc 4740
gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg 4800
gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat 4860
ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc 4920
acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa 4980
ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa 5040
aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt 5100
gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct 5160
tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat 5220
gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc aacaacgttg 5280
cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg 5340
atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt 5400
attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg 5460
ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg 5520
gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg 5580
tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa 5640
aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt 5700
tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt 5760
tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt 5820
ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag 5880
ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta 5940
gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat 6000
aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg 6060
ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg 6120
agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac 6180
aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga 6240
aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt 6300
ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta 6360
cggttcctgg cctt 6374
<210> 62
<211> 6347
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 62
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 60
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 120
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 180
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 240
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 300
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 360
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 420
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 480
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 540
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 600
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 660
acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 720
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 780
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 840
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 900
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgtcc tgcaggcagc 960
tgcgcgctcg ctcgctcact gaggccgccc gggcaaagcc cgggcgtcgg gcgacctttg 1020
gtcgcccggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaac tccatcacta 1080
ggggttcctg cggccgcacg cgtttaatta agacctcgaa ggggacttgg ggggttcggg 1140
gctttcgggg gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg 1200
cgcgctccgc ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg 1260
gggtgtgggg tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg 1320
cccgcgcgct cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct 1380
aggagcgcag gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc 1440
gcgcttcctc ccgacgcaga gcaaaccgcc cagagtagaa gcggatccgc caccatggat 1500
tggggcacac tccagagcat cctcgggggt gtcaacaaac actccaccag cattggaaag 1560
atctggctca cggtcctctt catcttccgc atcatgatcc tcgtggtggc tgcaaaggag 1620
gtgtggggag atgagcaagc cgattttgtc tgcaacacgc tccagcctgg ctgcaagaat 1680
gtatgctacg accaccactt ccccatctct cacatccggc tctgggctct gcagctgatc 1740
atggtgtcca cgccagccct cctggtagct atgcatgtgg cctaccggag acatgaaaag 1800
aaacggaagt tcatgaaggg agagataaag aacgagttta aggacatcga agagatcaaa 1860
acccagaagg tccgtatcga agggtccctg tggtggacct acaccaccag catcttcttc 1920
cgggtcatct ttgaagccgt cttcatgtac gtcttttaca tcatgtacaa tggcttcttc 1980
atgcaacgtc tggtgaaatg caacgcttgg ccctgcccca atacagtgga ctgcttcatt 2040
tccaggccca cagaaaagac tgtcttcacc gtgtttatga tttctgtgtc tggaatttgc 2100
attctgctaa atatcacaga gctgtgctat ttgttcgtta ggtattgctc aggaaagtcc 2160
aaaagaccag tctaaggcgc gccacccctg cagggaattc cgcattgccc agttgttaga 2220
ttaagaaata gacagcatga gagggatgag gcaacccgtg ctcagctgtc aaggctcagt 2280
cgctagcatt tcccaacaca aagattctga ccttaaatgc aaccatttga aacccctgta 2340
ggcctcaggt gaaactccag atgccacaat ggagctctgc tcccctaaag cctcaaaaca 2400
aaggcctaat tctatgcctg tcttaatttt ctttcactta agttagttcc actgagaccc 2460
caggctgtta ggggttattg gtgtaaggta ctttcatatt ttaaacagag gatatcggca 2520
tttgtttctt tctctgagga caagagaaaa aagccaggtt ccacagagga cacagagaag 2580
gtttgggtgt cctcctgggg ttctttttgc caactttccc cacgttaaag gtgaacattg 2640
gttctttcat ttgctttgga agttttaatc tctaacagtg gacaaagtta ccagtgcctt 2700
aaactctgtt acactttttg gaagtgaaaa ctttgtagta tgataggtta ttttgatgta 2760
aagatgttct ggataccatt atatgttccc cctgtttcag aggctcagat tgtaatatgt 2820
aaatggtatg tcattcgcta ctatgattta atttgaaata tggtcttttg gttatgaata 2880
ctttgcagca cagctgagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 2940
attgtagcct caatcgagtg agacagacta gaagttccta gtgatggctt atgatagcaa 3000
atggcctcat gtcaaatatt tagatgtaat tttgtgtaag aaatacagac tggatgtacc 3060
accaactact acctgtaatg acaggcctgt ccaacacatc tcccttttcc atgactgtgg 3120
tagccagcat cggaaagaac gctgatttaa agaggtcgct tgggaatttt attgacacag 3180
taccatttaa tggggaggac aaaatggggc aggggaggga gaagtttctg tcgttaaaaa 3240
cagatttgga aagactggac tctaaagtct gttgattaaa gatgagcttt gtctacttca 3300
aaagtttgtt tgcttacccc ttcagcctcc aattttttaa gtgaaaatat agctaataac 3360
atgtgaaaag aatagaagct aaggtttaga taaatattga gcagatctat aggaagattg 3420
aacctgaata ttgccattat gcttgacatg gtttccaaaa aatggtactc cacatatttc 3480
agtgagggta agtattttcc tgttgtcaag aatagcattg taaaagcatt ttgtaataat 3540
aaagaatagc tttaatgata tgcttgtaac taaaataatt ttgtaatgta tcaaatacat 3600
ttaaaacatt aaaatataat ctctataata atttaaaatc taatatggtt ttaatagaac 3660
agcgatatca agcttatcga taatcaacct ctggattaca aaatttgtga aagattgact 3720
ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg 3780
tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggttg 3840
ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg 3900
tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct cctttccggg 3960
actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg ccttgcccgc 4020
tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc ggggaaatca 4080
tcgtcctttc cttggctgct cgcctatgtt gccacctgga ttctgcgcgg gacgtccttc 4140
tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct gctgccggct 4200
ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc cctttgggcc 4260
gcctccccgc gaattcatcg ataccgagcg ctgctcgaga gatctgtgat agcggccatc 4320
aagctggctg tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc 4380
ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg 4440
cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg 4500
gaggattggg aagacaatag caggcatgct ggggacacgt gcggaccgag cggccgcagg 4560
aacccctagt gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg 4620
ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag 4680
cgcgcagctg cctgcagggg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta 4740
tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 4800
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 4860
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 4920
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 4980
acttgatttg ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 5040
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 5100
caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 5160
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 5220
tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 5280
ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 5340
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 5400
accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 5460
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5520
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5580
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5640
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5700
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5760
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5820
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5880
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5940
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 6000
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 6060
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 6120
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 6180
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 6240
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6300
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattg 6347
<210> 63
<211> 6347
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 63
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 60
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 120
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 180
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 240
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 300
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 360
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 420
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 480
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 540
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 600
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 660
acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 720
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 780
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 840
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 900
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgtcc tgcaggcagc 960
tgcgcgctcg ctcgctcact gaggccgccc gggcaaagcc cgggcgtcgg gcgacctttg 1020
gtcgcccggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaac tccatcacta 1080
ggggttcctg cggccgcacg cgtttaatta agacctcgaa ggggacttgg ggggttcggg 1140
gctttcgggg gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg 1200
cgcgctccgc ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg 1260
gggtgtgggg tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg 1320
cccgcgcgct cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct 1380
aggagcgcag gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc 1440
gcgcttcctc ccgacgcaga gcaaaccgcc cagagtagaa gcggatccgc caccatggat 1500
tggggcacgc tgcagacgat cctggggggt gtgaacaaac actccaccag cattggaaag 1560
atctggctca ccgtcctctt catttttcgc attatgatcc tcgttgtggc tgcaaaggag 1620
gtgtggggag atgagcaggc cgactttgtc tgcaacaccc tgcagccagg ctgcaagaac 1680
gtgtgctacg atcactactt ccccatctcc cacatccggc tatgggccct gcagctgatc 1740
ttcgtgtcca cgccagcgct cctagtggcc atgcacgtgg cctaccggag acatgagaag 1800
aagaggaagt tcatcaaggg ggagataaag agtgaattta aggacatcga ggagatcaaa 1860
acccagaagg tccgcatcga aggctccctg tggtggacct acacaagcag catcttcttc 1920
cgggtcatct tcgaagccgc cttcatgtac gtcttctatg tcatgtacga cggcttctcc 1980
atgcagcggc tggtgaagtg caacgcctgg ccttgtccca acactgtgga ctgctttgtg 2040
tcccggccca cggagaagac tgtcttcaca gtgttcatga ttgcagtgtc tggaatttgc 2100
atcctgctga atgtcactga attgtgttat ttgctaatta gatattgttc tgggaagtca 2160
aaaaagccag tttaaggcgc gccacccctg cagggaattc cgcattgccc agttgttaga 2220
ttaagaaata gacagcatga gagggatgag gcaacccgtg ctcagctgtc aaggctcagt 2280
cgctagcatt tcccaacaca aagattctga ccttaaatgc aaccatttga aacccctgta 2340
ggcctcaggt gaaactccag atgccacaat ggagctctgc tcccctaaag cctcaaaaca 2400
aaggcctaat tctatgcctg tcttaatttt ctttcactta agttagttcc actgagaccc 2460
caggctgtta ggggttattg gtgtaaggta ctttcatatt ttaaacagag gatatcggca 2520
tttgtttctt tctctgagga caagagaaaa aagccaggtt ccacagagga cacagagaag 2580
gtttgggtgt cctcctgggg ttctttttgc caactttccc cacgttaaag gtgaacattg 2640
gttctttcat ttgctttgga agttttaatc tctaacagtg gacaaagtta ccagtgcctt 2700
aaactctgtt acactttttg gaagtgaaaa ctttgtagta tgataggtta ttttgatgta 2760
aagatgttct ggataccatt atatgttccc cctgtttcag aggctcagat tgtaatatgt 2820
aaatggtatg tcattcgcta ctatgattta atttgaaata tggtcttttg gttatgaata 2880
ctttgcagca cagctgagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 2940
attgtagcct caatcgagtg agacagacta gaagttccta gtgatggctt atgatagcaa 3000
atggcctcat gtcaaatatt tagatgtaat tttgtgtaag aaatacagac tggatgtacc 3060
accaactact acctgtaatg acaggcctgt ccaacacatc tcccttttcc atgactgtgg 3120
tagccagcat cggaaagaac gctgatttaa agaggtcgct tgggaatttt attgacacag 3180
taccatttaa tggggaggac aaaatggggc aggggaggga gaagtttctg tcgttaaaaa 3240
cagatttgga aagactggac tctaaagtct gttgattaaa gatgagcttt gtctacttca 3300
aaagtttgtt tgcttacccc ttcagcctcc aattttttaa gtgaaaatat agctaataac 3360
atgtgaaaag aatagaagct aaggtttaga taaatattga gcagatctat aggaagattg 3420
aacctgaata ttgccattat gcttgacatg gtttccaaaa aatggtactc cacatatttc 3480
agtgagggta agtattttcc tgttgtcaag aatagcattg taaaagcatt ttgtaataat 3540
aaagaatagc tttaatgata tgcttgtaac taaaataatt ttgtaatgta tcaaatacat 3600
ttaaaacatt aaaatataat ctctataata atttaaaatc taatatggtt ttaatagaac 3660
agcgatatca agcttatcga taatcaacct ctggattaca aaatttgtga aagattgact 3720
ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg 3780
tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggttg 3840
ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg 3900
tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct cctttccggg 3960
actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg ccttgcccgc 4020
tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc ggggaaatca 4080
tcgtcctttc cttggctgct cgcctatgtt gccacctgga ttctgcgcgg gacgtccttc 4140
tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct gctgccggct 4200
ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc cctttgggcc 4260
gcctccccgc gaattcatcg ataccgagcg ctgctcgaga gatctgtgat agcggccatc 4320
aagctggctg tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc 4380
ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg 4440
cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg 4500
gaggattggg aagacaatag caggcatgct ggggacacgt gcggaccgag cggccgcagg 4560
aacccctagt gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg 4620
ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag 4680
cgcgcagctg cctgcagggg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta 4740
tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 4800
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 4860
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 4920
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 4980
acttgatttg ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 5040
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 5100
caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 5160
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 5220
tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 5280
ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 5340
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 5400
accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 5460
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5520
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5580
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5640
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5700
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5760
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5820
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5880
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5940
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 6000
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 6060
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 6120
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 6180
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 6240
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6300
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattg 6347
<210> 64
<211> 7150
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 64
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagagagca cttgggaaga 1320
gcccccgagg gcagccgggg cttgccgcct cacccttttg gtttcacatc ccagaaatca 1380
gtaaggcagg aattggaggc tgcttcttgc cttagcaact cggtgacctt aggcagaaca 1440
gttcagcctt ctgagtgtcc ttcctcttct gtaaggggag cgtaaaccgt cctccatgca 1500
gaacgtgtac tgtgcctggc acagcactgg ggcattagga tctccaaatt aaaggctcac 1560
tctgcgggat ggaggcagcc acagctggaa gaaggaacat ttggggccag aagtccccct 1620
acctccgtcc taagagagaa gatgggaata acgaccctcg ctgaaatgat tgctctctgg 1680
ccagctcgcc tcgcatccac atccaaatct gggaggcaca gagcgcatca ggacatcggg 1740
ttctgtcagt gtaatgggcg tggctcctga ccttctgtct gtatcagaga agataaggga 1800
gaacatttga aagaaaggag aaagaagata gccactggag aacagagcaa aggagccagc 1860
agaaaaagac gagacggctg tagccccaca ggaagcagaa accgataggc taagtaggat 1920
acacacaaag aaaagtagat cccgagaggc atttccccga gggctttcat gtggtttctc 1980
gtgaggagaa gctgactgca gggtgtttga aagaacgact tatgcagcca taaaaaatga 2040
tgagttcatg tcctttgtag ggacatggat gattaattaa gacctcgaag gggacttggg 2100
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 2160
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 2220
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 2280
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 2340
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 2400
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag ccatggtgag 2460
caagggcgag gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt 2520
aaacggccac aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct 2580
gaccctgaag ttcatctgca ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac 2640
caccctgacc tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga 2700
cttcttcaag tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga 2760
cgacggcaac tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg 2820
catcgagctg aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga 2880
gtacaactac aacagccaca acgtctatat catggccgac aagcagaaga acggcatcaa 2940
ggtgaacttc aagatccgcc acaacatcga ggacggcagc gtgcagctcg ccgaccacta 3000
ccagcagaac acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag 3060
cacccagtcc gccctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga 3120
gttcgtgacc gccgccggga tcactctcgg catggacgag ctgtacaagt aataaaggcg 3180
cgccacccct gcagggaatt ccgcattgcc cagttgttag attaagaaat agacagcatg 3240
agagggatga ggcaacccgt gctcagctgt caaggctcag tcgctagcat ttcccaacac 3300
aaagattctg accttaaatg caaccatttg aaacccctgt aggcctcagg tgaaactcca 3360
gatgccacaa tggagctctg ctcccctaaa gcctcaaaac aaaggcctaa ttctatgcct 3420
gtcttaattt tctttcactt aagttagttc cactgagacc ccaggctgtt aggggttatt 3480
ggtgtaaggt actttcatat tttaaacaga ggatatcggc atttgtttct ttctctgagg 3540
acaagagaaa aaagccaggt tccacagagg acacagagaa ggtttgggtg tcctcctggg 3600
gttctttttg ccaactttcc ccacgttaaa ggtgaacatt ggttctttca tttgctttgg 3660
aagttttaat ctctaacagt ggacaaagtt accagtgcct taaactctgt tacacttttt 3720
ggaagtgaaa actttgtagt atgataggtt attttgatgt aaagatgttc tggataccat 3780
tatatgttcc ccctgtttca gaggctcaga ttgtaatatg taaatggtat gtcattcgct 3840
actatgattt aatttgaaat atggtctttt ggttatgaat actttgcagc acagctgaga 3900
ggctgtctgt tgtattcatt gtggtcatag cacctaacaa cattgtagcc tcaatcgagt 3960
gagacagact agaagttcct agtgatggct tatgatagca aatggcctca tgtcaaatat 4020
ttagatgtaa ttttgtgtaa gaaatacaga ctggatgtac caccaactac tacctgtaat 4080
gacaggcctg tccaacacat ctcccttttc catgactgtg gtagccagca tcggaaagaa 4140
cgctgattta aagaggtcgc ttgggaattt tattgacaca gtaccattta atggggagga 4200
caaaatgggg caggggaggg agaagtttct gtcgttaaaa acagatttgg aaagactgga 4260
ctctaaagtc tgttgattaa agatgagctt tgtctacttc aaaagtttgt ttgcttaccc 4320
cttcagcctc caatttttta agtgaaaata tagctaataa catgtgaaaa gaatagaagc 4380
taaggtttag ataaatattg agcagatcta taggaagatt gaacctgaat attgccatta 4440
tgcttgacat ggtttccaaa aaatggtact ccacatattt cagtgagggt aagtattttc 4500
ctgttgtcaa gaatagcatt gtaaaagcat tttgtaataa taaagaatag ctttaatgat 4560
atgcttgtaa ctaaaataat tttgtaatgt atcaaataca tttaaaacat taaaatataa 4620
tctctataat aatttaaaat ctaatatggt tttaatagaa cagcgatatc aagcttatcg 4680
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 4740
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 4800
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 4860
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 4920
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 4980
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 5040
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 5100
tcgcctatgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 5160
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 5220
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg cgaattcatc 5280
gataccgagc gctgctcgag agatctgtga tagcggccat caagctggct gtgccttcta 5340
gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca 5400
ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc 5460
attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata 5520
gcaggcatgc tggggacacg tgcggaccga gcggccgcag gaacccctag tgatggagtt 5580
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg 5640
acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagct gcctgcaggg 5700
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatacgtca 5760
aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg 5820
cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct 5880
tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta 5940
gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgattt gggtgatggt 6000
tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg 6060
ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat ctcgggctat 6120
tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt 6180
taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttt atggtgcact 6240
ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc gccaacaccc 6300
gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 6360
gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgagacga 6420
aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag 6480
acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa 6540
atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat 6600
tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg 6660
gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa 6720
gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt 6780
gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt 6840
ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat 6900
tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg 6960
acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta 7020
cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat 7080
catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag 7140
cgtgacacca 7150
<210> 65
<211> 7108
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 65
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagagagca cttgggaaga 1320
gcccccgagg gcagccgggg cttgccgcct cacccttttg gtttcacatc ccagaaatca 1380
gtaaggcagg aattggaggc tgcttcttgc cttagcaact cggtgacctt aggcagaaca 1440
gttcagcctt ctgagtgtcc ttcctcttct gtaaggggag cgtaaaccgt cctccatgca 1500
gaacgtgtac tgtgcctggc acagcactgg ggcattagga tctccaaatt aaaggctcac 1560
tctgcgggat ggaggcagcc acagctggaa gaaggaacat ttggggccag aagtccccct 1620
acctccgtcc taagagagaa gatgggaata acgaccctcg ctgaaatgat tgctctctgg 1680
ccagctcgcc tcgcatccac atccaaatct gggaggcaca gagcgcatca ggacatcggg 1740
ttctgtcagt gtaatgggcg tggctcctga ccttctgtct gtatcagaga agataaggga 1800
gaacatttga aagaaaggag aaagaagata gccactggag aacagagcaa aggagccagc 1860
agaaaaagac gagacggctg tagccccaca ggaagcagaa accgataggc taagtaggat 1920
acacacaaag aaaagtagat cccgagaggc atttccccga gggctttcat gtggtttctc 1980
gtgaggagaa gctgactgca gggtgtttga aagaacgact tatgcagcca taaaaaatga 2040
tgagttcatg tcctttgtag ggacatggat gattaattaa gacctcgaag gggacttggg 2100
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 2160
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 2220
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 2280
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 2340
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 2400
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag ccatggattg 2460
gggcacgctg cagacgatcc tggggggtgt gaacaaacac tccaccagca ttggaaagat 2520
ctggctcacc gtcctcttca tttttcgcat tatgatcctc gttgtggctg caaaggaggt 2580
gtggggagat gagcaggccg actttgtctg caacaccctg cagccaggct gcaagaacgt 2640
gtgctacgat cactacttcc ccatctccca catccggcta tgggccctgc agctgatctt 2700
cgtgtccacg ccagcgctcc tagtggccat gcacgtggcc taccggagac atgagaagaa 2760
gaggaagttc atcaaggggg agataaagag tgaatttaag gacatcgagg agatcaaaac 2820
ccagaaggtc cgcatcgaag gctccctgtg gtggacctac acaagcagca tcttcttccg 2880
ggtcatcttc gaagccgcct tcatgtacgt cttctatgtc atgtacgacg gcttctccat 2940
gcagcggctg gtgaagtgca acgcctggcc ttgtcccaac actgtggact gctttgtgtc 3000
ccggcccacg gagaagactg tcttcacagt gttcatgatt gcagtgtctg gaatttgcat 3060
cctgctgaat gtcactgaat tgtgttattt gctaattaga tattgttctg ggaagtcaaa 3120
aaagccagtt taaaggcgcg ccacccctgc agggaattcc gcattgccca gttgttagat 3180
taagaaatag acagcatgag agggatgagg caacccgtgc tcagctgtca aggctcagtc 3240
gctagcattt cccaacacaa agattctgac cttaaatgca accatttgaa acccctgtag 3300
gcctcaggtg aaactccaga tgccacaatg gagctctgct cccctaaagc ctcaaaacaa 3360
aggcctaatt ctatgcctgt cttaattttc tttcacttaa gttagttcca ctgagacccc 3420
aggctgttag gggttattgg tgtaaggtac tttcatattt taaacagagg atatcggcat 3480
ttgtttcttt ctctgaggac aagagaaaaa agccaggttc cacagaggac acagagaagg 3540
tttgggtgtc ctcctggggt tctttttgcc aactttcccc acgttaaagg tgaacattgg 3600
ttctttcatt tgctttggaa gttttaatct ctaacagtgg acaaagttac cagtgcctta 3660
aactctgtta cactttttgg aagtgaaaac tttgtagtat gataggttat tttgatgtaa 3720
agatgttctg gataccatta tatgttcccc ctgtttcaga ggctcagatt gtaatatgta 3780
aatggtatgt cattcgctac tatgatttaa tttgaaatat ggtcttttgg ttatgaatac 3840
tttgcagcac agctgagagg ctgtctgttg tattcattgt ggtcatagca cctaacaaca 3900
ttgtagcctc aatcgagtga gacagactag aagttcctag tgatggctta tgatagcaaa 3960
tggcctcatg tcaaatattt agatgtaatt ttgtgtaaga aatacagact ggatgtacca 4020
ccaactacta cctgtaatga caggcctgtc caacacatct cccttttcca tgactgtggt 4080
agccagcatc ggaaagaacg ctgatttaaa gaggtcgctt gggaatttta ttgacacagt 4140
accatttaat ggggaggaca aaatggggca ggggagggag aagtttctgt cgttaaaaac 4200
agatttggaa agactggact ctaaagtctg ttgattaaag atgagctttg tctacttcaa 4260
aagtttgttt gcttacccct tcagcctcca attttttaag tgaaaatata gctaataaca 4320
tgtgaaaaga atagaagcta aggtttagat aaatattgag cagatctata ggaagattga 4380
acctgaatat tgccattatg cttgacatgg tttccaaaaa atggtactcc acatatttca 4440
gtgagggtaa gtattttcct gttgtcaaga atagcattgt aaaagcattt tgtaataata 4500
aagaatagct ttaatgatat gcttgtaact aaaataattt tgtaatgtat caaatacatt 4560
taaaacatta aaatataatc tctataataa tttaaaatct aatatggttt taatagaaca 4620
gcgatatcaa gcttatcgat aatcaacctc tggattacaa aatttgtgaa agattgactg 4680
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 4740
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 4800
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 4860
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 4920
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 4980
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 5040
cgtcctttcc ttggctgctc gcctatgttg ccacctggat tctgcgcggg acgtccttct 5100
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 5160
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 5220
cctccccgcg aattcatcga taccgagcgc tgctcgagag atctgtgata gcggccatca 5280
agctggctgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct 5340
tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc 5400
attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg 5460
aggattggga agacaatagc aggcatgctg gggacacgtg cggaccgagc ggccgcagga 5520
acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg 5580
gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc 5640
gcgcagctgc ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat 5700
ttcacaccgc atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg 5760
gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct 5820
cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta 5880
aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa 5940
cttgatttgg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct 6000
ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc 6060
aaccctatct cgggctattc ttttgattta taagggattt tgccgatttc ggcctattgg 6120
ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt 6180
acaattttat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc 6240
cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct 6300
tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca 6360
ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg 6420
ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct 6480
atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga 6540
taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc 6600
cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg 6660
aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc 6720
aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact 6780
tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc 6840
ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag 6900
catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat 6960
aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt 7020
ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa 7080
gccataccaa acgacgagcg tgacacca 7108
<210> 66
<211> 7135
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 66
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagagagca cttgggaaga 1320
gcccccgagg gcagccgggg cttgccgcct cacccttttg gtttcacatc ccagaaatca 1380
gtaaggcagg aattggaggc tgcttcttgc cttagcaact cggtgacctt aggcagaaca 1440
gttcagcctt ctgagtgtcc ttcctcttct gtaaggggag cgtaaaccgt cctccatgca 1500
gaacgtgtac tgtgcctggc acagcactgg ggcattagga tctccaaatt aaaggctcac 1560
tctgcgggat ggaggcagcc acagctggaa gaaggaacat ttggggccag aagtccccct 1620
acctccgtcc taagagagaa gatgggaata acgaccctcg ctgaaatgat tgctctctgg 1680
ccagctcgcc tcgcatccac atccaaatct gggaggcaca gagcgcatca ggacatcggg 1740
ttctgtcagt gtaatgggcg tggctcctga ccttctgtct gtatcagaga agataaggga 1800
gaacatttga aagaaaggag aaagaagata gccactggag aacagagcaa aggagccagc 1860
agaaaaagac gagacggctg tagccccaca ggaagcagaa accgataggc taagtaggat 1920
acacacaaag aaaagtagat cccgagaggc atttccccga gggctttcat gtggtttctc 1980
gtgaggagaa gctgactgca gggtgtttga aagaacgact tatgcagcca taaaaaatga 2040
tgagttcatg tcctttgtag ggacatggat gattaattaa gacctcgaag gggacttggg 2100
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 2160
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 2220
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 2280
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 2340
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 2400
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag ccatggattg 2460
gggcacactc cagagcatcc tcgggggtgt caacaaacac tccaccagca ttggaaagat 2520
ctggctcacg gtcctcttca tcttccgcat catgatcctc gtggtggctg caaaggaggt 2580
gtggggagat gagcaagccg attttgtctg caacacgctc cagcctggct gcaagaatgt 2640
atgctacgac caccacttcc ccatctctca catccggctc tgggctctgc agctgatcat 2700
ggtgtccacg ccagccctcc tggtagctat gcatgtggcc taccggagac atgaaaagaa 2760
acggaagttc atgaagggag agataaagaa cgagtttaag gacatcgaag agatcaaaac 2820
ccagaaggtc cgtatcgaag ggtccctgtg gtggacctac accaccagca tcttcttccg 2880
ggtcatcttt gaagccgtct tcatgtacgt cttttacatc atgtacaatg gcttcttcat 2940
gcaacgtctg gtgaaatgca acgcttggcc ctgccccaat acagtggact gcttcatttc 3000
caggcccaca gaaaagactg tcttcaccgt gtttatgatt tctgtgtctg gaatttgcat 3060
tctgctaaat atcacagagc tgtgctattt gttcgttagg tattgctcag gaaagtccaa 3120
aagaccagtc tacccatacg atgttccaga ttacgcttaa aggcgcgcca cccctgcagg 3180
gaattccgca ttgcccagtt gttagattaa gaaatagaca gcatgagagg gatgaggcaa 3240
cccgtgctca gctgtcaagg ctcagtcgct agcatttccc aacacaaaga ttctgacctt 3300
aaatgcaacc atttgaaacc cctgtaggcc tcaggtgaaa ctccagatgc cacaatggag 3360
ctctgctccc ctaaagcctc aaaacaaagg cctaattcta tgcctgtctt aattttcttt 3420
cacttaagtt agttccactg agaccccagg ctgttagggg ttattggtgt aaggtacttt 3480
catattttaa acagaggata tcggcatttg tttctttctc tgaggacaag agaaaaaagc 3540
caggttccac agaggacaca gagaaggttt gggtgtcctc ctggggttct ttttgccaac 3600
tttccccacg ttaaaggtga acattggttc tttcatttgc tttggaagtt ttaatctcta 3660
acagtggaca aagttaccag tgccttaaac tctgttacac tttttggaag tgaaaacttt 3720
gtagtatgat aggttatttt gatgtaaaga tgttctggat accattatat gttccccctg 3780
tttcagaggc tcagattgta atatgtaaat ggtatgtcat tcgctactat gatttaattt 3840
gaaatatggt cttttggtta tgaatacttt gcagcacagc tgagaggctg tctgttgtat 3900
tcattgtggt catagcacct aacaacattg tagcctcaat cgagtgagac agactagaag 3960
ttcctagtga tggcttatga tagcaaatgg cctcatgtca aatatttaga tgtaattttg 4020
tgtaagaaat acagactgga tgtaccacca actactacct gtaatgacag gcctgtccaa 4080
cacatctccc ttttccatga ctgtggtagc cagcatcgga aagaacgctg atttaaagag 4140
gtcgcttggg aattttattg acacagtacc atttaatggg gaggacaaaa tggggcaggg 4200
gagggagaag tttctgtcgt taaaaacaga tttggaaaga ctggactcta aagtctgttg 4260
attaaagatg agctttgtct acttcaaaag tttgtttgct taccccttca gcctccaatt 4320
ttttaagtga aaatatagct aataacatgt gaaaagaata gaagctaagg tttagataaa 4380
tattgagcag atctatagga agattgaacc tgaatattgc cattatgctt gacatggttt 4440
ccaaaaaatg gtactccaca tatttcagtg agggtaagta ttttcctgtt gtcaagaata 4500
gcattgtaaa agcattttgt aataataaag aatagcttta atgatatgct tgtaactaaa 4560
ataattttgt aatgtatcaa atacatttaa aacattaaaa tataatctct ataataattt 4620
aaaatctaat atggttttaa tagaacagcg atatcaagct tatcgataat caacctctgg 4680
attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct tttacgctat 4740
gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg gctttcattt 4800
tctcctcctt gtataaatcc tggttgctgt ctctttatga ggagttgtgg cccgttgtca 4860
ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac ccccactggt tggggcattg 4920
ccaccacctg tcagctcctt tccgggactt tcgctttccc cctccctatt gccacggcgg 4980
aactcatcgc cgcctgcctt gcccgctgct ggacaggggc tcggctgttg ggcactgaca 5040
attccgtggt gttgtcgggg aaatcatcgt cctttccttg gctgctcgcc tatgttgcca 5100
cctggattct gcgcgggacg tccttctgct acgtcccttc ggccctcaat ccagcggacc 5160
ttccttcccg cggcctgctg ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc 5220
agacgagtcg gatctccctt tgggccgcct ccccgcgaat tcatcgatac cgagcgctgc 5280
tcgagagatc tgtgatagcg gccatcaagc tggctgtgcc ttctagttgc cagccatctg 5340
ttgtttgccc ctcccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt 5400
cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg 5460
gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg 5520
acacgtgcgg accgagcggc cgcaggaacc cctagtgatg gagttggcca ctccctctct 5580
gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc 5640
ccgggcggcc tcagtgagcg agcgagcgcg cagctgcctg caggggcgcc tgatgcggta 5700
ttttctcctt acgcatctgt gcggtatttc acaccgcata cgtcaaagca accatagtac 5760
gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct 5820
acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 5880
ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 5940
gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtgggcca 6000
tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 6060
ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa 6120
gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac 6180
gcgaatttta acaaaatatt aacgtttaca attttatggt gcactctcag tacaatctgc 6240
tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga 6300
cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc 6360
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata 6420
cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact 6480
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 6540
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt 6600
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 6660
gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca 6720
cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc 6780
gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc 6840
cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg 6900
gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta 6960
tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc 7020
ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt 7080
gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga cacca 7135
<210> 67
<211> 7124
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 67
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcagtgatgc ctgaaacctc 1320
agatggtact gaaccctcta tataatctgt tttttcctat acatacaaac ctaccataag 1380
gcttaatggt aagagattaa caataaagaa taataaaaca acacttataa caatgtataa 1440
caatatattg taatataagt ttttggatgc agtctctctc tcaaaatgct atcatatttt 1500
ccaactgtgg ttgactacag gtaactggaa ccacaaaaat gaaacagtgg ataagagggc 1560
gactcctgta ccaaagaaaa aaatagagtg ttgcagctgt aacatagttg aatgactgag 1620
ttagactgca taactgacac acaaaaccac ataaatataa atgaaggaat ctctgggtgt 1680
aatctggtgc aaaggtgact gtgttaatca ttaatccaca agttgctatc ctgaagtgtg 1740
ccaaatgctt tatgtttatt tcatcacata gctctataaa gaaaggattt gtaattcctt 1800
tctacagaag tggaaagtaa gtcttaagac tcaaaaaact ttaaaaacta caatgaagta 1860
acaactttta ttaatttatt ttgtgtcttt ccagaatttc tatatatata ggaatgtgat 1920
atgaatctat atgtgaattg aatctacatg aatattgatg acttttattt ccccttttgc 1980
acataagata gaatatttta cctactattc cacactttgc ttttcttaac atatcatggg 2040
atctttttat ataagtgaac aaagagtttc ttcattcttt cacacagttt aattaagacc 2100
tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg cggacccggg 2160
aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc ctccgtaact 2220
ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg ccacggcggg 2280
agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg gagcccctcg 2340
gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga gaccccaacg 2400
ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa ccgcccagag 2460
tagaagccat ggattggggc acgctgcaga cgatcctggg gggtgtgaac aaacactcca 2520
ccagcattgg aaagatctgg ctcaccgtcc tcttcatttt tcgcattatg atcctcgttg 2580
tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt tgtctgcaac accctgcagc 2640
caggctgcaa gaacgtgtgc tacgatcact acttccccat ctcccacatc cggctatggg 2700
ccctgcagct gatcttcgtg tccacgccag cgctcctagt ggccatgcac gtggcctacc 2760
ggagacatga gaagaagagg aagttcatca agggggagat aaagagtgaa tttaaggaca 2820
tcgaggagat caaaacccag aaggtccgca tcgaaggctc cctgtggtgg acctacacaa 2880
gcagcatctt cttccgggtc atcttcgaag ccgccttcat gtacgtcttc tatgtcatgt 2940
acgacggctt ctccatgcag cggctggtga agtgcaacgc ctggccttgt cccaacactg 3000
tggactgctt tgtgtcccgg cccacggaga agactgtctt cacagtgttc atgattgcag 3060
tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg ttatttgcta attagatatt 3120
gttctgggaa gtcaaaaaag ccagtttaaa ggcgcgccac ccctgcaggg aattccgcat 3180
tgcccagttg ttagattaag aaatagacag catgagaggg atgaggcaac ccgtgctcag 3240
ctgtcaaggc tcagtcgcta gcatttccca acacaaagat tctgacctta aatgcaacca 3300
tttgaaaccc ctgtaggcct caggtgaaac tccagatgcc acaatggagc tctgctcccc 3360
taaagcctca aaacaaaggc ctaattctat gcctgtctta attttctttc acttaagtta 3420
gttccactga gaccccaggc tgttaggggt tattggtgta aggtactttc atattttaaa 3480
cagaggatat cggcatttgt ttctttctct gaggacaaga gaaaaaagcc aggttccaca 3540
gaggacacag agaaggtttg ggtgtcctcc tggggttctt tttgccaact ttccccacgt 3600
taaaggtgaa cattggttct ttcatttgct ttggaagttt taatctctaa cagtggacaa 3660
agttaccagt gccttaaact ctgttacact ttttggaagt gaaaactttg tagtatgata 3720
ggttattttg atgtaaagat gttctggata ccattatatg ttccccctgt ttcagaggct 3780
cagattgtaa tatgtaaatg gtatgtcatt cgctactatg atttaatttg aaatatggtc 3840
ttttggttat gaatactttg cagcacagct gagaggctgt ctgttgtatt cattgtggtc 3900
atagcaccta acaacattgt agcctcaatc gagtgagaca gactagaagt tcctagtgat 3960
ggcttatgat agcaaatggc ctcatgtcaa atatttagat gtaattttgt gtaagaaata 4020
cagactggat gtaccaccaa ctactacctg taatgacagg cctgtccaac acatctccct 4080
tttccatgac tgtggtagcc agcatcggaa agaacgctga tttaaagagg tcgcttggga 4140
attttattga cacagtacca tttaatgggg aggacaaaat ggggcagggg agggagaagt 4200
ttctgtcgtt aaaaacagat ttggaaagac tggactctaa agtctgttga ttaaagatga 4260
gctttgtcta cttcaaaagt ttgtttgctt accccttcag cctccaattt tttaagtgaa 4320
aatatagcta ataacatgtg aaaagaatag aagctaaggt ttagataaat attgagcaga 4380
tctataggaa gattgaacct gaatattgcc attatgcttg acatggtttc caaaaaatgg 4440
tactccacat atttcagtga gggtaagtat tttcctgttg tcaagaatag cattgtaaaa 4500
gcattttgta ataataaaga atagctttaa tgatatgctt gtaactaaaa taattttgta 4560
atgtatcaaa tacatttaaa acattaaaat ataatctcta taataattta aaatctaata 4620
tggttttaat agaacagcga tatcaagctt atcgataatc aacctctgga ttacaaaatt 4680
tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg tggatacgct 4740
gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt ctcctccttg 4800
tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag gcaacgtggc 4860
gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc caccacctgt 4920
cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga actcatcgcc 4980
gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggtg 5040
ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct atgttgccac ctggattctg 5100
cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct tccttcccgc 5160
ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca gacgagtcgg 5220
atctcccttt gggccgcctc cccgcgaatt catcgatacc gagcgctgct cgagagatct 5280
gtgatagcgg ccatcaagct ggctgtgcct tctagttgcc agccatctgt tgtttgcccc 5340
tcccccgtgc cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat 5400
gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 5460
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga cacgtgcgga 5520
ccgagcggcc gcaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct 5580
cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct 5640
cagtgagcga gcgagcgcgc agctgcctgc aggggcgcct gatgcggtat tttctcctta 5700
cgcatctgtg cggtatttca caccgcatac gtcaaagcaa ccatagtacg cgccctgtag 5760
cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag 5820
cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt 5880
tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca 5940
cctcgacccc aaaaaacttg atttgggtga tggttcacgt agtgggccat cgccctgata 6000
gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca 6060
aactggaaca acactcaacc ctatctcggg ctattctttt gatttataag ggattttgcc 6120
gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa 6180
caaaatatta acgtttacaa ttttatggtg cactctcagt acaatctgct ctgatgccgc 6240
atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 6300
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 6360
gttttcaccg tcatcaccga aacgcgcgag acgaaagggc ctcgtgatac gcctattttt 6420
ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa 6480
tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat 6540
gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca 6600
acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca 6660
cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta 6720
catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt 6780
tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc 6840
cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc 6900
accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc 6960
cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa 7020
ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga 7080
accggagctg aatgaagcca taccaaacga cgagcgtgac acca 7124
<210> 68
<211> 7151
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 68
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcagtgatgc ctgaaacctc 1320
agatggtact gaaccctcta tataatctgt tttttcctat acatacaaac ctaccataag 1380
gcttaatggt aagagattaa caataaagaa taataaaaca acacttataa caatgtataa 1440
caatatattg taatataagt ttttggatgc agtctctctc tcaaaatgct atcatatttt 1500
ccaactgtgg ttgactacag gtaactggaa ccacaaaaat gaaacagtgg ataagagggc 1560
gactcctgta ccaaagaaaa aaatagagtg ttgcagctgt aacatagttg aatgactgag 1620
ttagactgca taactgacac acaaaaccac ataaatataa atgaaggaat ctctgggtgt 1680
aatctggtgc aaaggtgact gtgttaatca ttaatccaca agttgctatc ctgaagtgtg 1740
ccaaatgctt tatgtttatt tcatcacata gctctataaa gaaaggattt gtaattcctt 1800
tctacagaag tggaaagtaa gtcttaagac tcaaaaaact ttaaaaacta caatgaagta 1860
acaactttta ttaatttatt ttgtgtcttt ccagaatttc tatatatata ggaatgtgat 1920
atgaatctat atgtgaattg aatctacatg aatattgatg acttttattt ccccttttgc 1980
acataagata gaatatttta cctactattc cacactttgc ttttcttaac atatcatggg 2040
atctttttat ataagtgaac aaagagtttc ttcattcttt cacacagttt aattaagacc 2100
tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg cggacccggg 2160
aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc ctccgtaact 2220
ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg ccacggcggg 2280
agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg gagcccctcg 2340
gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga gaccccaacg 2400
ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa ccgcccagag 2460
tagaagccat ggattggggc acactccaga gcatcctcgg gggtgtcaac aaacactcca 2520
ccagcattgg aaagatctgg ctcacggtcc tcttcatctt ccgcatcatg atcctcgtgg 2580
tggctgcaaa ggaggtgtgg ggagatgagc aagccgattt tgtctgcaac acgctccagc 2640
ctggctgcaa gaatgtatgc tacgaccacc acttccccat ctctcacatc cggctctggg 2700
ctctgcagct gatcatggtg tccacgccag ccctcctggt agctatgcat gtggcctacc 2760
ggagacatga aaagaaacgg aagttcatga agggagagat aaagaacgag tttaaggaca 2820
tcgaagagat caaaacccag aaggtccgta tcgaagggtc cctgtggtgg acctacacca 2880
ccagcatctt cttccgggtc atctttgaag ccgtcttcat gtacgtcttt tacatcatgt 2940
acaatggctt cttcatgcaa cgtctggtga aatgcaacgc ttggccctgc cccaatacag 3000
tggactgctt catttccagg cccacagaaa agactgtctt caccgtgttt atgatttctg 3060
tgtctggaat ttgcattctg ctaaatatca cagagctgtg ctatttgttc gttaggtatt 3120
gctcaggaaa gtccaaaaga ccagtctacc catacgatgt tccagattac gcttaaaggc 3180
gcgccacccc tgcagggaat tccgcattgc ccagttgtta gattaagaaa tagacagcat 3240
gagagggatg aggcaacccg tgctcagctg tcaaggctca gtcgctagca tttcccaaca 3300
caaagattct gaccttaaat gcaaccattt gaaacccctg taggcctcag gtgaaactcc 3360
agatgccaca atggagctct gctcccctaa agcctcaaaa caaaggccta attctatgcc 3420
tgtcttaatt ttctttcact taagttagtt ccactgagac cccaggctgt taggggttat 3480
tggtgtaagg tactttcata ttttaaacag aggatatcgg catttgtttc tttctctgag 3540
gacaagagaa aaaagccagg ttccacagag gacacagaga aggtttgggt gtcctcctgg 3600
ggttcttttt gccaactttc cccacgttaa aggtgaacat tggttctttc atttgctttg 3660
gaagttttaa tctctaacag tggacaaagt taccagtgcc ttaaactctg ttacactttt 3720
tggaagtgaa aactttgtag tatgataggt tattttgatg taaagatgtt ctggatacca 3780
ttatatgttc cccctgtttc agaggctcag attgtaatat gtaaatggta tgtcattcgc 3840
tactatgatt taatttgaaa tatggtcttt tggttatgaa tactttgcag cacagctgag 3900
aggctgtctg ttgtattcat tgtggtcata gcacctaaca acattgtagc ctcaatcgag 3960
tgagacagac tagaagttcc tagtgatggc ttatgatagc aaatggcctc atgtcaaata 4020
tttagatgta attttgtgta agaaatacag actggatgta ccaccaacta ctacctgtaa 4080
tgacaggcct gtccaacaca tctccctttt ccatgactgt ggtagccagc atcggaaaga 4140
acgctgattt aaagaggtcg cttgggaatt ttattgacac agtaccattt aatggggagg 4200
acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa aacagatttg gaaagactgg 4260
actctaaagt ctgttgatta aagatgagct ttgtctactt caaaagtttg tttgcttacc 4320
ccttcagcct ccaatttttt aagtgaaaat atagctaata acatgtgaaa agaatagaag 4380
ctaaggttta gataaatatt gagcagatct ataggaagat tgaacctgaa tattgccatt 4440
atgcttgaca tggtttccaa aaaatggtac tccacatatt tcagtgaggg taagtatttt 4500
cctgttgtca agaatagcat tgtaaaagca ttttgtaata ataaagaata gctttaatga 4560
tatgcttgta actaaaataa ttttgtaatg tatcaaatac atttaaaaca ttaaaatata 4620
atctctataa taatttaaaa tctaatatgg ttttaataga acagcgatat caagcttatc 4680
gataatcaac ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt 4740
gctcctttta cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc 4800
cgtatggctt tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag 4860
ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc 4920
actggttggg gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc 4980
cctattgcca cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg 5040
ctgttgggca ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg 5100
ctcgcctatg ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc 5160
ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt 5220
cttcgccttc gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgaattcat 5280
cgataccgag cgctgctcga gagatctgtg atagcggcca tcaagctggc tgtgccttct 5340
agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc 5400
actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt 5460
cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat 5520
agcaggcatg ctggggacac gtgcggaccg agcggccgca ggaaccccta gtgatggagt 5580
tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc 5640
gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcagc tgcctgcagg 5700
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatacgtc 5760
aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac 5820
gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc 5880
ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt 5940
agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt tgggtgatgg 6000
ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac 6060
gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta tctcgggcta 6120
ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat 6180
ttaacaaaaa tttaacgcga attttaacaa aatattaacg tttacaattt tatggtgcac 6240
tctcagtaca atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc 6300
cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac 6360
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg 6420
aaagggcctc gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta 6480
gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta 6540
aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata 6600
ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc 6660
ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 6720
agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct 6780
tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg 6840
tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta 6900
ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat 6960
gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt 7020
acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga 7080
tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga 7140
gcgtgacacc a 7151
<210> 69
<400> 69
000
<210> 70
<211> 7208
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 70
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tgctatctat catcttgaag 1320
ggcttctgga acaagttaga atagagtcaa cactcatgaa ctgctgtagc aaaaaaaact 1380
atagatgtag gattgacaag ggcaatagag cgatgactcc ctggctgtgt tgtatttgat 1440
ggacggcagt agcttttcac aaaatgctca tttggatgtt tcaaattaaa acgtttcact 1500
ttctagaacc aattacgtgg tcagtttagc tcctgaggtc ccagtcagag gggtattctg 1560
tagcttgcaa agcctctctt tggggactgg acatggagtc tgtggtctta gaattcagaa 1620
ccgggagaat gtgttagcca ctcatctaag ctattcctta aacgctttca gagccatctc 1680
cactgtgggg aaagaagttc tttgtgttct ctgacttagt ctcattctaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaagcaat tgcaataccc agagcgcaca gtagatggca ctgagacttg 1800
tcggaaagct ggacgcactc aagaggtggc agaaaaatct ataggtaagc ttttcttcta 1860
gtctggtgtt gctgctcctg accttattaa tgggctgaga aatagatttc tttcctttcc 1920
ttttcttttt tatatgaaat taaatgaagt ataaaagaat atgagaatgt gttgctatta 1980
gcaaggataa gtaatgcttt aggaaacgtt tggttcatgt gtgtgttttc agactgatgt 2040
gtgtcctgga tccagtgtaa aatgtacttc tgtctgtagg tctctgccac agaaaagttg 2100
gaaagccatt gttgtattcc atttccaggg caacaaaaga taccactgtc acttcatgtg 2160
aaatggtgtt gtttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg 2220
cggtcggggg ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc 2280
cgcggcgccg ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt 2340
gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc 2400
ctctccccga ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg 2460
agccccagcg cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc 2520
cgacgcagag caaaccgccc agagtagaag ccatggattg gggcacgctg cagacgatcc 2580
tggggggtgt gaacaaacac tccaccagca ttggaaagat ctggctcacc gtcctcttca 2640
tttttcgcat tatgatcctc gttgtggctg caaaggaggt gtggggagat gagcaggccg 2700
actttgtctg caacaccctg cagccaggct gcaagaacgt gtgctacgat cactacttcc 2760
ccatctccca catccggcta tgggccctgc agctgatctt cgtgtccacg ccagcgctcc 2820
tagtggccat gcacgtggcc taccggagac atgagaagaa gaggaagttc atcaaggggg 2880
agataaagag tgaatttaag gacatcgagg agatcaaaac ccagaaggtc cgcatcgaag 2940
gctccctgtg gtggacctac acaagcagca tcttcttccg ggtcatcttc gaagccgcct 3000
tcatgtacgt cttctatgtc atgtacgacg gcttctccat gcagcggctg gtgaagtgca 3060
acgcctggcc ttgtcccaac actgtggact gctttgtgtc ccggcccacg gagaagactg 3120
tcttcacagt gttcatgatt gcagtgtctg gaatttgcat cctgctgaat gtcactgaat 3180
tgtgttattt gctaattaga tattgttctg ggaagtcaaa aaagccagtt taaaggcgcg 3240
ccacccctgc agggaattcc gcattgccca gttgttagat taagaaatag acagcatgag 3300
agggatgagg caacccgtgc tcagctgtca aggctcagtc gctagcattt cccaacacaa 3360
agattctgac cttaaatgca accatttgaa acccctgtag gcctcaggtg aaactccaga 3420
tgccacaatg gagctctgct cccctaaagc ctcaaaacaa aggcctaatt ctatgcctgt 3480
cttaattttc tttcacttaa gttagttcca ctgagacccc aggctgttag gggttattgg 3540
tgtaaggtac tttcatattt taaacagagg atatcggcat ttgtttcttt ctctgaggac 3600
aagagaaaaa agccaggttc cacagaggac acagagaagg tttgggtgtc ctcctggggt 3660
tctttttgcc aactttcccc acgttaaagg tgaacattgg ttctttcatt tgctttggaa 3720
gttttaatct ctaacagtgg acaaagttac cagtgcctta aactctgtta cactttttgg 3780
aagtgaaaac tttgtagtat gataggttat tttgatgtaa agatgttctg gataccatta 3840
tatgttcccc ctgtttcaga ggctcagatt gtaatatgta aatggtatgt cattcgctac 3900
tatgatttaa tttgaaatat ggtcttttgg ttatgaatac tttgcagcac agctgagagg 3960
ctgtctgttg tattcattgt ggtcatagca cctaacaaca ttgtagcctc aatcgagtga 4020
gacagactag aagttcctag tgatggctta tgatagcaaa tggcctcatg tcaaatattt 4080
agatgtaatt ttgtgtaaga aatacagact ggatgtacca ccaactacta cctgtaatga 4140
caggcctgtc caacacatct cccttttcca tgactgtggt agccagcatc ggaaagaacg 4200
ctgatttaaa gaggtcgctt gggaatttta ttgacacagt accatttaat ggggaggaca 4260
aaatggggca ggggagggag aagtttctgt cgttaaaaac agatttggaa agactggact 4320
ctaaagtctg ttgattaaag atgagctttg tctacttcaa aagtttgttt gcttacccct 4380
tcagcctcca attttttaag tgaaaatata gctaataaca tgtgaaaaga atagaagcta 4440
aggtttagat aaatattgag cagatctata ggaagattga acctgaatat tgccattatg 4500
cttgacatgg tttccaaaaa atggtactcc acatatttca gtgagggtaa gtattttcct 4560
gttgtcaaga atagcattgt aaaagcattt tgtaataata aagaatagct ttaatgatat 4620
gcttgtaact aaaataattt tgtaatgtat caaatacatt taaaacatta aaatataatc 4680
tctataataa tttaaaatct aatatggttt taatagaaca gcgatatcaa gcttatcgat 4740
aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct 4800
ccttttacgc tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt 4860
atggctttca ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg 4920
tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact 4980
ggttggggca ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct 5040
attgccacgg cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg 5100
ttgggcactg acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc 5160
gcctatgttg ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc 5220
aatccagcgg accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt 5280
cgccttcgcc ctcagacgag tcggatctcc ctttgggccg cctccccgcg aattcatcga 5340
taccgagcgc tgctcgagag atctgtgata gcggccatca agctggctgt gccttctagt 5400
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 5460
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 5520
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 5580
aggcatgctg gggacacgtg cggaccgagc ggccgcagga acccctagtg atggagttgg 5640
ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag gtcgcccgac 5700
gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc ctgcaggggc 5760
gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atacgtcaaa 5820
gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 5880
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 5940
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 6000
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg gtgatggttc 6060
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 6120
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cgggctattc 6180
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 6240
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat ggtgcactct 6300
cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 6360
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 6420
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 6480
gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 6540
gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 6600
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 6660
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 6720
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 6780
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 6840
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 6900
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 6960
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 7020
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 7080
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 7140
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 7200
tgacacca 7208
<210> 71
<211> 7235
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 71
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tgctatctat catcttgaag 1320
ggcttctgga acaagttaga atagagtcaa cactcatgaa ctgctgtagc aaaaaaaact 1380
atagatgtag gattgacaag ggcaatagag cgatgactcc ctggctgtgt tgtatttgat 1440
ggacggcagt agcttttcac aaaatgctca tttggatgtt tcaaattaaa acgtttcact 1500
ttctagaacc aattacgtgg tcagtttagc tcctgaggtc ccagtcagag gggtattctg 1560
tagcttgcaa agcctctctt tggggactgg acatggagtc tgtggtctta gaattcagaa 1620
ccgggagaat gtgttagcca ctcatctaag ctattcctta aacgctttca gagccatctc 1680
cactgtgggg aaagaagttc tttgtgttct ctgacttagt ctcattctaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaagcaat tgcaataccc agagcgcaca gtagatggca ctgagacttg 1800
tcggaaagct ggacgcactc aagaggtggc agaaaaatct ataggtaagc ttttcttcta 1860
gtctggtgtt gctgctcctg accttattaa tgggctgaga aatagatttc tttcctttcc 1920
ttttcttttt tatatgaaat taaatgaagt ataaaagaat atgagaatgt gttgctatta 1980
gcaaggataa gtaatgcttt aggaaacgtt tggttcatgt gtgtgttttc agactgatgt 2040
gtgtcctgga tccagtgtaa aatgtacttc tgtctgtagg tctctgccac agaaaagttg 2100
gaaagccatt gttgtattcc atttccaggg caacaaaaga taccactgtc acttcatgtg 2160
aaatggtgtt gtttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg 2220
cggtcggggg ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc 2280
cgcggcgccg ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt 2340
gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc 2400
ctctccccga ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg 2460
agccccagcg cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc 2520
cgacgcagag caaaccgccc agagtagaag ccatggattg gggcacactc cagagcatcc 2580
tcgggggtgt caacaaacac tccaccagca ttggaaagat ctggctcacg gtcctcttca 2640
tcttccgcat catgatcctc gtggtggctg caaaggaggt gtggggagat gagcaagccg 2700
attttgtctg caacacgctc cagcctggct gcaagaatgt atgctacgac caccacttcc 2760
ccatctctca catccggctc tgggctctgc agctgatcat ggtgtccacg ccagccctcc 2820
tggtagctat gcatgtggcc taccggagac atgaaaagaa acggaagttc atgaagggag 2880
agataaagaa cgagtttaag gacatcgaag agatcaaaac ccagaaggtc cgtatcgaag 2940
ggtccctgtg gtggacctac accaccagca tcttcttccg ggtcatcttt gaagccgtct 3000
tcatgtacgt cttttacatc atgtacaatg gcttcttcat gcaacgtctg gtgaaatgca 3060
acgcttggcc ctgccccaat acagtggact gcttcatttc caggcccaca gaaaagactg 3120
tcttcaccgt gtttatgatt tctgtgtctg gaatttgcat tctgctaaat atcacagagc 3180
tgtgctattt gttcgttagg tattgctcag gaaagtccaa aagaccagtc tacccatacg 3240
atgttccaga ttacgcttaa aggcgcgcca cccctgcagg gaattccgca ttgcccagtt 3300
gttagattaa gaaatagaca gcatgagagg gatgaggcaa cccgtgctca gctgtcaagg 3360
ctcagtcgct agcatttccc aacacaaaga ttctgacctt aaatgcaacc atttgaaacc 3420
cctgtaggcc tcaggtgaaa ctccagatgc cacaatggag ctctgctccc ctaaagcctc 3480
aaaacaaagg cctaattcta tgcctgtctt aattttcttt cacttaagtt agttccactg 3540
agaccccagg ctgttagggg ttattggtgt aaggtacttt catattttaa acagaggata 3600
tcggcatttg tttctttctc tgaggacaag agaaaaaagc caggttccac agaggacaca 3660
gagaaggttt gggtgtcctc ctggggttct ttttgccaac tttccccacg ttaaaggtga 3720
acattggttc tttcatttgc tttggaagtt ttaatctcta acagtggaca aagttaccag 3780
tgccttaaac tctgttacac tttttggaag tgaaaacttt gtagtatgat aggttatttt 3840
gatgtaaaga tgttctggat accattatat gttccccctg tttcagaggc tcagattgta 3900
atatgtaaat ggtatgtcat tcgctactat gatttaattt gaaatatggt cttttggtta 3960
tgaatacttt gcagcacagc tgagaggctg tctgttgtat tcattgtggt catagcacct 4020
aacaacattg tagcctcaat cgagtgagac agactagaag ttcctagtga tggcttatga 4080
tagcaaatgg cctcatgtca aatatttaga tgtaattttg tgtaagaaat acagactgga 4140
tgtaccacca actactacct gtaatgacag gcctgtccaa cacatctccc ttttccatga 4200
ctgtggtagc cagcatcgga aagaacgctg atttaaagag gtcgcttggg aattttattg 4260
acacagtacc atttaatggg gaggacaaaa tggggcaggg gagggagaag tttctgtcgt 4320
taaaaacaga tttggaaaga ctggactcta aagtctgttg attaaagatg agctttgtct 4380
acttcaaaag tttgtttgct taccccttca gcctccaatt ttttaagtga aaatatagct 4440
aataacatgt gaaaagaata gaagctaagg tttagataaa tattgagcag atctatagga 4500
agattgaacc tgaatattgc cattatgctt gacatggttt ccaaaaaatg gtactccaca 4560
tatttcagtg agggtaagta ttttcctgtt gtcaagaata gcattgtaaa agcattttgt 4620
aataataaag aatagcttta atgatatgct tgtaactaaa ataattttgt aatgtatcaa 4680
atacatttaa aacattaaaa tataatctct ataataattt aaaatctaat atggttttaa 4740
tagaacagcg atatcaagct tatcgataat caacctctgg attacaaaat ttgtgaaaga 4800
ttgactggta ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg 4860
cctttgtatc atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc 4920
tggttgctgt ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc 4980
actgtgtttg ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt 5040
tccgggactt tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt 5100
gcccgctgct ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg 5160
aaatcatcgt cctttccttg gctgctcgcc tatgttgcca cctggattct gcgcgggacg 5220
tccttctgct acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg 5280
ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt 5340
tgggccgcct ccccgcgaat tcatcgatac cgagcgctgc tcgagagatc tgtgatagcg 5400
gccatcaagc tggctgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg 5460
ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt 5520
gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc 5580
aagggggagg attgggaaga caatagcagg catgctgggg acacgtgcgg accgagcggc 5640
cgcaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc tcgctcactg 5700
aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg 5760
agcgagcgcg cagctgcctg caggggcgcc tgatgcggta ttttctcctt acgcatctgt 5820
gcggtatttc acaccgcata cgtcaaagca accatagtac gcgccctgta gcggcgcatt 5880
aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 5940
gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 6000
agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 6060
caaaaaactt gatttgggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 6120
tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 6180
aacactcaac cctatctcgg gctattcttt tgatttataa gggattttgc cgatttcggc 6240
ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt 6300
aacgtttaca attttatggt gcactctcag tacaatctgc tctgatgccg catagttaag 6360
ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc 6420
atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc 6480
gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa 6540
tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg 6600
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 6660
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 6720
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 6780
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 6840
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 6900
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 6960
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 7020
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 7080
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 7140
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 7200
gaatgaagcc ataccaaacg acgagcgtga cacca 7235
<210> 72
<211> 7262
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 72
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taactgggca atgcgttaaa 1320
ctggcttttt tgacttccca gaacaatatc taattagcaa ataacacaat tcagtgacat 1380
tcagcaggat gcaaattcca gacactgcaa tcatgaacac tgtgaagaca gtcttctccg 1440
tgggccggga cacaaagcag tccacagtgt tgggacaagg ccaggcgttg cacttcacca 1500
gccgctgcat ggagaagccg tcgtacatga catagaagac gtacatgaag gcggcttcga 1560
agatgacccg gaagaagatg ctgcttgtgt aggtccacca cagggagcct tcgatgcgga 1620
ccttctgggt tttgatctcc tcgatgtcct taaattcact ctttatctcc cccttgatga 1680
acttcctctt cttctcatgt ctccggtagg ccacgtgcat ggccactagg agcgctggcg 1740
tggacacgaa gatcagctgc agggcccata gccggatgtg ggagatgggg aagtagtgat 1800
cgtagcacac gttcttgcag cctggctgca gggtgttgca gacaaagtcg gcctgctcat 1860
ctccccacac ctcctttgca gccacaacga ggatcataat gcgaaaaatg aagaggacgg 1920
tgagccagat ctttccaatg ctggtggagt gtttgttcac accccccagg atcgtctgca 1980
gcgtgcccca atccatcttc tactctgggc ggtttgctct ggaaaagacg aatgcacaca 2040
acacaggaat cactagctag gacagaacag ggagacttct ctgagtctgg gtaagcaagc 2100
atgcttaaat ctcttcctga gcaaacacca actcttacac aacctcacca aaacaggtga 2160
agacagaacc aacttagttt gtcattaatt aagacctcga aggggacttg gggggttcgg 2220
ggctttcggg ggcggtcggg ggttcgcgga cccgggaagc tctgaggacc cagaggccgg 2280
gcgcgctccg cccgcggcgc cgccccctcc gtaactttcc cagtctccga gggaagaggc 2340
ggggtgtggg gtgcggttaa aaggcgccac ggcgggagac aggtgttgcg gccccgcagc 2400
gcccgcgcgc tcctctcccc gactcggagc ccctcggcgg cgcccggccc aggacccgcc 2460
taggagcgca ggagccccag cgcagagacc ccaacgccga gacccccgcc ccggccccgc 2520
cgcgcttcct cccgacgcag agcaaaccgc ccagagtaga agccatggtg agcaagggcg 2580
aggagctgtt caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc 2640
acaagttcag cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga 2700
agttcatctg caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccctga 2760
cctacggcgt gcagtgcttc agccgctacc ccgaccacat gaagcagcac gacttcttca 2820
agtccgccat gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca 2880
actacaagac ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc 2940
tgaagggcat cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact 3000
acaacagcca caacgtctat atcatggccg acaagcagaa gaacggcatc aaggtgaact 3060
tcaagatccg ccacaacatc gaggacggca gcgtgcagct cgccgaccac taccagcaga 3120
acacccccat cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agcacccagt 3180
ccgccctgag caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga 3240
ccgccgccgg gatcactctc ggcatggacg agctgtacaa gtaataaagg cgcgccaccc 3300
ctgcagggaa ttccgcattg cccagttgtt agattaagaa atagacagca tgagagggat 3360
gaggcaaccc gtgctcagct gtcaaggctc agtcgctagc atttcccaac acaaagattc 3420
tgaccttaaa tgcaaccatt tgaaacccct gtaggcctca ggtgaaactc cagatgccac 3480
aatggagctc tgctccccta aagcctcaaa acaaaggcct aattctatgc ctgtcttaat 3540
tttctttcac ttaagttagt tccactgaga ccccaggctg ttaggggtta ttggtgtaag 3600
gtactttcat attttaaaca gaggatatcg gcatttgttt ctttctctga ggacaagaga 3660
aaaaagccag gttccacaga ggacacagag aaggtttggg tgtcctcctg gggttctttt 3720
tgccaacttt ccccacgtta aaggtgaaca ttggttcttt catttgcttt ggaagtttta 3780
atctctaaca gtggacaaag ttaccagtgc cttaaactct gttacacttt ttggaagtga 3840
aaactttgta gtatgatagg ttattttgat gtaaagatgt tctggatacc attatatgtt 3900
ccccctgttt cagaggctca gattgtaata tgtaaatggt atgtcattcg ctactatgat 3960
ttaatttgaa atatggtctt ttggttatga atactttgca gcacagctga gaggctgtct 4020
gttgtattca ttgtggtcat agcacctaac aacattgtag cctcaatcga gtgagacaga 4080
ctagaagttc ctagtgatgg cttatgatag caaatggcct catgtcaaat atttagatgt 4140
aattttgtgt aagaaataca gactggatgt accaccaact actacctgta atgacaggcc 4200
tgtccaacac atctcccttt tccatgactg tggtagccag catcggaaag aacgctgatt 4260
taaagaggtc gcttgggaat tttattgaca cagtaccatt taatggggag gacaaaatgg 4320
ggcaggggag ggagaagttt ctgtcgttaa aaacagattt ggaaagactg gactctaaag 4380
tctgttgatt aaagatgagc tttgtctact tcaaaagttt gtttgcttac cccttcagcc 4440
tccaattttt taagtgaaaa tatagctaat aacatgtgaa aagaatagaa gctaaggttt 4500
agataaatat tgagcagatc tataggaaga ttgaacctga atattgccat tatgcttgac 4560
atggtttcca aaaaatggta ctccacatat ttcagtgagg gtaagtattt tcctgttgtc 4620
aagaatagca ttgtaaaagc attttgtaat aataaagaat agctttaatg atatgcttgt 4680
aactaaaata attttgtaat gtatcaaata catttaaaac attaaaatat aatctctata 4740
ataatttaaa atctaatatg gttttaatag aacagcgata tcaagcttat cgataatcaa 4800
cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt 4860
acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct 4920
ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc 4980
gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg 5040
ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc 5100
acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc 5160
actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctat 5220
gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca 5280
gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt 5340
cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcgaattca tcgataccga 5400
gcgctgctcg agagatctgt gatagcggcc atcaagctgg ctgtgccttc tagttgccag 5460
ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact 5520
gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt 5580
ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat 5640
gctggggaca cgtgcggacc gagcggccgc aggaacccct agtgatggag ttggccactc 5700
cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 5760
gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag ctgcctgcag gggcgcctga 5820
tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatacgt caaagcaacc 5880
atagtacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 5940
gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 6000
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 6060
atttagtgct ttacggcacc tcgaccccaa aaaacttgat ttgggtgatg gttcacgtag 6120
tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 6180
tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcgggct attcttttga 6240
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 6300
atttaacgcg aattttaaca aaatattaac gtttacaatt ttatggtgca ctctcagtac 6360
aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc 6420
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 6480
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct 6540
cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg 6600
tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 6660
aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 6720
gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 6780
ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 6840
gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 6900
tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 6960
attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 7020
tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 7080
agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 7140
aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 7200
tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 7260
ca 7262
<210> 73
<211> 7220
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 73
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taactgggca atgcgttaaa 1320
ctggcttttt tgacttccca gaacaatatc taattagcaa ataacacaat tcagtgacat 1380
tcagcaggat gcaaattcca gacactgcaa tcatgaacac tgtgaagaca gtcttctccg 1440
tgggccggga cacaaagcag tccacagtgt tgggacaagg ccaggcgttg cacttcacca 1500
gccgctgcat ggagaagccg tcgtacatga catagaagac gtacatgaag gcggcttcga 1560
agatgacccg gaagaagatg ctgcttgtgt aggtccacca cagggagcct tcgatgcgga 1620
ccttctgggt tttgatctcc tcgatgtcct taaattcact ctttatctcc cccttgatga 1680
acttcctctt cttctcatgt ctccggtagg ccacgtgcat ggccactagg agcgctggcg 1740
tggacacgaa gatcagctgc agggcccata gccggatgtg ggagatgggg aagtagtgat 1800
cgtagcacac gttcttgcag cctggctgca gggtgttgca gacaaagtcg gcctgctcat 1860
ctccccacac ctcctttgca gccacaacga ggatcataat gcgaaaaatg aagaggacgg 1920
tgagccagat ctttccaatg ctggtggagt gtttgttcac accccccagg atcgtctgca 1980
gcgtgcccca atccatcttc tactctgggc ggtttgctct ggaaaagacg aatgcacaca 2040
acacaggaat cactagctag gacagaacag ggagacttct ctgagtctgg gtaagcaagc 2100
atgcttaaat ctcttcctga gcaaacacca actcttacac aacctcacca aaacaggtga 2160
agacagaacc aacttagttt gtcattaatt aagacctcga aggggacttg gggggttcgg 2220
ggctttcggg ggcggtcggg ggttcgcgga cccgggaagc tctgaggacc cagaggccgg 2280
gcgcgctccg cccgcggcgc cgccccctcc gtaactttcc cagtctccga gggaagaggc 2340
ggggtgtggg gtgcggttaa aaggcgccac ggcgggagac aggtgttgcg gccccgcagc 2400
gcccgcgcgc tcctctcccc gactcggagc ccctcggcgg cgcccggccc aggacccgcc 2460
taggagcgca ggagccccag cgcagagacc ccaacgccga gacccccgcc ccggccccgc 2520
cgcgcttcct cccgacgcag agcaaaccgc ccagagtaga agccatggat tggggcacgc 2580
tgcagacgat cctggggggt gtgaacaaac actccaccag cattggaaag atctggctca 2640
ccgtcctctt catttttcgc attatgatcc tcgttgtggc tgcaaaggag gtgtggggag 2700
atgagcaggc cgactttgtc tgcaacaccc tgcagccagg ctgcaagaac gtgtgctacg 2760
atcactactt ccccatctcc cacatccggc tatgggccct gcagctgatc ttcgtgtcca 2820
cgccagcgct cctagtggcc atgcacgtgg cctaccggag acatgagaag aagaggaagt 2880
tcatcaaggg ggagataaag agtgaattta aggacatcga ggagatcaaa acccagaagg 2940
tccgcatcga aggctccctg tggtggacct acacaagcag catcttcttc cgggtcatct 3000
tcgaagccgc cttcatgtac gtcttctatg tcatgtacga cggcttctcc atgcagcggc 3060
tggtgaagtg caacgcctgg ccttgtccca acactgtgga ctgctttgtg tcccggccca 3120
cggagaagac tgtcttcaca gtgttcatga ttgcagtgtc tggaatttgc atcctgctga 3180
atgtcactga attgtgttat ttgctaatta gatattgttc tgggaagtca aaaaagccag 3240
tttaaaggcg cgccacccct gcagggaatt ccgcattgcc cagttgttag attaagaaat 3300
agacagcatg agagggatga ggcaacccgt gctcagctgt caaggctcag tcgctagcat 3360
ttcccaacac aaagattctg accttaaatg caaccatttg aaacccctgt aggcctcagg 3420
tgaaactcca gatgccacaa tggagctctg ctcccctaaa gcctcaaaac aaaggcctaa 3480
ttctatgcct gtcttaattt tctttcactt aagttagttc cactgagacc ccaggctgtt 3540
aggggttatt ggtgtaaggt actttcatat tttaaacaga ggatatcggc atttgtttct 3600
ttctctgagg acaagagaaa aaagccaggt tccacagagg acacagagaa ggtttgggtg 3660
tcctcctggg gttctttttg ccaactttcc ccacgttaaa ggtgaacatt ggttctttca 3720
tttgctttgg aagttttaat ctctaacagt ggacaaagtt accagtgcct taaactctgt 3780
tacacttttt ggaagtgaaa actttgtagt atgataggtt attttgatgt aaagatgttc 3840
tggataccat tatatgttcc ccctgtttca gaggctcaga ttgtaatatg taaatggtat 3900
gtcattcgct actatgattt aatttgaaat atggtctttt ggttatgaat actttgcagc 3960
acagctgaga ggctgtctgt tgtattcatt gtggtcatag cacctaacaa cattgtagcc 4020
tcaatcgagt gagacagact agaagttcct agtgatggct tatgatagca aatggcctca 4080
tgtcaaatat ttagatgtaa ttttgtgtaa gaaatacaga ctggatgtac caccaactac 4140
tacctgtaat gacaggcctg tccaacacat ctcccttttc catgactgtg gtagccagca 4200
tcggaaagaa cgctgattta aagaggtcgc ttgggaattt tattgacaca gtaccattta 4260
atggggagga caaaatgggg caggggaggg agaagtttct gtcgttaaaa acagatttgg 4320
aaagactgga ctctaaagtc tgttgattaa agatgagctt tgtctacttc aaaagtttgt 4380
ttgcttaccc cttcagcctc caatttttta agtgaaaata tagctaataa catgtgaaaa 4440
gaatagaagc taaggtttag ataaatattg agcagatcta taggaagatt gaacctgaat 4500
attgccatta tgcttgacat ggtttccaaa aaatggtact ccacatattt cagtgagggt 4560
aagtattttc ctgttgtcaa gaatagcatt gtaaaagcat tttgtaataa taaagaatag 4620
ctttaatgat atgcttgtaa ctaaaataat tttgtaatgt atcaaataca tttaaaacat 4680
taaaatataa tctctataat aatttaaaat ctaatatggt tttaatagaa cagcgatatc 4740
aagcttatcg ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt 4800
aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct 4860
attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt 4920
tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac 4980
gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct 5040
ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca 5100
ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt 5160
ccttggctgc tcgcctatgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc 5220
ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct 5280
cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg 5340
cgaattcatc gataccgagc gctgctcgag agatctgtga tagcggccat caagctggct 5400
gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg 5460
gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg 5520
agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg 5580
gaagacaata gcaggcatgc tggggacacg tgcggaccga gcggccgcag gaacccctag 5640
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa 5700
aggtcgcccg acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagct 5760
gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc 5820
gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 5880
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 5940
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 6000
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgattt 6060
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 6120
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 6180
ctcgggctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 6240
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttt 6300
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc 6360
gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca 6420
agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg 6480
cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat 6540
ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 6600
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 6660
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc 6720
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa 6780
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg 6840
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt 6900
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg 6960
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac 7020
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc 7080
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa 7140
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc 7200
aaacgacgag cgtgacacca 7220
<210> 74
<211> 7247
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 74
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taactgggca atgcgttaaa 1320
ctggcttttt tgacttccca gaacaatatc taattagcaa ataacacaat tcagtgacat 1380
tcagcaggat gcaaattcca gacactgcaa tcatgaacac tgtgaagaca gtcttctccg 1440
tgggccggga cacaaagcag tccacagtgt tgggacaagg ccaggcgttg cacttcacca 1500
gccgctgcat ggagaagccg tcgtacatga catagaagac gtacatgaag gcggcttcga 1560
agatgacccg gaagaagatg ctgcttgtgt aggtccacca cagggagcct tcgatgcgga 1620
ccttctgggt tttgatctcc tcgatgtcct taaattcact ctttatctcc cccttgatga 1680
acttcctctt cttctcatgt ctccggtagg ccacgtgcat ggccactagg agcgctggcg 1740
tggacacgaa gatcagctgc agggcccata gccggatgtg ggagatgggg aagtagtgat 1800
cgtagcacac gttcttgcag cctggctgca gggtgttgca gacaaagtcg gcctgctcat 1860
ctccccacac ctcctttgca gccacaacga ggatcataat gcgaaaaatg aagaggacgg 1920
tgagccagat ctttccaatg ctggtggagt gtttgttcac accccccagg atcgtctgca 1980
gcgtgcccca atccatcttc tactctgggc ggtttgctct ggaaaagacg aatgcacaca 2040
acacaggaat cactagctag gacagaacag ggagacttct ctgagtctgg gtaagcaagc 2100
atgcttaaat ctcttcctga gcaaacacca actcttacac aacctcacca aaacaggtga 2160
agacagaacc aacttagttt gtcattaatt aagacctcga aggggacttg gggggttcgg 2220
ggctttcggg ggcggtcggg ggttcgcgga cccgggaagc tctgaggacc cagaggccgg 2280
gcgcgctccg cccgcggcgc cgccccctcc gtaactttcc cagtctccga gggaagaggc 2340
ggggtgtggg gtgcggttaa aaggcgccac ggcgggagac aggtgttgcg gccccgcagc 2400
gcccgcgcgc tcctctcccc gactcggagc ccctcggcgg cgcccggccc aggacccgcc 2460
taggagcgca ggagccccag cgcagagacc ccaacgccga gacccccgcc ccggccccgc 2520
cgcgcttcct cccgacgcag agcaaaccgc ccagagtaga agccatggat tggggcacac 2580
tccagagcat cctcgggggt gtcaacaaac actccaccag cattggaaag atctggctca 2640
cggtcctctt catcttccgc atcatgatcc tcgtggtggc tgcaaaggag gtgtggggag 2700
atgagcaagc cgattttgtc tgcaacacgc tccagcctgg ctgcaagaat gtatgctacg 2760
accaccactt ccccatctct cacatccggc tctgggctct gcagctgatc atggtgtcca 2820
cgccagccct cctggtagct atgcatgtgg cctaccggag acatgaaaag aaacggaagt 2880
tcatgaaggg agagataaag aacgagttta aggacatcga agagatcaaa acccagaagg 2940
tccgtatcga agggtccctg tggtggacct acaccaccag catcttcttc cgggtcatct 3000
ttgaagccgt cttcatgtac gtcttttaca tcatgtacaa tggcttcttc atgcaacgtc 3060
tggtgaaatg caacgcttgg ccctgcccca atacagtgga ctgcttcatt tccaggccca 3120
cagaaaagac tgtcttcacc gtgtttatga tttctgtgtc tggaatttgc attctgctaa 3180
atatcacaga gctgtgctat ttgttcgtta ggtattgctc aggaaagtcc aaaagaccag 3240
tctacccata cgatgttcca gattacgctt aaaggcgcgc cacccctgca gggaattccg 3300
cattgcccag ttgttagatt aagaaataga cagcatgaga gggatgaggc aacccgtgct 3360
cagctgtcaa ggctcagtcg ctagcatttc ccaacacaaa gattctgacc ttaaatgcaa 3420
ccatttgaaa cccctgtagg cctcaggtga aactccagat gccacaatgg agctctgctc 3480
ccctaaagcc tcaaaacaaa ggcctaattc tatgcctgtc ttaattttct ttcacttaag 3540
ttagttccac tgagacccca ggctgttagg ggttattggt gtaaggtact ttcatatttt 3600
aaacagagga tatcggcatt tgtttctttc tctgaggaca agagaaaaaa gccaggttcc 3660
acagaggaca cagagaaggt ttgggtgtcc tcctggggtt ctttttgcca actttcccca 3720
cgttaaaggt gaacattggt tctttcattt gctttggaag ttttaatctc taacagtgga 3780
caaagttacc agtgccttaa actctgttac actttttgga agtgaaaact ttgtagtatg 3840
ataggttatt ttgatgtaaa gatgttctgg ataccattat atgttccccc tgtttcagag 3900
gctcagattg taatatgtaa atggtatgtc attcgctact atgatttaat ttgaaatatg 3960
gtcttttggt tatgaatact ttgcagcaca gctgagaggc tgtctgttgt attcattgtg 4020
gtcatagcac ctaacaacat tgtagcctca atcgagtgag acagactaga agttcctagt 4080
gatggcttat gatagcaaat ggcctcatgt caaatattta gatgtaattt tgtgtaagaa 4140
atacagactg gatgtaccac caactactac ctgtaatgac aggcctgtcc aacacatctc 4200
ccttttccat gactgtggta gccagcatcg gaaagaacgc tgatttaaag aggtcgcttg 4260
ggaattttat tgacacagta ccatttaatg gggaggacaa aatggggcag gggagggaga 4320
agtttctgtc gttaaaaaca gatttggaaa gactggactc taaagtctgt tgattaaaga 4380
tgagctttgt ctacttcaaa agtttgtttg cttacccctt cagcctccaa ttttttaagt 4440
gaaaatatag ctaataacat gtgaaaagaa tagaagctaa ggtttagata aatattgagc 4500
agatctatag gaagattgaa cctgaatatt gccattatgc ttgacatggt ttccaaaaaa 4560
tggtactcca catatttcag tgagggtaag tattttcctg ttgtcaagaa tagcattgta 4620
aaagcatttt gtaataataa agaatagctt taatgatatg cttgtaacta aaataatttt 4680
gtaatgtatc aaatacattt aaaacattaa aatataatct ctataataat ttaaaatcta 4740
atatggtttt aatagaacag cgatatcaag cttatcgata atcaacctct ggattacaaa 4800
atttgtgaaa gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac 4860
gctgctttaa tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc 4920
ttgtataaat cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt 4980
ggcgtggtgt gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc 5040
tgtcagctcc tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc 5100
gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg 5160
gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg cctatgttgc cacctggatt 5220
ctgcgcggga cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc 5280
cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt 5340
cggatctccc tttgggccgc ctccccgcga attcatcgat accgagcgct gctcgagaga 5400
tctgtgatag cggccatcaa gctggctgtg ccttctagtt gccagccatc tgttgtttgc 5460
ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 5520
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 5580
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggacacgtgc 5640
ggaccgagcg gccgcaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc 5700
gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg 5760
cctcagtgag cgagcgagcg cgcagctgcc tgcaggggcg cctgatgcgg tattttctcc 5820
ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag caaccatagt acgcgccctg 5880
tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc 5940
cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg 6000
ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta gtgctttacg 6060
gcacctcgac cccaaaaaac ttgatttggg tgatggttca cgtagtgggc catcgccctg 6120
atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg gactcttgtt 6180
ccaaactgga acaacactca accctatctc gggctattct tttgatttat aagggatttt 6240
gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta acgcgaattt 6300
taacaaaata ttaacgttta caattttatg gtgcactctc agtacaatct gctctgatgc 6360
cgcatagtta agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg 6420
tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca 6480
gaggttttca ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt 6540
tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg 6600
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct 6660
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat 6720
tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc 6780
tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg 6840
ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg 6900
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga 6960
cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta 7020
ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc 7080
tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc 7140
gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg 7200
ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacacca 7247
<210> 75
<211> 7204
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 75
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagctacta actacaacca 1320
cgagattata gatgtttgct gatattgttc tcagtttggt tattgtgttg tttatgaatg 1380
aaagtagtgt atgtttgtgt gaatttttgt ttttaatttt ttatgagtgc cctaacaaag 1440
attacaaatt gggaatacaa actccagagc aatggagaca gtgacacttt tgtggagggg 1500
tacatgtggc tgttcgggtg gttattaaca caggctgctg cccctgccct gcaatgggaa 1560
tccccagggc attggaggat tcaacctctt gcagttacct cttgtaagac agcagatggc 1620
agcagagaga ggctttgcac atccctgcag gttctagttt gcacaaaggg cttctgagag 1680
acctatcaac caattataac atcaagtggc aaaaagagtc cttgataagt tatttcgctt 1740
ctcaaagaaa ccgaaaacgc caaactaatc actagtcttg tttttttttt tcctggcaaa 1800
agcctgctat ctttcatgat ttagctttca tgaaattgtt cctgaagacc cccaaaagaa 1860
acaatttcat gccccgaact ctgttcagag actttgctgt gcctgtcatg tccagcttgc 1920
catatcctgt tttgtaaagt agccacctta tatacacacc tgctgtctgc actgtgacct 1980
cctttcaaaa tcatctttgg ttcttcagag gcctggaata atgctctgcc cagatgaaga 2040
tctccgtaaa tgtgtttttg aaatggctaa tcaaataatg gataccctta ggtatttttg 2100
cagaaacact tggcagcctt ccataatatc cctactatga aatggaaact tgtgaatgag 2160
atgtggcttt aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt 2220
cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg 2280
gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg 2340
ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct 2400
ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc 2460
ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac 2520
gcagagcaaa ccgcccagag tagaagccat ggattggggc acgctgcaga cgatcctggg 2580
gggtgtgaac aaacactcca ccagcattgg aaagatctgg ctcaccgtcc tcttcatttt 2640
tcgcattatg atcctcgttg tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt 2700
tgtctgcaac accctgcagc caggctgcaa gaacgtgtgc tacgatcact acttccccat 2760
ctcccacatc cggctatggg ccctgcagct gatcttcgtg tccacgccag cgctcctagt 2820
ggccatgcac gtggcctacc ggagacatga gaagaagagg aagttcatca agggggagat 2880
aaagagtgaa tttaaggaca tcgaggagat caaaacccag aaggtccgca tcgaaggctc 2940
cctgtggtgg acctacacaa gcagcatctt cttccgggtc atcttcgaag ccgccttcat 3000
gtacgtcttc tatgtcatgt acgacggctt ctccatgcag cggctggtga agtgcaacgc 3060
ctggccttgt cccaacactg tggactgctt tgtgtcccgg cccacggaga agactgtctt 3120
cacagtgttc atgattgcag tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg 3180
ttatttgcta attagatatt gttctgggaa gtcaaaaaag ccagtttaaa ggcgcgccac 3240
ccctgcaggg aattccgcat tgcccagttg ttagattaag aaatagacag catgagaggg 3300
atgaggcaac ccgtgctcag ctgtcaaggc tcagtcgcta gcatttccca acacaaagat 3360
tctgacctta aatgcaacca tttgaaaccc ctgtaggcct caggtgaaac tccagatgcc 3420
acaatggagc tctgctcccc taaagcctca aaacaaaggc ctaattctat gcctgtctta 3480
attttctttc acttaagtta gttccactga gaccccaggc tgttaggggt tattggtgta 3540
aggtactttc atattttaaa cagaggatat cggcatttgt ttctttctct gaggacaaga 3600
gaaaaaagcc aggttccaca gaggacacag agaaggtttg ggtgtcctcc tggggttctt 3660
tttgccaact ttccccacgt taaaggtgaa cattggttct ttcatttgct ttggaagttt 3720
taatctctaa cagtggacaa agttaccagt gccttaaact ctgttacact ttttggaagt 3780
gaaaactttg tagtatgata ggttattttg atgtaaagat gttctggata ccattatatg 3840
ttccccctgt ttcagaggct cagattgtaa tatgtaaatg gtatgtcatt cgctactatg 3900
atttaatttg aaatatggtc ttttggttat gaatactttg cagcacagct gagaggctgt 3960
ctgttgtatt cattgtggtc atagcaccta acaacattgt agcctcaatc gagtgagaca 4020
gactagaagt tcctagtgat ggcttatgat agcaaatggc ctcatgtcaa atatttagat 4080
gtaattttgt gtaagaaata cagactggat gtaccaccaa ctactacctg taatgacagg 4140
cctgtccaac acatctccct tttccatgac tgtggtagcc agcatcggaa agaacgctga 4200
tttaaagagg tcgcttggga attttattga cacagtacca tttaatgggg aggacaaaat 4260
ggggcagggg agggagaagt ttctgtcgtt aaaaacagat ttggaaagac tggactctaa 4320
agtctgttga ttaaagatga gctttgtcta cttcaaaagt ttgtttgctt accccttcag 4380
cctccaattt tttaagtgaa aatatagcta ataacatgtg aaaagaatag aagctaaggt 4440
ttagataaat attgagcaga tctataggaa gattgaacct gaatattgcc attatgcttg 4500
acatggtttc caaaaaatgg tactccacat atttcagtga gggtaagtat tttcctgttg 4560
tcaagaatag cattgtaaaa gcattttgta ataataaaga atagctttaa tgatatgctt 4620
gtaactaaaa taattttgta atgtatcaaa tacatttaaa acattaaaat ataatctcta 4680
taataattta aaatctaata tggttttaat agaacagcga tatcaagctt atcgataatc 4740
aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat gttgctcctt 4800
ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct tcccgtatgg 4860
ctttcatttt ctcctccttg tataaatcct ggttgctgtc tctttatgag gagttgtggc 4920
ccgttgtcag gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt 4980
ggggcattgc caccacctgt cagctccttt ccgggacttt cgctttcccc ctccctattg 5040
ccacggcgga actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg 5100
gcactgacaa ttccgtggtg ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct 5160
atgttgccac ctggattctg cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc 5220
cagcggacct tccttcccgc ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc 5280
ttcgccctca gacgagtcgg atctcccttt gggccgcctc cccgcgaatt catcgatacc 5340
gagcgctgct cgagagatct gtgatagcgg ccatcaagct ggctgtgcct tctagttgcc 5400
agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt gccactccca 5460
ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg tgtcattcta 5520
ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac aatagcaggc 5580
atgctgggga cacgtgcgga ccgagcggcc gcaggaaccc ctagtgatgg agttggccac 5640
tccctctctg cgcgctcgct cgctcactga ggccgggcga ccaaaggtcg cccgacgccc 5700
gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc agctgcctgc aggggcgcct 5760
gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatac gtcaaagcaa 5820
ccatagtacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 5880
gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 5940
ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 6000
cgatttagtg ctttacggca cctcgacccc aaaaaacttg atttgggtga tggttcacgt 6060
agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt 6120
aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggg ctattctttt 6180
gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa 6240
aaatttaacg cgaattttaa caaaatatta acgtttacaa ttttatggtg cactctcagt 6300
acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac acccgctgac 6360
gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc 6420
gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag acgaaagggc 6480
ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca 6540
ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat 6600
tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa 6660
aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt 6720
tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag 6780
ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt 6840
tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg 6900
gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag 6960
aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta 7020
agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg 7080
acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta 7140
actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac 7200
acca 7204
<210> 76
<211> 7231
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 76
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagctacta actacaacca 1320
cgagattata gatgtttgct gatattgttc tcagtttggt tattgtgttg tttatgaatg 1380
aaagtagtgt atgtttgtgt gaatttttgt ttttaatttt ttatgagtgc cctaacaaag 1440
attacaaatt gggaatacaa actccagagc aatggagaca gtgacacttt tgtggagggg 1500
tacatgtggc tgttcgggtg gttattaaca caggctgctg cccctgccct gcaatgggaa 1560
tccccagggc attggaggat tcaacctctt gcagttacct cttgtaagac agcagatggc 1620
agcagagaga ggctttgcac atccctgcag gttctagttt gcacaaaggg cttctgagag 1680
acctatcaac caattataac atcaagtggc aaaaagagtc cttgataagt tatttcgctt 1740
ctcaaagaaa ccgaaaacgc caaactaatc actagtcttg tttttttttt tcctggcaaa 1800
agcctgctat ctttcatgat ttagctttca tgaaattgtt cctgaagacc cccaaaagaa 1860
acaatttcat gccccgaact ctgttcagag actttgctgt gcctgtcatg tccagcttgc 1920
catatcctgt tttgtaaagt agccacctta tatacacacc tgctgtctgc actgtgacct 1980
cctttcaaaa tcatctttgg ttcttcagag gcctggaata atgctctgcc cagatgaaga 2040
tctccgtaaa tgtgtttttg aaatggctaa tcaaataatg gataccctta ggtatttttg 2100
cagaaacact tggcagcctt ccataatatc cctactatga aatggaaact tgtgaatgag 2160
atgtggcttt aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt 2220
cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg 2280
gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg 2340
ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct 2400
ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc 2460
ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac 2520
gcagagcaaa ccgcccagag tagaagccat ggattggggc acactccaga gcatcctcgg 2580
gggtgtcaac aaacactcca ccagcattgg aaagatctgg ctcacggtcc tcttcatctt 2640
ccgcatcatg atcctcgtgg tggctgcaaa ggaggtgtgg ggagatgagc aagccgattt 2700
tgtctgcaac acgctccagc ctggctgcaa gaatgtatgc tacgaccacc acttccccat 2760
ctctcacatc cggctctggg ctctgcagct gatcatggtg tccacgccag ccctcctggt 2820
agctatgcat gtggcctacc ggagacatga aaagaaacgg aagttcatga agggagagat 2880
aaagaacgag tttaaggaca tcgaagagat caaaacccag aaggtccgta tcgaagggtc 2940
cctgtggtgg acctacacca ccagcatctt cttccgggtc atctttgaag ccgtcttcat 3000
gtacgtcttt tacatcatgt acaatggctt cttcatgcaa cgtctggtga aatgcaacgc 3060
ttggccctgc cccaatacag tggactgctt catttccagg cccacagaaa agactgtctt 3120
caccgtgttt atgatttctg tgtctggaat ttgcattctg ctaaatatca cagagctgtg 3180
ctatttgttc gttaggtatt gctcaggaaa gtccaaaaga ccagtctacc catacgatgt 3240
tccagattac gcttaaaggc gcgccacccc tgcagggaat tccgcattgc ccagttgtta 3300
gattaagaaa tagacagcat gagagggatg aggcaacccg tgctcagctg tcaaggctca 3360
gtcgctagca tttcccaaca caaagattct gaccttaaat gcaaccattt gaaacccctg 3420
taggcctcag gtgaaactcc agatgccaca atggagctct gctcccctaa agcctcaaaa 3480
caaaggccta attctatgcc tgtcttaatt ttctttcact taagttagtt ccactgagac 3540
cccaggctgt taggggttat tggtgtaagg tactttcata ttttaaacag aggatatcgg 3600
catttgtttc tttctctgag gacaagagaa aaaagccagg ttccacagag gacacagaga 3660
aggtttgggt gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat 3720
tggttctttc atttgctttg gaagttttaa tctctaacag tggacaaagt taccagtgcc 3780
ttaaactctg ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg 3840
taaagatgtt ctggatacca ttatatgttc cccctgtttc agaggctcag attgtaatat 3900
gtaaatggta tgtcattcgc tactatgatt taatttgaaa tatggtcttt tggttatgaa 3960
tactttgcag cacagctgag aggctgtctg ttgtattcat tgtggtcata gcacctaaca 4020
acattgtagc ctcaatcgag tgagacagac tagaagttcc tagtgatggc ttatgatagc 4080
aaatggcctc atgtcaaata tttagatgta attttgtgta agaaatacag actggatgta 4140
ccaccaacta ctacctgtaa tgacaggcct gtccaacaca tctccctttt ccatgactgt 4200
ggtagccagc atcggaaaga acgctgattt aaagaggtcg cttgggaatt ttattgacac 4260
agtaccattt aatggggagg acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa 4320
aacagatttg gaaagactgg actctaaagt ctgttgatta aagatgagct ttgtctactt 4380
caaaagtttg tttgcttacc ccttcagcct ccaatttttt aagtgaaaat atagctaata 4440
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 4500
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatatt 4560
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 4620
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 4680
atttaaaaca ttaaaatata atctctataa taatttaaaa tctaatatgg ttttaataga 4740
acagcgatat caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 4800
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 4860
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 4920
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 4980
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 5040
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 5100
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 5160
catcgtcctt tccttggctg ctcgcctatg ttgccacctg gattctgcgc gggacgtcct 5220
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 5280
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 5340
ccgcctcccc gcgaattcat cgataccgag cgctgctcga gagatctgtg atagcggcca 5400
tcaagctggc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 5460
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 5520
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 5580
gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg agcggccgca 5640
ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 5700
cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg 5760
agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg 5820
tatttcacac cgcatacgtc aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc 5880
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 5940
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 6000
ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 6060
aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 6120
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 6180
ctcaacccta tctcgggcta ttcttttgat ttataaggga ttttgccgat ttcggcctat 6240
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg 6300
tttacaattt tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 6360
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 6420
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 6480
tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc 6540
atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc 6600
cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc 6660
tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc 6720
gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg 6780
gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat 6840
ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc 6900
acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa 6960
ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa 7020
aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt 7080
gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct 7140
tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat 7200
gaagccatac caaacgacga gcgtgacacc a 7231
<210> 77
<211> 7214
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 77
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tctcagctgg agtgacgcac 1320
ctcatccatg cgggcctggc gtctggaagg tggctgggtc tctcgggctt gagcaccatc 1380
atcttagctc caacatgtca ttattccttc ctcactgagg acttttctgc ttcctaattg 1440
gttgttgaag atgaggcccc catgctcttt taagaaaacc tgttgtgccc caggcttggc 1500
tgtgatgggc actgactcat acagaagtag aaaggcctgc tgagtcatca acactcgtgc 1560
gacgccctcg cattttcatt aatgatggcc tccctgccac acgtgaatca ctccagcccg 1620
agatctgaaa ccaggacaca ccccaggggc gaggtgacgc tgagtgagcc cagctgtgtc 1680
cctttcatga gaactcagag cacagggctc tgtgtgcatg gccgtcccct ccagagagga 1740
ggaagtaaat gccgggatta gtggaagatc atttccttct atttgccttg gcttacgtct 1800
ttcagaattc aaacacgtgc actgttgacc ctgcaatggt ggagtttttg gattttcctt 1860
cagtccgatt gctaaaatac ttccctctca tgtgagctgt tgtgaaagtc atcagccaga 1920
taccattcta aaaacaaaga atgtgcttct cgtatgttgc atgctggtta ctgaaatatt 1980
agggaattac ataaaggttt tctggggcac atattcaagc tgaatgataa aattgaaggt 2040
cacacaaagc taaggtcttt caaatcctga cccaattagc tctctgttag ctctctgact 2100
ttggacaagc tgtctggtcc tctgaagcat actttgttcg ccctgggtag gggccctctg 2160
ttttaacagc gtttggcatt aattaagacc tcgaagggga cttggggggt tcggggcttt 2220
cgggggcggt cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc 2280
tccgcccgcg gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg 2340
tggggtgcgg ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc 2400
gcgctcctct ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag 2460
cgcaggagcc ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct 2520
tcctcccgac gcagagcaaa ccgcccagag tagaagccat ggattggggc acgctgcaga 2580
cgatcctggg gggtgtgaac aaacactcca ccagcattgg aaagatctgg ctcaccgtcc 2640
tcttcatttt tcgcattatg atcctcgttg tggctgcaaa ggaggtgtgg ggagatgagc 2700
aggccgactt tgtctgcaac accctgcagc caggctgcaa gaacgtgtgc tacgatcact 2760
acttccccat ctcccacatc cggctatggg ccctgcagct gatcttcgtg tccacgccag 2820
cgctcctagt ggccatgcac gtggcctacc ggagacatga gaagaagagg aagttcatca 2880
agggggagat aaagagtgaa tttaaggaca tcgaggagat caaaacccag aaggtccgca 2940
tcgaaggctc cctgtggtgg acctacacaa gcagcatctt cttccgggtc atcttcgaag 3000
ccgccttcat gtacgtcttc tatgtcatgt acgacggctt ctccatgcag cggctggtga 3060
agtgcaacgc ctggccttgt cccaacactg tggactgctt tgtgtcccgg cccacggaga 3120
agactgtctt cacagtgttc atgattgcag tgtctggaat ttgcatcctg ctgaatgtca 3180
ctgaattgtg ttatttgcta attagatatt gttctgggaa gtcaaaaaag ccagtttaaa 3240
ggcgcgccac ccctgcaggg aattccgcat tgcccagttg ttagattaag aaatagacag 3300
catgagaggg atgaggcaac ccgtgctcag ctgtcaaggc tcagtcgcta gcatttccca 3360
acacaaagat tctgacctta aatgcaacca tttgaaaccc ctgtaggcct caggtgaaac 3420
tccagatgcc acaatggagc tctgctcccc taaagcctca aaacaaaggc ctaattctat 3480
gcctgtctta attttctttc acttaagtta gttccactga gaccccaggc tgttaggggt 3540
tattggtgta aggtactttc atattttaaa cagaggatat cggcatttgt ttctttctct 3600
gaggacaaga gaaaaaagcc aggttccaca gaggacacag agaaggtttg ggtgtcctcc 3660
tggggttctt tttgccaact ttccccacgt taaaggtgaa cattggttct ttcatttgct 3720
ttggaagttt taatctctaa cagtggacaa agttaccagt gccttaaact ctgttacact 3780
ttttggaagt gaaaactttg tagtatgata ggttattttg atgtaaagat gttctggata 3840
ccattatatg ttccccctgt ttcagaggct cagattgtaa tatgtaaatg gtatgtcatt 3900
cgctactatg atttaatttg aaatatggtc ttttggttat gaatactttg cagcacagct 3960
gagaggctgt ctgttgtatt cattgtggtc atagcaccta acaacattgt agcctcaatc 4020
gagtgagaca gactagaagt tcctagtgat ggcttatgat agcaaatggc ctcatgtcaa 4080
atatttagat gtaattttgt gtaagaaata cagactggat gtaccaccaa ctactacctg 4140
taatgacagg cctgtccaac acatctccct tttccatgac tgtggtagcc agcatcggaa 4200
agaacgctga tttaaagagg tcgcttggga attttattga cacagtacca tttaatgggg 4260
aggacaaaat ggggcagggg agggagaagt ttctgtcgtt aaaaacagat ttggaaagac 4320
tggactctaa agtctgttga ttaaagatga gctttgtcta cttcaaaagt ttgtttgctt 4380
accccttcag cctccaattt tttaagtgaa aatatagcta ataacatgtg aaaagaatag 4440
aagctaaggt ttagataaat attgagcaga tctataggaa gattgaacct gaatattgcc 4500
attatgcttg acatggtttc caaaaaatgg tactccacat atttcagtga gggtaagtat 4560
tttcctgttg tcaagaatag cattgtaaaa gcattttgta ataataaaga atagctttaa 4620
tgatatgctt gtaactaaaa taattttgta atgtatcaaa tacatttaaa acattaaaat 4680
ataatctcta taataattta aaatctaata tggttttaat agaacagcga tatcaagctt 4740
atcgataatc aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat 4800
gttgctcctt ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct 4860
tcccgtatgg ctttcatttt ctcctccttg tataaatcct ggttgctgtc tctttatgag 4920
gagttgtggc ccgttgtcag gcaacgtggc gtggtgtgca ctgtgtttgc tgacgcaacc 4980
cccactggtt ggggcattgc caccacctgt cagctccttt ccgggacttt cgctttcccc 5040
ctccctattg ccacggcgga actcatcgcc gcctgccttg cccgctgctg gacaggggct 5100
cggctgttgg gcactgacaa ttccgtggtg ttgtcgggga aatcatcgtc ctttccttgg 5160
ctgctcgcct atgttgccac ctggattctg cgcgggacgt ccttctgcta cgtcccttcg 5220
gccctcaatc cagcggacct tccttcccgc ggcctgctgc cggctctgcg gcctcttccg 5280
cgtcttcgcc ttcgccctca gacgagtcgg atctcccttt gggccgcctc cccgcgaatt 5340
catcgatacc gagcgctgct cgagagatct gtgatagcgg ccatcaagct ggctgtgcct 5400
tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt 5460
gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg 5520
tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac 5580
aatagcaggc atgctgggga cacgtgcgga ccgagcggcc gcaggaaccc ctagtgatgg 5640
agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga ccaaaggtcg 5700
cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc agctgcctgc 5760
aggggcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatac 5820
gtcaaagcaa ccatagtacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt 5880
tacgcgcagc gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt 5940
cccttccttt ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc 6000
tttagggttc cgatttagtg ctttacggca cctcgacccc aaaaaacttg atttgggtga 6060
tggttcacgt agtgggccat cgccctgata gacggttttt cgccctttga cgttggagtc 6120
cacgttcttt aatagtggac tcttgttcca aactggaaca acactcaacc ctatctcggg 6180
ctattctttt gatttataag ggattttgcc gatttcggcc tattggttaa aaaatgagct 6240
gatttaacaa aaatttaacg cgaattttaa caaaatatta acgtttacaa ttttatggtg 6300
cactctcagt acaatctgct ctgatgccgc atagttaagc cagccccgac acccgccaac 6360
acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt 6420
gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag 6480
acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc 6540
ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt 6600
ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata 6660
atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 6720
tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 6780
tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 6840
ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 6900
atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 6960
ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 7020
catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 7080
cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 7140
ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 7200
cgagcgtgac acca 7214
<210> 78
<211> 7241
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 78
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tctcagctgg agtgacgcac 1320
ctcatccatg cgggcctggc gtctggaagg tggctgggtc tctcgggctt gagcaccatc 1380
atcttagctc caacatgtca ttattccttc ctcactgagg acttttctgc ttcctaattg 1440
gttgttgaag atgaggcccc catgctcttt taagaaaacc tgttgtgccc caggcttggc 1500
tgtgatgggc actgactcat acagaagtag aaaggcctgc tgagtcatca acactcgtgc 1560
gacgccctcg cattttcatt aatgatggcc tccctgccac acgtgaatca ctccagcccg 1620
agatctgaaa ccaggacaca ccccaggggc gaggtgacgc tgagtgagcc cagctgtgtc 1680
cctttcatga gaactcagag cacagggctc tgtgtgcatg gccgtcccct ccagagagga 1740
ggaagtaaat gccgggatta gtggaagatc atttccttct atttgccttg gcttacgtct 1800
ttcagaattc aaacacgtgc actgttgacc ctgcaatggt ggagtttttg gattttcctt 1860
cagtccgatt gctaaaatac ttccctctca tgtgagctgt tgtgaaagtc atcagccaga 1920
taccattcta aaaacaaaga atgtgcttct cgtatgttgc atgctggtta ctgaaatatt 1980
agggaattac ataaaggttt tctggggcac atattcaagc tgaatgataa aattgaaggt 2040
cacacaaagc taaggtcttt caaatcctga cccaattagc tctctgttag ctctctgact 2100
ttggacaagc tgtctggtcc tctgaagcat actttgttcg ccctgggtag gggccctctg 2160
ttttaacagc gtttggcatt aattaagacc tcgaagggga cttggggggt tcggggcttt 2220
cgggggcggt cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc 2280
tccgcccgcg gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg 2340
tggggtgcgg ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc 2400
gcgctcctct ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag 2460
cgcaggagcc ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct 2520
tcctcccgac gcagagcaaa ccgcccagag tagaagccat ggattggggc acactccaga 2580
gcatcctcgg gggtgtcaac aaacactcca ccagcattgg aaagatctgg ctcacggtcc 2640
tcttcatctt ccgcatcatg atcctcgtgg tggctgcaaa ggaggtgtgg ggagatgagc 2700
aagccgattt tgtctgcaac acgctccagc ctggctgcaa gaatgtatgc tacgaccacc 2760
acttccccat ctctcacatc cggctctggg ctctgcagct gatcatggtg tccacgccag 2820
ccctcctggt agctatgcat gtggcctacc ggagacatga aaagaaacgg aagttcatga 2880
agggagagat aaagaacgag tttaaggaca tcgaagagat caaaacccag aaggtccgta 2940
tcgaagggtc cctgtggtgg acctacacca ccagcatctt cttccgggtc atctttgaag 3000
ccgtcttcat gtacgtcttt tacatcatgt acaatggctt cttcatgcaa cgtctggtga 3060
aatgcaacgc ttggccctgc cccaatacag tggactgctt catttccagg cccacagaaa 3120
agactgtctt caccgtgttt atgatttctg tgtctggaat ttgcattctg ctaaatatca 3180
cagagctgtg ctatttgttc gttaggtatt gctcaggaaa gtccaaaaga ccagtctacc 3240
catacgatgt tccagattac gcttaaaggc gcgccacccc tgcagggaat tccgcattgc 3300
ccagttgtta gattaagaaa tagacagcat gagagggatg aggcaacccg tgctcagctg 3360
tcaaggctca gtcgctagca tttcccaaca caaagattct gaccttaaat gcaaccattt 3420
gaaacccctg taggcctcag gtgaaactcc agatgccaca atggagctct gctcccctaa 3480
agcctcaaaa caaaggccta attctatgcc tgtcttaatt ttctttcact taagttagtt 3540
ccactgagac cccaggctgt taggggttat tggtgtaagg tactttcata ttttaaacag 3600
aggatatcgg catttgtttc tttctctgag gacaagagaa aaaagccagg ttccacagag 3660
gacacagaga aggtttgggt gtcctcctgg ggttcttttt gccaactttc cccacgttaa 3720
aggtgaacat tggttctttc atttgctttg gaagttttaa tctctaacag tggacaaagt 3780
taccagtgcc ttaaactctg ttacactttt tggaagtgaa aactttgtag tatgataggt 3840
tattttgatg taaagatgtt ctggatacca ttatatgttc cccctgtttc agaggctcag 3900
attgtaatat gtaaatggta tgtcattcgc tactatgatt taatttgaaa tatggtcttt 3960
tggttatgaa tactttgcag cacagctgag aggctgtctg ttgtattcat tgtggtcata 4020
gcacctaaca acattgtagc ctcaatcgag tgagacagac tagaagttcc tagtgatggc 4080
ttatgatagc aaatggcctc atgtcaaata tttagatgta attttgtgta agaaatacag 4140
actggatgta ccaccaacta ctacctgtaa tgacaggcct gtccaacaca tctccctttt 4200
ccatgactgt ggtagccagc atcggaaaga acgctgattt aaagaggtcg cttgggaatt 4260
ttattgacac agtaccattt aatggggagg acaaaatggg gcaggggagg gagaagtttc 4320
tgtcgttaaa aacagatttg gaaagactgg actctaaagt ctgttgatta aagatgagct 4380
ttgtctactt caaaagtttg tttgcttacc ccttcagcct ccaatttttt aagtgaaaat 4440
atagctaata acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct 4500
ataggaagat tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac 4560
tccacatatt tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca 4620
ttttgtaata ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg 4680
tatcaaatac atttaaaaca ttaaaatata atctctataa taatttaaaa tctaatatgg 4740
ttttaataga acagcgatat caagcttatc gataatcaac ctctggatta caaaatttgt 4800
gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct 4860
ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat 4920
aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg 4980
gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag 5040
ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc 5100
tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg 5160
tcggggaaat catcgtcctt tccttggctg ctcgcctatg ttgccacctg gattctgcgc 5220
gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc 5280
ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc 5340
tccctttggg ccgcctcccc gcgaattcat cgataccgag cgctgctcga gagatctgtg 5400
atagcggcca tcaagctggc tgtgccttct agttgccagc catctgttgt ttgcccctcc 5460
cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag 5520
gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag 5580
gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg 5640
agcggccgca ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc 5700
tcactgaggc cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag 5760
tgagcgagcg agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt ctccttacgc 5820
atctgtgcgg tatttcacac cgcatacgtc aaagcaacca tagtacgcgc cctgtagcgg 5880
cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc 5940
cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc 6000
ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct 6060
cgaccccaaa aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc cctgatagac 6120
ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac 6180
tggaacaaca ctcaacccta tctcgggcta ttcttttgat ttataaggga ttttgccgat 6240
ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa 6300
aatattaacg tttacaattt tatggtgcac tctcagtaca atctgctctg atgccgcata 6360
gttaagccag ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct 6420
cccggcatcc gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt 6480
ttcaccgtca tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata 6540
ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt 6600
gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 6660
acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 6720
tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 6780
agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 6840
cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 6900
aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 6960
gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 7020
agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 7080
aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 7140
gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 7200
ggagctgaat gaagccatac caaacgacga gcgtgacacc a 7241
<210> 79
<211> 7251
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 79
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcgacctgaa cgattaaggc 1320
aaaacttcga aatgtgcccc agcagagatt tatttttcag ggggtgtttt gcattccagc 1380
ccctctgcct tcctggcgtt tagtgcgatt tgtttagcca tgtgctccct ggtgtgtgtt 1440
tttgaatgtg tgtgagatgg gttgtctctc gggacctggc aggtgcggcc accaggtcag 1500
ggctgccccc caaccctgtg cctccttcct cctagactct ggccccctca gtgctgaggg 1560
tgatacagag cacttttcaa gctggatttg gaatgtggcc tctcccctcc aaactcctgg 1620
agatcatgca aaggcctttg gagccagcca gtcacctgga aggtgacatt cccaccagct 1680
gaggcctcac cttcagcggg ggctgggcag ctttggagcc tggggccagc caagctcact 1740
ctgcccatat ccctgccacg tgtggcccag cggatgatca cctgtcttca tctgcgtact 1800
gggccacatc cctcctgccg tcccccactt ccctgatgac acctacagca agcccctacc 1860
caagtgttct gtgatcccct gtaaatgtgg cctccctagc tacttgcttt tatgaaacca 1920
acaatcctgg ggacacagtt ttcggctgtc tcaagacggg gcaaccactc ttttccccag 1980
gcctgtgggt cccaggcctg gagctagggt tggcattctt gcctgaattc tccactctat 2040
cccaacccct gaggccgcct gaggaggctc agactgtgtc aggctaggag gacagtcaaa 2100
ccacaaaaac atgcctttta agaagtataa gcacaaatcc ctctttgatg ttatataaaa 2160
gctcagtgtc actttaatta agacctcgaa ggggacttgg ggggttcggg gctttcgggg 2220
gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg cgcgctccgc 2280
ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg gggtgtgggg 2340
tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg cccgcgcgct 2400
cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct aggagcgcag 2460
gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc gcgcttcctc 2520
ccgacgcaga gcaaaccgcc cagagtagaa gccatggtga gcaagggcga ggagctgttc 2580
accggggtgg tgcccatcct ggtcgagctg gacggcgacg taaacggcca caagttcagc 2640
gtgtccggcg agggcgaggg cgatgccacc tacggcaagc tgaccctgaa gttcatctgc 2700
accaccggca agctgcccgt gccctggccc accctcgtga ccaccctgac ctacggcgtg 2760
cagtgcttca gccgctaccc cgaccacatg aagcagcacg acttcttcaa gtccgccatg 2820
cccgaaggct acgtccagga gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc 2880
cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc gcatcgagct gaagggcatc 2940
gacttcaagg aggacggcaa catcctgggg cacaagctgg agtacaacta caacagccac 3000
aacgtctata tcatggccga caagcagaag aacggcatca aggtgaactt caagatccgc 3060
cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc 3120
ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc 3180
aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg 3240
atcactctcg gcatggacga gctgtacaag taataaaggc gcgccacccc tgcagggaat 3300
tccgcattgc ccagttgtta gattaagaaa tagacagcat gagagggatg aggcaacccg 3360
tgctcagctg tcaaggctca gtcgctagca tttcccaaca caaagattct gaccttaaat 3420
gcaaccattt gaaacccctg taggcctcag gtgaaactcc agatgccaca atggagctct 3480
gctcccctaa agcctcaaaa caaaggccta attctatgcc tgtcttaatt ttctttcact 3540
taagttagtt ccactgagac cccaggctgt taggggttat tggtgtaagg tactttcata 3600
ttttaaacag aggatatcgg catttgtttc tttctctgag gacaagagaa aaaagccagg 3660
ttccacagag gacacagaga aggtttgggt gtcctcctgg ggttcttttt gccaactttc 3720
cccacgttaa aggtgaacat tggttctttc atttgctttg gaagttttaa tctctaacag 3780
tggacaaagt taccagtgcc ttaaactctg ttacactttt tggaagtgaa aactttgtag 3840
tatgataggt tattttgatg taaagatgtt ctggatacca ttatatgttc cccctgtttc 3900
agaggctcag attgtaatat gtaaatggta tgtcattcgc tactatgatt taatttgaaa 3960
tatggtcttt tggttatgaa tactttgcag cacagctgag aggctgtctg ttgtattcat 4020
tgtggtcata gcacctaaca acattgtagc ctcaatcgag tgagacagac tagaagttcc 4080
tagtgatggc ttatgatagc aaatggcctc atgtcaaata tttagatgta attttgtgta 4140
agaaatacag actggatgta ccaccaacta ctacctgtaa tgacaggcct gtccaacaca 4200
tctccctttt ccatgactgt ggtagccagc atcggaaaga acgctgattt aaagaggtcg 4260
cttgggaatt ttattgacac agtaccattt aatggggagg acaaaatggg gcaggggagg 4320
gagaagtttc tgtcgttaaa aacagatttg gaaagactgg actctaaagt ctgttgatta 4380
aagatgagct ttgtctactt caaaagtttg tttgcttacc ccttcagcct ccaatttttt 4440
aagtgaaaat atagctaata acatgtgaaa agaatagaag ctaaggttta gataaatatt 4500
gagcagatct ataggaagat tgaacctgaa tattgccatt atgcttgaca tggtttccaa 4560
aaaatggtac tccacatatt tcagtgaggg taagtatttt cctgttgtca agaatagcat 4620
tgtaaaagca ttttgtaata ataaagaata gctttaatga tatgcttgta actaaaataa 4680
ttttgtaatg tatcaaatac atttaaaaca ttaaaatata atctctataa taatttaaaa 4740
tctaatatgg ttttaataga acagcgatat caagcttatc gataatcaac ctctggatta 4800
caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta cgctatgtgg 4860
atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt tcattttctc 4920
ctccttgtat aaatcctggt tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca 4980
acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc actggttggg gcattgccac 5040
cacctgtcag ctcctttccg ggactttcgc tttccccctc cctattgcca cggcggaact 5100
catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca ctgacaattc 5160
cgtggtgttg tcggggaaat catcgtcctt tccttggctg ctcgcctatg ttgccacctg 5220
gattctgcgc gggacgtcct tctgctacgt cccttcggcc ctcaatccag cggaccttcc 5280
ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt cttcgccttc gccctcagac 5340
gagtcggatc tccctttggg ccgcctcccc gcgaattcat cgataccgag cgctgctcga 5400
gagatctgtg atagcggcca tcaagctggc tgtgccttct agttgccagc catctgttgt 5460
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 5520
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 5580
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggacac 5640
gtgcggaccg agcggccgca ggaaccccta gtgatggagt tggccactcc ctctctgcgc 5700
gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg 5760
gcggcctcag tgagcgagcg agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt 5820
ctccttacgc atctgtgcgg tatttcacac cgcatacgtc aaagcaacca tagtacgcgc 5880
cctgtagcgg cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 5940
ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc gccacgttcg 6000
ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga tttagtgctt 6060
tacggcacct cgaccccaaa aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc 6120
cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat agtggactct 6180
tgttccaaac tggaacaaca ctcaacccta tctcgggcta ttcttttgat ttataaggga 6240
ttttgccgat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga 6300
attttaacaa aatattaacg tttacaattt tatggtgcac tctcagtaca atctgctctg 6360
atgccgcata gttaagccag ccccgacacc cgccaacacc cgctgacgcg ccctgacggg 6420
cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg agctgcatgt 6480
gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc 6540
tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc 6600
ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc 6660
cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga 6720
gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt 6780
ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag 6840
tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag 6900
aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta 6960
ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg 7020
agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca 7080
gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag 7140
gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc 7200
gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc a 7251
<210> 80
<211> 7209
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 80
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcgacctgaa cgattaaggc 1320
aaaacttcga aatgtgcccc agcagagatt tatttttcag ggggtgtttt gcattccagc 1380
ccctctgcct tcctggcgtt tagtgcgatt tgtttagcca tgtgctccct ggtgtgtgtt 1440
tttgaatgtg tgtgagatgg gttgtctctc gggacctggc aggtgcggcc accaggtcag 1500
ggctgccccc caaccctgtg cctccttcct cctagactct ggccccctca gtgctgaggg 1560
tgatacagag cacttttcaa gctggatttg gaatgtggcc tctcccctcc aaactcctgg 1620
agatcatgca aaggcctttg gagccagcca gtcacctgga aggtgacatt cccaccagct 1680
gaggcctcac cttcagcggg ggctgggcag ctttggagcc tggggccagc caagctcact 1740
ctgcccatat ccctgccacg tgtggcccag cggatgatca cctgtcttca tctgcgtact 1800
gggccacatc cctcctgccg tcccccactt ccctgatgac acctacagca agcccctacc 1860
caagtgttct gtgatcccct gtaaatgtgg cctccctagc tacttgcttt tatgaaacca 1920
acaatcctgg ggacacagtt ttcggctgtc tcaagacggg gcaaccactc ttttccccag 1980
gcctgtgggt cccaggcctg gagctagggt tggcattctt gcctgaattc tccactctat 2040
cccaacccct gaggccgcct gaggaggctc agactgtgtc aggctaggag gacagtcaaa 2100
ccacaaaaac atgcctttta agaagtataa gcacaaatcc ctctttgatg ttatataaaa 2160
gctcagtgtc actttaatta agacctcgaa ggggacttgg ggggttcggg gctttcgggg 2220
gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg cgcgctccgc 2280
ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg gggtgtgggg 2340
tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg cccgcgcgct 2400
cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct aggagcgcag 2460
gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc gcgcttcctc 2520
ccgacgcaga gcaaaccgcc cagagtagaa gccatggatt ggggcacgct gcagacgatc 2580
ctggggggtg tgaacaaaca ctccaccagc attggaaaga tctggctcac cgtcctcttc 2640
atttttcgca ttatgatcct cgttgtggct gcaaaggagg tgtggggaga tgagcaggcc 2700
gactttgtct gcaacaccct gcagccaggc tgcaagaacg tgtgctacga tcactacttc 2760
cccatctccc acatccggct atgggccctg cagctgatct tcgtgtccac gccagcgctc 2820
ctagtggcca tgcacgtggc ctaccggaga catgagaaga agaggaagtt catcaagggg 2880
gagataaaga gtgaatttaa ggacatcgag gagatcaaaa cccagaaggt ccgcatcgaa 2940
ggctccctgt ggtggaccta cacaagcagc atcttcttcc gggtcatctt cgaagccgcc 3000
ttcatgtacg tcttctatgt catgtacgac ggcttctcca tgcagcggct ggtgaagtgc 3060
aacgcctggc cttgtcccaa cactgtggac tgctttgtgt cccggcccac ggagaagact 3120
gtcttcacag tgttcatgat tgcagtgtct ggaatttgca tcctgctgaa tgtcactgaa 3180
ttgtgttatt tgctaattag atattgttct gggaagtcaa aaaagccagt ttaaaggcgc 3240
gccacccctg cagggaattc cgcattgccc agttgttaga ttaagaaata gacagcatga 3300
gagggatgag gcaacccgtg ctcagctgtc aaggctcagt cgctagcatt tcccaacaca 3360
aagattctga ccttaaatgc aaccatttga aacccctgta ggcctcaggt gaaactccag 3420
atgccacaat ggagctctgc tcccctaaag cctcaaaaca aaggcctaat tctatgcctg 3480
tcttaatttt ctttcactta agttagttcc actgagaccc caggctgtta ggggttattg 3540
gtgtaaggta ctttcatatt ttaaacagag gatatcggca tttgtttctt tctctgagga 3600
caagagaaaa aagccaggtt ccacagagga cacagagaag gtttgggtgt cctcctgggg 3660
ttctttttgc caactttccc cacgttaaag gtgaacattg gttctttcat ttgctttgga 3720
agttttaatc tctaacagtg gacaaagtta ccagtgcctt aaactctgtt acactttttg 3780
gaagtgaaaa ctttgtagta tgataggtta ttttgatgta aagatgttct ggataccatt 3840
atatgttccc cctgtttcag aggctcagat tgtaatatgt aaatggtatg tcattcgcta 3900
ctatgattta atttgaaata tggtcttttg gttatgaata ctttgcagca cagctgagag 3960
gctgtctgtt gtattcattg tggtcatagc acctaacaac attgtagcct caatcgagtg 4020
agacagacta gaagttccta gtgatggctt atgatagcaa atggcctcat gtcaaatatt 4080
tagatgtaat tttgtgtaag aaatacagac tggatgtacc accaactact acctgtaatg 4140
acaggcctgt ccaacacatc tcccttttcc atgactgtgg tagccagcat cggaaagaac 4200
gctgatttaa agaggtcgct tgggaatttt attgacacag taccatttaa tggggaggac 4260
aaaatggggc aggggaggga gaagtttctg tcgttaaaaa cagatttgga aagactggac 4320
tctaaagtct gttgattaaa gatgagcttt gtctacttca aaagtttgtt tgcttacccc 4380
ttcagcctcc aattttttaa gtgaaaatat agctaataac atgtgaaaag aatagaagct 4440
aaggtttaga taaatattga gcagatctat aggaagattg aacctgaata ttgccattat 4500
gcttgacatg gtttccaaaa aatggtactc cacatatttc agtgagggta agtattttcc 4560
tgttgtcaag aatagcattg taaaagcatt ttgtaataat aaagaatagc tttaatgata 4620
tgcttgtaac taaaataatt ttgtaatgta tcaaatacat ttaaaacatt aaaatataat 4680
ctctataata atttaaaatc taatatggtt ttaatagaac agcgatatca agcttatcga 4740
taatcaacct ctggattaca aaatttgtga aagattgact ggtattctta actatgttgc 4800
tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta ttgcttcccg 4860
tatggctttc attttctcct ccttgtataa atcctggttg ctgtctcttt atgaggagtt 4920
gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg tttgctgacg caacccccac 4980
tggttggggc attgccacca cctgtcagct cctttccggg actttcgctt tccccctccc 5040
tattgccacg gcggaactca tcgccgcctg ccttgcccgc tgctggacag gggctcggct 5100
gttgggcact gacaattccg tggtgttgtc ggggaaatca tcgtcctttc cttggctgct 5160
cgcctatgtt gccacctgga ttctgcgcgg gacgtccttc tgctacgtcc cttcggccct 5220
caatccagcg gaccttcctt cccgcggcct gctgccggct ctgcggcctc ttccgcgtct 5280
tcgccttcgc cctcagacga gtcggatctc cctttgggcc gcctccccgc gaattcatcg 5340
ataccgagcg ctgctcgaga gatctgtgat agcggccatc aagctggctg tgccttctag 5400
ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac 5460
tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca 5520
ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag 5580
caggcatgct ggggacacgt gcggaccgag cggccgcagg aacccctagt gatggagttg 5640
gccactccct ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga 5700
cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag cgcgcagctg cctgcagggg 5760
cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catacgtcaa 5820
agcaaccata gtacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc 5880
gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt 5940
cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag 6000
ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgatttg ggtgatggtt 6060
cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt 6120
tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcgggctatt 6180
cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt 6240
aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttta tggtgcactc 6300
tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg 6360
ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg 6420
tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgagacgaa 6480
agggcctcgt gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga 6540
cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa 6600
tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt 6660
gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg 6720
cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag 6780
atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg 6840
agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg 6900
gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt 6960
ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga 7020
cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac 7080
ttctgacaac gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc 7140
atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc 7200
gtgacacca 7209
<210> 81
<211> 7236
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 81
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcgacctgaa cgattaaggc 1320
aaaacttcga aatgtgcccc agcagagatt tatttttcag ggggtgtttt gcattccagc 1380
ccctctgcct tcctggcgtt tagtgcgatt tgtttagcca tgtgctccct ggtgtgtgtt 1440
tttgaatgtg tgtgagatgg gttgtctctc gggacctggc aggtgcggcc accaggtcag 1500
ggctgccccc caaccctgtg cctccttcct cctagactct ggccccctca gtgctgaggg 1560
tgatacagag cacttttcaa gctggatttg gaatgtggcc tctcccctcc aaactcctgg 1620
agatcatgca aaggcctttg gagccagcca gtcacctgga aggtgacatt cccaccagct 1680
gaggcctcac cttcagcggg ggctgggcag ctttggagcc tggggccagc caagctcact 1740
ctgcccatat ccctgccacg tgtggcccag cggatgatca cctgtcttca tctgcgtact 1800
gggccacatc cctcctgccg tcccccactt ccctgatgac acctacagca agcccctacc 1860
caagtgttct gtgatcccct gtaaatgtgg cctccctagc tacttgcttt tatgaaacca 1920
acaatcctgg ggacacagtt ttcggctgtc tcaagacggg gcaaccactc ttttccccag 1980
gcctgtgggt cccaggcctg gagctagggt tggcattctt gcctgaattc tccactctat 2040
cccaacccct gaggccgcct gaggaggctc agactgtgtc aggctaggag gacagtcaaa 2100
ccacaaaaac atgcctttta agaagtataa gcacaaatcc ctctttgatg ttatataaaa 2160
gctcagtgtc actttaatta agacctcgaa ggggacttgg ggggttcggg gctttcgggg 2220
gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg cgcgctccgc 2280
ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg gggtgtgggg 2340
tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg cccgcgcgct 2400
cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct aggagcgcag 2460
gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc gcgcttcctc 2520
ccgacgcaga gcaaaccgcc cagagtagaa gccatggatt ggggcacact ccagagcatc 2580
ctcgggggtg tcaacaaaca ctccaccagc attggaaaga tctggctcac ggtcctcttc 2640
atcttccgca tcatgatcct cgtggtggct gcaaaggagg tgtggggaga tgagcaagcc 2700
gattttgtct gcaacacgct ccagcctggc tgcaagaatg tatgctacga ccaccacttc 2760
cccatctctc acatccggct ctgggctctg cagctgatca tggtgtccac gccagccctc 2820
ctggtagcta tgcatgtggc ctaccggaga catgaaaaga aacggaagtt catgaaggga 2880
gagataaaga acgagtttaa ggacatcgaa gagatcaaaa cccagaaggt ccgtatcgaa 2940
gggtccctgt ggtggaccta caccaccagc atcttcttcc gggtcatctt tgaagccgtc 3000
ttcatgtacg tcttttacat catgtacaat ggcttcttca tgcaacgtct ggtgaaatgc 3060
aacgcttggc cctgccccaa tacagtggac tgcttcattt ccaggcccac agaaaagact 3120
gtcttcaccg tgtttatgat ttctgtgtct ggaatttgca ttctgctaaa tatcacagag 3180
ctgtgctatt tgttcgttag gtattgctca ggaaagtcca aaagaccagt ctacccatac 3240
gatgttccag attacgctta aaggcgcgcc acccctgcag ggaattccgc attgcccagt 3300
tgttagatta agaaatagac agcatgagag ggatgaggca acccgtgctc agctgtcaag 3360
gctcagtcgc tagcatttcc caacacaaag attctgacct taaatgcaac catttgaaac 3420
ccctgtaggc ctcaggtgaa actccagatg ccacaatgga gctctgctcc cctaaagcct 3480
caaaacaaag gcctaattct atgcctgtct taattttctt tcacttaagt tagttccact 3540
gagaccccag gctgttaggg gttattggtg taaggtactt tcatatttta aacagaggat 3600
atcggcattt gtttctttct ctgaggacaa gagaaaaaag ccaggttcca cagaggacac 3660
agagaaggtt tgggtgtcct cctggggttc tttttgccaa ctttccccac gttaaaggtg 3720
aacattggtt ctttcatttg ctttggaagt tttaatctct aacagtggac aaagttacca 3780
gtgccttaaa ctctgttaca ctttttggaa gtgaaaactt tgtagtatga taggttattt 3840
tgatgtaaag atgttctgga taccattata tgttccccct gtttcagagg ctcagattgt 3900
aatatgtaaa tggtatgtca ttcgctacta tgatttaatt tgaaatatgg tcttttggtt 3960
atgaatactt tgcagcacag ctgagaggct gtctgttgta ttcattgtgg tcatagcacc 4020
taacaacatt gtagcctcaa tcgagtgaga cagactagaa gttcctagtg atggcttatg 4080
atagcaaatg gcctcatgtc aaatatttag atgtaatttt gtgtaagaaa tacagactgg 4140
atgtaccacc aactactacc tgtaatgaca ggcctgtcca acacatctcc cttttccatg 4200
actgtggtag ccagcatcgg aaagaacgct gatttaaaga ggtcgcttgg gaattttatt 4260
gacacagtac catttaatgg ggaggacaaa atggggcagg ggagggagaa gtttctgtcg 4320
ttaaaaacag atttggaaag actggactct aaagtctgtt gattaaagat gagctttgtc 4380
tacttcaaaa gtttgtttgc ttaccccttc agcctccaat tttttaagtg aaaatatagc 4440
taataacatg tgaaaagaat agaagctaag gtttagataa atattgagca gatctatagg 4500
aagattgaac ctgaatattg ccattatgct tgacatggtt tccaaaaaat ggtactccac 4560
atatttcagt gagggtaagt attttcctgt tgtcaagaat agcattgtaa aagcattttg 4620
taataataaa gaatagcttt aatgatatgc ttgtaactaa aataattttg taatgtatca 4680
aatacattta aaacattaaa atataatctc tataataatt taaaatctaa tatggtttta 4740
atagaacagc gatatcaagc ttatcgataa tcaacctctg gattacaaaa tttgtgaaag 4800
attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat 4860
gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc 4920
ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg 4980
cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct gtcagctcct 5040
ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg ccgcctgcct 5100
tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg 5160
gaaatcatcg tcctttcctt ggctgctcgc ctatgttgcc acctggattc tgcgcgggac 5220
gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct 5280
gccggctctg cggcctcttc cgcgtcttcg ccttcgccct cagacgagtc ggatctccct 5340
ttgggccgcc tccccgcgaa ttcatcgata ccgagcgctg ctcgagagat ctgtgatagc 5400
ggccatcaag ctggctgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt 5460
gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat 5520
tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag 5580
caagggggag gattgggaag acaatagcag gcatgctggg gacacgtgcg gaccgagcgg 5640
ccgcaggaac ccctagtgat ggagttggcc actccctctc tgcgcgctcg ctcgctcact 5700
gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg cccgggcggc ctcagtgagc 5760
gagcgagcgc gcagctgcct gcaggggcgc ctgatgcggt attttctcct tacgcatctg 5820
tgcggtattt cacaccgcat acgtcaaagc aaccatagta cgcgccctgt agcggcgcat 5880
taagcgcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 5940
cgcccgctcc tttcgctttc ttcccttcct ttctcgccac gttcgccggc tttccccgtc 6000
aagctctaaa tcgggggctc cctttagggt tccgatttag tgctttacgg cacctcgacc 6060
ccaaaaaact tgatttgggt gatggttcac gtagtgggcc atcgccctga tagacggttt 6120
ttcgcccttt gacgttggag tccacgttct ttaatagtgg actcttgttc caaactggaa 6180
caacactcaa ccctatctcg ggctattctt ttgatttata agggattttg ccgatttcgg 6240
cctattggtt aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat 6300
taacgtttac aattttatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa 6360
gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg 6420
catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac 6480
cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt ttataggtta 6540
atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg 6600
gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 6660
aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc 6720
gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa 6780
cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac 6840
tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga 6900
tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag 6960
agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca 7020
cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca 7080
tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa 7140
ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc 7200
tgaatgaagc cataccaaac gacgagcgtg acacca 7236
<210> 82
<211> 7018
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 82
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg ttctaggtag acaactaaga 1320
tgttcatctt atggtttaat gtttagttgt aaaggttgtt tgcttctcat ttggttccaa 1380
gaaagagtat ttaggccaat ttcagggaga aatatgtgta tagatatatt catatgtcaa 1440
actgattagt gctgaatgtc acatttccat attctaataa catttctagc aaagaagagg 1500
acacagtgaa gagagaattg cccgcattgt cattgtctct ttctgagcct agaacgccta 1560
acacttgggt gtggagagac tcagcctcaa ttcactttct agcagccact gagatgtgct 1620
tgcctggggt gccccctggc aggcagggct ggaactgctt tccagtaccc acacggactg 1680
tgaacgaatc tttctttgtg ctttgtgtac agaatggaag ttcaacaaat atttgttgaa 1740
tgtgtatgtc cttccaatac gcagcagccc agagcaaacg tggtaatctt gtgtgtgttc 1800
atgtgaaagc agaatttaat ggtgctttta agcaccaaag tttaagatgc acgagaaaac 1860
tgtatctcca ttttttcctt ttcgtttaca attacttgta taagccaggc acggtggtgg 1920
ctcacgcctg taatcccagc actttgggag gccgaggcgg gcggatcaca tgaggtcggg 1980
agttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg 2040
ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg 2100
ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa 2160
ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga 2220
ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg 2280
cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcagag 2340
caaaccgccc agagtagaag ccatggattg gggcacgctg cagacgatcc tggggggtgt 2400
gaacaaacac tccaccagca ttggaaagat ctggctcacc gtcctcttca tttttcgcat 2460
tatgatcctc gttgtggctg caaaggaggt gtggggagat gagcaggccg actttgtctg 2520
caacaccctg cagccaggct gcaagaacgt gtgctacgat cactacttcc ccatctccca 2580
catccggcta tgggccctgc agctgatctt cgtgtccacg ccagcgctcc tagtggccat 2640
gcacgtggcc taccggagac atgagaagaa gaggaagttc atcaaggggg agataaagag 2700
tgaatttaag gacatcgagg agatcaaaac ccagaaggtc cgcatcgaag gctccctgtg 2760
gtggacctac acaagcagca tcttcttccg ggtcatcttc gaagccgcct tcatgtacgt 2820
cttctatgtc atgtacgacg gcttctccat gcagcggctg gtgaagtgca acgcctggcc 2880
ttgtcccaac actgtggact gctttgtgtc ccggcccacg gagaagactg tcttcacagt 2940
gttcatgatt gcagtgtctg gaatttgcat cctgctgaat gtcactgaat tgtgttattt 3000
gctaattaga tattgttctg ggaagtcaaa aaagccagtt taaaggcgcg ccacccctgc 3060
agggaattcc gcattgccca gttgttagat taagaaatag acagcatgag agggatgagg 3120
caacccgtgc tcagctgtca aggctcagtc gctagcattt cccaacacaa agattctgac 3180
cttaaatgca accatttgaa acccctgtag gcctcaggtg aaactccaga tgccacaatg 3240
gagctctgct cccctaaagc ctcaaaacaa aggcctaatt ctatgcctgt cttaattttc 3300
tttcacttaa gttagttcca ctgagacccc aggctgttag gggttattgg tgtaaggtac 3360
tttcatattt taaacagagg atatcggcat ttgtttcttt ctctgaggac aagagaaaaa 3420
agccaggttc cacagaggac acagagaagg tttgggtgtc ctcctggggt tctttttgcc 3480
aactttcccc acgttaaagg tgaacattgg ttctttcatt tgctttggaa gttttaatct 3540
ctaacagtgg acaaagttac cagtgcctta aactctgtta cactttttgg aagtgaaaac 3600
tttgtagtat gataggttat tttgatgtaa agatgttctg gataccatta tatgttcccc 3660
ctgtttcaga ggctcagatt gtaatatgta aatggtatgt cattcgctac tatgatttaa 3720
tttgaaatat ggtcttttgg ttatgaatac tttgcagcac agctgagagg ctgtctgttg 3780
tattcattgt ggtcatagca cctaacaaca ttgtagcctc aatcgagtga gacagactag 3840
aagttcctag tgatggctta tgatagcaaa tggcctcatg tcaaatattt agatgtaatt 3900
ttgtgtaaga aatacagact ggatgtacca ccaactacta cctgtaatga caggcctgtc 3960
caacacatct cccttttcca tgactgtggt agccagcatc ggaaagaacg ctgatttaaa 4020
gaggtcgctt gggaatttta ttgacacagt accatttaat ggggaggaca aaatggggca 4080
ggggagggag aagtttctgt cgttaaaaac agatttggaa agactggact ctaaagtctg 4140
ttgattaaag atgagctttg tctacttcaa aagtttgttt gcttacccct tcagcctcca 4200
attttttaag tgaaaatata gctaataaca tgtgaaaaga atagaagcta aggtttagat 4260
aaatattgag cagatctata ggaagattga acctgaatat tgccattatg cttgacatgg 4320
tttccaaaaa atggtactcc acatatttca gtgagggtaa gtattttcct gttgtcaaga 4380
atagcattgt aaaagcattt tgtaataata aagaatagct ttaatgatat gcttgtaact 4440
aaaataattt tgtaatgtat caaatacatt taaaacatta aaatataatc tctataataa 4500
tttaaaatct aatatggttt taatagaaca gcgatatcaa gcttatcgat aatcaacctc 4560
tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct ccttttacgc 4620
tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt atggctttca 4680
ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg tggcccgttg 4740
tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact ggttggggca 4800
ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct attgccacgg 4860
cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg ttgggcactg 4920
acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc gcctatgttg 4980
ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc aatccagcgg 5040
accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt cgccttcgcc 5100
ctcagacgag tcggatctcc ctttgggccg cctccccgcg aattcatcga taccgagcgc 5160
tgctcgagag atctgtgata gcggccatca agctggctgt gccttctagt tgccagccat 5220
ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc 5280
tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg 5340
ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgctg 5400
gggacacgtg cggaccgagc ggccgcagga acccctagtg atggagttgg ccactccctc 5460
tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag gtcgcccgac gcccgggctt 5520
tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc ctgcaggggc gcctgatgcg 5580
gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atacgtcaaa gcaaccatag 5640
tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc 5700
gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc 5760
acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt 5820
agtgctttac ggcacctcga ccccaaaaaa cttgatttgg gtgatggttc acgtagtggg 5880
ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt 5940
ggactcttgt tccaaactgg aacaacactc aaccctatct cgggctattc ttttgattta 6000
taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt 6060
aacgcgaatt ttaacaaaat attaacgttt acaattttat ggtgcactct cagtacaatc 6120
tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc 6180
tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc 6240
tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg 6300
atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc 6360
acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat 6420
atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag 6480
agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt 6540
cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt 6600
gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc 6660
cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta 6720
tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac 6780
ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa 6840
ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg 6900
atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc 6960
cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacacca 7018
<210> 83
<211> 7045
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 83
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagagggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg ttctaggtag acaactaaga 1320
tgttcatctt atggtttaat gtttagttgt aaaggttgtt tgcttctcat ttggttccaa 1380
gaaagagtat ttaggccaat ttcagggaga aatatgtgta tagatatatt catatgtcaa 1440
actgattagt gctgaatgtc acatttccat attctaataa catttctagc aaagaagagg 1500
acacagtgaa gagagaattg cccgcattgt cattgtctct ttctgagcct agaacgccta 1560
acacttgggt gtggagagac tcagcctcaa ttcactttct agcagccact gagatgtgct 1620
tgcctggggt gccccctggc aggcagggct ggaactgctt tccagtaccc acacggactg 1680
tgaacgaatc tttctttgtg ctttgtgtac agaatggaag ttcaacaaat atttgttgaa 1740
tgtgtatgtc cttccaatac gcagcagccc agagcaaacg tggtaatctt gtgtgtgttc 1800
atgtgaaagc agaatttaat ggtgctttta agcaccaaag tttaagatgc acgagaaaac 1860
tgtatctcca ttttttcctt ttcgtttaca attacttgta taagccaggc acggtggtgg 1920
ctcacgcctg taatcccagc actttgggag gccgaggcgg gcggatcaca tgaggtcggg 1980
agttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg 2040
ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg 2100
ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa 2160
ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga 2220
ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg 2280
cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcagag 2340
caaaccgccc agagtagaag ccatggattg gggcacactc cagagcatcc tcgggggtgt 2400
caacaaacac tccaccagca ttggaaagat ctggctcacg gtcctcttca tcttccgcat 2460
catgatcctc gtggtggctg caaaggaggt gtggggagat gagcaagccg attttgtctg 2520
caacacgctc cagcctggct gcaagaatgt atgctacgac caccacttcc ccatctctca 2580
catccggctc tgggctctgc agctgatcat ggtgtccacg ccagccctcc tggtagctat 2640
gcatgtggcc taccggagac atgaaaagaa acggaagttc atgaagggag agataaagaa 2700
cgagtttaag gacatcgaag agatcaaaac ccagaaggtc cgtatcgaag ggtccctgtg 2760
gtggacctac accaccagca tcttcttccg ggtcatcttt gaagccgtct tcatgtacgt 2820
cttttacatc atgtacaatg gcttcttcat gcaacgtctg gtgaaatgca acgcttggcc 2880
ctgccccaat acagtggact gcttcatttc caggcccaca gaaaagactg tcttcaccgt 2940
gtttatgatt tctgtgtctg gaatttgcat tctgctaaat atcacagagc tgtgctattt 3000
gttcgttagg tattgctcag gaaagtccaa aagaccagtc tacccatacg atgttccaga 3060
ttacgcttaa aggcgcgcca cccctgcagg gaattccgca ttgcccagtt gttagattaa 3120
gaaatagaca gcatgagagg gatgaggcaa cccgtgctca gctgtcaagg ctcagtcgct 3180
agcatttccc aacacaaaga ttctgacctt aaatgcaacc atttgaaacc cctgtaggcc 3240
tcaggtgaaa ctccagatgc cacaatggag ctctgctccc ctaaagcctc aaaacaaagg 3300
cctaattcta tgcctgtctt aattttcttt cacttaagtt agttccactg agaccccagg 3360
ctgttagggg ttattggtgt aaggtacttt catattttaa acagaggata tcggcatttg 3420
tttctttctc tgaggacaag agaaaaaagc caggttccac agaggacaca gagaaggttt 3480
gggtgtcctc ctggggttct ttttgccaac tttccccacg ttaaaggtga acattggttc 3540
tttcatttgc tttggaagtt ttaatctcta acagtggaca aagttaccag tgccttaaac 3600
tctgttacac tttttggaag tgaaaacttt gtagtatgat aggttatttt gatgtaaaga 3660
tgttctggat accattatat gttccccctg tttcagaggc tcagattgta atatgtaaat 3720
ggtatgtcat tcgctactat gatttaattt gaaatatggt cttttggtta tgaatacttt 3780
gcagcacagc tgagaggctg tctgttgtat tcattgtggt catagcacct aacaacattg 3840
tagcctcaat cgagtgagac agactagaag ttcctagtga tggcttatga tagcaaatgg 3900
cctcatgtca aatatttaga tgtaattttg tgtaagaaat acagactgga tgtaccacca 3960
actactacct gtaatgacag gcctgtccaa cacatctccc ttttccatga ctgtggtagc 4020
cagcatcgga aagaacgctg atttaaagag gtcgcttggg aattttattg acacagtacc 4080
atttaatggg gaggacaaaa tggggcaggg gagggagaag tttctgtcgt taaaaacaga 4140
tttggaaaga ctggactcta aagtctgttg attaaagatg agctttgtct acttcaaaag 4200
tttgtttgct taccccttca gcctccaatt ttttaagtga aaatatagct aataacatgt 4260
gaaaagaata gaagctaagg tttagataaa tattgagcag atctatagga agattgaacc 4320
tgaatattgc cattatgctt gacatggttt ccaaaaaatg gtactccaca tatttcagtg 4380
agggtaagta ttttcctgtt gtcaagaata gcattgtaaa agcattttgt aataataaag 4440
aatagcttta atgatatgct tgtaactaaa ataattttgt aatgtatcaa atacatttaa 4500
aacattaaaa tataatctct ataataattt aaaatctaat atggttttaa tagaacagcg 4560
atatcaagct tatcgataat caacctctgg attacaaaat ttgtgaaaga ttgactggta 4620
ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc 4680
atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttgctgt 4740
ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc actgtgtttg 4800
ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt tccgggactt 4860
tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt gcccgctgct 4920
ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg aaatcatcgt 4980
cctttccttg gctgctcgcc tatgttgcca cctggattct gcgcgggacg tccttctgct 5040
acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg ccggctctgc 5100
ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt tgggccgcct 5160
ccccgcgaat tcatcgatac cgagcgctgc tcgagagatc tgtgatagcg gccatcaagc 5220
tggctgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga 5280
ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt 5340
gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg 5400
attgggaaga caatagcagg catgctgggg acacgtgcgg accgagcggc cgcaggaacc 5460
cctagtgatg gagttggcca ctccctctct gcgcgctcgc tcgctcactg aggccgggcg 5520
accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg agcgagcgcg 5580
cagctgcctg caggggcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc 5640
acaccgcata cgtcaaagca accatagtac gcgccctgta gcggcgcatt aagcgcggcg 5700
ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct 5760
ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat 5820
cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt 5880
gatttgggtg atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg 5940
acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac 6000
cctatctcgg gctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta 6060
aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgtttaca 6120
attttatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga 6180
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 6240
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 6300
aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata 6360
ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 6420
tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 6480
atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt 6540
attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa 6600
gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac 6660
agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt 6720
aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt 6780
cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat 6840
cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac 6900
actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg 6960
cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc 7020
ataccaaacg acgagcgtga cacca 7045
<210> 84
<211> 700
<212> DNA
<213> Cynomolgus macaque
<400> 84
atggcaccag cttttgaaaa aagaaaacct ttttgctggt agtctggcaa ggagacagaa 60
aaaaaccact cacatctgcc tccccaggct gggggctggg ccggatttta taaggatagg 120
gtaatgaggg gtggtctgtt tggatcttgc aatgaggtgc tgctgggagg tgtgatctga 180
ttggatcctg ccatggagtg atgccaaagc tccatctgat tggatcctgg atcctgccgt 240
gtgtgctctg cttcttaatg caacccctgc tcctcagtct gagcccttag attctgccca 300
cggttgcacg cttggttcac tttggcatgc tcaggttaca tgaccttcag cttggggtcc 360
atggcaactg aaaagcaact cacaacttcc tttcataaaa attgaacctg actggtctgg 420
tgcagtcaca ccagctctat cccattgatg acaggaccgc atcatgggga ttagagcaga 480
gaggtcatag taactagcat tttcaagagg gcaccctgat gtctggatga acttcagggc 540
aacaaaatag cgggcaggtg agcagttgaa gacacccaga cactgggcct gaccaaggtg 600
gggtggtggg gatggcacag gaggacacag gatgggaatt aatgagggca ggggctttgt 660
cttgctcact gataagtcca tggcacatag agggtgatcg 700
<210> 85
<211> 700
<212> DNA
<213> Cynomolgus macaque
<400> 85
cgatcaccct ctatgtgcca tggacttatc agtgagcaag acaaagcccc tgccctcatt 60
aattcccatc ctgtgtcctc ctgtgccatc cccaccaccc caccttggtc aggcccagtg 120
tctgggtgtc ttcaactgct cacctgcccg ctattttgtt gccctgaagt tcatccagac 180
atcagggtgc cctcttgaaa atgctagtta ctatgacctc tctgctctaa tccccatgat 240
gcggtcctgt catcaatggg atagagctgg tgtgactgca ccagaccagt caggttcaat 300
ttttatgaaa ggaagttgtg agttgctttt cagttgccat ggaccccaag ctgaaggtca 360
tgtaacctga gcatgccaaa gtgaaccaag cgtgcaaccg tgggcagaat ctaagggctc 420
agactgagga gcaggggttg cattaagaag cagagcacac acggcaggat ccaggatcca 480
atcagatgga gctttggcat cactccatgg caggatccaa tcagatcaca cctcccagca 540
gcacctcatt gcaagatcca aacagaccac ccctcattac cctatcctta taaaatccgg 600
cccagccccc agcctgggga ggcagatgtg agtggttttt ttctgtctcc ttgccagact 660
accagcaaaa aggttttctt ttttcaaaag ctggtgccat 700
<210> 86
<211> 700
<212> DNA
<213> Cynomolgus macaque
<400> 86
ccgttaggaa aagaaaaaca gaaggaattg tgttctctgg agggcagggc tctgagtact 60
gagtctcatg ttttcaaagt cggaaagtgt ccacagttaa tatttggatg ggcccacagt 120
gcccgtcttg ctcgccggag cccaggcctg tcccatcaca gacaaagggc tcttgctgtg 180
cacctgtgga gaggggagct tggctgggga aggcagggtc agcctctttg tgctcttttt 240
gtttgaagca gagttttgca aagggagtgg ctctggaaga aaagcagagc gtggagtgtc 300
agaggccggc gtgttgtgaa atgcataagc cctggagacc ctctgtaact ggccttcaca 360
cacgcccgcc gccaaggaca acactgaacc acggaagcgg ggtgtttgcc agctcacgag 420
acggggagac atgaagcttc taccagcaga ggagctggag gggaaacaga aagaaagaac 480
tgagtctagc agcctccttg gacatttctt ccaacgcctc cagcccagca caacaaacaa 540
cctcagggca tccggcccgt gtcgcgccct ggcacaccca actctgccct gctccaagag 600
cccacagagg gcctcggggt cacactcaag gagcatgctt ggaatccaaa gtgcatgctg 660
tggtggggag atggacaagg acagaaatag cacccagcaa 700
<210> 87
<211> 700
<212> DNA
<213> Cynomolgus macaque
<400> 87
ttgctgggtg ctatttctgt ccttgtccat ctccccacca cagcatgcac tttggattcc 60
aagcatgctc cttgagtgtg accccgaggc cctctgtggg ctcttggagc agggcagagt 120
tgggtgtgcc agggcgcgac acgggccgga tgccctgagg ttgtttgttg tgctgggctg 180
gaggcgttgg aagaaatgtc caaggaggct gctagactca gttctttctt tctgtttccc 240
ctccagctcc tctgctggta gaagcttcat gtctccccgt ctcgtgagct ggcaaacacc 300
ccgcttccgt ggttcagtgt tgtccttggc ggcgggcgtg tgtgaaggcc agttacagag 360
ggtctccagg gcttatgcat ttcacaacac gccggcctct gacactccac gctctgcttt 420
tcttccagag ccactccctt tgcaaaactc tgcttcaaac aaaaagagca caaagaggct 480
gaccctgcct tccccagcca agctcccctc tccacaggtg cacagcaaga gccctttgtc 540
tgtgatggga caggcctggg ctccggcgag caagacgggc actgtgggcc catccaaata 600
ttaactgtgg acactttccg actttgaaaa catgagactc agtactcaga gccctgccct 660
ccagagaaca caattccttc tgtttttctt ttcctaacgg 700
<210> 88
<211> 510
<212> DNA
<213> Cynomolgus macaque
<400> 88
aaaaaagaat cacaattgcc accaaggctc tatgttttcg caaaagtcca gcatttaaaa 60
gaaacttcct gcatggccta catctgctga ttggtaattt gtcgttcagg ttaaaaacaa 120
aacaagcggg cattgttgtg atatcatcct tgataacatc ccaagaaaac tctagagctg 180
gcaagagagg aaagcagata atggtcaaag ctgtcatctg agttttaaaa acactgtgat 240
ttttctttta aaggaacatc ttcagtttcc aaggccatac acacggctcc taactgcagc 300
ttaaaatttt ccactgggct cccttctgag aacaaacgct attcagtggc gagtgccgga 360
caccactgcg ctttcaaagg tggctgccag aggacactca ggacttcaca gcagccggta 420
agccagactg gggtcagtca ctcccccatc agaattattt tgtttctcct ttgcttagga 480
aaggaaggat tcctcagatt ggcatcccag 510
<210> 89
<211> 510
<212> DNA
<213> Cynomolgus macaque
<400> 89
ctgggatgcc aatctgagga atccttcctt tcctaagcaa aggagaaaca aaataattct 60
gatgggggag tgactgaccc cagtctggct taccggctgc tgtgaagtcc tgagtgtcct 120
ctggcagcca cctttgaaag cgcagtggtg tccggcactc gccactgaat agcgtttgtt 180
ctcagaaggg agcccagtgg aaaattttaa gctgcagtta ggagccgtgt gtatggcctt 240
ggaaactgaa gatgttcctt taaaagaaaa atcacagtgt ttttaaaact cagatgacag 300
ctttgaccat tatctgcttt cctctcttgc cagctctaga gttttcttgg gatgttatca 360
aggatgatat cacaacaatg cccgcttgtt ttgtttttaa cctgaacgac aaattaccaa 420
tcagcagatg taggccatgc aggaagtttc ttttaaatgc tggacttttg cgaaaacata 480
gagccttggt ggcaattgtg attctttttt 510
<210> 90
<211> 643
<212> DNA
<213> Cynomolgus macaque
<400> 90
ataatgagca acataaggtt aaaataacat tgcaacccca tggaagcaag agaaatggaa 60
attattaata aatggaccac atgtaaggga atgctgtggt tctattgtag agattacaga 120
gagcaattta ggagagccag gcgctggggg caagagggaa atgaaacgaa aaccgaaggg 180
atttgttcag gaagaaaaat gaaaacagat aaaaggtgtt catttcaaag cttccctctt 240
tcccagcatt tttctgaagt agagtttgaa aggaaagcaa aataactgca aaccaataca 300
gtggcacgag ttcactgacg cagagctagg aacgacgtcc agagatctcc agccccgcct 360
cccgttctgg gtcacctggc tccttgacag ccctgaaaac tgcctgtgca aatctccagg 420
catgttatac ccatgagcgg ggacgtgtgg caccgacaaa gggacctgta cacctttgaa 480
gtatcctggg agaccagact cacattccac acacgctcac gagtcactga gcagccccat 540
tggaaatacg tggcaccgtc tcattccata tttgaccaaa accagtgttt acccagctca 600
gccgatagtt tcattttttt aaccaaacct aatgcagaat ggc 643
<210> 91
<211> 643
<212> DNA
<213> Cynomolgus macaque
<400> 91
gccattctgc attaggtttg gttaaaaaaa tgaaactatc ggctgagctg ggtaaacact 60
ggttttggtc aaatatggaa tgagacggtg ccacgtattt ccaatggggc tgctcagtga 120
ctcgtgagcg tgtgtggaat gtgagtctgg tctcccagga tacttcaaag gtgtacaggt 180
ccctttgtcg gtgccacacg tccccgctca tgggtataac atgcctggag atttgcacag 240
gcagttttca gggctgtcaa ggagccaggt gacccagaac gggaggcggg gctggagatc 300
tctggacgtc gttcctagct ctgcgtcagt gaactcgtgc cactgtattg gtttgcagtt 360
attttgcttt cctttcaaac tctacttcag aaaaatgctg ggaaagaggg aagctttgaa 420
atgaacacct tttatctgtt ttcatttttc ttcctgaaca aatcccttcg gttttcgttt 480
catttccctc ttgcccccag cgcctggctc tcctaaattg ctctctgtaa tctctacaat 540
agaaccacag cattccctta catgtggtcc atttattaat aatttccatt tctcttgctt 600
ccatggggtt gcaatgttat tttaacctta tgttgctcat tat 643
<210> 92
<211> 542
<212> DNA
<213> Cynomolgus macaque
<400> 92
cacgtcttgt aattttttta ctgaatgtta gacattgcat ataaaagact atccaggagt 60
gttttgtttt tgttttttct agtgagtgca agtcccttgc tctctgccag ttggctggaa 120
tgagaatctg atcagatttc atcaagagtc aggttgagct gagactgagc ggtagtgttc 180
actaaattga gtgcaccact gatatctaat ggaaacaagg acattttact ttgctcctca 240
gcctaacctg aatttcctat gccaccactg tataatggct ggtttctttg gttctcctaa 300
tgtgtgagct ggaagcaggt tgagacatag atttcatatc attttggctt cccttgcatc 360
taacatggct ccacaattca agcactatga aattgtttaa ctgttttcca gtcttgcctc 420
cacagccact tttgcagtaa aatcacggat gggggtgacg ttgagccaaa ctatttttgc 480
atttggtgga cttctaaatt ccaatccagc tccaaatctt ttggcagatt tttcttaaag 540
gt 542
<210> 93
<211> 542
<212> DNA
<213> Cynomolgus macaque
<400> 93
acctttaaga aaaatctgcc aaaagatttg gagctggatt ggaatttaga agtccaccaa 60
atgcaaaaat agtttggctc aacgtcaccc ccatccgtga ttttactgca aaagtggctg 120
tggaggcaag actggaaaac agttaaacaa tttcatagtg cttgaattgt ggagccatgt 180
tagatgcaag ggaagccaaa atgatatgaa atctatgtct caacctgctt ccagctcaca 240
cattaggaga accaaagaaa ccagccatta tacagtggtg gcataggaaa ttcaggttag 300
gctgaggagc aaagtaaaat gtccttgttt ccattagata tcagtggtgc actcaattta 360
gtgaacacta ccgctcagtc tcagctcaac ctgactcttg atgaaatctg atcagattct 420
cattccagcc aactggcaga gagcaaggga cttgcactca ctagaaaaaa caaaaacaaa 480
acactcctgg atagtctttt atatgcaatg tctaacattc agtaaaaaaa ttacaagacg 540
tg 542
<210> 94
<211> 523
<212> DNA
<213> Cynomolgus macaque
<400> 94
cggcagagac ctacagacca aagtacattt cacactggat ccaggacaca catcagtctg 60
aaagcacaca catgaaccaa acgtttccta aagcattact tacccttgct aatagcaaca 120
cattctcata ttcttttata cttcatttaa tttcatttaa aaaagaaaaa gataggaaag 180
aaatctattt ctccgcccat taataaggtc agacgcagca acgctagact agaagaaaag 240
tttacctact gatttttctc ccacctcctg agtgcgcaca gctttccgac aagtgtcagt 300
gccatctact gtgcgctctg ggtactgcaa tagccttttt tttttttttt ttttttttta 360
gaatgagact aaatgagaga acacaaagaa cttctttccc cacagtggag atggctctga 420
aagcgtttaa ggaatggctt agatgagtgg ctaacacatt atcccagttc tgaattctaa 480
gaccacagac tccatgtccg atccccaaag agaggctttg caa 523
<210> 95
<211> 523
<212> DNA
<213> Cynomolgus macaque
<400> 95
ttgcaaagcc tctctttggg gatcggacat ggagtctgtg gtcttagaat tcagaactgg 60
gataatgtgt tagccactca tctaagccat tccttaaacg ctttcagagc catctccact 120
gtggggaaag aagttctttg tgttctctca tttagtctca ttctaaaaaa aaaaaaaaaa 180
aaaaaaaagg ctattgcagt acccagagcg cacagtagat ggcactgaca cttgtcggaa 240
agctgtgcgc actcaggagg tgggagaaaa atcagtaggt aaacttttct tctagtctag 300
cgttgctgcg tctgacctta ttaatgggcg gagaaataga tttctttcct atctttttct 360
tttttaaatg aaattaaatg aagtataaaa gaatatgaga atgtgttgct attagcaagg 420
gtaagtaatg ctttaggaaa cgtttggttc atgtgtgtgc tttcagactg atgtgtgtcc 480
tggatccagt gtgaaatgta ctttggtctg taggtctctg ccg 523
<210> 96
<211> 579
<212> DNA
<213> Cynomolgus macaque
<400> 96
ggtgtgtata tcaggtggtt actttacaaa acaggatgtg gcaagctgga cctgatagac 60
acatcaaagc ctctgaacag agttcagggc atgaaatggt ttcttttggg ggtcttcagg 120
aacaatttca tgaaagctaa atcatgaaag atagcagact tttgccagga aaaaaaaaca 180
aaacaaaacg agactagtga ttagtttggc gttttcggtt tctttgagaa gcgaaataac 240
ttatcaagga ctctttgtgc cgcttgatgt tctaatcggt tgatgggtct ctcagaagcc 300
ctttctgcaa actagaacct gcagggatgt gcaaagcctc tctctgctgc catctgctgt 360
cttacaagag gtcactgcga gaggctgaat cccccaatgc cttggggatt cccactgcag 420
ggcaggggcg ccagcctgtg ttacaaccac ctgaacggcc acgtggacct tccacaaaag 480
tgtcactgtt tccattgctc tggtgtttgt attcccaatg tgtagtcttt gttagggcac 540
tcacaaaaag ttaaaaacaa aaattcacac aagcataca 579
<210> 97
<211> 579
<212> DNA
<213> Cynomolgus macaque
<400> 97
tgtatgcttg tgtgaatttt tgtttttaac tttttgtgag tgccctaaca aagactacac 60
attgggaata caaacaccag agcaatggaa acagtgacac ttttgtggaa ggtccacgtg 120
gccgttcagg tggttgtaac acaggctggc gcccctgccc tgcagtggga atccccaagg 180
cattggggga ttcagcctct cgcagtgacc tcttgtaaga cagcagatgg cagcagagag 240
aggctttgca catccctgca ggttctagtt tgcagaaagg gcttctgaga gacccatcaa 300
ccgattagaa catcaagcgg cacaaagagt ccttgataag ttatttcgct tctcaaagaa 360
accgaaaacg ccaaactaat cactagtctc gttttgtttt gttttttttt cctggcaaaa 420
gtctgctatc tttcatgatt tagctttcat gaaattgttc ctgaagaccc ccaaaagaaa 480
ccatttcatg ccctgaactc tgttcagagg ctttgatgtg tctatcaggt ccagcttgcc 540
acatcctgtt ttgtaaagta accacctgat atacacacc 579
<210> 98
<211> 700
<212> DNA
<213> Cynomolgus macaque
<400> 98
ggtcaggatt tgaaagacct tagctttgtg tgaccttcag ttttatcatt cagtttgaat 60
atgtgcccca gaaaaccttt atgtaatttc ctaatatttc agtaacatat ttcacaacat 120
acaagcagca cattctcttt ttttagaatg gtgtctcgct gatgactttg acgacagctc 180
acgtgagagg gaagtatttc agcaatcaga ccgaaggaga atccaaaaac cccactattg 240
cggggtcaag agtgcacgtg tttgaattct gaaagatgta agccaaggca aacagaagga 300
aatgatcttc cactaatccc tgcatttact tcctcctctc tggaggggac ggccacacac 360
acagagccct gtgctctgac ttctcctgaa ggggacacag ctgggctcac tcagtgtcac 420
ctcgcccctg gggtgtgccc gggtttcaga tctcaggctg gagtgattca cgtgtagcag 480
ggaggccgtc attaatgaaa atgcaggggc gtcgcgggag tgttgatgat tcagcaggcc 540
tttctacttc tctatgagtc agtacccgtc gcagccaagc ctggggcaga acaggttttc 600
ttaaaagagc atgggggcct cgtcttcaac aaccaattag gaggcagaaa agtcctcagt 660
gaggaaggaa taatgacatg ttggagctaa gatgatggtg 700
<210> 99
<211> 700
<212> DNA
<213> Cynomolgus macaque
<400> 99
caccatcatc ttagctccaa catgtcatta ttccttcctc actgaggact tttctgcctc 60
ctaattggtt gttgaagacg aggcccccat gctcttttaa gaaaacctgt tctgccccag 120
gcttggctgc gacgggtact gactcataga gaagtagaaa ggcctgctga atcatcaaca 180
ctcccgcgac gcccctgcat tttcattaat gacggcctcc ctgctacacg tgaatcactc 240
cagcctgaga tctgaaaccc gggcacaccc caggggcgag gtgacactga gtgagcccag 300
ctgtgtcccc ttcaggagaa gtcagagcac agggctctgt gtgtgtggcc gtcccctcca 360
gagaggagga agtaaatgca gggattagtg gaagatcatt tccttctgtt tgccttggct 420
tacatctttc agaattcaaa cacgtgcact cttgaccccg caatagtggg gtttttggat 480
tctccttcgg tctgattgct gaaatacttc cctctcacgt gagctgtcgt caaagtcatc 540
agcgagacac cattctaaaa aaagagaatg tgctgcttgt atgttgtgaa atatgttact 600
gaaatattag gaaattacat aaaggttttc tggggcacat attcaaactg aatgataaaa 660
ctgaaggtca cacaaagcta aggtctttca aatcctgacc 700
<210> 100
<211> 532
<212> DNA
<213> Cynomolgus macaque
<400> 100
gttttttcat gcatcttaaa ctttggtgct taaagaaaag caccattaaa tcctgctctc 60
acacgaacac acacaagatt accacgtttg ctctgggctg ccgcgtatag gaaggacata 120
tacattcaat aaatatttgt tgaacttcca ttctgtacac aaagcacaaa gaaagattcg 180
ttcacagtcc gcgtgggtac aggaaagcag ttccagccct gcctgccagg gggcacccca 240
ggcaagcaca tctcagtggc tgcaagaaag tcagcgagtt gaggctgagt ctctctctat 300
acccaagtgt taggtgttct aggctcaaag agagacaatg acaatgcggg caattctctc 360
ttcactgtgt ccctttcttt gctagaaatg ttattagaat gtggaaatgt gacccgtcga 420
ttgagaattc agcactaatc agtttgacat atgagtatat ctacatagac acatatttct 480
ccctgaaatt gtcctaaaca ctgtcttcct tgaaaccaaa tgagaaggaa ac 532
<210> 101
<211> 532
<212> DNA
<213> Cynomolgus macaque
<400> 101
gtttccttct catttggttt caaggaagac agtgtttagg acaatttcag ggagaaatat 60
gtgtctatgt agatatactc atatgtcaaa ctgattagtg ctgaattctc aatcgacggg 120
tcacatttcc acattctaat aacatttcta gcaaagaaag ggacacagtg aagagagaat 180
tgcccgcatt gtcattgtct ctctttgagc ctagaacacc taacacttgg gtatagagag 240
agactcagcc tcaactcgct gactttcttg cagccactga gatgtgcttg cctggggtgc 300
cccctggcag gcagggctgg aactgctttc ctgtacccac gcggactgtg aacgaatctt 360
tctttgtgct ttgtgtacag aatggaagtt caacaaatat ttattgaatg tatatgtcct 420
tcctatacgc ggcagcccag agcaaacgtg gtaatcttgt gtgtgttcgt gtgagagcag 480
gatttaatgg tgcttttctt taagcaccaa agtttaagat gcatgaaaaa ac 532
<210> 102
<211> 120
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 102
gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg ttcgcggacc 60
cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt 120
<210> 103
<211> 228
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 103
aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg 60
cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc 120
ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg cagagacccc 180
aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcag 228
<210> 104
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 104
agcaaaccgc ccagagtaga ag 22
<210> 105
<211> 370
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 105
gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg ttcgcggacc 60
cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt 120
aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg 180
cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc 240
ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg cagagacccc 300
aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcagag caaaccgccc 360
agagtagaag 370
<210> 106
<211> 130
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 106
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct 130
<210> 107
<211> 130
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 107
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120
gagcgcgcag 130
<210> 108
<211> 602
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 108
gataatcaac ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt 60
gctcctttta cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc 120
cgtatggctt tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag 180
ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc 240
actggttggg gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc 300
cctattgcca cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg 360
ctgttgggca ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg 420
ctcgcctgtg ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc 480
ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt 540
cttcgccttc gccctcagac gagtcggatc tccctttggg ccgcctcccc gcatcggact 600
ag 602
<210> 109
<211> 237
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 109
gtcgactaga gctcgctgat cagcctcgac tgtgccttct agttgccagc catctgttgt 60
ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta 120
ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg 180
ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctgggga 237
<210> 110
<211> 3493
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 110
gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg ttcgcggacc 60
cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt 120
aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg 180
cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc 240
ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg cagagacccc 300
aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcagag caaaccgccc 360
agagtagaag cggatccgcc accatggatt ggggcacgct gcagacgatc ctggggggtg 420
tgaacaaaca ctccaccagc attggaaaga tctggctcac cgtcctcttc atttttcgca 480
ttatgatcct cgttgtggct gcaaaggagg tgtggggaga tgagcaggcc gactttgtct 540
gcaacaccct gcagccaggc tgcaagaacg tgtgctacga tcactacttc cccatctccc 600
acatccggct atgggccctg cagctgatct tcgtgtccac gccagcgctc ctagtggcca 660
tgcacgtggc ctaccggaga catgagaaga agaggaagtt catcaagggg gagataaaga 720
gtgaatttaa ggacatcgag gagatcaaaa cccagaaggt ccgcatcgaa ggctccctgt 780
ggtggaccta cacaagcagc atcttcttcc gggtcatctt cgaagccgcc ttcatgtacg 840
tcttctatgt catgtacgac ggcttctcca tgcagcggct ggtgaagtgc aacgcctggc 900
cttgtcccaa cactgtggac tgctttgtgt cccggcccac ggagaagact gtcttcacag 960
tgttcatgat tgcagtgtct ggaatttgca tcctgctgaa tgtcactgaa ttgtgttatt 1020
tgctaattag atattgttct gggaagtcaa aaaagccagt ttacccatac gatgttccag 1080
attacgctta aggcgcgcca cccctgcagg gaattccgca ttgcccagtt gttagattaa 1140
gaaatagaca gcatgagagg gatgaggcaa cccgtgctca gctgtcaagg ctcagtcgct 1200
agcatttccc aacacaaaga ttctgacctt aaatgcaacc atttgaaacc cctgtaggcc 1260
tcaggtgaaa ctccagatgc cacaatggag ctctgctccc ctaaagcctc aaaacaaagg 1320
cctaattcta tgcctgtctt aattttcttt cacttaagtt agttccactg agaccccagg 1380
ctgttagggg ttattggtgt aaggtacttt catattttaa acagaggata tcggcatttg 1440
tttctttctc tgaggacaag agaaaaaagc caggttccac agaggacaca gagaaggttt 1500
gggtgtcctc ctggggttct ttttgccaac tttccccacg ttaaaggtga acattggttc 1560
tttcatttgc tttggaagtt ttaatctcta acagtggaca aagttaccag tgccttaaac 1620
tctgttacac tttttggaag tgaaaacttt gtagtatgat aggttatttt gatgtaaaga 1680
tgttctggat accattatat gttccccctg tttcagaggc tcagattgta atatgtaaat 1740
ggtatgtcat tcgctactat gatttaattt gaaatatggt cttttggtta tgaatacttt 1800
gcagcacagc tgagaggctg tctgttgtat tcattgtggt catagcacct aacaacattg 1860
tagcctcaat cgagtgagac agactagaag ttcctagtga tggcttatga tagcaaatgg 1920
cctcatgtca aatatttaga tgtaattttg tgtaagaaat acagactgga tgtaccacca 1980
actactacct gtaatgacag gcctgtccaa cacatctccc ttttccatga ctgtggtagc 2040
cagcatcgga aagaacgctg atttaaagag gtcgcttggg aattttattg acacagtacc 2100
atttaatggg gaggacaaaa tggggcaggg gagggagaag tttctgtcgt taaaaacaga 2160
tttggaaaga ctggactcta aagtctgttg attaaagatg agctttgtct acttcaaaag 2220
tttgtttgct taccccttca gcctccaatt ttttaagtga aaatatagct aataacatgt 2280
gaaaagaata gaagctaagg tttagataaa tattgagcag atctatagga agattgaacc 2340
tgaatattgc cattatgctt gacatggttt ccaaaaaatg gtactccaca tatttcagtg 2400
agggtaagta ttttcctgtt gtcaagaata gcattgtaaa agcattttgt aataataaag 2460
aatagcttta atgatatgct tgtaactaaa ataattttgt aatgtatcaa atacatttaa 2520
aacattaaaa tataatctct ataataattt aaaatctaat atggttttaa tagaacagcg 2580
atatcaagct tatcgatgat aatcaacctc tggattacaa aatttgtgaa agattgactg 2640
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 2700
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 2760
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 2820
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 2880
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 2940
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 3000
cgtcctttcc ttggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 3060
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 3120
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 3180
cctccccgca tcggactagg aattcatcga taccgagcgc tgctcgagag atctgtgata 3240
gcggccatca agctgggtcg actagagctc gctgatcagc ctcgactgtg ccttctagtt 3300
gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc 3360
ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt 3420
ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca 3480
ggcatgctgg gga 3493
<210> 111
<211> 3918
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic
<400> 111
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct tgtagttaat gattaacccg ccatgctact tatctaccag ggtaatgggg 180
atcctctaga acgcgtttaa ttaagacctc gaaggggact tggggggttc ggggctttcg 240
ggggcggtcg ggggttcgcg gacccgggaa gctctgagga cccagaggcc gggcgcgctc 300
cgcccgcggc gccgccccct ccgtaacttt cccagtctcc gagggaagag gcggggtgtg 360
gggtgcggtt aaaaggcgcc acggcgggag acaggtgttg cggccccgca gcgcccgcgc 420
gctcctctcc ccgactcgga gcccctcggc ggcgcccggc ccaggacccg cctaggagcg 480
caggagcccc agcgcagaga ccccaacgcc gagacccccg ccccggcccc gccgcgcttc 540
ctcccgacgc agagcaaacc gcccagagta gaagcggatc cgccaccatg gattggggca 600
cgctgcagac gatcctgggg ggtgtgaaca aacactccac cagcattgga aagatctggc 660
tcaccgtcct cttcattttt cgcattatga tcctcgttgt ggctgcaaag gaggtgtggg 720
gagatgagca ggccgacttt gtctgcaaca ccctgcagcc aggctgcaag aacgtgtgct 780
acgatcacta cttccccatc tcccacatcc ggctatgggc cctgcagctg atcttcgtgt 840
ccacgccagc gctcctagtg gccatgcacg tggcctaccg gagacatgag aagaagagga 900
agttcatcaa gggggagata aagagtgaat ttaaggacat cgaggagatc aaaacccaga 960
aggtccgcat cgaaggctcc ctgtggtgga cctacacaag cagcatcttc ttccgggtca 1020
tcttcgaagc cgccttcatg tacgtcttct atgtcatgta cgacggcttc tccatgcagc 1080
ggctggtgaa gtgcaacgcc tggccttgtc ccaacactgt ggactgcttt gtgtcccggc 1140
ccacggagaa gactgtcttc acagtgttca tgattgcagt gtctggaatt tgcatcctgc 1200
tgaatgtcac tgaattgtgt tatttgctaa ttagatattg ttctgggaag tcaaaaaagc 1260
cagtttaccc atacgatgtt ccagattacg cttaaggcgc gccacccctg cagggaattc 1320
cgcattgccc agttgttaga ttaagaaata gacagcatga gagggatgag gcaacccgtg 1380
ctcagctgtc aaggctcagt cgctagcatt tcccaacaca aagattctga ccttaaatgc 1440
aaccatttga aacccctgta ggcctcaggt gaaactccag atgccacaat ggagctctgc 1500
tcccctaaag cctcaaaaca aaggcctaat tctatgcctg tcttaatttt ctttcactta 1560
agttagttcc actgagaccc caggctgtta ggggttattg gtgtaaggta ctttcatatt 1620
ttaaacagag gatatcggca tttgtttctt tctctgagga caagagaaaa aagccaggtt 1680
ccacagagga cacagagaag gtttgggtgt cctcctgggg ttctttttgc caactttccc 1740
cacgttaaag gtgaacattg gttctttcat ttgctttgga agttttaatc tctaacagtg 1800
gacaaagtta ccagtgcctt aaactctgtt acactttttg gaagtgaaaa ctttgtagta 1860
tgataggtta ttttgatgta aagatgttct ggataccatt atatgttccc cctgtttcag 1920
aggctcagat tgtaatatgt aaatggtatg tcattcgcta ctatgattta atttgaaata 1980
tggtcttttg gttatgaata ctttgcagca cagctgagag gctgtctgtt gtattcattg 2040
tggtcatagc acctaacaac attgtagcct caatcgagtg agacagacta gaagttccta 2100
gtgatggctt atgatagcaa atggcctcat gtcaaatatt tagatgtaat tttgtgtaag 2160
aaatacagac tggatgtacc accaactact acctgtaatg acaggcctgt ccaacacatc 2220
tcccttttcc atgactgtgg tagccagcat cggaaagaac gctgatttaa agaggtcgct 2280
tgggaatttt attgacacag taccatttaa tggggaggac aaaatggggc aggggaggga 2340
gaagtttctg tcgttaaaaa cagatttgga aagactggac tctaaagtct gttgattaaa 2400
gatgagcttt gtctacttca aaagtttgtt tgcttacccc ttcagcctcc aattttttaa 2460
gtgaaaatat agctaataac atgtgaaaag aatagaagct aaggtttaga taaatattga 2520
gcagatctat aggaagattg aacctgaata ttgccattat gcttgacatg gtttccaaaa 2580
aatggtactc cacatatttc agtgagggta agtattttcc tgttgtcaag aatagcattg 2640
taaaagcatt ttgtaataat aaagaatagc tttaatgata tgcttgtaac taaaataatt 2700
ttgtaatgta tcaaatacat ttaaaacatt aaaatataat ctctataata atttaaaatc 2760
taatatggtt ttaatagaac agcgatatca agcttatcga tgataatcaa cctctggatt 2820
acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg 2880
gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct 2940
cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc gttgtcaggc 3000
aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg ggcattgcca 3060
ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc acggcggaac 3120
tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt 3180
ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctgt gttgccacct 3240
ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca gcggaccttc 3300
cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt cgccctcaga 3360
cgagtcggat ctccctttgg gccgcctccc cgcatcggac taggaattca tcgataccga 3420
gcgctgctcg agagatctgt gatagcggcc atcaagctgg gtcgactaga gctcgctgat 3480
cagcctcgac tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3540
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 3600
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 3660
gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg agcggccgcg 3720
gtaccaaacc taggtaatac ccattaccct ggtagataag tagcatggcg ggttaatcat 3780
taactacaag gaacccctag tgatggagtt ggccactccc tctctgcgcg ctcgctcgct 3840
cactgaggcc gggcgaccaa aggtcgcccg acgcccgggc tttgcccggg cggcctcagt 3900
gagcgagcga gcgcgcag 3918
SEQUENCE LISTING
<110> President and Fellows of Harvard College
<120> RECOMBINANT ADENO ASSOCIATED VIRUS (RAAV) ENCODING GJB2 AND USES
THEREOF
<130> H0824.70367WO00
<140> Not Yet Assigned
<141> 2021-09-14
<150> US 63/078,233
<151> 2020-09-14
<150> US 63/161,619
<151> 2021-03-16
<160> 111
<170> PatentIn version 3.5
<210> 1
<211> 225
<212> PRT
<213> Homo sapiens
<400> 1
Met Asp Trp Gly Thr Leu Gln Thr Ile Leu Gly Gly Val Asn Lys His
1 5 10 15
Ser Thr Ser Ile Gly Lys Ile Trp Leu Thr Val Leu Phe Ile Phe Arg
20 25 30
Ile Met Ile Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gln
35 40 45
Ala Asp Phe Val Cys Asn Thr Leu Gln Pro Gly Cys Lys Asn Val Cys
50 55 60
Tyr Asp His Tyr Phe Pro Ile Ser His Ile Arg Leu Trp Ala Leu Gln
65 70 75 80
Leu Ile Phe Val Ser Thr Pro Ala Leu Leu Val Ala Met His Val Ala
85 90 95
Tyr Arg Arg His Glu Lys Arg Lys Phe Ile Lys Gly Glu Ile Lys Ser
100 105 110
Glu Phe Lys Asp Ile Glu Glu Ile Lys Thr Gln Lys Val Arg Ile Glu
115 120 125
Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser Ile Phe Phe Arg Val Ile
130 135 140
Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly Phe
145 150 155 160
Ser Met Gln Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn Thr
165 170 175
Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr Val
180 185 190
Phe Met Ile Ala Val Ser Gly Ile Cys Ile Leu Leu Asn Val Thr Glu
195 200 205
Leu Cys Tyr Leu Leu Ile Arg Tyr Cys Ser Gly Lys Ser Lys Lys Pro
210 215 220
Val
225
<210> 2
<211> 678
<212> DNA
<213> Homo sapiens
<400> 2
atggattggg gcacgctgca gacgatcctg gggggtgtga acaaacactc caccagcatt 60
ggaaagatct ggctcaccgt cctcttcatt tttcgcatta tgatcctcgt tgtggctgca 120
aaggaggtgt ggggagatga gcaggccgac tttgtctgca acaccctgca gccaggctgc 180
aagaacgtgt gctacgatca ctacttcccc atctcccaca tccggctatg ggccctgcag 240
ctgatcttcg tgtccacgcc agcgctccta gtggccatgc acgtggccta ccggagacat 300
gagaagaaga ggaagttcat caagggggag ataaagagg aatttaagga catcgaggag 360
atcaaaaccc agaaggtccg catcgaaggc tccctgtggt ggacctacac aagcagcatc 420
ttcttccggg tcatcttcga agccgccttc atgtacgtct tctatgtcat gtacgacggc 480
ttctccatgc agcggctggt gaagtgcaac gcctggcctt gtcccaacac tgtggactgc 540
tttgtgtccc ggcccacgga gaagactgtc ttcacagtgt tcatgattgc agtgtctgga 600
atttgcatcc tgctgaatgt cactgaattg tgttatttgc taattagata ttgttctggg 660
aagtcaaaaa agccagtt 678
<210> 3
<211> 226
<212> PRT
213 <213>
<400> 3
Met Asp Trp Gly Thr Leu Gln Ser Ile Leu Gly Gly Val Asn Lys His
1 5 10 15
Ser Thr Ser Ile Gly Lys Ile Trp Leu Thr Val Leu Phe Ile Phe Arg
20 25 30
Ile Met Ile Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gln
35 40 45
Ala Asp Phe Val Cys Asn Thr Leu Gln Pro Gly Cys Lys Asn Val Cys
50 55 60
Tyr Asp His His Phe Pro Ile Ser His Ile Arg Leu Trp Ala Leu Gln
65 70 75 80
Leu Ile Met Val Ser Thr Pro Ala Leu Leu Val Ala Met His Val Ala
85 90 95
Tyr Arg Arg His Glu Lys Lys Arg Lys Phe Met Lys Gly Glu Ile Lys
100 105 110
Asn Glu Phe Lys Asp Ile Glu Glu Ile Lys Thr Gln Lys Val Arg Ile
115 120 125
Glu Gly Ser Leu Trp Trp Thr Tyr Thr Thr Ser Ile Phe Phe Arg Val
130 135 140
Ile Phe Glu Ala Val Phe Met Tyr Val Phe Tyr Ile Met Tyr Asn Gly
145 150 155 160
Phe Phe Met Gln Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn
165 170 175
Thr Val Asp Cys Phe Ile Ser Arg Pro Thr Glu Lys Thr Val Phe Thr
180 185 190
Val Phe Met Ile Ser Val Ser Gly Ile Cys Ile Leu Leu Asn Ile Thr
195 200 205
Glu Leu Cys Tyr Leu Phe Val Arg Tyr Cys Ser Gly Lys Ser Lys Arg
210 215 220
Pro Val
225
<210> 4
<211> 678
<212> DNA
213 <213>
<400> 4
atggattggg gcacactcca gagcatcctc gggggtgtca acaaacactc caccagcatt 60
ggaaagatct ggctcacggt cctcttcatc ttccgcatca tgatcctcgt ggtggctgca 120
aaggaggtgt ggggagatga gcaagccgat tttgtctgca acacgctcca gcctggctgc 180
aagaatgtat gctacgacca ccacttcccc atctctcaca tccggctctg ggctctgcag 240
ctgatcatgg tgtccacgcc agccctcctg gtagctatgc atgtggccta ccggagacat 300
gaaaagaaac ggaagttcat gaagggagag ataaagaacg agtttaagga catcgaagag 360
atcaaaaccc agaaggtccg tatcgaaggg tccctgtggt ggacctacac caccagcatc 420
ttcttccggg tcatctttga agccgtcttc atgtacgtct tttacatcat gtacaatggc 480
ttcttcatgc aacgtctggt gaaatgcaac gcttggccct gccccaatac agtggactgc 540
ttcatttcca ggcccacaga aaagactgtc ttcaccgtgt ttatgatttc tgtgtctgga 600
atttgcattc tgctaaatat cacagagctg tgctatttgt tcgttaggta ttgctcagga 660
aagtccaaaa gaccagtc 678
<210> 5
<211> 500
<212> DNA
<213> Homo sapiens
<400> 5
acctgtctcc cgccgtggcg ccttttaacc gcaccccaca ccccgcctct tccctcggag 60
actgggaaag ttacggaggg ggcggcgccg cgggcggagc gcgcccggcc tctgggtcct 120
cagagcttcc cgggtccgcg aacccccgac cgcccccgaa agccccgaac cccccaagtc 180
cccttcgagg tcccgatctc ctagttcctt tgagccccca tgagttcccc aagtgccccc 240
agcgccctga gtctcccccg gttaccccga gcgccgcctc ccccagcccc ttggcggccc 300
gggtgaagcg ggggcggctg agagtcggga ccccccagga agcggcgccc cagaccccgg 360
ctccggcgct gtgccgtggg cggggttcag ggatggctgt ggtcgttgtc ctctgtactc 420
cgcatagtgc gagaggactt ggcatttatg agcgcttctt taatttttta ttgttagaga 480
aacaggcatt cctccaagga 500
<210> 6
<211> 4843
<212> DNA
<213> Homo sapiens
<400> 6
ctttgtggat ggcttggtgg cctcactgtc aggctggcac tgatggctca gttagcatat 60
ctgttttgat aagtgctgca acagtgcatt ataattgtgg gctgtggttt taatttcaaa 120
gtgtttctta aaagacacat tattttaaaa tgacagaaaa ttcaactccc tcggttactg 180
gcccagctaa gcgacgtcac tgcattgcag ttcagcgctg aagcttggga gagtcccaca 240
ctccttactg caagcggatg tggagaggcc agtggataat ctcctgtgag cccatggcct 300
tcttttcatc ccaggatgg aattgtcttc actgattcat agttacaccc tgcctgccac 360
aaccaacgct ctcctaaaca agattccacc ctctccacaa tccggatgaa tcatctcttt 420
tccacccttc agagctggta gtgaatcctc cttcttcttt ttcttaaaag catcctcctc 480
tcctcatttt aggcaagttg catcccgttt tctgatggac tccagaagca ggctcgtagt 540
gaatgtcttt catgacccac agtcgctgcc acggggcacc aaggtcaggc agaaaccatc 600
cagtgccacc ttggtcagag gctaacagga gagaggtggc cacgaaagtt acatcagatt 660
gacataggcc tgtgaaacat ttagcttcac tgagcttggg aaagacaaca tcattggaaa 720
aaacaatatt ttagcccagg ttcagcactg acccattgat aatccagact gggaggccct 780
taggtgagct ggttgtcctg ctacagcacc cacagctcag gccagtcccg tcccaacagc 840
agaaccaccg aggacagcaa cattccgatt ttaacaaaag catcttatgg aattagacat 900
tcttcattgg ccctcactga gtggaaaaca ggatactccc cgaagtaaac tctctcctgg 960
tttacaacaa tacacctggc caagaatatg gggctgcagg aggaggggtt tatcctttgc 1020
cctcttccac ctgccaaacc caggtcatac acccttctac agacctgtcc agttaccatc 1080
agctgagaaa aatacagttc cgagaaaccc tatattgtta ttttataaag cttgagttga 1140
agctacctgt tttaaagatc ctttttcagg aagaggagta aattaagatt tactccccaa 1200
tgggctaggg ggtcatgggt taagaggggc tcagaagcag gacgaagttg ttttcaatat 1260
tcaagtcaga ggaggagctg ccctcctggc ctcccgaccc tgggcggtta catgcagctt 1320
cctaccgggc ccacgccatc ctgcaccgcc tggagggctg ccagaggcca gcggaggagt 1380
tggttcagtt ccttagggaa gacactaggt gaatcaccag gatccagaaa aggcaaaagg 1440
gactcttcac cccttaaatt tctccaccct taggtgatgg gtggtcgacc ttgcctggct 1500
gtccccagag ggttcctcca cccttctcac cagtgtctga aattgtgacc gactgtgcac 1560
agcagtttcg aaagggactc taaggtcaca tggggacacg gccgtaccac gcttctcaag 1620
gcagtcccag gtgcatggcc acggaaccca gctctcagca gctgttagtt aggtgagcgc 1680
tgttcgggct gccttcctcc tccagtgggg caggatcgag gcactgatgg aaccgtcctg 1740
aggacgcggg tctcagccgc acaccacctc ttcgcgaaca agggtcctaa aaattttcct 1800
tctaggcggg gagcacagcc cggaaacaga ccctcgtgaa gtgtttagga aaaagggaag 1860
ccactgaaat cttggccccg gggtaggccg ggatcggctg gctccgcgtt agttctaggc 1920
aaactccgcc caaatctctg cccggggatt tttctgcaga agccgctcca agaggtaaag 1980
gtcagttcct gcagcgaagg cttcctgctt caccggcgaa acggagcttt gcttcgaagc 2040
taagctttcg gtgaatttaa aacgtttggt ggcagtgggt caagtagcca ggcggctgcg 2100
ctagagtacc ccgaagggac atcggcgaca ccacaaacct cgcgctggcg gctcgcccgc 2160
gcctttttcc cctcccgcgc gcgcccggcc ccactcgcac cccgggcggt gccatcgcgt 2220
ccacttcccc ggccgcccca ttccagctcc ggagctcggc cgcagaaacg cccgctccag 2280
aaggcggccc ccgccccccg gcccaaggac gtgtgttggt ccagcccccc ggttccccga 2340
gacccacgcg gccgggcaac cgctctgggt ctcgcggtcc ctccccgcgc caggttcctg 2400
gccgggcagt ccggggccgg cgggctcacc tgcgtcggga ggaagcgcgg cggggccggg 2460
gcgggggtct cggcgttggg gtctctgcgc tggggctcct gcgctcctag gcgggtcctg 2520
ggccgggcgc cgccgagggg ctccgagtcg gggagaggag cgcgcgggcg ctgcggggcc 2580
gcaacacctg tctcccgccg tggcgccttt taaccgcacc ccacaccccg cctcttccct 2640
cggagactgg gaaagttacg gagggggcgg cgccgcgggc ggagcgcgcc cggcctctgg 2700
gtcctcagag cttcccgggt ccgcgaaccc ccgaccgccc ccgaaagccc cgaacccccc 2760
aagtcccctt cgaggtcccg atctcctagt tcctttgagc ccccatgagt tccccaagtg 2820
cccccagcgc cctgagtctc ccccggttac cccgagcgcc gcctccccca gccccttggc 2880
ggcccgggtg aagcgggggc ggctgagagt cgggaccccc caggaagcgg cgccccagac 2940
cccggctccg gcgctgtgcc gtgggcgggg ttcagggatg gctgtggtcg ttgtcctctg 3000
tactccgcat agtgcgagag gacttggcat ttatgagcgc ttctttaatt ttttattgtt 3060
agagaaacag gcattcctcc aaggactgaa gatctgttcg agtcgcggag gctgcgcggg 3120
cccgcgaggc tctcgcaggg ggacctaggc tgggtggcgg ggcagtgccc tctggaatgg 3180
gggttaacgg tggccgagga gggggcgccg ctggtgccgg cgaagtcccc gcttctttct 3240
cccctcaaaa tctcaccaat ccgaacgaac gccttctcga atttccgatt ttattcaatt 3300
actttcaaca atgtgccaag gactaaggtt gggggcggtg ggagagacaa gcctcgtttt 3360
tgccatggcc ggcagggggg tcccgccatc tgcggagggt gccccccgcg gcccccggcc 3420
cagccaactt cctcctcttt tcgcaactgg ggaactgcaa ggaggtgact cctttcgggg 3480
tgaggaggcc cagacttttc agaaaggaaa gagggcaggt aaaacctgcc aagccccttc 3540
ctgctcgatg cacacagcac gaaaggggga aactgatagg attctgcgga agaccgctgg 3600
ggggctggct ctgcactgca cacctgctgg gggctttctg gataccgtga aactttgtct 3660
cagattatga ggtctcagta tttgcatttg gttggggatt ttgatgtctt gcgatacaaa 3720
tgacagaaga cagatttgca cagcgcaagc ggatgaggga ctaagatgtg cagagcaggc 3780
tgggtgggga ctcccgggga ggtctccccc aacccccgcc ccacctcggg cacccacttc 3840
gcgatttttg cagaggggag ccaggtcaga ggtgcagcct ggtcccctcg cgctcacgtt 3900
tttacccagg tcagttcgaa gttaagtgga aatgatgatt aatcctgaca agtcagatct 3960
ggcctcagaa tggatttccc gtgattgcca ccatattag cattgacttt tccttgaaaa 4020
attggcgccc cgtggccatg ggccgaccta ggcagtttct gcagggacga gcgtgagttt 4080
tgtaccgcgg ttaccaccta ctttccagct ccaggtctta gtctaagagg gagtgtctgc 4140
tcatgaagag gcaaagcccc aggagctgcg aaaagccttg catggcccat ctgagagatg 4200
tgctgagtcg gcttgttaaa aatgacaggc aaagcctgtg gggtggggca gctttcttgg 4260
cctgagcgca tcttggttga gccagaggtg acttggggtg gggagtgggg cgccggttgg 4320
tgggttctcc ctttaatttc tcaaaggctg tggtgtttat gagtctgttg gaatcctggt 4380
tgggttggaa tgaaggaagg ttctagaacc attgtgggaa gctcgctagt aaagatggtt 4440
tggagatcgg aagttgactg actttccccc attgaaaaat gtcacctgag attttagtgc 4500
ctgtatcacg attataggct caactttctt ttccttgttt tctttgattt agttctcctt 4560
atgtgcaaaa ttactgtgtg atgttggcta gtcgtattat cacagccact ccgtgttttc 4620
aggatttgta gctggaagtc ctatagcact taagtcttca cttacagatc agcgcttgct 4680
tttatctgt tttgtgtgat ttctgctgtt ttcctgtgag ttggtgtttt cttcccaagt 4740
aggctcagga ctcctctagg gcaggacatt atatgcatgt acatagtgtc ctccagtgta 4800
ggggaggaga aggagggagag gtgaggtggg aaaagggtga ggg 4843
<210> 7
<211> 5178
<212> DNA
213 <213>
<400> 7
ccaaaaaggg acaaaaacag acaaacaaac aacaccaaca caaacaacaa cagcactaaa 60
acgagtctct gcacctaggt cttcgcacgc aggctggtag tcccaccctc aggtagggcc 120
tgtttggtta acgatccgtg tctgttttga tatgtgttgc aagtgagtgt tgcactgtgg 180
actatggttt taaccttgaa gtgattctaa aataaatata tgatgaaaaa tgacggaaaa 240
ttagctcagc ggttcaccag ttgctggtcc aaggagccac ctgatggggg ttttgccttg 300
ggtggcatca cagtgtatcc tgtctgagtg acacagtgtc tatatatggc ctgtgcccta 360
gatgagcctc cataagccaa tgaccttcta tttcatccca gggcaggaac cttccatggc 420
tacacctggt ctgtcacaat caacccctct tttgattaat cccatcttcc cggctgtcct 480
gactcacttg cttccaccccc ttccttccaa gctgtaaaga atcctctgac tctttcttaa 540
aagcacccta ccctcctgct tagcaagtta catcctgttt cgcagtggac tcacagcagg 600
cgcagagaga agtccctcct tgtccctagt ggcggtggca gagcaccagg gaacccactt 660
gctggaaccc actcagctct gccttggaca gaggagatag ggccaggggc atgggaatta 720
aggaatactg acatacaccg gtaaaacatc aagtcctatc caacttggaa agcagaaaca 780
gacaggctcg gcaggttcag ccctgaccca tttataccta gactgtcaga ggccctttgg 840
gaagctggtt gtcctctgaa cagtctctca gctccatgtg gtctgccccc aacagcagaa 900
ggattgaaaa gcaacagtgt tccaagttta acaaaacaat ctgattggaa ttagaccttc 960
tgttcttcct tccccttctc ccgagtggag atcaggacat tgaaataaac atctacacac 1020
ctgacccaaa atacagagct ggaggatccc tttgcctgcc tatagcatcc acagactagc 1080
ccaattatta tcaacacaga aaaaaaaaaa aaccctcaat ttctgcgtaa actgtgcact 1140
tgtttataaa agtacttaag tgtttgttga atttgagttt accgtgttac ccaggatggc 1200
ttctaaatcc atgcagttgg agttagcaca acatgggggt gggggtaggg ggttaataca 1260
tctataatag cagaactctg gaggctgagg taggaggagt gtgctaactt gaggaaaact 1320
tttctgcaga gcaagaccct ggctcaagaa aacaaacacc aaaagagaca agaaaagaaa 1380
agaacagaac caaaacaaaa acaaacaaac aaacaaacaa aaaaccaaaa aatgggaagg 1440
ccggattgaa caaacaaggt caagaagaga gagagagaga gagagagaga gagagagaga 1500
gagagagaga gaaaactcca aaagaaaacc aaatagctgg gacatagctg tgggtcccgg 1560
catatctgat tgcagctgct tgtcttaaat ggcctttcta agtggaagga gaggttaaaa 1620
tttgacctca caaaggggtt aggagtacta agccagcagg tgaaatcgtc aatattcaac 1680
tgtggtgtag gaggtgattt ccaggctggc cttaggacta ggtcacacgc aggtccctac 1740
ctggcatggg acacctggag attgccttga accggtgaat cattcgctcc tgagtagaag 1800
ggagcttctc catgtttata gtatatactg catatgaccc ttaatttgcct taaaggatac 1860
ttcggggagc tggtggactg cctctagatg ctgaccccac cgcaccctcc acccttctca 1920
taattcactg gctttgccca tagttcccaa aggactccgg ggtaagtgta gccatgactg 1980
agccaggctt ctcaggacaa tcccgtggac ctgagcaatg ggtcccattt aggcctacgc 2040
tcccttccct tccattgagg cagcaccaag gggctgatgc aattgtccta aggggacaagt 2100
ttctcagcag cacgccatct gtgaacctgt gccttccctt ccagctgtaa cgtcccgcct 2160
ggacgcaaat ccttaaaaag catttaagga aagaaaaaaa aaaaaagcaa tcaaaatctc 2220
cacccgagtg caggttgggg ttccccagct cgcgggagcg gctacggccg cgcgttttgg 2280
gcggtcgccc acgtcacccc agtgctttag gtggtaaagg tcagtgtctt cccacggagg 2340
cttcctgctt aacaaatgaa actgagtttt cctgctcagc tttcggttag ctaaaaactt 2400
ttcaatggcg gcagacaacg cagccaggag gcctcgggaa aattctagcg aaggaatact 2460
ggcgacacgt cgcagtcgtg cgcggaacag cctggccccc gcgtccctcc ccaccccgcg 2520
ctgtgcggga cctcccggct caggctgtgc gcggcggtga gagcagccgg ctccaacccc 2580
gagccgggcc agacgcctgc agccgaagaa acgcgttcac agctcgggtc cctatgcacg 2640
ggtggcggtg gcccgtaggg accgcgcagc gcgttccggc ctcggtttcc caggaccgtg 2700
gcggcccgca cccctcctcg cacctcacgc gtccctactg gctgagtctc gcgccccagc 2760
caccgtgggg cgttgcggtc gggggcgggt tacaccagtg tgactcggtg gcgcggattg 2820
gcggtcgcac ctgtgtccgg aggagcgtgc agcgttgggt ggcgggaagc ggcgaggcgc 2880
tgtccccggt aaggagcagg tctgaagcgg gtcccggggc cgctcctggg ttggtccgaa 2940
atgggtcgcc ggctgatcct gtgctggtcg ccgcgggtcc cggtggaggc tgcgctcagt 3000
ggactggagc gccgccgact ggctgcgagt tgggaggcg gagcgcgccg cgcgctgcga 3060
tcctggacac ctgttggccg cggcgccttt taaccaaagc cctcaccccg cctctctcac 3120
cctggagcga ttgagaaagt tgcggaggag gcggctccca gtagcccgcc acccccagcg 3180
ccacgggcgg ggctctccgg gcacccagag ccgtcagggc ccgccgagtc gcgagctctc 3240
ctggagccta ggtcactccc caccccactc cgccccaccc cacccccagc tctctttgag 3300
ctcaaggctc ttccagtgtc ctgtcccgag cgcagcctga acagagctgg tagacctgtg 3360
tcttcaccca ggacgcaggt cgcaaagctc caagtcccag ctactcgctt ttgggggatt 3420
gggtgatgtt gaaagagagt tgatgttgct cttactactc tcactagtgg aaagtgtgct 3480
gttatattcg aagcttcgct gtagtaatat tatatatact tgtgtgtgtg tgtgtgtgtg 3540
tgtgtgtgg tgtgtgtgcg tgttagataa acggacggta cagttttgtg ttggcctgca 3600
gcttccagta gcgcacagga gactcctctc ccgtagtgca gtgagctgag gcatctagaa 3660
ttcgggttca aggcagacta acagagggcg ccgccagggc tggccaaatt ctggcttcta 3720
tttctttgaa ttcccgattt aattcgatca ctttgaacag ggtgccagtg gctaggacag 3780
aagaagatgt agaggtgcgt ctccagggct ggcctggaag tggacttgtc acagtctctg 3840
gagggttctc tgcctgtgcc cccgctctct gtgtcctctt ttccacaact gaaagcattg 3900
caaggaaggg gcacccagat tctgccggtg caggggatgc ggaagggggg ggggagcaga 3960
agaggttagg caagcccatc cctcttggag tccaggatgc tgggaagacc tgggcagcct 4020
gcatctacct ctctccgcca agctgttcgt gggttttgag ggctcggtgt tccacattgc 4080
ttggctgtct ggatagtttt gagaggagtt acggtggaca ttcacaagag ctagctacgc 4140
tttgggatac ctaggccagc tagcttcacc ttactacttg caacccgagt cctacagctg 4200
ccaggtttgg aatgaaaacg gcacatcccc acaaagttcc ttcagattag ctttacacgc 4260
agtgaagaga ctgattcatt ctgacaaggc ccgtctggtc gaaggattgg ctttcaatga 4320
aaggaccatg gctgaaggta catgctttcc ctgtaaagct ggcacattgc cgcgggcaga 4380
cctgactgct cttgcttggg cagaggaagg ttgcacgctc gcttgctact acccccacct 4440
cctttctaac tgtaagtctt agtctaagag ggaggtgtctc taaggaagag agcctcggat 4500
ctgtgtccag cccttcagag agagagagat gtgctgaatc agcttgtgtg gaataactgg 4560
ccaagcaaga tggggtggta caactccctt ggcctgagca catctaaaga tgaatcaaag 4620
aggagatgag gtagtggcag caggcagggg tggaaggatg ttggcacctt tagcttctca 4680
tgggtcgtac agtttccagt caattggagc ccctgttcag tgaggatgac agaagcttct 4740
agaatcattg taggaagctg gccagtaaaa gataggttgg agatcagaac tgcttcactt 4800
tctccattga acaatttctc ctgagggtta gtgcccacgt tatgattaca gcttcagcgt 4860
ctagctccct aacttgcttc tacagattcg cctaatggct gtgtgttggc tgatggtcac 4920
aggtgctggg aatattagga tgtatcgcta gctcatctcc tcctctgttc cagccatccc 4980
tccttgtttc ttgttttctc accaactaga ccagaggctc ctctagggta agaaatgcta 5040
aatttatttg tgtatgtgta ttctccagag ggggagaggg gagagggaag gagaagggag 5100
gggaagagag gcaaggagaa gggagaaggg aggagaaggg aggacagggg gacagaggaa 5160
gctagaaaag agctagga 5178
<210> 8
<211> 4964
<212> DNA
<213> Homo sapiens
<400> 8
taatccagat gttaacactg aaacttccaa gcaggggagt gaaatgagac tttcactttt 60
gacttcgtat actcctgtat tatttaagtg aaaatgtatt tatatattct ataattacaa 120
aaatcacatt ggttgccttt tcattttgaa atgagcaaaa gtgacagggc tgttaaaaag 180
ctaagtcact tgagcaataa cgtgatgtcc agaacagtgg ttccatggct cagccatgtc 240
gggggctgca ctgaggacag ggggccatct gccttctagg aggacactgt ggactggaat 300
attgttcctg ccttgaggag gagtctccca gcacagttac tgctgcttga ctgtcagagc 360
atgcgttttc ttagggaagt tgaaggcagc ctgtatctag taaggtggta tgcagtagtt 420
gcttaatgct gaatgtgtga aggaatgtgg ggctgtggag caggaggata aagtctgaac 480
ttggacctgt tgttctcagc tattcgaagc tttctcaagt ggaaaataga ctgactttgg 540
gtccatcaga gggcagaaca aatgctggag agcagatgct agaattccgt cttaaaacca 600
tgaatcctta cagcggcctg cgtggcctgc gccatctgtc ccagccacgc cctccttggc 660
cccatctccc cctttctcgc cctgactctt tggcatcctg gcctttccgt ctcactggga 720
tgcttcccta agagactcgt gtggtttgct gccctgtatc ctccggatct cctgaccacc 780
ctatgttagt tacattgcaa tttcccgttt ccctcatgac gtcttatttt cctccattta 840
aattacctgc agcaggtacc acctacaggg atctgttgag agtcggcctc cttcaatgtg 900
aagcctgatg ttttgttctg ttcacagcta tgcccccagc ccctaacagt tggtggcagt 960
cagtaaatat tgcctgggaa aacgaatcat tagccatgtg cagaaatgga acagcgtctc 1020
accaagttgg ggttgcccct ggaccctgg aacactgggg cagctggggt gttcctactg 1080
tgcttgttac cggcttcagg aatcaaatgc actagagaat tgtagaagtg cggtccacat 1140
cctctgtgtg gtaggaccag ctgctgttgg cctctgagca ggatctctta cctctctgag 1200
cagtgccttc ctgttgccct cagcaagaat aacactaaca gcctaggact tcagagcact 1260
gctgcgaggt gcaaatgagg tgatatggga aaagcatttg gtgagatgta tggaaagtgt 1320
agagaccctg accagatgag tcaatggcct tcttcgttac tctgttgacc tttctttaat 1380
tacagagtcg catagctgtc accaccttat ccttttttgc tgctatattt gcccccagcc 1440
attcctctcc cggcttatgt ggctagactc acctgcctgt gctgcagtta ctccaggctt 1500
tgtgtaaatg tgcatttttt tccagccccc agtttatcaa gctttgcttg agtcacttgt 1560
atctgaaata ccatctgtca ctcttccagg ttgggatctg tctagtggaa aacagatgac 1620
agtcatatgt tacttagtgc tttactatgt ggagaacgtt tacataaatt atcttatttc 1680
attgccacta agccggggaa agattcagga aacccatttt aagatgagga cactgaggtc 1740
agggtaagtg agtgagcttt tacccacctc tcagctgctc tctagttgtc aaagaccaac 1800
ccgtgggggt ggctcaggcc cgacccctgc agcatattcc ttggggcctc ccaagtgggc 1860
ccgatctgct caccccagct gtgactgtct tttgacagga ggagggagca gcgaggctgc 1920
acccactgct cataaaaagc agagcttgtc cacgccgagg gctcggctgg gtgggaggcc 1980
gcttccacaa ggctttttct tgctccatac aaagtgcaga ctgatgcttt gagatatagt 2040
caggattatc attttcagag ctcaagctct aatttccagg catgtgacca gacctctcta 2100
tccattccta caagtggtcg agagtagccc ataattattt tggcttggtc ttttaatagc 2160
2220 ttgagagtaa taatctacat agcttgtaga agtgaatgta cttattttaa
ttttttgatg ttgttgttgt ttgggacagg atcttgctgt cgcctaggct ggagtgcagt 2280
ggcacaatct cagctcactg cagcatggac ctcccaggtt caagcaatct tcccacctca 2340
gcctcctgag tagctgagac tacaggcaca tgttaccacg cctgcctggc taacattttt 2400
attttttata gaaacaatgt ctccctatat tgcccaggct ggttttgaac tcctgggctc 2460
aagtgatcct ctcgtctcag cctcccaaag tgttgggatt ataggtataa gcctctgcac 2520
ccagcttaaa aaatcctatt ttcacagtct atgtgcagag cattttggaa gtcaggtaga 2580
aaccatttcc cattttctat tacctgggtg atagttgact ggtttttgtt ctttgaaatc 2640
cattttaaaa gtgtatggtc ctctatgaaa atacttctaa ttattgatgt gtgaaatgct 2700
ttgaaatcct tggatgggaaa tcttgtacca tgaaagaaca gaactgttgg tggtgtctct 2760
gggagaggct cacgagggcc gggcaagcct gtgggggtag caggcagtca ctcccatggg 2820
gacaggctga cctggcaggc ttatttccca tggaagtggg cactgaggaa taaaaagcag 2880
tttcaggcca ggtgcggtgg cccatgcctg taatccttgc actttaggag actgaggcag 2940
ggggatccct tcagcccagg agttcgagac cagactgggc aatatagtgg gacctcgttt 3000
ctacaaaaaa tgaaaaaatt agtggagtgt ggtggcacac tccagtggtc ccagctactt 3060
gggacgctga ggtggggagga tcgcttgagc ctgggaggca gaggttgcag tgagccaagg 3120
tcatgctatg agtaacattt tgaaggtcca cttctgggat tcatccagga gctaaacggg 3180
tcatgtccag ccaactcagc attcaccaag gtacgtttcc agaccaaaca ccacattgtc 3240
catagactga tatgcctcaa aaacctggta gaggtgggca cggggttagg tagaaatcat 3300
cttcctccct tccttcccca ccaaactttc tggtgacaga agcttttctg taactggggc 3360
agaatggggt cagacactct ggcaacttac ccattggtgt tatgaaatat aaaacattaa 3420
tgtatttata taaaaagtga tagatgaaat taaaatttgc tgttctatta aaaccatatt 3480
agattttaaa ttattataga gattatattt taatgtttta aatgtatttg atacattaca 3540
aaattatttt agttacaagc atatcattaa agctattctt tattattaca aaatgctttt 3600
acaatgctat tcttgacaac aggaaaatac ttaccctcac tgaaatatgt ggagtaccat 3660
tttttggaaa ccatgtcaag cataatggca atattcaggt tcaatcttcc tatagatctg 3720
ctcaatattt atctaaacct tagcttctat tcttttcaca tgttattagc tatattttca 3780
cttaaaaaat tggaggctga aggggtaagc aaacaaactt ttgaagtaga caaagctcat 3840
ctttaatcaa cagactttag agtccagtct ttccaaatct gtttttaacg acagaaactt 3900
ctccctcccc tgccccattt tgtcctcccc attaaatggt actgtgtcaa taaaattccc 3960
aagcgacctc tttaaatcag cgttctttcc gatgctggct accacagtca tggaaaaggg 4020
agatgtgttg gacaggcctg tcattacagg tagtagttgg tggtacatcc agtctgtatt 4080
tcttacacaa aattacatct aaatatttga catgaggcca tttgctatca taagccatca 4140
ctaggaactt ctagtctgtc tcactcgatt gaggctacaa tgttgttagg tgctatgacc 4200
acaatgaata caacagacag cctctcagct gtgctgcaaa gtattcataa ccaaaagacc 4260
atatttcaaa ttaaatcata gtagcgaatg acataccatt tacatattac aatctgagcc 4320
tctgaaacag ggggaacata taatggtatc cagaacatct ttacatcaaa ataacctatc 4380
atactacaaa gttttcactt ccaaaaagtg taacagagtt taaggcactg gtaactttgt 4440
ccactgttag agattaaaac ttccaaagca aatgaaagaa ccaatgttca cctttaacgt 4500
ggggaaagtt ggcaaaaaga accccaggag gacacccaaa ccttctctgt gtcctctgtg 4560
gaacctggct tttttctctt gtcctcagag aaagaaacaa atgccgatat cctctgttta 4620
aaatatgaaa gtaccttaca ccaataaccc ctaacagcct ggggtctcag tggaactaac 4680
ttaagtgaaa gaaaattaag acaggcatag aattaggcct ttgttttgag gctttagggg 4740
agcagagctc cattgtggca tctggagttt cacctgaggc ctacaggggt ttcaaatggt 4800
tgcatttaag gtcagaatct ttgtgttggg aaatgctagc gactgagcct tgacagctga 4860
gcacgggttg cctcatccct ctcatgctgt ctatttctta atctaacaac tgggcaatgc 4920
gttaaactgg cttttttgac ttcccagaac aatatctaat tagc 4964
<210> 9
<211> 5166
<212> DNA
213 <213>
<400> 9
catggagaga gatggataac tgagatttct gggcaagaga tgaaatgggc tgaatcccac 60
tcctgactgc acacacctct cagtgattta attagaaata aaaacaagtc tctacattaa 120
catttacata agtaaca gccgtctttt ccattcaaag tgactgaagg agatggtgtt 180
gttaaaagat tgaaattaga cagcagcaac acgtctagaa gagcatccct ggggcagggt 240
tctgcctcaa caccacacag cactacacag caccacactt agcacaaggc tcctcgtggc 300
tcctcatgtc ccttcagcaa gtcaccagtg caccaggagg cgttggggag ggaactcctg 360
accacaatca cagcctgagg gttggagttg tgtttcagtc atcctggggg gcaggggggag 420
cttaaactcg ttggcattta ctagggcagt acacagcagc cgctccacgt tgaacgagtg 480
gatgatcagc ctgagaatca aggctgggct gagcttggct ctatcctcaa ttatctgcag 540
agcgccctgg tagagaacag atctgccttt gagtttccaa gtgagagcgg agcaaggctg 600
ggcacagagc agggtggcaa ggtggctgct gtgggcacag cacagaagat actcaggggc 660
atagatcttc ctggtggctg cttggtctca tgttggtcag gtcacctcca tttttggcct 720
catcatcttc tgacatgcac ctgcttcatg cgtctgcttc ctggaaccca ttcctggctt 780
tttgtcttaa ttctctgagg caggtggctc cattgcttgt ctcctttagg tttcatctaa 840
gagggaccgt cacacacagc ctgtgtggggc atcatgctgg tgcctgacag tcctctctct 900
ctctctctct ctctctctct ctctctctct ctcccccccc cctctgctgt ggctttggcc 960
tctgcagaaa caatctatgg gatttgttga tatgctgcct ccttcaacac aaaggcttaa 1020
gttgtattta tcagctccag tcccagggaa taatcatgtc tggtgcttag ctggtgctca 1080
gtagatagca gctgatgaaa aaaaatcagg agggatacgt aggaactgac cacaaaatct 1140
tgtgggggtg cagttacacc acggactcca gcagtgttgc aacagatgta ggttgtgggc 1200
ctgtggagtt agtcttcatt gtgggagggg caactccaca aggcctatca acataacctc 1260
cgaggggttg gactactctt gctggccttc gatcttgaca attaccagtg ccttcttcac 1320
aacccctccc ccacccctgc acaggtgatg acttgatggt tcttaagttg caataagaat 1380
gacaggaagc aagcaggaag caagagatgt gatatacaca ttaggtcgta tggagaccct 1440
gacagagcaa acctgtaaca ttcattctta ctgttatagc ccctttctta gtcacttatt 1500
aatattcatt tagtcattta gtttttgctg tttgcttgat gcagagtctc atgaagttca 1560
ggctggcttt gaactaagta tgcagctgag gatagccttg aacttcaaat tctcctacct 1620
tcatttctga gccattggga atgcaggcat ccaccttgga gcgccatttc tatttattta 1680
ctttctctaa ggctggggat ggagcctatg gctgtgtgtg gtaggcacag gctggggatg 1740
gagcctatgg ctgtgtgtgg taggtagcat tttggcattg actcacttac tctccagccc 1800
ttgattcttt tgagttacag agtgatacca ttgcctgtca ctcatcttta ctgtgctttt 1860
gtgtatgcac ccagcccccc ttcctctgtt gacctggctg gtctctgagg tcactgtgtt 1920
atgttattt cagtgtcaac ctgcacactc tcaagcttcc ggttaattga gctttgcagg 1980
agacattcct acttactctg tcattcacca tgtcactcag ggtctactga gtggggagaga 2040
gatgacatat taatgctaat atcattctac tgccctaggt ggaggagagg gtctgtgtga 2100
atcaccccat tgcttttcct aggggtgggg agtatttagg aagccccactg taaggtggag 2160
agcctaggcc agggtaagca cggagctccc ttccacccgt ggccacccat tcagcatttg 2220
caagctgctc cctggtgcat cacctagtta gaacagtggc acctgagaca gcttaggcct 2280
ggggaaacca atagaacact ctgttgttcc acttggacta gcagtggcct gtctctccac 2340
agggagcacc acccatgttg gggagcatca cctgtaacct ccagagttca ctcacaccaa 2400
ggcttcttct cttcacaaac tgccatctgc tagtatcagg atgatcatat tccagaggcc 2460
aagcttatgg ccagccctct ccgtcagtcc tatgaagtgg ttgttggcag tttgtaatta 2520
ttttggccct gttctttaat accttaagag taataatctt cataatgtgt aggagtgggaa 2580
ctagccattt aaaaagctgt gcattctttt aacagggtac gtccaggaca ccctggcagg 2640
tgggagac tattcacttt ttctactgtc caagtggacg tgggctaagt tgtatccctt 2700
tcgagctagg ttgtatggtc ctccataaaa acatagtatc actgatgttt aaaatgcctt 2760
gacagcctca gtgtgaagct tataatttaa aggatgatag tgtaggtacc acccaggaga 2820
gagacgtata gcctgtccct tacctgggac acgcttgcct ggcaaggtct gtcccgtggg 2880
aatagacatg gaggaaacaa agaacatggg ccacatgctt ctacacacac acacacacac 2940
3000
agagagagag agtcttgcaa agttctgcag aggacggttc tcaaagtgta gtcttcacag 3060
tggaagatgt tttaattttt aaatataaag aggtttgttg ttgttgtttt ctgtgatact 3120
ggtgttccaa tatgggggcc cacacacgga gacaggtgtt ttagcgctga ttacacactg 3180
agcctaagga ccatgtaaac tgtgagttcc tctgcttctt ctagaaacgg aacggaactg 3240
atcccgtcac caggacttag catcctcctg ctgcactctg actctcagac cttgcagccc 3300
ttaggttggg gctcacggaa cctcttagag tgcgtggatt tgggcagcag tggtctgtct 3360
gttccctctc tctttatcaa gttttctagc cacagggtat tttttgtaac tggagcagaa 3420
tcccagaaca tgttgtaaca tgtgagcata cttctgggat gctttaagat ataaactatg 3480
aaatatatgt atatacaaat tagtatagct gggcatggtg gtgtgcacgt ttaatctcag 3540
tccttgggag gcagagacag gcagatttat gagagttcta ggccagtctg gtgacagagt 3600
gaggccctgt ttcaaagaca aaaacaaatc aaagccagaa aaacttacca ttggtcacgt 3660
tagagtttgg tattctatta aaaaccttat ttaattttaa agtatacaaa ataatcatat 3720
tttaataaag ggcatttagg ggtttacaaa attatatcag tgacaagcat gaaaccacaa 3780
ctcttattta ttgttacaaa atggctttcc aatgacattc ttggcaggaa gaagtgtccc 3840
ctgttggatt tgttgactgt catcttgtag gatacacata aggcatagtg gtaatggttc 3900
aacttgccct agaaaggtta catactgacc taaactagtt tcttctattt cttccaaata 3960
tccacatttc tgtttccagt taagaaggca atgctgaaga gggaggcaaa cacactttca 4020
aaagtagaaa aacttagttt taatcaacag gattgggagt ctagaagttt cattggttct 4080
ctgaaaacca ccccatttgg tttctgcacc attgaattgt cccatggcag tgaaattccc 4140
aagcaaaccc atgaagtccc tatcttctga tgctgactgc aacatcccac agctacagag 4200
tagacaaact ggtggggggt gggggtgggg tggggctgag ttaggctcat ggcaggtggc 4260
agttgtcggc atatcctatc tgtctcttac acaaaattac agttgactat tttaattgag 4320
gcctcttctt gtcagaagcc agcacgagac gcttccagtt tgtctcactt atgacaggca 4380
gtagggttat agccctgagc ccagcacgcc agtgatgaat acaataggtg ggccctcagc 4440
cacactgcag gtttcccata acccaaaggc caacatctta aagaccctgt gagatctggt 4500
tacacaccat gctcacttca cacactgaac ctctggacta ggaggaatgt ataatacttt 4560
ccagatcatt ttaggaaaaa aaagagccta tcttatttta aggttttcat taaaaaaaaa 4620
aagtacacag cacttgaagt attaatagct ttttgtccat tgttgcacac gtaaactatc 4680
aaagcaaata acagtatggc atttctttac ctttagctag gggtaacttg ggggggggga 4740
ctttctcagt ggcaccttcc tcaggaccgg gttcctctct cctgtcctca gaggaagaga 4800
aacaatgtga gatccctttg tttaaactgt gaatgtatcc tccaagcttg gtcgctacca 4860
gcacggggtc tcagtggaac taactttaga acccattaat acaggcatag aattgggcct 4920
ttgtttggga gctttggggg aagggaggcc cacggaggct tctggagttt cataggaggc 4980
ctccagggac ttcaaatggt ggcattttag atgggaatgt ttgtcttggg aactgctggt 5040
ggctgagctc tgccgactaa gcgactaagc atgggttgcc tcatcctctc cctccatctt 5100
tgctctagca gccaggcaat gcattagact ggtcttttgg actttcctga gcaataccta 5160
ACGAAC 5166
<210> 10
<211> 2504
<212> DNA
<213> Homo sapiens
<400> 10
aaggggacag gacatctctt tccaaaactt aggtttggtg actcctggat ttcacactct 60
ctgactgctt gggtgagggt ggaatggagg gctgtccccc accctcgcac ctgcacggtg 120
gcatgctttc ctcctactcc agggaattcc tcgtggcctc atggcctggg ctgtttctgg 180
cttcaagctc cacgtggcct ggccccagcg gtctggtcca ccttgtactc ggtgcccccg 240
ctgccccctg gcctcagctg gagtgacgca cctcatccat gcgggcctgg cgtctggaag 300
gtggctgggt ctctcgggct tgagcaccat catcttagct ccaacatgtc attattcctt 360
cctcactgag gacttttctg cttcctaatt ggttgttgaa gatgaggccc ccatgctctt 420
ttaagaaaac ctgttgtgcc ccaggcttgg ctgtgatggg cactgactca tacagaagta 480
gaaaggcctg ctgagtcatc aacactcgtg cgacgccctc gcattttcat taatgatggc 540
ctccctgcca cacgtgaatc actccagccc gagatctgaa accaggacac accccagggg 600
cgaggtgacg ctgagtgagc ccagctgtgt ccctttcatg agaactcaga gcacagggct 660
ctgtgtgcat ggccgtcccc tccagagagg aggaagtaaa tgccgggatt agtggaagat 720
catttccttc tatttgcctt ggcttacgtc tttcagaatt caaacacgtg cactgttgac 780
cctgcaatgg tggagttttt ggattttcct tcagtccgat tgctaaaata cttccctctc 840
atgtgagctg ttgtgaaagt catcagccag ataccattct aaaaacaaag aatgtgcttc 900
tcgtatgttg catgctggtt actgaaatat tagggaatta cataaaggtt ttctggggca 960
catattcaag ctgaatgata aaattgaagg tcacacaaag ctaaggtctt tcaaatcctg 1020
acccaattag ctctctgtta gctctctgac tttggacaag ctgtctggtc ctctgaagca 1080
tactttgttc gccctgggta ggggccctct gttttaacag cgtttggcag atgaaaacat 1140
ttgcaaagcc aaaggacaat gaaatctacg gaagcctacc atatgccaat gactccacca 1200
aatgttttct cttcttggga tcttctaaaa ttcatctgaa tacttataag ttatgcaaat 1260
tttggttat aatctaggtt gtattacctt gggggaagtc agttaatctc tttgaactca 1320
gtttctttat ctgtgaacct gaaagaacac cttcaaactc caagggtggc tgtcagaatt 1380
aactatagag gtgcaggtat cagatgaaag ctataaaaca gtttacagat cttagatatt 1440
atgatggatg gctatgatac gtttctcgaa tcactgcttg ccaatgagct gtacaatctt 1500
cctgaagggg tctgcctttc caatctgggc agcaacagtt aatgacggtg tgccaggata 1560
tctgtgtctc cttttatctg ctccagactt taaacacacc ctctgattac atcacactat 1620
caatttgaaa aagggctcag agccaaaatc accactgtta gcgagttctc cagggctgcc 1680
tcctatcctc tggaggtggg gctctcgtct gcagaaatag gcataagggt tttctatggt 1740
ttttgtttgt tttaaagacg aaacatgttt tgggatcttt taagaatcct aatcgttgtg 1800
aaagaaactg aagtaagtta ctgttcaagt gactctcatt ctgctgtgaa tagtttctcc 1860
cacgtgaagt cagctcaaga gactgtgaat tgcttcagcc tacctgagac ctggtacaca 1920
gggaggcttc ctagccacgg aagaggagag cgtttgcagg aggagaagga ggagagaggg 1980
cccacgcagg tgacattctg gaaagggaat gctggtgcga aactgcctca cctactttgc 2040
tccttggatg ttcaggaaaa gccagcccca tccgccccag tccgagggcc tcactcatgg 2100
aacaaatgaa gctgagaaga ggagcttcct gttttccagc tgctggggtc atcattatct 2160
tcaggaagga ccccgaaaag catcgtgtgt tgttgcaaag gcctgcctta tcctggcccc 2220
caggtccctc tccgctggcc ctgtctactg gataagctga ggttgcacga agtaggtcca 2280
ggcctaatgt gacagtgaat aatatggtgt ttggccacac agagatgtgt gtaggtacaa 2340
aaaccaccat gcttttggcg gcaaagtaaa aaatgaagat gtcgtcaaac gatctgaact 2400
ctgatggaga ctgagcgaga gaccctggcc caaaacaatc actccatggc ggatgcgctc 2460
tggggtagac agctactgct ctcagagcag ctgttttcag gcca 2504
<210> 11
<211> 3870
<212> DNA
213 <213>
<400> 11
gtaagagcca attaggaagt tccagggtta gtaaaggcca atcagtaagc accagggtaa 60
gagccaatca gtaagctcca aggttagtaa gagccaatca gtaagctcca ggttagtaag 120
aaccaatcgg taagcaccag ggttagtaaa ggccaatcag taaactccag ggttagcaaa 180
gaccaatcag gaagttccag ggttagtaat ggccaatcag taagctcctg ggttagtaag 240
agcttctggt tttggtcctt caatcactgg cctgagcact catgtgattg gctaggctgg 300
ctaatcaacc agctgtggga atactatcca gtgatgggct tgcagacaga tgccacagca 360
tgtggcacct ttaatgtggg tgctgaggat acaaagtcag gtctctccac gcttgcatag 420
gaaacacttt accaaatgag ccatttttct cagtttcgat tttatttat tttttgagac 480
agggtcccac tgtatagctc aggttggaca cagacttgtg atactcctat cttggcctcc 540
ttgactactg gaattgcaag tgtgtggcac catgccagct ggaaaggtaa ctttctaagg 600
tacctctttc taaaatagat gttgaccttt tgtaaggaca gactaaacgc cccctgggct 660
tgaggctggc gccatccaga acagggtaga gcgtattgag cctggcaggt tgaatccatc 720
780
ggagacaaga cagagagtgt tactcagtcc aggtactctc ttgaactaag agcacacagg 840
gaagaagggc ctcatctgag gccaaggtgt cattgtatcc ggtataaggg gacaggatca 900
cctcctttca tgttggagct cgtggatctt acattctcta atgcttgact agatgtgagt 960
ggagctagaa cacgtatctt ctcctggtca ccgcccaggg ttcgtgcgct tttcttactc 1020
ggtacatcat cctcatcgca gtgggctggt ctctggctgc ctcatccagt ttgtcgtctc 1080
agttcatacg gacaccccct ggcttgtcag tgctggccca gtaccctcgg gcctgagcac 1140
ctgtgatgcc cctgcctcca gctcttcctc cccagagtct gcaatgctat cattccttcc 1200
cggcccagag acttacgctt cctcattaga tgtggggagat gaggttctca agctccaaca 1260
aaccagtcct gacctcgttt tggcaggaac tcaaagagaa gtcagaagct tgctgaatca 1320
cccacaccgg ccggccggcc gagcatcctg gcaaggcctg taattagagc ctctctttca 1380
caccttgaat cttgagggcc ccacgtctga aatgaggggt gtcccagtgc ctgctgcaag 1440
tttatgagca gcacacagac tcctttcctt tggaactcag gggtgctgcc tgcgtctggc 1500
ttctgtggag gaggaagtaa tgtgtgtgga ttagtaaaag atcattttcc tgctgtttgt 1560
cttggcctcc gtgcttcaga attcaagcac ttgtactctt gaccctgcag tggtggctgg 1620
ttttgagtcc acttcctgtc tgatcgctaa actgctcctt ctctgaggac cttcagctga 1680
agccacttac ctgctaacac ttaattaatt aataattaat attgtaatta attttttgtt 1740
gcaggattgg cagtgaaacc caaaacgtca cacatgctaa gcaggcacgg ggccatcaaa 1800
tcattttctt aattttttac tttttttt tttgtgtgtg acagggtctc aagtaaccca 1860
ggttgacctt aaacttcctg tgtggccaga atggctttga atctctggcc cttcttctcc 1920
ctcccatggt actgagatta caggtatgta ccaccatgcc tgacaccctg atgctgtggt 1980
ggactcaagg aatgcacata cctaagcttg aatgctcgct gttgaaatac tagagacatt 2040
taaaataatt tgccagttag gaaaagcttt ctatggcaca cagtccaatt gaatcttaac 2100
acacacacac acacacacac acacacacac acacacacac acacaagact taggtctttc 2160
aaattccagc ttggtggctt gttccatgtc ttctttggac aagccctcca gctctcctct 2220
cctctgctct cctccttggt aactaagggg aggccacgcc tactttatg gcatcctaga 2280
gatgccaaca ttggcaaaga gaagggacaa ttaaattcat tgaggcctgt gtggtgtgtc 2340
agcaactctg ccaaccactt tcttatcttg gtatcattta aattagtttg aacacttaaa 2400
aggttgtgta aatgtggctg tctagtatta gaagctgttt tgtattattg ttagttgtgt 2460
tccctcaggg gaagtgagct gccctgagct cagttcttta tctggaaact gggcctaata 2520
cctccagact caaatgactg tcacaggact tagctatgaa ggaaagggtt gaggcagaag 2580
tcagagcact ttacaaatat taggcgcact tactaatgct catgataaat tcttcaaatt 2640
gttgtgcgat aaagatcttg tcagggtttc tcaggcggct atctttccca tcagagctgt 2700
ctgtccaagt taaagacagc ttactggaat atttctgtat ccttttgtcc aatacaggat 2760
ttaaatatac cctgcgatta gattgtaatg ccaataaaaa gaaaagaggg gatgtcagag 2820
cataagccca gggtgacaac cctgggactg gcattctaga ttctggggag gagactcttt 2880
ctgggaagag aggctcatgg cgttttgcag tttttgtttt ctgttttaag acaggagttg 2940
ctttggggag ctttatctta agaatccgaa cggttgtgta ggcaagcaag caagcaaggc 3000
agctactgtt cggttgacct cgttctgctg tgaagaattt gcactgtgtg aagtgtgttc 3060
aggaaaccct gaatagcctt ggcacacctc cgacgtgctg cttcgtggta aagtttcctg 3120
tcctcaaaag agaagacatt taaaggaaga ggagggacca aagaacgggt cacctagaca 3180
acagggatct gggcacctgg taggaaggaa accttagctt atttactcct tgaatgttgg 3240
gagagaacag ccaggaccct gccctagagc ctcactcatg aaagctgaat ctggggacagt 3300
gagtcctccc ctctaactgc tcccagttcc actgtctcca gggtggatcc caagtggatg 3360
ctgtgtacat ggccttcatt ctggtgccta agctccactc tgtggaccct gtcaccaagt 3420
tggtgtgagg aaatgtaaca tttaatatta tgggtctggg ccacaccaat aaactacgag 3480
gcattgtagt caaagctgct gccgcctttc agtcacctga cctcggtggc cattgaataa 3540
gtgaccttgg tctaaaacaa ttgctccaat gttctgttct gatgctctgg gtggatcgct 3600
gcttgtgtca gagcagatgt ttccaggctg ttgctggggc caatgtcacc attcctgtta 3660
gtttcagatt gtctattagt tctagatagg gtctcattat atgagacacc ccaccctcct 3720
gcatggctca aaagtttact gatttttat ctttgtgtgt aagtgtcttg tgtgcacgca 3780
catatatggg caccatatgc attcctggtg gtaggaagct agaagagggg ctcagattct 3840
ctggaactgg agttacagat agtcgtgagt 3870
<210> 12
<211> 1768
<212> DNA
<213> Homo sapiens
<400> 12
atcacgcagc ccataccctg cggttctccg gggacttatg catcggccca agttgagggt 60
ttgtctgaac tgaaacccgc atcctagacc tggctttctt ctccccaaat ccaaggggac 120
accccggtga cccacaaaag cttagaaaat ccaacacgca gcaaatgaaa cgggggaaag 180
gggcaccggc cctcactctg gcctcttaga cacacgatat gaaaccttca taaaacctgt 240
tgtacaagtc aaaggggacc acgctggggt aaaagtcaaa ccagtccatc ctcgttcctc 300
tgcgtacaga gagagggtcc agcgcgggcg gcgcccactg ccatcgggcc ggggccgggg 360
cgcgtggaca ggaggggtgcg gatagaggca gatcgggggc ccggtcgccc cacgtgcggc 420
cagacaccca tcccggccgc gctctgccgg ctctgatccg gtgccagaca ggagcgacag 480
gggcgaggtg gggaccagcc gccgacctca cctgttttgt tttcttggag gaaattcctc 540
cgctgggggg ccgaggtggc accgcccgct cgccccccgc aagacccagc cggtccgcgc 600
ccgcttacct gctctgcggc cggcggccct ggcgcgggct ctgcgcgggg cggcgccctt 660
cgctccggct gggcaggcag gtcgggctcg ggcgccgccg gctgtcgggc tctcgtcggg 720
tttcgggtga aggccccggc tcccacctgc tgcgcctttt aaccgcgccc caccccgcct 780
ctgccctgac gcggctcggg cgggctgcgg gaggcgagcg ctgtcactcg acgagccccc 840
cgcccccacc tacccggggc gcactagccg ctgggcgcgg accgtccccc tgaggagcaa 900
ggagtgcagg accggggctg tccctccggg gccggatgcg cagagcgggg acctttttcc 960
cgtggcgggg gcgcagggtg ggggacccct aagaagtgca cagtgcgcgg ggccctcttt 1020
ccggcccttg gagggaacgg ggtaccgggg atgcaggggg tagggctctc cctcgggagc 1080
gcagagggcg ggcccagccc cctctgcacg ggtgcaggtg tggggcgcct gctcaggccc 1140
tcgagggaac tcttcctccc tagtgcaccc gtggggagca gtgtgagggg caggctgtgt 1200
ttttgccagg acacatcctc agtctttctg ggtgatccag ccttctcata gcccgcgggg 1260
tgcacagacc tctcctatag gagcctggag gttctttatt aattaatgac cacttagagg 1320
aggtacaggg gttgttttta ttaattacct ccatcctttg aagactcctc cggggaagcg 1380
gagcaggcct tcctcgggac agtgcaccag gagagaccac attgcctccc cgcttttcag 1440
tcaagactag aaagctcagg gccagtacag ggagtggtgc aagggctggt ggggtgggaaa 1500
cgttggaagc tatttaggca cctggcttta caggttcaaa cctgtcacgc atcggacaaa 1560
agatgtgtga cttgcttatt ctacaaaact gttcggtaat taaacgtccc cacctaaacc 1620
atatgccact tgttgggtca tattctccca cgaaacaatt aagatgtctg ttaaaggtca 1680
tggaatttga gccaagactt cataaaaatc cgctttccaa aatattttat ttgaggagaa 1740
caaggttctt aaagaatttg cccaagtc 1768
<210> 13
<211> 1751
<212> DNA
213 <213>
<400> 13
aatcatgcag cctgaatggg catttctctc caagtcgcag ggtttgactg accataaaca 60
tcattccttg ctgtgctttt ctgcccgctc cccaaatcga tgacagcccc aaaccagcaa 120
aggaaatgag aaaagggact taatccggac tctagtcact ttaaacagcc tggtgtgttt 180
ataaaacctg tcgtgcaagt cagaggggca tggtgcatgc agaagtcaaa ctagtccatc 240
ccagttccta ctgcagggca cgagggaggg ggcggcgcgg gtgacaacca ccctgccgcg 300
gttccagttc ccggtgggct cgcaaaggcg ggatgccgat gggaggcaga taaggatgct 360
ggcaaacccc cgcctccccc ccccccaccc cccgcatggt caagactgtc tgtaaccgcc 420
gggccgcctg gagatacttg ccaccccctc gtcccacaaa tctggcgaga aagggaacag 480
accacttcct ttacctgccc gggtttctcg gaggaaatgc tcccactcgc gcttacctgc 540
tcggtgggag ccggctccag gctcgcagcg gcactcagag ctcctaccct gagcgtaggt 600
tggatcaggc gccggcggtt cacagcggga atggaatcgg ggacagtgcg ggtggagccc 660
cggtttccac ctgtggcttc ttttaaccgc gcccccaccc cgcctctgcc tgacgccgca 720
cgggagggct gcgggagagg agcgcgggca ctcgacgcgc cttctgtggt gcgcaccgcc 780
ctctctccgg gacagaggag cggggcgggt ccccttctgt ggagcaaggg gcaggggacc 840
ttccctgtta gggccaggtc ttagtggtac tatattaggg cactcgttgg gatccttctt 900
ctgaagccag ggaccactgc gagtgtcccc taggagagac tccaggtgta ggctggtctt 960
cccttgggtt ggggacagaa ggcttgtccc ttcttgtgga tgtgggtgga gcgtggaccg 1020
cgatgggcaa gctcagccag atcccatcaa ggacagggaa aagttgcccg ctggggcctt 1080
gctggggctg gacactggag ggcccttaat gaagtgaggg ctatccagag tacggggaac 1140
aggcttgtgg acccagctag tagtgagtct ctcctgttgg tcatcctggt aggaagacaa 1200
ctggtttgtt ttcatccttt ctagaccctt tgggcaccct ctcctctaga gcagcctgga 1260
ggttctttat tccttaatga ccacttagga gtctcaaagg tttgttttta ttagtcatct 1320
gaatcccttc ctgcattgtc cagggaaggg gagtggactt ccatcttgag agatccccact 1380
gtgtctgctg tcacatcaag ggcagggtaa ggtcaaggca agcatagagg gtggtacagg 1440
gggtcctggg ctggaaatgt tggaagccat gtaaggacct agttttacag ggcctgccct 1500
gtgctacttc agacaagact tgtaacatgt gtaacttggt tattttacaa aattggctgg 1560
caggtatgtt ctacctgtt gggtcatatt ctcactttag ctacattcta cctgttggtt 1620
cacgttctct cacaaaacga gagtaatagt gcttcctaaa atgtctctcc caggtcatgg 1680
aggttgagtc aacgctttat aaaaacccac cttaataaaa tacttgaacc agagttctcg 1740
gaattggacc c 1751
<210> 14
<211> 3358
<212> DNA
<213> Homo sapiens
<400> 14
taaaagtgag caaacagctt gaaccaatct aaacagctta tttattgag gtaataaact 60
tttccttctt cctgagtttt cctaaattct tctctatcat gaaaatagca ttaatagcta 120
aaattttaag tgtttagagg ttttgccttt caaatccagt aagtctccag agtcaacagg 180
tgctacaaga tgctactggc agtaacagtg cttctccagg attgtggtag gtggtgtcta 240
agggtctttt cagcttgaag gttctgtttc ccagttctgt ctcacttaag atcagatctt 300
ggtgagtata ttggcaaacc atttcattat ttaaatttgt aaaatacagg ctttaggccg 360
ggcgcggtgg ctcacacctg taatcccagc actttgggag gcccaggcgg gcagatcacc 420
tgaggttggg agtttgagac cagcctgacc aacatggtga aactacgtct ctactgaaaa 480
tacaaactta gccaggcttg gtggcacatg cctgtaatcc cagctactcg agaggctgag 540
gcaggagaat cgcttgaacc cgagaggcgg aggttgctgt gagctaagat tgtgccattg 600
cactccagct tgggcaacaa gaatgaaact ccatctcaaa aaaaaaaaaa caacaacaac 660
aacaaaaaca ggctttaatt gtatttcata ctctttaact aactagatat taactataaa 720
atattaacaa tttcaaattt ttgttaaagg aatacattta cacagcttaa aaattcaagt 780
ggaactaaaa ggtttacaag gcaatatttc agtcctctgc cccattctct gctcctccca 840
ccctgtatgc tgtcccagag gcaaccaacg cctttcattt tttagagctc ttctgacgtt 900
tacctttatg tttccaaata atgtgcttat tatgccattt actgattgct ggactttaga 960
cctgttgact ttttctgcta tggtagtgga ggctttagct ctgacctgag ccccactgct 1020
cctgctccac ccaccctct tccctcaccc tcatgacatg atcatggctc atactctggt 1080
caaatacata ttgttattta tattattttg actgcgagca taatgacgtc tggaccaagt 1140
tgtattctat gttacatttt cttttggttg caattgcctc ccttccctga gagtgaacca 1200
tgactggggt tttcatttgc ttggctttct atgtgtctat tgttcggctt ttcctactct 1260
tccaacaaat ctgtcatatg cccggaaaca attttttcaa gttcccagac atggttccgc 1320
acagtccatc tattccatct gtttctttcc cttttcccgg gggctgtggt ctgggcaggg 1380
tgctctggcc ctctgcccag tggtcccctg ggctcccctt gcctttcccc tgggccagag 1440
cttgtgcttt ctggagtccg tgtcttcctg tcttggtctc taccttcatt ttgctgaagc 1500
acacaccttc caggaacttc ctcaggaggg gaatgtggaa ctaaacttct atgcacataa 1560
agtcttcata tcaccctcaa acccgatctg tctccccgcc tccaatgtac tttcctttcc 1620
tctcttattt tctctgtttt tatgaactta cacctttttt cttcactatt gtgtaattgg 1680
catttaagat gggagtagag ataaatgcac ctggtgtaggc tcatactaac cacacgcctc 1740
agtgcatggg tgtttatcag acttctctca atcaagagct gcgctgagta cttgtgaagg 1800
ccctgcaggg ctggtgctga gtaagttcag gattgggcac ctctgagggg tgaggaaatg 1860
gaggttcaga gacgagaagg aacttcccca aggccacatg gttaatgatt ggaagatctg 1920
agattctaaa ccaaacctga gtcgatcact tccctttctg tccactgcac tgataactga 1980
agcccaaggg ctgaggccac acctcagcgt gtgaggatca gcagaggaga ccctgctggc 2040
tgcgggatgt ggataggctt tgaggaagag gaaaagcaca ggcaaaatgt caaagataag 2100
tgggaatgag gttccctgga gcatgagtcg caggtgctca ggaaggtgct ggcagctcta 2160
gagaaggcca gagagaagca cccagtggtg ggagccacag ccccaagaca caggctaaag 2220
ccccagccca gggtgggtga gctccaccct gtcacctatg gggttgcatg caagtggttc 2280
ctctaagcat tggcttcatc tgggaggcgg gggtgacatc gcttctttga gccttatttg 2340
gaggactaaa caacacatgc attttgtcat taggctggtg caaaagtaat tgtggttttt 2400
ttctattact tttaatggta aaaaccgcaa ttagttttgc agcaacatac taactttaaa 2460
gttcttaata catatgagat attatttcta tcagcttaga aggatccatt atgattgtag 2520
aagacctggg atgccagtct gaggaactct tcttttctta agcaaaggag aaacaaaata 2580
attctgatgg ggggagtgact gaccccagtc tggctcaccg gcggctgtga agtcctgagt 2640
gtcctctggc agctgccttt gaaagcgcag tggtgtccgg ggctcgccac tgaatagcgt 2700
ttgttctcag aagggagccc ggtggaaaat ttgaagctgc agttaggaac tgtgtgtatg 2760
gccttggaaa ctgaagatgt tcctttaaaa gaaaaatcac agtgttttta aaactcagat 2820
gacagctttg accattatct gctttcctct cctgccagct ctagagtttt cttgggatgt 2880
tatcaaggat gatatcacaa caatgcccac ttctgttttg tttttaacct gaatgacaaa 2940
ttaaccaatca gcagatgtag gccatccagg gaagtttctt ttaaatgctg gacttttgca 3000
aaaatgtaga gccttggtgg caattgtgat tctttttttt ttcttttctt ttccccaatg 3060
aaggtacttt tttttatgtc cagttttgga aggctcctga agattgtttg agaacttgac 3120
tgctgtgtca gggcagtgct gacactctct gttgccaact gttattcatt attccaaaaa 3180
atcagagaag caaaaacgac ccctccaaac aactccaaga caaactccaa gcaaaacaac 3240
aacacacaca caaacccaca attttccttt ggttgcttct gagaaggagt tttaatggta 3300
tagtaaatac agcatttatc ggatgatttt tgctgccatt gatatgtttc tcttcttg 3358
<210> 15
<211> 5018
<212> DNA
213 <213>
<400> 15
aggaggtgtg tcttcctgga ggaaatatgt cacaagggtg ggctttgagc atttaaaaat 60
ttaccccctt tccaggtttt tctctctgct tcctgcttat ggttcaagat acaaactctc 120
agcttccagc ttcagcccct ctgctctcag agatgctcat ctctctggaa ccatgggtcc 180
aaataaactc tttgttctat aagttaccat ggtcacggtg ctttaccaca gcaacagcaa 240
agtagctaat ataatctttt caaggccacg aaaaagagaa aggcaaacca agagtttggc 300
tgaccaaatc agctgagaac acaaaccttc ccatcctaaa ttccccaatg ttcttttatt 360
tttcatcatg caaatagcca ctgatattta aattatatta atgtgctcat tatggcagtt 420
tcatatattt atatattgta ctttgaacat attcacacac ctccaaatac cctcttctgt 480
cccccacatt ttaagactgg aagtctcgtt ttttcaaatc cattattagg tccttagggt 540
caatggggtc atatgatggt gtctgtggtt ctaattagtg gccagctgga tacctgcaga 600
atcaatgact agtgggtaaa aagtgagcag tcagggtcag cagctcacaa agcgtcagtg 660
agaggcggac aaagagagct ttcagcaacc cctaactggg tgggcagcat gtgagccaag 720
tgtgagtccc tcctttttgg acctgggaga ccagcagagt gtgcaggccc tccgttggct 780
tggcccaggt gataagctga cctcagcagg aattacctca gtcttagtcc agctcctgat 840
gtaagtctca ctcaaaacaa aacaaacaag cctagacaaa accagcttgt tgtctttttt 900
ctgttgtggg aactgctccc actcaggaat ttctcagtgg ccccctcaag gaagtttgct 960
tcttctctgc ttccttccac acatctgtgt ctttctggtt ggagaccatg gacttgagag 1020
ttcaagttga gcttccacta ccctaagtgc ctgggtcaag cacacctgcg ctgagaaggg 1080
tcctgccagt ctcaaaactg catcactaga tcagcagtat actctctcac ttaagcatgg 1140
agtggggagg tgcctttgta tgtcttagca atagtcatct acgtgatttt gaggtcattt 1200
tacttttaaa gtatataatc ttcaaaccaa attcaaagac taggcaaaat ttttaaatta 1260
gcttttaaaa aatgagctgg tttgcttact tccctgatct taattcctat aggcagtatt 1320
gtgaggtaac ttatttaggt ttagggatga tagagaaata atgtcttagg gttttactcc 1380
tgtgaacaga cactatgacc aaggcaacac ttataaagac aatgtttaat tggggctggc 1440
ttacaggttc agttgttcag tccattatca aggcaggaac atggcagtgt ctaggcaggt 1500
atggtgcagg aggagctgag agttctacag cttcatctga aggaagctac gagaatcctg 1560
gcttctagga agctaggatg aggatcttaa agcccacgct cacagtgaca cacttcttcc 1620
aacaaggcca cacctccaaa tagtgccact ccttgggcca agcatattca aatcactatg 1680
ggtactctta aaagaatgca tgttttagct ttaaacattg ttcatttatc cgtgtaacag 1740
actggtttga gatctctcag caaagggagt tatccttata cagggactct tttcattctt 1800
tttcttagtg catattcatt gtagatagtg ctgagttgta taaaggcttt atctatctat 1860
ctatctatct atctatctac atcccaaatg ttgcccccct ccccgtaccc cctcaaagag 1920
ttctttctcc cacccccatt ctctttgcct ttaagaggca acctcctctt atatctcccc 1980
aacctgatgc atcaaatctc tgcaggatta ggcctcaggc cagcccatgt atgctctttg 2040
gttggtgact cagtctctgg aagctcccag gggtccaggt tagttgacac tgttggtttt 2100
cttgtggggt tgccatctgc ttgagggcct tcaatccttc ccctaactct cccacagggg 2160
ttcccaacct ccagtcagtc cagtgtttat ctatgggtat ctggatatcc ccctctgtct 2220
catcagctgc tgggtacagc ctctcagagg cctgctatgc taggctcctg tctgcaagca 2280
caacatagta tcatcaatgg tgtgagtgat gggtgcctgc ccatgggatg ggtctcaaaa 2340
cgatctgatc actggtcagc cattccttca gtctttgctc catctttgtc cctgcctttc 2400
ttttagacaa gatcaatttg gggtcaaatt ataaaggcat tttcatgtta agtgtataat 2460
gtattttgac catgtttccc catatcctcc taccctccca tttgccctcc ccctttctca 2520
ttagtattct ttgttctaga caaatttact ctacttttat ggcatatgac acatacatga 2580
tttaatgaaa cataaaatgg agaatctaca gacaaaagaa agcatgaaat atttggctga 2640
agctgactca actcatttaa tatgacaacc tccatttccc tacaaataag agaatctcat 2700
tctttattgc agactaaaat tccacaggtg tatataccac atttctttcc ctatccctct 2760
gtctttggac acctaggcag gttccaccgt gtagctattg tgagtaatgc tgtagtcaac 2820
attgacatgc aagtgtctct gtgacatgtt gacacagagt tctctggata aacacatagg 2880
agtgtcgtag ctgaatggca gtcgattgag aaaacaaata ataaaagggt tggtgagcag 2940
gtgggaaaag gaaactttga acgcattgct ggtgagaagg aaagtcagtc tagctgctat 3000
ggaaatcagg gcgagggttc ctcaggccct aaaaccagaa ctgccttatg acccaggcag 3060
tcttgacagc tgttgttgtc tgtgcttaag ttcttgactc tgtcagacat agagaaacca 3120
gatctcaggc tagaagttcc ttctttctcc atgttccctt aaccaccctc ttctctcctg 3180
cctcagcctt gtagaagtgt gccttccatt aggcacctaa gaagaggaac ttgacagtca 3240
gctgccacct tctagtgact ggaagaacca aatattctgg atctgaataa aagattttac 3300
attctgcttt gtggctcaca ggagactcag tgacaggccc acctaagcac acacagaaca 3360
gtagagcgac aggttgaaac agcttccagg aggagtgggg ggaggacggg ctgaggaagt 3420
gggatgtgta attccagtag agaaagtcat tggaggtacg gaaggtgctg gcaaccctga 3480
gaaacagcag ctgatccacc agctgcaggg ccaggcctct ggatgcaaca gccaagtcag 3540
agcccagctg ggcctggctg tgttccacct gctccctggg tggccccagg caagtgactc 3600
ccctgagaac tggcttcagt agtgagaaga ggggtggggt gacaatagcc tctttacagg 3660
gttacctaga ggactaaata atgcacatac gcatacacac acacagacat gcacacatag 3720
acgcacacat agacacatag acacagacac acacacagaa acagacactg acacacacat 3780
acacatacac aaagacacac agaaacagac acatacatat atgtatacac acagagatat 3840
acaaatatac atacacacat ggacacaaac acacacatac agaaacagac acacagacac 3900
acacaccaac atataataca cacccatata acacacacat ataacacaca cacacaggca 3960
aacacatggg tttatgggct ctgcagtaca ataaggcttt attttcatca gcttagtcag 4020
cagtagccta caaatattag tgttcaaaag tattttctag gcaagggaga gacagaaagt 4080
ggttgtggtg gggagtgagg ctggtgactg tgagtgggca gtgtctagtg tctggggaca 4140
gctgagattg gcagccccact ggccactgac tagagttgct tcccacaagt gagtccagtg 4200
gaaattttta gtttgctctt agaaactgtg ccttcagcct tggaaactga agatgtttct 4260
ttaaaagaaa aatcgtgctt tttgaaactc aaatgagagc attgcctgcg gtctgctttt 4320
ctctctctct ctctcaccag ttttcctggg atgttatcag ggccaatcat cagaacaatg 4380
ctcacttcta tcttgtgtct aacctggatg acaaatggcc agtcagccga tgtaggtcac 4440
gcaaggaagt ctgtctttcg ggttggactg aggtagccgc agtgcgatgg ctgctttgtt 4500
gtttctttcc cttttcttgt cccaactaaa agcgcttctg gtctgggagt aggggcgact 4560
gaaggctgtt tgagaacttg actgctgggc ccctctaaca ttttctgttg ccaacagctt 4620
actccttttg ctaaaaaaaa aaaaaaaaaa aaaaaaagca aacaagccca aactacttct 4680
tcaaacaatt ctaagacacc acacaaacag aacagactga agccccagta acccagcttt 4740
cccagggatg tttgtgagaa ccagggtagt ttttgatcac tactaaattc tacttaaaca 4800
tttttaaagg atttcttttt cttctcgttt ttaaatttgt tcttcgaata caatgtattt 4860
ttgatcatat gtgcacccct cccccaaccc ctccttctat caagccaacc tggtgttccc 4920
tcccctcccc tctccctcct cctctccctc ccctccctct ctccttccct ttccctcatc 4980
tccccctccc cttcccctca tttccccctc cccttccc 5018
<210> 16
<211> 5079
<212> DNA
<213> Homo sapiens
<400> 16
gttttaatgg tatagtaaat acagcattta tcggatgatt tttgctgcca ttgatatgtt 60
tctcttcttg aaagaggaat tcaaatgaca atgaacattt ttggggtcct cttttatgga 120
gtttgatttt caggggattg tcaggcatgt cgtctccggg ttcccatgct gcacagtccc 180
agcactctct gtggctcagc cttcccgtcc cttgccctct gaataccttg ccgttgactg 240
aatggtcatc gttagcacag gtcatcacaa tacatgactc ctgggcagga ggaacagagg 300
agcggaggtt gtgccatgca tttaaaaccc agttagcatc ccagtgggtc ttccaaggcc 360
gaagatggca aaacgttttt attttacttt gttgaaatca tctgtttccc tccaaatggt 420
gggctgtttg ggcacaaggt catgttgtct tcaatttcat agccccggta cccagcaagg 480
atggctgccc ataggctcta ttaagatgcc gagtgcatcc gtggcacggc caggaggagt 540
gtgctgtggt cagccttcca gaaggaatca atctcctggg agaagtggag aagttggcct 600
gcagcagggg cctcgagaat ggcgggtctc atccaccacc agcaggctcg tctgttgccc 660
agcagtgtga tcctagctga ggttattct ctttccctca ttagactgca gtctcctgaa 720
aggcagggtg tgcacctgac ttgtcttttt gtcccttcat cctgcgccct gcacggtttg 780
atcagtaaat ggtggctgag agacaaggga gtgggaagga aggaggtcag gaggggagag 840
aggtctgagt gcttgaaaga gtccctcctc tgcttcaggg gcttgttctg gggttttctg 900
gatcttcagt acttgcgggt aggatctgag ctctcccggc ccctggtggt tgttggccag 960
gcctggccag cttccagcag cacaggtcat cataatatat gactcctgga caggaggaac 1020
agaggagcgg aggtcgtgcc atgcatttaa aacccagtta gcatcccact gggtcttcca 1080
aggcggaaga tggcaaaacg ttttatttt actttgttga aatgcaggtt gttccttttt 1140
ttttaaccaa cttttatgtt ccaaggctaa aacatagcat aaaacaattt gaaaaagtcg 1200
gtttcaatgt ttcccattgt tcactgagag agggtcacac agggtgcaag gcaacagagg 1260
acaccattgc ttacgtagta cctcgtgagc tgcactgcga gaggcctttc aaaggaaggt 1320
tttatttagg aagcaaggaa tgattaaaaa ctgatggctc taatcaaatg agatttaaaa 1380
ttttccatta aaccttcata gttaggctgc atgcagtggc tcatgcttgt aactccagca 1440
ctttgggagg ctgagatggg aggatcactt gaggccagga ggttgaggct gcagtgagct 1500
gtgactgggg cactgcactt cagtctgagt gacagaggga gactgtatct caaaaaataa 1560
aaaaaattaa aaattaaaag aaataaacct ttaacattgg gtgtaatttt actttccatc 1620
tactccttct tcctcacctg caacgttcaa gagcaggagg gaagatgtga acacacattt 1680
gtgtgtgtgt gtaaacatgc tcatgtgttt ctaaattatc aagtcaggat aagaacttct 1740
actgtgaaat acagatatac aacaatatgt cccaagctat gtttaatgca cttttattat 1800
cctgctagtt cttctaaata tgatcattat acaatagttc tttttttttt tttttttgag 1860
atggagtctt gctctgtcac ctaggctgga gtgcagtagc gcaatctcgg ctcactgcaa 1920
cctccgcccc ccagattcaa gcaattatcc tgactcagcc tcccgagtag ctgggactac 1980
aggcgcgtgc caccacaccc agctaatttt tgtattttta gtagagacgg gggtcttgcc 2040
2100
aaagtgctag gattacaggt gtgagccact gtgcccggcc cattatacaa tagttctaca 2160
aagaaaattt aagagcaagc tctggcttag tctttgaaaa acaagtttgg aatttcctat 2220
acgagtggat aaaatgtcag ctcttggtat tgtccttaag acacagtaca tggtatttac 2280
tctcttttta tagggtaaag atagataaat ccccaaaggc cttggcattt aggaaacaat 2340
catgctttat ctattaactt actctttaag ctctgtcatt ttttgcgtct gagtgagaca 2400
ctctatttac tgagccacag accacctgct agataagcag agactcttcc agggcacaca 2460
gcctggagaa aaaacgcctg aatgcacaac tagaagtatt agcaagtctg gtttaactgt 2520
ccccaaatgt ctaactaaga atattagtgg gccaggcgca gtggctcacg cctgtaatcc 2580
cagcactttg ggaggccgag gcgggcggat catgaggtca ggagatcgag accatcctgg 2640
ctaacacagt gaaaccccat ctctactgaa aatacaaaaa aattagctgg acatggtggc 2700
agccacctgc tctagtccca gctactcggg aggctgaggc aggagaatgg catgaacccg 2760
ggaggcggag cttgcagtga gccgagcccg cgccactgca ctccagcctg ggcgatagag 2820
cgagactctg cctcaaaaaa aaaaaaagaa tattagtgaa tgattagtat atgggaaaca 2880
cctccggacc accctacatt attattagtc ttcactttgt ggtgggtaaa gataaaataa 2940
aagtagctac cgtttattga atgtttacca tgtgtggatg aaaaccatgt taatcattgt 3000
cttctttaat cctcacagca acctaatgaa gtaggtacta taattttgca gatagccaca 3060
ttgagggtga gtgaggttaa acaacttgct catatgactc aaaagtttgg aagccatttt 3120
caaatcagat gtggacaaag tgtgcctttt taaccattgt attattcagt cttcctatga 3180
agacacgcct ctatttgggg catttacttc ctatataact tgatgaaaaa aaacccagca 3240
ttttcattgc ttgcctataa aaactctaaa ggtgtttctg tgggagggtg tgttattcca 3300
ctcagctatt gataaatata gtcctgtctt aatgtttaat gtggatcttt tttctgtttc 3360
atgcttttct gaatttttga gtgaccatgt cactcagaaa agctttgaat cagcaacatt 3420
tccagtggac tgtaggggaaa gcctgttgtt ttggtggaaa gtagagagtc acagatcccc 3480
aaccttcatc tgagccgtgg ttctgcatca gtacagacag gaaaccaact attaggagcc 3540
actacatgaa atagtatttc ctcaggtgag caaaaaattc ttttgctttt gtagattggc 3600
cctgtctata cgtggtagcc actagtcaca tgtggctttt gacgtttgca ttttaattaa 3660
ttaaagtgaa acacaattta aagttcagtc acccctgcca cactataagt gcccagtatt 3720
caatacaact gcccagtggc tgccatgctg ggcggcgcaa acgtagagca cttctgtcct 3780
ggctgaaaat tctactagac agagccatcc aggaatttgg actagcaagc accaagttca 3840
cagttagaga acacagttgc aggccaggcg cggtggctca cgcctgtaat cccagcactt 3900
tgggaggcca aggcggatgg atcacgaaat caggagtttg agaccagcct ggccagcacg 3960
gtgaaacccc atctctacta aaaatacaaa aaattagcca ggcatggtgg tgctcacctg 4020
taatcccagc tactcggggg gctgaggcag aagaatcact tgaacccagg aggcggaggt 4080
tgcagtgagc tgagattgcg tcactgcact ccagcctggg caatagagca agactctgtc 4140
tcaaaaaaaa aaaaaaaaaa aaaaaaaagg aaagaaaaag aaaaaagaga agacagctgc 4200
tttacaaagc aagagggctt caagaatctg gaaaccaaag gagcaatgtc ctttgagttt 4260
ctacaaattt gggccacact gattgggcct ttccacagcc aattccattt gccttcatta 4320
tggaaagtaa acagtttaac ttcctactga catgctctgc agtgcagaca gtaaacagta 4380
gctcaccgct gcttctgcca gctgctctcg ggtgttctac ttgggtgggg aacagcagca 4440
ctggcactgg cactggcccc ggtggcccca cagagcatgg ctccatcagg ctgggtgcta 4500
cagagggatg ccaagaacat ttgggcattg aatgcctctc tctctctctc tctctgaaat 4560
gaaaaccctc atcaattcaa caatagtttc tctaatagaa catatagtga tttgtttcat 4620
ctcaactgtt cccatacaat aatagaaagg agggagtctg tgcctgagag tgcctgcaaa 4680
ccccagggca caccagcccc gtggagccat aacagttgct cacagagaca gcccctcaca 4740
gcagccccg gcacagtgac tcgtgtaatg aaagctggaa aattgcccag gaaaacctga 4800
agatgcattc ctgaagctcc cacactccaa cgcacgcaca cacagacttc tctcctggct 4860
ttaggaacat gaatttacct tgaatcttta aacttaattg aaaatcttgc aaaataacga 4920
gctttccttt gaatcttcat ggcactttgt aataaaatgt ctaaaagggg gccattccat 4980
gaaatcattt aattggcatt aatagtacac tattacttca tataaaatca taatcatata 5040
aatgtactta tataactcca tgtaaattaa tttatataa 5079
<210> 17
<211> 4077
<212> DNA
213 <213>
<400> 17
gggtagtttt tgatcactac taaattctac ttaaacattt ttaaaggatt tcttttctt 60
ctcgttttta aatttgttct tcgaatacaa tgtatttttg atcatatgtg cacccctccc 120
ccaacccctc cttctatcaa gccaacctgg tgttccctcc cctcccctct ccctcctcct 180
ctccctcccc tccctctctc cttccctttc cctcatctcc ccctcccctt cccctcattt 240
ccccctcccc ttcccctccc tcctccttcc cctccctttc tctcccctcc tttacctccc 300
ctctcttccc cttccccctc cctccctccc ttcctccttc ttctggaggt tatggtagca 360
ctaggagtca aatccagagc ctgacactca actgctgatt gaacccctga cccttcttat 420
tttttctgtc catgtttatt ttcttgaagg aggaattaca taaaaaatga gcctttcgga 480
ggtcttcctt ccttgagtct gctgttaggg atgagtcccg tttgaatttc tgtccatggc 540
agggtctagc gccgatttct ctctgatccc cagaacctca ccctgatgag gtttgtgcga 600
tgggtgacac taaacagtgt tttctactaa acagtgggct ttgtggggac agggtgacac 660
tgtcttccac ttgctctgag ttccccgcag gcatcacccc cttcctcccc actggtgccc 720
cactctctct atctgggtag gttgcaggcc ccctcacagt tctacctgga acgtgctggg 780
gtcagcgcag gcaggagctg gctggccttt gtaagactgg ccaactagag cgatgcaaag 840
ccggcctggc accaacccgg gctgctctgc agaaagctag ctgatttcca gcctgagcag 900
gtgcctgtga ctccaggggc agggtctctg tcagacgcac ctctatccat ccttcatctt 960
atccctatgt tctgactgtt aaatggcaac tgagtgagga ggggaaggaa ggcagaggag 1020
gggtctgaga gggatttgag tgttcccagg cccttgcaga ggctgtcccg ggtctggagg 1080
gcttcagcca gggtgtccta tgtaacacag gatcctcaga tagcaggtac tgttaaagag 1140
gaggccatca cacctgtgca tttgagacca tgccaaagca aaaggtgtca acacccgcat 1200
tttactgcat ggaaatgtag ttcgttcctt ttcaaccttt tgtatcgtgg ggctgaagag 1260
atgatgtgaa aggactttaa aaactccact aggcttctct gctttgttca ctgtagaagg 1320
tcacagggag ttcaagaaaa caggctaggg ataggaggat gctcatgtgc ttctcttggg 1380
agcggtggca gggccagctc cgtctcaaag caggctttat ctagaaactg gtgaggtggc 1440
aggagcttag gaggagggag aaattgattt aaatattttc attaaacact ccctcactga 1500
tggtaatttc acttgctctc tccctcttag ccccccacac ttcagaacag gagagagagg 1560
atactcgcat acacacacat ttaagtgcag gcacacacat agatatgtat ttctaaacca 1620
tttttcctgt gaatacaatg atgtgctccg atatatactt aagccagtct tactattaaa 1680
ccatctcttc taaaaaatat gatcaaaaca cagttgttct aaaagcaaac tctaaaagac 1740
tgacctagtc tctgacaatg agtttgaaaa agtgcagctc ttggtgttgt ctgcaaaccc 1800
aacactattt gttgacttga caggcaagac agacaaaccc tcaaagttaa tggtttctct 1860
attcgtttac tctgtaagtg ctctctgcat tcaagcgaga tactgcattg gctgacacat 1920
taaatatgct gagactcttc cagaacgcag caggcagaca acccacggtc aacagtgggg 1980
gaatggtatt tgtctggctt agttatctcc aaatgtctag agagagaata atagtatata 2040
atggtgcatg gaaaacaccc atgagccttg gtgtgttatt agtagtagtt actttatagt 2100
gggtaatgac aaaataaagg tagcttccag tttctgaagg tttactatgt gtggatgtaa 2160
cccttgctaa tcaccacctt agttaatcca aacaacagtc ccatgaagta tgactattat 2220
tatccccatt ttacagacaa acaaaatgag gactacagag gttaataact tgccccaagt 2280
catggtacca aagggtttgg gagccattat ttcagtcaaa ttctaaccaa gtgtgcttag 2340
ccatcgtgcc agaggttcca aggaaggagt ttgcttgttt gttttattta tatcacttga 2400
tgaaataaaa ctaccattcc cattacatat aaaacctcct atagatgcct ccttagcatg 2460
ctgtgtgatt ccactaagct gttgatagac acagtcctcg gggctggggg tgtgggtcat 2520
ttgttagcat gcatgaggtc ttgggtttga tccccagcac tgataaagct ggcatggtga 2580
tgtatgcctg tcaccccagg acttcagaga tggaggaagc cattcagtgc catcaccagc 2640
tacataatga gtaagaaaga gaccagcctg gaacacatgg cattttatct taaaaaaaaa 2700
aaaagacatt cgttttgaca tgtatatttt ttgcttttgt aaattttcaa gggaatgttt 2760
cacccagaag ctttgcactg ctgatggtac acgtctgaaa tgtcagcaat ccagaggctg 2820
aggcaggagg attattgagt tccaggtcag ctgggtctaa acacaggagg aaagtagagc 2880
tttgagtgga caccatgttc agatgctcaa tgatcttcag agttatgctt ttggcagaca 2940
ccacaccaac agaaaaacaa gaacaacaat tgccttcaaa gggagggcag ccttgtgaag 3000
ctctgattca aaggagaatt gtcctttgga gtctgaatga atttggaccg ctctttctga 3060
gcctttccaa ttctactggc atccacaact gaaaacaaac agcggtgccc tgattgccac 3120
agacactctc tgctgggcag acagcacacc gcagttccca ggctgttctg ccagcatctc 3180
tcaggtgttc agcctgggtg gggaattgca acatgtgtag caagccaggt ggccctgcag 3240
agcctgtctc caacttcgat gctgctgggg acacaaagaa cattagggca tggagtggct 3300
ctgtcagtct ctgtgaggga agcccttgct caccacataa catcattccc taggtgtgtt 3360
cctgcacata tcctaatttg ttttaactct gtatttatag tgagaattgt taagagaatc 3420
ttaggactga gcaggactga accagacaga gacagcagtt ccatgttgcc agacagatct 3480
tacacaggct tagcctggtc gcagccacca gaccaggtcc ctgttcagtg agaggtggaa 3540
agaaatacac atggattttt tttttcattt tttgctttgt aaatcatggg ggagatgggaa 3600
aagtttacac atagattttt tttttctttt cgttatttgt tttataagtc attactcact 3660
agcctaggct agcttggagc actctctgta gctcaggctg gccttgaact cttagcatct 3720
cagcttcagc ctcctgagaa ctgggattac atagctatga tactatacct ggcgcccaga 3780
tgtgtttaaa agcctcaact tcccaataga cctagacgct cctttctcag tctgaaggac 3840
acaaatgtac ctcaatctac aaacttaatc acaaatctct caagggtgtt tctgaaactt 3900
cagagcactt tggaacaaac tttcctagtg gggaggtttg tttcttcact catttaactg 3960
gcaaagtcac aactatacaa cttcatttat ttatataatt ctatctaact aatggaaata 4020
agaggtgagg ttagagaaga ggaataactt ttaatattct gtagtaaagt agtgaag 4077
<210> 18
<211> 1501
<212> DNA
213 <213>
<400> 18
gacttgcagt cttcaagaac ggatgatgcc ccaggcaaaa ggggtatcct accctgccac 60
ttagtgggcc ccaaaggaga ggcttctgct ctagggcaaa gcttcatttc cctcttcctt 120
tgagctcact tatttggaat gagtatgtct gccccttgcc tgccctatca tggtcttttg 180
ggaacacaca acaaacctgg ttttgccggt tcacagccag aggacggatt cccttctaca 240
tgggtctgcc tataccagat gatgtgatac tgtgttgact tgggacttgg agtggtttgg 300
gcatgggtta agactttggg ccagttggga tggggtaagt gcgtttagca tgtgaggatg 360
ctaaatatga acttggggga catagagaat atggagttat agacccagtg gtatccttcc 420
agatttgtaa ttaaatctgt acagttcaat acctcaaaat gtgactatat ttggagacag 480
ggcttccatg gggagatgac attgaaatgg ggccgtcagg atggactcta acctgaatga 540
tgtctttgta agagaatcat tagctacaaa gagagcccag gggcacacac ttagaaagga 600
tcccacaagg acacaggaag ggaggtggaca tgtgcaaggc aggcagaggc ctcctgagaa 660
atcggttctg tctgcacctt gatcttggat atccagcctc tagaattatg aatgcattgc 720
cttctttgac aaatctgtat ctaaaagaaa ggaggggtgtt atttgtttta gctcaagttc 780
tagtacaagg tcacttggcc ccttgtgctt gggtggagca tcataacatt tggcagaaga 840
cagccattcg tgtcatagga gataggatgc agaggacaag tggaagggga ggggactgga 900
cacataggca caacacccgt ggtgacctgc ttaccccagc tgggccgata cctcctgaga 960
ttccagcacc atccaaaaca gcaccatgag caggagaaca gatttgagag ccattatgca 1020
tgcaagccat aacagtgagg gaatacattt ctgctaagtc ataagtaata ctgacttcaa 1080
tcttaaaatc ccagggaagc tgatgaagct cagcggtaag gcacttgctg gcgtgctaga 1140
ggctctgggt tcccatccct cccagacaat ttaccagagt cttcccttgg tgttagcagt 1200
tttgggtcct cttgtcttca cattaaaact gacattcaca tggaatgatt tttgctaatg 1260
gtgagaaagg gttcatttta ttctcattaa gagggtcaac taagtaccac acacacacac 1320
acacacacac acacacacac accccacaga ttaatttgcag cccctcggtc ttaagtgatg 1380
caattgctgt gcactcctgt cttgcaggct gtgctctgtt ctattggtgg ttcaccagcc 1440
tgtgccaaca ctgactggaa gaacaagctc tctctggttc atcttcacag tcttggttat 1500
t-1501
<210> 19
<211> 1909
<212> DNA
<213> Homo sapiens
<400> 19
gaatgtttac atgtacattt caaacccagt tttctaattg tgcagtctta atttcctagt 60
taatttcact ttacagataa gaagctctgg agacatggcc tttccggtta aagacacaga 120
gcccaggcac tgcccacggc ttcctccaca ctcatgctgc tttcccttag gtaagacaaa 180
cctcaccaaa gctgagactg gctcaagaaa cggggaagcc taatgcttgt aaacattccc 240
ttaattgggaa gcattaggca ccaaaattct tcctaaaaaa tatgtaagcc ccaagaatga 300
aagggccatg gttagcacaa accgcacctc ctgagcccag caaaacccaa caggcacagt 360
gcagcacagc ctgggcggtc tctcaggtga gtctctgcct cgctcttgcc ctgtctgtca 420
cctcatctct gccaagtctg aaaatcctga gctccaggga ctgtgggaac ttcactagac 480
atgtgtgaac aactctacat tctgatccgt agcgtctccc taatgatgca catctaggaa 540
ggagaggggag ggagaggggag cgtgtgcatt ccttggagca acgaggacag cctagtgatt 600
tgcaaactct ttgcggcctc ctggtgggct tcagaatcaa tttgtgagtc ccaaccagaa 660
ttttctacat aattagaata aaacagagtt aagatatgag tgcatcgtat gttgcaagat 720
actgttttgt aaacgttgtt tcagatattt gtgagtgcac atgtgtgtgt gcagtaatgg 780
gtcacaaaat atatttactc tgggtcatgt tttaagaggg ctagaaggca acactaacat 840
aggatggttg gaagatggtc aggctcagaa catcagattt tgcctccttc cagggtacca 900
cttttatcaa gtcacacatt ccttcccgct ctgcttttgt gtttctcaat cgctatccaa 960
atttgcgcag aagtcaggaa tcacgtgggt aaagatttaa gctgtacttc tggtttaatt 1020
aagcacgttg aagaagaggt gctctggggg aacgtggaga aggtgggtag cgagggctcc 1080
aggggctcag aaggtggcct cgaggggctc tcatctgcca tccttgtgag ggagaaagtc 1140
ctaaaccagt cgtaacattg ccagaacaag gggtcccaat ccagacctcc aaagagggtg 1200
cttggatctc tcatgggaag gaattcaagg tgagtcacaa agtgctgtga gaagagagag 1260
ttttttggaa gttacgcaga tacagagtag ggtgtcctca gaaagcaaga ggaggaactg 1320
cctcgtcttt aagtttttct tacataggag tcctctctat gtaaagacag agctaagctg 1380
tgtctctatg tgggtgggct gacagcgtga caaaatttat tattctgttg atttaaagaa 1440
aactatactc aatattttaa tgtgtaagta catcaagtca taattataat tatcttgaaa 1500
gcatatattg ttatgggtat tgggacctct ggacttttcg ttgtcatatg attgtatcct 1560
tgcaggtatc tttaggctgt ttcttcaact gtaaatatct tatgactgtg ggtcgtgacc 1620
ggcaaggaat ggagttggtt tttaaaatgg tgtcaccctg gctcttctat gctcctgttt 1680
ccctaacagt aatagcccag ccattctctc ccatgttctc ctctgccctc aacttcagaa 1740
tgaagtcaat tttatttca gccaaaatag gaggattcta ttctgtctgt tgaggtctgc 1800
tgtggtctaa tgatgttaat aaccagtggc tgggcatgat tacacgacga ggattctaaa 1860
tcctgtttca tgtttccctc tgggccccact ggctatatga ccccttaaa 1909
<210> 20
<211> 1201
<212> DNA
213 <213>
<400> 20
gagtatatat gtttctaagc caggttccta actatgtagt attaatttcc taatgaaaca 60
ccctttacag gtagtgaggc ctttggagac cagggcttta aaggccaagt agctgaagcc 120
cagggtcttt ccatggcttc ttcctatgac tgtttatcta atagatgaga caaacctttt 180
caaaactgat tatcagttaa gttccaagaa agcaccactg taaatgttaa tgttcctttg 240
aaatggaagt atttagcgct ctgtgtgtgt gtgtgagtgt gtgtgtgttg tgcagttggg 300
tacatatatg cagatatgca caattgtttg tgtttgtggg tctttgtgg tgtgtgcagg 360
tctaaagttt ttcttttcat tagttatggt ctaaagtggt tttaaaaaaa gaaaaagaag 420
agcagagaag gctatgatag catgaggttc ctttgggatt gtctggctta gaacgctagg 480
ttttcccatg ttttaacagc ttcccatgtc cttcccactc tgcctttgtc tttctcattg 540
tgatccagat ttgccccaga gggggagaac ccagtaggta agagttcacg ctgtacttcc 600
atgttaatta agtgatgtgg aagtcttgga aaggctgggc agtttttcct gtcttcccag 660
gagctggggg aggttcatcc ttaatggaac cagttccatg ccatccccag gaggcaagaa 720
gtctggaaac atcaataatt attcagtcac aacaacccac tttcctctct ccccctaatc 780
ctcaactgct gacttcagga caaagtccat ctgatttcaa tcagatagga agactagtta 840
gaggcctgcc ccagtttact ggctgcagca acaggaagca caggttacaa taccaagtga 900
ttccacgctg aaagcttcac tctgatcatc ctaccaggct gctacatgag cccttgaaag 960
cgaattatcc ccggagactt actttctata taacacatat atacttacat atacatgtcg 1020
actttgtttt ttcttgtatg ctgtaaagat gcctaggata catttaagga tgcaacataa 1080
aagtcacttt cttcatggag taattattat aatagtactt gtttctgggg gagcaaattg 1140
aaatgtttcc cagtgtgaac tgccaagtta aaacaacaaa aagctagttg gagctccccc 1200
t-1201
<210> 21
<211> 3995
<212> DNA
<213> Homo sapiens
<400> 21
ctaacatagg gtcgttagtg tcagaactga attaaattgt agggacatgca ggtggtgact 60
gcagagaatt ggagcattgc ttggagtgaa aaccaagccc acatatttgg tgtcaaaagt 120
gttatacaag tagaaaaaca ggttctcttt aatggaatat tattcagccg tattaaggaa 180
tgaggttcag acccatacta cagcacatat gaatctccaa aatattgtgt ttagtgaaat 240
aatatagaca caaaggacaa atactgtata attgcactta catgaggtgc ctggaatagg 300
caaatccata gagacaggca gtagaatcat ggttgccagg ggctgggcgg gagggagaat 360
ggagagttag tgcttaatgg gtacagagtt tctgtttaga ggtgatgaaa acagtttgga 420
aatagtggtg atgattgtac tatattgtga atgtatgtaa tgccactcac cgaacactct 480
aaagtgtttg aaatagcaaa tttctattat acgtatttta ccatagtttt taagttaatt 540
accatagttt ttaaaagtta ataggataat attccctgaa ccactataca ctttagattg 600
gtacactgtg tggcatgtgc attatatctc aatgaagttg ttaaaaacaa gatttaaaag 660
cagagattgg gtaaagtaaa ggtttgctct gtgctgagct gtgtggcatg tggacctgtt 720
ttcccaggag ggagcactcc tggggttttg gccgcagctg cacatcagcc ccctgtgcag 780
aggaggtatg gtgtgtgatc tggagattag ctgtttctag tgcagtattt acatttaaag 840
acattgctga gttaggcaga attttctata tccatttgta ttttgcttgg cattcacttt 900
cttacaaaaa tggacaatca agacaaagaa aacaaaaggt ccaattacta ctcttcattt 960
caccccaaag caaaacaata ttagttttca attttttttt cccatagaaa gcaataacag 1020
tcccatacta cctcctcttc catgaaagta gtgcttgaga tgccccaagg aaaaaccatt 1080
ctttccaaag atgaaagact ttgtacctgt caggtgaaga gatggaataa atgccactcc 1140
tagtgggtgt gggacttgtg cagcccctgg tccccagtta tctgcttatc agaatgtggt 1200
ttgcatatca cctttagcgg aattccttgg gatgcttgta attctggggg agatgtctgg 1260
agtctgcatt tttagccagt actcctatga cttaggcaca gtaggggaacc actggtgcca 1320
ttccttcctt cctttcttcc ttccttcctt ccttccttct ttccttcctt ccttccttcc 1380
tccctccctc cgtccttccc tccctccttc tttctctctt tctttctttc ttcggagtct 1440
cactctgtca cccaagctgg attgcaatgg tgtgatcttg gctcactgca acctctgtct 1500
tctgggttca agtgattctc ctgcctcagc ctgctcagta gctggtatta taggtgtgca 1560
ccaccacacc cagctaattt ttttggattt tagtggaggg gtttcaccac gttgagcagg 1620
ctgatcttga actcctggct tcaaatgatc cacccgcctc agcctcccaa agtacttgga 1680
ttacaggcgt gaaccactgc gccctgctgc aatgcttttg ctttccgtat acaaggaggg 1740
gttgcaggct tgactctaaa atgattgact ttatggagga ccgtctcatg tctggatggt 1800
aagtgatagg ggagggggca accctaaatg ggatcccaat gacttgatga aagactggaa 1860
gatgagacac tttcaggtgt gcataatgga agacttacgt aggactagga ccaagcctct 1920
caattatact aagttgtcca tgattgacca gggatttgat gaaaatccca ctgccttcct 1980
agaaaggtta agagaggcct tggtaaagca cacctctcta tctcctgatt cagtcaaggg 2040
acagctaatc ctaaaggatg aatttggctg ggcatggtgg ctcatgcgtg taatcccagc 2100
actttgggag gctgaggtgg gaggatcacc tgaggtcaag agtttgagac cagccttgtc 2160
aacgtggtga aaccctgtct ctactaaaaa tacaaaaaaa attagctggg tgtggtggca 2220
ggtgcctgta atctcagcta ctcgggaggt ggaggcagga gaattgtttg aatctggggag 2280
gcagaggttt gcagggaacc tagatcgcac cattgcactc caacctgggt gacaagcaaa 2340
actccatctc aaaaaaataa aagggataaa tttattactc aagctgcccg atatcaggag 2400
gaagttgcag aaaggggccc tgggtccaga aagtacatta gaggacctcc tgaaaatggc 2460
caccttggtc ttttatgatt gagacaggga ggcctgggaa agagagagga gatacaggta 2520
ttccagggtg cacctgttaa cttctaaaga tatggcaaga acagttctct ctcttctaaa 2580
gtttatctgc ccccgtacaa ggtttaattt ctttcaccag ggtgaaacag cttggagtac 2640
aatgttgttg ttagtatatt tcacttatct ctgttggcac taaattcttt ccttgtataa 2700
tacacatgtt taacttatgc atacttgacc ttataaaact tgtttttttc tctcatgcct 2760
agaagccatc aaactccaaa tggtcaggca actggagcct cagatgatag ctcccctttg 2820
ctaggaaccc ttaaatagac ctctgggagg actctgactg ccattttctc caaaacaaca 2880
ccccttgtca gcaggaagca gcaagactgg tcatcaacca tattctaacg gcagtattcc 2940
tatgatttag ccagtgggcc gtgaccggca aggaatgtgc cttgttagtt tcaagatgga 3000
gttgattttt aaaatcatgt caccctggct cttctatgct cctgttcccc taacagtaat 3060
agcccagcca ttctctgcca tgttttcctc tgcccccagc ttccgaatga agtcaatttt 3120
tatttcttca acgtacctct tcagagggga aattatacag gaggggggca gggaagtgct 3180
gggtagagaa aggtggatcc ccagctaggg ttccaccccc acagacctag gtgaggaaag 3240
gcacttctgg cttcacaccc aaatgttgca ttttcgaaga ccaacctggc ctgccatgcc 3300
cccattctgg gcctataaaa acccaccacc ctagcggaca gacacacagg tggccagacg 3360
tcaagaacag cacatcagca gttgaagaca caaaagggtg gacgacaaga aggcatcaca 3420
agagaacgtc aagggagcac gccgatggaa gaacctgctg gcaggctatc cactgttggc 3480
atgaggggga gtttggctgg ggcagtcaga gaagagcccg gctgcatagc ggcccaattc 3540
caggggaaaa ccatctctct tttggctccc ccggcagaga gctacttctg ctcaataaaa 3600
cttggctttt attcaccaag cccaggtgtg atccgattct tccggtacac caaagcaaga 3660
atccctctgt ccttgtgaca aggtagaggg tctaattgag ctggttaata caagccacct 3720
atagagagca aactaagaaa gcaccctgta acacaggccc actggggctt caggagctgt 3780
aaacattcac ccctagacac tgccgtgggg tcggagcccc ccagcctgcc tatctgtatg 3840
ctcccctaga ggtttgtgca gtgaggcact gaggaagtga gccatactcc catccacgcc 3900
ctacaaaggg gataagggaa tctttcctgt ttcataagta gcaatctctg tggtaacagc 3960
ccctgtggtg atgccgtctc tctcggttct gccct 3995
<210> 22
<211> 1651
<212> DNA
213 <213>
<400> 22
tccttggcta ctttctctag ctcctccatt gggagcccta tgatccatcc attagctgac 60
tgatgacact gcattcttta atatatgggg tttgcactaa cttggggtag ttatgtcat 120
gtttgaacta aattatagga cctccagttg ctggagaatt gctctgtgtg gactgtccac 180
acatatttgg tttctaaaat gtcatataag cagacactgc agtttctcca cagtggaatc 240
ttacccgggc ataataaggg aagacattcg gcacaagctt caacacaggt gaaccttaga 300
360
gacacctggc agaggccagc ttaaagagac aggcagaaga tgtgagtccc aaggactgcg 420
gaggggaaat gacagccagt gttttgtggg tgctgagggc aacagtttgg agtagacaat 480
ggtgatgcag ggctgtgaac gggctcagtg ccgctcactg aaccaaacag cctaagtgtt 540
tataataaca aaagtaatac tgacatacac cttccgttgt ttgaaagagt taataaggta 600
acattcccca aatcacttta aacaggcaaa ctatgtgaaa tataaatctg tttctgtgaa 660
gctgcttttt taaatgcttc tcctatcaga ggtcagaaga aagaaggctt gctggggagtg 720
gagttggctg tgtatctcag acctgttttt gcaggaggag tgtgcgctcc gggatttggc 780
agcggctcga gtcatccctg tgagaggcag gcatggtgcg tgatcctggg gcttttctgt 840
ttctagtgtt ctatttattt taaagacatt gctgagttca gcagaaatgt ttcacatcca 900
tttgtatttt ccttggtact catttcctta caaaaatgac gatcaaagca aagaaaacag 960
agaatcttca ttttacccca aagcaaagtg agtgcacttc taataccata acagaaaaaa 1020
cgcttcgggc ccttaggaag tgctgaagaa gctgggcaag gtggtgggtg cctttagacc 1080
caaaggaaag tgattttctc caaatgtgag aggcctgcga tgatggggtg agtggccccc 1140
agaggatgtg gggactgact agcgctgtct ccgtctgtat gcccagtgaa gctgtgggtg 1200
ggacacaatt aacagcacaa gtctgagtgg tgagaccctc tgctgtgacg aaccctgcac 1260
tgatgttact gttgaaggta tctctcaagt gctcatgctg gaaactaagc ccccagtttc 1320
tagttgatgt tgtttggagg tgggatctta tgggaggggga ttaggattag atgatgtcat 1380
aggggtgggg cctccacaat ggcattaatt gctttagagg aagcagacaa gaccaaacta 1440
gcacatttac gctgtcttac cgtgagagta atctgccatc ttctgaggca ggtgagttga 1500
tatcaccaga tgcccacacc atgcatttgg gctccacagt ctccagaatc ataggttttg 1560
aacctttat ctttataagt tttctagact ggggcattct gttacagcag caagaactag 1620
actaatatac atccctcctt ccatctgccc a 1651
<210> 23
<211> 751
<212> DNA
213 <213>
<400> 23
tgtgtgcacc agctttgact gctgctggag gctgcccatt tcctgtgatc tcaaccagct 60
tttctgatag gccagtttat ctctggactc tggcctatgc ctgatacaga tgtaatcagg 120
catccaggaa gctatctata tggaggcaaa ggtcctttta ttcaggccac tggaagcctc 180
ttccataaag ttcagtagta cgagtacagt gtcctttcct gtgtacagcc cctcgctttc 240
tcttctggac tcccagctga gccagtgttt gagccaccca tcactctgaa aacagcatct 300
tcatctcctt aggctcagct tctcaagtca cacaggctac attgctgccc tcagggtgag 360
cctcccttca ttcatctcgg tgataattct aaacaatggc ctgtgtgtta tagaaaggcc 420
ctgcaagcat acatgttatc aacttactag ctgtgcccaa ggttgcatag ctagtaagtg 480
gtaagactga aatttgagcc taggggacca taactctaaa caatgttcta tccactaggc 540
ggtactgtgt agaccatggg ctcacacaca cacacacaca cacacacaca aaatgtattg 600
aataaaataa ttgtgggttt tgcatatttt cctgttttat gtcagcttga cacaagctag 660
aatcatttgt gaagagggac tctcaattga gaaaatgctt ccactttttg ttgttttgtt 720
tgttgttttt gcctgtcgga aagtctgcac t 751
<210> 24
<211> 490
<212> DNA
<213> Homo sapiens
<400> 24
ctgtggagtg cctatagcac tgtgtgtagg cagaatgcaa aggggacagt gtgggtgggg 60
acagtgttgg tgtagaaatg gcggggaggt tagattgcag gcacagaggg cctcagccat 120
ctcgagagcc cagacttcct ccctgaggtg atggcacttg gggaagtcag tcatggaagg 180
attttaagaa agatgtgaaa ggggcaggtt tctattttca gaaaaccatt ctgggccagt 240
ggaagatgga gtacacagga ccacaccttg gtgaagggag attgtaggag cctgggcttg 300
gtggcggggg acagtggaga gaacagcctg ggatgtatga acatggcaag tctcccttcc 360
tggacagtgg ggtttgccta tggtggacag aaggtgagat catcctttga aaaatgccac 420
ttcatagtgt ttccccagct gtgggccttc actcattgga gggtcaaata atcaatgtat 480
taggttgcaa 490
<210> 25
<211> 1505
<212> DNA
213 <213>
<400> 25
tcccagagaa cctaagcctg attcccagca cccaaaggac tgcttacaac caactgaaac 60
tccagttcag ggatccaaca ccctcttctg gcctctgtag gcaccaggct tgcatgtggt 120
acccagacat tcgtgcaagc aaaacactca tacatataaa aatagataaa taaatgccta 180
tttaaaaccc ttgcctcatc tgaaattatc tgaatgttga tttctttgga ttccctttcc 240
ttttgccctt gggaaaaata ggtcacccct gtgtcagtta ctgtatgttt tggtcactgt 300
tcatagtttt agagaggatg tctaggaggg cagggtcacc tgtggtgtgg caattggggag 360
ctccatgtgc agaaggaatg cagacacagc agcagagagt gcaggaggcc cggaaggttc 420
caccatcccc acagccccac ttcctccctc tgccgaaggg gttgggggtc aggcagaggc 480
tttaagaggg gcgtggacag ggtagatttc tgttttggga aaaccatcta tcagagggca 540
gaggacaggg tggaacccaa cacagctgag agcttgcaag gggctgggct gggcagcagt 600
gaagaggaac ctcacaggga ggagcccctg gggtgcaggg gctctgaaac tgccctgtga 660
aaaacactgc ctcattgtct tggcagtttg ggccctgacc cagtagcagc aggtcagaca 720
attgttatat aaagttccga aaattcaaac ctcccccttc ctccttcatc cttcttagct 780
acacgtgtgt ccatgagtgg cagagcaggc actcacatag aggtgtgccc actgcagcgg 840
ctacagcact aaagaaaatc cctctctccc cttcctctcc ccctttcttt tacttcaaag 900
cagagtctta ctatagggcc cggcccctgt gggctgctca cttttaatcc tctgccttgg 960
cctatctagc actgagatca cacacctgcc tgtgtcacta tgcctggctt ccagcacttc 1020
tttgagtgct gacagacacc tcaagtggaa aattcttgtc cttgcttcat ttgacagatc 1080
acagtgaaaa tgggagccca ctaaaaatac tttataggat taccctcggg ctgtgtctga 1140
ggcgggtagg taacataagg aatttcaggg ttagacttta gtcctgtcac caagacatct 1200
atctctttat acatataaaa gtattccaca gtctgaaaaa agctctgaaa tagagaatgc 1260
ttcttgtcca tagcatcata gatagagacc cttcagactt gtatataaaa cagaattgaa 1320
aagtcaattc aggtgtgcac acacacatgc atgcacgcac cagcacgcct gacatctctc 1380
agggctgccg ggcatcactc aggtgactgc ttgacgtgtt gatgtttgtg tctttggctt 1440
cttctttgag tcttttgttt ttcttctttt attttattta tgagacaggg ttgagttcat 1500
tgcat 1505
<210> 26
<211> 1840
<212> DNA
<213> Homo sapiens
<400> 26
cacaccattg catgcttcag ccgttgcccg tgctatttcc tcccttggaa agccctctac 60
tgtgaggccc tcacctctca accctctccc tggcccccat gttgtctatg tgatttcttg 120
ccatttaaaa atctacccag gtgtcagcgc ttgggcagtt tcctcacacc tctcacccag 180
ttcatcctcc cttgcttggt gctatttctg cccttgtcca tatcccccacc acagcatgca 240
ctttggattc caggcacgct ccttgagtgt gaccccgagg ccctctgtgg gctcttggag 300
cagggcaaag ctgggtgtgc tggggcgcag cacgggcctg atgccctgag gttgtttgtt 360
gtgctgggct ggaggcgttc gaagaaacgt ccaaggaggc tgctagactc agttctttct 420
ttctgttttc cctccacctc ctctgctagt ggaagctcca tgtctcccag gctcgtgagc 480
tggcaaacac cccgcttgca tggttcagtg ttgtcgttgg cggcaggcgt acgtggaagg 540
ccagttacag agggtctcta gggctaatgc atttcacaac acaccgccct ctgacactcc 600
acgctctgct tttcctccag aaccactccc tttgcaaaac tctgtttcaa acaaaaagag 660
cacaaagagg ctgaccgtgc cttcctccaa ccaagctccc ctctccacag gtgcacagca 720
agagcccttt gtctgtgatg ggacaggcct gggctccagt gagcaagaca ggcactgtgg 780
gcccatccaa atattaactg tggacacttt cctactttga aaacatgaga ctttgtactc 840
agagccctgc cctccagaga acacaattac ttctgttttt cttttcctag tggaaggagg 900
cttgacactg gtgatggcct tgcctttaca atgctcaggg tttgggaaag tcagggccta 960
gggctgctga tctccaggca ctgtctgctt tccatctatc ctctctgctt ggtccctgaa 1020
aagcaggagg gagacaggag gaatggggagc atgaatgccc tcagggtcca cgggggatcc 1080
cggaaggcct agaacaccag gggtctgggc tccacccatg atggatcatg cctttggggg 1140
aagattggcc tacactcatg tcaagtaata agttttactt cctgcacctg gtgttaggtt 1200
ggttctaaga tgcagctgta acctgtgact aagatcaata tttttcatgt cactatctga 1260
tcatacaatg gtcaatttat cgatttagaa aattgttgca caacgaggca acaccgagtc 1320
atgacttaaa aaaaaaaaaa gtggatctaa ccgaagctag attgtggctt atcacctttg 1380
attgtcagtt tcttgggtca aatcttaatg ccacattgac cactgtgtca agagaggcca 1440
ggttccaact cagctccgtg tatagtgttc atggaatctc aatgctcatc aggcgctgct 1500
ggggctgggc ctcggggagg ggcaggctcc tgtcagcaca agtcaccagc acaggtttta 1560
accagccagt ctgggctact tttaccactg aagcagtggg gcgagaaact ctattttaca 1620
gtgtttctaa aacctctgtg agctaaaagt agaagcaact caaatgcccc tcacctgatg 1680
aataaacaaa cacagtgtgg catcctcgta caatggagta ttattcagcc atagaaaggg 1740
aggaaatagt tgtgctcgat acagtatgga tgaggcttgg agacatgatg ataagtgaaa 1800
agaagccaat cacaaaagga caaataatgt atgattccat 1840
<210> 27
<211> 1451
<212> DNA
213 <213>
<400> 27
taagccatca catgcttcaa ccatgggcta cttccacctg ctcccccccc ccccacacac 60
acacactgct acccctcacc cccagcttgg tgcctcactt ctcaggctat aatgctgctt 120
tcatggacat tccttgttct ttggaaacaa gggcccttcc ctctgcagag ttctcctgcc 180
tgaggctgtg tgttcttggt ttgtgggcct ttgcccagct ggtgcccagt gcaaggtgcc 240
ctgctaactg aacaaatgac cttgctcatc gtcatcttct tggtctccat ctttgtggtg 300
gagccttctg gaccaccggc aggtaccctt tgcaggacag cctatcctgc cctgtctccc 360
tacagagcca ctccctgaag ctgcagaaaa caagagagca tagaggtgac cctctccaca 420
ggtgtgtggc cagagccact catccacagt ggccaggccc atccaaatat taatgatggg 480
tgttttctgc tttgaagttg agaatgtcgg tcctcaagag tccaccctga agagaacaca 540
accacatctg tttccttcca gggaacaggg gctgcactgc ccttcttctc tgtccgtgcc 600
cagagcatgt atctgagcat gcccagagcc aaacacagca tctatttcct actgatcttc 660
acagctggac aggctcccac acagccagat gctccctggg gagcctcaaa agcaaggttc 720
accaggtgga gctctgggga aattgctttc aactctgtct tggcagggct tgccttctgc 780
acctggcttt aggagggctc caagatgcag cataacatgg gacggatatc aacgcttctg 840
tctgatctta taacaaaggt caatttgtaa agttgatacc accaagtcct ttcttccttc 900
ctttcttcca caccccgtcc tctctgagaa aatggatcca atagaagcta gagtgtgact 960
tgtaggttct gactgtcact tctttggggt gaattttaat gccaaatcag ccaggggcga 1020
agctgaggag agccaagttc acacacagtt cagcacgaag ttttaattca gtcccatccg 1080
tccgaatctg cactgctgtg ggtgggttaa agggagagca ggctcctgac agcatgtgct 1140
ccagcacagg tgagtctgtc acactttttc ctacagctgc caggcaagac gtcaagtcta 1200
cttaaggttt cttatgcctg gaatcgccta aaacgtaaag caatcaaaat gtctatcacc 1260
caaagagtag ccagacaaaa cacagcaggt ccttttatga agagtcctgt gtcacaagac 1320
acaggaatat caattctcag ccattaaaag gcacgctgta atgacactgg ccacgatatg 1380
ccacatctta gaaatattac aataagtcaa agaagccagc agcaaaaggc taactaatgt 1440
attatttcca t 1451
<210> 28
<211> 6212
<212> DNA
<213> Homo sapiens
<400> 28
ctctaggtgg tgaaaatgac cagatttggt tgtggggtca tagtggacac taaagatcag 60
caaggggaaaa aagatgtgac tataaacttt ccattctcac agttgttttg agacccgagt 120
gtacgtttaa tgttttcaac agaagaggct gcatgaagaa gagtaagtta accgcgggga 180
ggctgtgaga atttttctgc gcggacaatg gagctcagtg tctgtttcag tgtttgtgct 240
ctctatagat acctggatga ttcttgggcc tcagtgtgtt ctcgctccct ccctgccgag 300
actcaaaggg atgatgcacg ctgcccagcc aaaaccagga cagaacgtct ttttccccgt 360
gggaatgcgc tcccggcgcc aattccaagg cctgcctggg tcctattcag gcagtgctgg 420
ggtgagcagc aggctcgggc ccagctgaca cggccagaga tccccagtga ctactttcct 480
gacatggcag agatggcaga tggagaatcc ataagcccca gttacacccg ggagctcaca 540
ctgtggcttc agtctccaag gagagtgggg agagccctgg ccctccgtga aggattgctt 600
ccgcccaagg ggggccagtg aacccgaatc actctgctgg atggtgctgg ggggctgatg 660
caatctgcat tccttcccct cgcacccctt acccctcgct acctccccct tctcatcctc 720
cccactcgca cctctccttc tcccacacct ggctgacacc cactcttgag tcactgtcag 780
ctccaagaca gaaccggcat cctgggtgct tggcaggagc caaaggagca tgttacagga 840
tctctggctt cacagatggg gagagagcag ttcagagaat tgcgggttcc acatttgctt 900
gaagtcactc atcagccttt atgttacatt acaacaaagc agcccagggg acatggactc 960
atagggtacc tggtgtttcc ccaactgtag gggggattcc gggacaaata aagtttgcca 1020
ctgggaccct cccccgaact gtgccctgtc ccactcctgt gacacactct ctgccccacaa 1080
gagagtggcc aacagtggag gctgagagtg accacctgcc tgccctcagt tattaaaggc 1140
tactggagaa caagccttga gtgcgtgctg agaacacatg cccctagctg ccatcaaaga 1200
gaatcacttc atatgatttt gaccataagc aaactcttcc accttcattt tttaaaataa 1260
cggctttatt gagatatgca tcacttacca tgaaactcac tcttttaaag tgtacaaccc 1320
agggttttca gtgtattcac ggaattgtgc aaccatcacc catcacccct aatttcagga 1380
catttttatc actccaaaaa gaaactttgc acacatcatt cttctctccc cacagcctct 1440
gacaactgct gatctatttt gtctctatgg atttagcagt catggacatt tcatatacat 1500
ggaatcatac actatatgtc ctttcatgac tgacatctgt cacttagcat gattttatga 1560
gattcatcat gttggagcat gcacccatgc ttccatcctt tctttttttt ttttcacagt 1620
cttgctctgt cgtgcaggct gaagtgcaat ggcacgattt tggctcactg caacctctgc 1680
ctcccaggtt caagccattc tcctgcctca gcctcccagg tagctgggac tacaggtatg 1740
tgccactatg cctggctaat ttttttgtat ttttagtaga gatggagttt caccatgctg 1800
gccaggctgg tctcaaactc ctgacctcaa gtgatctgcc cgcttcggcc tcccaaagtg 1860
ctgggattac agacgtgagc caccacatcc tttctaaggc tgaatagtat tgcactgtat 1920
ggatagacca catttagttt atctgcctgc tggcttatgg acaatgagtc actccacttt 1980
ttggctacta tgaatcatgc tgttgtgagc acttgtgtac atgtctttat atggatgtct 2040
gttttccctt ccattgggtt tgcttggggg tggaattgct gggccacctt ctttctccat 2100
gagtggagca tgcctatgcg cccatccccg catctcccat gtgtggaggc actgcccaag 2160
ctcgtctgta ctctgagtca cagggctgg caccattacc gatcaccatc tatgggtcag 2220
ggacttatca atgagcaaga catagcccct gccatcacta actcacattc tgcatcgtcc 2280
tgtgccatcc ccaccacccc accttggtca ggcccagtgt ccaggtgtct tcaactgctc 2340
accttccccc tattttgttg ccctgaagtt catccagaca tcagggtgcc ctattgaaaa 2400
tgctagttaa tatgacctct ctgctctaac cccaatgttg gagtcttgtc atcagtggga 2460
tagagctggt gtgactgcac cagaccagtc aggttcaact tttatgaaag gaagttgtga 2520
gttgctttca gttgccatgg accccaagtc gtaggtcatg taagctgagc atgcccaaac 2580
ggaccaagca tgcaaccatg ggcagaacct gagtgctcag actgaggagc aggggctgaa 2640
ttaagaagca gagcatacat ggcaggatcc aggatccagg agccaatcag actgagtttg 2700
gcatcactcc atggcaggat ccaatcagat cacacctccc tgcagcacct cattgcaaga 2760
tccaatcaga ccacacctca ttaccctagg cttataaaat ccaggccagc cgctagcttg 2820
gggaggcaga tttgagtgtt tttttttttc tgtctccttg ccagactacc agcaaaaaag 2880
gttttctttt ctcaaaagcc ggtgtcatgg tattggcctc tgtgcacatt gggcagtgag 2940
cccactgatt gctcagtaac atgggcacac tctggggccc acacaagcca ggaatgatgt 3000
ggcctttacc tgctgctcca gctgcatctg agcccagtat cccctgaaca caaaccccca 3060
cctgcatgga gctgcatgcg gttctcgggt acctcctggc tatgttcagc tcctgtagat 3120
tccttcagat ccactccttc ccatttcctc atccaactgc ccagcagagt gcctactatg 3180
cgccacacac tgggattcag cagtaaacga cacaaacatg atccccaccc ttatccttct 3240
cccaggactc ttattaatct aaggctcacc tcccttcttg taacttccat gaactcatat 3300
gctccctctc agctcaggga cgttgctgga ggaagcaaga gagcagcaga tgaaccctta 3360
tgttcaggag gcagatggag ctcattcaaa gcccaccttg gcctcttctt aacccgaaga 3420
ttttagcaag tcatataacc tttgaactgc aactccctgg attgtggaat gcccaaagtg 3480
tgctgagcgt gaagtaaata atgcaagtgt aaagtgtgcg gcatggtcct ggttcatctc 3540
aggaggccgt taggaaacta gcacttattt ttgccagggc ttgagcatag aacatatactaa 3600
tttccccaat ggcattatca cattgtatta ctttttattt acatgttctt tctcccctac 3660
caatctcaga gaatctcaag ggcagcaatg attaattatt aattttggaa tccttggttc 3720
ctggcacatt ccttgaaaat aaatcattgg cttactttcc actgattctc ttaattaccc 3780
ctgagaggca gagattggaa ttatactatg ctgagcagct caatgttttc ccagtaacag 3840
caggaaaatc ccaatgcaca gagaaggaac ctgaatgact taggtgggac acaccaggac 3900
agacaccccgt ggtgatgaca ttctgtgccc ttcatcccac agagtggtct gtcttcacag 3960
tggtctcccc tcaccacact gagccctcaa acttcctctt tccgctgacc aaagtgcacc 4020
caggcctgct tgtccattca gacagatgcc agggccctct gcactccatc tgacctctgc 4080
aatatgccgg ttcctaataa gggagcagga tccaggtcca gttgttcaca cttctaattt 4140
cataccggca gcctcagtaa agttctgcca tcaggctaag gccccactga tcgtcgacct 4200
tttctgcata aagattcacc tccagggctc ttagaaaata ctgctgcctg gctaccaccc 4260
catccttagt gtgacatagg gttttttttt cttcttcttc tgttttttgt tttttttaga 4320
ataattaggc agctctgttg cccaggctgg agtgcagtgg catgatctca gctcactgca 4380
acctctgcct cctggttcaa gcaattctcc tacctcagcc tcttgagtac ctaggactat 4440
aggcacacgc caccatgccc ggctaatttt ttgtattttt agtagagacg gggtttcacc 4500
aggttagcca ggatggtctc aatctcctga ccttgtgatc cgcccacctc agcctcccaa 4560
agtgctggga ttacagacgt gaggcaccac acctggcctg ccccgggttg tttttttttt 4620
taaagctccc cagggatttg taagtgcata ccaaagactg ggaacccctg gcttagctca 4680
cagagcaaag agccttttga gggttcccct cgacagttgc tccctcacct ccagctgtgg 4740
ggccacacag agcgctgggc cattgtggtg ttagagacca gagttaaagg gactccatct 4800
gtaatatcca ggacaaatgg gctggcaggt gctgctcaaa cccttacaca cagatagtat 4860
ttggggaggt gaggtcaatt cccccattat ggaacgctgc ggttttaaaa gcaagcaaac 4920
aaacaaaaac aggaaaaaag tgagcttttt aaaactaagg taaaatttgt cctcaacttc 4980
ctggccttga ttgggctctg ctactagagc ggcagaagca actcacttcc ctgcttccac 5040
ggacctgttt catgtaatgc attttgcaga gatttgaaga cagggtcctt gacttgggca 5100
gctaacagcc tgaggctaga ggcagccacc cctgaacagt gaacaattct gcaaggcgcc 5160
tggcaatagt actatgcggg gagggggtag gaacaaggtg ctgcagggcg gggtggagga 5220
ggaaatgaat tctgcctggg agaagcggga gtgcgtattt gagtggggtc tggagcaggt 5280
gcatgcaaag aagcacctca aaggcacggg caggtgtggg caggcgtggg caggcgtggg 5340
caggcgtggg aaggcgtggg caggcgtggg caggtgtggg caggcgtggg caggcgtggg 5400
caggcgtggg caggtgtggg caggtgtggg caggcatgtg ggcacggcac agggcttgtc 5460
caggccagat gccattaagc acaggtatct gtggtgggca ggggacacag tggaagcaga 5520
tagagaaggt ttgctggggt cccatggagg ggcgccttgt aggccatggt cactctaggc 5580
tgatgcaagg tgctcaaggt tgaaggcaga ggtgactgac ctgtgcttga gagagggtag 5640
ggaagagaag ctgccggact tgaggggctg aaattgtcct gtaatagtcc aggtcaggag 5700
tgttaatgat gccccagctc gggcagtgac tacggcaagg agagtttaac atgtggttca 5760
gttcagcaga catggggaac tcactatgtg tgaagcagga cacatcacgg aggcagccct 5820
caaatgcttg aagacagtaa tcctgcccct gtgctgtggc gggttcttta aggggtgtga 5880
cttcctcatc agacccattg ctctcacacc taatgatgct gccatgtggc agggctgtgg 5940
gcagagccat gccctagcag gggaagtgga ggacagcggc ggggagggag tgggggcagg 6000
gctttcctgc cctctgggtc ctctcctctc tttcgtggca gggccttgag gtccattcgc 6060
tgggctgcac agaaggagga ctccagagcc ccccttgggt tcaggatttt atacacgcag 6120
cattccagac agatggaccc gtgtattgac aatgaaagca tgggagaact gtatttcttt 6180
ggtgattaaa gtaaatgcaa aagttatgat gc 6212
<210> 29
<211> 2501
<212> DNA
213 <213>
<400> 29
cctcagctgg aattaaccct acacagttcc tcagagccta gggcttagta aaaaggccaa 60
gcctgaccta tgacctctct gacatctgtc cttagcacgt gttcttttct ttccaagtac 120
attgtaccac catgatggcc tgtgccctcc tcccccatcac ctccatacaa cgaatgagct 180
ctcatgagag cagagtggag gctggtgctg tggcctccac tcaggaattg tgaaccactc 240
caaccttctt ttgttaaaca ttacctagcc tcaaatatct tgtgatagca acagaagaga 300
ctaagatact taaaaatatc tatggatgaa gaaaatgacc aatgtgagga cgtcgtggat 360
attggccatc agcaaagaag agagcataaa gttcccattc tcacagatat tctgaaacct 420
gtgtatttca tttttgatgg aaaagagctg cacacagaat agtaagttag ctggaggggaa 480
cttatgagcc tttttttttc cccctcacat aaacaacaat ggagcttagt gtccatttca 540
ttctctttgt gcttgactgg gacccagatg gctcactgtc cctcagtatg tccctgctcc 600
ctccctgctg agatctcatt ggctgtgacg cactgccctg ctccagccag gacactactg 660
tctttcttcc ccgtgggaat gtgttctcaa agccaactcc aacaacgctg acctgggcat 720
cacttgggtg gtgctggagt gagctgtagg ctctggtcct gctgttgtag cctggggtcc 780
tagttgtcat tcccctgaca cagcagagag agcaaacaac agaaccaatg gctgtagcca 840
catggtgaac agctagacct ccagaacaat aggagtaaat gcttctgcca cgaagtgtat 900
ggagaaccta aaccaatctt caggcagaac tggggccagg taccacacac agccctgccc 960
ctttctcagc tggctgttgc ccatgccaga gtcatgatca cccataggat tctcagaccc 1020
agggcattgt gtagctggag ctcaatgagt cttacgggcc ggaagcagcc aattcaggga 1080
actctgggtt ctgcgtttgc tttgcatcta tttggtgaga gacagtgtga gttcttccat 1140
tacaaaattc caatgtttaa agagcaaaca gtcaagaaac aagaaaaaaa aacccaaggg 1200
tgtgtctgtg tgtgtgtgtg tgtgcatgtg tttatgtatg tgcaggtaca tgttggggac 1260
atgtgcatgt gcatgtttac atgtgcatag agaggtcaga agacaacacc agctgttgtt 1320
ccccaagtac aatccatagt tcaaccccct gtgtgtgtgt gtgtgtgtgt ttatgtgtgc 1380
atatgctatg gaagtcaaag attgagtctg gtgtcttcaa ctgccctcta ccctattttc 1440
tgaaacagag tctctcacta aatctagacc tcactggttg ggcatccttg ttagccaatg 1500
agctcaacta tctgcccgtt tgttctctct ctctctctct ctctctctct ctctctctct 1560
ctctctctct ctctctctct ctctccataa atgaatgaat gtgtgttttt aaaaagagag 1620
tttaaaaaaa actaaggtgg catgtatccc agcttctctc cacaatccaa ctggaacggc 1680
tcaggccagc ctcatttcac gcagctcact ctatcaacac atctgctgca cagagcatgc 1740
tttgtgagtg actcaaagat cagaaccctg acttccaatg gcttatagcc taagggtaga 1800
gaagttacct gtattctggc aagataccag ggattgtagg aggggtagca acctggggag 1860
gagggaatgc actctgtgta ggagatgcag aaaggattgg aagagctggt gagtatttga 1920
gttggatgtt ggactgataa atgcagggag catctcacag gttgggatca ggcacaccgg 1980
taggatgttt catccatccg agtcaaatgg agggcaggtg tagggatttc aggttagagg 2040
gcagggaaag aaagtagaga ggagagcctg gggttgtgct ggaggtgtgca cagagcactc 2100
agctggcact ttgaagaaca aagtggactg tccctggacg tgagactgag caggtaaggt 2160
gggttaagag acggtaagat cactactgca ataatccaaa ataagaacct ttatgatctc 2220
taggtgggat aacaaccagg gggagggact tttaacacac aattcagttc aacaggaact 2280
cgcacatcct ggaggcaaca cgtgaactgc gcaggctcag cagtcattgt ctgttctgcg 2340
tggtgctctt ccaagtggca cagtgtcttc atcagacctg gtgctcacat gactgatcta 2400
gtcacagaac aggccatgta tcaagttttg ggaaacagga agcaatggga gaaatgtatt 2460
ttaattggtga ttaagtgaag tgcaaaagat aggacgtgct a 2501
<210> 30
<211> 347
<212> DNA
<213> Homo sapiens
<400> 30
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 60
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 120
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 180
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 240
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 300
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtag 347
<210> 31
<211> 1131
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 31
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 60
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 120
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 180
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 240
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 300
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag caggtgagtt 360
tgtggtgtcg ccgatgtccc ttcggggtac tctagcgcag ccgcctggct acttgaccca 420
ctgccaccaa acgttttaaa ttcaccgaaa gcttagcttc gaagcaaagc tccgtttcgc 480
cggtgaagca ggaagccttc gctgcaggaa ctgaccttta cctcttggag cggcttctgc 540
agaaaaatcc ccgggcagag atttgggcgg agtttgccta gaactaacgc ggagccagcc 600
gatcccggcc taccccgggg ccaagatttt aaggggtgaa gagtcccttt tgccttttct 660
ggatcctggt gattcaccta gtgtcttccc taaggaactg aaccaactcc tccgctggcc 720
tctggcagcc ctccaggcgg tgcaggatgg cgtgggcccg gtaggaagct gcatgtaacc 780
gcccagggtc gggaggccag gagggcagct cctcctctga cttgaatatt gaaaacaaga 840
ggatgctttt aagaaaaaga agaaggagga ttcactacca gctctgaagg gtggaaaaga 900
gatgattcat ccggattgg gagagggtgg aatcttgttt aggagagcgt tggttgtggc 960
aggcagggtg taactatgaa tcagtgaaga caattcacat cctgggatga aaagaaggcc 1020
atgggctcac aggagattat ccactggcct ctccacatcc gcttgcagta aggagtgtgg 1080
gactctccca agcttcagcg ctgaactgca atgcagtgac gtcgcttaag a 1131
<210> 32
<211> 1431
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 32
cgcattgccc agttgttaga ttaagaaata gacagcatga gagggatgag gcaacccgtg 60
ctcagctgtc aaggctcagt cgctagcatt tcccaacaca aagattctga ccttaaatgc 120
aaccatttga aacccctgta ggcctcaggt gaaactccag atgccacaat ggagctctgc 180
tcccctaaag cctcaaaaca aaggcctaat tctatgcctg tcttaatttt ctttcactta 240
agttagttcc actgagaccc caggctgtta ggggttattg gtgtaaggta ctttcatatt 300
ttaaacagag gatatcggca tttgtttctt tctctgagga caagagaaaa aagccaggtt 360
ccacagagga cacagagaag gtttgggtgt cctcctgggg ttctttttgc caactttccc 420
cacgttaaag gtgaacattg gttctttcat ttgctttgga agttttaatc tctaacagtg 480
gacaaagtta ccagtgcctt aaactctgtt acactttttg gaagtgaaaa ctttgtagta 540
tgataggtta ttttgatgta aagatgttct ggataccatt atatgttccc cctgtttcag 600
aggctcagat tgtaatatgt aaatggtatg tcattcgcta ctatgattta atttgaaata 660
tggtcttttg gttatgaata ctttgcagca cagctgagag gctgtctgtt gtattcattg 720
tggtcatagc acctaacaac attgtagcct caatcgagtg agacagacta gaagttccta 780
gtgatggctt atgatagcaa atggcctcat gtcaaatatt tagatgtaat tttgtgtaag 840
aaatacagac tggatgtacc accaactact acctgtaatg acaggcctgt ccaacacatc 900
tcccttttcc atgactgtgg tagccagcat cggaaagaac gctgatttaa agaggtcgct 960
tgggaatttt attgacacag taccatttaa tggggaggac aaaatggggc agggggaggga 1020
gaagtttctg tcgttaaaaa cagatttgga aagactggac tctaaagtct gttgattaaa 1080
gatgagcttt gtctacttca aaagtttgtt tgcttacccc ttcagcctcc aattttttaa 1140
gtgaaaatat agctaataac atgtgaaaag aatagaagct aaggtttaga taaatattga 1200
gcagatctat aggaagattg aacctgaata ttgccattat gcttgacatg gtttccaaaa 1260
aatggtactc cacatatttc agtgagggta agtattttcc tgttgtcaag aatagcattg 1320
taaaagcatt ttgtaataat aaagaatagc tttaatgata tgcttgtaac taaaataatt 1380
ttgtaatgta tcaaatacat ttaaaacatt aaaatataat ctctataata a 1431
<210> 33
<211> 743
<212> PRT
<213> artificial sequence
<220>
<223> synthetic
<400> 33
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser
1 5 10 15
Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro
20 25 30
Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro
35 40 45
Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro
50 55 60
Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp
65 70 75 80
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala
85 90 95
Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly
100 105 110
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro
115 120 125
Leu Gly Leu Val Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg
130 135 140
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly
145 150 155 160
Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr
165 170 175
Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro
180 185 190
Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly
195 200 205
Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser
210 215 220
Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile
225 230 235 240
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu
245 250 255
Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn
260 265 270
Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg
275 280 285
Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn
290 295 300
Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile
305 310 315 320
Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn
325 330 335
Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu
340 345 350
Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro
355 360 365
Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp
370 375 380
Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe
385 390 395 400
Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu
405 410 415
Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu
420 425 430
Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser
435 440 445
Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser
450 455 460
Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro
465 470 475 480
Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn
485 490 495
Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn
500 505 510
Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys
515 520 525
Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly
530 535 540
Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile
545 550 555 560
Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser
565 570 575
Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ser Thr Thr Leu
580 585 590
Tyr Ser Pro Ala Gln Ala Gln Thr Gly Trp Val Gln Asn Gln Gly Ile
595 600 605
Leu Pro Gly Met Val Trp Gln Asp Arg Asp Val Tyr Leu Gln Gly Pro
610 615 620
Ile Trp Ala Lys Ile Pro His Thr Asp Gly Asn Phe His Pro Ser Pro
625 630 635 640
Leu Met Gly Gly Phe Gly Met Lys His Pro Pro Pro Gln Ile Leu Ile
645 650 655
Lys Asn Thr Pro Val Pro Ala Asp Pro Pro Thr Ala Phe Asn Lys Asp
660 665 670
Lys Leu Asn Ser Phe Ile Thr Gln Tyr Ser Thr Gly Gln Val Ser Val
675 680 685
Glu Ile Glu Trp Glu Leu Gln Lys Glu Asn Ser Lys Arg Trp Asn Pro
690 695 700
Glu Ile Gln Tyr Thr Ser Asn Tyr Tyr Lys Ser Asn Asn Val Glu Phe
705 710 715 720
Ala Val Asn Thr Glu Gly Val Tyr Ser Glu Pro Arg Pro Ile Gly Thr
725 730 735
Arg Tyr Leu Thr Arg Asn Leu
740
<210> 34
<211> 149
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 34
ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60
cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120
gccaactcca tcactaggggg ttcctagat 149
<210> 35
<211> 139
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 35
cccctagtga tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccgcc 60
cgggcaaagc ccgggcgtcg ggcgaccttt ggtcgcccgg cctcagtgag cgagcgagcg 120
cgcagagagg gagtggcca 139
<210> 36
<211> 6374
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 36
ttgctggcct tttgctcaca tgtcctgcag gcagctgcgc gctcgctcgc tcactgaggc 60
cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag tgagcgagcg 120
agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc gcacgcgttt 180
aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg 240
cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc 300
ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg 360
ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg 420
gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga 480
gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa 540
ccgcccagag tagaagcgga tccgccacca tggattgggg cacactccag agcatcctcg 600
ggggtgtcaa caaacactcc accagcattg gaaagatctg gctcacggtc ctcttcatct 660
tccgcatcat gatcctcgtg gtggctgcaa aggaggtgtg gggagatgag caagccgatt 720
ttgtctgcaa cacgctccag cctggctgca agaatgtatg ctacgaccac cacttcccca 780
tctctcacat ccggctctgg gctctgcagc tgatcatggt gtccacgcca gccctcctgg 840
tagctatgca tgtggcctac cggagacatg aaaagaaacg gaagttcatg aagggagaga 900
taaagaacga gtttaaggac atcgaagaga tcaaaccca gaaggtccgt atcgaagggt 960
ccctgtggtg gacctacacc accagcatct tcttccgggt catctttgaa gccgtcttca 1020
tgtacgtctt ttacatcatg tacaatggct tcttcatgca acgtctggtg aaatgcaacg 1080
cttggccctg ccccaataca gtggactgct tcatttccag gcccacagaa aagactgtct 1140
tcaccgtgtt tatgatttct gtgtctggaa tttgcattct gctaaatatc acagagctgt 1200
gctatttgtt cgttaggtat tgctcaggaa agtccaaaag accagtctac ccatacgatg 1260
ttccagatta cgcttaaggc gcgccacccc tgcagggaat tccgcattgc ccagttgtta 1320
gattaagaaa tagacagcat gagagggatg aggcaacccg tgctcagctg tcaaggctca 1380
gtcgctagca tttcccaaca caaagattct gaccttaaat gcaaccattt gaaacccctg 1440
taggcctcag gtgaaactcc agatgccaca atggagctct gctcccctaa agcctcaaaa 1500
caaaggccta attctatgcc tgtcttaatt ttctttcact taagttagtt ccactgagac 1560
cccaggctgt taggggttat tggtgtaagg tactttcata ttttaaacag aggatatcgg 1620
catttgtttc tttctctgag gacaagagaa aaaagccagg ttccacagag gacacagaga 1680
aggtttgggt gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat 1740
tggttctttc atttgctttg gaagttttaa tctctaacag tggacaaagt taccagtgcc 1800
ttaaactctg ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg 1860
taaagatgtt ctggatacca ttatatgttc cccctgtttc agaggctcag attgtaatat 1920
gtaaatggta tgtcattcgc tactatgatt taatttgaaa tatggtcttt tggttatgaa 1980
tactttgcag cacagctgag aggctgtctg ttgtattcat tgtggtcata gcacctaaca 2040
acattgtagc ctcaatcgag tgagacagac tagaagttcc tagtgatggc ttatgatagc 2100
aaatggcctc atgtcaaata tttagatgta attttgtgta agaaatacag actggatgta 2160
ccaccaacta ctacctgtaa tgacaggcct gtccaacaca tctccctttt ccatgactgt 2220
ggtagccagc atcggaaaga acgctgattt aaagaggtcg cttgggaatt ttattgacac 2280
agtaccattt aatggggagg acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa 2340
aacagatttg gaaagactgg actctaaagt ctgttgatta aagatgagct ttgtctactt 2400
caaaagtttg tttgcttacc ccttcagcct ccaatttttt aagtgaaaat atagctaata 2460
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2520
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatatt 2580
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2640
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2700
atttaaaaca ttaaaatata atctctataa taatttaaaa tctaatatgg ttttaataga 2760
acagcgatat caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 2820
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2880
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2940
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3000
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3060
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3120
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3180
catcgtcctt tccttggctg ctcgcctatg ttgccacctg gattctgcgc gggacgtcct 3240
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3300
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3360
ccgcctcccc gcgaattcat cgataccgag cgctgctcga gagatctgtg atagcggcca 3420
tcaagctggc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3480
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 3540
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 3600
gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg agcggccgca 3660
ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 3720
cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg 3780
agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg 3840
tatttcacac cgcatacgtc aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc 3900
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 3960
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 4020
ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 4080
aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 4140
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 4200
ctcaacccta tctcgggcta ttcttttgat ttataaggga ttttgccgat ttcggcctat 4260
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg 4320
tttacaattt tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 4380
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 4440
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 4500
tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc 4560
atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc 4620
cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc 4680
tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc 4740
gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg 4800
gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat 4860
ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc 4920
acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa 4980
ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa 5040
aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt 5100
gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct 5160
tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat 5220
gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc aacaacgttg 5280
cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg 5340
atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt 5400
attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg 5460
ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg 5520
gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg 5580
tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa 5640
aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt 5700
tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt 5760
tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt 5820
ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag 5880
ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta 5940
gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat 6000
aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg 6060
ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg 6120
agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac 6180
aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga 6240
aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt 6300
ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta 6360
cggttcctgg cctt 6374
<210> 37
<211> 700
<212> DNA
<213> Homo sapiens
<400> 37
ccatgatatg ttaagaaaag caaagtgtgg aatagtaggt aaaatattct atcttatgtg 60
caaaagggga aataaaagtc atcaatattc atgtagattc aattcacata tagattcata 120
tcacattcct atatatatag aaattctgga aagacacaaa ataaattaat aaaagttgtt 180
acttcattgt agtttttaaa gttttttgag tcttaagact tactttccac ttctgtagaa 240
aggaattaca aatcctttct ttatagagct atgtgatgaa ataaacataa agcatttggc 300
acacttcagg atagcaactt gtggattaat gattaacaca gtcacctttg caccagatta 360
cacccagaga ttccttcatt tatatttatg tggttttgtg tgtcagttat gcagtctaac 420
tcagtcattc aactatgtta cagctgcaac actctatttt tttctttggt acaggagtcg 480
ccctcttatc cactgtttca tttttgtggt tccagttacc tgtagtcaac cacagttgga 540
aaatatgata gcattttgag agagagactg catccaaaaa ctttatattac aatatattgt 600
tatacattgt tataagtgtt gttttattat tctttattgt taatctctta ccattaagcc 660
ttatggtagg tttgtatgta taggaaaaaa cagattatat 700
<210> 38
<211> 700
<212> DNA
<213> Homo sapiens
<400> 38
atataatctg ttttttccta tacatacaaa cctaccataa ggcttaatgg taagagatta 60
acaataaaga ataataaaac aacacttata acaatgtata acaatatatt gtaatataag 120
tttttggatg cagtctctct ctcaaaatgc tatcatattt tccaactgtg gttgactaca 180
ggtaactgga accacaaaaa tgaaacagtg gataagaggg cgactcctgt accaaagaaa 240
aaaatagagt gttgcagctg taacatagtt gaatgactga gttagactgc ataactgaca 300
cacaaaacca cataaatata aatgaaggaa tctctgggtg taatctggtg caaaggtgac 360
tgtgttaatc attaatccac aagttgctat cctgaagtgt gccaaatgct ttatgtttat 420
ttcatcacat agctctataa agaaaggatt tgtaattcct ttctacagaa gtggaaagta 480
agtcttaaga ctcaaaaaac tttaaaaact acaatgaagt aacaactttt attaatttat 540
tttgtgtctt tccagaattt ctatatatat aggaatgtga tatgaatcta tatgtgaatt 600
gaatctacat gaatattgat gacttttat tccccttttg cacataagat agaatatttt 660
acctactatt ccacactttg cttttcttaa catatcatgg 700
<210> 39
<211> 700
<212> DNA
<213> Homo sapiens
<400> 39
gcagagacct acagacagaa gtacatttta cactggatcc aggacacaca tcagtctgaa 60
aacacacaca tgaaccaaac gtttcctaaa gcattactta tccttgctaa tagcaacaca 120
ttctcatatt cttttatact tcatttaatt tcatataaaa aagaaaagga aaggaaagaa 180
atctatttct cagcccatta ataaggtcag gagcagcaac accagactag aagaaaagct 240
tacctataga tttttctgcc acctcttgag tgcgtccagc tttccgacaa gtctcagtgc 300
catctactgt gcgctctggg tattgcaatt gctttttttt tttttttttt ttttttttta 360
gaatgagact aagtcagaga acacaaagaa cttctttccc cacagtggag atggctctga 420
aagcgtttaa ggaatagctt agatgagtgg ctaacacat ctcccggttc tgaattctaa 480
gaccacagac tccatgtcca gtccccaaag agaggctttg caagctacag aatacccctc 540
tgactgggac ctcaggagct aaactgacca cgtaattggt tctagaaagt gaaacgtttt 600
aatttgaaac atccaaatga gcattttgg aaaagctact gccgtccatc aaatacaaca 660
cagccaggga gtcatcgctc tattgccctt gtcaatccta 700
<210> 40
<211> 700
<212> DNA
<213> Homo sapiens
<400> 40
taggattgac aagggcaata gagcgatgac tccctggctg tgttgtattt gatggacggc 60
agtagctttt cacaaaatgc tcatttggat gtttcaaatt aaaacgtttc actttctaga 120
accaattacg tggtcagttt agctcctgag gtcccagtca gaggggtatt ctgtagcttg 180
caaagcctct ctttggggac tggacatgga gtctgtggtc ttagaattca gaaccggggag 240
aatgtgttag ccactcatct aagctattcc ttaaacgctt tcagagccat ctccactgtg 300
gggaaagaag ttctttgtgt tctctgactt agtctcattc taaaaaaaaa aaaaaaaaaa 360
aaaaaaaagc aattgcaata cccagagcgc acagtagatg gcactgagac ttgtcggaaa 420
gctggacgca ctcaagaggt ggcagaaaaa tctataggta agcttttctt ctagtctggt 480
gttgctgctc ctgaccttat taatgggctg agaaatagat ttctttcctt tccttttctt 540
ttttatatga aattaaatga agtataaaag aatatgagaa tgtgttgcta ttagcaagga 600
taagtaatgc tttaggaaac gtttggttca tgtgtgtgtt ttcagactga tgtgtgtcct 660
ggatccagtg taaaatgtac ttctgtctgt aggtctctgc 700
<210> 41
<211> 700
<212> DNA
<213> Homo sapiens
<400> 41
atccattatt tgattagcca tttcaaaaac acatttacgg agatcttcat ctgggcagag 60
cattattcca ggcctctgaa gaaccaaaga tgattttgaa aggaggtcac agtgcagaca 120
gcaggtgtgt atataaggtg gctactttac aaaacaggat atggcaagct ggacatgaca 180
ggcacagcaa agtctctgaa cagagttcgg ggcatgaaat tgtttctttt gggggtcttc 240
aggaacaatt tcatgaaagc taaatcatga aagatagcag gcttttgcca ggaaaaaaaa 300
aaacaagact agtgattagt ttggcgtttt cggtttcttt gagaagcgaa ataacttatc 360
aaggactctt tttgccactt gatgttataa ttggttgata ggtctctcag aagccctttg 420
tgcaaactag aacctgcagg gatgtgcaaa gcctctctct gctgccatct gctgtcttac 480
aagaggtaac tgcaagaggt tgaatcctcc aatgccctgg ggattcccat tgcagggcag 540
gggcagcagc ctgtgttaat aaccacccga acagccacat gtacccctcc acaaaagtgt 600
cactgtctcc attgctctgg agtttgtatt cccaatttgt aatctttgtt agggcactca 660
taaaaaatta aaaacaaaaa ttcacacaaa catacactac 700
<210> 42
<211> 700
<212> DNA
<213> Homo sapiens
<400> 42
gtagtgtatg tttgtgtgaa tttttgtttt taatttttta tgagtgccct aacaaagatt 60
acaaattggg aatacaaact ccagagcaat ggagacagtg acacttttgt ggaggggtac 120
atgtggctgt tcgggtggtt attaacacag gctgctgccc ctgccctgca atgggaatcc 180
ccagggcatt ggaggattca acctcttgca gttacctctt gtaagacagc agatggcagc 240
agagagaggc tttgcacatc cctgcaggtt ctagtttgca caaagggctt ctgagagacc 300
tatcaaccaa ttataacatc aagtggcaaa aagagtcctt gataagttat ttcgcttctc 360
aaagaaaccg aaaacgccaa actaatcact agtcttgttt ttttttttcc tggcaaaagc 420
ctgctatctt tcatgattta gctttcatga aattgttcct gaagaccccc aaaagaaaca 480
atttcatgcc ccgaactctg ttcagagact ttgctgtgcc tgtcatgtcc agcttgccat 540
atcctgtttt gtaaagtagc caccttatat acacacctgc tgtctgcact gtgacctcct 600
ttcaaaatca tctttggttc ttcagaggcc tggaataatg ctctgcccag atgaagatct 660
ccgtaaatgt gtttttgaaa tggctaatca aataatggat 700
<210> 43
<211> 700
<212> DNA
<213> Homo sapiens
<400> 43
gctaattggg tcaggatttg aaagacctta gctttgtgg accttcaatt ttatcattca 60
gcttgaatat gtgccccaga aaacctttat gtaattccct aatatttcag taaccagcat 120
gcaacatacg agaagcacat tctttgtttt tagaatggta tctggctgat gactttcaca 180
acagctcaca tgagagggaa gtattttagc aatcggactg aaggaaaatc caaaaactcc 240
accattgcag ggtcaacagt gcacgtgttt gaattctgaa agacgtaagc caaggcaaat 300
agaaggaaat gatcttccac taatcccggc atttacttcc tcctctctgg aggggacggc 360
catgcacaca gagccctgtg ctctgagttc tcatgaaagg gacacagctg ggctcactca 420
gcgtcacctc gcccctgggg tgtgtcctgg tttcagatct cgggctggag tgattcacgt 480
gtggcaggga ggccatcatt aatgaaaatg cgagggcgtc gcacgagtgt tgatgactca 540
gcaggccttt ctacttctgt atgagtcagt gcccatcaca gccaagcctg gggcacaaca 600
ggttttctta aaagagcatg ggggcctcat cttcaacaac caattaggaa gcagaaaagt 660
cctcagtgag gaaggaataa tgacatgttg gagctaagat 700
<210> 44
<211> 700
<212> DNA
<213> Homo sapiens
<400> 44
atcttagctc caacatgtca ttattccttc ctcactgagg acttttctgc ttcctaattg 60
gttgttgaag atgaggcccc catgctcttt taagaaaacc tgttgtgccc caggcttggc 120
tgtgatgggc actgactcat acagaagtag aaaggcctgc tgagtcatca acactcgtgc 180
gacgccctcg cattttcatt aatgatggcc tccctgccac acgtgaatca ctccagcccg 240
agatctgaaa ccaggacaca ccccaggggc gaggtgacgc tgagtgagcc cagctgtgtc 300
cctttcatga gaactcagag cacagggctc tgtgtgcatg gccgtcccct ccagagagga 360
ggaagtaaat gccgggatta gtggaagatc atttccttct atttgccttg gcttacgtct 420
ttcagaattc aaacacgtgc actgttgacc ctgcaatggt ggagtttttg gattttcctt 480
cagtccgatt gctaaaatac ttccctctca tgtgagctgt tgtgaaagtc atcagccaga 540
taccattcta aaaacaaaga atgtgcttct cgtatgttgc atgctggtta ctgaaatatt 600
agggaattac ataaaggttt tctggggcac atattcaagc tgaatgataa aattgaaggt 660
cacacaaagc taaggtcttt caaatcctga cccaattagc 700
<210> 45
<211> 658
<212> DNA
<213> Homo sapiens
<400> 45
cgcctcggcc tcccaaagtg ctgggattac aggcgtgagc caccaccgtg cctggcttat 60
acaagtaatt gtaaacgaaa aggaaaaaat ggagatacag ttttctcgtg catcttaaac 120
tttggtgctt aaaagcacca ttaaattctg ctttcacatg aacacacaca agattaccac 180
gtttgctctg ggctgctgcg tattggaagg acatacacat tcaacaaata tttgttgaac 240
ttccattctg tacacaaagc acaaagaaag attcgttcac agtccgtgtg ggtactggaa 300
agcagttcca gccctgcctg ccagggggca ccccaggcaa gcacatctca gtggctgcta 360
gaaagtgaat tgaggctgag tctctccaca cccaagtgtt aggcgttcta ggctcagaaa 420
gagacaatga caatgcgggc aattctctct tcactgtgtc ctcttctttg ctagaaatgt 480
tattagaata tggaaatgtg acattcagca ctaatcagtt tgacatatga atatatctat 540
acacatattt ctccctgaaa ttggcctaaa tactctttct tggaaccaaa tgagaagcaa 600
acaaccttta caactaaaca ttaaaccata agatgaacat cttagttgtc tacctaga 658
<210> 46
<211> 682
<212> DNA
<213> Homo sapiens
<400> 46
ttctaggtag acaactaaga tgttcatctt atggtttaat gtttagttgt aaaggttgtt 60
tgcttctcat ttggttccaa gaaagagtat ttaggccaat ttcagggaga aatatgtgta 120
tagatatatt catatgtcaa actgattagt gctgaatgtc acatttccat attctaataa 180
catttctagc aaagaagagg acacagtgaa gagagaattg cccgcattgt cattgtctct 240
ttctgagcct agaacgccta acacttgggt gtggagagac tcagcctcaa ttcactttct 300
agcagccact gagatgtgct tgcctggggt gccccctggc aggcagggct ggaactgctt 360
tccagtaccc acacggactg tgaacgaatc tttctttgtg ctttgtgtac agaatggaag 420
ttcaacaaat atttgttgaa tgtgtatgtc cttccaatac gcagcagccc agagcaaacg 480
tggtaatctt gtgtgtgttc atgtgaaagc agaatttaat ggtgctttta agcaccaaag 540
tttaagatgc acgagaaaac tgtatctcca ttttttcctt ttcgtttaca attacttgta 600
taagccaggc acggtggtgg ctcacgcctg taatcccagc actttggggg gccgaggcgg 660
gcggatcaca tgaggtcggg ag 682
<210> 47
<211> 135
<212> DNA
<213> Homo sapiens
<400> 47
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 60
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 120
gaagaggcgg ggtgt 135
<210> 48
<211> 7163
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 48
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcagtgatgc ctgaaacctc 1320
agatggtact gaaccctcta tataatctgt tttttcctat acatacaaac ctaccataag 1380
gcttaatggt aagagattaa caataaagaa taataaaaca acacttataa caatgtataa 1440
caatatattg taatataagt ttttggatgc agtctctctc tcaaaatgct atcatatttt 1500
ccaactgtgg ttgactacag gtaactggaa ccacaaaaat gaaacagtgg ataagagggc 1560
gactcctgta ccaaagaaaa aaatagagtg ttgcagctgt aacatagttg aatgactgag 1620
ttagactgca taactgacac acaaaaccac ataaatataa atgaaggaat ctctgggtgt 1680
aatctggtgc aaaggtgact gtgttaatca ttaatccaca agttgctatc ctgaagtgtg 1740
ccaaatgctt tatgtttatt tcatcacata gctctataaa gaaaggattt gtaattcctt 1800
tctacagaag tggaaagtaa gtcttaagac tcaaaaaact ttaaaaacta caatgaagta 1860
acaactttta ttaatttatt ttgtgtcttt ccagaatttc tatatatata ggaatgtgat 1920
atgaatctat atgtgaattg aatctacatg aatattgatg acttttatt ccccttttgc 1980
acataagata gaatatttta cctactattc cacactttgc ttttcttaac atatcatggg 2040
atctttttat ataagtgaac aaagagtttc ttcattcttt cacacagttt aattaagacc 2100
tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg cggacccggg 2160
aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc ctccgtaact 2220
ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg ccacggcggg 2280
agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg gagcccctcg 2340
gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga gaccccaacg 2400
ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa ccgcccagag 2460
tagaagccat ggtgagcaag ggcgaggagc tgttcaccgg ggtggtgccc atcctggtcg 2520
agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc cggcgagggc gagggcgatg 2580
ccacctacgg caagctgacc ctgaagttca tctgcaccac cggcaagctg cccgtgccct 2640
ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg cttcagccgc taccccgacc 2700
acatgaagca gcacgacttc ttcaagtccg ccatgcccga aggctacgtc caggagcgca 2760
ccatcttctt caaggacgac ggcaactaca agacccgcgc cgaggtgaag ttcgagggcg 2820
acaccctggt gaaccgcatc gagctgaagg gcatcgactt caaggaggac ggcaacatcc 2880
tggggcacaa gctggagtac aactacaaca gccacaacgt ctatatcatg gccgacaagc 2940
agaagaacgg catcaaggtg aacttcaaga tccgccacaa catcgaggac ggcagcgtgc 3000
agctcgccga ccactaccag cagaacaccc ccatcggcga cggccccgtg ctgctgcccg 3060
acaaccacta cctgagcacc cagtccgccc tgagcaaaga ccccaacgag aagcgcgatc 3120
acatggtcct gctggagttc gtgaccgccg ccgggatcac tctcggcatg gacgagctgt 3180
acaagtaaag gcgcgccacc cctgcaggga attccgcatt gcccagttgt tagattaaga 3240
aatagacagc atgagaggga tgaggcaacc cgtgctcagc tgtcaaggct cagtcgctag 3300
catttcccaa cacaaagatt ctgaccttaa atgcaaccat ttgaaacccc tgtaggcctc 3360
aggtgaaact ccagatgcca caatggagct ctgctcccct aaagcctcaa aacaaaggcc 3420
taattctatg cctgtcttaa ttttctttca cttaagttag ttccactgag accccaggct 3480
gttaggggtt attggtgtaa ggtactttca tattttaaac agaggatatc ggcatttgtt 3540
tctttctctg aggacaagag aaaaaagcca ggttccacag aggacacaga gaaggtttgg 3600
gtgtcctcct ggggttcttt ttgccaactt tccccacgtt aaaggtgaac attggttctt 3660
tcatttgctt tggaagtttt aatctctaac agtggacaaa gttaccagtg ccttaaactc 3720
tgttacactt tttggaagtg aaaactttgt agtatgatag gttattttga tgtaaagatg 3780
ttctggatac cattatatgt tccccctgtt tcagaggctc agattgtaat atgtaaatgg 3840
tatgtcattc gctactatga tttaatttga aatatggtct tttggttatg aatactttgc 3900
agcacagctg agaggctgtc tgttgtattc attgtggtca tagcacctaa caacattgta 3960
gcctcaatcg agtgagacag actagaagtt cctagtgatg gcttatgata gcaaatggcc 4020
tcatgtcaaa tatttagatg taattttgtg taagaaatac agactggatg taccaccaac 4080
tactacctgt aatgacaggc ctgtccaaca catctccctt ttccatgact gtggtagcca 4140
gcatcggaaa gaacgctgat ttaaagaggt cgcttgggaa ttttattgac acagtaccat 4200
ttaatgggga ggacaaaatg gggcagggga gggagaagtt tctgtcgtta aaaacagatt 4260
tggaaagact ggactctaaa gtctgttgat taaagatgag ctttgtctac ttcaaaagtt 4320
tgtttgctta ccccttcagc ctccaatttt ttaagtgaaa atatagctaa taacatgtga 4380
aaagaataga agctaaggtt tagataaata ttgagcagat ctataggaag attgaacctg 4440
aatattgcca ttatgcttga catggtttcc aaaaaatggt actccacata tttcagtgag 4500
ggtaagtatt ttcctgttgt caagaatagc attgtaaaag cattttgtaa taataaagaa 4560
tagctttaat gatatgcttg taactaaaat aattttgtaa tgtatcaaat acatttaaaa 4620
cattaaaata taatctctat aataatttaa aatctaatat ggttttaata gaacagcgat 4680
atcaagctta tcgataatca acctctggat tacaaaattt gtgaaagatt gactggtatt 4740
cttaactatg ttgctccttt tacgctatgt ggatacgctg ctttaatgcc tttgtatcat 4800
gctattgctt cccgtatggc tttcattttc tcctccttgt ataaatcctg gttgctgtct 4860
ctttatgagg agttgtggcc cgttgtcagg caacgtggcg tggtgtgcac tgtgtttgct 4920
gacgcaaccc ccactggttg gggcattgcc accacctgtc agctcctttc cgggactttc 4980
gctttccccc tccctattgc cacggcggaa ctcatcgccg cctgccttgc ccgctgctgg 5040
acaggggctc ggctgttggg cactgacaat tccgtggtgt tgtcgggggaa atcatcgtcc 5100
tttccttggc tgctcgccta tgttgccacc tggattctgc gcgggacgtc cttctgctac 5160
gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg gcctgctgcc ggctctgcgg 5220
cctcttccgc gtcttcgcct tcgccctcag acgagtcgga tctccctttg ggccgcctcc 5280
ccgcgaattc atcgataccg agcgctgctc gagagatctg tgatagcggc catcaagctg 5340
gctgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc 5400
ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt 5460
ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat 5520
tgggaagaca atagcaggca tgctggggac acgtgcggac cgagcggccg caggaacccc 5580
tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac 5640
caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca 5700
gctgcctgca ggggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 5760
accgcatacg tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg 5820
tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 5880
cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 5940
ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 6000
tttgggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 6060
gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 6120
tatctcgggc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa 6180
aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa cgtttacaat 6240
tttatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca 6300
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag 6360
acaagctgg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa 6420
acgcgcgaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat 6480
aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg 6540
tttattttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat 6600
gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat 6660
tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt 6720
aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag 6780
cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa 6840
agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg 6900
ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct 6960
tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac 7020
tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca 7080
caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat 7140
accaaacgac gagcgtgaca cca 7163
<210> 49
<211> 7247
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 49
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tgctatctat catcttgaag 1320
ggcttctgga acaagttaga atagagtcaa cactcatgaa ctgctgtagc aaaaaaaact 1380
atagatgtag gattgacaag ggcaatagag cgatgactcc ctggctgtgt tgtatttgat 1440
ggacggcagt agcttttcac aaaatgctca tttggatgtt tcaaattaaa acgtttcact 1500
ttctagaacc aattacgtgg tcagtttagc tcctgaggtc ccagtcagag gggtattctg 1560
tagcttgcaa agcctctctt tggggactgg acatggagtc tgtggtctta gaattcagaa 1620
ccgggagaat gtgttagcca ctcatctaag ctattcctta aacgctttca gagccatctc 1680
cactgtgggg aaagaagttc tttgtgttct ctgacttagt ctcattctaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaagcaat tgcaataccc agagcgcaca gtagatggca ctgagacttg 1800
tcggaaagct ggacgcactc aagaggtggc agaaaaatct ataggtaagc ttttcttcta 1860
gtctggtgtt gctgctcctg accttattaa tgggctgaga aatagatttc tttcctttcc 1920
ttttcttttt tatatgaaat taaatgaagt ataaaagaat atgagaatgt gttgctatta 1980
gcaaggataa gtaatgcttt aggaaacgtt tggttcatgt gtgtgttttc agactgatgt 2040
gtgtcctgga tccagtgtaa aatgtacttc tgtctgtagg tctctgccac agaaaagttg 2100
gaaagccatt gttgtattcc atttccaggg caacaaaaga taccactgtc acttcatgtg 2160
aaatggtgtt gtttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg 2220
cggtcggggg ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc 2280
cgcggcgccg ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt 2340
gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc 2400
ctctccccga ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg 2460
agccccagcg cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc 2520
cgacgcagag caaaccgccc agagtagaag ccatggtgag caagggcgag gagctgttca 2580
ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg 2640
tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca 2700
ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc 2760
agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc 2820
ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc 2880
gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg 2940
acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac aacagccaca 3000
acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc 3060
acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg 3120
gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca 3180
aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga 3240
tcactctcgg catggacgag ctgtacaagt aaaggcgcgc cacccctgca gggaattccg 3300
cattgcccag ttgttagatt aagaaataga cagcatgaga gggatgaggc aacccgtgct 3360
cagctgtcaa ggctcagtcg ctagcatttc ccaacacaaa gattctgacc ttaaatgcaa 3420
ccatttgaaa cccctgtagg cctcaggtga aactccagat gccacaatgg agctctgctc 3480
ccctaaagcc tcaaaacaaa ggcctaattc tatgcctgtc ttaattttct ttcacttaag 3540
ttagttccac tgagacccca ggctgttagg ggttattggt gtaaggtact ttcatatttt 3600
aaacagagga tatcggcatt tgtttctttc tctgaggaca agagaaaaaa gccaggttcc 3660
acagaggaca cagagaaggt ttgggtgtcc tcctggggtt ctttttgcca actttcccca 3720
cgttaaaggt gaacattggt tctttcattt gctttggaag ttttaatctc taacagtgga 3780
caaagttacc agtgccttaa actctgttac actttttgga agtgaaaact ttgtagtatg 3840
ataggttatt ttgatgtaaa gatgttctgg ataccattat atgttccccc tgtttcagag 3900
gctcagattg taatatgtaa atggtatgtc attcgctact atgatttaat ttgaaatatg 3960
gtcttttggt tatgaatact ttgcagcaca gctgagaggc tgtctgttgt attcattgtg 4020
gtcatagcac ctaacaacat tgtagcctca atcgagtgag acagactaga agttcctagt 4080
gatggcttat gatagcaaat ggcctcatgt caaatattta gatgtaattt tgtgtaagaa 4140
atacagactg gatgtaccac caactactac ctgtaatgac aggcctgtcc aacacatctc 4200
ccttttccat gactgtggta gccagcatcg gaaagaacgc tgatttaaag aggtcgcttg 4260
ggaattttat tgacacagta ccatttaatg gggaggacaa aatggggcag gggagggaga 4320
agtttctgtc gttaaaaaca gatttggaaa gactggactc taaagtctgt tgattaaaga 4380
tgagctttgt ctacttcaaa agtttgtttg cttacccctt cagcctccaa ttttttaagt 4440
gaaaatatag ctaataacat gtgaaaagaa tagaagctaa ggtttagata aatattgagc 4500
agatctatag gaagattgaa cctgaatatt gccattatgc ttgacatggt ttccaaaaaa 4560
tggtactcca catatttcag tgagggtaag tattttcctg ttgtcaagaa tagcattgta 4620
aaagcatttt gtaataataa agaatagctt taatgatatg cttgtaacta aaataatttt 4680
gtaatgtatc aaatacattt aaaacattaa aatataatct ctataataat ttaaaatcta 4740
atatggtttt aatagaacag cgatatcaag cttatcgata atcaacctct ggattacaaa 4800
atttgtgaaa gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac 4860
gctgctttaa tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc 4920
ttgtataaat cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt 4980
ggcgtggtgt gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc 5040
tgtcagctcc tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc 5100
gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg 5160
gtgttgtcgg ggaaatcatc gtcctttcct tggctgctcg cctatgttgc cacctggatt 5220
ctgcgcggga cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc 5280
cgcggcctgc tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt 5340
cggatctccc tttgggccgc ctccccgcga attcatcgat accgagcgct gctcgagaga 5400
tctgtgatag cggccatcaa gctggctgtg ccttctagtt gccagccatc tgttgtttgc 5460
ccctcccctg tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa 5520
aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg 5580
gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggacacgtgc 5640
ggaccgagcg gccgcaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc 5700
gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg 5760
cctcagtgag cgagcgagcg cgcagctgcc tgcaggggcg cctgatgcgg tattttctcc 5820
ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag caaccatagt acgcgccctg 5880
tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc 5940
cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg 6000
ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta gtgctttacg 6060
gcacctcgac cccaaaaaac ttgatttggg tgatggttca cgtagtgggc catcgccctg 6120
atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg gactcttgtt 6180
ccaaactgga acaacactca accctatctc gggctattct tttgatttat aagggatttt 6240
gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta acgcgaattt 6300
taacaaaata ttaacgttta caattttatg gtgcactctc agtacaatct gctctgatgc 6360
cgcatagtta agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg 6420
tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca 6480
gaggttttca ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt 6540
tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg 6600
aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct 6660
catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat 6720
tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc 6780
tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg 6840
ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg 6900
ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga 6960
cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta 7020
ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc 7080
tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc 7140
gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg 7200
ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacacca 7247
<210> 50
<211> 7243
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 50
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagctacta actacaacca 1320
cgagattata gatgtttgct gatattgttc tcagtttggt tattgtgttg tttatgaatg 1380
aaagtagtgt atgtttgtgt gaatttttgt ttttaatttt ttatgagtgc cctaacaaag 1440
attacaaatt gggaatacaa actccagagc aatggagaca gtgacacttt tgtggagggg 1500
tacatgtggc tgttcggggg gttattaaca caggctgctg cccctgccct gcaatgggaa 1560
tccccagggc attggaggat tcaacctctt gcagttacct cttgtaagac agcagatggc 1620
agcagagaga ggctttgcac atccctgcag gttctagttt gcacaaaggg cttctgagag 1680
acctatcaac caattataac atcaagtggc aaaaagagtc cttgataagt tatttcgctt 1740
ctcaaagaaa ccgaaaacgc caaactaatc actagtcttg tttttttttt tcctggcaaa 1800
agcctgctat ctttcatgat ttagctttca tgaaattgtt cctgaagacc cccaaaagaa 1860
acaatttcat gccccgaact ctgttcagag actttgctgt gcctgtcatg tccagcttgc 1920
catatcctgt tttgtaaagt agccacctta tatacacacc tgctgtctgc actgtgacct 1980
cctttcaaaa tcatctttgg ttcttcagag gcctggaata atgctctgcc cagatgaaga 2040
tctccgtaaa tgtgtttttg aaatggctaa tcaaataatg gataccctta ggtatttttg 2100
cagaaacact tggcagcctt ccataatatc cctactatga aatggaaact tgtgaatgag 2160
atgtggcttt aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt 2220
cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg 2280
gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg 2340
ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct 2400
ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc 2460
ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac 2520
gcagagcaaa ccgcccagag tagaagccat ggtgagcaag ggcgaggagc tgttcaccgg 2580
ggtggtgccc atcctggtcg agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc 2640
cggcgagggc gagggcgatg ccacctacgg caagctgacc ctgaagttca tctgcaccac 2700
cggcaagctg cccgtgccct ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg 2760
cttcagccgc taccccgacc acatgaagca gcacgacttc ttcaagtccg ccatgcccga 2820
aggctacgtc caggagcgca ccatcttctt caaggacgac ggcaactaca agacccgcgc 2880
cgaggtgaag ttcgagggcg acaccctggt gaaccgcatc gagctgaagg gcatcgactt 2940
caaggaggac ggcaacatcc tggggcacaa gctggagtac aactacaaca gccacaacgt 3000
ctatatcatg gccgacaagc agaagaacgg catcaaggtg aacttcaaga tccgccacaa 3060
catcgaggac ggcagcgtgc agctcgccga ccactaccag cagaacaccc ccatcggcga 3120
cggccccgtg ctgctgcccg acaaccacta cctgagcacc cagtccgccc tgagcaaaga 3180
ccccaacgag aagcgcgatc acatggtcct gctggagttc gtgaccgccg ccgggatcac 3240
tctcggcatg gacgagctgt acaagtaaag gcgcgccacc cctgcaggga attccgcatt 3300
gcccagttgt tagattaaga aatagacagc atgagaggga tgaggcaacc cgtgctcagc 3360
tgtcaaggct cagtcgctag catttcccaa cacaaagatt ctgaccttaa atgcaaccat 3420
ttgaaacccc tgtaggcctc aggtgaaact ccagatgcca caatggagct ctgctcccct 3480
aaagcctcaa aacaaaggcc taattctatg cctgtcttaa ttttctttca cttaagttag 3540
ttccactgag accccaggct gttaggggtt attggtgtaa ggtactttca tattttaaac 3600
agaggatatc ggcatttgtt tctttctctg aggacaagag aaaaaagcca ggttccacag 3660
aggacacaga gaaggtttgg gtgtcctcct ggggttcttt ttgccaactt tccccacgtt 3720
aaaggtgaac attggttctt tcatttgctt tggaagtttt aatctctaac agtggacaaa 3780
gttaccagtg ccttaaactc tgttacactt tttggaagtg aaaactttgt agtatgatag 3840
gttattttga tgtaaagatg ttctggatac cattatatgt tccccctgtt tcagaggctc 3900
agattgtaat atgtaaatgg tatgtcattc gctactatga tttaatttga aatatggtct 3960
tttggttatg aatactttgc agcacagctg agaggctgtc tgttgtattc attgtggtca 4020
tagcacctaa caacattgta gcctcaatcg agtgagacag actagaagtt cctagtgatg 4080
gcttatgata gcaaatggcc tcatgtcaaa tatttagatg taattttgtg taagaaatac 4140
agactggatg taccaccaac tactacctgt aatgacaggc ctgtccaaca catctccctt 4200
ttccatgact gtggtagcca gcatcggaaa gaacgctgat ttaaagaggt cgcttgggaa 4260
ttttattgac acagtaccat ttaatgggga ggacaaaatg gggcagggga gggagaagtt 4320
tctgtcgtta aaaacagatt tggaaagact ggactctaaa gtctgttgat taaagatgag 4380
ctttgtctac ttcaaaagtt tgtttgctta ccccttcagc ctccaatttt ttaagtgaaa 4440
atatagctaa taacatgtga aaagaataga agctaaggtt tagataaata ttgagcagat 4500
ctataggaag attgaacctg aatattgcca ttatgcttga catggtttcc aaaaaatggt 4560
actccacata tttcagtgag ggtaagtatt ttcctgttgt caagaatagc attgtaaaag 4620
cattttgtaa taataaagaa tagctttaat gatatgcttg taactaaaat aattttgtaa 4680
tgtatcaaat acatttaaaa cattaaaata taatctctat aataatttaa aatctaatat 4740
ggttttaata gaacagcgat atcaagctta tcgataatca acctctggat tacaaaattt 4800
gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt ggatacgctg 4860
ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc tcctccttgt 4920
ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg caacgtggcg 4980
tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc accacctgtc 5040
agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa ctcatcgccg 5100
cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggtgt 5160
tgtcggggaa atcatcgtcc tttccttggc tgctcgccta tgttgccacc tggattctgc 5220
gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt ccttcccgcg 5280
gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag acgagtcgga 5340
tctccctttg ggccgcctcc ccgcgaattc atcgataccg agcgctgctc gagagatctg 5400
tgatagcggc catcaagctg gctgtgcctt ctagttgcca gccatctgtt gtttgcccct 5460
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg 5520
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc 5580
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggac acgtgcggac 5640
cgagcggccg caggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc 5700
gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc 5760
agtgagcgag cgagcgcgca gctgcctgca ggggcgcctg atgcggtatt ttctcctac 5820
gcatctgtgc ggtatttcac accgcatacg tcaaagcaac catagtacgc gccctgtagc 5880
ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac acttgccagc 5940
gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt cgccggcttt 6000
ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc tttacggcac 6060
ctcgacccca aaaaacttga tttgggtgat ggttcacgta gtgggccatc gccctgatag 6120
acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact cttgttccaa 6180
actggaacaa cactcaaccc tatctcgggc tattcttttg atttataagg gattttgccg 6240
atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc gaattttaac 6300
aaaatattaa cgtttacaat tttatggtgc actctcagta caatctgctc tgatgccgca 6360
tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg ggcttgtctg 6420
ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat gtgtcagagg 6480
ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg cctattttta 6540
taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 6600
gtgcgcggaa cccctatttg tttattttc taaatacatt caaatatgta tccgctcatg 6660
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 6720
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 6780
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 6840
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 6900
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tatttcccg tattgacgcc 6960
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 7020
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 7080
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggacccgaag 7140
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 7200
ccggagctga atgaagccat accaaacgac gagcgtgaca cca 7243
<210> 51
<211> 7253
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 51
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tctcagctgg agtgacgcac 1320
ctcatccatg cgggcctggc gtctggaagg tggctgggtc tctcgggctt gagcaccatc 1380
atcttagctc caacatgtca ttattccttc ctcactgagg acttttctgc ttcctaattg 1440
gttgttgaag atgaggcccc catgctcttt taagaaaacc tgttgtgccc caggcttggc 1500
tgtgatgggc actgactcat acagaagtag aaaggcctgc tgagtcatca acactcgtgc 1560
gacgccctcg cattttcatt aatgatggcc tccctgccac acgtgaatca ctccagcccg 1620
agatctgaaa ccaggacaca ccccaggggc gaggtgacgc tgagtgagcc cagctgtgtc 1680
cctttcatga gaactcagag cacagggctc tgtgtgcatg gccgtcccct ccagagagga 1740
ggaagtaaat gccgggatta gtggaagatc atttccttct atttgccttg gcttacgtct 1800
ttcagaattc aaacacgtgc actgttgacc ctgcaatggt ggagtttttg gattttcctt 1860
cagtccgatt gctaaaatac ttccctctca tgtgagctgt tgtgaaagtc atcagccaga 1920
taccattcta aaaacaaaga atgtgcttct cgtatgttgc atgctggtta ctgaaatatt 1980
agggaattac ataaaggttt tctggggcac atattcaagc tgaatgataa aattgaaggt 2040
cacacaaagc taaggtcttt caaatcctga cccaattagc tctctgttag ctctctgact 2100
ttggacaagc tgtctggtcc tctgaagcat actttgttcg ccctgggtag gggccctctg 2160
ttttaacagc gtttggcatt aattaagacc tcgaagggga cttggggggt tcggggcttt 2220
cgggggcggt cgggggttcg cggacccggg aagctctgag gacccagagg ccgggcgcgc 2280
tccgcccgcg gcgccgcccc ctccgtaact ttcccagtct ccgagggaag aggcggggtg 2340
tggggtgcgg ttaaaaggcg ccacggcggg agacaggtgt tgcggccccg cagcgcccgc 2400
gcgctcctct ccccgactcg gagcccctcg gcggcgcccg gcccaggacc cgcctaggag 2460
cgcaggagcc ccagcgcaga gaccccaacg ccgagacccc cgccccggcc ccgccgcgct 2520
tcctcccgac gcagagcaaa ccgcccagag tagaagccat ggtgagcaag ggcgaggagc 2580
tgttcaccgg ggtggtgccc atcctggtcg agctggacgg cgacgtaaac ggccacaagt 2640
tcagcgtgtc cggcgagggc gagggcgatg ccacctacgg caagctgacc ctgaagttca 2700
tctgcaccac cggcaagctg cccgtgccct ggcccaccct cgtgaccacc ctgacctacg 2760
gcgtgcagtg cttcagccgc taccccgacc acatgaagca gcacgacttc ttcaagtccg 2820
ccatgcccga aggctacgtc caggagcgca ccatcttctt caaggacgac ggcaactaca 2880
agacccgcgc cgaggtgaag ttcgagggcg acaccctggt gaaccgcatc gagctgaagg 2940
gcatcgactt caaggaggac ggcaacatcc tggggcacaa gctggagtac aactacaaca 3000
gccacaacgt ctatatcatg gccgacaagc agaagaacgg catcaaggtg aacttcaaga 3060
tccgccacaa catcgaggac ggcagcgtgc agctcgccga ccactaccag cagaacaccc 3120
ccatcggcga cggccccgtg ctgctgcccg acaaccacta cctgagcacc cagtccgccc 3180
tgagcaaaga ccccaacgag aagcgcgatc acatggtcct gctggagttc gtgaccgccg 3240
ccgggatcac tctcggcatg gacgagctgt acaagtaaag gcgcgccacc cctgcaggga 3300
attccgcatt gcccagttgt tagattaaga aatagacagc atgagaggga tgaggcaacc 3360
cgtgctcagc tgtcaaggct cagtcgctag catttcccaa cacaaagatt ctgaccttaa 3420
atgcaaccat ttgaaacccc tgtaggcctc aggtgaaact ccagatgcca caatggagct 3480
ctgctcccct aaagcctcaa aacaaaggcc taattctatg cctgtcttaa ttttctttca 3540
cttaagttag ttccactgag accccaggct gttaggggtt attggtgtaa ggtactttca 3600
tattttaaac agaggatatc ggcatttgtt tctttctctg aggacaagag aaaaaagcca 3660
ggttccacag aggacacaga gaaggtttgg gtgtcctcct ggggttcttt ttgccaactt 3720
tccccacgtt aaaggtgaac attggttctt tcatttgctt tggaagtttt aatctctaac 3780
agtggacaaa gttaccagtg ccttaaactc tgttacactt tttggaagtg aaaactttgt 3840
agtatgatag gttattttga tgtaaagatg ttctggatac cattatatgt tccccctgtt 3900
tcagaggctc agattgtaat atgtaaatgg tatgtcattc gctactatga tttaatttga 3960
aatatggtct tttggttatg aatactttgc agcacagctg agaggctgtc tgttgtattc 4020
attgtggtca tagcacctaa caacattgta gcctcaatcg agtgagacag actagaagtt 4080
cctagtgatg gcttatgata gcaaatggcc tcatgtcaaa tatttagatg taattttggg 4140
taagaaatac agactggatg taccaccaac tactacctgt aatgacaggc ctgtccaaca 4200
catctccctt ttccatgact gtggtagcca gcatcggaaa gaacgctgat ttaaagaggt 4260
cgcttgggaa tttattgac acagtaccat ttaatgggga ggacaaaatg gggcagggga 4320
gggagaagtt tctgtcgtta aaaacagatt tggaaagact ggactctaaa gtctgttgat 4380
taaagatgag ctttgtctac ttcaaaagtt tgtttgctta ccccttcagc ctccaatttt 4440
ttaagtgaaa atatagctaa taacatgtga aaagaataga agctaaggtt tagataaata 4500
ttgagcagat ctataggaag attgaacctg aatattgcca ttatgcttga catggtttcc 4560
aaaaaatggt actccacata tttcagtgag ggtaagtatt ttcctgttgt caagaatagc 4620
attgtaaaag cattttgtaa taataaagaa tagctttaat gatatgcttg taactaaaat 4680
aattttgtaa tgtatcaaat acatttaaaa cattaaaata taatctctat aataatttaa 4740
aatctaatat ggttttaata gaacagcgat atcaagctta tcgataatca acctctggat 4800
tacaaaattt gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt 4860
ggatacgctg ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc 4920
tcctccttgt ataaatcctg gttgctgtct ctttatgagg agttgtggcc cgttgtcagg 4980
caacgtggcg tggtgtgcac tgtgtttgct gacgcaaccc ccactggttg gggcattgcc 5040
accacctgtc agctcctttc cgggactttc gctttccccc tccctattgc cacggcggaa 5100
ctcatcgccg cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat 5160
tccgtggtgt tgtcgggggaa atcatcgtcc tttccttggc tgctcgccta tgttgccacc 5220
tggattctgc gcgggacgtc cttctgctac gtcccttcgg ccctcaatcc agcggacctt 5280
ccttcccgcg gcctgctgcc ggctctgcgg cctcttccgc gtcttcgcct tcgccctcag 5340
acgagtcgga tctccctttg ggccgcctcc ccgcgaattc atcgataccg agcgctgctc 5400
gagagatctg tgatagcggc catcaagctg gctgtgcctt ctagttgcca gccatctgtt 5460
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 5520
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 5580
ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggac 5640
acgtgcggac cgagcggccg caggaacccc tagtgatgga gttggccact ccctctctgc 5700
gcgctcgctc gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc 5760
gggcggcctc agtgagcgag cgagcgcgca gctgcctgca ggggcgcctg atgcggtatt 5820
ttctccttac gcatctgtgc ggtatttcac accgcatacg tcaaagcaac catagtacgc 5880
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg tgaccgctac 5940
acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc tcgccacgtt 6000
cgccggcttt ccccgtcaag ctctaaatcg ggggctccct ttagggttcc gatttagtgc 6060
tttacggcac ctcgacccca aaaaacttga tttgggtgat ggttcacgta gtgggccatc 6120
gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta atagtggact 6180
cttgttccaa actggaacaa cactcaaccc tatctcgggc tattcttttg atttataagg 6240
gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa aatttaacgc 6300
gaattttaac aaaatattaa cgtttacaat tttatggtgc actctcagta caatctgctc 6360
tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg cgccctgacg 6420
ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg ggagctgcat 6480
gtgtcagagg ttttcaccgt catcaccgaa acgcgcgaga cgaaagggcc tcgtgatacg 6540
cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt 6600
tcggggaaat gtgcgcggaa cccctatttg tttattttc taaatacatt caaatatgta 6660
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 6720
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 6780
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 6840
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 6900
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 6960
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 7020
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 7080
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 7140
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 7200
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca cca 7253
<210> 52
<211> 7057
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 52
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg ttctaggtag acaactaaga 1320
tgttcatctt atggtttaat gtttagttgt aaaggttgtt tgcttctcat ttggttccaa 1380
gaaagagtat ttaggccaat ttcagggaga aatatgtgta tagatatatt catatgtcaa 1440
actgattagt gctgaatgtc acatttccat attctaataa catttctagc aaagaagagg 1500
acacagtgaa gagagaattg cccgcattgt cattgtctct ttctgagcct agaacgccta 1560
acacttgggt gtggagagac tcagcctcaa ttcactttct agcagccact gagatgtgct 1620
tgcctggggt gccccctggc aggcagggct ggaactgctt tccagtaccc acacggactg 1680
tgaacgaatc tttctttgtg ctttgtgtac agaatggaag ttcaacaaat atttgttgaa 1740
tgtgtatgtc cttccaatac gcagcagccc agagcaaacg tggtaatctt gtgtgtgttc 1800
atgtgaaagc agaatttaat ggtgctttta agcaccaaag tttaagatgc acgagaaaac 1860
tgtatctcca ttttttcctt ttcgtttaca attacttgta taagccaggc acggtggtgg 1920
ctcacgcctg taatcccagc actttgggag gccgaggcgg gcggatcaca tgaggtcggg 1980
agttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg cggtcggggg 2040
ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc cgcggcgccg 2100
ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt gcggttaaaa 2160
ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc ctctccccga 2220
ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg agccccagcg 2280
cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc cgacgcagag 2340
caaaccgccc agagtagaag ccatggtgag caagggcgag gagctgttca ccggggtggt 2400
gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg tgtccggcga 2460
gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca ccaccggcaa 2520
gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc agtgcttcag 2580
ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc ccgaaggcta 2640
cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc gcgccgaggt 2700
gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg acttcaagga 2760
ggacggcaac atcctggggc acaagctgga gtacaactac aacagccaca acgtctatat 2820
catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc acaacatcga 2880
ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg gcgacggccc 2940
cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca aagaccccaa 3000
cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga tcactctcgg 3060
catggacgag ctgtacaagt aaaggcgcgc cacccctgca gggaattccg cattgcccag 3120
ttgttagatt aagaaataga cagcatgaga gggatgaggc aacccgtgct cagctgtcaa 3180
ggctcagtcg ctagcatttc ccaacacaaa gattctgacc ttaaatgcaa ccatttgaaa 3240
cccctgtagg cctcaggtga aactccagat gccacaatgg agctctgctc ccctaaagcc 3300
tcaaaacaaa ggcctaattc tatgcctgtc ttaattttct ttcacttaag ttagttccac 3360
tgagaccccca ggctgttagg ggttattggt gtaaggtact ttcatatttt aaacagagga 3420
tatcggcatt tgtttctttc tctgaggaca agagaaaaaa gccaggttcc acagaggaca 3480
cagagaaggt ttgggtgtcc tcctggggtt ctttttgcca actttcccca cgttaaaggt 3540
gaacattggt tctttcattt gctttggaag ttttaatctc taacagtgga caaagttacc 3600
agtgccttaa actctgttac actttttgga agtgaaaact ttgtagtatg ataggttatt 3660
ttgatgtaaa gatgttctgg ataccattat atgttccccc tgtttcagag gctcagattg 3720
taatatgtaa atggtatgtc attcgctact atgatttaat ttgaaatatg gtcttttggt 3780
tatgaatact ttgcagcaca gctgagaggc tgtctgttgt attcattgtg gtcatagcac 3840
ctaacaacat tgtagcctca atcgagtgag acagactaga agttcctagt gatggcttat 3900
gatagcaaat ggcctcatgt caaatattta gatgtaattt tgtgtaagaa atacagactg 3960
gatgtaccac caactactac ctgtaatgac aggcctgtcc aacacatctc ccttttccat 4020
gactgtggta gccagcatcg gaaagaacgc tgatttaaag aggtcgcttg ggaattttat 4080
tgacacagta ccatttaatg gggaggacaa aatggggcag gggagggaga agtttctgtc 4140
gttaaaaaca gatttggaaa gactggactc taaagtctgt tgattaaaga tgagctttgt 4200
ctacttcaaa agtttgtttg cttacccctt cagcctccaa ttttttaagt gaaaatatag 4260
ctaataacat gtgaaaagaa tagaagctaa ggtttagata aatattgagc agatctatag 4320
gaagattgaa cctgaatatt gccattatgc ttgacatggt ttccaaaaaa tggtactcca 4380
catatttcag tgagggtaag tattttcctg ttgtcaagaa tagcattgta aaagcatttt 4440
gtaataataa agaatagctt taatgatatg cttgtaacta aaataatttt gtaatgtatc 4500
aaatacattt aaaacattaa aatataatct ctataataat ttaaaatcta atatggtttt 4560
aatagaacag cgatatcaag cttatcgata atcaacctct ggattacaaa atttgtgaaa 4620
gattgactgg tattcttaac tatgttgctc cttttacgct atgtggatac gctgctttaa 4680
tgcctttgta tcatgctatt gcttcccgta tggctttcat tttctcctcc ttgtataaat 4740
cctggttgct gtctctttat gaggagttgt ggcccgttgt caggcaacgt ggcgtggtgt 4800
gcactgtgtt tgctgacgca acccccactg gttggggcat tgccaccacc tgtcagctcc 4860
tttccgggac tttcgctttc cccctcccta ttgccacggc ggaactcatc gccgcctgcc 4920
ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg gtgttgtcgg 4980
ggaaatcatc gtcctttcct tggctgctcg cctatgttgc cacctggatt ctgcgcggga 5040
cgtccttctg ctacgtccct tcggccctca atccagcgga ccttccttcc cgcggcctgc 5100
tgccggctct gcggcctctt ccgcgtcttc gccttcgccc tcagacgagt cggatctccc 5160
tttgggccgc ctccccgcga attcatcgat accgagcgct gctcgagaga tctgtgatag 5220
cggccatcaa gctggctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg 5280
tgccttcctt gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa 5340
ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcaggaca 5400
gcaaggggga ggattgggaa gacaatagca ggcatgctgg ggacacgtgc ggaccgagcg 5460
gccgcaggaa cccctagtga tggagttggc cactccctct ctgcgcgctc gctcgctcac 5520
tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag 5580
cgagcgagcg cgcagctgcc tgcaggggcg cctgatgcgg tattttctcc ttacgcatct 5640
gtgcggtatt tcacaccgca tacgtcaaag caaccatagt acgcgccctg tagcggcgca 5700
ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg ctacacttgc cagcgcccta 5760
gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca cgttcgccgg ctttccccgt 5820
caagctctaa atcgggggct ccctttaggg ttccgattta gtgctttacg gcacctcgac 5880
cccaaaaaac ttgatttggg tgatggttca cgtagtgggc catcgccctg atagacggtt 5940
tttcgccctt tgacgttgga gtccacgttc tttaataggg gactcttgtt ccaaactgga 6000
acaacactca accctatctc gggctattct tttgatttat aagggatttt gccgatttcg 6060
gcctattggt taaaaaatga gctgatttaa caaaaattta acgcgaattt taacaaaata 6120
ttaacgttta caattttatg gtgcactctc agtacaatct gctctgatgc cgcatagtta 6180
agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg 6240
gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca 6300
ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt 6360
aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc 6420
ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa 6480
taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc 6540
cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa 6600
acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa 6660
ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg 6720
atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa 6780
gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc 6840
acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc 6900
atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta 6960
accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag 7020
ctgaatgaag ccataccaaa cgacgagcgt gacacca 7057
<210> 53
<211> 212
<212> DNA
<213> Homo sapiens
<400> 53
ggggtgcggt taaaaggcgc cacggcggga gacaggtgtt gcggccccgc agcgcccgcg 60
cgctcctctc cccgactcgg agcccctcgg cggcgcccgg cccaggaccc gcctaggagc 120
gcaggagccc cagcgcagag accccaacgc cgagacccccc gccccggccc cgccgcgctt 180
cctcccgacg cagagcaaac cgcccagagt ag 212
<210> 54
<211> 784
<212> DNA
<213> Homo sapiens
<400> 54
aagcaggtga gtttgtggtg tcgccgatgt cccttcgggg tactctagcg cagccgcctg 60
gctacttgac ccactgccac caaacgtttt aaattcaccg aaagcttagc ttcgaagcaa 120
agctccgttt cgccggtgaa gcaggaagcc ttcgctgcag gaactgacct ttacctcttg 180
gagcggcttc tgcagaaaaa tccccgggca gagatttggg cggagtttgc ctagaactaa 240
cgcggagcca gccgatcccg gcctaccccg gggccaagat tttaaggggt gaagagtccc 300
ttttgccttt tctggatcct ggtgattcac ctagtgtctt ccctaaggaa ctgaaccaac 360
tcctccgctg gcctctggca gccctccagg cggtgcagga tggcgtgggc ccggtaggaa 420
gctgcatgta accgcccagg gtcgggaggc caggagggca gctcctcctc tgacttgaat 480
attgaaaaca agaggatgct tttaagaaaa agaagaagga ggattcacta ccagctctga 540
agggtgggaaa agagatgatt catccggatt gtggagaggg tggaatcttg tttaggagag 600
cgttggttgt ggcaggcagg gtgtaactat gaatcagtga agacaattca catcctggga 660
tgaaaagaag gccatgggct cacaggagat tatccactgg cctctccaca tccgcttgca 720
gtaaggagtg tgggactctc ccaagcttca gcgctgaact gcaatgcagt gacgtcgctt 780
aaga 784
<210> 55
<211> 771
<212> DNA
<213> Homo sapiens
<400> 55
tcatccatgt ccctacaaag gacatgaact catcattttt tatggctgca taagtcgttc 60
tttcaaacac cctgcagtca gcttctcctc acgagaaacc acatgaaagc cctcggggaa 120
atgcctctcg ggatctactt ttctttgtgt gtatcctact tagcctatcg gtttctgctt 180
cctgtggggc tacagccgtc tcgtcttttt ctgctggctc ctttgctctg ttctccagtg 240
gctatcttct ttctcctttc tttcaaatgt tctccccttat cttctctgat acagacagaa 300
ggtcaggagc cacgcccatt acactgacag aacccgatgt cctgatgcgc tctgtgcctc 360
ccagatttgg atgtggatgc gaggcgagct ggccagagag caatcatttc agcgagggtc 420
gttatccca tcttctctct taggacggag gtagggggac ttctggcccc aaatgttcct 480
tcttccagct gtggctgcct ccatcccgca gagtgagcct ttaatttgga gatcctaatg 540
ccccagtgct gtgccaggca cagtacacgt tctgcatgga ggacggttta cgctcccctt 600
acagaagagg aaggacactc agaaggctga actgttctgc ctaaggtcac cgagttgcta 660
aggcaagaag cagcctccaa ttcctgcctt actgatttct gggatgtgaa accaaaaggg 720
tgaggcggca agccccggct gccctcgggg gctcttccca agtgctctct t 771
<210> 56
<211> 771
<212> DNA
<213> Homo sapiens
<400> 56
aagagagcac ttgggaagag cccccgaggg cagccggggc ttgccgcctc acccttttgg 60
tttcacatcc cagaaatcag taaggcagga attggaggct gcttcttgcc ttagcaactc 120
ggtgacctta ggcagaacag ttcagccttc tgagtgtcct tcctcttctg taaggggagc 180
gtaaaccgtc ctccatgcag aacgtgtact gtgcctggca cagcactggg gcattaggat 240
ctccaaatta aaggctcact ctgcgggatg gaggcagcca cagctggaag aaggaacatt 300
tggggccaga agtcccccta cctccgtcct aagagagaag atgggaataa cgaccctcgc 360
tgaaatgatt gctctctggc cagctcgcct cgcatccaca tccaaatctg ggaggcacag 420
agcgcatcag gacatcgggt tctgtcagtg taatgggcgt ggctcctgac cttctgtctg 480
tatcagagaa gataagggag aacatttgaa agaaaggaga aagaagatag ccactggaga 540
acagagcaaa ggagccagca gaaaaagacg agacggctgt agccccacag gaagcagaaa 600
ccgataggct aagtaggata cacacaaaga aaagtagatc ccgagaggca tttccccgag 660
ggctttcatg tggtttctcg tgaggagaag ctgactgcag ggtgtttgaa agaacgactt 720
atgcagccat aaaaaatgat gagttcatgt cctttgtagg gacatggatg a 771
<210> 57
<211> 699
<212> DNA
<213> Homo sapiens
<400> 57
cttgcttacc cagactcaga gaagtctccc tgttctgtcc tagctagtga ttcctgtgtt 60
gtgtgcattc gtcttttcca gagcaaaccg cccagagtag aagatggatt ggggcacgct 120
gcagacgatc ctggggggtg tgaacaaaca ctccaccagc attggaaaga tctggctcac 180
cgtcctcttc atttttcgca ttatgatcct cgttgtggct gcaaaggagg tgtggggaga 240
tgagcaggcc gactttgtct gcaacaccct gcagccaggc tgcaagaacg tgtgctacga 300
tcactacttc cccatctccc acatccggct atgggccctg cagctgatct tcgtgtccac 360
gccagcgctc ctagtggcca tgcacgtggc ctaccggaga catgagaaga agaggaagtt 420
catcaagggg gagataaaga gtgaatttaa ggacatcgag gagatcaaaa cccagaaggt 480
ccgcatcgaa ggctccctgt ggtggaccta cacaagcagc atcttcttcc gggtcatctt 540
cgaagccgcc ttcatgtacg tcttctatgt catgtacgac ggcttctcca tgcagcggct 600
ggtgaagtgc aacgcctggc cttgtcccaa cactgtggac tgctttgtgt cccggcccac 660
ggagaagact gtcttcacag tgttcatgat tgcagtgtc 699
<210> 58
<211> 699
<212> DNA
<213> Homo sapiens
<400> 58
gacactgcaa tcatgaacac tgtgaagaca gtcttctccg tgggccggga cacaaagcag 60
tccacagtgt tgggacaagg ccaggcgttg cacttcacca gccgctgcat ggagaagccg 120
tcgtacatga catagaagac gtacatgaag gcggcttcga agatgacccg gaagaagatg 180
ctgcttgtgt aggtccacca cagggagcct tcgatgcgga ccttctgggt tttgatctcc 240
tcgatgtcct taaattcact ctttatctcc cccttgatga acttcctctt cttctcatgt 300
ctccggtagg ccacgtgcat ggccactagg agcgctggcg tggacacgaa gatcagctgc 360
agggcccata gccggatgtg ggagatgggg aagtagtgat cgtagcacac gttcttgcag 420
cctggctgca gggtgttgca gacaaagtcg gcctgctcat ctccccacac ctcctttgca 480
gccacaacga ggatcataat gcgaaaaatg aagaggacgg tgagccagat ctttccaatg 540
ctggtggagt gtttgttcac accccccagg atcgtctgca gcgtgcccca atccatcttc 600
tactctgggc ggtttgctct ggaaaagacg aatgcacaca acacaggaat cactagctag 660
gacagaacag ggagacttct ctgagtctgg gtaagcaag 699
<210> 59
<211> 700
<212> DNA
<213> Homo sapiens
<400> 59
gcctgacaca gtctgagcct cctcaggcgg cctcaggggt tgggatagag tggagaattc 60
aggcaagaat gccaacccta gctccaggcc tgggacccac aggcctgggg aaaagagtgg 120
ttgccccgtc ttgagacagc cgaaaactgt gtccccagga ttgttggttt cataaaagca 180
agtagctagg gaggccacat ttacagggga tcacagaaca cttgggtagg ggcttgctgt 240
aggtgtcatc agggaagtgg gggacggcag gagggatgtg gcccagtacg cagatgaaga 300
caggtgatca tccgctgggc cacacgtggc agggatatgg gcagagtgag cttggctggc 360
cccaggctcc aaagctgccc agcccccgct gaaggtgagg cctcagctgg tgggaatgtc 420
accttccagg tgactggctg gctccaaagg cctttgcatg atctccagga gtttggaggg 480
gagaggccac attccaaatc cagcttgaaa agtgctctgt atcaccctca gcactgaggg 540
ggccagagtc taggaggaag gaggcacagg gttggggggc agccctgacc tggtggccgc 600
acctgccagg tcccgagaga caacccatct cacacacatt caaaaacaca caccagggag 660
cacatggcta aacaaatcgc actaaacgcc aggaaggcag 700
<210> 60
<211> 700
<212> DNA
<213> Homo sapiens
<400> 60
ctgccttcct ggcgtttagt gcgatttgtt tagccatgtg ctccctggtg tgtgtttttg 60
aatgtgtgtg agatgggttg tctctcggga cctggcaggt gcggccacca ggtcagggct 120
gccccccaac cctgtgcctc cttcctccta gactctggcc ccctcagtgc tgagggtgat 180
acagagcact tttcaagctg gatttggaat gtggcctctc ccctccaaac tcctggagat 240
catgcaaagg cctttggagc cagccagtca cctggaaggt gacattccca ccagctgagg 300
cctcaccttc agcgggggct gggcagcttt ggagcctggg gccagccaag ctcactctgc 360
ccatatccct gccacgtgg gcccagcgga tgatcacctg tcttcatctg cgtactgggc 420
cacatccctc ctgccgtccc ccacttccct gatgacacct acagcaagcc cctacccaag 480
tgttctgtga tcccctgtaa atgtggcctc cctagctact tgcttttatg aaaccaacaa 540
tcctggggac acagttttcg gctgtctcaa gacggggcaa ccactctttt ccccaggcct 600
gtgggtccca ggcctggagc tagggttggc attcttgcct gaattctcca ctctatccca 660
acccctgagg ccgcctgagg aggctcagac tgtgtcaggc 700
<210> 61
<211> 6374
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 61
ttgctggcct tttgctcaca tgtcctgcag gcagctgcgc gctcgctcgc tcactgaggc 60
cgcccgggca aagcccgggc gtcgggcgac ctttggtcgc ccggcctcag tgagcgagcg 120
agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctgcggcc gcacgcgttt 180
aattaagacc tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg 240
cggacccggg aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc 300
ctccgtaact ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg 360
ccacggcggg agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg 420
gagcccctcg gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga 480
gaccccaacg ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa 540
ccgcccagag tagaagcgga tccgccacca tggattgggg cacgctgcag acgatcctgg 600
ggggtgtgaa caaacactcc accagcattg gaaagatctg gctcaccgtc ctcttcattt 660
ttcgcattat gatcctcgtt gtggctgcaa aggaggtgtg gggagatgag caggccgact 720
ttgtctgcaa caccctgcag ccaggctgca agaacgtgtg ctacgatcac tacttcccca 780
tctcccacat ccggctatgg gccctgcagc tgatcttcgt gtccacgcca gcgctcctag 840
tggccatgca cgtggcctac cggagacatg agaagaagag gaagttcatc aagggggaga 900
taaagagtga atttaaggac atcgaggaga tcaaaaccca gaaggtccgc atcgaaggct 960
ccctgtggtg gacctacaca agcagcatct tcttccgggt catcttcgaa gccgccttca 1020
tgtacgtctt ctatgtcatg tacgacggct tctccatgca gcggctggtg aagtgcaacg 1080
cctggccttg tcccaacact gtggactgct ttgtgtcccg gcccacggag aagactgtct 1140
tcacagtgtt catgattgca gtgtctggaa tttgcatcct gctgaatgtc actgaattgt 1200
gttattgct aattagatat tgttctggga agtcaaaaaa gccagtttac ccatacgatg 1260
ttccagatta cgcttaaggc gcgccacccc tgcagggaat tccgcattgc ccagttgtta 1320
gattaagaaa tagacagcat gagagggatg aggcaacccg tgctcagctg tcaaggctca 1380
gtcgctagca tttcccaaca caaagattct gaccttaaat gcaaccattt gaaacccctg 1440
taggcctcag gtgaaactcc agatgccaca atggagctct gctcccctaa agcctcaaaa 1500
caaaggccta attctatgcc tgtcttaatt ttctttcact taagttagtt ccactgagac 1560
cccaggctgt taggggttat tggtgtaagg tactttcata ttttaaacag aggatatcgg 1620
catttgtttc tttctctgag gacaagagaa aaaagccagg ttccacagag gacacagaga 1680
aggtttgggt gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat 1740
tggttctttc atttgctttg gaagttttaa tctctaacag tggacaaagt taccagtgcc 1800
ttaaactctg ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg 1860
taaagatgtt ctggatacca ttatatgttc cccctgtttc agaggctcag attgtaatat 1920
gtaaatggta tgtcattcgc tactatgatt taatttgaaa tatggtcttt tggttatgaa 1980
tactttgcag cacagctgag aggctgtctg ttgtattcat tgtggtcata gcacctaaca 2040
acattgtagc ctcaatcgag tgagacagac tagaagttcc tagtgatggc ttatgatagc 2100
aaatggcctc atgtcaaata tttagatgta attttgtgta agaaatacag actggatgta 2160
ccaccaacta ctacctgtaa tgacaggcct gtccaacaca tctccctttt ccatgactgt 2220
ggtagccagc atcggaaaga acgctgattt aaagaggtcg cttgggaatt ttattgacac 2280
agtaccattt aatggggagg acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa 2340
aacagatttg gaaagactgg actctaaagt ctgttgatta aagatgagct ttgtctactt 2400
caaaagtttg tttgcttacc ccttcagcct ccaatttttt aagtgaaaat atagctaata 2460
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2520
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatatt 2580
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2640
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2700
atttaaaaca ttaaaatata atctctataa taatttaaaa tctaatatgg ttttaataga 2760
acagcgatat caagcttatc gataatcaac ctctggatta caaaatttgt gaaagattga 2820
ctggtattct taactatgtt gctcctttta cgctatgtgg atacgctgct ttaatgcctt 2880
tgtatcatgc tattgcttcc cgtatggctt tcattttctc ctccttgtat aaatcctggt 2940
tgctgtctct ttatgaggag ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg 3000
tgtttgctga cgcaaccccc actggttggg gcattgccac cacctgtcag ctcctttccg 3060
ggactttcgc tttccccctc cctattgcca cggcggaact catcgccgcc tgccttgccc 3120
gctgctggac aggggctcgg ctgttgggca ctgacaattc cgtggtgttg tcggggaaat 3180
catcgtcctt tccttggctg ctcgcctatg ttgccacctg gattctgcgc gggacgtcct 3240
tctgctacgt cccttcggcc ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg 3300
ctctgcggcc tcttccgcgt cttcgccttc gccctcagac gagtcggatc tccctttggg 3360
ccgcctcccc gcgaattcat cgataccgag cgctgctcga gagatctgtg atagcggcca 3420
tcaagctggc tgtgccttct agttgccagc catctgttgt ttgcccctcc cccgtgcctt 3480
ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag gaaattgcat 3540
cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag gacagcaagg 3600
gggaggattg ggaagacaat agcaggcatg ctggggacac gtgcggaccg agcggccgca 3660
ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc 3720
cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg 3780
agcgcgcagc tgcctgcagg ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg 3840
tatttcacac cgcatacgtc aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc 3900
gcggcgggtg tggtggttac gcgcagcgtg accgctacac ttgccagcgc cctagcgccc 3960
gctcctttcg ctttcttccc ttcctttctc gccacgttcg ccggctttcc ccgtcaagct 4020
ctaaatcggg ggctcccttt agggttccga tttagtgctt tacggcacct cgaccccaaa 4080
aaacttgatt tgggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 4140
cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac tggaacaaca 4200
ctcaacccta tctcgggcta ttcttttgat ttataaggga ttttgccgat ttcggcctat 4260
tggttaaaaa atgagctgat ttaacaaaaa tttaacgcga attttaacaa aatattaacg 4320
tttacaattt tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 4380
ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 4440
gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 4500
tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc 4560
atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc 4620
cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc 4680
tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc 4740
gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg 4800
gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat 4860
ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc 4920
acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa 4980
ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa 5040
aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt 5100
gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct 5160
tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat 5220
gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc aacaacgttg 5280
cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg 5340
atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt 5400
attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg 5460
ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg 5520
gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg 5580
tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa 5640
aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt 5700
tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt 5760
tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt 5820
ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag 5880
ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta 5940
gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat 6000
aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg 6060
ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg 6120
agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac 6180
aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga 6240
aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt 6300
ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta 6360
cggttcctgg cctt 6374
<210> 62
<211> 6347
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 62
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 60
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 120
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 180
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 240
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 300
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 360
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 420
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 480
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 540
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 600
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 660
acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 720
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 780
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 840
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 900
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgtcc tgcaggcagc 960
tgcgcgctcg ctcgctcact gaggccgccc gggcaaagcc cgggcgtcgg gcgacctttg 1020
gtcgcccggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaac tccatcacta 1080
ggggttcctg cggccgcacg cgtttaatta agacctcgaa ggggacttgg ggggttcggg 1140
gctttcgggg gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg 1200
cgcgctccgc ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg 1260
gggtgtgggg tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg 1320
cccgcgcgct cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct 1380
aggagcgcag gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc 1440
gcgcttcctc ccgacgcaga gcaaaccgcc cagagtagaa gcggatccgc caccatggat 1500
tggggcacac tccagagcat cctcgggggt gtcaacaaac actccaccag cattggaaag 1560
atctggctca cggtcctctt catcttccgc atcatgatcc tcgtggtggc tgcaaaggag 1620
gtgtggggag atgagcaagc cgattttgtc tgcaacacgc tccagcctgg ctgcaagaat 1680
gtatgctacg accaccactt ccccatctct cacatccggc tctgggctct gcagctgatc 1740
atggtgtcca cgccagccct cctggtagct atgcatgtgg cctaccggag acatgaaaag 1800
aaacggaagt tcatgaaggg agagataaag aacgagttta aggacatcga agagatcaaa 1860
acccagaagg tccgtatcga agggtccctg tggtggacct acaccaccag catcttcttc 1920
cgggtcatct ttgaagccgt cttcatgtac gtcttttaca tcatgtacaa tggcttcttc 1980
atgcaacgtc tggtgaaatg caacgcttgg ccctgcccca atacagtgga ctgcttcatt 2040
tccaggccca cagaaaagac tgtcttcacc gtgtttatga tttctgtgtc tggaatttgc 2100
attctgctaa atatcacaga gctgtgctat ttgttcgtta ggtattgctc aggaaagtcc 2160
aaaagaccag tctaaggcgc gccacccctg cagggaattc cgcattgccc agttgttaga 2220
ttaagaaata gacagcatga gagggatgag gcaacccgtg ctcagctgtc aaggctcagt 2280
cgctagcatt tcccaacaca aagattctga ccttaaatgc aaccatttga aacccctgta 2340
ggcctcaggt gaaactccag atgccacaat ggagctctgc tcccctaaag cctcaaaaca 2400
aaggcctaat tctatgcctg tcttaatttt ctttcactta agttagttcc actgagaccc 2460
caggctgtta ggggttattg gtgtaaggta ctttcatatt ttaaacagag gatatcggca 2520
tttgtttctt tctctgagga caagagaaaa aagccaggtt ccacagagga cacagagaag 2580
gtttgggtgt cctcctgggg ttctttttgc caactttccc cacgttaaag gtgaacattg 2640
gttctttcat ttgctttgga agttttaatc tctaacagtg gacaaagtta ccagtgcctt 2700
aaactctgtt acactttttg gaagtgaaaa ctttgtagta tgataggtta ttttgatgta 2760
aagatgttct ggataccatt atatgttccc cctgtttcag aggctcagat tgtaatatgt 2820
aaatggtatg tcattcgcta ctatgattta atttgaaata tggtcttttg gttatgaata 2880
ctttgcagca cagctgagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 2940
attgtagcct caatcgagtg agacagacta gaagttccta gtgatggctt atgatagcaa 3000
atggcctcat gtcaaatatt tagatgtaat tttgtgtaag aaatacagac tggatgtacc 3060
accaactact acctgtaatg acaggcctgt ccaacacatc tcccttttcc atgactgtgg 3120
tagccagcat cggaaagaac gctgatttaa agaggtcgct tgggaatttt attgacacag 3180
taccatttaa tggggaggac aaaatggggc aggggaggga gaagtttctg tcgttaaaaa 3240
cagatttgga aagactggac tctaaagtct gttgattaaa gatgagcttt gtctacttca 3300
aaagtttgtt tgcttacccc ttcagcctcc aattttttaa gtgaaaatat agctaataac 3360
atgtgaaaag aatagaagct aaggtttaga taaatattga gcagatctat aggaagattg 3420
aacctgaata ttgccattat gcttgacatg gtttccaaaa aatggtactc cacatatttc 3480
agtgagggta agtattttcc tgttgtcaag aatagcattg taaaagcatt ttgtaataat 3540
aaagaatagc tttaatgata tgcttgtaac taaaataatt ttgtaatgta tcaaatacat 3600
ttaaaacatt aaaatataat ctctataata atttaaaatc taatatggtt ttaatagaac 3660
agcgatatca agcttatcga taatcaacct ctggattaca aaatttgtga aagattgact 3720
ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg 3780
tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggttg 3840
ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg 3900
tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct cctttccggg 3960
actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg ccttgcccgc 4020
tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc ggggaaatca 4080
tcgtcctttc cttggctgct cgcctatgtt gccacctgga ttctgcgcgg gacgtccttc 4140
tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct gctgccggct 4200
ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc cctttgggcc 4260
gcctccccgc gaattcatcg ataccgagcg ctgctcgaga gatctgtgat agcggccatc 4320
aagctggctg tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc 4380
ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg 4440
cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg 4500
gaggattggg aagacaatag caggcatgct ggggacacgt gcggaccgag cggccgcagg 4560
aacccctagt gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg 4620
ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag 4680
cgcgcagctg cctgcagggg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta 4740
tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 4800
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 4860
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 4920
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 4980
acttgatttg ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 5040
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 5100
caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 5160
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 5220
tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 5280
ccgaccccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 5340
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 5400
accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 5460
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5520
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5580
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5640
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5700
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5760
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5820
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5880
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5940
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 6000
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 6060
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 6120
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 6180
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 6240
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6300
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattg 6347
<210> 63
<211> 6347
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 63
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 60
aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 120
attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 180
tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 240
aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 300
gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 360
cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 420
gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 480
agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 540
ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 600
cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 660
acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 720
gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 780
ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 840
agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 900
cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgtcc tgcaggcagc 960
tgcgcgctcg ctcgctcact gaggccgccc gggcaaagcc cgggcgtcgg gcgacctttg 1020
gtcgcccggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaac tccatcacta 1080
ggggttcctg cggccgcacg cgtttaatta agacctcgaa ggggacttgg ggggttcggg 1140
gctttcgggg gcggtcgggg gttcgcggac ccgggaagct ctgaggaccc agaggccggg 1200
cgcgctccgc ccgcggcgcc gccccctccg taactttccc agtctccgag ggaagaggcg 1260
gggtgtgggg tgcggttaaa aggcgccacg gcgggagaca ggtgttgcgg ccccgcagcg 1320
cccgcgcgct cctctccccg actcggagcc cctcggcggc gcccggccca ggacccgcct 1380
aggagcgcag gagccccagc gcagagaccc caacgccgag acccccgccc cggccccgcc 1440
gcgcttcctc ccgacgcaga gcaaaccgcc cagagtagaa gcggatccgc caccatggat 1500
tggggcacgc tgcagacgat cctggggggt gtgaacaaac actccaccag cattggaaag 1560
atctggctca ccgtcctctt catttttcgc attatgatcc tcgttgtggc tgcaaaggag 1620
gtgtggggag atgagcaggc cgactttgtc tgcaacaccc tgcagccagg ctgcaagaac 1680
gtgtgctacg atcactactt ccccatctcc cacatccggc tatgggccct gcagctgatc 1740
ttcgtgtcca cgccagcgct cctagtggcc atgcacgtgg cctaccggag acatgagaag 1800
aagaggaagt tcatcaaggg ggagataaag agtgaattta aggacatcga ggagatcaaa 1860
acccagaagg tccgcatcga aggctccctg tggtggacct acacaagcag catcttcttc 1920
cgggtcatct tcgaagccgc cttcatgtac gtcttctatg tcatgtacga cggcttctcc 1980
atgcagcggc tggtgaagtg caacgcctgg ccttgtccca acactgtgga ctgctttgtg 2040
tcccggccca cggagaagac tgtcttcaca gtgttcatga ttgcagtgtc tggaatttgc 2100
atcctgctga atgtcactga attgtgttat ttgctaatta gatattgttc tgggaagtca 2160
aaaaagccag tttaaggcgc gccacccctg cagggaattc cgcattgccc agttgttaga 2220
ttaagaaata gacagcatga gagggatgag gcaacccgtg ctcagctgtc aaggctcagt 2280
cgctagcatt tcccaacaca aagattctga ccttaaatgc aaccatttga aacccctgta 2340
ggcctcaggt gaaactccag atgccacaat ggagctctgc tcccctaaag cctcaaaaca 2400
aaggcctaat tctatgcctg tcttaatttt ctttcactta agttagttcc actgagaccc 2460
caggctgtta ggggttattg gtgtaaggta ctttcatatt ttaaacagag gatatcggca 2520
tttgtttctt tctctgagga caagagaaaa aagccaggtt ccacagagga cacagagaag 2580
gtttgggtgt cctcctgggg ttctttttgc caactttccc cacgttaaag gtgaacattg 2640
gttctttcat ttgctttgga agttttaatc tctaacagtg gacaaagtta ccagtgcctt 2700
aaactctgtt acactttttg gaagtgaaaa ctttgtagta tgataggtta ttttgatgta 2760
aagatgttct ggataccatt atatgttccc cctgtttcag aggctcagat tgtaatatgt 2820
aaatggtatg tcattcgcta ctatgattta atttgaaata tggtcttttg gttatgaata 2880
ctttgcagca cagctgagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 2940
attgtagcct caatcgagtg agacagacta gaagttccta gtgatggctt atgatagcaa 3000
atggcctcat gtcaaatatt tagatgtaat tttgtgtaag aaatacagac tggatgtacc 3060
accaactact acctgtaatg acaggcctgt ccaacacatc tcccttttcc atgactgtgg 3120
tagccagcat cggaaagaac gctgatttaa agaggtcgct tgggaatttt attgacacag 3180
taccatttaa tggggaggac aaaatggggc aggggaggga gaagtttctg tcgttaaaaa 3240
cagatttgga aagactggac tctaaagtct gttgattaaa gatgagcttt gtctacttca 3300
aaagtttgtt tgcttacccc ttcagcctcc aattttttaa gtgaaaatat agctaataac 3360
atgtgaaaag aatagaagct aaggtttaga taaatattga gcagatctat aggaagattg 3420
aacctgaata ttgccattat gcttgacatg gtttccaaaa aatggtactc cacatatttc 3480
agtgagggta agtattttcc tgttgtcaag aatagcattg taaaagcatt ttgtaataat 3540
aaagaatagc tttaatgata tgcttgtaac taaaataatt ttgtaatgta tcaaatacat 3600
ttaaaacatt aaaatataat ctctataata atttaaaatc taatatggtt ttaatagaac 3660
agcgatatca agcttatcga taatcaacct ctggattaca aaatttgtga aagattgact 3720
ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg 3780
tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggttg 3840
ctgtctcttt atgaggagtt gtggcccgtt gtcaggcaac gtggcgtggt gtgcactgtg 3900
tttgctgacg caacccccac tggttggggc attgccacca cctgtcagct cctttccggg 3960
actttcgctt tccccctccc tattgccacg gcggaactca tcgccgcctg ccttgcccgc 4020
tgctggacag gggctcggct gttgggcact gacaattccg tggtgttgtc ggggaaatca 4080
tcgtcctttc cttggctgct cgcctatgtt gccacctgga ttctgcgcgg gacgtccttc 4140
tgctacgtcc cttcggccct caatccagcg gaccttcctt cccgcggcct gctgccggct 4200
ctgcggcctc ttccgcgtct tcgccttcgc cctcagacga gtcggatctc cctttgggcc 4260
gcctccccgc gaattcatcg ataccgagcg ctgctcgaga gatctgtgat agcggccatc 4320
aagctggctg tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc 4380
ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg 4440
cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg 4500
gaggattggg aagacaatag caggcatgct ggggacacgt gcggaccgag cggccgcagg 4560
aacccctagt gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg 4620
ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag 4680
cgcgcagctg cctgcagggg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta 4740
tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 4800
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 4860
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 4920
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 4980
acttgatttg ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 5040
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 5100
caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 5160
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 5220
tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 5280
ccgaccccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 5340
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 5400
accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 5460
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 5520
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 5580
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 5640
ccttattccc ttttttgcgg cattttgcct tcctgttttt gctcacccag aaacgctggt 5700
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 5760
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 5820
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 5880
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 5940
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 6000
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 6060
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 6120
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 6180
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 6240
ggaggcggat aaagttgcag gaccacttct gcgctcggcc cttccggctg gctggtttat 6300
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattg 6347
<210> 64
<211> 7150
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 64
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagagagca cttgggaaga 1320
gccccccgagg gcagccgggg cttgccgcct cacccttttg gtttcacatc ccagaaatca 1380
gtaaggcagg aattggaggc tgcttcttgc cttagcaact cggtgacctt aggcagaaca 1440
gttcagcctt ctgagtgtcc ttcctcttct gtaaggggag cgtaaaccgt cctccatgca 1500
gaacgtgtac tgtgcctggc acagcactgg ggcattagga tctccaaatt aaaggctcac 1560
tctgcgggat ggaggcagcc acagctggaa gaaggaacat ttggggccag aagtccccct 1620
acctccgtcc taagagagaa gatgggaata acgaccctcg ctgaaatgat tgctctctgg 1680
ccagctcgcc tcgcatccac atccaaatct gggaggcaca gagcgcatca ggacatcggg 1740
ttctgtcagt gtaatgggcg tggctcctga ccttctgtct gtatcagaga agataaggga 1800
gaacatttga aagaaaggag aaagaagata gccactggag aacagagcaa aggagccagc 1860
agaaaaagac gagacggctg tagccccaca ggaagcagaa accgataggc taagtaggat 1920
acacacaaag aaaagtagat cccgagaggc atttccccga gggctttcat gtggtttctc 1980
gtgaggagaa gctgactgca gggtgtttga aagaacgact tatgcagcca taaaaaatga 2040
tgagttcatg tcctttgtag ggacatggat gattaattaa gacctcgaag gggacttggg 2100
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 2160
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 2220
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 2280
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 2340
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 2400
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag ccatggtgag 2460
caagggcgag gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt 2520
aaacggccac aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct 2580
gaccctgaag ttcatctgca ccaccggcaa gctgccccgtg ccctggccca ccctcgtgac 2640
caccctgacc tacggcgtgc agtgcttcag ccgctacccc gaccacatga agcagcacga 2700
cttcttcaag tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga 2760
cgacggcaac tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg 2820
catcgagctg aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga 2880
gtacaactac aacagccaca acgtctatat catggccgac aagcagaaga acggcatcaa 2940
ggtgaacttc aagatccgcc acaacatcga ggacggcagc gtgcagctcg ccgaccacta 3000
ccagcagaac acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag 3060
cacccagtcc gccctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga 3120
gttcgtgacc gccgccggga tcactctcgg catggacgag ctgtacaagt aataaaggcg 3180
cgccacccct gcagggaatt ccgcattgcc cagttgttag attaagaaat agacagcatg 3240
agagggatga ggcaacccgt gctcagctgt caaggctcag tcgctagcat ttcccaacac 3300
aaagattctg accttaaatg caaccatttg aaacccctgt aggcctcagg tgaaactcca 3360
gatgccacaa tggagctctg ctcccctaaa gcctcaaaac aaaggcctaa ttctatgcct 3420
gtcttaattt tctttcactt aagttagttc cactgagacc ccaggctgtt aggggttatt 3480
ggtgtaaggt actttcatat tttaaacaga ggatatcggc atttgtttct ttctctgagg 3540
acaagagaaa aaagccaggt tccacagagg acacagagaa ggtttgggtg tcctcctggg 3600
gttctttttg ccaactttcc ccacgttaaa ggtgaacatt ggttctttca tttgctttgg 3660
aagttttaat ctctaacagt ggacaaagtt accagtgcct taaactctgt taacacttttt 3720
ggaagtgaaa actttgtagt atgataggtt attttgatgt aaagatgttc tggataccat 3780
tatatgttcc ccctgtttca gaggctcaga ttgtaatatg taaatggtat gtcattcgct 3840
actatgattt aatttgaaat atggtctttt ggttatgaat actttgcagc acagctgaga 3900
ggctgtctgt tgtattcatt gtggtcatag cacctaacaa cattgtagcc tcaatcgagt 3960
gagacagact agaagttcct agtgatggct tatgatagca aatggcctca tgtcaaatat 4020
ttagatgtaa ttttgtgtaa gaaatacaga ctggatgtac caccaactac tacctgtaat 4080
gacaggcctg tccaacacat ctcccttttc catgactgtg gtagccagca tcggaaagaa 4140
cgctgattta aagaggtcgc ttgggaattt tattgacaca gtaccattta atggggagga 4200
caaaatgggg caggggaggg agaagtttct gtcgttaaaa acagatttgg aaagactgga 4260
ctctaaagtc tgttgattaa agatgagctt tgtctacttc aaaagtttgt ttgcttaccc 4320
cttcagcctc caatttttta agtgaaaata tagctaataa catgtgaaaa gaatagaagc 4380
taaggtttag ataaatattg agcagatcta taggaagatt gaacctgaat attgccatta 4440
tgcttgacat ggtttccaaa aaatggtact ccacatattt cagtgagggt aagtattttc 4500
ctgttgtcaa gaatagcatt gtaaaagcat tttgtaataa taaagaatag ctttaatgat 4560
atgcttgtaa ctaaaataat tttgtaatgt atcaaataca tttaaaacat taaaatataa 4620
tctctataat aatttaaaat ctaatatggt tttaatagaa cagcgatatc aagcttatcg 4680
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg 4740
ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc 4800
gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt tatgaggagt 4860
tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac gcaaccccca 4920
ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct ttccccctcc 4980
ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc 5040
tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt ccttggctgc 5100
tcgcctatgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc ccttcggccc 5160
tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct cttccgcgtc 5220
ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg cgaattcatc 5280
gataccgagc gctgctcgag agatctgtga tagcggccat caagctggct gtgccttcta 5340
gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca 5400
ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc 5460
attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata 5520
gcaggcatgc tggggacacg tgcggaccga gcggccgcag gaacccctag tgatggagtt 5580
ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg 5640
acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagct gcctgcaggg 5700
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatacgtca 5760
aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg 5820
cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct 5880
tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg gctcccttta 5940
gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgattt gggtgatggt 6000
tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt ggagtccacg 6060
ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat ctcgggctat 6120
tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa tgagctgatt 6180
taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttt atggtgcact 6240
ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc gccaacaccc 6300
gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 6360
gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgagacga 6420
aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag 6480
acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa 6540
atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat 6600
tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg 6660
gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa 6720
gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt 6780
gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt 6840
ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat 6900
tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg 6960
acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta 7020
cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat 7080
catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag 7140
cgtgacacca 7150
<210> 65
<211> 7108
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 65
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagagagca cttgggaaga 1320
gccccccgagg gcagccgggg cttgccgcct cacccttttg gtttcacatc ccagaaatca 1380
gtaaggcagg aattggaggc tgcttcttgc cttagcaact cggtgacctt aggcagaaca 1440
gttcagcctt ctgagtgtcc ttcctcttct gtaaggggag cgtaaaccgt cctccatgca 1500
gaacgtgtac tgtgcctggc acagcactgg ggcattagga tctccaaatt aaaggctcac 1560
tctgcgggat ggaggcagcc acagctggaa gaaggaacat ttggggccag aagtccccct 1620
acctccgtcc taagagagaa gatgggaata acgaccctcg ctgaaatgat tgctctctgg 1680
ccagctcgcc tcgcatccac atccaaatct gggaggcaca gagcgcatca ggacatcggg 1740
ttctgtcagt gtaatgggcg tggctcctga ccttctgtct gtatcagaga agataaggga 1800
gaacatttga aagaaaggag aaagaagata gccactggag aacagagcaa aggagccagc 1860
agaaaaagac gagacggctg tagccccaca ggaagcagaa accgataggc taagtaggat 1920
acacacaaag aaaagtagat cccgagaggc atttccccga gggctttcat gtggtttctc 1980
gtgaggagaa gctgactgca gggtgtttga aagaacgact tatgcagcca taaaaaatga 2040
tgagttcatg tcctttgtag ggacatggat gattaattaa gacctcgaag gggacttggg 2100
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 2160
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 2220
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 2280
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 2340
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 2400
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag ccatggattg 2460
gggcacgctg cagacgatcc tggggggtgt gaacaaacac tccaccagca ttggaaagat 2520
ctggctcacc gtcctcttca tttttcgcat tatgatcctc gttgtggctg caaaggaggt 2580
gtggggagat gagcaggccg actttgtctg caacaccctg cagccaggct gcaagaacgt 2640
gtgctacgat cactacttcc ccatctccca catccggcta tgggccctgc agctgatctt 2700
cgtgtccacg ccagcgctcc tagtggccat gcacgtggcc taccggagac atgagaagaa 2760
gaggaagttc atcaaggggg agataaagag tgaatttaag gacatcgagg agatcaaaac 2820
ccagaaggtc cgcatcgaag gctccctgg gtggacctac acaagcagca tcttcttccg 2880
ggtcatcttc gaagccgcct tcatgtacgt cttctatgtc atgtacgacg gcttctccat 2940
gcagcggctg gtgaagtgca acgcctggcc ttgtcccaac actgtggact gctttgtgtc 3000
ccggcccacg gagaagactg tcttcacagt gttcatgatt gcagtgtctg gaatttgcat 3060
cctgctgaat gtcactgaat tgtgttattt gctaattaga tattgttctg ggaagtcaaa 3120
aaagccagtt taaaggcgcg ccacccctgc agggaattcc gcattgccca gttgttagat 3180
taagaaatag acagcatgag agggatgagg caacccgtgc tcagctgtca aggctcagtc 3240
gctagcattt cccaacacaa agattctgac cttaaatgca accatttgaa acccctgtag 3300
gcctcaggtg aaactccaga tgccacaatg gagctctgct cccctaaagc ctcaaaacaa 3360
aggcctaatt ctatgcctgt cttaattttc tttcacttaa gttagttcca ctgagacccc 3420
aggctgttag gggtattgg tgtaaggtac tttcatattt taaacagagg atatcggcat 3480
ttgtttcttt ctctgaggac aagagaaaaa agccaggttc cacagaggac acagagaagg 3540
tttgggtgtc ctcctggggt tctttttgcc aactttcccc acgttaaagg tgaacattgg 3600
ttctttcatt tgctttggaa gttttaatct ctaacagtgg acaaagttac cagtgcctta 3660
aactctgtta cactttttgg aagtgaaaac tttgtagtat gataggttat tttgatgtaa 3720
agatgttctg gataccatta tatgttcccc ctgtttcaga ggctcagatt gtaatatgta 3780
aatggtatgt cattcgctac tatgatttaa tttgaaatat ggtcttttgg ttatgaatac 3840
tttgcagcac agctgagagg ctgtctgttg tattcattgt ggtcatagca cctaacaaca 3900
ttgtagcctc aatcgagtga gacagactag aagttcctag tgatggctta tgatagcaaa 3960
tggcctcatg tcaaatattt agatgtaatt ttgtgtaaga aatacagact ggatgtacca 4020
ccaactacta cctgtaatga caggcctgtc caacacatct cccttttcca tgactgtggt 4080
agccagcatc ggaaagaacg ctgatttaaa gaggtcgctt gggaatttta ttgacacagt 4140
accatttaat ggggaggaca aaatggggca ggggagggag aagtttctgt cgttaaaaac 4200
agatttggaa agactggact ctaaagtctg ttgattaaag atgagctttg tctacttcaa 4260
aagtttgttt gcttacccct tcagcctcca attttttaag tgaaaatata gctaataaca 4320
tgtgaaaaga atagaagcta aggtttagat aaatattgag cagatctata ggaagattga 4380
acctgaatat tgccattatg cttgacatgg tttccaaaaa atggtactcc acatatttca 4440
gtgagggtaa gtattttcct gttgtcaaga atagcattgt aaaagcattt tgtaataata 4500
aagaatagct ttaatgatat gcttgtaact aaaataattt tgtaatgtat caaatacatt 4560
taaaacatta aaatataatc tctataataa tttaaaatct aatatggttt taatagaaca 4620
gcgatatcaa gcttatcgat aatcaacctc tggattacaa aatttgtgaa agattgactg 4680
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 4740
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 4800
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 4860
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 4920
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 4980
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaaatcat 5040
cgtcctttcc ttggctgctc gcctatgttg ccacctggat tctgcgcggg acgtccttct 5100
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 5160
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 5220
cctccccgcg aattcatcga taccgagcgc tgctcgagag atctgtgata gcggccatca 5280
agctggctgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct 5340
tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc 5400
attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg 5460
aggattggga agacaatagc aggcatgctg gggacacgtg cggaccgagc ggccgcagga 5520
acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg 5580
gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc 5640
gcgcagctgc ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat 5700
ttcacaccgc atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg 5760
gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct 5820
cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta 5880
aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa 5940
cttgatttgg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct 6000
ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc 6060
aaccctatct cgggctattc ttttgattta taagggattt tgccgatttc ggcctattgg 6120
ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt 6180
acaattttat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc 6240
cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct 6300
tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca 6360
ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg 6420
ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct 6480
atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga 6540
taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc 6600
cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg 6660
aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc 6720
aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact 6780
tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc 6840
ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag 6900
catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat 6960
aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt 7020
ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa 7080
gccataccaa acgacgagcg tgacacca 7108
<210> 66
<211> 7135
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 66
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taagagagca cttgggaaga 1320
gccccccgagg gcagccgggg cttgccgcct cacccttttg gtttcacatc ccagaaatca 1380
gtaaggcagg aattggaggc tgcttcttgc cttagcaact cggtgacctt aggcagaaca 1440
gttcagcctt ctgagtgtcc ttcctcttct gtaaggggag cgtaaaccgt cctccatgca 1500
gaacgtgtac tgtgcctggc acagcactgg ggcattagga tctccaaatt aaaggctcac 1560
tctgcgggat ggaggcagcc acagctggaa gaaggaacat ttggggccag aagtccccct 1620
acctccgtcc taagagagaa gatgggaata acgaccctcg ctgaaatgat tgctctctgg 1680
ccagctcgcc tcgcatccac atccaaatct gggaggcaca gagcgcatca ggacatcggg 1740
ttctgtcagt gtaatgggcg tggctcctga ccttctgtct gtatcagaga agataaggga 1800
gaacatttga aagaaaggag aaagaagata gccactggag aacagagcaa aggagccagc 1860
agaaaaagac gagacggctg tagccccaca ggaagcagaa accgataggc taagtaggat 1920
acacacaaag aaaagtagat cccgagaggc atttccccga gggctttcat gtggtttctc 1980
gtgaggagaa gctgactgca gggtgtttga aagaacgact tatgcagcca taaaaaatga 2040
tgagttcatg tcctttgtag ggacatggat gattaattaa gacctcgaag gggacttggg 2100
gggttcgggg ctttcggggg cggtcggggg ttcgcggacc cgggaagctc tgaggaccca 2160
gaggccgggc gcgctccgcc cgcggcgccg ccccctccgt aactttccca gtctccgagg 2220
gaagaggcgg ggtgtggggt gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc 2280
cccgcagcgc ccgcgcgctc ctctccccga ctcggagccc ctcggcggcg cccggcccag 2340
gacccgccta ggagcgcagg agccccagcg cagagacccc aacgccgaga cccccgcccc 2400
ggccccgccg cgcttcctcc cgacgcagag caaaccgccc agagtagaag ccatggattg 2460
gggcacactc cagagcatcc tcgggggtgt caacaaacac tccaccagca ttggaaagat 2520
ctggctcacg gtcctcttca tcttccgcat catgatcctc gtggtggctg caaaggaggt 2580
gtggggagat gagcaagccg attttgtctg caacacgctc cagcctggct gcaagaatgt 2640
atgctacgac caccacttcc ccatctctca catccggctc tgggctctgc agctgatcat 2700
ggtgtccacg ccagccctcc tggtagctat gcatgtggcc taccggagac atgaaaagaa 2760
acggaagttc atgaagggag agataaagaa cgagtttaag gacatcgaag agatcaaaac 2820
ccagaaggtc cgtatcgaag ggtccctgg gtggacctac accaccagca tcttcttccg 2880
ggtcatcttt gaagccgtct tcatgtacgt cttttacatc atgtacaatg gcttcttcat 2940
gcaacgtctg gtgaaatgca acgcttggcc ctgccccaat acagtggact gcttcatttc 3000
caggcccaca gaaaagactg tcttcaccgt gtttatgatt tctgtgtctg gaatttgcat 3060
tctgctaaat atcacagagc tgtgctattt gttcgttagg tattgctcag gaaagtccaa 3120
aagaccagtc tacccatacg atgttccaga ttacgcttaa aggcgcgcca cccctgcagg 3180
gaattccgca ttgcccagtt gttagattaa gaaatagaca gcatgagagg gatgaggcaa 3240
cccgtgctca gctgtcaagg ctcagtcgct agcatttccc aacacaaaga ttctgacctt 3300
aaatgcaacc atttgaaacc cctgtaggcc tcaggtgaaa ctccagatgc cacaatggag 3360
ctctgctccc ctaaagcctc aaaacaaagg cctaattcta tgcctgtctt aattttcttt 3420
cacttaagtt agttccactg agaccccagg ctgttagggg ttattggtgt aaggtacttt 3480
catattttaa acagaggata tcggcatttg tttctttctc tgaggacaag agaaaaaagc 3540
caggttccac agaggacaca gagaaggttt gggtgtcctc ctggggttct ttttgccaac 3600
tttccccacg ttaaaggtga acattggttc tttcatttgc tttggaagtt ttaatctcta 3660
acagtggaca aagttaccag tgccttaaac tctgttacac tttttggaag tgaaaacttt 3720
gtagtatgat aggttatttt gatgtaaaga tgttctggat accattatat gttccccctg 3780
tttcagaggc tcagattgta atatgtaaat ggtatgtcat tcgctactat gatttaattt 3840
gaaatatggt cttttggtta tgaatacttt gcagcacagc tgagaggctg tctgttgtat 3900
tcattgtggt catagcacct aacaacattg tagcctcaat cgagtgagac agactagaag 3960
ttcctagtga tggcttatga tagcaaatgg cctcatgtca aatatttaga tgtaattttg 4020
tgtaagaaat acagactgga tgtaccacca actactacct gtaatgacag gcctgtccaa 4080
cacatctccc ttttccatga ctgtggtagc cagcatcgga aagaacgctg atttaaagag 4140
gtcgcttggg aatttattg acacagtacc atttaatggg gaggacaaaa tggggcaggg 4200
gagggagaag tttctgtcgt taaaaacaga tttggaaaga ctggactcta aagtctgttg 4260
attaaagatg agctttgtct acttcaaaag tttgtttgct taccccttca gcctccaatt 4320
ttttaagtga aaatatagct aataacatgt gaaaagaata gaagctaagg tttagataaa 4380
tattgagcag atctatagga agattgaacc tgaatattgc cattatgctt gacatggttt 4440
ccaaaaaatg gtactccaca tatttcagtg agggtaagta ttttcctgtt gtcaagaata 4500
gcattgtaaa agcattttgt aataataaag aatagcttta atgatatgct tgtaactaaa 4560
ataattttgt aatgtatcaa atacatttaa aacattaaaa tataatctct ataataattt 4620
aaaatctaat atggttttaa tagaacagcg atatcaagct tatcgataat caacctctgg 4680
attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct tttacgctat 4740
gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg gctttcattt 4800
tctcctcctt gtataaatcc tggttgctgt ctctttatga ggagttgtgg cccgttgtca 4860
ggcaacgtgg cgtggtgtgc actgtgtttg ctgacgcaac ccccactggt tggggcattg 4920
ccaccacctg tcagctcctt tccgggactt tcgctttccc cctccctatt gccacggcgg 4980
aactcatcgc cgcctgcctt gcccgctgct ggacaggggc tcggctgttg ggcactgaca 5040
attccgtggt gttgtcgggg aaatcatcgt cctttccttg gctgctcgcc tatgttgcca 5100
cctggattct gcgcgggacg tccttctgct acgtcccttc ggccctcaat ccagcggacc 5160
ttccttcccg cggcctgctg ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc 5220
agacgagtcg gatctccctt tgggccgcct ccccgcgaat tcatcgatac cgagcgctgc 5280
tcgagagatc tgtgatagcg gccatcaagc tggctgtgcc ttctagttgc cagccatctg 5340
ttgtttgccc ctccccccgtg ccttccttga ccctggaagg tgccactccc actgtccttt 5400
cctaataaaa tgaggaaatt gcatcgcatt gtctgagtag gtgtcattct attctggggg 5460
gtggggtggg gcaggacagc aagggggagg attgggaaga caatagcagg catgctgggg 5520
acacgtgcgg accgagcggc cgcaggaacc cctagtgatg gagttggcca ctccctctct 5580
gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc 5640
ccgggcggcc tcagtgagcg agcgagcgcg cagctgcctg caggggcgcc tgatgcggta 5700
ttttctcctt acgcatctgt gcggtatttc acaccgcata cgtcaaagca accatagtac 5760
gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct 5820
acacttgcca gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 5880
ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 5940
gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtggccca 6000
tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 6060
ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa 6120
gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac 6180
gcgaatttta acaaaatatt aacgtttaca attttatggt gcactctcag tacaatctgc 6240
tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga 6300
cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc 6360
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata 6420
cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact 6480
tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg 6540
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt 6600
atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct 6660
gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca 6720
cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc 6780
gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc 6840
cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg 6900
gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta 6960
tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc 7020
ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt 7080
gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga cacca 7135
<210> 67
<211> 7124
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 67
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcagtgatgc ctgaaacctc 1320
agatggtact gaaccctcta tataatctgt tttttcctat acatacaaac ctaccataag 1380
gcttaatggt aagagattaa caataaagaa taataaaaca acacttataa caatgtataa 1440
caatatattg taatataagt ttttggatgc agtctctctc tcaaaatgct atcatatttt 1500
ccaactgtgg ttgactacag gtaactggaa ccacaaaaat gaaacagtgg ataagagggc 1560
gactcctgta ccaaagaaaa aaatagagtg ttgcagctgt aacatagttg aatgactgag 1620
ttagactgca taactgacac acaaaaccac ataaatataa atgaaggaat ctctgggtgt 1680
aatctggtgc aaaggtgact gtgttaatca ttaatccaca agttgctatc ctgaagtgtg 1740
ccaaatgctt tatgtttatt tcatcacata gctctataaa gaaaggattt gtaattcctt 1800
tctacagaag tggaaagtaa gtcttaagac tcaaaaaact ttaaaaacta caatgaagta 1860
acaactttta ttaatttatt ttgtgtcttt ccagaatttc tatatatata ggaatgtgat 1920
atgaatctat atgtgaattg aatctacatg aatattgatg acttttatt ccccttttgc 1980
acataagata gaatatttta cctactattc cacactttgc ttttcttaac atatcatggg 2040
atctttttat ataagtgaac aaagagtttc ttcattcttt cacacagttt aattaagacc 2100
tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg cggacccggg 2160
aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc ctccgtaact 2220
ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg ccacggcggg 2280
agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg gagcccctcg 2340
gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga gaccccaacg 2400
ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa ccgcccagag 2460
tagaagccat ggattggggc acgctgcaga cgatcctggg gggtgtgaac aaacactcca 2520
ccagcattgg aaagatctgg ctcaccgtcc tcttcatttt tcgcattatg atcctcgttg 2580
tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt tgtctgcaac accctgcagc 2640
caggctgcaa gaacgtgtgc tacgatcact acttccccat ctcccacatc cggctatggg 2700
ccctgcagct gatcttcgtg tccacgccag cgctcctagt ggccatgcac gtggcctacc 2760
ggagacatga gaagaagagg aagttcatca aggggggagat aaagagtgaa tttaaggaca 2820
tcgaggagat caaaacccag aaggtccgca tcgaaggctc cctgtggtgg acctacacaa 2880
gcagcatctt cttccgggtc atcttcgaag ccgccttcat gtacgtcttc tatgtcatgt 2940
acgacggctt ctccatgcag cggctggtga agtgcaacgc ctggccttgt cccaacactg 3000
tggactgctt tgtgtcccgg cccacggaga agactgtctt cacagtgttc atgattgcag 3060
tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg ttatttgcta attagatatt 3120
gttctgggaa gtcaaaaaag ccagtttaaa ggcgcgccac ccctgcaggg aattccgcat 3180
tgcccagttg ttagattaag aaatagacag catgagaggg atgaggcaac ccgtgctcag 3240
ctgtcaaggc tcagtcgcta gcatttccca acacaaagat tctgacctta aatgcaacca 3300
tttgaaaccc ctgtaggcct caggtgaaac tccagatgcc acaatggagc tctgctcccc 3360
taaagcctca aaacaaaggc ctaattctat gcctgtctta attttctttc acttaagtta 3420
gttccactga gaccccaggc tgttaggggt tattggtgta aggtactttc atattttaaa 3480
cagaggatat cggcatttgt ttctttctct gaggacaaga gaaaaaagcc aggttccaca 3540
gaggacacag agaaggtttg ggtgtcctcc tggggttctt tttgccaact ttccccacgt 3600
taaaggtgaa cattggttct ttcatttgct ttggaagttt taatctctaa cagtggacaa 3660
agttaccagt gccttaaact ctgttacact ttttggaagt gaaaactttg tagtatgata 3720
ggttatttg atgtaaagat gttctggata ccattatatg ttccccctgt ttcagaggct 3780
cagattgtaa tatgtaaatg gtatgtcatt cgctactatg atttaatttg aaatatggtc 3840
ttttggttat gaatactttg cagcacagct gagaggctgt ctgttgtatt cattgtggtc 3900
atagcaccta acaacattgt agcctcaatc gagtgagaca gactagaagt tcctagtgat 3960
ggcttatgat agcaaatggc ctcatgtcaa atatttagat gtaattttgt gtaagaaata 4020
cagactggat gtaccaccaa ctactacctg taatgacagg cctgtccaac acatctccct 4080
tttccatgac tgtggtagcc agcatcggaa agaacgctga tttaaagagg tcgcttggga 4140
attttatga cacagtacca tttaatgggg aggacaaaat ggggcagggg agggagaagt 4200
ttctgtcgtt aaaaacagat ttggaaagac tggactctaa agtctgttga ttaaagatga 4260
gctttgtcta cttcaaaagt ttgtttgctt accccttcag cctccaattt tttaagtgaa 4320
aatatagcta ataacatgtg aaaagaatag aagctaaggt ttagataaat attgagcaga 4380
tctataggaa gattgaacct gaatattgcc attatgcttg acatggtttc caaaaaatgg 4440
tactccacat atttcagtga gggtaagtat tttcctgttg tcaagaatag cattgtaaaa 4500
gcattttgta ataataaaga atagctttaa tgatatgctt gtaactaaaa taattttgta 4560
atgtatcaaa tacatttaaa acattaaaat ataatctcta taataattta aaatctaata 4620
tggttttaat agaacagcga tatcaagctt atcgataatc aacctctgga ttacaaaatt 4680
tgtgaaagat tgactggtat tcttaactat gttgctcctt ttacgctatg tggatacgct 4740
gctttaatgc ctttgtatca tgctattgct tcccgtatgg ctttcatttt ctcctccttg 4800
tataaatcct ggttgctgtc tctttatgag gagttgtggc ccgttgtcag gcaacgtggc 4860
gtggtgtgca ctgtgtttgc tgacgcaacc cccactggtt ggggcattgc caccacctgt 4920
cagctccttt ccgggacttt cgctttcccc ctccctattg ccacggcgga actcatcgcc 4980
gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggtg 5040
ttgtcgggga aatcatcgtc ctttccttgg ctgctcgcct atgttgccac ctggattctg 5100
cgcgggacgt ccttctgcta cgtcccttcg gccctcaatc cagcggacct tccttcccgc 5160
ggcctgctgc cggctctgcg gcctcttccg cgtcttcgcc ttcgccctca gacgagtcgg 5220
atctcccttt gggccgcctc cccgcgaatt catcgatacc gagcgctgct cgagagatct 5280
gtgatagcgg ccatcaagct ggctgtgcct tctagttgcc agccatctgt tgtttgcccc 5340
tcccccgtgc cttccttgac cctggaaggt gccactccca ctgtcctttc ctaataaaat 5400
gaggaaattg catcgcattg tctgagtagg tgtcattcta ttctgggggg tggggtgggg 5460
caggacagca agggggagga ttgggaagac aatagcaggc atgctgggga cacgtgcgga 5520
ccgagcggcc gcaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct 5580
cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct 5640
cagtgagcga gcgagcgcgc agctgcctgc aggggcgcct gatgcggtat tttctcctta 5700
cgcatctgtg cggtatttca caccgcatac gtcaaagcaa ccatagtacg cgccctgtag 5760
cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag 5820
cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt 5880
tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca 5940
cctcgacccc aaaaaacttg atttgggtga tggttcacgt agtgggccat cgccctgata 6000
gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca 6060
aactggaaca acactcaacc ctatctcggg ctattctttt gatttataag ggattttgcc 6120
gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa 6180
caaaatatta acgtttacaa ttttatggtg cactctcagt acaatctgct ctgatgccgc 6240
atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 6300
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 6360
gttttcaccg tcatcaccga aacgcgcgag acgaaagggc ctcgtgatac gcctattttt 6420
ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa 6480
tgtgcgcgga acccctattt gtttatttt ctaaatacat tcaaatatgt atccgctcat 6540
gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca 6600
acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca 6660
cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta 6720
catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt 6780
tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc 6840
cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc 6900
accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc 6960
cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa 7020
ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga 7080
accggagctg aatgaagcca taccaaacga cgagcgtgac acca 7124
<210> 68
<211> 7151
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 68
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tcagtgatgc ctgaaacctc 1320
agatggtact gaaccctcta tataatctgt tttttcctat acatacaaac ctaccataag 1380
gcttaatggt aagagattaa caataaagaa taataaaaca acacttataa caatgtataa 1440
caatatattg taatataagt ttttggatgc agtctctctc tcaaaatgct atcatatttt 1500
ccaactgtgg ttgactacag gtaactggaa ccacaaaaat gaaacagtgg ataagagggc 1560
gactcctgta ccaaagaaaa aaatagagtg ttgcagctgt aacatagttg aatgactgag 1620
ttagactgca taactgacac acaaaaccac ataaatataa atgaaggaat ctctgggtgt 1680
aatctggtgc aaaggtgact gtgttaatca ttaatccaca agttgctatc ctgaagtgtg 1740
ccaaatgctt tatgtttatt tcatcacata gctctataaa gaaaggattt gtaattcctt 1800
tctacagaag tggaaagtaa gtcttaagac tcaaaaaact ttaaaaacta caatgaagta 1860
acaactttta ttaatttatt ttgtgtcttt ccagaatttc tatatatata ggaatgtgat 1920
atgaatctat atgtgaattg aatctacatg aatattgatg acttttatt ccccttttgc 1980
acataagata gaatatttta cctactattc cacactttgc ttttcttaac atatcatggg 2040
atctttttat ataagtgaac aaagagtttc ttcattcttt cacacagttt aattaagacc 2100
tcgaagggga cttggggggt tcggggcttt cgggggcggt cgggggttcg cggacccggg 2160
aagctctgag gacccagagg ccgggcgcgc tccgcccgcg gcgccgcccc ctccgtaact 2220
ttcccagtct ccgagggaag aggcggggtg tggggtgcgg ttaaaaggcg ccacggcggg 2280
agacaggtgt tgcggccccg cagcgcccgc gcgctcctct ccccgactcg gagcccctcg 2340
gcggcgcccg gcccaggacc cgcctaggag cgcaggagcc ccagcgcaga gaccccaacg 2400
ccgagacccc cgccccggcc ccgccgcgct tcctcccgac gcagagcaaa ccgcccagag 2460
tagaagccat ggattggggc acactccaga gcatcctcgg gggtgtcaac aaacactcca 2520
ccagcattgg aaagatctgg ctcacggtcc tcttcatctt ccgcatcatg atcctcgtgg 2580
tggctgcaaa ggaggtgtgg ggagatgagc aagccgattt tgtctgcaac acgctccagc 2640
ctggctgcaa gaatgtatgc tacgaccacc acttccccat ctctcacatc cggctctggg 2700
ctctgcagct gatcatggtg tccacgccag ccctcctggt agctatgcat gtggcctacc 2760
2820
tcgaagagat caaaacccag aaggtccgta tcgaagggtc cctgtggtgg acctacacca 2880
ccagcatctt cttccgggtc atctttgaag ccgtcttcat gtacgtcttt tacatcatgt 2940
acaatggctt cttcatgcaa cgtctggtga aatgcaacgc ttggccctgc cccaatacag 3000
tggactgctt catttccagg cccacagaaa agactgtctt caccgtgttt atgatttctg 3060
tgtctggaat ttgcattctg ctaaatatca cagagctgtg ctatttgttc gttaggtatt 3120
gctcaggaaa gtccaaaaga ccagtctacc catacgatgt tccagattac gcttaaaggc 3180
gcgccacccc tgcagggaat tccgcattgc ccagttgtta gattaagaaa tagacagcat 3240
gagagggatg aggcaacccg tgctcagctg tcaaggctca gtcgctagca tttcccaaca 3300
caaagattct gaccttaaat gcaaccattt gaaacccctg taggcctcag gtgaaactcc 3360
agatgccaca atggagctct gctcccctaa agcctcaaaa caaaggccta attctatgcc 3420
tgtcttaatt ttctttcact taagttagtt ccactgagac cccaggctgt taggggttat 3480
tggtgtaagg tactttcata ttttaaacag aggatatcgg catttgtttc tttctctgag 3540
gacaagagaa aaaagccagg ttccacagag gacacagaga aggtttgggt gtcctcctgg 3600
ggttcttttt gccaactttc cccacgttaa aggtgaacat tggttctttc atttgctttg 3660
gaagttttaa tctctaacag tggacaaagt taccagtgcc ttaaactctg ttacactttt 3720
tggaagtgaa aactttgtag tatgataggt tattttgatg taaagatgtt ctggatacca 3780
ttatatgttc cccctgtttc agaggctcag attgtaatat gtaaatggta tgtcattcgc 3840
tactatgatt taatttgaaa tatggtcttt tggttatgaa tactttgcag cacagctgag 3900
aggctgtctg ttgtattcat tgtggtcata gcacctaaca acattgtagc ctcaatcgag 3960
tgagacagac tagaagttcc tagtgatggc ttatgatagc aaatggcctc atgtcaaata 4020
tttagatgta attttgtgta agaaatacag actggatgta ccaccaacta ctacctgtaa 4080
tgacaggcct gtccaacaca tctccctttt ccatgactgt ggtagccagc atcggaaaga 4140
acgctgattt aaagaggtcg cttgggaatt ttattgacac agtaccattt aatggggagg 4200
acaaaatggg gcaggggagg gagaagtttc tgtcgttaaa aacagatttg gaaagactgg 4260
actctaaagt ctgttgatta aagatgagct ttgtctactt caaaagtttg tttgcttacc 4320
ccttcagcct ccaatttttt aagtgaaaat atagctaata acatgtgaaa agaatagaag 4380
ctaaggttta gataaatatt gagcagatct ataggaagat tgaacctgaa tattgccatt 4440
atgcttgaca tggtttccaa aaaatggtac tccacatatt tcagtgaggg taagtatttt 4500
cctgttgtca agaatagcat tgtaaaagca ttttgtaata ataaagaata gctttaatga 4560
tatgcttgta actaaaataa ttttgtaatg tatcaaatac atttaaaaca ttaaaatata 4620
atctctataa taatttaaaa tctaatatgg ttttaataga acagcgatat caagcttatc 4680
gataatcaac ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt 4740
gctcctttta cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc 4800
cgtatggctt tcattttctc ctccttgtat aaatcctggt tgctgtctct ttatgaggag 4860
ttgtggcccg ttgtcaggca acgtggcgtg gtgtgcactg tgtttgctga cgcaaccccc 4920
actggttggg gcattgccac cacctgtcag ctcctttccg ggactttcgc tttccccctc 4980
cctattgcca cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg 5040
ctgttgggca ctgacaattc cgtggtgttg tcggggaaat catcgtcctt tccttggctg 5100
ctcgcctatg ttgccacctg gattctgcgc gggacgtcct tctgctacgt cccttcggcc 5160
ctcaatccag cggaccttcc ttcccgcggc ctgctgccgg ctctgcggcc tcttccgcgt 5220
cttcgccttc gccctcagac gagtcggatc tccctttggg ccgcctcccc gcgaattcat 5280
cgataccgag cgctgctcga gagatctgtg atagcggcca tcaagctggc tgtgccttct 5340
agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc 5400
actcccactg tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt 5460
cattctattc tggggggtgg ggtggggcag gacagcaagg gggaggattg ggaagacaat 5520
agcaggcatg ctggggacac gtgcggaccg agcggccgca ggaaccccta gtgatggagt 5580
tggccactcc ctctctgcgc gctcgctcgc tcactgaggc cgggcgacca aaggtcgccc 5640
gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg agcgcgcagc tgcctgcagg 5700
ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatacgtc 5760
aaagcaacca tagtacgcgc cctgtagcgg cgcattaagc gcggcgggtg tggtggttac 5820
gcgcagcgtg accgctacac ttgccagcgc cctagcgccc gctcctttcg ctttcttccc 5880
ttcctttctc gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt 5940
agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt tgggtgatgg 6000
ttcacgtagt gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac 6060
gttctttaat agtggactct tgttccaaac tggaacaaca ctcaacccta tctcgggcta 6120
ttcttttgat ttataaggga ttttgccgat ttcggcctat tggttaaaaa atgagctgat 6180
ttaacaaaaa tttaacgcga attttaacaa aatattaacg tttacaattt tatggtgcac 6240
tctcagtaca atctgctctg atgccgcata gttaagccag ccccgacacc cgccaacacc 6300
cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac 6360
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgagacg 6420
aaagggcctc gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta 6480
gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta 6540
aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata 6600
ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc 6660
ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga 6720
agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct 6780
tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg 6840
tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta 6900
ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat 6960
gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt 7020
acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga 7080
tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga 7140
gcgtgacacc a 7151
<210> 69
<400> 69
000
<210> 70
<211> 7208
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 70
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tgctatctat catcttgaag 1320
ggcttctgga acaagttaga atagagtcaa cactcatgaa ctgctgtagc aaaaaaaact 1380
atagatgtag gattgacaag ggcaatagag cgatgactcc ctggctgtgt tgtatttgat 1440
ggacggcagt agcttttcac aaaatgctca tttggatgtt tcaaattaaa acgtttcact 1500
ttctagaacc aattacgtgg tcagtttagc tcctgaggtc ccagtcagag gggtattctg 1560
tagcttgcaa agcctctctt tggggactgg acatggagtc tgtggtctta gaattcagaa 1620
ccgggagaat gtgttagcca ctcatctaag ctattcctta aacgctttca gagccatctc 1680
cactgtgggg aaagaagttc tttgtgttct ctgacttagt ctcattctaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaagcaat tgcaataccc agagcgcaca gtagatggca ctgagacttg 1800
tcggaaagct ggacgcactc aagaggtggc agaaaaatct ataggtaagc ttttcttcta 1860
gtctggtgtt gctgctcctg accttattaa tgggctgaga aatagatttc tttcctttcc 1920
ttttcttttt tatatgaaat taaatgaagt ataaaagaat atgagaatgt gttgctatta 1980
gcaaggataa gtaatgcttt aggaaacgtt tggttcatgt gtgtgttttc agactgatgt 2040
gtgtcctgga tccagtgtaa aatgtacttc tgtctgtagg tctctgccac agaaaagttg 2100
gaaagccatt gttgtattcc atttccaggg caacaaaaga taccactgtc acttcatgtg 2160
aaatggtgtt gtttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg 2220
cggtcggggg ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc 2280
cgcggcgccg ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt 2340
gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc 2400
ctctccccga ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg 2460
agccccagcg cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc 2520
cgacgcagag caaaccgccc agagtagaag ccatggattg gggcacgctg cagacgatcc 2580
tggggggtgt gaacaaacac tccaccagca ttggaaagat ctggctcacc gtcctcttca 2640
tttttcgcat tatgatcctc gttgtggctg caaaggaggt gtggggagat gagcaggccg 2700
actttgtctg caacaccctg cagccaggct gcaagaacgt gtgctacgat cactacttcc 2760
ccatctccca catccggcta tgggccctgc agctgatctt cgtgtccacg ccagcgctcc 2820
tagtggccat gcacgtggcc taccggagac atgagaagaa gaggaagttc atcaaggggg 2880
agataaagag tgaatttaag gacatcgagg agatcaaaac ccagaaggtc cgcatcgaag 2940
gctccctgg gtggacctac acaagcagca tcttcttccg ggtcatcttc gaagccgcct 3000
tcatgtacgt cttctatgtc atgtacgacg gcttctccat gcagcggctg gtgaagtgca 3060
acgcctggcc ttgtcccaac actgtggact gctttgtgtc ccggcccacg gagaagactg 3120
tcttcacagt gttcatgatt gcagtgtctg gaatttgcat cctgctgaat gtcactgaat 3180
tgtgtattt gctaattaga tattgttctg ggaagtcaaa aaagccagtt taaaggcgcg 3240
ccacccctgc agggaattcc gcattgccca gttgttagat taagaaatag acagcatgag 3300
agggatgagg caacccgtgc tcagctgtca aggctcagtc gctagcattt cccaacacaa 3360
agattctgac cttaaatgca accatttgaa acccctgtag gcctcaggtg aaactccaga 3420
tgccacaatg gagctctgct cccctaaagc ctcaaaacaa aggcctaatt ctatgcctgt 3480
cttaattttc tttcacttaa gttagttcca ctgagacccc aggctgttag gggttattgg 3540
tgtaaggtac tttcatattt taaacagagg atatcggcat ttgtttcttt ctctgaggac 3600
aagagaaaaa agccaggttc cacagaggac acagagaagg tttgggtgtc ctcctggggt 3660
tctttttgcc aactttcccc acgttaaagg tgaacattgg ttctttcatt tgctttggaa 3720
gttttaatct ctaacagtgg acaaagttac cagtgcctta aactctgtta cactttttgg 3780
aagtgaaaac tttgtagtat gataggttat tttgatgtaa agatgttctg gataccatta 3840
tatgttcccc ctgtttcaga ggctcagatt gtaatatgta aatggtatgt cattcgctac 3900
tatgatttaa tttgaaatat ggtcttttgg ttatgaatac tttgcagcac agctgagagg 3960
ctgtctgttg tattcattgt ggtcatagca cctaacaaca ttgtagcctc aatcgagtga 4020
gacagactag aagttcctag tgatggctta tgatagcaaa tggcctcatg tcaaatattt 4080
agatgtaatt ttgtgtaaga aatacagact ggatgtacca ccaactacta cctgtaatga 4140
caggcctgtc caacacatct cccttttcca tgactgtggt agccagcatc ggaaagaacg 4200
ctgatttaaa gaggtcgctt gggaatttta ttgacacagt accatttaat ggggaggaca 4260
aaatggggca ggggagggag aagtttctgt cgttaaaaac agatttggaa agactggact 4320
ctaaagtctg ttgattaaag atgagctttg tctacttcaa aagtttgttt gcttacccct 4380
tcagcctcca attttttaag tgaaaatata gctaataaca tgtgaaaaga atagaagcta 4440
aggtttagat aaatattgag cagatctata ggaagattga acctgaatat tgccattatg 4500
cttgacatgg tttccaaaaa atggtactcc acatatttca gtgagggtaa gtattttcct 4560
gttgtcaaga atagcattgt aaaagcattt tgtaataata aagaatagct ttaatgatat 4620
gcttgtaact aaaataattt tgtaatgtat caaatacatt taaaacatta aaatataatc 4680
tctataataa tttaaaatct aatatggttt taatagaaca gcgatatcaa gcttatcgat 4740
aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct 4800
ccttttacgc tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt 4860
atggctttca ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg 4920
tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact 4980
ggttggggca ttgccacac ctgtcagctc ctttccggga ctttcgcttt ccccctccct 5040
attgccacgg cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg 5100
ttgggcactg acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc 5160
gcctatgttg ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc 5220
aatccagcgg accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt 5280
cgccttcgcc ctcagacgag tcggatctcc ctttgggccg cctccccgcg aattcatcga 5340
taccgagcgc tgctcgagag atctgtgata gcggccatca agctggctgt gccttctagt 5400
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 5460
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 5520
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 5580
aggcatgctg gggacacgtg cggaccgagc ggccgcagga acccctagtg atggagttgg 5640
ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag gtcgcccgac 5700
gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc ctgcaggggc 5760
gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atacgtcaaa 5820
gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 5880
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 5940
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 6000
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg gtgatggttc 6060
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 6120
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cgggctattc 6180
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 6240
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat ggtgcactct 6300
cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 6360
tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 6420
ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 6480
gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 6540
gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 6600
acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 6660
aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 6720
attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 6780
tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 6840
gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 6900
cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 6960
tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 7020
agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 7080
tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 7140
tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 7200
tgacacca 7208
<210> 71
<211> 7235
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 71
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg tgctatctat catcttgaag 1320
ggcttctgga acaagttaga atagagtcaa cactcatgaa ctgctgtagc aaaaaaaact 1380
atagatgtag gattgacaag ggcaatagag cgatgactcc ctggctgtgt tgtatttgat 1440
ggacggcagt agcttttcac aaaatgctca tttggatgtt tcaaattaaa acgtttcact 1500
ttctagaacc aattacgtgg tcagtttagc tcctgaggtc ccagtcagag gggtattctg 1560
tagcttgcaa agcctctctt tggggactgg acatggagtc tgtggtctta gaattcagaa 1620
ccgggagaat gtgttagcca ctcatctaag ctattcctta aacgctttca gagccatctc 1680
cactgtgggg aaagaagttc tttgtgttct ctgacttagt ctcattctaa aaaaaaaaaa 1740
aaaaaaaaaa aaaaagcaat tgcaataccc agagcgcaca gtagatggca ctgagacttg 1800
tcggaaagct ggacgcactc aagaggtggc agaaaaatct ataggtaagc ttttcttcta 1860
gtctggtgtt gctgctcctg accttattaa tgggctgaga aatagatttc tttcctttcc 1920
ttttcttttt tatatgaaat taaatgaagt ataaaagaat atgagaatgt gttgctatta 1980
gcaaggataa gtaatgcttt aggaaacgtt tggttcatgt gtgtgttttc agactgatgt 2040
gtgtcctgga tccagtgtaa aatgtacttc tgtctgtagg tctctgccac agaaaagttg 2100
gaaagccatt gttgtattcc atttccaggg caacaaaaga taccactgtc acttcatgtg 2160
aaatggtgtt gtttaattaa gacctcgaag gggacttggg gggttcgggg ctttcggggg 2220
cggtcggggg ttcgcggacc cgggaagctc tgaggaccca gaggccgggc gcgctccgcc 2280
cgcggcgccg ccccctccgt aactttccca gtctccgagg gaagaggcgg ggtgtggggt 2340
gcggttaaaa ggcgccacgg cgggagacag gtgttgcggc cccgcagcgc ccgcgcgctc 2400
ctctccccga ctcggagccc ctcggcggcg cccggcccag gacccgccta ggagcgcagg 2460
agccccagcg cagagacccc aacgccgaga cccccgcccc ggccccgccg cgcttcctcc 2520
cgacgcagag caaaccgccc agagtagaag ccatggattg gggcacactc cagagcatcc 2580
tcgggggtgt caacaaacac tccaccagca ttggaaagat ctggctcacg gtcctcttca 2640
tcttccgcat catgatcctc gtggtggctg caaaggaggt gtggggagat gagcaagccg 2700
attttgtctg caacacgctc cagcctggct gcaagaatgt atgctacgac caccacttcc 2760
ccatctctca catccggctc tgggctctgc agctgatcat ggtgtccacg ccagccctcc 2820
tggtagctat gcatgtggcc taccggagac atgaaaagaa acggaagttc atgaagggag 2880
agataaagaa cgagtttaag gacatcgaag agatcaaaac ccagaaggtc cgtatcgaag 2940
ggtccctgtg gtggacctac accaccagca tcttcttccg ggtcatcttt gaagccgtct 3000
tcatgtacgt cttttacatc atgtacaatg gcttcttcat gcaacgtctg gtgaaatgca 3060
acgcttggcc ctgccccaat acagtggact gcttcatttc caggcccaca gaaaagactg 3120
tcttcaccgt gtttatgatt tctgtgtctg gaatttgcat tctgctaaat atcacagagc 3180
tgtgctattt gttcgttagg tattgctcag gaaagtccaa aagaccagtc tacccatacg 3240
atgttccaga ttacgcttaa aggcgcgcca cccctgcagg gaattccgca ttgcccagtt 3300
gttagattaa gaaatagaca gcatgagagg gatgaggcaa cccgtgctca gctgtcaagg 3360
ctcagtcgct agcatttccc aacacaaaga ttctgacctt aaatgcaacc atttgaaacc 3420
cctgtaggcc tcaggtgaaa ctccagatgc cacaatggag ctctgctccc ctaaagcctc 3480
aaaacaaagg cctaattcta tgcctgtctt aattttcttt cacttaagtt agttccactg 3540
agaccccagg ctgttagggg ttattggtgt aaggtacttt catattttaa acagaggata 3600
tcggcatttg tttctttctc tgaggacaag agaaaaaagc caggttccac agaggacaca 3660
gagaaggttt gggtgtcctc ctggggttct ttttgccaac tttccccacg ttaaaggtga 3720
acattggttc tttcatttgc tttggaagtt ttaatctcta acagtggaca aagttaccag 3780
tgccttaaac tctgttacac tttttggaag tgaaaacttt gtagtatgat aggttatttt 3840
gatgtaaaga tgttctggat accattatat gttccccctg tttcagaggc tcagattgta 3900
atatgtaaat ggtatgtcat tcgctactat gatttaattt gaaatatggt cttttggtta 3960
tgaatacttt gcagcacagc tgagaggctg tctgttgtat tcattgtggt catagcacct 4020
aacaacattg tagcctcaat cgagtgagac agactagaag ttcctagtga tggcttatga 4080
tagcaaatgg cctcatgtca aatatttaga tgtaattttg tgtaagaaat acagactgga 4140
tgtaccacca actactacct gtaatgacag gcctgtccaa cacatctccc ttttccatga 4200
ctgtggtagc cagcatcgga aagaacgctg atttaaagag gtcgcttggg aattttattg 4260
acacagtacc atttaatggg gaggacaaaa tggggcaggg gagggagaag tttctgtcgt 4320
taaaaacaga tttggaaaga ctggactcta aagtctgttg attaaagatg agctttgtct 4380
acttcaaaag tttgtttgct taccccttca gcctccaatt ttttaagtga aaatatagct 4440
aataacatgt gaaaagaata gaagctaagg tttagataaa tattgagcag atctatagga 4500
agattgaacc tgaatattgc cattatgctt gacatggttt ccaaaaaatg gtactccaca 4560
tatttcagtg agggtaagta ttttcctgtt gtcaagaata gcattgtaaa agcattttgt 4620
aataataaag aatagcttta atgatatgct tgtaactaaa ataattttgt aatgtatcaa 4680
atacatttaa aacattaaaa tataatctct ataataattt aaaatctaat atggttttaa 4740
tagaacagcg atatcaagct tatcgataat caacctctgg attacaaaat ttgtgaaaga 4800
ttgactggta ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg 4860
cctttgtatc atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc 4920
tggttgctgt ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc 4980
actgtgtttg ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt 5040
tccgggactt tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt 5100
gcccgctgct ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg 5160
aaatcatcgt cctttccttg gctgctcgcc tatgttgcca cctggattct gcgcgggacg 5220
tccttctgct acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg 5280
ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt 5340
tgggccgcct ccccgcgaat tcatcgatac cgagcgctgc tcgagagatc tgtgatagcg 5400
gccatcaagc tggctgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg 5460
ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt 5520
gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc 5580
aagggggagg attgggaaga caatagcagg catgctgggg acacgtgcgg accgagcggc 5640
cgcaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc tcgctcactg 5700
aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg 5760
agcgagcgcg cagctgcctg caggggcgcc tgatgcggta ttttctcctt acgcatctgt 5820
gcggtatttc acaccgcata cgtcaaagca accatagtac gcgccctgta gcggcgcatt 5880
aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 5940
gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 6000
agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 6060
caaaaaactt gatttgggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 6120
tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 6180
aacactcaac cctatctcgg gctattcttt tgatttataa gggattttgc cgatttcggc 6240
ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt 6300
aacgtttaca attttatggt gcactctcag tacaatctgc tctgatgccg catagttaag 6360
ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc 6420
atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc 6480
gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa 6540
tgtcatgata ataatggttt cttagacgtc aggtggcact tttcgggggaa atgtgcgcgg 6600
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata 6660
accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg 6720
tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac 6780
gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact 6840
ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat 6900
gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga 6960
gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac 7020
agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat 7080
gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac 7140
cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct 7200
gaatgaagcc ataccaaacg acgagcgtga cacca 7235
<210> 72
<211> 7262
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 72
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taactgggca atgcgttaaa 1320
ctggcttttt tgacttccca gaacaatatc taattagcaa ataacacaat tcagtgacat 1380
tcagcaggat gcaaattcca gacactgcaa tcatgaacac tgtgaagaca gtcttctccg 1440
tgggccggga cacaaagcag tccacagtgt tgggacaagg ccaggcgttg cacttcacca 1500
gccgctgcat ggagaagccg tcgtacatga catagaagac gtacatgaag gcggcttcga 1560
agatgacccg gaagaagatg ctgcttgtgt aggtccacca cagggagcct tcgatgcgga 1620
ccttctgggt tttgatctcc tcgatgtcct taaattcact ctttatctcc cccttgatga 1680
acttcctctt cttctcatgt ctccggtagg ccacgtgcat ggccactagg agcgctggcg 1740
tggacacgaa gatcagctgc agggcccata gccggatgtg ggagatgggg aagtagtgat 1800
cgtagcacac gttcttgcag cctggctgca gggtgttgca gacaaagtcg gcctgctcat 1860
ctccccacac ctcctttgca gccacaacga ggatcataat gcgaaaaatg aagaggacgg 1920
tgagccagat ctttccaatg ctggtggagt gtttgttcac accccccagg atcgtctgca 1980
gcgtgcccca atccatcttc tactctgggc ggtttgctct ggaaaagacg aatgcacaca 2040
acacaggaat cactagctag gacagaacag ggagacttct ctgagtctgg gtaagcaagc 2100
atgcttaaat ctcttcctga gcaaacacca actcttacac aacctcacca aaacaggtga 2160
agacagaacc aacttagttt gtcattaatt aagacctcga aggggacttg gggggttcgg 2220
ggctttcggg ggcggtcggg ggttcgcgga cccgggaagc tctgaggacc cagaggccgg 2280
gcgcgctccg cccgcggcgc cgccccctcc gtaactttcc cagtctccga gggaagaggc 2340
ggggtgtggg gtgcggttaa aaggcgccac ggcgggagac aggtgttgcg gccccgcagc 2400
gcccgcgcgc tcctctcccc gactcggagc ccctcggcgg cgcccggccc aggacccgcc 2460
taggagcgca ggagccccag cgcagagacc ccaacgccga gacccccgcc ccggccccgc 2520
cgcgcttcct cccgacgcag agcaaaccgc ccagagtaga agccatggtg agcaagggcg 2580
aggagctgtt caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc 2640
acaagttcag cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga 2700
agttcatctg caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccctga 2760
cctacggcgt gcagtgcttc agccgctacc ccgaccacat gaagcagcac gacttcttca 2820
agtccgccat gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca 2880
actacaagac ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc 2940
tgaagggcat cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact 3000
acaacagcca caacgtctat atcatggccg acaagcagaa gaacggcatc aaggtgaact 3060
tcaagatccg ccacaacatc gaggacggca gcgtgcagct cgccgaccac taccagcaga 3120
acacccccat cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agcacccagt 3180
ccgccctgag caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga 3240
ccgccgccgg gatcactctc ggcatggacg agctgtacaa gtaataaagg cgcgccaccc 3300
ctgcagggaa ttccgcattg cccagttgtt agattaagaa atagacagca tgagagggat 3360
gaggcaaccc gtgctcagct gtcaaggctc agtcgctagc atttcccaac acaaagattc 3420
tgaccttaaa tgcaaccatt tgaaacccct gtaggcctca ggtgaaactc cagatgccac 3480
aatggagctc tgctccccta aagcctcaaa acaaaggcct aattctatgc ctgtcttaat 3540
tttctttcac ttaagttagt tccactgaga ccccaggctg ttaggggtta ttggtgtaag 3600
gtactttcat attttaaaca gaggatatcg gcatttgttt ctttctctga ggacaagaga 3660
aaaaagccag gttccacaga ggacacagag aaggtttggg tgtcctcctg gggttctttt 3720
tgccaacttt ccccacgtta aaggtgaaca ttggttcttt catttgcttt ggaagtttta 3780
atctctaaca gtggacaaag ttaccagtgc cttaaactct gttacacttt ttggaagtga 3840
aaactttgta gtatgatagg ttattttgat gtaaagatgt tctggatacc attatatgtt 3900
ccccctgttt cagaggctca gattgtaata tgtaaatggt atgtcattcg ctactatgat 3960
ttaatttgaa atatggtctt ttggttatga atactttgca gcacagctga gaggctgtct 4020
gttgtattca ttgtggtcat agcacctaac aacattgtag cctcaatcga gtgagacaga 4080
ctagaagttc ctagtgatgg cttatgatag caaatggcct catgtcaaat atttagatgt 4140
aattttgtgt aagaaataca gactggatgt accaccaact actacctgta atgacaggcc 4200
tgtccaacac atctcccttt tccatgactg tggtagccag catcggaaag aacgctgatt 4260
taaagaggtc gcttgggaat tttatgaca cagtaccatt taatggggag gacaaaatgg 4320
ggcaggggag ggagaagttt ctgtcgttaa aaacagattt ggaaagactg gactctaaag 4380
tctgttgatt aaagatgagc tttgtctact tcaaaagttt gtttgcttac cccttcagcc 4440
tccaattttt taagtgaaaa tatagctaat aacatgtgaa aagaatagaa gctaaggttt 4500
agataaatat tgagcagatc tataggaaga ttgaacctga atattgccat tatgcttgac 4560
atggtttcca aaaaatggta ctccacatat ttcagtgagg gtaagtattt tcctgttgtc 4620
aagaatagca ttgtaaaagc attttgtaat aataaagaat agctttaatg atatgcttgt 4680
aactaaaata attttgtaat gtatcaaata catttaaaac attaaaatat aatctctata 4740
ataatttaaa atctaatatg gttttaatag aacagcgata tcaagcttat cgataatcaa 4800
cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt 4860
acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct 4920
ttcattttct cctccttgta taaatcctgg ttgctgtctc tttatgagga gttgtggccc 4980
gttgtcaggc aacgtggcgt ggtgtgcact gtgtttgctg acgcaacccc cactggttgg 5040
ggcattgcca ccacctgtca gctcctttcc gggactttcg ctttccccct ccctattgcc 5100
acggcggaac tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc 5160
actgacaatt ccgtggtgtt gtcggggaaa tcatcgtcct ttccttggct gctcgcctat 5220
gttgccacct ggattctgcg cgggacgtcc ttctgctacg tcccttcggc cctcaatcca 5280
gcggaccttc cttcccgcgg cctgctgccg gctctgcggc ctcttccgcg tcttcgcctt 5340
cgccctcaga cgagtcggat ctccctttgg gccgcctccc cgcgaattca tcgataccga 5400
gcgctgctcg agagatctgt gatagcggcc atcaagctgg ctgtgccttc tagttgccag 5460
ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact 5520
gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt 5580
ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat 5640
gctggggaca cgtgcggacc gagcggccgc aggaacccct agtgatggag ttggccactc 5700
cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 5760
gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag ctgcctgcag gggcgcctga 5820
tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatacgt caaagcaacc 5880
atagtacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt 5940
gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct 6000
cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg 6060
atttagtgct ttacggcacc tcgaccccaa aaaacttgat ttgggtgatg gttcacgtag 6120
tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa 6180
tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcgggct attcttttga 6240
tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa 6300
atttaacgcg aattttaaca aaatattaac gtttacaatt ttatggtgca ctctcagtac 6360
aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc 6420
gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg 6480
gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct 6540
cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg 6600
tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 6660
aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 6720
gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 6780
ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 6840
gggtgcacga gtggggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 6900
tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 6960
attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 7020
tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 7080
agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 7140
aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 7200
tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 7260
ca 7262
<210> 73
<211> 7220
<212> DNA
<213> artificial sequence
<220>
<223> synthetic
<400> 73
cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 60
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 120
tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 180
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 240
tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 300
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 360
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 420
tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 480
agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 540
aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 600
cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 660
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 720
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 780
gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 840
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 900
ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 960
gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 1020
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1080
ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1140
acatgtcctg caggcagctg cgcgctcgct cgctcactga ggccgcccgg gcaaagcccg 1200
ggcgtcgggc gacctttggt cgcccggcct cagtgagcga gcgagcgcgc agagaggag 1260
tggccaactc catcactagg ggttcctgcg gccgcacgcg taactgggca atgcgttaaa 1320
ctggcttttt tgacttccca gaacaatatc taattagcaa ataacacaat tcagtgacat 1380
tcagcaggat gcaaattcca gacactgcaa tcatgaacac tgtgaagaca gtcttctccg 1440
tgggccggga cacaaagcag tccacagtgt tgggacaagg ccaggcgttg cacttcacca 1500
gccgctgcat ggagaagccg tcgtacatga catagaagac gtacatgaag gcggcttcga 1560
agatgacccg gaagaagatg ctgcttgtgt aggtccacca cagggagcct tcgatgcgga 1620
ccttctgggt tttgatctcc tcgatgtcct taaattcact ctttatctcc cccttgatga 1680
acttcctctt cttctcatgt ctccggtagg ccacgtgcat ggccactagg agcgctggcg 1740
tggacacgaa gatcagctgc agggcccata gccggatgtg ggagatgggg aagtagtgat 1800
cgtagcacac gttcttgcag cctggctgca gggtgttgca gacaaagtcg gcctgctcat 1860
ctccccacac ctcctttgca gccacaacga ggatcataat gcgaaaaatg aagaggacgg 1920
tgagccagat ctttccaatg ctggtggagt gtttgttcac accccccagg atcgtctgca 1980
gcgtgcccca atccatcttc tactctgggc ggtttgctct ggaaaagacg aatgcacaca 2040
acacaggaat cactagctag gacagaacag ggagacttct ctgagtctgg gtaagcaagc 2100
atgcttaaat ctcttcctga gcaaacacca actcttacac aacctcacca aaacaggtga 2160
agacagaacc aacttagttt gtcattaatt aagacctcga aggggacttg gggggttcgg 2220
ggctttcggg ggcggtcggg ggttcgcgga cccgggaagc tctgaggacc cagaggccgg 2280
gcgcgctccg cccgcggcgc cgccccctcc gtaactttcc cagtctccga gggaagaggc 2340
ggggtgtggg gtgcggttaa aaggcgccac ggcgggagac aggtgttgcg gccccgcagc 2400
gcccgcgcgc tcctctcccc gactcggagc ccctcggcgg cgcccggccc aggacccgcc 2460
taggagcgca ggagccccag cgcagagacc ccaacgccga gacccccgcc ccggccccgc 2520
cgcgcttcct cccgacgcag agcaaaccgc ccagagtaga agccatggat tggggcacgc 2580
tgcagacgat cctggggggt gtgaacaaac actccaccag cattggaaag atctggctca 2640
ccgtcctctt catttttcgc attatgatcc tcgttgtggc tgcaaaggag gtgtggggag 2700
atgagcaggc cgactttgtc tgcaacaccc tgcagccagg ctgcaagaac gtggtgctacg 2760
atcactactt ccccatctcc cacatccggc tatgggccct gcagctgatc ttcgtgtcca 2820
cgccagcgct cctagtggcc atgcacgtgg cctaccggag acatgagaag aagaggaagt 2880
tcatcaaggg ggagataaag agtgaattta aggagacatcga ggagatcaaa acccagaagg 2940
tccgcatcga aggctccctg tggtggacct acacaagcag catcttcttc cgggtcatct 3000
tcgaagccgc cttcatgtac gtcttctatg tcatgtacga cggcttctcc atgcagcggc 3060
tggtgaagtg caacgcctgg ccttgtccca acactgtgga ctgctttgtg tcccggccca 3120
cggagaagac tgtcttcaca gtgttcatga ttgcagtgtc tggaatttgc atcctgctga 3180
atgtcactga attgtgttat ttgctaatta gatattgttc tgggaagtca aaaaagccag 3240
tttaaaggcg cgccacccct gcagggaatt ccgcattgcc cagttgttag attaagaaat 3300
agacagcatg agagggatga ggcaacccgt gctcagctgt caaggctcag tcgctagcat 3360
ttcccaacac aaagattctg accttaaatg caaccatttg aaacccctgt aggcctcagg 3420
tgaaactcca gatgccacaa tggagctctg ctcccctaaa gcctcaaaac aaaggcctaa 3480
ttctatgcct gtcttaattt tctttcactt aagttagttc cactgagacc ccaggctgtt 3540
aggggttat ggtgtaaggt actttcatat tttaaacaga ggatatcggc atttgtttct 3600
ttctctgagg acaagagaaa aaagccaggt tccacagagg acacagagaa ggtttgggtg 3660
tcctcctggg gttctttttg ccaactttcc ccacgttaaa ggtgaacatt ggttctttca 3720
tttgctttgg aagttttaat ctctaacagt ggacaaagtt accagtgcct taaactctgt 3780
tacacttttt ggaagtgaaa actttgtagt atgataggtt attttgatgt aaagatgttc 3840
tggataccat tatatgttcc ccctgtttca gaggctcaga ttgtaatatg taaatggtat 3900
gtcattcgct actatgattt aatttgaaat atggtctttt ggttatgaat actttgcagc 3960
acagctgaga ggctgtctgt tgtattcatt gtggtcatag cacctaacaa cattgtagcc 4020
tcaatcgagt gagacagact agaagttcct agtgatggct tatgatagca aatggcctca 4080
tgtcaaatat ttagatgtaa ttttgtgtaa gaaatacaga ctggatgtac caccaactac 4140
tacctgtaat gacaggcctg tccaacacat ctcccttttc catgactgtg gtagccagca 4200
tcggaaagaa cgctgattta aagaggtcgc ttgggaattt tattgacaca gtaccattta 4260
atggggagga caaaatgggg caggggaggg agaagtttct gtcgttaaaa acagatttgg 4320
aaagactgga ctctaaagtc tgttgattaa agatgagctt tgtctacttc aaaagtttgt 4380
ttgcttaccc cttcagcctc caatttttta agtgaaaata tagctaataa catgtgaaaa 4440
gaatagaagc taaggtttag ataaatattg agcagatcta taggaagatt gaacctgaat 4500
attgccatta tgcttgacat ggtttccaaa aaatggtact ccacatattt cagtgagggt 4560
aagtattttc ctgttgtcaa gaatagcatt gtaaaagcat tttgtaataa taaagaatag 4620
ctttaatgat atgcttgtaa ctaaaataat tttgtaatgt atcaaataca tttaaaacat 4680
taaaatataa tctctataat aatttaaaat ctaatatggt tttaatagaa cagcgatatc 4740
aagctttcg ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt 4800
aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct 4860
attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt 4920
tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac 4980
gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct 5040
ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca 5100
ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt 5160
ccttggctgc tcgcctatgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc 5220
ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct 5280
cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg 5340
cgaattcatc gataccgagc gctgctcgag agatctgtga tagcggccat caagctggct 5400
gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg 5460
gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg 5520
agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg 5580
gaagacaata gcaggcatgc tggggacacg tgcggaccga gcggccgcag gaacccctag 5640
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa 5700
aggtcgcccg acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagct 5760
gcctgcaggg gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc 5820
gcatacgtca aagcaaccat agtacgcgcc ctgtagcggc gcattaagcg cggcgggtgt 5880
ggtggttacg cgcagcgtga ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc 5940
tttcttccct tcctttctcg ccacgttcgc cggctttccc cgtcaagctc taaatcgggg 6000
gctcccttta gggttccgat ttagtgcttt acggcacctc gaccccaaaa aacttgattt 6060
gggtgatggt tcacgtagtg ggccatcgcc ctgatagacg gtttttcgcc ctttgacgtt 6120
ggagtccacg ttctttaata gtggactctt gttccaaact ggaacaacac tcaaccctat 6180
ctcgggctat tcttttgatt tataagggat tttgccgatt tcggcctatt ggttaaaaaa 6240
tgagctgatt taacaaaaat ttaacgcgaa ttttaacaaa atattaacgt ttacaatttt 6300
atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagc cccgacaccc 6360
gccaacccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca 6420
agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg 6480
cgcgagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat 6540
ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt 6600
atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct 6660
tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc 6720
cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa 6780
agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg 6840
taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt 6900
tctgctatgt ggcgcggtat tatcccgtat tgacgccggg caagagcaac tcggtcgccg 6960
catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac 7020
ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc 7080
ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa 7140
catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc 7200
Claims (76)
서열식별번호: 103에 대해 적어도 80% 동일한, 임의로 서열식별번호: 103에 대해 100% 동일한 제1 핵산 서열; 및/또는
서열식별번호: 104에 대해 적어도 80% 동일한, 임의로 서열식별번호: 104에 대해 100% 동일한 제2 핵산 서열
을 포함하는 것인 단리된 핵산.26. The method of claim 25, wherein the 5 'UTR is
a first nucleic acid sequence that is at least 80% identical to SEQ ID NO: 103, optionally 100% identical to SEQ ID NO: 103; and/or
A second nucleic acid sequence that is at least 80% identical to SEQ ID NO: 104, optionally 100% identical to SEQ ID NO: 104
An isolated nucleic acid comprising a.
서열식별번호: 106에 대해 적어도 80% 동일한, 임의로 서열식별번호: 106에 대해 100% 동일한 뉴클레오티드 서열을 갖는 5' ITR; 및/또는
서열식별번호: 107에 대해 적어도 80% 동일한, 임의로 서열식별번호: 107에 대해 100% 동일한 뉴클레오티드 서열을 갖는 3' ITR
을 포함하는 것인 단리된 핵산.34. The expression cassette of claim 32 or 33
a 5' ITR having a nucleotide sequence that is at least 80% identical to SEQ ID NO: 106, optionally 100% identical to SEQ ID NO: 106; and/or
A 3' ITR having a nucleotide sequence that is at least 80% identical to SEQ ID NO: 107, optionally 100% identical to SEQ ID NO: 107.
An isolated nucleic acid comprising a.
(a) 5' ITR;
(b) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열;
(c) GJB2 5' UTR;
(d) GJB2 단백질을 코딩하는 뉴클레오티드 서열;
(e) GJB2 3' UTR;
(f) 소 성장 호르몬 폴리 A 신호; 및
(g) 3' ITR.Vector containing 5' to 3':
(a) 5'ITR;
(b) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof;
(c) GJB2 5'UTR;
(d) a nucleotide sequence encoding the GJB2 protein;
(e) GJB2 3'UTR;
(f) bovine growth hormone poly A signal; and
(g) 3' ITRs.
(a) 5' ITR;
(b) GJB2 인핸서;
(c) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열;
(d) GJB2 5' UTR;
(e) GJB2 단백질을 코딩하는 뉴클레오티드 서열;
(f) GJB2 3' UTR;
(g) 소 성장 호르몬 폴리 A 신호; 및
(h) 3' ITR.Vector containing 5' to 3':
(a) 5'ITR;
(b) a GJB2 enhancer;
(c) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof;
(d) GJB2 5'UTR;
(e) a nucleotide sequence encoding the GJB2 protein;
(f) GJB2 3'UTR;
(g) bovine growth hormone poly A signal; and
(h) 3' ITRs.
(i) 캡시드 단백질; 및
(ii) 제1항 내지 제44항 중 어느 한 항의 단리된 핵산.Recombinant adeno-associated virus (rAAV) comprising:
(i) capsid proteins; and
(ii) the isolated nucleic acid of any one of claims 1-44.
(i) 캡시드 단백질; 및
(ii) (a) 5' ITR;
(b) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열;
(c) GJB2 5' UTR;
(d) GJB2 단백질을 코딩하는 뉴클레오티드 서열;
(e) GJB2 3' UTR;
(f) 소 성장 호르몬 폴리 A 신호; 및
(g) 3' ITR
을 포함하는 단리된 핵산.Recombinant adeno-associated virus (rAAV) comprising:
(i) capsid proteins; and
(ii) (a) 5'ITR;
(b) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof;
(c) GJB2 5'UTR;
(d) a nucleotide sequence encoding the GJB2 protein;
(e) GJB2 3'UTR;
(f) bovine growth hormone poly A signal; and
(g) 3'ITR
An isolated nucleic acid comprising
(i) 캡시드 단백질; 및
(ii) (a) 5' ITR;
(b) GJB2 인핸서;
(c) GJB2 프로모터, 또는 그의 기저 GJB2 프로모터 서열;
(d) GJB2 5' UTR;
(e) GJB2 단백질을 코딩하는 뉴클레오티드 서열;
(f) GJB2 3' UTR;
(g) 소 성장 호르몬 폴리 A 신호; 및
(h) 3' ITR
을 포함하는 단리된 핵산.Recombinant adeno-associated virus (rAAV) comprising:
(i) capsid proteins; and
(ii) (a) 5'ITR;
(b) a GJB2 enhancer;
(c) a GJB2 promoter, or an underlying GJB2 promoter sequence thereof;
(d) GJB2 5'UTR;
(e) a nucleotide sequence encoding the GJB2 protein;
(f) GJB2 3'UTR;
(g) bovine growth hormone poly A signal; and
(h) 3'ITR
An isolated nucleic acid comprising
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063078233P | 2020-09-14 | 2020-09-14 | |
US63/078,233 | 2020-09-14 | ||
US202163161619P | 2021-03-16 | 2021-03-16 | |
US63/161,619 | 2021-03-16 | ||
PCT/US2021/050205 WO2022056444A1 (en) | 2020-09-14 | 2021-09-14 | Recombinant adeno associated virus (raav) encoding gjb2 and uses thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20230069157A true KR20230069157A (en) | 2023-05-18 |
Family
ID=80631939
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020237012321A KR20230069157A (en) | 2020-09-14 | 2021-09-14 | Recombinant adeno-associated virus (rAAV) encoding GJB2 and uses thereof |
Country Status (10)
Country | Link |
---|---|
EP (1) | EP4211151A1 (en) |
JP (1) | JP2023541443A (en) |
KR (1) | KR20230069157A (en) |
AU (1) | AU2021339843A1 (en) |
BR (1) | BR112023004605A2 (en) |
CA (1) | CA3191533A1 (en) |
IL (1) | IL301057A (en) |
MX (1) | MX2023002978A (en) |
TW (1) | TW202227476A (en) |
WO (1) | WO2022056444A1 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2022549380A (en) * | 2019-09-30 | 2022-11-24 | アプライド ジェネティック テクノロジーズ コーポレイション | Adeno-associated virus (AAV) system for the treatment of hereditary deafness |
WO2024011224A2 (en) * | 2022-07-08 | 2024-01-11 | The Trustees Of Columbia University In The City Of New York | Regulatory element for cell type specific expression of genes in spinal motor neurons |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1781813B1 (en) * | 2004-06-17 | 2010-01-27 | Epigenomics AG | Compositions and methods for preventing carry-over contamination in nucleic acid amplification reactions |
JP2009521933A (en) * | 2005-12-28 | 2009-06-11 | セントカー・インコーポレーテツド | Markers and methods for assessing and treating psoriasis and related disorders |
US20210079406A1 (en) * | 2018-04-10 | 2021-03-18 | President And Fellows Of Harvard College | Aav vectors encoding clarin-1 or gjb2 and uses thereof |
KR20210113160A (en) * | 2018-10-11 | 2021-09-15 | 데시벨 테라퓨틱스, 인크. | AAV1 vectors and their use for the treatment of otic indications |
WO2020097372A1 (en) * | 2018-11-07 | 2020-05-14 | Akouos, Inc. | Use of adeno-associated viral vectors to correct gene defects/ express proteins in hair cells and supporting cells in the inner ear |
JP2022549380A (en) * | 2019-09-30 | 2022-11-24 | アプライド ジェネティック テクノロジーズ コーポレイション | Adeno-associated virus (AAV) system for the treatment of hereditary deafness |
-
2021
- 2021-09-14 BR BR112023004605A patent/BR112023004605A2/en unknown
- 2021-09-14 WO PCT/US2021/050205 patent/WO2022056444A1/en active Application Filing
- 2021-09-14 KR KR1020237012321A patent/KR20230069157A/en unknown
- 2021-09-14 CA CA3191533A patent/CA3191533A1/en active Pending
- 2021-09-14 MX MX2023002978A patent/MX2023002978A/en unknown
- 2021-09-14 TW TW110134291A patent/TW202227476A/en unknown
- 2021-09-14 AU AU2021339843A patent/AU2021339843A1/en active Pending
- 2021-09-14 EP EP21867807.6A patent/EP4211151A1/en active Pending
- 2021-09-14 JP JP2023516689A patent/JP2023541443A/en active Pending
- 2021-09-14 IL IL301057A patent/IL301057A/en unknown
Also Published As
Publication number | Publication date |
---|---|
AU2021339843A1 (en) | 2023-04-06 |
IL301057A (en) | 2023-05-01 |
JP2023541443A (en) | 2023-10-02 |
MX2023002978A (en) | 2023-06-01 |
BR112023004605A2 (en) | 2023-04-11 |
EP4211151A1 (en) | 2023-07-19 |
CA3191533A1 (en) | 2022-03-17 |
WO2022056444A1 (en) | 2022-03-17 |
TW202227476A (en) | 2022-07-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102606174B1 (en) | An optimized strategy for exon skipping modification using CRISPR/CAS9 with triple guide sequences. | |
CN108753824B (en) | Viral vectors for the treatment of retinal dystrophy | |
KR20230022175A (en) | Orientation of AAV capsids | |
KR20200044793A (en) | Compositions and methods for delivery of AAV | |
KR20230057487A (en) | Methods and compositions for genomic manipulation | |
KR102604096B1 (en) | Gene therapy to treat Wilson's disease | |
CN110325199A (en) | For treating the gene therapy of phenylketonuria | |
JP2022137029A (en) | CpG-REDUCED FACTOR VIII VARIANT, COMPOSITION, AND METHOD AND USE FOR TREATING HEMOSTASIS DISORDER | |
CN112218882A (en) | FOXP3 in edited CD34+Expression in cells | |
KR20200018455A (en) | AADC Polynucleotides for the Treatment of Parkinson's Disease | |
AU2016343979A1 (en) | Delivery of central nervous system targeting polynucleotides | |
KR20200032174A (en) | Enhanced chimeric antigen receptors and uses thereof | |
KR20230053735A (en) | Improved methods and compositions for manipulation of genomes | |
KR20200116933A (en) | Compositions and methods for correcting dystrophin mutations in human cardiomyocytes | |
KR20200126997A (en) | Compositions and methods for the treatment of non-aging-related hearing impairment in human subjects | |
KR102628872B1 (en) | Tools and methods for using cell division loci to control proliferation of cells | |
KR20210005146A (en) | Expression of human FOXP3 in gene edited T cells | |
KR20210068068A (en) | Prataxin expression constructs with engineered promoters and methods of use thereof | |
KR20230069157A (en) | Recombinant adeno-associated virus (rAAV) encoding GJB2 and uses thereof | |
KR20200095462A (en) | Adeno-associated virus composition for restoring HBB gene function and method of use thereof | |
CN112912112A (en) | Liver-specific nucleic acid regulatory elements and methods and uses thereof | |
TW202221125A (en) | Compositions and methods for the treatment of neurological disorders related to glucosylceramidase beta deficiency | |
KR20230002681A (en) | Integration of large adenovirus payloads | |
CN115768890A (en) | Thermal control of T cell immunotherapy by molecular and physical initiation | |
KR20230023641A (en) | Compositions and methods for treating GJB2-associated hearing loss |