CN1454256A - 11q13.3的高骨量基因 - Google Patents
11q13.3的高骨量基因 Download PDFInfo
- Publication number
- CN1454256A CN1454256A CN00819619A CN00819619A CN1454256A CN 1454256 A CN1454256 A CN 1454256A CN 00819619 A CN00819619 A CN 00819619A CN 00819619 A CN00819619 A CN 00819619A CN 1454256 A CN1454256 A CN 1454256A
- Authority
- CN
- China
- Prior art keywords
- seq
- hbm
- gene
- bone
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 311
- 210000000988 bone and bone Anatomy 0.000 title claims abstract description 177
- 238000000034 method Methods 0.000 claims abstract description 238
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 63
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 51
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 38
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 38
- 239000013604 expression vector Substances 0.000 claims abstract description 17
- 210000001650 focal adhesion Anatomy 0.000 claims abstract description 16
- 239000000463 material Substances 0.000 claims abstract description 14
- 108020004414 DNA Proteins 0.000 claims description 191
- 210000004027 cell Anatomy 0.000 claims description 162
- 239000002299 complementary DNA Substances 0.000 claims description 150
- 210000004436 artificial bacterial chromosome Anatomy 0.000 claims description 132
- 239000002773 nucleotide Substances 0.000 claims description 103
- 125000003729 nucleotide group Chemical group 0.000 claims description 103
- 239000000523 sample Substances 0.000 claims description 97
- 101150103172 HBM gene Proteins 0.000 claims description 91
- 230000014509 gene expression Effects 0.000 claims description 66
- 238000004458 analytical method Methods 0.000 claims description 62
- 239000012634 fragment Substances 0.000 claims description 59
- 235000018102 proteins Nutrition 0.000 claims description 58
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 39
- 108020004999 messenger RNA Proteins 0.000 claims description 36
- 238000012216 screening Methods 0.000 claims description 32
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 28
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 26
- 241001465754 Metazoa Species 0.000 claims description 25
- 239000002253 acid Substances 0.000 claims description 25
- 150000001413 amino acids Chemical class 0.000 claims description 22
- 239000003153 chemical reaction reagent Substances 0.000 claims description 19
- 235000001014 amino acid Nutrition 0.000 claims description 17
- 230000001105 regulatory effect Effects 0.000 claims description 17
- 230000000295 complement effect Effects 0.000 claims description 16
- 208000022696 bone development disease Diseases 0.000 claims description 15
- 230000037182 bone density Effects 0.000 claims description 13
- 239000013599 cloning vector Substances 0.000 claims description 13
- 238000011156 evaluation Methods 0.000 claims description 12
- 239000003795 chemical substances by application Substances 0.000 claims description 10
- 239000000284 extract Substances 0.000 claims description 10
- 230000005764 inhibitory process Effects 0.000 claims description 9
- 230000001009 osteoporotic effect Effects 0.000 claims description 9
- 238000013518 transcription Methods 0.000 claims description 9
- 230000035897 transcription Effects 0.000 claims description 9
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 8
- 238000009509 drug development Methods 0.000 claims description 7
- 229920001184 polypeptide Polymers 0.000 claims description 7
- 210000001541 thymus gland Anatomy 0.000 claims description 7
- 238000011161 development Methods 0.000 claims description 6
- 230000018109 developmental process Effects 0.000 claims description 6
- 102000054765 polymorphisms of proteins Human genes 0.000 claims description 6
- 230000002441 reversible effect Effects 0.000 claims description 6
- 101001043594 Homo sapiens Low-density lipoprotein receptor-related protein 5 Proteins 0.000 claims description 5
- 102100021926 Low-density lipoprotein receptor-related protein 5 Human genes 0.000 claims description 5
- 208000003263 MASS syndrome Diseases 0.000 claims description 5
- 230000003834 intracellular effect Effects 0.000 claims description 5
- 230000008054 signal transmission Effects 0.000 claims description 5
- 102000013918 Apolipoproteins E Human genes 0.000 claims description 4
- 108010025628 Apolipoproteins E Proteins 0.000 claims description 4
- 102000004067 Osteocalcin Human genes 0.000 claims description 4
- 108090000573 Osteocalcin Proteins 0.000 claims description 4
- 230000001276 controlling effect Effects 0.000 claims description 4
- 238000002405 diagnostic procedure Methods 0.000 claims description 4
- 102000054766 genetic haplotypes Human genes 0.000 claims description 4
- 230000001568 sexual effect Effects 0.000 claims description 4
- 241000894007 species Species 0.000 claims description 4
- 230000002103 transcriptional effect Effects 0.000 claims description 4
- 208000020084 Bone disease Diseases 0.000 claims description 3
- 230000005540 biological transmission Effects 0.000 claims description 3
- 150000003230 pyrimidines Chemical class 0.000 claims description 3
- 241000283984 Rodentia Species 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 10
- 208000034826 Genetic Predisposition to Disease Diseases 0.000 claims 2
- 102000007365 Sialoglycoproteins Human genes 0.000 claims 1
- 108010032838 Sialoglycoproteins Proteins 0.000 claims 1
- 230000002708 enhancing effect Effects 0.000 claims 1
- 230000002401 inhibitory effect Effects 0.000 claims 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 54
- 201000010099 disease Diseases 0.000 abstract description 53
- 208000001132 Osteoporosis Diseases 0.000 abstract description 39
- 238000010367 cloning Methods 0.000 abstract description 25
- 230000014461 bone development Effects 0.000 abstract description 22
- 239000013598 vector Substances 0.000 abstract description 9
- -1 coding sequences Chemical class 0.000 abstract description 8
- 239000003155 DNA primer Substances 0.000 abstract 1
- 239000002751 oligonucleotide probe Substances 0.000 abstract 1
- 239000008194 pharmaceutical composition Substances 0.000 abstract 1
- 230000011664 signaling Effects 0.000 abstract 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 650
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical group SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 269
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 173
- 238000003752 polymerase chain reaction Methods 0.000 description 63
- 230000008859 change Effects 0.000 description 60
- 238000006243 chemical reaction Methods 0.000 description 49
- 239000000203 mixture Substances 0.000 description 49
- 102220023257 rs387907546 Human genes 0.000 description 40
- 108091034117 Oligonucleotide Proteins 0.000 description 39
- 239000000243 solution Substances 0.000 description 38
- 108091035242 Sequence-tagged site Proteins 0.000 description 35
- 238000012360 testing method Methods 0.000 description 34
- 210000001519 tissue Anatomy 0.000 description 34
- 239000000047 product Substances 0.000 description 33
- 102220023258 rs387907548 Human genes 0.000 description 33
- 241000894006 Bacteria Species 0.000 description 32
- 102220369447 c.1352G>A Human genes 0.000 description 32
- 239000003550 marker Substances 0.000 description 30
- 241000699666 Mus <mouse, genus> Species 0.000 description 29
- 230000000694 effects Effects 0.000 description 29
- 238000005516 engineering process Methods 0.000 description 29
- 238000009396 hybridization Methods 0.000 description 29
- 230000003321 amplification Effects 0.000 description 28
- 102220369446 c.1274G>A Human genes 0.000 description 28
- 230000002068 genetic effect Effects 0.000 description 28
- 238000003199 nucleic acid amplification method Methods 0.000 description 28
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 27
- 241000282414 Homo sapiens Species 0.000 description 27
- 230000000692 anti-sense effect Effects 0.000 description 27
- 239000012528 membrane Substances 0.000 description 27
- 108700024394 Exon Proteins 0.000 description 26
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 26
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 26
- 239000003814 drug Substances 0.000 description 26
- 238000011160 research Methods 0.000 description 26
- 230000000875 corresponding effect Effects 0.000 description 22
- 230000006870 function Effects 0.000 description 22
- 239000000499 gel Substances 0.000 description 22
- 238000002360 preparation method Methods 0.000 description 21
- 239000013612 plasmid Substances 0.000 description 19
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 18
- 238000001556 precipitation Methods 0.000 description 18
- 102220023256 rs387907547 Human genes 0.000 description 17
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 17
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 16
- 238000012408 PCR amplification Methods 0.000 description 16
- 150000001875 compounds Chemical class 0.000 description 16
- 230000003993 interaction Effects 0.000 description 16
- 230000008569 process Effects 0.000 description 16
- 238000012163 sequencing technique Methods 0.000 description 16
- 238000012546 transfer Methods 0.000 description 16
- 102000004190 Enzymes Human genes 0.000 description 15
- 108090000790 Enzymes Proteins 0.000 description 15
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 15
- 230000002759 chromosomal effect Effects 0.000 description 15
- 238000013016 damping Methods 0.000 description 15
- 229940088598 enzyme Drugs 0.000 description 15
- 239000012530 fluid Substances 0.000 description 15
- 239000006228 supernatant Substances 0.000 description 15
- 238000013461 design Methods 0.000 description 14
- 238000001962 electrophoresis Methods 0.000 description 14
- 238000007901 in situ hybridization Methods 0.000 description 14
- 208000010392 Bone Fractures Diseases 0.000 description 13
- 238000013459 approach Methods 0.000 description 13
- 230000012010 growth Effects 0.000 description 13
- 239000007788 liquid Substances 0.000 description 13
- 238000001712 DNA sequencing Methods 0.000 description 12
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 12
- 229920002684 Sepharose Polymers 0.000 description 12
- 241000700605 Viruses Species 0.000 description 12
- 210000004369 blood Anatomy 0.000 description 12
- 239000008280 blood Substances 0.000 description 12
- 230000029087 digestion Effects 0.000 description 12
- 239000008103 glucose Substances 0.000 description 12
- 238000013507 mapping Methods 0.000 description 12
- 230000008521 reorganization Effects 0.000 description 12
- 238000005406 washing Methods 0.000 description 12
- 210000005253 yeast cell Anatomy 0.000 description 12
- 206010017076 Fracture Diseases 0.000 description 11
- 239000004471 Glycine Substances 0.000 description 11
- 150000007513 acids Chemical class 0.000 description 11
- 210000000349 chromosome Anatomy 0.000 description 11
- 229940079593 drug Drugs 0.000 description 11
- 239000000975 dye Substances 0.000 description 11
- 229910052500 inorganic mineral Inorganic materials 0.000 description 11
- 239000011707 mineral Substances 0.000 description 11
- 238000010561 standard procedure Methods 0.000 description 11
- 108700028369 Alleles Proteins 0.000 description 10
- 108010001831 LDL receptors Proteins 0.000 description 10
- 102220369445 c.668T>C Human genes 0.000 description 10
- 230000002308 calcification Effects 0.000 description 10
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 10
- 238000001415 gene therapy Methods 0.000 description 10
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 10
- 238000003780 insertion Methods 0.000 description 10
- 230000037431 insertion Effects 0.000 description 10
- 238000010369 molecular cloning Methods 0.000 description 10
- 208000001685 postmenopausal osteoporosis Diseases 0.000 description 10
- 108091060211 Expressed sequence tag Proteins 0.000 description 9
- 102000000853 LDL receptors Human genes 0.000 description 9
- 239000004677 Nylon Substances 0.000 description 9
- 210000004556 brain Anatomy 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- 229920001778 nylon Polymers 0.000 description 9
- 210000002997 osteoclast Anatomy 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- 230000006798 recombination Effects 0.000 description 9
- 238000005215 recombination Methods 0.000 description 9
- 102220004457 rs11567847 Human genes 0.000 description 9
- 210000002027 skeletal muscle Anatomy 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 9
- 238000002560 therapeutic procedure Methods 0.000 description 9
- 238000001086 yeast two-hybrid system Methods 0.000 description 9
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 8
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 8
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 8
- 241000282326 Felis catus Species 0.000 description 8
- 238000002105 Southern blotting Methods 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 238000010168 coupling process Methods 0.000 description 8
- 235000011187 glycerol Nutrition 0.000 description 8
- 210000003734 kidney Anatomy 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 230000009182 swimming Effects 0.000 description 8
- 230000001225 therapeutic effect Effects 0.000 description 8
- 230000027455 binding Effects 0.000 description 7
- 230000033228 biological regulation Effects 0.000 description 7
- 230000004087 circulation Effects 0.000 description 7
- 230000008878 coupling Effects 0.000 description 7
- 238000005859 coupling reaction Methods 0.000 description 7
- 238000007877 drug screening Methods 0.000 description 7
- 102000006495 integrins Human genes 0.000 description 7
- 108010044426 integrins Proteins 0.000 description 7
- 235000015097 nutrients Nutrition 0.000 description 7
- 210000000689 upper leg Anatomy 0.000 description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 6
- 241000271566 Aves Species 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- 108091000080 Phosphotransferase Proteins 0.000 description 6
- 241000700159 Rattus Species 0.000 description 6
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 6
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 6
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 6
- 102100023895 Zyxin Human genes 0.000 description 6
- KLOHDWPABZXLGI-YWUHCJSESA-M ampicillin sodium Chemical compound [Na+].C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C([O-])=O)(C)C)=CC=CC=C1 KLOHDWPABZXLGI-YWUHCJSESA-M 0.000 description 6
- 239000000074 antisense oligonucleotide Substances 0.000 description 6
- 238000012230 antisense oligonucleotides Methods 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 239000011324 bead Substances 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 210000001612 chondrocyte Anatomy 0.000 description 6
- 238000011534 incubation Methods 0.000 description 6
- 239000003112 inhibitor Substances 0.000 description 6
- 238000002347 injection Methods 0.000 description 6
- 239000007924 injection Substances 0.000 description 6
- 230000000968 intestinal effect Effects 0.000 description 6
- 238000002156 mixing Methods 0.000 description 6
- 238000005457 optimization Methods 0.000 description 6
- 102000020233 phosphotransferase Human genes 0.000 description 6
- 239000011541 reaction mixture Substances 0.000 description 6
- 210000003625 skull Anatomy 0.000 description 6
- 210000000130 stem cell Anatomy 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 241000701161 unidentified adenovirus Species 0.000 description 6
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 5
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 5
- 101150030271 AXIN1 gene Proteins 0.000 description 5
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 5
- 206010028980 Neoplasm Diseases 0.000 description 5
- 208000010191 Osteitis Deformans Diseases 0.000 description 5
- 208000027868 Paget disease Diseases 0.000 description 5
- 241000656145 Thyrsites atun Species 0.000 description 5
- 108010005774 beta-Galactosidase Proteins 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 5
- 230000030570 cellular localization Effects 0.000 description 5
- 210000004718 centriole Anatomy 0.000 description 5
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 5
- 235000012000 cholesterol Nutrition 0.000 description 5
- 239000012141 concentrate Substances 0.000 description 5
- 230000002950 deficient Effects 0.000 description 5
- 239000008367 deionised water Substances 0.000 description 5
- 229910021641 deionized water Inorganic materials 0.000 description 5
- 238000003745 diagnosis Methods 0.000 description 5
- 230000004069 differentiation Effects 0.000 description 5
- 238000010790 dilution Methods 0.000 description 5
- 239000012895 dilution Substances 0.000 description 5
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 5
- 229960005542 ethidium bromide Drugs 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 238000013467 fragmentation Methods 0.000 description 5
- 238000006062 fragmentation reaction Methods 0.000 description 5
- 238000007710 freezing Methods 0.000 description 5
- 230000036541 health Effects 0.000 description 5
- 238000011081 inoculation Methods 0.000 description 5
- 208000027202 mammary Paget disease Diseases 0.000 description 5
- 230000013011 mating Effects 0.000 description 5
- 230000011164 ossification Effects 0.000 description 5
- 229940049547 paraxin Drugs 0.000 description 5
- 239000012071 phase Substances 0.000 description 5
- 230000026731 phosphorylation Effects 0.000 description 5
- 238000006366 phosphorylation reaction Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- 208000024891 symptom Diseases 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 101150056293 AJUBA gene Proteins 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 4
- 229920000936 Agarose Polymers 0.000 description 4
- 244000153158 Ammi visnaga Species 0.000 description 4
- 235000010585 Ammi visnaga Nutrition 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 206010006187 Breast cancer Diseases 0.000 description 4
- 208000026310 Breast neoplasm Diseases 0.000 description 4
- 108060001064 Calcitonin Proteins 0.000 description 4
- 102000055006 Calcitonin Human genes 0.000 description 4
- LTMHDMANZUZIPE-AMTYYWEZSA-N Digoxin Natural products O([C@H]1[C@H](C)O[C@H](O[C@@H]2C[C@@H]3[C@@](C)([C@@H]4[C@H]([C@]5(O)[C@](C)([C@H](O)C4)[C@H](C4=CC(=O)OC4)CC5)CC3)CC2)C[C@@H]1O)[C@H]1O[C@H](C)[C@@H](O[C@H]2O[C@@H](C)[C@H](O)[C@@H](O)C2)[C@@H](O)C1 LTMHDMANZUZIPE-AMTYYWEZSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 4
- 101001005101 Homo sapiens LIM domain-containing protein 1 Proteins 0.000 description 4
- 102100026033 LIM domain-containing protein 1 Human genes 0.000 description 4
- 241000283973 Oryctolagus cuniculus Species 0.000 description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 4
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 4
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 4
- 101710137500 T7 RNA polymerase Proteins 0.000 description 4
- MUMGGOZAMZWBJJ-DYKIIFRCSA-N Testostosterone Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 MUMGGOZAMZWBJJ-DYKIIFRCSA-N 0.000 description 4
- 108010023249 Zyxin Proteins 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 238000009395 breeding Methods 0.000 description 4
- 230000001488 breeding effect Effects 0.000 description 4
- 239000001506 calcium phosphate Substances 0.000 description 4
- 229910000389 calcium phosphate Inorganic materials 0.000 description 4
- 235000011010 calcium phosphates Nutrition 0.000 description 4
- 239000000969 carrier Substances 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000005336 cracking Methods 0.000 description 4
- 230000009089 cytolysis Effects 0.000 description 4
- 210000003104 cytoplasmic structure Anatomy 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- LTMHDMANZUZIPE-PUGKRICDSA-N digoxin Chemical compound C1[C@H](O)[C@H](O)[C@@H](C)O[C@H]1O[C@@H]1[C@@H](C)O[C@@H](O[C@@H]2[C@H](O[C@@H](O[C@@H]3C[C@@H]4[C@]([C@@H]5[C@H]([C@]6(CC[C@@H]([C@@]6(C)[C@H](O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)C[C@@H]2O)C)C[C@@H]1O LTMHDMANZUZIPE-PUGKRICDSA-N 0.000 description 4
- 229960005156 digoxin Drugs 0.000 description 4
- LTMHDMANZUZIPE-UHFFFAOYSA-N digoxine Natural products C1C(O)C(O)C(C)OC1OC1C(C)OC(OC2C(OC(OC3CC4C(C5C(C6(CCC(C6(C)C(O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)CC2O)C)CC1O LTMHDMANZUZIPE-UHFFFAOYSA-N 0.000 description 4
- 238000004821 distillation Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 230000008014 freezing Effects 0.000 description 4
- 229940088597 hormone Drugs 0.000 description 4
- 239000005556 hormone Substances 0.000 description 4
- 230000008676 import Effects 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 239000002502 liposome Substances 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 230000002969 morbid Effects 0.000 description 4
- 208000002865 osteopetrosis Diseases 0.000 description 4
- 230000007170 pathology Effects 0.000 description 4
- 102000005962 receptors Human genes 0.000 description 4
- 108020003175 receptors Proteins 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- 108091092562 ribozyme Proteins 0.000 description 4
- 238000007789 sealing Methods 0.000 description 4
- 210000002966 serum Anatomy 0.000 description 4
- 230000019491 signal transduction Effects 0.000 description 4
- 208000011580 syndromic disease Diseases 0.000 description 4
- 210000002303 tibia Anatomy 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 4
- 241001430294 unidentified retrovirus Species 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 3
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 3
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 3
- 241000972773 Aulopiformes Species 0.000 description 3
- 108700012045 Axin Proteins 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 241000206602 Eukaryota Species 0.000 description 3
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 3
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 3
- 108050001049 Extracellular proteins Proteins 0.000 description 3
- 102100028043 Fibroblast growth factor 3 Human genes 0.000 description 3
- 101001065272 Homo sapiens EGF-containing fibulin-like extracellular matrix protein 1 Proteins 0.000 description 3
- 101001060280 Homo sapiens Fibroblast growth factor 3 Proteins 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- 239000006142 Luria-Bertani Agar Substances 0.000 description 3
- 208000002193 Pain Diseases 0.000 description 3
- 208000006735 Periostitis Diseases 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 206010039984 Senile osteoporosis Diseases 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 238000000246 agarose gel electrophoresis Methods 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 239000005557 antagonist Substances 0.000 description 3
- 229940098773 bovine serum albumin Drugs 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 235000011089 carbon dioxide Nutrition 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 230000001332 colony forming effect Effects 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- 230000001351 cycling effect Effects 0.000 description 3
- 210000004292 cytoskeleton Anatomy 0.000 description 3
- 230000034994 death Effects 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- 210000001840 diploid cell Anatomy 0.000 description 3
- 210000002745 epiphysis Anatomy 0.000 description 3
- 230000001076 estrogenic effect Effects 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 210000002744 extracellular matrix Anatomy 0.000 description 3
- 235000019688 fish Nutrition 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 210000004349 growth plate Anatomy 0.000 description 3
- 238000010438 heat treatment Methods 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 238000007689 inspection Methods 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 238000002386 leaching Methods 0.000 description 3
- 238000007834 ligase chain reaction Methods 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000008774 maternal effect Effects 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 210000003460 periosteum Anatomy 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 239000002244 precipitate Substances 0.000 description 3
- 230000002062 proliferating effect Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 235000019515 salmon Nutrition 0.000 description 3
- 238000004062 sedimentation Methods 0.000 description 3
- 238000010008 shearing Methods 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000001117 sulphuric acid Substances 0.000 description 3
- 235000011149 sulphuric acid Nutrition 0.000 description 3
- 230000001629 suppression Effects 0.000 description 3
- 108091035539 telomere Proteins 0.000 description 3
- 102000055501 telomere Human genes 0.000 description 3
- 210000003411 telomere Anatomy 0.000 description 3
- 210000001550 testis Anatomy 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 230000002792 vascular Effects 0.000 description 3
- 102000009310 vitamin D receptors Human genes 0.000 description 3
- 108050000156 vitamin D receptors Proteins 0.000 description 3
- LZKJGAQEZMGEBN-YYTBSQRUSA-N (2s)-2-amino-3-(4-hydroxyphenyl)propanoic acid;(2s)-2-aminopentanedioic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O.OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LZKJGAQEZMGEBN-YYTBSQRUSA-N 0.000 description 2
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 2
- FRXSZNDVFUDTIR-UHFFFAOYSA-N 6-methoxy-1,2,3,4-tetrahydroquinoline Chemical compound N1CCCC2=CC(OC)=CC=C21 FRXSZNDVFUDTIR-UHFFFAOYSA-N 0.000 description 2
- 206010000599 Acromegaly Diseases 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- 102100034163 Alpha-actinin-1 Human genes 0.000 description 2
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 2
- 102000052583 Anaphase-Promoting Complex-Cyclosome Apc8 Subunit Human genes 0.000 description 2
- 101100467538 Arabidopsis thaliana RANBPM gene Proteins 0.000 description 2
- 102000004321 Atrophin-1 Human genes 0.000 description 2
- 108090000806 Atrophin-1 Proteins 0.000 description 2
- 102000051172 Axin Human genes 0.000 description 2
- 229940122361 Bisphosphonate Drugs 0.000 description 2
- 206010065687 Bone loss Diseases 0.000 description 2
- 101100268670 Caenorhabditis elegans acc-3 gene Proteins 0.000 description 2
- 208000009903 Camurati-Engelmann Syndrome Diseases 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108010035532 Collagen Proteins 0.000 description 2
- 102000008186 Collagen Human genes 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 238000007400 DNA extraction Methods 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 101150060155 Dcc gene Proteins 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 102100033334 E3 ubiquitin-protein ligase Itchy homolog Human genes 0.000 description 2
- 102100031814 EGF-containing fibulin-like extracellular matrix protein 1 Human genes 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 102100031758 Extracellular matrix protein 1 Human genes 0.000 description 2
- 108010067306 Fibronectins Proteins 0.000 description 2
- 102000016359 Fibronectins Human genes 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- 208000032612 Glial tumor Diseases 0.000 description 2
- 206010018338 Glioma Diseases 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 2
- 206010020100 Hip fracture Diseases 0.000 description 2
- 101000799406 Homo sapiens Alpha-actinin-1 Proteins 0.000 description 2
- 101000912124 Homo sapiens Cell division cycle protein 23 homolog Proteins 0.000 description 2
- 101000997630 Homo sapiens E3 ubiquitin-protein ligase Itchy homolog Proteins 0.000 description 2
- 101000920078 Homo sapiens Elongation factor 1-alpha 1 Proteins 0.000 description 2
- 101000866526 Homo sapiens Extracellular matrix protein 1 Proteins 0.000 description 2
- 101001041145 Homo sapiens Homeobox protein Hox-B13 Proteins 0.000 description 2
- 206010020590 Hypercalciuria Diseases 0.000 description 2
- 206010020707 Hyperparathyroidism primary Diseases 0.000 description 2
- 208000000038 Hypoparathyroidism Diseases 0.000 description 2
- 102000007330 LDL Lipoproteins Human genes 0.000 description 2
- 108010007622 LDL Lipoproteins Proteins 0.000 description 2
- 244000178870 Lavandula angustifolia Species 0.000 description 2
- 235000010663 Lavandula angustifolia Nutrition 0.000 description 2
- 102000011965 Lipoprotein Receptors Human genes 0.000 description 2
- 108010061306 Lipoprotein Receptors Proteins 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 208000037196 Medullary thyroid carcinoma Diseases 0.000 description 2
- 108091092878 Microsatellite Proteins 0.000 description 2
- WGZDBVOTUVNQFP-UHFFFAOYSA-N N-(1-phthalazinylamino)carbamic acid ethyl ester Chemical group C1=CC=C2C(NNC(=O)OCC)=NN=CC2=C1 WGZDBVOTUVNQFP-UHFFFAOYSA-N 0.000 description 2
- WXOMTJVVIMOXJL-BOBFKVMVSA-A O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)OS(=O)(=O)OC[C@H]1O[C@@H](O[C@]2(COS(=O)(=O)O[Al](O)O)O[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]2OS(=O)(=O)O[Al](O)O)[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]1OS(=O)(=O)O[Al](O)O Chemical compound O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)OS(=O)(=O)OC[C@H]1O[C@@H](O[C@]2(COS(=O)(=O)O[Al](O)O)O[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]2OS(=O)(=O)O[Al](O)O)[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]1OS(=O)(=O)O[Al](O)O WXOMTJVVIMOXJL-BOBFKVMVSA-A 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 206010031243 Osteogenesis imperfecta Diseases 0.000 description 2
- 241000286209 Phasianidae Species 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 2
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 2
- 201000000981 Primary Hyperparathyroidism Diseases 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- RTQKBZIRDWZLDF-BZSNNMDCSA-N Pro-Pro-Trp Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)O)CCN1C(=O)[C@@H]1CCCN1 RTQKBZIRDWZLDF-BZSNNMDCSA-N 0.000 description 2
- 101100142497 Prochlorococcus marinus (strain SARG / CCMP1375 / SS120) rplU gene Proteins 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 102100033982 Ran-binding protein 9 Human genes 0.000 description 2
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 2
- 101000828148 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Transposon Ty3-G Gag polyprotein Proteins 0.000 description 2
- 101000790437 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Transposon Ty3-I Gag polyprotein Proteins 0.000 description 2
- 208000005770 Secondary Hyperparathyroidism Diseases 0.000 description 2
- 102100030053 Secreted frizzled-related protein 3 Human genes 0.000 description 2
- 108091058545 Secretory proteins Proteins 0.000 description 2
- 102000040739 Secretory proteins Human genes 0.000 description 2
- 108010090804 Streptavidin Proteins 0.000 description 2
- NKANXQFJJICGDU-QPLCGJKRSA-N Tamoxifen Chemical compound C=1C=CC=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 NKANXQFJJICGDU-QPLCGJKRSA-N 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 229930003756 Vitamin B7 Natural products 0.000 description 2
- 108010020277 WD repeat containing planar cell polarity effector Proteins 0.000 description 2
- 230000032683 aging Effects 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 description 2
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- WQZGKKKJIJFFOK-FPRJBGLDSA-N beta-D-galactose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-FPRJBGLDSA-N 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 150000004663 bisphosphonates Chemical class 0.000 description 2
- 230000008468 bone growth Effects 0.000 description 2
- 229940036811 bone meal Drugs 0.000 description 2
- 239000002374 bone meal Substances 0.000 description 2
- 210000003557 bones of lower extremity Anatomy 0.000 description 2
- GDTBXPJZTBHREO-UHFFFAOYSA-N bromine Chemical compound BrBr GDTBXPJZTBHREO-UHFFFAOYSA-N 0.000 description 2
- 229910052794 bromium Inorganic materials 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 210000003793 centrosome Anatomy 0.000 description 2
- 238000000975 co-precipitation Methods 0.000 description 2
- 229920001436 collagen Polymers 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000010205 computational analysis Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 239000000287 crude extract Substances 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000008021 deposition Effects 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 238000004043 dyeing Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 210000002216 heart Anatomy 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 210000004754 hybrid cell Anatomy 0.000 description 2
- 201000005991 hyperphosphatemia Diseases 0.000 description 2
- 210000003559 hypertrophic chondrocyte Anatomy 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 230000000155 isotopic effect Effects 0.000 description 2
- 239000001102 lavandula vera Substances 0.000 description 2
- 235000018219 lavender Nutrition 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 239000012160 loading buffer Substances 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 230000005291 magnetic effect Effects 0.000 description 2
- 230000003211 malignant effect Effects 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 208000023356 medullary thyroid gland carcinoma Diseases 0.000 description 2
- 230000009245 menopause Effects 0.000 description 2
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 2
- 230000000394 mitotic effect Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 208000008084 monostotic fibrous dysplasia Diseases 0.000 description 2
- 210000000214 mouth Anatomy 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- 230000003472 neutralizing effect Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 210000004409 osteocyte Anatomy 0.000 description 2
- 208000005368 osteomalacia Diseases 0.000 description 2
- 206010031281 osteopoikilosis Diseases 0.000 description 2
- 201000008968 osteosarcoma Diseases 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 229940067626 phosphatidylinositols Drugs 0.000 description 2
- 150000003905 phosphatidylinositols Chemical class 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 208000001061 polyostotic fibrous dysplasia Diseases 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 239000000186 progesterone Substances 0.000 description 2
- 229960003387 progesterone Drugs 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 201000001444 pseudopseudohypoparathyroidism Diseases 0.000 description 2
- 201000010108 pycnodysostosis Diseases 0.000 description 2
- 239000011535 reaction buffer Substances 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 230000000630 rising effect Effects 0.000 description 2
- FGDZQCVHDSGLHJ-UHFFFAOYSA-M rubidium chloride Chemical compound [Cl-].[Rb+] FGDZQCVHDSGLHJ-UHFFFAOYSA-M 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 229960003604 testosterone Drugs 0.000 description 2
- 238000010257 thawing Methods 0.000 description 2
- 208000013818 thyroid gland medullary carcinoma Diseases 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 238000010396 two-hybrid screening Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 208000012991 uterine carcinoma Diseases 0.000 description 2
- 235000011912 vitamin B7 Nutrition 0.000 description 2
- 239000011735 vitamin B7 Substances 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- NWXMGUDVXFXRIG-WESIUVDSSA-N (4s,4as,5as,6s,12ar)-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O NWXMGUDVXFXRIG-WESIUVDSSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- ZEDPQIJYJCPIRM-UHFFFAOYSA-N 2,3-dimethylbenzonitrile Chemical compound CC1=CC=CC(C#N)=C1C ZEDPQIJYJCPIRM-UHFFFAOYSA-N 0.000 description 1
- GTYWSWPCXBOSSY-UHFFFAOYSA-N 2-chloro-1H-indole phosphoric acid Chemical compound OP(O)(O)=O.Clc1cc2ccccc2[nH]1 GTYWSWPCXBOSSY-UHFFFAOYSA-N 0.000 description 1
- PJISLFCKHOHLLP-UHFFFAOYSA-N 2-diethoxyphosphorylsulfanyl-n,n-diethylethanamine Chemical compound CCOP(=O)(OCC)SCCN(CC)CC PJISLFCKHOHLLP-UHFFFAOYSA-N 0.000 description 1
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 1
- 101150039504 6 gene Proteins 0.000 description 1
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 1
- 102000015936 AP-1 transcription factor Human genes 0.000 description 1
- 108050004195 AP-1 transcription factor Proteins 0.000 description 1
- 240000006487 Aciphylla squarrosa Species 0.000 description 1
- 206010001497 Agitation Diseases 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 241000272525 Anas platyrhynchos Species 0.000 description 1
- 208000019901 Anxiety disease Diseases 0.000 description 1
- 108010039627 Aprotinin Proteins 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- 102000003916 Arrestin Human genes 0.000 description 1
- 108090000328 Arrestin Proteins 0.000 description 1
- 206010003210 Arteriosclerosis Diseases 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- 241000713842 Avian sarcoma virus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 208000008035 Back Pain Diseases 0.000 description 1
- 102000015735 Beta-catenin Human genes 0.000 description 1
- 108060000903 Beta-catenin Proteins 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 206010005949 Bone cancer Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000829192 Bos taurus polyomavirus 1 Species 0.000 description 1
- 241000701822 Bovine papillomavirus Species 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 101150010856 CRT gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 102000005701 Calcium-Binding Proteins Human genes 0.000 description 1
- 108010045403 Calcium-Binding Proteins Proteins 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 101710099573 Casein kinase II subunit alpha Proteins 0.000 description 1
- 101710159482 Casein kinase II subunit alpha' Proteins 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- 208000017667 Chronic Disease Diseases 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 208000024779 Comminuted Fractures Diseases 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 101100083446 Danio rerio plekhh1 gene Proteins 0.000 description 1
- 241001269238 Data Species 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 1
- 108700003861 Dominant Genes Proteins 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 206010015548 Euthanasia Diseases 0.000 description 1
- 208000034454 F12-related hereditary angioedema with normal C1Inh Diseases 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 102000019432 Galanin Human genes 0.000 description 1
- 101800002068 Galanin Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 230000010558 Gene Alterations Effects 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- OWOFCNWTMWOOJJ-WDSKDSINSA-N Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OWOFCNWTMWOOJJ-WDSKDSINSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- 102000002254 Glycogen Synthase Kinase 3 Human genes 0.000 description 1
- 108010014905 Glycogen Synthase Kinase 3 Proteins 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 229940122440 HIV protease inhibitor Drugs 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 102100021088 Homeobox protein Hox-B13 Human genes 0.000 description 1
- 102000009331 Homeodomain Proteins Human genes 0.000 description 1
- 108010048671 Homeodomain Proteins Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001039199 Homo sapiens Low-density lipoprotein receptor-related protein 6 Proteins 0.000 description 1
- 208000037147 Hypercalcaemia Diseases 0.000 description 1
- 206010020850 Hyperthyroidism Diseases 0.000 description 1
- 206010058359 Hypogonadism Diseases 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 244000283207 Indigofera tinctoria Species 0.000 description 1
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 206010023204 Joint dislocation Diseases 0.000 description 1
- 244000285963 Kluyveromyces fragilis Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102100040704 Low-density lipoprotein receptor-related protein 6 Human genes 0.000 description 1
- 102000004857 Lymphoid enhancer-binding factor 1 Human genes 0.000 description 1
- 108090001093 Lymphoid enhancer-binding factor 1 Proteins 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- 206010025476 Malabsorption Diseases 0.000 description 1
- 208000004155 Malabsorption Syndromes Diseases 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 208000030136 Marchiafava-Bignami Disease Diseases 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- 206010027336 Menstruation delayed Diseases 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- 208000029725 Metabolic bone disease Diseases 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 206010027626 Milia Diseases 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 101150118570 Msx2 gene Proteins 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101100108487 Mus musculus Ajuba gene Proteins 0.000 description 1
- 101100344029 Mus musculus Lrp5 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 108060008487 Myosin Proteins 0.000 description 1
- 108010064696 N,O-diacetylmuramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 102100026073 Oligodendrocyte transcription factor 1 Human genes 0.000 description 1
- 101710195940 Oligodendrocyte transcription factor 1 Proteins 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 108010079855 Peptide Aptamers Proteins 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 108010065081 Phosphorylase b Proteins 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 108010039918 Polylysine Proteins 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 108091008103 RNA aptamers Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 102000004879 Racemases and epimerases Human genes 0.000 description 1
- 108090001066 Racemases and epimerases Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 206010039203 Road traffic accident Diseases 0.000 description 1
- BFDMCHRDSYTOLE-UHFFFAOYSA-N SC#N.NC(N)=N.ClC(Cl)Cl.OC1=CC=CC=C1 Chemical compound SC#N.NC(N)=N.ClC(Cl)Cl.OC1=CC=CC=C1 BFDMCHRDSYTOLE-UHFFFAOYSA-N 0.000 description 1
- CWHJIJJSDGEHNS-MYLFLSLOSA-N Senegenin Chemical compound C1[C@H](O)[C@H](O)[C@@](C)(C(O)=O)[C@@H]2CC[C@@]3(C)C(CC[C@]4(CCC(C[C@H]44)(C)C)C(O)=O)=C4[C@@H](CCl)C[C@@H]3[C@]21C CWHJIJJSDGEHNS-MYLFLSLOSA-N 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- 241000872198 Serjania polyphylla Species 0.000 description 1
- 241000854711 Shinkai Species 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 102000013275 Somatomedins Human genes 0.000 description 1
- 241000272534 Struthio camelus Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 108700025695 Suppressor Genes Proteins 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 102000002248 Thyroxine-Binding Globulin Human genes 0.000 description 1
- 108010000259 Thyroxine-Binding Globulin Proteins 0.000 description 1
- 102000009488 Thyroxine-Binding Proteins Human genes 0.000 description 1
- 108010048889 Thyroxine-Binding Proteins Proteins 0.000 description 1
- 229940122618 Trypsin inhibitor Drugs 0.000 description 1
- 101710162629 Trypsin inhibitor Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- YVNQAIFQFWTPLQ-UHFFFAOYSA-O [4-[[4-(4-ethoxyanilino)phenyl]-[4-[ethyl-[(3-sulfophenyl)methyl]amino]-2-methylphenyl]methylidene]-3-methylcyclohexa-2,5-dien-1-ylidene]-ethyl-[(3-sulfophenyl)methyl]azanium Chemical compound C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C(=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S(O)(=O)=O)C)C=2C(=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S(O)(=O)=O)C)C=C1 YVNQAIFQFWTPLQ-UHFFFAOYSA-O 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 208000037919 acquired disease Diseases 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 239000002390 adhesive tape Substances 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- DCSBSVSZJRSITC-UHFFFAOYSA-M alendronate sodium trihydrate Chemical compound O.O.O.[Na+].NCCCC(O)(P(O)(O)=O)P(O)([O-])=O DCSBSVSZJRSITC-UHFFFAOYSA-M 0.000 description 1
- 239000011717 all-trans-retinol Substances 0.000 description 1
- 235000019169 all-trans-retinol Nutrition 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 238000012197 amplification kit Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 210000000709 aorta Anatomy 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 229960004405 aprotinin Drugs 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 208000011775 arteriosclerosis disease Diseases 0.000 description 1
- 238000000889 atomisation Methods 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 208000019804 backache Diseases 0.000 description 1
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 1
- 241001609931 bacterium 20 Species 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000000035 biogenic effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000018678 bone mineralization Effects 0.000 description 1
- 230000010072 bone remodeling Effects 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000008004 cell lysis buffer Substances 0.000 description 1
- 230000009087 cell motility Effects 0.000 description 1
- 229920002301 cellulose acetate Polymers 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000000739 chaotic effect Effects 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 238000005660 chlorination reaction Methods 0.000 description 1
- 229940032122 claris Drugs 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000000280 densification Methods 0.000 description 1
- 238000000432 density-gradient centrifugation Methods 0.000 description 1
- 210000004268 dentin Anatomy 0.000 description 1
- RPLCPCMSCLEKRS-BPIQYHPVSA-N desogestrel Chemical compound C1CC[C@@H]2[C@H]3C(=C)C[C@](CC)([C@](CC4)(O)C#C)[C@@H]4[C@@H]3CCC2=C1 RPLCPCMSCLEKRS-BPIQYHPVSA-N 0.000 description 1
- 229960004976 desogestrel Drugs 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- 108020001096 dihydrofolate reductase Proteins 0.000 description 1
- KAKKHKRHCKCAGH-UHFFFAOYSA-L disodium;(4-nitrophenyl) phosphate;hexahydrate Chemical compound O.O.O.O.O.O.[Na+].[Na+].[O-][N+](=O)C1=CC=C(OP([O-])([O-])=O)C=C1 KAKKHKRHCKCAGH-UHFFFAOYSA-L 0.000 description 1
- UKWLRLAKGMZXJC-QIECWBMSSA-L disodium;[4-chloro-3-[(3r,5s)-1-chloro-3'-methoxyspiro[adamantane-4,4'-dioxetane]-3'-yl]phenyl] phosphate Chemical compound [Na+].[Na+].O1OC2([C@@H]3CC4C[C@H]2CC(Cl)(C4)C3)C1(OC)C1=CC(OP([O-])([O-])=O)=CC=C1Cl UKWLRLAKGMZXJC-QIECWBMSSA-L 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 235000014103 egg white Nutrition 0.000 description 1
- 210000000969 egg white Anatomy 0.000 description 1
- 235000013601 eggs Nutrition 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000009585 enzyme analysis Methods 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- 239000000328 estrogen antagonist Substances 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000009123 feedback regulation Effects 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 210000000245 forearm Anatomy 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 210000000232 gallbladder Anatomy 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000004034 genetic regulation Effects 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 150000002333 glycines Chemical class 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 230000036449 good health Effects 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 210000003780 hair follicle Anatomy 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 230000011132 hemopoiesis Effects 0.000 description 1
- 208000016861 hereditary angioedema type 3 Diseases 0.000 description 1
- 235000009200 high fat diet Nutrition 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 239000012145 high-salt buffer Substances 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 239000004030 hiv protease inhibitor Substances 0.000 description 1
- 108091008039 hormone receptors Proteins 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 230000000148 hypercalcaemia Effects 0.000 description 1
- 208000030915 hypercalcemia disease Diseases 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 238000012151 immunohistochemical method Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000003426 interchromosomal effect Effects 0.000 description 1
- 230000016507 interphase Effects 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 239000011630 iodine Substances 0.000 description 1
- 229910052740 iodine Inorganic materials 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 101150109249 lacI gene Proteins 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 230000031700 light absorption Effects 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 239000006210 lotion Substances 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 238000003754 machining Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- 230000005906 menstruation Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- SLZIZIJTGAYEKK-CIJSCKBQSA-N molport-023-220-247 Chemical compound C([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CN)[C@@H](C)O)C1=CNC=N1 SLZIZIJTGAYEKK-CIJSCKBQSA-N 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- JPXMTWWFLBLUCD-UHFFFAOYSA-N nitro blue tetrazolium(2+) Chemical compound COC1=CC(C=2C=C(OC)C(=CC=2)[N+]=2N(N=C(N=2)C=2C=CC=CC=2)C=2C=CC(=CC=2)[N+]([O-])=O)=CC=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=C([N+]([O-])=O)C=C1 JPXMTWWFLBLUCD-UHFFFAOYSA-N 0.000 description 1
- 230000000474 nursing effect Effects 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000000050 nutritive effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000000963 osteoblast Anatomy 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000008533 pain sensitivity Effects 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 238000002161 passivation Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 239000002831 pharmacologic agent Substances 0.000 description 1
- NTGBUUXKGAZMSE-UHFFFAOYSA-N phenyl n-[4-[4-(4-methoxyphenyl)piperazin-1-yl]phenyl]carbamate Chemical compound C1=CC(OC)=CC=C1N1CCN(C=2C=CC(NC(=O)OC=3C=CC=CC=3)=CC=2)CC1 NTGBUUXKGAZMSE-UHFFFAOYSA-N 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 108091005981 phosphorylated proteins Proteins 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 238000013492 plasmid preparation Methods 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000003234 polygenic effect Effects 0.000 description 1
- 229920000656 polylysine Polymers 0.000 description 1
- 229920006327 polystyrene foam Polymers 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 101150031139 pth gene Proteins 0.000 description 1
- 239000012264 purified product Substances 0.000 description 1
- GZUITABIAKMVPG-UHFFFAOYSA-N raloxifene Chemical compound C1=CC(O)=CC=C1C1=C(C(=O)C=2C=CC(OCCN3CCCCC3)=CC=2)C2=CC=C(O)C=C2S1 GZUITABIAKMVPG-UHFFFAOYSA-N 0.000 description 1
- 229960004622 raloxifene Drugs 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000979 retarding effect Effects 0.000 description 1
- 208000007442 rickets Diseases 0.000 description 1
- 229940102127 rubidium chloride Drugs 0.000 description 1
- 229920006298 saran Polymers 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 210000003765 sex chromosome Anatomy 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- 229960001866 silicon dioxide Drugs 0.000 description 1
- AWUCVROLDVIAJX-GSVOUGTGSA-N sn-glycerol 3-phosphate Chemical compound OC[C@@H](O)COP(O)(O)=O AWUCVROLDVIAJX-GSVOUGTGSA-N 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 210000004872 soft tissue Anatomy 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 210000004988 splenocyte Anatomy 0.000 description 1
- 238000013223 sprague-dawley female rat Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- IHCDKJZZFOUARO-UHFFFAOYSA-M sulfacetamide sodium Chemical compound O.[Na+].CC(=O)[N-]S(=O)(=O)C1=CC=C(N)C=C1 IHCDKJZZFOUARO-UHFFFAOYSA-M 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229960001603 tamoxifen Drugs 0.000 description 1
- 239000009871 tenuigenin Substances 0.000 description 1
- XUIIKFGFIJCVMT-UHFFFAOYSA-N thyroxine-binding globulin Natural products IC1=CC(CC([NH3+])C([O-])=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-UHFFFAOYSA-N 0.000 description 1
- 231100000167 toxic agent Toxicity 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 150000005691 triesters Chemical class 0.000 description 1
- PIEPQKCYPFFYMG-UHFFFAOYSA-N tris acetate Chemical compound CC(O)=O.OCC(N)(CO)CO PIEPQKCYPFFYMG-UHFFFAOYSA-N 0.000 description 1
- 239000002753 trypsin inhibitor Substances 0.000 description 1
- 108010046845 tryptones Proteins 0.000 description 1
- 238000000539 two dimensional gel electrophoresis Methods 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 238000003805 vibration mixing Methods 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 235000019166 vitamin D Nutrition 0.000 description 1
- 239000011710 vitamin D Substances 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 230000003313 weakening effect Effects 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
- A61P1/14—Prodigestives, e.g. acids, enzymes, appetite stimulants, antidyspeptics, tonics, antiflatulents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P15/00—Drugs for genital or sexual disorders; Contraceptives
- A61P15/08—Drugs for genital or sexual disorders; Contraceptives for gonadal disorders or for enhancing fertility, e.g. inducers of ovulation or of spermatogenesis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
- A61P19/08—Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P19/00—Drugs for skeletal disorders
- A61P19/08—Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease
- A61P19/10—Drugs for skeletal disorders for bone diseases, e.g. rachitism, Paget's disease for osteoporosis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
- A61P25/30—Drugs for disorders of the nervous system for treating abuse or dependence
- A61P25/32—Alcohol-abuse
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/12—Drugs for disorders of the metabolism for electrolyte homeostasis
- A61P3/14—Drugs for disorders of the metabolism for electrolyte homeostasis for calcium homeostasis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P5/00—Drugs for disorders of the endocrine system
- A61P5/14—Drugs for disorders of the endocrine system of the thyroid hormones, e.g. T3, T4
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P7/00—Drugs for disorders of the blood or the extracellular fluid
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/775—Apolipopeptides
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/505—Medicinal preparations containing antigens or antibodies comprising antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Pharmacology & Pharmacy (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Engineering & Computer Science (AREA)
- Physical Education & Sports Medicine (AREA)
- Rheumatology (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Diabetes (AREA)
- Endocrinology (AREA)
- Orthopedic Medicine & Surgery (AREA)
- Toxicology (AREA)
- Addiction (AREA)
- Reproductive Health (AREA)
- Hematology (AREA)
- Neurology (AREA)
- Psychiatry (AREA)
- Obesity (AREA)
- Nutrition Science (AREA)
- Neurosurgery (AREA)
- Pregnancy & Childbirth (AREA)
- Biomedical Technology (AREA)
- Gynecology & Obstetrics (AREA)
- Cell Biology (AREA)
Abstract
本发明涉及用于分离和检测一种高骨量基因和相应的野生型基因及其突变体的方法和材料。本发明也涉及高骨量基因、相应的野生型基因及其突变体。本发明中鉴定的基因参与骨发育和粘着斑信号传递。本发明也提供核酸,包括编码序列、寡核苷酸引物和探针,蛋白质,克隆载体,表达载体,转化宿主,开发药物组合物的方法,鉴定与骨发育有关分子的方法,和诊断与治疗与骨发育有关疾病的方法。在优选实施方案中,本发明涉及治疗、诊断和预防骨质疏松症的方法。
Description
发明领域
本发明一般地涉及遗传学、基因组学和分子生物学领域。更具体地,本发明涉及用于分离、检测和测序一种高骨量基因和相应的野生型基因及其突变体的方法和材料。本发明也涉及高骨量基因、相应的野生型基因及其突变体。本发明中鉴定的基因与骨发育本身及其生理学相关。本发明也提供核酸,蛋白质,克隆载体,表达载体,转化宿主,开发药物组合物的方法,鉴定与骨发育有关分子的方法,和诊断与治疗与骨发育有关疾病的方法。在优选实施方案中,本发明定位于治疗、诊断、预防和筛选骨骼的正常和异常状况,包括代谢性骨疾病如骨质疏松症的方法。
发明背景
最常见的骨质疏松类型中的两种是绝经后和老年骨质疏松症。骨质疏松同时影响男性和女性,并与其他骨骼异常一起使衰老人口的健康风险持续增加。最常见的骨质疏松类型与绝经相关。在月经停止后3-6年内,大多数女性丢失了骨小梁室中20-60%的骨量。这种快速丢失一般与骨重吸收和形成的增加相关,然而,重吸收循环更占优势,因此结果是骨量的净丢失。在绝经期女性中骨质疏松是常见且严重的疾病。仅在美国估计就有2千5百万女性受这种疾病的困扰。骨质疏松的结果对个人有害,而且由于它是慢性病且需要对疾病后遗症广泛及长期的支持(住院、护理及家庭护理),同时也造成大量的经济损失,对于年龄较大的患者尤其是这样。另外,尽管一般不认为骨质疏松是威胁生命的疾病,但在老年女性中20-30%的死亡率与髋骨骨折有关,这种高百分比的死亡率可能与绝经后骨质疏松直接相关。
最易受绝经后骨质疏松影响的骨组织是骨小梁。这种组织经常被称为骨松质,且在关节附近的骨末端处和脊椎骨中分布特别集中。小梁组织的特征是相互连接以及与组成外表面和骨干中心的更坚固和致密的皮质组织相连接的小结构。这种交错的小梁网络对外部皮质结构给予侧向支持,并且这对整体组织的生物机械强度很关键。在绝经后骨质疏松中,主要是净重吸收和小梁丢失导致骨衰弱和骨折。考虑到绝经后女性的小梁丢失,最常见的骨折与高度依赖小梁支持的骨有关,例如脊椎骨、股骨颈、和前臂,这不使人惊奇。事实上,髋骨骨折、Colle’s骨折和椎骨粉碎性骨折是绝经后骨质疏松的指征。
常规接受的治疗绝经后骨质疏松的最早方法之一是雌激素替代疗法。尽管这种疗法经常成功,但患者依从性低,这主要是因为长期雌激素治疗的不希望有的副作用。雌激素替代疗法常见的副作用包括月经重现、虚胖、抑郁和对乳腺癌和子宫癌的担心。为限制未接受子宫切除术女性的已知子宫癌威胁,经常使用雌激素和孕酮周期疗法的方案。这种方案与用于节育的方法相似,且经常由于孕酮的副作用特征而使女性不耐受。最近已发现,原本开发用于治疗乳腺癌的某些抗雌激素在绝经后骨质疏松实验模型中具有疗效。其中有雷洛昔芬(见美国专利5,393,763,和Black et al,J.Clin.Invest.,93:63-69(1994))。除此之外,他莫西芬,乳腺癌治疗广泛使用的临床药物,已显示对于患乳腺癌的绝经后女性可以增加骨矿物质密度(Love etal,N.Engl.J.Med.,326:852-856(1992))。
另一种治疗绝经后骨质疏松的疗法是使用降钙素。降钙素是天然存在的肽,它抑制骨重吸收并在许多国家已被允许使用(Overgaardet al,Br.Med.J.,305:556-561(1992)),然而降钙素的使用有一些限制。其增加骨矿密度的作用非常有限且治疗非常昂贵。另一种治疗绝经后骨质疏松的疗法是使用双膦酸酯。这些化合物原本开发用于佩吉特病和恶性高钙血症,已显示出它们可抑制骨重吸收。阿仑特罗,这类化合物中的一种,已被允许用于治疗绝经后骨质疏松。这些试剂可能有助于骨质疏松治疗,但这些试剂也有潜在的出现以下状况的倾向,包括软骨病、极长的骨半衰期(超过2年)以及可能的“冻骨综合征frozen bone syndrome)”,例如正常的骨重建停止。
老年骨质疏松与绝经后骨质疏松相似,其标志是骨矿密度丢失及骨折率、患病率和相关死亡率的随之增加。这一般出现于生命晚期,即70岁以后。从历史观点看,老年骨质疏松在女性中更常见,但随着更多老年男性人口的出现,这种疾病对两种性别的健康都成为主要因素。激素如睾酮或雌激素对此疾病有何作用还不清楚(如果有的话),且其病因仍不清楚。这种疾病的治疗不是非常满意。激素疗法,女性中雌激素和男性中睾酮,显示不明确的结果;降钙素和双膦酸酯可能有些用处。
成熟骨骼的峰值骨量在很大程度上受遗传控制。孪生研究已表明,成年同卵双胞胎间的骨量差异小于异卵双胞胎(Slemenda et al,J.Bone Miner.Res.,6:561-567(1991);Young et al,J.Bone Miner.Res.,6:561-567(1995);Pocock et al,J.Clin.Invest.,80:706-710(1987);Kelly et al,J.Bone Miner.Res.,8:11-17(1993)),并已估计60%或更多的骨量差异来自遗传(Krall et al,J.Bone Miner.Res.,10:S367(1993))。峰值骨量是老年骨量的影响最大的决定因素(Hui et al,Ann.Int.Med.,111:355-361(1989)),尽管成年和老年人中年龄相关的骨丢失速度也是一个重要的决定因素(Hui etal,Osteoporosis Int.,1:30-34(1995))。因为骨量是骨折风险的主要可测量决定因素,遗传造成的成熟峰值骨量是老年时个体骨折风险的重要决定因素。因而,骨量的遗传基础研究是骨质疏松导致骨折的病因学中值得关注的问题。
近来,在骨质疏松领域对峰值骨量的遗传调控引起强烈关注。这主要集中于具有适当的多态性以供测定与正常范围内骨量差异相关性的候选基因,或者集中于骨质疏松患者中发现的与低骨量相关的基因及基因座检查。在此工作中,维生素D受体基因座(VDR)(Morrison etal,Nature,367:284-287(1994))、PTH基因(Howard et al,J.Clin.Endocrinol.Metab.,80:2800-2805(1995);Johnson et al,J.BoneMiner.Res.,8:11-17(1995);Gong et al,J.Bone Miner.Res.,10:S462(1995))和雌激素受体基因(Hosoi et al,J.Bone Miner.Res.,10:S170(1995);Morrison et al,Nature,367:284-287(1994))最突出。这些研究较困难,因为骨量(表型)是连续、定量、多基因的性状,并受环境因素的影响,如营养、共存疾病、年龄、身体活动及其它因素。因而这种研究设计要求大量受试者。具体来说,现有的VDR研究结果混乱且相互矛盾(Garnero et al,J.Bone Miner.Res.,10:1283-1288(1995);Eisman et al,J.Bone Miner.Res.,10:1289-1293(1995);Peacock,J.Bone Miner.Res.,10:1294-1297(1995))。进一步说,这些工作远远没有阐明清楚遗传对骨量影响的机理。
众所周知遗传对峰值骨量的影响大大超过环境因素,然而确定与骨量差异连锁的基因座(最终是基因)的研究很困难且成本很高。利用连锁分析,例如同胞或大家系的研究设计,一般可得到比单纯相关性研究更多的信息,尽管后者也有其价值。然而,骨量遗传连锁研究受两个主要问题阻碍。第一个问题是表型,如上简述。骨量是连续、定量的性状,建立离散的表型困难。测量的每个解剖学位点可能受几个基因影响,而其中很多可能在位点之间又不相同。第二个问题是表型的年龄成分。等到可以鉴定个体为低骨量时,有很大可能其父母或其它先辈成员已经死亡并因此不能对其研究,而其后代可能甚至还没到峰值骨量,使之遗传分析时其表型分析不能确定。
无论如何,连锁分析可用于发现导致遗传性“疾病”的基因位置而不需要该疾病生化性质的任何知识,也即,不需要知道被认为引起该疾病的突变蛋白。传统方法基于有关疾病过程的假设,该过程可能暗示一种已知蛋白作为待评估的候选者。采用连锁分析的遗传学定位方法可用于首先发现缺陷基因所在的普遍染色体区域,然后逐渐缩小区域大小,以尽可能精确确定具体突变基因的位置。待在候选区内发现基因自身后,与DNA一起鉴定其信使RNA和蛋白,并检查其突变。
遗传学定位方法有其实践意义,因为疾病定位可用于产前诊断,甚至在引起疾病的改变基因被发现之前也可。连锁分析能使家庭知道他们是否携带某个疾病基因,并通过分子诊断估计未出生儿童的状况,即使其中很多没有患病儿童。然后,家族内疾病传播可用于发现缺陷基因。如本文所用,“高骨量”(HBM)类似于一种疾病状态,尽管从实际角度来看高骨量事实上有助于个体避免已知疾病如骨质疏松。
由于染色体从父母到子女的遗传特性,从而可以进行连锁分析。减数分裂期间,两个亲代的同源染色体配对并指导其正确分离到子细胞。当其排列并配对时,两个同源染色体交换其片段,这称为“交换”或“重组”。由此得到的染色体是嵌合体,也就是说,它们同时包含来自父本和母本同源染色体的部分。两个序列在染色体上越靠近,它们之间发生重组的可能性越低,因而它们连锁越紧密。在连锁分析实验中,跟踪从一代传递到下一代的染色体上的两个位置从而判断它们之间的重组频率。在遗传病研究中,通过检测个体是否表现出疾病症状来确定由疾病基因或其对应的正常基因标记的一个染色体位置,即染色体区域的遗传。另一个位置由一段DNA序列标记,它表现出群体中的天然变异,这样基于它们所具有的“标记”序列拷贝可区分两个同源染色体。每个家系中,遗传学标记序列的遗传与疾病状态的遗传相比较。如果在具有常染色体显性疾病如高骨量的家系内,每个受影响的个体具有相同形式的标记并且所有未受影响个体具有至少一个不同形式的标记,那么疾病基因和这个标记位置互相靠近有很大的可能。以此方式,可用已知标记系统性检查染色体并与疾病状态比较。合并来自不同家系的数据,并用统计学方法一起进行计算机分析。结果为表明遗传标记和疾病间的连锁可能性的信息,两者之间可以存在各种距离。阳性结果可能意味着疾病与标记非常近,而阴性结果表明在该染色体上二者距离很远,或者它们在完全不同的染色体上。
连锁分析实施方案如下,按给定标记基因座将受影响家系的所有成员分类,并用标记探针评估特定疾病状态的共遗传,由此确定在多大程度上其中二者共遗传。可用重组频率测量两个基因座间的遗传学距离。1%的重组频率等价于1图距、或1厘摩根(cM),大约等价于1,000kb DNA。这种关系成立可至约20%频率或20cM。
整个人基因组是3,300cM长。为了在标记基因座的5-10cM内发现疾病基因,可用大约330个间距约10cM的指示性标记基因座搜索整个基因组(Botstein et al,Am.J.Hum.Genet.,32:314-331(1980))。用多种统计学方法确定连锁结果的可靠性。人连锁分析中最常用的方法是LOD评分法(Morton,Prog.Clin.Biol.Res.,147:245-265(1984),Morton et al,Am.J.Hum.Genet.,38:868-883(1986)),并由Ott,Am.J.Hum.Genet.,28:528-529(1976)引入计算机程序LIPED中。LOD评分是,在最大至不连锁(距离>50cM)的给定距离上,两个基因座连锁可能性比例的对数。使用对数值的优点是在具有相同疾病的家系间可以加和,在人类家系规模相对较小时这变得很必要。
根据惯例,总LOD评分大于+3.0(也就是说,在特定重组频率下,连锁几率比不连锁几率大1000倍)被认为是在该特定重组频率下连锁的显著证据。总LOD评分小于-2.0(也就是说,在特定重组频率下,不连锁几率比连锁几率大100倍)被认为是在该特定重组频率下,检验的两个基因座不连锁的强有力证据。直到最近,大多数连锁分析的实施基于两点数据,它表示待检验疾病与特定遗传学标记的关系。然而,作为近几年人类基因组图谱迅速进展的结果,以及伴随的计算机方法学的进步,用多点数据进行连锁分析已变得可行。多点分析提供疾病和几个连锁的遗传学标记间的同步连锁分析,而标记间的重组距离是已知的。
多点分析更优有两个原因。第一,家系所提供的信息通常增加。根据对标记基因座为杂合的亲代数目和家系中受影响个体的数目,每个家系有一定量的潜在信息。然而,为使所有个体都提供信息,少量标记即有充分的多态性。如果同时考虑多个标记,那么个体对至少一个标记为杂合的可能性大大增加。第二,可以确定疾病基因在标记间位置的指标。这允许鉴定侧翼的标记,并因而最终允许分离疾病基因所在的小区域。Lathrop et al,Proc.Natl.Acad.Sci.USA,81:3443-3446(1984)已经编写了最广泛使用的计算机程序包,LINKAGE,供多点分析。
本领域需要鉴定与高骨量表型相关的基因,本发明正是针对于这个,以及其它重要的目的。
发明简述
本发明通过遗传学连锁和突变分析描述位于染色体11q13.3上的Zmax1基因和HBM基因。与这些基因连锁的另外的遗传学标记的使用帮助了这个发现。通过使用连锁分析和突变分析,容易鉴定倾向于HBM的人。使用细菌人工染色体的克隆方法使发明人能集中于11q13.3的染色体区域并加速常染色体显性基因的测序。此外,本发明鉴定了Zmax1基因和HBM基因,并鉴定了Zmax1基因582位上的鸟嘌呤至胸腺嘧啶的多态性突变,该突变产生HBM基因和HBM表型。
本发明鉴定了Zmax1基因和HBM基因,它们可用于确定人是否倾向于HBM,并因此不易患以骨密度降低为特征的疾病,包括,例如骨质疏松,或倾向于和易患以异常高骨密度为特征的疾病,例如骨质疏松。携带HBM基因的老年个体表达HBM蛋白,并因此不发生骨质疏松。这种体内观察是强有力的证据,表明使用HBM基因或蛋白或其片段处理正常个体将改善骨质疏松。
进一步,这种处理将适用于骨病变特别是骨折的治疗,以帮助这种损伤痊愈中的骨重建。例如,倾向于或易患压力性骨折(即压力诱导的微骨折的积累,最终导致贯穿骨皮质的真实性骨折)的人可用本发明的方法被鉴定和/或治疗。进一步,本发明的方法和组合物在继发性骨质疏松中有用,其中治疗过程涉及骨重建,如伴随皮质类固醇给药、甲状腺机能亢进、性腺机能减退、恶性血液病、吸收不良和酒精中毒的内分泌状况,以及与维生素D和/或磷酸代谢相关的疾病,如软骨病和佝偻病,和以异常或紊乱的骨重建为特征的疾病,如佩吉特病,以及良性或恶性骨肿瘤。
在各种实施方案中,本发明涉及HBM和Zmax1的核酸、蛋白质、载体和被转化宿主。
此外,本发明涉及本发明上述实施方案的应用,包括例如,骨发育疾病的基因疗法、药物开发和诊断测定。在优选实施方案中,本发明涉及治疗、诊断、预防和筛选骨质疏松的方法。
下面将详细描述本发明的这些和其它方面。
附图简述
图1表示用于遗传学连锁研究的个体家系。每个个体下面是识别号、脊椎BMD的z评分以及11号染色体上的关键标记的等位基因。实心符号代表“受影响”的个体,含“N”的符号是“未受影响”的个体。对来自37个个体的DNA进行了基因型分析。问号表示未知的基因型或未进行基因型分析的个体。
图2表示11q13.3中HBM区域的BAC/STS成分物理图。STS标记来源于基因、EST、微卫星、随机序列和BAC末端序列,标注于长水平线上面。GDB上出现的标记使用相同的命名规则。如果有基因座名(D11S####)则列于主名称(primary name)后的括号内。来自BAC末端序列的STS与在前的BAC名一起列出,其后并分别用L或R表示克隆的左和右端。两个大箭头指出确定HBM关键区的遗传学标记。STS下面的水平线指出用PCR筛选9倍覆盖BAC文库鉴定的BAC克隆。开环指出在文库筛选时标记不扩增对应的BAC文库地址。克隆名用以下规则:B指BAC,板,行和列号,随后用-H表明HBM项目(即B36F16-H)。
图3A-3F表示Zmax1及侧翼内含子序列的基因组结构。翻译起始于1号外显子内有下划线的“ATG”。HBM基因的多态性位点在3号外显子内并用下划线的“G”表示,这个核苷酸在HBM基因中是“T”,mRNA的3’未翻译区在23号外显子中用下划线标出(1号外显子,SEQID NO:40;2号外显子,SEQ ID NO:41;3号外显子,SEQ ID NO:42;4号外显子,SEQ ID NO:43;5号外显子,SEQ ID NO:44;6号外显子,SEQ ID NO:45;7号外显子,SEQ ID NO:46;8号外显子,SEQ ID NO:47;9号外显子,SEQ ID NO:48;10号外显子,SEQID NO:49;11号外显子,SEQ ID NO:50;12号外显子,SEQ ID NO:51;13号外显子,SEQ ID NO:52;14号外显子,SEQ ID NO:53;15号外显子,SEQ ID NO:54;16号外显子,SEQ ID NO:55;17号外显子,SEQ ID NO:56;18号外显子,SEQ ID NO:57;19号外显子,SEQ ID NO:58;20号外显子,SEQ ID NO:59;21号外显子,SEQ ID NO:60;22号外显子,SEQ ID NO:61;23号外显子,SEQ IDNO:62)。
图4表示Zmax1的结构域结构,包括YWTD间隔区、细胞外附着位点、LDL和钙结合位点、富含半胱氨酸的生长因子重复、跨膜区、含CK-II磷酸化位点的理想PEST区和内化结构域。图4也标出HBM蛋白内甘氨酸变为缬氨酸的位点。信号肽位于1-22位氨基酸,胞外结构域位于23-1385位氨基酸,跨膜片段位于1386-1413位氨基酸,胞质结构域位于1414-1615位氨基酸。
图5是与HBM基因有关的BAC重叠群B527D12和B200E21的结构示意图。
图6A-6E是野生型基因Zmax1的核苷酸和氨基酸序列。582位核苷酸的碱基对取代,鸟嘌呤变为胸腺嘧啶,的位置用下划线标出。这种等位基因变体是HBM基因。HBM基因编码一种蛋白,其171位氨基酸由甘氨酸变为缬氨酸。5’非翻译区(UTR)边界碱基1-70,3’UTR边界碱基4916-5120。
图7A和7B是DNA印迹分析,表示Zmax1在多种组织中的表达。
图8是PCR产物分析。
图9是Zmax1外显子3突变的等位基因特异性寡核苷酸检测。
图10是通过用有义和反义探针原位杂交在100×放大下的小鼠Zmax1细胞定位。
图11是通过用有义和反义探针原位杂交在400×放大下的小鼠Zmax1细胞定位。
图12是通过用有义和反义探针对骨内膜的成骨细胞原位杂交在400×放大下的小鼠Zmax1细胞定位。
图13表示MC-3T3细胞中Zmax1表达的反义抑制。
图14表示Zmax1外显子3等位基因特异性寡核苷酸(ASO)测定,这证明了相比于野生型Zmax1等位基因(左框,G-特异性寡核苷酸测定;55℃洗涤),HBM1等位基因的稀有性(右框;T-特异性寡核苷酸测定;58℃洗涤)。右框中的阳性点是阳性对照。
图15表示代表Zmax1在粘着斑信号传递中可能作用的模型。
发明详述
为帮助理解本说明书和权利要求,提供以下定义。
“基因”指一段DNA序列,通过其模板或信使RNA编码特定肽的特征性氨基酸序列。名词“基因”包括间插的非编码区,以及调节区,还可以包括5’和3’末端。
“基因序列”指DNA分子,包括包含非转录或非翻译序列的DNA分子。该名词也意欲包括存在于同一个DNA分子中的基因、基因片段、非转录序列或非翻译序列的任意组合。
本发明的序列可有多种来源,包括DNA、cDNA、合成DNA、合成RNA或其组合。这种序列可含有包括或不包括天然存在内含子的DNA。进一步,这种基因组DNA可与启动子区域或多聚A序列一起获得。序列、基因组DNA或cDNA可由几种方式中的任一种获得。可从适用的细胞通过本领域公知的方法提取和纯化基因组DNA。此外,可从细胞中分离mRNA并通过逆转录或其它方法用于产生cDNA。
“cDNA”指以RNA为模板通过RNA-依赖的DNA聚合酶(逆转录酶)作用产生的互补或拷贝DNA。因此,“cDNA克隆”意指克隆载体上携带或由PCR扩增的,与感兴趣的RNA分子互补的双链DNA序列。这个名词包括从中已去除间插序列的基因。
“重组DNA”意指通过体外拼接cDNA或基因组DNA序列而已被重新结合的分子。
“克隆”指使用体外重组技术插入特定基因或其它DNA序列到载体分子中。为成功克隆所需的基因,有必要用产生DNA片段、连接片段到载体分子、将复合DNA分子引入可将其复制的宿主细胞、以及从接受体宿主细胞中选择含目的基因的克隆的方法。
“cDNA文库”指重组DNA分子的集合,其中含cDNA插入序列并一起包含生物体的完整基因组。这种cDNA文库可通过本领域已知的方法制备,并已有描述,例如Cowell and Austin,“cDNA LibraryProtocols,”Methods in Molecular Biology(1997)。一般来说,首先从生物体细胞中分离RNA,期望从其基因组中克隆特定基因。
“克隆载体”指质粒或噬菌体DNA或能在宿主细胞中复制的其它DNA序列。克隆载体的特征在于一或多个内切酶识别位点,在此位点该DNA序列可按预定的方式被切断而不损失DNA的基本生物学功能,其中可能包含适用于识别转化细胞的标记。
“表达调控序列”指当可操作性连接到基因上时控制或调节结构基因表达的核苷酸序列。它们包括,例如,lac系统,trp系统,lambda噬菌体的主要操纵基因和启动子区域,fd衣壳蛋白的调控区及已知的调控原核或真核细胞基因表达的其它序列。表达调控序列将根据载体设计是在原核还是真核宿主中表达可操作性连接的基因的情况改变,并且可以包含转录元件如增强子元件、终止序列、组织特异性元件和/或翻译起始和终止位点。
“表达载体”与克隆载体相似但在转化入宿主后,可以表达克隆入其中的基因。克隆基因通常置于表达调控序列的控制之下(即与其可操作性连接)。
“操纵基因”指能与特异性阻遏物相互作用的DNA序列,因此控制相邻基因的转录。
“启动子”指能被RNA聚合酶识别的DNA序列。这种序列的存在允许RNA聚合酶结合并起始可操作性连接的基因序列的转录。
“启动子区”意欲包括启动子以及转录起始必需的其它基因序列。启动子区的存在足以引起可操作性连接基因序列的表达。
“可操作性连接”意指启动子控制基因表达的起始。如果导入宿主细胞后启动子决定最邻近的DNA序列转录成一或多种RNA,那么这个启动子被可操作性连接到最邻近的DNA序列。如果启动子能起始DNA序列的转录,那么这个启动子被可操作性连接到该DNA序列。
“原核生物”指没有真正细胞核的所有生物,包括细菌。
“真核生物”指具有真正细胞核的生物和细胞,包括哺乳动物细胞。
“宿主”包括原核生物和真核生物,如酵母和丝状真菌,以及植物和动物细胞。该名词包括作为可复制表达载体的接受体的生物或细胞。
基因“片段”指具有该基因生物活性的基因的任何变体。
“变体”指在结构和生物活性或免疫学特性上与完整基因或与其基因片段基本相似的基因。如果两个基因具有相似的活性,此处可认为它们是变体,即使其氨基酸残基序列不等同。
“核酸扩增”指方法如聚合酶链式反应(PCR)、连接扩增(或连接酶链式反应,LCR)和基于使用Q-beta复制酶的扩增方法。这些方法本领域公知,并描述于,例如,美国专利4,683,195和4,683,202。进行PCR的试剂和硬件可由市场获得。用于扩增来自HBM区的序列的引物优选互补于HBM区域的序列或目标区域侧翼区域的序列并与之特异性杂交。由扩增产生的HBM序列可直接测序。此外,可在测序分析之前克隆扩增序列。
“抗体”可指多克隆和/或单克隆抗体及其片段,以及免疫结合等同物,它可以结合到HBM蛋白及其片段上或来自HBM区域,特别是来自HBM基因座或其一部分的序列上。名词抗体即用于指同源的分子实体,也指混合物如由多种不同的分子实体组成的血清产品。可用蛋白质合成仪合成制备蛋白质并偶联到载体分子上,然后在几个月内注射给兔子。测定兔血清对HBM蛋白或其片段的免疫反应性。可通过向小鼠注射蛋白或其片段来制备单克隆抗体。然后用ELISA筛选单克隆抗体并测定对HBM蛋白或其片段的特异性免疫反应性。Harlow et al,Antibodies:A Laboratory Manual,Cold SpringHarbor Laboratory,Cold Spring Harbor,NY(1998)。这些抗体将用于测定及制药。
“HBM”指高骨量。
“HBM蛋白”指除了包含171位甘氨酸转变为缬氨酸以外,与Zmax1蛋白等同的蛋白。HBM蛋白定义适用于任何编码Zmax1真正同源物的生物。例如,小鼠HBM蛋白指170位甘氨酸由缬氨酸取代的小鼠Zmax1蛋白。
“HBM基因”指在表现HBM特征或表型的个体中发现的基因组DNA序列,其中该序列编码由SEQ ID NO:4表示的蛋白。HBM基因和Zmax1基因是等位基因。HBM基因编码的蛋白具有引起骨量升高的特性,而Zmax1基因编码的蛋白则不是这样。HBM基因和Zmax1基因的区别在于HBM基因在582位有一个胸腺嘧啶,而Zmax1基因在582位有一个鸟嘌呤。HBM基因含有如SEQ ID NO:2所示的核酸序列。HBM基因也可称为“HBM多态性”。
“正常”、“野生型”、“不受影响的”和“Zmax1”都指编码由SEQID NO:3所表示蛋白的基因组DNA序列。Zmax1基因在582位有鸟嘌呤。Zmax1基因含有如SEQ ID NO:1所示的核酸序列。“正常”、“野生型”、“不受影响的”和“Zmax1”也指编码不使骨量升高的蛋白的基因组序列的等位基因变体。Zmax1基因在人群中常见,而HBM基因稀有。
“5YWT+EGF”指在Zmax1蛋白中发现的重复单元,由5个YWT重复及跟随的一个EGF重复组成。
“骨发育”一般指任何涉及随时间而发生骨改变的过程,包括,例如,正常发育,疾病状态中出现的改变以及衰老过程中出现的改变。“骨发育疾病”具体指骨发育中的任何疾病,包括,例如,疾病状态中出现的改变以及衰老过程出现的改变。骨发育本质上可以是进展性的或周期性的。发育中可以改变的骨的方面包括,例如,矿化,特异性解剖学特征的形成,以及多种细胞类型的相对或绝对数目。
“骨调节”或“骨形成调节”指影响涉及骨重建的任何生理过程的能力,如本领域一般技术人员理解,包括,例如但不限于,通过破骨和成骨细胞活性的骨重吸收和骨外向性生长,还可包括如本文所用的一些或全部骨形成和发育。
“正常骨密度”指在Z评分值为0的两个标准偏差内的骨密度。
“Zmax1系统”指纯化蛋白、细胞提取物、细胞、动物、人或任何其它物质的组合物,其中Zmax1以正常或突变形式出现。
“替代标记”指诊断指征,症状、体征或可在细胞、组织、人或动物中观察到的其它特征,它们与HBM基因或骨量升高相关或与二者同时相关,但比骨密度更易测量。替代标记的一般概念在诊断医学中被广泛接受。
本发明包括分别以SEQ ID NO:1和3表示的形式存在的Zmax1基因和Zmax1蛋白,和其它密切相关的变体,以及其精确表达Zmax1所必需的邻近Zmax1的染色体区域。在优选实施方案中,本发明涉及SEQ ID NO:1的核酸序列的至少15个连续核苷酸。
本发明也包括分别以SEQ ID NO:2和4表示的形式存在的HBM基因和HBM蛋白,和其它密切相关的变体,以及精确表达HBM基因所必需的邻近HBM基因的染色体区域。在优选实施方案中,本发明涉及SEQ ID NO:2的核酸序列的至少15个连续核苷酸。更优选的,本发明涉 SEQ ID NO:2的核酸序列的至少15个连续核苷酸,其中15个连续核苷酸之一是582位的胸腺嘧啶。
本发明也涉及Zmax1基因区域的核苷酸序列,以及HBM基因区域的核苷酸序列。更具体地,一种优选实施方案是包含Zmax1基因区域B200E21-H和B527D12-H的片段的BAC克隆。一种优选实施方案是由SEQ ID NO:5-12组成的BAC克隆的核苷酸序列。
本发明也涉及使用核苷酸序列鉴定Zmax1基因和HBM基因的DNA探针,扩增Zmax1基因和HBM基因的PCR引物,Zmax1基因和HBM基因中的核苷酸多态性,和Zmax1基因和HBM基因的调节元件。
本发明描述在染色体11q13.3上遗传学标记D11S987和SNP_CONTIG033-6之间的Zmax1基因和HBM基因的染色体位置的进一步定位,以及Zmax1基因和HBM基因的DNA序列。通过向用于基因作图的图谱上加入更多的遗传学标记,并通过家系扩展以包括更多个体,对染色体位置精确定位。家系扩展很关键,因为已作基因型分析的新个体具有使区域缩窄的关键重组事件。为鉴定11q13.3区域的基因,鉴定了一组包含此染色体区域的BAC克隆。BAC克隆用作基因组DNA测序的模板,也用作通过直接cDNA选择鉴定编码序列的试剂。基因组测序和直接cDNA选择用于表征来自11q13.3的超过1.5M bp的DNA。在此区域内鉴定了Zmax1基因,然后通过受影响和未受影响个体的突变分析发现了HBM基因。
当基因被遗传学定位于特定染色体区域时,可通过一系列的步骤在分子水平表征此区域的基因,所述步骤包括:将整个DNA区域克隆至一组重叠克隆(物理作图),通过直接cDNA选择、外显子捕获和DNA测序(基因鉴定)的结合表征这些克隆编码的基因,和通过对受影响和未受影响HBM家族成员的比较性DNA测序(突变分析)鉴定这些基因的突变。
采用设计用于扩增感兴趣染色体区域的独特分子界标的PCR测定,筛选克隆到在大肠杆菌或啤酒糖酵母中增殖的载体中的人DNA文库,从而实现物理作图。为产生HBM候选区域的物理图谱,用一组序列标志位点(STS)标记筛选克隆到细菌人工染色体(BAC)上的人DNA文库,这些STS已由人类基因组计划预先作图于染色体11q13.3上。
STS是可用PCR测定的人基因组中的独特分子界标。通过人类基因组计划的共同努力,22条常染色体和两条性染色体上的成千上万个STS的位置已经确定。对于定位克隆的工作,物理图谱与遗传图谱有密切联系,因为用于遗传学作图的标记也可用作物理作图的STS。通过用来自于遗传标记、基因和随机DNA片段的STS组合筛选BAC文库,可以汇编含有代表感兴趣染色体区域中所有DNA的重叠克隆的物理图谱。
BAC是用于人及其它DNA大片段(80kb到200kb)的克隆载体,在大肠杆菌中增殖。要用BAC构建物理图谱,需要筛选BAC克隆文库,这样携带对应给定STS或STS组的DNA序列的单个克隆可被鉴定。贯穿大部分人基因组,STS标记相隔大约20-50kb,因此单个BAC克隆典型包含至少两个STS标记。另外,筛选的BAC文库包含的克隆DNA足以覆盖人基因组6次。因此,典型情况下单个STS鉴定一个以上BAC克隆。通过用间隔约50kb的一系列STS标记筛选6倍覆盖的BAC文库,对于人基因组的任何区域可以汇编由一系列重叠BAC克隆,即BAC重叠群,组成的物理图谱。该图谱与遗传图谱密切相联,因为用于制作物理图谱的许多STS标记也是遗传学标记。
构建物理图谱时,在基因组的STS图上经常有空位,这将导致不能鉴定在给定位置重叠的BAC克隆。典型的,首先用从公开文献和万维网资源中已被鉴定的一组STS构建物理图谱。起始图由几个分离的BAC重叠群组成,它们由未知分子距离的空位隔开。为鉴定填充这些空位的BAC克隆,有必要从位于缺口任意一边的克隆末端开发新STS标记。这通过对空位侧翼的BAC进行末端200到300碱基对测序,并开发扩增100以上碱基对序列的PCR测定来完成。如果末端序列表明在人基因组中唯一,则新STS可用于筛选BAC文库以鉴定包含物理图谱中空位处DNA的其它BAC。为汇编覆盖大小为HBM候选区域(2,000,000或更多碱基对)的区域的BAC重叠群,经常有必要从几个克隆的末端开发新STS标记。
建立BAC重叠群后,这组重叠克隆用作鉴定在该染色体区域编码的基因的模板。可通过很多方法实现基因鉴定。常用三种方法:(1)可以对选自代表整个染色体区域的BAC重叠群的一组BAC进行测序,并可用计算机方法鉴定所有的基因,(2)来自BAC重叠群的BAC可用作克隆对应于在该区域编码的基因的cDNA的试剂,它通过称为直接cDNA选择的方法进行,或(3)在称为外显子捕获的方法中通过选择特异DNA序列基序,来自BAC重叠群的BAC可用于鉴定编码序列。本发明包括通过前两种方法鉴定的基因。
为测序代表HBM候选区域的整个BAC重叠群,选择了一组BAC,将其亚克隆到质粒载体中,随后对这些亚克隆进行DNA测序。因为克隆到BAC中的DNA代表基因组DNA,这种测序被称为基因组测序以与cDNA测序相区别。为起始感兴趣染色体区域的基因组测序,选择了几个不重叠的BAC克隆。制备每个BAC克隆的DNA,将克隆剪切成随机小片段,然后将其克隆到标准质粒载体如pUC18中。使质粒克隆生长以增殖小片段,并将其作为测序模板。为确保适度覆盖以及BAC DNA序列的序列质量,对足够的质粒克隆进行测序,以获得对BAC克隆的6倍覆盖。例如,如果BAC长100kb,那么噬菌粒要测序产生600kb的序列。因为在克隆到噬菌粒载体之前将BAC DNA随机剪切,600kb的原始DNA序列可通过计算方法汇编成重叠的DNA序列,称为序列重叠群。对于用计算方法进行初步基因鉴定,每个BAC的6倍覆盖足以产生10到20个1000bp到20,000bp的序列重叠群。
本发明使用的测序策略是初步测序来自HBM候选区域BAC重叠群的“种子”BAC。然后用“种子”BAC的序列鉴定来自重叠群中的最小重叠BAC,随后对其测序。以此方式,对整个候选区域进行测序,在每个BAC上有几个小序列空位。该序列用作计算机基因鉴定的模板。计算机基因鉴定的一种方法是将BAC重叠群序列与cDNA和基因组序列的公共数据库,例如,unigene,dbEST,genbank进行比较。这种比较一般使用BLAST系列的计算机算法和程序(Altschul et al,J.Mol.Biol.,215:403-410(1990))。BAC序列也可翻译成蛋白序列,可用设计用于分析蛋白序列的BLAST版本,用蛋白序列搜索公共蛋白数据库(Altschul et al,Nucl.Acids Res.,25:3389-3402(1997))。另一种方法是用计算机算法如MZEF(Zhang,Proc.Natl.Acad.Sci.,94:565-568(1997))和GRAIL(Uberbacher et al,Methods Enzymol.,266:259-281(1996)),它基于所有外显子中常见的特异性DNA序列基序的存在,以及人蛋白编码序列中典型的密码使用的存在,来预测序列中外显子的位置。
除通过计算方法鉴定基因外,也可通过直接cDNA选择鉴定基因(Del Mastro et al,Genome Res.5(2):185-194(1995))。在直接cDNA选择中,制备感兴趣组织的cDNA池,并用来自候选区域的BAC进行液体杂交测定以捕获与BAC中编码区碱基互补的cDNA。在描述于此的方法中,通过用多聚A RNA随机引发第一链cDNA,用标准方法合成第二链cDNA,再加接头到cDNA片段的末端,从而从几种不同的组织创建cDNA池。接头用于扩增cDNA池。BAC克隆用作体外DNA合成的模板,以创建生物素标记的BAC DNA拷贝。然后将生物素标记的BAC DNA拷贝变性并与过量的PCR扩增的、带接头的也已变性的cDNA池一起孵育。使BAC DNA和cDNA在溶液中退火,并用链亲合素包被的磁珠分离BAC和cDNA间的杂合双链。然后用与接头序列互补的引物扩增被BAC捕获的cDNA,并重复进行第二轮杂交/选择过程。经两轮直接cDNA选择后,克隆cDNA片段,并创建这些直接选择片段的文库。
用两种方法分析由直接选择分离的cDNA克隆。由于来自HBM候选区域的BAC池用于提供基因组DNA序列,cDNA必须作图到各个BAC上。这通过在微滴定板中排列BAC,并在高密度格栅中复制其DNA来实现。然后将各个cDNA克隆与格栅杂交,确保它们与来自直接选择所用组的各个BAC有一致的序列,并确定该BAC的特定特征。对已确保对应各个BAC的cDNA克隆进行测序。为确证由直接选择分离的cDNA克隆是否具有与已鉴定基因相同或相似的序列,用BLAST系列程序将DNA和蛋白编码序列与公共数据库比较。
由BAC测序和直接cDNA选择提供的基因组DNA序列和cDNA序列的组合产生区域内推测基因的初步清单。该区域中的基因都是HBM基因座的候选基因。为进一步表征每个基因,进行DNA印迹以确定对应于每个基因的转录物的大小,并确定哪些假定的外显子被一起转录,以形成独立的基因。对于每个基因的DNA印迹分析,由直接选择的cDNA克隆或通过PCR扩增基因组DNA的特异性片段或编码感兴趣推测基因的BAC的特异性片段来制备探针。DNA印迹给出转录物大小的信息以及其表达组织。对于非高表达的转录物,有时有必要用感兴趣组织的RNA作反应模板进行逆转录PCR测定。
通过计算方法和直接cDNA选择进行基因鉴定提供关于染色体上基因的独特信息。鉴定基因后,可以检测不同个体在每个基因的突变。I.用DXA测量进行表型分析
脊椎骨矿物质含量(BMC)和骨矿质密度(BMD)测量在克来登大学(Omaha,Nebraska))由使用Norland Instruments密度测量仪(Norland XR2600密度测量仪,双能量X线吸光分析,DXA)的DXA进行。另一位置的脊椎BMC和BMD使用可用的机器。估计是当前在美国运行的800 DXA机。大多数大城市有具DXA能力的办公室或成像中心,通常是Lunar或Hologic机。提供脊椎BMC和BMD数据的每个位置包括多个拷贝的机器打印清单,以证明进行BMD测定的感兴趣区域已被适当选择。已获得完整的临床史和骨骼放射照片。
用下列标准定义HBM表型:极高脊椎BMD;缺乏任何已知高骨量综合征的临床史;和表示四肢骨骼正常外形的骨骼放射照片。II.微卫星标记的基因型分析
为将遗传间隔缩小到比Johnson et al,Am.J.Hum.Genet.,60:1326-1332(1997)原始报道中更小的区域,在染色体11q12-13上分型更多的微卫星标记。新标记包括:D11S4191,D11S1883,D11S1785,D11S4113,D11S4136,D11S4139,(Dib,et al,Nature,380:152-154(1996),FGF3(Polymeropolous,et al,Nucl.Acid Res.,18:7468(1990)),以及GTC_HBM_标记_1,GTC_HBM_标记_2,GTC_HBM_标记_3,GTC_HBM_标记_4,GTC_HBM_标记_5,GTC_HBM_标记_6,GTC_HBM_标记_7(见图2)。
由注册抽血师抽血(20ml)转至有淡紫色帽(含EDTA)的试管中,血液冻存直至DNA提取。从冰箱中保存最多7天的血液中提取DNA的质量和数量都没有降低。对于在远处抽血的受试者,运输方法成功适用于很多场合。通过隔夜快递运输血样,血样保存于带冷冻包装的聚苯乙烯泡沫容器中以提供冷却。将有淡紫色帽的试管放入独立的塑料运输管并放入带拉链锁的生物危险性盒中。当样品第二天到达时,立即处理提取其DNA。
DNA提取方法使用购自Gentra Systems,Inc.(Minneapolis,Minnesota)的试剂盒。简单地说,程序包括向全血中加入3倍体积的红细胞裂解缓冲液。室温孵育10分钟后,在Beckman台式离心机中以2,000Xg离心溶液10分钟。将白细胞沉淀重悬于细胞裂解缓冲液中。等沉淀完全重悬并且无细胞团时,立即用RNA酶A于37℃消化溶液15分钟。加入提供的蛋白沉淀溶液将蛋白沉淀并离心除去。向上清液中加入异丙醇以沉淀DNA。这种方法简单快速,只需要1-2小时,并可以同时处理几十个样品。用20ml全血样品一般可得到>8mg的DNA,且分子量>50kb。以乙醇沉淀形式保存编号的50μg等分DNA以归档。
用一个荧光标记的寡核苷酸引物和一个未标记的寡核苷酸引物对DNA进行基因型分析。标记和未标记寡核苷酸由Integrated DNATechnologies,Inc.(Coralville,Iowa)获得。微卫星基因型分析使用的所有其它试剂购自Perkin Elmer-Applied Biosystems,Inc.(“PE-ABI”)(Norwalk,Connecticut)。如PE-ABI所述用AmpliTaq DNA聚合酶对每个标记分别进行单独PCR反应。向反应中加入3.5μl上样缓冲液,其中含有去离子甲酰胺、蓝色葡聚糖和TAMRA350大小标准(PE-ABI)。于95℃加热5分钟以变性DNA后,按377型DNA测序仪(PE-ABI,Foster City,California)操作手册所述加样并电泳。凝胶电泳后,数据用PE-ABI GENESCANTM和GENOTYPERTM软件分析。首先,在GENESCANTM软件内,在第一步分析之前手工优化泳道轨迹。取出凝胶泳道数据后,检查每条泳道的标准曲线图并验证其线性度和大小访问(size calling)。其中任何参数有问题的泳道再重新追踪轨迹并验证。所有泳道追踪完且大小标准正确鉴定后,数据输入GENOTYPERTM进行等位基因鉴定。为加速等位基因访问(allele calling(起始)),使用了Guy Van Camp博士的因特网站点(http://alt.www.uia.ac.be/u/dnalab/ld.html)的程序LinkageDesigner。该程序大大加快了由GENOTYPERTM产生的数据进入家系绘图程序(pedigree drawing program)Cyrillic(2.0版,CherwellScientific Publishing Limited,Oxford,Great Britain)及其随后用程序LINKAGE(Lathrop et al,Am.J.Hum.Genet.,37:482-498(1985))进行连锁分析的速度。III.连锁分析
图1表示用于本发明遗传连锁研究的个体家系。特别地,用程序LINKAGE(Lathrop et al,Am.J.Hum.Genet.,37:482-498(1985))的MLINK和LINKMAP组件进行两点连锁分析。作为前文档从Cyrillic输出家系/标记数据进入Makeped程序并转换成合适的ped文档进行连锁分析。
原始连锁分析使用三种模型:(i)常染色体显性,全贯通模型(fully penetrant model),(ii)贯通降低的常染色体显性模型,和(iii)定量性状模型。通过分析来自一个大家系的22个成员的DNA的连锁标记将HBM基因座作图于染色体11q12-13。其中使用了高度自动化的技术,以及一组345个荧光标记,它们分布于22个常染色体上,间隔约6-22cM。只有来自11号染色体该区域的标记表现出明显连锁(LOD评分~3.0)。两点和多点分析中获得的最高LOD评分(5.74)是D11S987(图2中图谱位置55)。95%置信区间将HBM基因座定位于标记DS905和D11S937之间(图2中图谱位置41-71)。单倍型分析也将Zmax1基因定位于相同区域。标记D11S987、D11S905、和D11S937的进一步描述可见Gyapay et al,Nature Genetics,Vol.7,(1994)。
本发明中,发明人报道了将HBM间隔缩小到标记D11S987和GTC_HBM_标记_5之间的区域。这两个标记位于原始分析的定界标记之间(D11S11S905和D11S937)且大约相距3cM。使用来自标记D11S4191、D11S1883、D11S1785、D11S4136、D11S4139、(Dib et al,Nature,380:152-154(1996))、FGF3(Polymeropolous et al,Nucl.Acid Res.,18:7468(1990))(有关遗传标记的信息可见基因组数据库因特网站点,http://gdbwww.gdb.org/)、以及来自标记GTC_HBM_标记_1、GTC_HBM_标记_2、GTC_HBM_标记_3、GTC_HBM_标记_4、GTC_HBM_标记_5、GTC_HBM_标记_6和GTC_HBM_标记_7的基因型数据,成功缩小了间距。
如图1所示,用上述遗传标记进行的单倍型分析鉴定了个体9019和9020中的重组事件(交换),这显著提高了Zmax1基因所在的11号染色体间隔的精确性。个体9019是受HBM影响的个体,他从母本染色体中继承了带HBM基因的部分11号染色体,并继承了部分11号染色体的同源染色体。从携带HBM基因的染色体继承的部分包含标记D11S935、D11S1313、GTC_HBM_标记_4、D11S987、D11S1296、GTC_HBM_标记_6、GTC_HBM_标记_2、D11S970、GTC_HBM_标记_3、D11S4113、GTC_HBM_标记_1、GTC_HBM_标记_7和GTC_HBM_标记_5。从D11S4136向端粒方向延伸的部分来源自非HBM染色体。该数据将Zmax1基因定位于标记GTC_HBM_标记_5的靠近中心粒侧位置。个体9020是未受影响的个体,也表现出关键重组事件。此个体继承了重组的亲代11号染色体,其中包括来自其父亲(个体0115)的携带HBM基因的11号染色体同源染色体的标记D11S935、D11S1313、GTC_HBM_标记_4、D11S987、D11S1296和GTC_HBM_标记_6,和来自其父亲的不携带HBM基因的11号染色体的标记GTC_HBM_标记_2、D11S970、GTC_HBM_标记_3、GTC_HBM_标记_1、GTC_HBM_标记_7、GTC_HBM_标记_5、D11S4136、D11S4139、D11S1314和D11S937。由于个体0115的纯合特性,标记D11S4113未给出信息。这种重组事件将HBM区域的中心粒侧边界定位于标记D11S1296和D11S987之间。
两点连锁分析也用于确证Zmax1基因在11号染色体上的位置。全贯通模型下两点连锁分析的连锁结果示于下表1。该表第一列列出遗传标记,表上部横向给出重组比率。各列的每个框内表示在第一行所示重组比率下测定的与Zmax1基因连锁的每个标记的LOD评分。例如,峰值LOD评分7.66出现于标记D11S970,它在单倍型分析所定义的间隔内。
表1
标记 | 0.0 | 0.05 | 0.1 | 0.15 | 0.2 | 0.25 | 0.3 | 0.35 | 0.4 |
D11S935 | -无穷大 | 0.39 | 0.49 | 0.47 | 0.41 | 0.33 | 0.25 | 0.17 | 0.10 |
D11S1313 | -无穷大 | 2.64 | 2.86 | 2.80 | 2.59 | 2.30 | 1.93 | 1.49 | 1.00 |
D11S987 | -无穷大 | 5.49 | 5.18 | 4.70 | 4.13 | 3.49 | 2.79 | 2.03 | 1.26 |
D11S4113 | 4.35 | 3.99 | 3.62 | 3.24 | 2.83 | 2.40 | 1.94 | 1.46 | 0.97 |
D11S1337 | 2.29 | 2.06 | 1.81 | 1.55 | 1.27 | 0.99 | 0.70 | 0.42 | 0.18 |
D11S970 | 7.66 | 6.99 | 6.29 | 5.56 | 4.79 | 3.99 | 3.15 | 2.30 | 1.44 |
D11S4136 | 6.34 | 5.79 | 5.22 | 4.61 | 3.98 | 3.30 | 2.59 | 1.85 | 1.11 |
D11S4139 | 6.80 | 6.28 | 5.73 | 5.13 | 4.50 | 3.84 | 3.13 | 2.38 | 1.59 |
FGF3 | 0.59 | 3.23 | 3.15 | 2.91 | 2.61 | 2.25 | 1.84 | 1.40 | 0.92 |
D11S1314 | 6.96 | 6.49 | 5.94 | 5.34 | 4.69 | 4.01 | 3.27 | 2.49 | 1.67 |
D11S937 | -无穷大 | 4.98 | 4.86 | 4.52 | 4.06 | 3.51 | 2.88 | 2.20 | 1.47 |
单核苷酸多态性(SNP)进一步定义HBM区域。该SNP被命名为SNP_Contig033_6,定位于遗传标记GTC_HBM_标记_5的中心粒侧25kb。这个SNP在遗传标记GTC_BM_标记_7的端粒侧。SNP_Contig033_6出现于受HBM影响的个体0113中。然而,受HBM影响的个体9019是0113的儿子,不携带此SNP。因此,这表明交换发生于该SNP的中心粒侧。遗传标记GTC_HBM_标记_5和GTC_HBM_标记_7的引物序列见下表2。
表2
标记 | 引物(正向) | 引物(反向) |
GTC_HBM_标记_5 | TTTTGGGTACACAATTCAGTCG | AAAACTGTGGGTGCTTCTGG |
GTC_HBM_标记_7 | GTGATTGAGCCAATCCTGAGA | TGAGCCAAATAAACCCCTTCT |
所述家系有几个很有兴趣的特征,最重要的是他们的骨骼尽管非常致密,但具有绝对正常的形状。受HBM影响个体的骨骼外径正常,并且也有髓腔,不影响造血。HBM影响成员似乎对骨折有抗性,并且在检查的成员中没有神经症状,也没有任何器官或系统功能损伤的症状。家系中HBM影响成员生活到老年时没有不适的疾病和残疾。而且,HBM表型不与其它骨疾病相配,如骨质疏松、骨质疏松假胶质瘤、恩格尔曼病、Ribbing’s病、高磷酸盐血症、范布伦病、肢骨纹状肥厚、骨硬化病、致密性成骨不全症、硬化性狭窄、脆弱性骨硬化、肢端肥大症、佩吉特病、纤维结构不良、小梁狭窄、成骨不全、甲状旁腺功能减退、假性甲状旁腺功能减退、假假性甲状旁腺功能减退、原发性和继发性甲状旁腺功能亢进以及相关综合征、高钙尿、甲状腺髓样癌、骨软化及其它疾病。很清楚,HBM基因座在该家系中对于调节骨密度具有非常强大和实质性的作用,并且其鉴定对于理解调节骨密度的途径和疾病如骨质疏松的病因是重要的一步。
另外,携带HBM基因的老年个体因而表达HBM蛋白,不表现正常个体骨量丢失的特性。也就是说,HBM基因是骨质疏松的抑制因子。实质上,携带HBM基因的个体被给予HBM蛋白,并因而不产生骨质疏松。这种体内观察是强有力的证据,证明用HBM基因或蛋白或其片段处理正常个体,将改善骨质疏松。IV.物理作图
为提供克隆和表征HBM基因座的试剂,上述遗传图数据用于构建染色体11q13.3上含Zmax1区域的物理图谱。物理图由一组有序的分子界标、和一组来自染色体11q13.3的含Zmax1区域的BAC克隆组成。
各种公开可用的图谱资源被用于鉴定HBM区域的已有STS标记(0lson et al,Science,245:1434-1435(1989))。资源包括GDB,怀特海德研究所基因组中心,dbSTS和dbEST(NCBI),11db,得克萨斯大学西南GESTEC,斯坦福人类基因组中心,和几篇参考文献(Courseaux et al,Genomics,40:13-23(1997),Courseaux et al,Genomics,37:354-365(1996),Guru et al,Genomics,42:436-445(1997),Hosoda et al,Genes Cells,2:345-357(1997),James etal,Nat.Genet.,8:70-76(1994),Kitamura et al,DNA Research,4:281-289(1997),Lemens et al,Genomics,44:94-100(1997),Smith et al,Genome Res.,7:835-842(1997))。对图谱手工整合以鉴定作图于含Zmax1的区域的标记。
已有STS的引物由GDB或参考文献获得,列于下表3。因而,表3表示用于制作Zmax1基因区域物理图谱的STS标记。表3:HBM STS表表3:HBM STS表表3:HBM STS表表3:HBM STS表表3:HBM STS表
新STS产生自公开可用的基因组序列或序列起源的BAC插入末端。引物挑选的程序采用Cross_match(P.Green,U.of Washington)自动执行载体和重复序列掩盖,以及用Primer3自动执行随后的引物挑选(Rozen,Skaletsky(1996,1997))。Primer3可在以下网址找到
www.genome.wi.mit.edu/genome_software/other/primer3.html。
每对引物的聚合酶链式反应(PCR)条件针对MgCl2浓度进行初步优化。标准缓冲液是10mM Tris-HCl(pH8.3),50mM KCl,MgCl2,0.2mM每种dNTP,0.2μM每种引物,2.7ng/μl人DNA,0.25单位AmpliTaq(Perkin Elmer)和MgCl2浓度1.0mM,1.5mM,2.0mM或2.4mM。循环条件包括94℃初次变性2分钟,随后40个循环94℃15秒、55℃25秒和72℃25秒,然后72℃最后延伸3分钟。根据第一轮优化的结果必要时进一步优化条件。变量包括升高退火温度到58℃或60℃,增加循环数到42和退火及延伸时间到30秒,以及用AmpliTaqGold(Perkin Elmer)。
含感兴趣STS标记的BAC克隆(Kim et al,Genomics,32:213-218(1996),Shizuya et al,Proc.Natl.Acad.Sci.USA,89:8794-8797(1992))通过基于PCR的对来自总人BAC文库(购自ResearchGenetics)的DNA池进行筛选的方法获得。来自文库板1-596的DNA池用于对应人DNA的9个基因组等价物。初步筛选过程包括对超级池(superpool)的各个标记,也即对起源自8个384孔文库板的所有BAC克隆的DNA混合物的PCR反应。对每个阳性超级池,对板(8),行(16)和列(24)池进行筛选,以鉴定独特的文库地址。用2%琼脂糖凝胶(Sigma)将PCR产物在150V电泳45分钟,凝胶中在1×TBE中含0.5μg/ml溴化乙锭。所用电泳单元是来自Owl Scientific Products的A3-1系统。典型地,凝胶含10排泳道,每排50个孔。分子量标记(100bp序列梯,Life Technologies,Bethesda,MD)加于凝胶的两端。用Kodak DC40 CCD相机对凝胶成像并用Kodak 1D软件处理。凝胶数据输出为表格分隔的文本文件;文件名包括关于所筛选文库的信息,凝胶图像文件和筛选标记。用定制的Perl程序将这些数据自动输入到FilemakerTM PRO(Claris Corp.)数据库中供数据储存和分析。在所获得克隆地址信息不完全或模糊的情况下,另做试验以恢复独特、完整的文库地址。
从文库中回收克隆化BAC培养物包括从文库孔中取出样品在含12.5μg.ml氯霉素(Sigma)的LB琼脂上划线培养(Maniatis et al,Molecular Cloning:A Laboratory Manual.,Cold Spring HarborLaboratory,Cold Spring Harbor,NY(1982))。两个单菌落和一部分原始划线用合适的STS标记通过菌落PCR测定验证。阳性克隆于-70℃保存于含12.5μg.ml氯霉素和15%甘油的LB培养液中。
几种不同类型的DNA制备方法用于分离BAC DNA。下面列出的手工碱裂解小量制备方法(Maniatis et al,Molecular Cloning:ALaboratory Manual.,Cold Spring Harbor Laboratory,Cold SpringHarbor,NY(1982))成功用于大多数情况,即限制性图谱分析,CHEF凝胶分析,FISH图谱分析,但不能成功地可重复的用于末端测序。Autogen和Qiagen方法特别用于制备BAC DNA供末端测序用。
用50ml锥形瓶在15ml含12.5μg.ml氯霉素的Terrific培养液中以300rpm摇动在37℃培养细菌20小时。培养物在Sorvall RT6000D中以3000rpm(约1800g)在4℃离心15分钟。然后尽可能完全的抽去上清液。有时在此步将细胞沉淀冻存于-20℃最多两周。然后对沉淀进行混旋以使细胞均匀并尽可能消除团块。加入250μl P1溶液(50mM葡萄糖,15mM Tris-HCl,pH8,10mM EDTA和100μg/mlRNA酶A),上下抽吸混合物以混匀。将混合物转入2ml Eppendorf管。然后加入350μl P2溶液(0.2N NaOH,1%SDS),小心混合混合物并在室温孵育5分钟。加入350μl P3溶液(3 M KOAc,pH 5.5),小心混合混合物直至形成白色沉淀。溶液在冰上孵育5分钟,然后在微离心机中4℃离心10分钟。上清液仔细转移(避免白色沉淀)到新的2ml Eppendorf管中,加入0.9ml异丙醇,混合溶液并置冰上5分钟。样品离心10分钟,仔细除去上清液。用70%乙醇洗涤沉淀并空气干燥5分钟。沉淀重悬于200μl TE8(10mM Tris-HCl,pH8.0,1.0mM EDTA)中,并加入RNA酶A到100μg/ml。样品在37℃孵育30分钟,然后通过加入C2H3O2Na 3H2O到0.5M和2体积乙醇使样品沉淀。将样品离心10分钟,70%乙醇洗涤沉淀并空气干燥,然后溶于50μl TE8中。这种DNA制备方法的典型产率是3-5μg/15ml细菌培养物。10-15μl用于HindIII限制酶分析;5μl用于NotI消化并通过CHEF凝胶电泳确定克隆插入片段大小。
BAC接种于50ml锥形管中的15ml含12.5μg.ml氯霉素的2×LB培养液中。每个克隆接种4管。培养物激烈摇动下(>300rpm)于37℃生长过夜(约16小时)。BAC DNA分离的标准条件按照Autogen 740制造商推荐的方法。3ml培养物样品放入Autogen管中,共60ml或每个克隆20管。根据Autogen方法振荡15秒,使样品最终溶于100μlTE8中,之后将DNA溶液从每支管中转移出来并集中到一个2mlEppendorf管中。避免管子中留有大量碎片(来自碎片沉积步骤)。然后用0.5ml TE8漂洗管子并将此溶液加到合并物中。DNA溶液保存于4℃,-20℃冻存时常会出现块状物。此DNA或者直接用于限制酶作图、CHEF凝胶分析或FISH作图,或者按下面进一步纯化用于末端测序反应。
用TE8将DNA溶液体积调到2ml,轻轻混合样品并在65℃加热10分钟。4℃离心DNA溶液5分钟,上清液转移到15ml锥形管。调整NaCl浓度到0.75M(加入~0.3ml 5M NaCl到2ml样品中)。用Qiagen柱平衡缓冲液(QBT缓冲液)将总体积调到6ml,将含DNA的上清液加到柱上并使之以重力流进入。用10ml Qiagen缓冲液QC洗柱两次。随后分别用保存于65℃的缓冲液QF洗脱结合DNA 4次,每次1ml。用0.7体积异丙醇(约2.8ml)沉淀DNA。分别将每个样品转移入4个2.2ml Eppendorf管中,并在室温孵育2小时或过夜。用微离心机在4℃离心样品10分钟,仔细除去上清液,再加1ml 70%乙醇再次离心样品。因为在此步DNA沉淀常是松散的,所以要仔细除去上清液。再次离心样品以浓缩剩下的液体,用微吸头将液体除去,在干燥器中干燥DNA沉淀10分钟。每管中加入20μl无菌蒸馏去离子水,然后置4℃过夜。合并每个克隆的4个20μl样品,再用20μl无菌蒸馏去离子水漂洗使终体积到100μl。65℃加热样品5分钟,然后轻轻混合。用NotI消化并与未切的lambda DNA比较来分析,典型产量是2-5μg/60ml培养物。
3ml含12.5μg/ml氯霉素的LB培养液分装入高压灭菌Autogen管中,每个克隆用一支管。为进行接种,将甘油原液从-70℃取出放在干冰上。用无菌牙签从原始管中挑一小部分甘油原液转入Autogen管;牙签至少在Autogen管中保留两分钟再弃掉。接种后用胶带盖住管口确保密封严密。所有样品被接种后,将管子转入Autogen托架并放于37℃旋转摇床,在250rpm下摇16-17小时。生长后,按制造商给定的BAC DNA制备标准条件,来进行Autogen。程序中不将样品溶于TE8中,而保持DNA沉淀干燥。程序完成后,从输出盘取出管子,直接向管底加入30μl无菌蒸馏去离子水,轻轻摇动2-5秒,用parafilm膜覆盖并在室温孵育1-3小时。将DNA样品转入Eppendorf管中,或者直接用于测序或4℃保存供以后使用。V.用于物理作图的BAC克隆表征
手工碱裂解或Autogen方法制备的DNA样品用HindIII消化供限制性片段大小分析。该数据用于比较克隆之间的重叠程度。典型地每反应用1-2μg。反应混合物包括:1×缓冲液2(New EnglandBiolabs),0.1mg/ml牛血清白蛋白(New England Biolabs),50μg/mlRNA酶A(Boehringer Mannheim),和20单位HindIII(New EnglandBiolabs),最终体积25μl。消化物在37℃孵育4-6小时,BAC DNA也用NotI消化以通过CHEF凝胶分析(见下)估计插入片段的大小。除了使用20单位NotI外,反应条件与HindIII的相同。电泳之前加入含溴酚蓝和二甲苯腈蓝的6μl 6×Ficoll上样缓冲液。
在含0.5μg/ml溴化乙锭的1×TBE中用0.6%琼脂糖(Seakem,FMC Bioproducts)分析HindIII消化物。在A4型电泳仪(OwlScientific)中进行凝胶(20cm×25cm)电泳,电压50伏持续20-24小时。分子量大小标记包括未消化的lambda DNA、HindIII消化的lambda DNA和HaeIII消化的_×174 DNA。上样前将分子量标记在65℃加热2分钟。用Kodak DC40 CCD照相机成像并用Kodak 1D软件分析。
根据制造商推荐,在CHEF DRIl(BioRad)电泳仪上分析NotI消化物。简而言之,在0.5×TBE中制备1%琼脂糖凝胶(BioRad脉冲场级),在电泳仪中14℃平衡30分钟,以6伏特/cm循环电泳14小时。转换时间从10秒到20秒。电泳后在0.5μg/ml溴化乙锭中对凝胶染色。分子量标记包括lambda DNA、HindIII消化的lambda DNA、lambda序列梯、PFG序列梯和低范围PFG标记(全部来自New EnglandBiolabs)。
根据制造商推荐并有少许修改,用Bioprime标记试剂盒(BioRad)标记手工碱裂解或Autogen方法制备的BAC DNA。每50μl反应用约200ng DNA,3μl用2%琼脂糖凝胶分析以确定标记程度。原位杂交之前用Sephadex G50旋转离心柱纯化反应体系。按文献所述(Ma etal,Cytogenet.Cell Genet.,74:266-271(1996))进行中期FISH。VI.BAC末端测序
BAC插入片段末端测序所用的DNA是用上述两种方法之一来进行制备的。用于测序的动态能量转移引物和动态直接循环测序试剂盒购自Amersham。包括M13-40正向测序引物(目录号#US79730)的预制测序混合体系用于T7 BAC载体末端;预制的测序混合体系(目录号#US79530)与M13-28的反向测序引物(目录号#US79339)混合,用于SP6 BAC载体末端。测序反应混合体系包括:四种荧光标记的染料引物的一种,四种双脱氧终止混合物中的一种,dNTPS,反应缓冲液以及热测序酶(Thermosequenase)。对每一个BAC DNA样品来说,3μl BAC DNA样品等分到四个PCR条形管中,然后加入2μl四种染料引物中的一种/反应终止混合物的组合到每一个管中。将管密封,在进行PCR以前短时离心。热循环条件如下:95℃变性1分钟,45℃退火15秒,70℃延伸1分钟,共循环35次,循环结束后反应板进行短时间离心,将所有液体收集到管底。然后每管加入5μl无菌蒸馏去离子水,将管密封再次短时间离心。然后将每一个BAC的四个样品汇合到一起。接着每一管中加入1.5μl 7.5M NH4OAc和100μl-20℃的100%乙醇,进行DNA沉淀。上下抽吸一次将样品混匀。然后密封板,冰上孵育10分钟。用Haraeus台式离心机在4℃以4000rpm(3290g)将板离心30分钟以回收DNA。弃上清液,多余的液体印到纸巾上吸去。每管加入100μl-20℃ 70%乙醇洗涤沉淀,4℃下4000rpm(3,290g)再离心10分钟。弃上清液,多余的液体再印到纸巾上吸去。将板正面朝下扣在纸巾上离心,当转速达到800rpm时即停,以去除残余的痕量液体。样品在室温下空气干燥30分钟。给管加盖,在电泳以前以干燥状态存放在-20℃。电泳前将DNA迅速溶解到1.5μlAmersham上样染液中。将板密封并以2000rpm(825g)离心,在平板振荡器上涡旋1-2分钟后将样品以2000rpm(825g)短时间再离心。样品在65℃加热2分钟,然后迅速转移到冰上。根据制造商推荐,标准凝胶电泳在ABI 377荧光测序仪上进行。VII.HBM BAC DNA的亚克隆和测序
Zmax基因区的物理图提供了一组BAC克隆,其中包含Zmax1基因和HBM基因。来自这个区域的一些BAC的DNA测序已经完成。DNA测序数据是一个独特的因素,它包括了这种数据,该领域的技术人员能用它们鉴定Zmax1基因和HBM基因,或用于制备探针来识别这些基因,或通过识别DNA序列多态性来鉴定这些基因。
BAC DNA有两种分离方法:Qiagen BAC DNA纯化法(Qiagen,Inc.根据产品说明)或手工纯化方法,后者是标准碱裂解/氯化铯质粒DNA制备方法的改进(见例如:Ausubel et al,Current Protocols inMolecular Biology,John Wiley & Sons(1997))。就手工方法简单来说,沉淀细胞,将其重悬于GTE(50mM葡萄糖,25mM Tris-c1(pH8),10mM EDTA)和溶菌酶(50mg/ml溶液)。随后加入NaOH/SDS(1%SDS/0.2N NaOH),然后再加入冰冷的3M KOAc(pH4.5-4.8)。RNA酶A加入到过滤的上清液中,再加入蛋白酶K和20%的SDS。DNA用异丙醇沉淀,干燥,然后重悬于TE(10mM Tris,1mM EDTA(pH 8.0))中。进一步用氯化铯密度梯度离心法纯化BAC DNA(Ausubel et al,CurrentProtocols in Molecular Biology,John Wiley & Sons(1997))。
分离后,用HPLC水压(Hengen,Trends in Biochem.Sci.,22:273-274(1997))剪切BAC DNA达到2000-3000bp的插入片段大小。剪切后将DNA浓缩并用标准1%琼脂糖凝胶分离。将对应适当大小的单个级分从凝胶上切下并用电洗脱纯化(Sambrook et al,Molecular Cloning:A Laboratory Manual,Cold Spring HarborLaboratory,Cold Spring,NY(1989))。
用T4 DNA聚合酶将纯化DNA片段末端钝化,然后将钝端DNA连接到独特的BstXI接头上(5’GTCTTCACCACGGGG和5’GTGGTGAAGAC,摩尔量过量100-1000倍)。这些接头与BstXI切割的pMPX载体(由发明人构建)互补,而突出端自身不互补。因此,接头将不会形成串联体,切开的载体自身也不易重新连接。用1%琼脂糖凝胶将连接了接头的插入片段与未插入的接头分开并用GeneClean(BIO 101,Inc.)纯化。然后将连接了接头的插入片段连接到改进的pBlueScript载体上以构建“鸟枪”亚克隆文库。当接头二聚体被克隆进去之后,载体克隆位点上所含框外lacZ基因变为框内,允许由于其蓝色使其被避免。
以下所有步骤基于ABI377自动DNA测序方法测序,只有对方法的较大修改才强调。简而言之,把文库转化至DH5α感受态细胞(LifeTechnologies,Bethesda,MD,DH5α转化方法),将其涂到含氨苄青霉素和IPTG/Xgal的抗生素平板上对其进行评估,平板37℃孵育过夜。用成功转化体平铺克隆并挑选出来进行测序。培养物37℃生长过夜。用硅胶珠DNA制备方法(Ng et al,Nucl.Acids Res.,24:5045-5047(1996))纯化DNA。该方法每个克隆可得到25μg DNA。
纯化DNA样品用ABI染料终止子化学测序。在ABI377机器上进行ABI染料终止子序列阅读,对凝胶进行泳道跟踪后将数据直接转入UNIX机器。用PHARP(P.Green,Abstracts of DOE Human Genome ProgramContractor-Grantee Workshop V,Jan.1996,p.157)以默认参数和质量评分汇编所有阅读结果。以6倍覆盖进行初步汇编产生平均8-15个重叠群。初步汇编后,鉴定未配上的序列(只给出一条链阅读结果的克隆的序列)并用ABI技术测序以鉴定其它重叠群。用GenomeTherapeutics程序Pick_primer在克隆末端选择步行(walking)引物以帮助填充空位。用所选克隆和引物进行步行测序,再次用PHARP将数据重新汇编至序列重叠群。VIII.由计算方法鉴定基因
将BAC序列汇编入重叠群后,对重叠群进行计算分析以鉴定编码区和具有与已知基因相似的DNA序列的区域。该方法包括以下步骤:
1.去除重叠群空位:序列重叠群经常包含一些标志(以句号表
示),它们代表各个ABI序列阅读结果有插入或缺失的地方。在
对重叠群自动化计算分析之前将句号除去。保留原始数据供以后
参考。
2.通过使用交叉匹配程序(Phil Green,
http:\\chimera.biotech.Washington.edu\UWGC)将BAC载体序
列“掩盖”在序列内。因为前面详述的鸟枪文库构建留下一些BAC
载体在鸟枪文库内。该程序用于比较BAC重叠群和BAC载体序列
并在进行下一步之前掩盖任何载体序列。在序列文档中的掩盖序
列以“X”表示,且在随后的分析中保持惰性。
3.通过将BAC重叠群与全部大肠杆菌DNA序列比较,掩盖污染
BAC序列的大肠杆菌序列。
4.用交叉匹配掩盖人基因组中常见的已知重复元件。执行交叉
匹配时,将BAC序列与人重复元件数据库(Jerzy Jerka,Genetic
Information Research Institute,Palo Alto,CA)比较。掩
盖的重复序列以“X”表示且在随后的分析中保持惰性。
5.用MZEF计算机程序(Zhang,Proc.Natl.Acad.Sci.,
94:565-568(1997))预测序列内外显子的位置。
6.用blastn2算法(Altschul et al,Nucl.Acids Res.,
25:3389-3402(1997))将序列与公共单基因数据库(National
Center for Biotechnology Information,National Library of
Medicine,38A,8N905,8600 Rockville Pike,Bethesda,MD20894;
www.ncbi.nlm.nih.gov)比较。该搜索参数是:E=0.05,v=50,
B=50(其中E是期望概率评分截止点,V是结果报告中返回的数
据库项目数,B是结果报告中返回的序列比对数(Altschul et al,
J.Mol.Biol.,215:403-410(1990))。
7.将序列的所有6个读码框翻译成蛋白质,并将蛋白序列与
Genpept Swissprot PIR编辑的非冗余蛋白质数据库(National
Center for Biotechnology Information,National Library of
Medicine,38A,8N905,8600 Rockville Pike,Bethesda,MD 20894;
www.ncbi.nlm.nih.gov))比较。该搜索参数是E=0.05,V=50,
B=50,其中E、V和B如上定义。
8.用blastn2(Altschul et al,Nucl.Acids Res.,25:3389-3402
(1997))将BAC DNA序列与直接选择实验(下述)产生的cDNA
克隆数据库比较。该搜索参数是E=0.05,V=250,B=250,其中E、
V和B如上定义。
9.用blastn2(Altschul et al,Nucl.Acids Res.,25:3389-3402
(1997))将BAC序列与染色体11q12-13上的HBM区域的所有其
它BAC序列比较。该搜索参数是E=0.05,V=50,B=50,其中E、
V和B如上定义。
10.用blastn2(Altschul et al,Nucl.Acids Res.,25:3389-3402
(1997))将BAC序列与染色体11q12-13上的HBM区域的BAC末
端来源的序列比较。该搜索参数是E=0.05,V=50,B=50,其中E、
V和B如上定义。
11.用blastn2(Altschul et al,Nucl.Acids Res.,25:3389-3402
(1997))将BAC序列与Genbank数据库(National Center for
Biotechnology Information,National Library of Medicine,
38A,8N905,8600 Rockville Pike,Bethesda,MD 20894;
www.ncbi.nlm.nih.gov))比较。该搜索参数是E=0.05,V=50,
B=50,其中E、V和B如上定义。
12.用blastn2(Altschul et al,Nucl.Acids Res.,25:3389-3402
(1997))将BAC序列与Genbank数据库(National Center for
Biotechnology Information,National Library of Medicine,
38A,8N905,8600 Rockville Pike,Bethesda,MD 20894;
www.ncbi.nlm.nih.gov))的STS分区比较。该搜索参数是
E=0.05,V=50,B=50,其中E、V和B如上定义。
13.用blastn2(Altschul et al,Nucl.Acids Res.,25:3389-3402
(1997))将BAC序列与表达序列标签(EST)Genbank数据库
(National Center for Biotechnology Information,National
Library of Medicine,38A,8N905,8600 Rockville Pike,
Bethesda,MD 20894;www.ncbi.nlm.nih.gov))比较。该搜索
参数是E=0.05,V=250,B=250,其中E、V和B如上定义。IX.通过直接cDNA选择鉴定基因
从骨髓、颅骨、股骨、肾、骨骼肌、睾丸和全脑中制备初步带接头的(primary linkered)cDNA库。由颅骨和股骨组织制备poly(A)+RNA(Chomczynski et al,Anal.Biochem.,162:156-159(1987);D’Alessio et al,Focus,9:1-4(1987)),其它mRNA购自Clontech(Palo Alto,California)。为从相同组织中产生oligo(dT)和随机引发的cDNA库,在一个反应中混合2.5μg mRNA和oligo(dT)引物,在另一反应中混合2.5μg mRNA和随机六聚体,根据制造商推荐(LifeTechnologies,Bethesda,MD)将它们都转为第一链和第二链cDNA。通过1∶1(每种10μg)混合,在65℃孵育5分钟并使之冷却到室温,使配对的磷酸化cDNA接头(见下面序列)一起退火。
配对接头oligo1/2
OLIGO 1:5’CTG AGC GGA ATT CGT GAG ACC3’(SEQ ID NO:12)
OLIGO 2:5’TTG GTC TCA CGT ATT CCG CTC GA3’(SEQ ID NO:13)配对接头oligo3/4
OLIGO 3:5’CTC GAG AAT TCT GGA TCC TC3’(SEQ ID NO:14)
OLIGO 4:5’TTG AGG ATC CAG AAT TCT CGA G3’(SEQ ID NO:15)配对接头oligo5/6
OLIGO 5:5’TGT ATG CGA ATT CGC TGC GCG3’(SEQ ID NO:16)
OLIGO 6:5’TTC GCG CAG CGA ATT CGC ATA CA3’(SEQ ID NO:17)配对接头oligo7/8
OLIGO 7:5’GTC CAC TGA ATT CTC AGT GAG3’(SEQ ID NO:18)
OLIGO 8:5’TTG TCA CTG AGA ATT CAG TGG AC3’(SEQ ID NO:19)配对接头oligo11/12
OLIGO 11:5’GAA TCC GAA TTC CTG GTC AGC3’(SEQ ID NO:20)
OLIGO 12:5’TTG CTG ACC AGG AAT TCG GAT TC3’(SEQ ID NO:21)根据制造商的说明(Life Technologies,Bethesda,MD)将接头连接到所有oligo(dT)和随机引发的cDNA库(见下)。
将oligo1/2连接到由骨髓制备的oligo(dT)和随机引发的cDNA库上,oligo3/4连接到由颅骨制备的oligo(dT)和随机引发的cDNA库上,oligo5/6连接到由骨骼肌制备的oligo(dT)和随机引发的cDNA库上,oligo7/8连接到由肾制备的oligo(dT)和随机引发的cDNA库上,oligo11/12连接到由股骨制备的oligo(dT)和随机引发的cDNA库上。
分别用1μl连接反应的1∶1、1∶10和1∶100稀释物进行PCR扩增以估计cDNA库的长度分布。PCR反应在Perkin Elmer 9600上进行,每25μl反应体系中含1μl DNA、10mM Tris-HCl(pH8.3)、50mMKCl、1.5mM MgCl2、0.001%明胶、200mM每种dNTP、10μM引物和1单位Taq DNA聚合酶(Perkin Elmer),并在以下条件下扩增:94℃30秒、60℃30秒和72℃2分钟,30个循环。扩增cDNA库的长度分布用1%琼脂糖凝胶电泳评估。将能最好代表随机引发和oligo(dT)引发的cDNA库的PCR反应放大规模,由此得到约2-3μg的每种cDNA库。直接选择反应的起始cDNA含有0.5μg随机引发的cDNA和0.5μgoligo(dT)引发的cDNA。
直接cDNA选择程序中使用的54个BAC来源的DNA使用NucleobondAX柱按照制造商所述(The Nest Group,Inc.)进行分离。
以等摩尔量合并BAC,根据制造商说明(Boehringer Mannheim)以切口平移方法用生物素16-UTP标记1μg分离的基因组DNA。按本领域技术人员掌握的方法(Del Mastro and Lovett,Methods inMolecular Biology,Humana Press Inc.,NJ(1996))检测生物素结合。
用本领域技术人员掌握的方法(Del Mastro and Lovett,Methodsin Molecular Biology,Humana Press Inc.,NJ(1996))进行直接cDNA选择。简言之就是在两个独立的反应中扩增cDNA库:在一个反应中混合来自骨髓、颅骨、脑和睾丸的cDNA库,在另一个反应中混合来自骨骼肌、肾和股骨的cDNA库。CDNA库中重复序列、酵母序列和质粒的抑制进行到Cot为20。100ng生物素化的BAC DNA与抑制后的cDNA混合并在溶液中杂交到Cot为200。用包被链亲合素的顺磁珠捕获生物素化的DNA和同源cDNA。清洗磁珠并洗脱初步选择的cDNA。将这些cDNA进行PCR扩增并进行第二轮直接选择。第二轮直接选择的产物称为二次选择物质。Galanin cDNA克隆已显示作图在11q12-13上(Evans,Genomics,18:473-477(1993)),用其监测第二轮选择的富集情况。
根据制造商推荐,来自骨髓、颅骨、股骨、肾、骨骼肌、睾丸和全脑的二次选择物质用如下修饰引物oligo1,3,5,7和11作PCR扩增,并克隆到UDG载体pAMP10(Life Technologies,Bethesda,MD)上。修饰引物序列:oligo1-CUA:5’CUA CUA CUA CUA CTG AGC GGA ATT CGT GAG ACC3’(SEQID NO:22)oligo3-CUA:5’CUA CUA CUA CUA CTC GAG AAT TCT GGA TCC TC3’(SEQID NO:23)oligo5-CUA:5’CUA CUA CUA CUA TGT ATG CGA ATT CGC TGC GCG3’(SEQID NO:24)oligo7-CUA:5’CUA CUA CUA CUA GTC CAC TGA ATT CTC AGT GAG3’(SEQID NO:25)oligo11-CUA:5’CUA CUA CUA CUA GAA TCC GAA TTC CTG GTC AGC3’(SEQID NO:26)
根据制造商推荐,将由每种组织来源的克隆的二次选择物质转化入最大效率的DH5α感受态细胞(Life Technologies,Bethesda,MD)。从每个转化源挑384个菌落并转入4个96孔微滴板。用M13染料引物终止循环测序试剂盒(Applied Biosystems)将所有二次选择的cDNA克隆测序,并用ABI 377自动荧光测序仪(Applied Biosystems)收集数据。用BLASTN、BLASTX和FASTA程序(Altschul et al,J.Mol.Biol.,215:403-410(1990),Altschul et al,Nucl.Acids.Res.,25:3389-3402(1997))分析所有序列。将cDNA序列与包含来源于人重复序列、线粒体DNA、核糖体RNA和大肠杆菌DNA的数据库比较,以用交叉匹配程序从数据组中除去背景克隆。用BLASTN2程序对已知基因(Genbank)和HBM区的BAC序列进行再一轮比较。根据结果将与这些序列的同源性大于90%的cDNA存档,将数据保存于数据库中供进一步分析。已鉴定但与HBM区域BAC序列没有显著相似性的cDNA序列或由交叉匹配删除的cDNA序列与含HBM区BAC的尼龙膜杂交,以确认它们是否与靶序列杂交。
杂交分析用于将cDNA克隆作图于选择它们的BAC靶上。已鉴定来自HBM区的BAC排成阵列并在96孔微滴板上生长。含25μg/ml卡那霉素的LB琼脂倒入96孔微滴板的盖子上,一旦琼脂固化就立即将切好的Hybond N+尼龙膜(Amersham)放到琼脂上面,用手持96孔平板复制仪(V&P Scientific,Inc.)将BAC以复制方式印到膜上。平板37℃培养过夜。根据制造商推荐处理膜。
用能够扩增该克隆的相关引物(oligo1,3,5,7和11)PCR扩增需要进行杂交图谱分析的cDNA。在该PCR扩增中,修饰引物使之在寡核苷酸5’端包含一个有接头的地高辛分子。按制备cDNA库时所用方法(如上)在相同条件进行PCR扩增。加5μl PCR反应体系,用1%琼脂糖凝胶电泳评估PCR产物的质和量。含BAC印的尼龙膜每张单独在含10ml杂交液(5×SSPE,0.5×Blotto,2.5%SDS和1mMEDTA(pH8.0))的50ml锥形管中预杂交。将50ml锥形管放入65℃电烤箱(Robbins Scientific)中2小时。每种cDNA探针中的25ng变性,并加入含尼龙膜和杂交液的独立50ml锥形管中。65℃杂交过夜。在以下每种溶液中65℃洗滤膜20分钟:3x SSPE,0.1% SDS;1xSSPE,0.1% SDS和0.1x SSPE,0.1% SDS。
从50ml锥形管中取出膜并放入盘中。在每张膜之间放入醋酸纤维素片防止膜之间互相粘住。根据制造商推荐(Boehringer Mannheim)用Anti-DIG-AP和CDP-Star孵育膜。用Saran包裹膜并对KodakBio-Max X-射线胶片曝光1小时。X.cDNA克隆和表达分析
为表征由直接cDNA选择鉴定的基因表达以及与公共数据库比较基因组DNA测序结果,进行一系列试验进一步表征HBM区域的基因。首先,设计寡核苷酸引物用于聚合酶链式反应(PCR),由此可从DNA分子库(cDNA文库)或RNA群体(RT-PCR和RACE)中扩增部分cDNA、EST或基因组DNA。在含基因组DNA的反应体系中使用PCR引物以验证它们可产生根据基因组(BAC)序列预测大小的产物。然后检查cDNA文库中是否存在特异性cDNA或EST。特定cDNA文库中转录单位片段的存在表明有很大可能性,相同转录单位的其它部分也存在。
表征新基因需要的关键数据是加工后转录物或信使RNA(mRNA)的核苷酸长度。本领域技术人员主要通过DNA印迹杂交(Sambrook etal,Molecular Cloning:A Laboratory Manual,Cold Spring HarborLaboratory,Cold Spring Harbor NY(1989))确定mRNA的长度。为了方便,将与测序的关键区BAC表现出有显著序列相似性的多组EST和直接选择cDNA克隆,分成大约30kb的单位。每个30kb单位内有1到50个EST和直接选择的cDNA克隆,其中包含一或多个独立的转录单位。根据制造商推荐的条件,采用商品试剂(Multiple Tissue DNAblot;Clontech,Palo Alto,California),用一或多个EST或直接选择的cDNA作为杂交探针确定多种组织中的mRNA长度。
通过本领域技术人员熟悉的方法(例如,Soares,in Automated DNASequencing and Analysis,Adams,Fields and Venter,Eds.,AcademicPress,NY,pages 110-114(1994)),从股骨和颅骨组织中构建定向克隆的cDNA文库。先用锤子将骨初步破碎成碎片,在液氮中冷冻小片并在组织粉碎机(Spectrum Laboratory Products)中使之成为粉末。用标准酸性硫氰酸胍-酚-氯仿提取缓冲液(例如,Chomczynskiand Sacchi,AnaI.Biochem.,162:156-159(1987)),通过用polytron匀浆器(Brinkman Instruments)将骨粉匀浆而从骨粉中提取RNA。此外,人脑和肺总RNA购自Clontech。根据制造商推荐(Dynal,Inc.)用dynabeads-dT从总RNA中分离polyA RNA。
用以下序列的寡核苷酸引物起始第一链cDNA合成:5’-AACTGGAAGAATTC
GCGGCCGCAGGAATTTTTTTTTTTTTTTTTT-3’(SEQ IDNO:27)。该引物在cDNA的3’端引入一个NotI限制性位点(下划线)。根据制造商(Life Technologies,Bethesda,MD)所述,用“一管”cDNA合成试剂盒进行第一链和第二链合成。用T4多核苷酸激酶处理双链cDNA以确保分子末端为钝端(Soares in Automated DNASequencing and Analysis,Adams,Fields and Venter,Eds.,AcademicPress,NY,pages 110-114(1994)),然后用Biogel柱(Huynh et alin DNA cloning,Vol.1,Glover,Ed.,IRL Press,0xford,pages49-78(1985))或用size-sep 400琼脂糖柱(Pharmacia,catalog #27-5105-01)根据大小选择钝端cDNA。只有400bp或更长的cDNA用于随后步骤。然后通过本领域技术人员熟悉的方法(Soares 1994)将EcoRI接头(序列:5’OH-AATTCGGCACGAG-OH 3’(SEQ ID NO:28),和5’p-CTCGTGCCG-OH 3’(SEQ ID NO:29))连接到双链cDNA上,然后通过NotI消化从cDNA的3’端除去EcoRI接头(Soares,1994)。将cDNA连接到质粒载体pBluescript II KS+(Stratagene,La Jolla,California)上,由本领域技术人员熟悉的电穿孔方法(Soares,1994)将连接物转化到大肠杆菌宿主DH10B或DH12S。37℃生长过夜,按Mega-prep试剂盒(Qiagen,Chatsworth,California)给出的方法刮平板从大肠杆菌菌落中回收DNA。通过计数一部分原始转化体总数并确定平均插入片段大小以及无cDNA插入片段质粒的百分比,来估计cDNA文库的质量。其它cDNA文库(人全脑、心、肾、白细胞和胎脑)购买自Life Technologies,Bethesda,MD。
由oligo(dT)和随机六聚体(N6)引发的cDNA文库都用于分离HBM区域内转录的cDNA克隆:人骨、人脑、人肾和人骨骼肌(除了骨骼肌(dT)和肾(dT)cDNA文库外,所有cDNA文库由发明人制造)。按如下方法制备每个cDNA文库的4个10×10阵列:用原始转化体将cDNA文库滴到2.5×106。用适当体积的冷冻原液接种2L LB/氨苄青霉素(100mg/ml)。将接种的液体培养物等分成400管,每管4ml,每管中大约含5000cfu。温和搅拌下30℃孵育试管过夜,培养物生长到OD为0.7-0.9。等分100μl培养物和300μl 80%甘油从而制备每种培养物的冷冻原液。将原液冻于干冰/乙醇浴并保存于-70℃。根据制造商说明用Qiagen(Chatsworth,CA)旋转离心小量制备试剂盒,将剩余的培养物用于制备DNA。来自400个培养物的DNA合并成80个行和列。通过PCR鉴定cDNA文库是否含感兴趣的HBM cDNA克隆。设计标记用于扩增推测的外显子。一旦标准PCR优化完成及判断出特异性cDNA文库是否含有感兴趣的cDNA克隆,就用标记筛选文库阵列。表明其中存在cDNA克隆的阳性地址用相同的标记做第二轮PCR进行确证。
一旦鉴定cDNA文库可能含有对应来自HBM区的感兴趣的特异性转录物的cDNA克隆,则分离含与EST或直接选择的感兴趣cDNA相同的cDNA插入片段的克隆。这通过标准“菌落筛选”的改进方法完成(Sambrook et al,Molecular Cloning:A Laboratory Manual,ColdSpring Harbor Laboratory,Cold Spring Harbor NY(1989))。具体来说,20个150mm的LB+氨苄青霉素琼脂平板上分布有20,000个菌落形成单位(cfu)的cDNA文库,让菌落37℃生长过夜。将菌落转移到尼龙滤膜(Hybond from Amersham,或等价用品)上,基本按所述(Sambrook et al,Molecular Cloning:A Laboratory Manual,Cold Spring Harbor Laboratory,Cold Spring Harbor NY(1989))将两张滤膜按压在一起从而得到复制品。“主”平板再孵育6-8小时使菌落再长起来。按以下方法将细菌菌落DNA固定到尼龙滤膜上:滤膜依次用变性溶液(0.5 N NaOH,1.5 M NaCl)处理两分钟、中和溶液(0.5 M Tris-C1 pH8.0,1.5 M NaCl)处理两分钟(两次)。纸巾擦除时用2×SSC/0.1%SDS溶液洗1分钟,从而从滤膜上除去细菌菌落。空气干燥滤膜并在真空下80℃烤1-2小时。
由随机六聚体标记(Fineberg and Vogelstein,Anal.Biochem.,132:6-13(1983))或在反应中包括基因特异性引物和非随机六聚体(对小片段)来制备cDNA杂交探针。计算比活性,大于5×108cpm/108μg cDNA。在55℃的10mM Tris-C1 pH8.0,1M NaCl,1mM EDTA,0.1%SDS中预洗涤菌落膜30分钟,之后每张滤膜在大于2ml的6×SSC、50%去离子甲酰胺、2%SDS、5×Denhardt’s溶液和100mg/ml变性鲑鱼精子DNA中,在42℃下预杂交30分钟。而后将滤膜转入杂交液(6×SSC、2%SDS、5×Denhardt’s溶液和100mg/ml变性鲑鱼精子精DNA)中,其中含有变性的α32P-dCTP标记的cDNA探针,并在42℃孵育16-18小时。
孵育16-18小时后,恒速搅拌下在2×SSC、2%SDS中室温洗滤膜20分钟,随后在65℃洗两次,每次15分钟。再在0.5×SSC、0.5%SDS中65℃洗15分钟。将滤膜包裹于塑料包中对放射胶片曝光几小时到过夜。胶片显影后,将平板上的单个菌落与放射自显影匹配,从而可以将其挑入1ml含氨苄青霉素的LB培养液中。37℃振荡1-2小时,将溶液等分涂150mm平板进行再次筛选。二次筛选与初筛(上述)相同,但平板上含约250个菌落,这样单个菌落可清楚鉴定供挑选。
用放射性标记探针的菌落筛选获得cDNA克隆后,通过限制性内切酶切割、PCR和直接测序来表征克隆,以确保原始探针和分离克隆之间的序列一致性。为获得全长cDNA,再用已鉴定克隆末端的新序列探测文库。重复此过程,直至克隆cDNA的长度与DNA印迹分析中估计的全长匹配。
RT-PCR被用作另一种分离全长克隆的方法。用“Superscript OneStep RT-PCR”试剂盒(Life Technologies,Gaithersburg,MD)合成并扩增cDNA。方法包括加1.5μl RNA到以下体系中:25μl反应混合物,它是一种含MgSO4和dNTP的独家享有的缓冲混合物,1μl有义引物(10μM)和1μl反义引物(10μM),1μl逆转录酶和Taq DNA聚合酶混合物,和高压灭菌水,加到总反应混合物50μl。然后将反应体系放到热循环仪中进行一轮,50℃下15-30分钟,然后94℃下15秒,55-60℃下30秒和68-72℃下1分钟/kb预期产物,最后一轮72℃下5-10分钟。用琼脂糖凝胶分析样品。从凝胶上切出产物并从凝胶中纯化(GeneClean,Bio 101)。纯化产物克隆到pCTNR(GeneralContractor DNA Cloning System,5 Prime-3 Prime,Inc.)中并测序,以验证克隆特异于感兴趣的基因。
用Marathon cDNA扩增试剂盒(Clontech,Palo Alto,CA),按制造商说明进行cDNA末端快速扩增(RACE),作为克隆候选基因5’和3’末端的方法。首先进行第一链合成,其中样品总RNA与修饰的oligo(dT)引物混合,加热到70℃,在冰上冷却,随后加入:5×第一链缓冲液、10mM dNTP混合物和AMV逆转录酶(20U/μl),由此制备cDNA库。管子在42℃孵育1小时,然后将反应管放于冰上。对于第二链合成,直接向反应管中加入以下成分:5×第二链缓冲液、10mMdNTP混合物、无菌水、20×第二链酶混合物,反应管在16孵育1.5小时。向反应管中加入T4 DNA聚合酶并在16℃孵育45分钟。加入EDTA/糖元混合物使第二链合成终止。酚/氯仿提取样品并用醋酸铵沉淀。琼脂糖凝胶分析其大小分布从而检查cDNA库质量。然后将MarathoncDNA接头(Clontech)连接到cDNA末端。根据所选基因特异性引物(GSP)的方向,含引发位点的特异性接头允许扩增5’或3’末端。加一等分双链cDNA到以下试剂中:10μM Marathon cDNA接头、5×DNA连接缓冲液、T4DNA连接酶。反应于16℃孵育过夜,加热灭活以终止反应。加入以下体系至稀释的双链cDNA库进行PCR扩增:10×cDNAPCR反应缓冲液、10μm dNTP混合物、10μm GSP、10μm AP1引物(试剂盒)、50×Advantage cDNA聚合酶混合物。热循环条件如下:94℃30秒,5个循环的94℃5秒、72℃4分钟,5个循环的94℃5秒、70℃4分钟,23个循环的94℃5秒、68℃4分钟。第一轮PCR完成后,用GSP延伸到接头末端以产生接头引物结合位点,可观察到感兴趣的特异性cDNA呈指数扩增。通常再进行一轮巢式PCR以确证特异性cDNA。用琼脂糖凝胶分析RACE产物并纯化(GeneClean,BIO101)。然后将RACE产物克隆进pCTNR(General Contractor DNA CloningSystem,5’-3’,Inc.)中,鉴定DNA序列以验证克隆特异于感兴趣基因。XI.突变分析
用上述方法鉴定比较性基因并对每个基因的外显子进行突变检测分析。比较性DNA测序用于鉴定染色体11q12-13中HBM候选基因的多态性。从患者类淋巴母细胞系中扩增候选基因的DNA序列。
发明人开发了一种基于从候选区域扩增的PCR产物的直接DNA测序分析的方法,以寻找作为成因的多态性。该方法由三个阶段组成,其中使用不同亚组的HBM家系以发现分离的多态性和使用群体组评估多态性的频率。家系资源由单个建立者(founder)产生,导致一种假设,所有受影响个体将有相同的成因性多态性。
首先在HBM家系的一个亚组中筛选候选区域,它由先证者、女儿、和其母亲、父亲和兄弟组成。同时产生单染色体参考序列并用于比较。在此核心家系中母亲和女儿携带HBM多态性,从而能监测多态性传播。最终结果是筛选了两个HBM染色体和6个非HBM染色体。这样就可排除很多经常发生的等位基因,而只有在受影响个体上出现的等位基因通过到下一步分析。
在由另外两个核心家族组成的更大HBM家系中再次检验该原始家族中只随HBM表型分离的多态性。这些家系由5个HBM和3个未受影响个体组成。该组中HBM个体包括两个关键性交换个体,由此提供关键区域的中心粒侧和端粒侧边界。追踪这些个体和其受影响父母的多态性遗传可进一步提高关键区的精确度。最终该组共筛选出7个HBM染色体和17个非HBM染色体。
当在扩展组中给定多态性继续只随HBM表型分离时,再检验群体组(population panel)。这个84人的组由42个已知有正常骨矿密度的个体和42个已知不相关但未对骨矿密度分类的个体组成。正常骨矿质密度在BMD Z评分0的两个标准差内。第二组来自广泛使用的CEPH个体组。随后在完整HBM家系和更大群体中检查在此群体中发现很少见的任何分离多态性。
用聚合酶链式反应(PCR)从HBM家系DNA和单染色体对照中产生测序模板。染色体11q12-13上HBM区域内基因的酶促扩增用PCR完成,其中使用每个外显子的侧翼寡核苷酸以及每个基因推测的5’调控元件。选择引物来扩增每个外显子和在剪切点两侧的每个内含子内死亡15或更多个碱基对。所有PCR引物都做成嵌合体以促进染料引物测序。合成期间,将M13-21F(5’-GTAA CGA CGG CCA GT-3’)(SEQ ID NO:30)和-28REV(5’-AAC AGC TAT GAC CAT G-3’)(SEQID NO:31)引物结合位点分别加入每个正向和反向PCR引物的5’末端。50μl PCR中使用150ng基因组DNA和2U AmpliTaq、500nM引物和125μM dNTP。缓冲液和循环条件专门对应于每个引物组。TaqStart抗体(Clontech)用于热启动PCR使形成最少的引物二聚体。取10%产物用于琼脂糖凝胶检验。测序前用去离子水对适当样品1∶25稀释。
根据标准能量转移引物(Amersham)方法对每个PCR产物测序。所有反应在96孔板中进行。每个模板进行4个独立的反应,分别对应A、C、G和T。每反应包括2μl测序反应混合物和3μl稀释模板。用箔带热封板,放入热循环仪根据制造商推荐进行循环。循环后合并4个反应。转移3μl合并产物到新96孔板,并在每个孔中加入1μl制造商的加样染料。所有96孔加样过程在Hydra96孔加样仪上进行(Robbins Scientific,USA)。直接将1μl合并物加到48泳道凝胶上在ABI 377 DNA测序仪上以2.4kV跑10小时。
用Polyphred(华盛顿大学)汇编序列组并用Consed(华盛顿大学)观察。序列汇编成组,对一个特定靶区域这些组代表所有相关家系成员和对照。对三个阶段的每个阶段分别进行汇编。每个个体包括正向和反向阅读结果,以及单染色体模板和彩色注释参考序列阅读结果。Polyphred用紫旗标出潜在的多态位点。两个阅读器独立观察每次汇编并评价紫旗位点的有效性。
对核心家系的两个HBM影响个体和两个未受影响个体,评估了成熟mRNA上存在的共23个外显子和原始转录物的几个其它部分的杂合性。鉴定了25个SNP,如下表所示:
表4:Zmax1基因及其环境的单核苷酸多态性
外显子名称 | 位置 | 碱基改变 |
b200e21-h_Contig1_1.nt | 69169(309G) | C/A |
b200e21-h_Contig4_12.nt | 27402(309G) | A/G |
b200e21-h_Contig4_13.nt | 27841(309G) | T/C |
b200e21-h_Contig4_16.nt | 35600(309G) | A/G |
b200e21-h_Contig4_21.nt | 45619(309G) | G/A |
b200e21-h_Contig4_22.nt-a | 46018(309G) | T/G |
b200e21-h_Contig4_22.nt-b | 46093(309G) | T/G |
b200e21-h_Contig4_22.nt-c | 46190(309G) | A/G |
b200e21-h_Contig4_24.nt-a | 50993(309G) | T/C |
b200e21-h_Contig4_24.nt-b | 51124(309G) | C/T |
b200e21-h_Contig4_25.nt | 55461(309G) | C/T |
b200e21-h_Contig4_33.nt-a | 63645(309G) | C/A |
b200e21-h_Contig4_33.nt-b | 63646(309G) | A/C |
b200e21-h_Contig4_61.nt | 24809(309G) | T/G |
b200e21-h_Contig4_62.nt | 27837(309G) | T/C |
外显子名称 | 位置 | 碱基改变 |
b200e21-h_Contig4_63.nt-a | 31485(309G) | C/T |
b200e21-h_Contig4_63.nt-b | 31683(309G) | A/G |
b200e21-h_Contig4_9.nt | 24808(309G) | T/G |
b527d12-h_Contig030g_1.nt-a | 31340(308G) | T/C |
b527d12-h_Contig030g_1.nt-b | 32538(308G) | A/G |
b527d12-h_Contig080C_2.nt | 13224(308G) | A/G |
b527d12-h_Contig087C_1.nt | 21119(308G) | C/A |
b527d12-h_Contig087C_4.nt | 30497(308G) | G/A |
b527d12-h_Contig088C_4.nt | 24811(309G) | A/C |
b527d12-h_Contig89_1HP.nt | 68280(309G) | G/A |
除表4所示的多态性外另两个多态性也能出现于SEQ ID NO:2。它们是SEQ ID NO:2中位点2002的变化。在此位点可出现鸟嘌呤或腺嘌呤。该多态性表现沉默,不与氨基酸序列的任何改变相关。第二个变化在SEQ ID NO:2上的位点4059,对应胞嘧啶(C)变为胸腺嘧啶(T)。该多态性导致缬氨酸(V)向丙氨酸(A)的相应改变。其它多态性发现于候选基因外显子及相邻内含子序列。当出现于SEQ IDNO:2上时,表4中列出或上面讨论两个的任一个多态性或其组合也可以对骨量有轻微影响。
本发明涉及核酸序列,它是具有上面鉴定的点突变的SEQ ID NO:1的核酸序列。
优选地,本发明包括SEQ ID NO:2的核酸序列。具体来说,在所有HBM个体中Zmax1(HBM基因)的编码序列中582位上由G到T的碱基对取代被鉴定为杂合状态,而在未受影响个体(即,b527d12-h_Contig087C_1.nt)中没有发现。图5表示B527D12中重叠群的顺序。HBM基因的转录方向为从左到右。B527D12的重叠群308G的序列是HBM基因编码区的反向互补序列。因此,表4中所示重叠群308G中的相对多态性,作为由C到A的碱基取代互补于HBM基因中的由G到T的取代。该突变引起甘氨酸171取代为缬氨酸(G171V)。
通过检验不同组个体的DNA序列确证HBM多态性。HBM家系的所有成员中(38个个体),只在受影响(即骨量增加)个体(N=18)中观察到HBM多态性是杂合形式。在未受影响个体中(N=20)(BMDZ<2.0)从未观察到HBM多态性。为确定是否该多态性在HBM家系以外的个体中也总能观察到,对297个表型分析个体的HBM基因位点进行表征。在HBM多态性位点上没有杂合性。在一个未进行表型分析的对照组中,在64个个体中未观察到582位的杂合性。总的来说,这些数据证明,在表现高骨量表型的家系中观察到的多态性与Zmax1的582位G→T多态性强烈相关。进一步,这些结果结合下面所述的ASO结果,确定了HBM多态性与HBM表型遗传学分离,并且HBM多态性及其表型在一般群体中都很少见。XII.等位基因特异的寡核苷酸(ASO)分析
用特异于感兴趣外显子的引物对含HBM1多态性的扩增子进行PCR扩增。如下在96孔微滴板中对适当群体的个体进行PCR扩增。制备PCR反应体系(20μl),其中含1×Promega PCR缓冲液(Cat.# M1883,含1.5mM MgCl2),100mM dNTP,200nM PCR引物(1863F:CCAAGTTCTGAGAAGTCC和1864R:AATACCTGAAACCATACCTG),1UAmplitaq,和20ng基因组DNA,并在以下PCR条件下扩增:94℃,1分钟,(94℃30秒;58℃,30秒;72℃,1分钟)×35个循环),72℃,5分钟,4℃保持。然后加入加样染料和10μl产物在含1μg/ml溴化乙锭的1.5%琼脂糖凝胶中100-150V电泳5-10分钟。凝胶在变性溶液(1.5M NaCl,0.5N NaOH)中处理20分钟,用水简单漂洗。然后在1M Tris-HCl,pH7.5,1.5M NaCl中中和凝胶20分钟,并用水漂洗。在10×SSC中浸泡凝胶20分钟,并在10×SSC中过夜印迹到尼龙转移膜(Hybond N+- Amersham)上。用6×SSC漂洗滤膜10分钟并经UV交联。
等位基因特异的寡核苷酸(ASO)设计为在中间带多态性。寡核苷酸5’端无磷酸,购自Gibco BRL。寡核苷酸序列是
2326 Zmax1.ASO.g:AGACTGGG
GTGAGACGC
2327 Zmax1.ASO.t:CAGACTGGG
TTGAGACGCC
多态性核苷酸用下划线标出。为标记寡核苷酸,混合1.5μl 1μg/ml ASO oligo(2326.Zmax1.ASO.g或2327.Zmax1.ASO.t)、11μlddH2O、2μl 10×激酶正向缓冲液(kinase forward buffer)、5μlγ32P-ATP(6000Ci/mMol)和1μl T4多核苷酸激酶(10U/μl),反应体系在37℃孵育30-60分钟。然后将反应体系在95℃放置2分钟,再加30ml水。用G25微旋柱(Pharmacia)纯化探针。
在10ml 5×SSPE、5×Denhardt’s、2%SDS和100μg/ml变性的、超声处理的鲑鱼精子DNA中40℃对印迹预杂交2小时。激酶处理的oligo的全部反应混合物加到10ml新鲜杂交缓冲液(5×SSPE,5×Denhardt’s,2%SDS)中并在40℃杂交至少4小时到过夜。
所有洗涤都采用5×SSPE、0.1%SDS。第一次洗涤在45℃下15分钟;然后更换溶液并在50℃洗涤滤膜15分钟。用两个增感屏将滤膜对Kodak biomax胶片在-70℃曝光15分钟到1小时。必要时55℃洗涤滤膜15分钟再对滤膜曝光。在煮沸的0.1×SSC、0.1%SDS中洗涤10分钟至少3次以解吸滤膜。
将最佳捕获两个ASO等位基因特异性测定的两张胶片扫描进AdobePhotoshop,从而转成数字图像。在Graphic Converter中将图像互相重叠,然后评分并存储于FileMaker Pro 4.0(见图9)。
为确定种族多样性群体中HBM1等位基因频率,来自各种族的672个随机个体通过等位基因特异的寡核苷酸(ASO)方法进行分型。该群体包括96个CEPH祖父母(主要是高加索人),192个高加索人,192个非洲裔美国人,96个西班牙人和96个亚洲人个体。在任何这些个体中没有得到存在HBM1多态性的证据。总的来说,共911个个体用直接测序或ASO杂交分型;在HBM多态性位点都是纯合的GG(图14)。该信息表明HBM1等位基因在各种族群体中都很少见。
因而本发明提供一种快速鉴定带HBM1等位基因个体的方法。该方法能用于诊断和筛选个体对骨质疏松或其它骨疾病的易患性的领域。该测定也可用于鉴定其他带HBM1等位基因或此处所述其他多态性的个体。XIII.Zmax1细胞定位A. 非同位素原位杂交法检测大鼠胫骨中基因表达
原位杂交由Pathology Associates International(PAI),Frederick,MD进行。本研究目的是检测大鼠骨中表达Zmax1基因的特异性细胞类型,特别强调骨生长和重建领域。本研究所用Zmax1探针由人(HuZmax1)和小鼠(MsZmax1)cDNA产生,它们的序列一致性为87%。人和小鼠Zmax1与大鼠Zmax1的同源性未知。
例如,非同位素原位杂交检测基因表达按如下进行,但其它方法也是本领域技术人员已知的。从通过二氧化碳窒息法实施安乐死的6-8周龄的雌性Sprague Dawley大鼠中采集胫骨。死亡后立即去除远端,近端胫骨用液氮迅速冷冻于OCT包埋介质中。组织保存于-80℃冰箱。
按以下方法制备从cDNA扩增PCR产物的探针。用人LRP5(Genbank识别号ABO17498)和小鼠LRP5(Genbank识别号AFO64984)的公开序列,选择从cDNA克隆扩增PCR产物的引物。为最小化与LDL受体家族其它基因的交叉反应,从蛋白编码区的细胞内部分衍生PCR产物。用cDNA克隆作模板在50μl反应体积中进行PCR。PCR反应体系含1.5mM MgCl2、1单位Amplitaq、200μM dNTP和2μM每种引物。PCR循环条件是94℃1分钟,随后35个循环的94℃30秒、55℃30秒、72℃30秒;72℃延伸5分钟。然后使反应体系跑1.5%琼脂糖Tris-乙酸凝胶。DNA从琼脂糖中洗脱,乙醇沉淀并重悬于10mM Tris,pH8.0。对小鼠和人cDNA都制备凝胶纯化的PCR产物并供给PathologyAssociates International作原位杂交。
人和小鼠PCR引物及其产物的序列如下:人Zmax1有义引物(HBM1253)CCCGTGTGCTCCGCCGCCCAGTTC人Zmax1反义引物(HBM1465)GGCTCACGGAGCTCATCATGGACTT人Zmax1 PCR产物CCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACTGTGACGCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCTCCGACGAGCTCATGTGTGAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGCAGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCATGGGTGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGGGGCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGCCCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCC小鼠Zmax1有义引物(HBM1655)AGCGAGGCCACCATCCACAGG小鼠Zmax1反义引物(HBM1656)TCGCTGGTCGGCATAATCAAT小鼠Zmax1 PCR物AGCAGAGCCACCATCCACAGGATCTCCCTGGAGACTAACAACAACGATGTGGCTATCCCACTCACGGGTGTCAAAGAGGCCTCTGCACTGGACTTTGATGTGTCCAACAATCACATCTACTGGACTGATGTTAGCCTCAAGACGATCAGCCGAGCCTTCATGAATGGGAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGACTACCCTGAAGGAATGGCTGTGGACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACCAACAGGATTGAGGTGGCCCGGCTGGATGGGCAGTTCCGGCAGGTGCTTGTGTGGAGAGACCTTGACAACCCCAGGTCTCTGGCTCTGGATCCTACTAAAGGCTACATCTACTGGACTGAGTGGGGTGGCAAGCCAAGGATTGTGCGGGCCTTCATGGATGGGACCAATTGTATGACACTGGTAGACAAGGTGGGCCGGGCCAACGACCTCACCATTGATTATGCCGACCAGCGA
如下合成Ribo探针(Riboprobe)。用嵌合引物再扩增PCR产物,其中嵌合引物设计成掺入再扩增产物的或者T3启动子上游,或者T7启动子下游。所得PCR产物用作通过体外转录(IVT)合成地高辛标记的Ribo探针的模板。用MAXIscript IVT试剂盒(Ambion),根据制造商的说明,在存在地高辛-11-UTP(Boehringer-Mannheim)的条件下,分别用T7和T3 RNA聚合酶合成反义和有义Ribo探针。然后用DNA酶I降解DNA,并通过超滤除去未掺入的地高辛。用变性聚丙烯酰胺凝胶电泳评价Ribo探针的完整性。分子量大小与100-1000碱基对(bp)的RNA序列梯(Ambion)的电泳迁移率比较。由印迹免疫化学评估探针产率和标记。Ribo探针分成5μl等份保存于-80℃。
如下进行原位杂交。在Jung CM3000低温恒温器(Leica)上将冷冻大鼠骨切成5μM切片并包埋于粘合剂玻片上(Instrumedics)。切片在低温恒温器中保存于-20℃,直至所有玻片都制备好,以免在4%多聚甲醛中后固定15分钟之前的mRNA降解。后固定后,切片与1ng/μl反义或有义RNA探针在Pathology AssociatesInternational(PAI)客户定制杂交缓冲液中一起于58℃孵育约40个小时。杂交后玻片经一系列杂交后严格洗涤以减少非特异探针结合。用偶联碱性磷酸酶的抗地高辛抗体(FAB片段)以免疫组织化学法对杂交物显色。氯化硝基蓝四唑/溴氯吲哚磷酸酯(BoehringMannheim)是一种沉淀性碱性磷酸酶底物,用其作为色素原将杂交细胞染成紫色到近黑色,这取决于染色程度。用核牢固红(nuclearfast red)对组织切片负染。试验对照包括不加探针、和不加探针和抗地高辛抗体。
对特异性细胞类型进行评价,证明与反义探针杂交,其中由显示紫到黑色的细胞质和/或核周染色指示mRNA的阳性杂交信号。将复制切片与对应的有义探针杂交,从而将每种细胞类型与复制切片比较。如果采用反义探针时观察到染色而用有义探针无染色或弱本底,则认为结果为阳性。
每种研究探针的杂交信号的细胞定位总结于表5。对Zmax1的杂交主要在重建涉及的骨区域中被检测到,它包括干骺端内的骨内膜和骨小梁。在骨膜和骨骺的所选骨衬细胞中也观察到杂交。骨生长板内的软骨细胞中,特别是增殖性软骨细胞中,也有阳性信号。原位杂交结果的代表性显微照片见图10、11和12。表5大鼠胫骨Zmax1原位杂交总结
图标:“+”=检测到杂交信号,“-”=未检测到杂交信号,“ISH”=原位杂交
探针 | 位点 | ISH信号 |
人Zmax1 | 骺 | |
成骨细胞 | + | |
破骨细胞 | - | |
生长板 | ||
静止软骨细胞 | - | |
增殖性软骨细胞 | + | |
肥大软骨细胞 | - | |
干骺端 | ||
成骨细胞 | + | |
破骨细胞 | + | |
骨干 | - |
骨内膜 | ||
成骨细胞 | + | |
破骨细胞 | + | |
骨膜 | - | |
鼠Zmax1 | 骺 | |
成骨细胞 | + | |
破骨细胞 | - | |
生长板 | ||
静止软骨细胞 | - | |
增殖性软骨细胞 | + | |
肥大软骨细胞 | + | |
干骺端 | ||
成骨细胞 | + | |
破骨细胞 | + | |
骨干 | - | |
骨内膜 | ||
成骨细胞 | + | |
破骨细胞 | + | |
骨膜 | + |
这些研究确证了涉及骨重建和骨形成的细胞中Zmax1的表达定位。Zmax1表达于增殖区和邻近干骺端的成骨细胞和破骨细胞中,这提示Zmax1基因与骨生长和矿化过程有关。在骨形成发育期间和生长期间以及骨经历连续重建的成年人中,成骨细胞和破骨细胞的活性和分化密切协作。内部骨结构形成和骨重建产生于活化破骨细胞的骨重吸收及随后由成骨细胞沉积新物质的偶联。Zmax1与LDL受体基因相关,并因此可能是与骨重建过程中机械感觉和随后信号传递有关的一种受体。因此,该基因表达水平的改变可对骨重建速度和矿化度产生影响。XIV.反义
反义寡核苷酸是短的合成核酸,其中包含靶RNA的互补碱基序列。活细胞中RNA与反义寡核苷酸的杂交会干扰RNA功能并最终阻滞蛋白表达。因此,只要知道任何基因的部分序列,就可以用反义寡核苷酸对其进行导向。
反义技术正成为一种广泛运用的研究工具,并将在对基因组测序所鉴定的治疗靶位进行确认和阐明上发挥日益重要的作用。
开发反义技术是为了用与编码靶基因的mRNA互补的寡核苷酸抑制基因的表达。反义寡核苷酸的抑制效应有几种可能的机制。其中,RNA酶H导致的mRNA降解被认为是抑制蛋白功能的主要机制。这种技术最初用于阐明目的基因的功能,如果经仔细和合适的设计也可用于治疗用途。
制备反义寡核苷酸的材料和方法的一种实例如下。与Sequiter(Natick,MA)协作,用反义技术在鼠成骨细胞样细胞系MC3T3中进行了初步研究。这些细胞可被激发沿骨分化顺序发育。初始增殖阶段的特征为分化标记极少量表达并起始合成胶原类细胞外基质。胶原基质的合成是其后诱导分化标记必需的。一旦基质合成开始,成骨细胞标记基因就以一种明确的时间顺序被激活:碱性磷酸酶在早期被诱导产生,而骨唾液酸蛋白和骨钙蛋白在分化过程中晚一些时间产生。这个基因表达的时间顺序可用于监测骨的成熟和矿化过程。基质矿化是在成熟起始的几天后才开始的,包括矿物在胶原纤丝内和表面沉积并深入到细胞层-培养皿接触面附近的基质内。由培养的成骨细胞形成的胶原纤丝相关矿物质与体内编织骨中发现的相类似,因此,常用它作为研究物。
根据产品说明书(美国专利5,849,902),分化第一周MC3T3细胞用反义寡核苷酸转染。
Zmax1的寡核苷酸设计如下:10875:AGUACAGCUUCUUGCCAACCCAGUC10876:UCCUCCAGGUCGAUGGUCAGCCCAU10877:GUCUGAGUCCGAGUUCAAAUCCAGG图13给出了MC3T3细胞中Zmax1的反义抑制结果,上面三个寡核苷酸转染到MC3T3细胞中,RNA用标准方法进行分离。DNA分析清楚显示Zmax1转录物的状态明显较不稳定,而对照基因GAPDH保持不变。这样,使用上述引物的反义技术可用于骨生物学中Zmax1表达功能的研究。XV.酵母双杂交
为鉴定Zmax1参与的、调节骨密度的信号传递途径,而使用酵母双杂交蛋白相互作用技术。通过将待测蛋白偶联到酵母转录系统的成分中,该技术帮助鉴定与另一个蛋白相互作用的蛋白(Fields andSong,1989,Nature 340:245-246;Fields and Song,美国专利5,283,173;Johnston,1987,Microbiol.Rev.51:458-476;Keeganet al,1986,Science 231:699-704;Durfee et al,1993,Genes Dev.7:555-569;Chien et al,1991,Proc.Natl.Acad.Sci.USA88:9578-9582;Fields et al.,1994,Trends in Genetics 10:286-292;和Gyuris et al.,1993,Cell 75:791-803)。首先,将一种用于寻找其相互作用蛋白的“诱饵”蛋白,融合到酵母转录因子的DNA结合结构域。其次,构建cDNA文库,其中cDNA融合到同一酵母转录因子的转录活化结构域中,该文库被称为猎物文库。诱饵构建体和猎物文库被转化到酵母细胞然后交配以产生二倍体细胞。如果诱饵与cDNA文库中的特异性猎物相互作用,通过这种相互作用,活化结构域被带到启动子附近。而后启动转录,通过可选择标记和在选择性培养基上的生长来指示相互作用蛋白的存在。
用于此处讨论的酵母双杂交试验的氨基酸序列由完整细胞质结构域和部分跨膜结构域组成并表示如下(氨基向羧基方向):RVVCQRYAGA NGPFPHEYVS GTPHVPLNFI APGGSQHGPF TGIACGKSMM
SSVSLMGGRG GVPLYDRNHV TGASSSSSSS TKATLYPPI
L NPPPSPATDP
SLYNMDMFYS SNIPATVRPY RPYIIRG
MAP PTTPCSTDVC DSDYSASRWK
ASKYYLDLNS DSDP
YPPPPT PHSQYLSAE
D SCPPSPATER SYFHL
FPPPP
SPCTDSS
推定的跨膜结构域的最后6个氨基酸用粗体表示。推定的SH3结构域用下划线表示。由Zmax1或HBM等位基因编码蛋白的50个或更多氨基酸的其它氨基酸序列也可用作诱饵。所用诱饵多肽大小的上限仅受完整跨膜结构域存在的限制(见图4),在一种酵母双杂交系统中它将使诱饵失去功能。这些其它诱饵蛋白可用于鉴定,粘着斑信号传导途径中、或HBM或Zmax1蛋白可能参与的其它途径中,与HBM或Zmax1编码蛋白相互作用的蛋白。一旦鉴定,鉴定调节粘着斑信号传导途径或HBM参与的其它途径中蛋白的方法可以按此处所述进行,用于HBM和Zmax1蛋白。
为鉴定细胞质Zmax1信号传导途径,Zmax1胞质结构域被亚克隆到两个诱饵载体上。第一个载体是pDBleu,它用于筛选克隆到pPC86(Clontech)载体上的脑和Hela猎物cDNA文库。所用的第二个诱饵载体是pDBtrp,它用于筛选克隆于载体pOP46中的来源于TE85骨肉瘤细胞系的cDNA猎物文库。使用本领域技术人员已知的标准技术,它们描述于Fields and Song,1989,Nature 340:245-246;Fields andSong的美国专利5,283,173;Johnston,1987,Microbiol.Rev.51:458-476;Keegan et al,1986,Science 231:699-704;Durfee etal,1993,Genes Dev.7:555-569;Chien et al,1991,Proc.Natl.Acad.Sci.USA 88:9578-9582;Fields et al.,1994,Trends inGenetics 10:286-292;和Gyuris et al.,1993,Cel l75:791-803。用标准方法将诱饵构建体和猎物cDNA文库转化入酵母中。
为进行蛋白相互作用筛选,诱饵酵母株的过夜培养物生长于含2%葡萄糖的20ml SD选择培养基(pDBLeu,SD-Leu培养基,pDBtrp,SD-trp培养基)中。培养物于30℃剧烈振荡过夜。用完全培养基(含2%葡萄糖的YEPD)1∶10稀释培养物,然后将培养物于30℃振荡孵育2小时。
解冻冷冻猎物文库,通过在含2%葡萄糖的150ml YEPD培养基中于30℃生长2小时来再次活化酵母细胞。用70%乙醇对过滤器灭菌并用无菌水洗涤以除去乙醇。通过检测600nm处OD值测量诱饵和猎物培养物的细胞密度。将适当体积的酵母细胞、诱饵和猎物(文库)放入50ml Falcon管,酵母细胞的数目相当于每种酵母菌株细胞数为1ml OD 600=4。用无菌滤器过滤混合物。然后将滤膜转到预热的YEPD琼脂板,细胞面向上,除去滤膜下面所有气泡。板子在30℃孵育6小时。将一张滤膜转入50ml Falcon管,并加10ml含2%葡萄糖的SD;旋涡振荡10秒以重悬细胞。
测定原代二倍体细胞数(生长于SD-Leu,-trp板上)与只在SD-Trp和SD-Leu板上生长的菌落形成单位数。涂不同稀释度的平板并在30℃孵育两天。计算菌落形成单位数。二倍体菌落数(生长于SD-Leu-Trp板上的菌落)允许计算是否全部猎物构建体文库都与表达诱饵的酵母交配。此信息对判断筛选质量很重要。A. 间接选择
然后合并来自5个滤膜交配体的重悬细胞,在50ml Falcon管中离心沉淀细胞。然后将细胞重悬于16ml含2%葡萄糖的SD培养基中。对每种细胞(SD-Leu,-Trp)用无菌玻璃珠将2ml该细胞悬液涂于8个方形平板上,在30℃孵育18-20小时以选择二倍体细胞。
然后将细胞从方形平板上刮下,离心细胞并合并进一只50mlFalcon管。然后将细胞沉淀重悬于25ml含2%葡萄糖的SD培养基。用Neugebauer小室对适当稀释液(通常1∶100到1∶1000)计数从而确定细胞数。约5×107二倍体细胞平铺于选择培养基上。观察与不相关猎物载体一起生长的诱饵株有助于确定将必须选哪个选择性平板供文库筛选。一般来说,所有筛选物涂于一块方形平板上,每种分别是SD-Leu,-Trp,-His;推荐SD-Leu,-Trp,His,5mM 3AT,和SD-Leu,-Trp,-His,-Ade。
酵母细胞用无菌玻璃珠均匀铺展,并在30℃孵育4天。通过将刮下细胞悬液的不同稀释液涂于SD-Leu,-Trp平板上来计算形成菌落的酵母细胞数。通常涂100μl 10-3和10-4稀释液,每个平板产生100-1000个菌落。B. 直接选择
将5张带交配酵母细胞的滤膜每张分别转入一个50ml Falcon管,然后每管用10ml含2%葡萄糖的SD培养基重悬细胞,随后涡旋振荡10秒。合并重悬细胞并在Beckman离心机上以3000rpm离心。弃上清液,细胞重悬于6ml含2%葡萄糖的SD培养基。分别将两ml悬液铺展于选择性方形平板上并在30℃孵育4-5天。C. 分离单菌落
用无菌牙签从分离菌落中挑酵母细胞并分别转入96孔板的各个孔中。细胞重悬于50μl SD-Leu,-Trp,-His培养基中并在30℃孵育1天。然后将酵母细胞以96孔形式印于一块SD-Leu,-Trp,-His平板上并在30℃孵育2天。也将酵母细胞印于尼龙膜上,并覆盖于YEPD平板上在30℃孵育1天。尼龙滤膜上的细胞用于β-Gal报告基因活性分析。
用无菌牙签从SD-Leu,-Trp,-His平板上刮下酵母菌落,并在必要时根据β-Gal活性重新设置,然后重悬于20%甘油中。将其作为母平板保存于-80℃。
要制备DNA,将甘油保存的酵母细胞印于SD-Trp平板上并在30℃孵育2天。两天后,酵母菌落用于菌落PCR和测序。使用标准菌落PCR条件扩增从相互作用筛选中回收的猎物插入片段。用标准测序反应和ABI377(Perkin Elmer)荧光测序仪测序。D. 验证诱饵/猎物相互作用
解冻感兴趣猎物的甘油保存物并接种至10ml含葡萄糖的SD-Trp中培养过夜。生长过夜后,用BIO 101 RPM酵母质粒分离试剂盒从10ml培养物中制备质粒DNA。离心培养物并转入1.5ml微离心管。向沉淀中加入酵母裂解基质,再加250μl碱裂解溶液。样品涡旋5分钟,加入250μl中和溶液并将样品混匀。室温下在微离心机中离心样品2分钟,上清液转入旋转离心滤器,避免碎片和裂解基质混入。加入250μl Glassmilk旋转离心缓冲液,振荡混匀。样品离心1分钟,弃掉收集管中的液体。加入500μl洗涤液,离心样品1分钟,弃掉洗涤液。重复洗涤一次,随后干式离心1分钟以从旋转离心滤器中除去剩余的液体。滤器转入新收集管,加100μl无菌水;短时涡旋样品以重悬,并离心30秒在收集管底部收集DNA。
用标准方法和制备的甘油保存物将5μl DNA转化入DH10BElectromax细胞。用Qiagen QIAprep旋转离心小量制备试剂盒小量制备DNA。最后用30μl Qiagen EB缓冲液洗脱DNA。用标准方法将1μl质粒DNA样品转化酵母细胞,在SD-Trp培养基上生长两天后,挑选菌落并点斑于新鲜培养基上。与之相似,将诱饵菌落点斑于SD-Leu培养基上。两者30℃过夜培养。
要进行交配,将来自诱饵和猎物斑点的细胞一起铺展于YAPD培养基上并30℃孵育12小时。复制平板于SD琼脂-Leu-Trp板,30℃生长两天。要测定相互作用强度,将这些平板复制于SD琼脂-Leu-Trp-His、含5mM3AT和10mM 3AT的SD琼脂-Leu-Trp-His、SD琼脂-Leu-Trp-His-Ade和SD琼脂-Leu-Trp-Ura培养基上,30℃生长两天。E. Galacton Star β-半乳糖苷酶活性测定
对选择平板上的阳性相互作用子划线及复制铺板后,将菌落置入含200μl SD培养基的96孔皿,留1和96号孔为空白。从第一个96孔盘取10μl放入另一个含100μl SD培养基的平底96孔皿。对照包括阴性对照和极弱的阳性对照。在OD600测量细胞密度(用96孔光密度计,OD600=1时对应1×107个细胞)。OD通常在0.03到0.10之间。用光度计专用微板,将50μl反应混合物吸入每孔,再加50μl培养物并用洗液管上下吹吸两次。反应在室温孵育30分钟,随后用光度计测量相对光单位。
表6列出从3个测定的猎物文库中由酵母双杂交筛选鉴定的基因。发现两个基因,斑联蛋白和axin基因,在所有三组筛选中与Zmax1的胞质结构域有相互作用。三个基因,α-辅肌动蛋白、TCB和S1-5基因在三组筛选的两组中有相互作用。
在细胞-细胞和细胞-基质接触(粘着斑)位点发现多种蛋白,它们表现出与Zmax1的胞质结构域相互作用,包括:α-辅肌动蛋白、Trio、Pinch样蛋白和斑联蛋白。PINCH是一种含LIM结构域的蛋白,已知可与整联蛋白相关激酶相互作用,整联蛋白相关激酶是整联蛋白和生长因子信号传递途径中的一种早期信号传导蛋白。在酵母双杂交筛选中发现密切相关的基因,这提高了与从细胞外基质信号的整联蛋白信号传导相关的新传导途径的可能性。Trio也已知定位于粘着斑,被认为在细胞-基质相互作用和细胞运动相关的细胞骨架重排的协同中起关键作用。斑联蛋白是另一种含LIM结构域的蛋白,也定位于粘着斑,并被认为与经由整联蛋白信号传递途径的信号传递引发的细胞骨架重新组织相关。斑联蛋白也与α-辅肌动蛋白相互作用,我们已鉴定α-辅肌动蛋白与Zmax1相互作用。鉴定的其它含LIM结构域的蛋白包括小鼠ajuba的人同源物、LIMD1和一种新LIMD1样蛋白。
axin也在双杂交试验中被鉴定,该蛋白参与Wnt信号途径的抑制,并与肿瘤抑制物APC相互作用。此处与上述粘着斑信号传递有联系:两种途径的一个共同步骤涉及糖原合成酶激酶3的抑制,它接着导致β-连环蛋白/Lef-1和AP-1转录因子的活化。Axin/APC与此以及整联蛋白相关激酶相关。Wnt途径在胚胎发生中参与细胞命运决定。如果被不适当的活化,Wnt途径也可以导致癌症。Wnt途径也可能参与细胞骨架重排。描述Zmax1参与粘着斑信号传递的模型见图15。
结合其它研究,该数据提示整联蛋白信号传递途径参与细胞对机械压力和粘附的应答。这提供了关于在骨生物学中Zmax1作用机理的一种吸引人的模型。有可能Zmax1参与直接感觉机械压力或结合细胞外基质中的一种与机械感知有关的分子。由于对骨细胞的细胞形态、细胞粘附、迁移、增殖、分化和凋亡的影响,经由随后途径的信号传递可能参与骨重建。
表6:酵母双杂交结果
基因代码 | 基因 | Genbank识别号 | 核苷酸SEQ IDNO: | 氨基酸SEQ IDNO: |
ACTN1 | α辅肌动蛋白 | NM_001102 | 63 | |
AES | 氨基端增强子 | NM_001130.3 | 64 | |
AIP4 | Atrophin-1相互作用蛋白 | AF038564.1 | 65 | |
Nove1 | Ajuba | 66 | ||
AXIN | Wnt信号传递 | AF009674.1 | 67 | |
CDC23 | 细胞分裂周期23,酵母,同源物 | NM_004661.1 | 68 | |
HSM800944 | 与TRIO相似 | AL117435.1 | 69 | |
HSM800936 | AL117427.1 | 70 | ||
Nove1 | 与含LIM结构域的蛋白1相似 | 71 | ||
DEEPSET | 有丝分裂纺锤体卷曲相关蛋白 | NM_006461.1 | 72 | |
ECM1 | 细胞外基质蛋白1 | U65932.1 | 73 |
EF1A | 延长因子1-α | X16869.1 | 74 | |
FN | 纤连蛋白 | X02761.1 | 75 | |
HOXB13 | 同源域蛋白 | U81599.1 | 76 | |
Nove1 | 富含谷氨酸-酪氨酸的蛋白 | 77 | ||
LIMD1 | 含LIM结构域的蛋白1 | NM_014240.1 | 78 | |
Nove1 | PINCH样 | 79 | ||
RANBPM | 中心体蛋白 | NM_005493.1 | 80 | |
S1-5 | 细胞外蛋白 | U03877.1 | 81 | |
TCB | 编码胞质甲状腺素结合蛋白的基因 | M26252.1 | 82 | |
TID | 肿瘤器官芽 | NM_005147.1 | 83 | |
ZYX | 斑联蛋白 | NM_003461.1 | 84 | |
TRIO | GTP酶 | U42390.1 | 85 | |
HUMPITPB | 磷脂酰肌醇转移蛋白 | D30037.1 | 86 | |
ACTN1 | α辅肌动蛋白 | NP_001093.1 | 87 | |
AES | 氨基端增强子 | NP_001121.2 | 88 | |
AIP4 | Atrophin-1相互作用蛋白 | AAC04845.1 | 89 | |
Nove1 | Ajuba | 90 | ||
AXIN | Wnt信号传递 | AAC51624.1 | 91 | |
CDC23 | 细胞分裂周期23,酵母,同源物 | NP_004652.1 | 92 | |
Nove1 | 与TRIO相似(此处将第二列中的CAB55923.1移到第3列的同一排空格中) | 93 | ||
Nove1 | 与含LIM结构域的蛋白1相似 | 94 | ||
DEEPEST | 有丝分裂纺锤体卷曲相关蛋白 | NP_006452.1 | 95 | |
ECM1 | 细胞外基质蛋白1 | AAB05933.1 | 96 | |
EF1A | 延长因子1-α | CAA34756.1 | 97 | |
FN | 纤连蛋白 | CAA26536.1 | 98 | |
Nove1 | 同源域蛋白B13 | 99 | ||
HOXB13 | 富含谷氨酸-酪氨酸的蛋白 | AAB39863.1 | 100 | |
LIMD1 | 含LIM结构域的蛋白1 | NP_055055.1 | 101 | |
Nove1 | PINCH样 | 102 |
RANBPM | 中心体蛋白 | NP_005484.1 | 103 | |
S1-5 | 细胞外蛋白 | AAA65590.1 | 104 | |
TCB | 胞质甲状腺激素结合蛋白 | AAA36672.1 | 105 | |
TID | 肿瘤器官芽 | NP_005138.1 | 106 | |
ZYX | 斑联蛋白 | NP_003452.1 | 107 | |
TRIO | GTP酶 | AAC34245.1 | 108 | |
PTDINSTP | 磷脂酰肌醇转移蛋白 | P48739 | 109 |
根据图15所述模型和表6所示结果,本发明涉及的另一方面将是通过调节粘着斑信号传递来调节骨密度和骨量疾病。由酵母双杂交系统鉴定参与粘着斑信号传递途径的成员,可通过调节其中任何一个DNA、mRNA转录物或由其任何一个所编码的蛋白,从而调节骨密度和骨量疾病。
本发明也涉及由HBM酵母双杂交鉴定的新核酸及其蛋白,它们包括但不限于SEQ ID NO:66(Ajuba)、SEQ ID NO:71(与编码含LIM结构域的蛋白1的基因相似的一个基因)、SEQ ID NO:77(富含谷氨酸-赖氨酸的蛋白)、SEQ ID NO:79(PINCH样基因)、SEQ ID NO:90(Ajuba蛋白)、SEQ ID NO:93(与TRIO相似的蛋白)、SEQ ID NO:94()、SEQID NO:99(富含谷氨酸-赖氨酸的蛋白)和SEQ ID NO:102(PINCH样蛋白)。XVI.潜在功能
Zmax1编码的蛋白与低密度脂蛋白受体(LDL受体)相关。见Goldstein et al,Ann.Rev.Cell Biology,1:1-39(1985);Brown etal,Science,232:34-47(1986)。低密度脂蛋白受体负责摄取低密度脂蛋白,包含胆固醇的脂蛋白集合体。有低脂蛋白受体缺陷的个体胆固醇清除障碍,容易发生动脉硬化。此外,低脂蛋白受体缺陷的细胞表现出胆固醇合成增加,这一方面是因为胆固醇合成酶的反馈调节发生改变,另一方面是因为增加了这些酶的基因的转录。在有的细胞类型中,胆固醇是合成类固醇激素的前体。
这样,低密度脂蛋白受体可能直接或间接发挥信号转导蛋白的作用并可能调节基因的表达。因为Zmax1与低密度脂蛋白受体相关,这个蛋白也可能参与了骨重塑过程中细胞间的信号传导。
第171位甘氨酸很可能对Zmax1的功能十分重要,因为这个氨基酸在Zmax1的鼠同源基因中也有发现。紧密相关的LRP6蛋白在相应位点也有甘氨酸(Brown et al,Biochemical and Biophysical ResearchComm.,248:879-888(1998))。在蛋白结构或功能上具有重要意义的氨基酸在种间倾向于保守。因为自然选择阻止了重要位置氨基酸改变突变的增加。
此外,Zmax1的胞外结构域包括了四个重复,该重复由五个YWTD基序和一个紧接的EFG基序组成。该5YWTD+EFG重复和可能形成了一个独特的折叠蛋白结构域,因为这个重复在低密度脂蛋白受体和其它低密度脂蛋白受体相关蛋白中也能找到。前三个5YWTD+EFG重复在结构上十分相似,而第四个变化非常大。在Zmax1中第171位甘氨酸位于第一个5YWTD+EFG重复的YWTD基序中心。和低密度脂蛋白受体蛋白的5YWTD+EFG重复一样,Zmax1的其它两个相似的5YWTD+EFG重复在相应位点也有甘氨酸。然而,在Zmax1的前三个5YWTD+EFG重复和低密度脂蛋白受体的单个重复间仅有17.6%的氨基酸是相同的。这些发现表明,第171位甘氨酸对于这个重复的功能是必需的,第171位甘氨酸的突变会引起Zmax1的功能改变。CDNA和肽序列如图6A-6E所示。582位核苷酸的关键碱基用粗体和下划线标出。
DNA印迹分析(图7A-B)显示Zmax1在人的骨组织和许多其他组织中表达。多组织DNA印迹(Clontech,Palo Al to,CA)用来自于Zmax1的外显子进行标记。如图7A所示,5.5kb的Zmax1转录物在心、肾、肺、肝和胰腺高表达,在骨骼肌和脑中水平较低。第二个DNA印迹,如图7B所示,进一步证实转录物大小为5.5kb,而且显示Zmax1在骨、骨髓、颅盖和人成骨细胞系中表达。
总的来说,这些结果结合酵母双杂交结果表明:Zmax1基因中的HBM多态性是造成HBM表型的原因,Zmax1基因在骨发育中有重要作用。此外,由于Zmax1的突变会改变骨的矿化和发育,很可能与Zmax1结合的分子可有效地改变骨发育。这样的分子可能包括,例如:小分子、蛋白、RNA aptamers,肽aptamers等等。XVII.核酸制备、载体和宿主细胞转化
本发明中的大量核酸可以在适当的宿主细胞中通过复制产生。编码所需片段的天然或合成的核酸片段插入到重组核酸构建体中,通常为DNA构建体,它能够被插入原核或真核细胞或在其中复制。通常核酸构建体应适于在单细胞宿主,比如酵母或细菌中复制,但也可以将其导入(整合或不整合到基因组中)到培养的哺乳动物或植物或其它真核细胞中。核酸的纯化用当前发明所描述的方法进行,例如,Sambrooket al,Molecular Cloning,A Laboratory Manual,2nd Ed.(Cold SpringHarbor Laboratory,Cold Spring Harbor,NY(1989)or Ausubel etal,Current Protocols in Molecular Biology,J.Wiley and Sons,NY(1992)。
本发明中的核酸也可以通过化学合成,如:通过亚磷酰胺法,如Beaucage et al,Tetra.Letts.,22:1859-1862(1981)中所述,或三酯法,如Matteucci,et al,J.Am.Chem.Soc.,103:3185(1981)中所述,也可以通过商品化的自动寡核苷酸合成仪合成获得。双链片段可由化学合成的单链产物通过合成互补链并在适当的温度下一起退火获得,也可通过适当的引物序列用DNA聚合酶加入互补链。
导入原核或真核宿主的核酸构建体的制备可以包含宿主识别的复制系统,包括编码所需蛋白的核酸片段,优选也包括可操作地连接于蛋白编码片段的转录和翻译起始调控序列。表达载体可以包括如:复制起点或自主复制序列(ARS)和表达调控序列,启动子,增强子和必需的加工信息位点,如核糖体结合位点,RNA剪接位点,聚腺苷酸化位点,转录终止序列以及mRNA稳定序列。适合的时候也可包括分泌信号,它可来源于天然HBM或Zmax1蛋白或来自其它受体或来自相同或相关物种的分泌蛋白,它能允许蛋白穿过或停靠在细胞膜上,从而获得其功能构象,或分泌到细胞外。这样的载体可通过在本领域内公知和讨论的标准重组技术制备。例如,Sambrook et al,MolecularCloning,A Laboratory Manual,2nd Ed.(Cold Spring HarborLaboratory,Cold Spring Harbor,NY(1989)或Ausubel et al,CurrentProtocols in Molecular Biology,J.Wiley and Sons,NY(1992).
合适的启动子和其它必需的载体序列应进行选择使之在宿主中具有功能,在合适时可以包括那些在天然状态下与Zmax1和HBM基因相关的序列。细胞系的可用重组和表达载体的例子可见Sambrook et al,Molecular Cloning,A Laboratory Manual,2nd Ed.(Cold SpringHarbor Laboratory,Cold Spring Harbor,NY(1989)or Ausubel etal,Current Protocols in Molecular Biology,J.Wiley and Sons,NY(1992).许多本领域公知的载体可从Stratagene,New EnglandBioLabs,Promega Biotech和其它公司购得。启动子,比如trp,lac和嗜菌体启动子,tRNA启动子和糖分解酶启动子也可用于原核宿主。有效的酵母启动子包括金属硫蛋白启动子区,3-磷酸甘油激酶或其它糖分解酶,比如烯醇酶或3-磷酸甘油醛脱氢酶,以及负责麦芽糖和半乳糖利用的酶,以及其它。适合于酵母表达的载体和启动子在EP73,675A中有更深入的阐述。恰当的非天然的哺乳动物启动子应该包括来自于SV40的早期和晚期启动子(Fiers et al,Nature,273:113(1978))或从鼠莫洛尼氏白血病毒、小鼠瘤病毒、鸟类肉瘤病毒、腺病毒II型、牛乳头瘤病毒或多瘤病毒衍生的启动子。此外,构建体应与一个可扩增的基因(如:DHFR)相连,这样可以产生多个拷贝数的该基因。对于合适的增强子和其它表达调控序列可参考Enhancers and Eukaryotic Gene Expression,Cold Spring HarborPress,Cold Spring Harbor,NY(1983).
虽然这样的表达载体可以进行自主复制,它们也可以插入到宿主细胞基因组中进行复制,所用的方法是本领域内所公知的。
表达和克隆载体可含有可选择标记,它是一个编码蛋白的基因,对于用该载体转化的宿主细胞的存活和生长来说该蛋白是必需的。该基因的存在保证只有那些表达插入片段的宿主细胞能生长。典型的选择基因编码的蛋白a)提供对抗生素或其它有毒物质的抗性,比如氨苄青霉素、新霉素、氨甲蝶呤等;b)补充营养缺陷,或c)提供培养基中所没有的关键的营养物质,比如编码细菌D-丙氨酸消旋酶的基因。挑选适当的选择标记是根据宿主细胞来进行的,本领域内适合不同宿主的选择标记是公知的。
含有感兴趣核酸的载体可在体外进行转录,产生的RNA用公知的方法导入细胞,比如注射(见,Kubo et al,FEBS Letts.241:119(1988)),或用公知的方法将载体导入宿主细胞,所述方法依据宿主细胞的不同而有变化,包括电穿孔;用氯化钙、氯化铷、磷酸钙、DEAE-右旋糖苷和其它物质进行转染;微粒轰击;脂质体转染;感染(此时载体是一种感染剂,比如逆转录病毒基因组);以及其它方法。通常可参照:Sambrook et al.,1989和Ausubel et al.,1992.用本领域内公知的任何一种将核酸导入宿主细胞的方法,包括上面所提到的方法,在此都称为“转化”。上述导入了核酸的细胞在此也包括这些细胞的子代细胞。
本发明中大量的核酸和蛋白可通过在与原核或真核宿主细胞相容的载体或其它表达媒介物质中表达Zmax1或HBM或其部分来进行制备。虽然其它的原核生物,如枯草杆菌或假单胞菌也可选用,但最常用的原核表达宿主是大肠杆菌菌株。
哺乳动物或其它的真核宿主细胞,如酵母,丝状真菌,植物,昆虫或两栖类或鸟类均可用于获取本发明中的蛋白。通过培养来繁殖哺乳动物细胞就是大家所公知的。可参考Jakoby and Pastan(eds.),Cell Culture Methods in Enzymology,volume 58,Academic Pres ,Inc.,Harcourt Brace Jovanovich,NY,(1979)。常用的哺乳动物宿主细胞系有VERO和HeLa细胞、中国仓鼠卵巢(CHO)细胞、WI38、BHK和COS细胞系。本领域技术人员可以理解,其它细胞系也可能适合,比如提供所所需的糖基化或其它特征的更高表达。
根据载体构建的方式可用标记对克隆进行选择。标记可位于相同或不同的DNA分子上,优选位于相同的DNA分子上。在原核宿主中,转化体可通过对氨苄青霉素、四环素或其它抗生素的抗性进行选择。特定产物的产生基于对温度的敏感性,这一特征也可作为一种适当的标记。
本发明中用核酸对原核或真核细胞的转化不仅对本发明中核酸和蛋白的生成有用,而且例如也可用于研究Zmax1和HBM蛋白的特性。
本领域技术人员可以理解,反义核酸序列可用于抑制或减弱Zmax1和HBM的表达。比如,包含Zmax1或HBM基因全部或部分,或来自Zmax1或HBM区域的其它序列的核酸载体可以以反义方向置于一个启动子控制下并导入细胞。在细胞中表达这样的反义构建体将干扰Zmax1或HBM的转录和/或翻译和/或复制。
在此公开的根据Zmax1和HBM基因序列设计的探针和引物可用于识别其它物种中同源的Zmax1和HBM的基因序列和蛋白。这些Zmax1和HBM基因序列和蛋白可在其被分离的物种中用于这里描述的诊断/预后、治疗和药物筛选方法。XVIII.蛋白表达和纯化
本发明的HBM蛋白的表达和纯化基本上可用下述方法进行。为便于从HBM基因克隆、表达和纯化膜蛋白和分泌蛋白,选择了pET系统(Novagen)基因表达系统在大肠杆菌中进行重组蛋白的克隆和表达。同样,将编码肽标签的DNA序列His-Tap融合到感兴趣的DNA序列的3’末端以便于纯化重组蛋白产物。选择3’末端进行融合,以避免任何5’端信号序列的改变。
例如,从SEQ ID NOS:1,3和5-12所表示的核酸中选择用于克隆HBM的核酸是通过聚合酶链式反应(PCR)进行。HBM核苷酸序列5’和3’端特异的合成寡核苷酸引物设计并购自Life Technologies(Gaithersburg,MD)。所有正向引物(特异于序列的5’端)经设计在5’端包括一个Nco I克隆位点。这些引物设计为允许在Nco I位点中编码的蛋氨酸残基处起始蛋白翻译,紧接是一个缬氨酸残基以及由HBM序列编码的蛋白。所有的反向引物(特异于序列的3’端)在5’端包含一个EcoR I位点,从而可使HBM序列克隆到pET-28b的读码框中。pET-28b载体可提供编码额外的20个C-羧基端氨基酸的序列,这些氨基酸包括6个组氨酸残基(位于C-末端),构成了组氨酸亲和标签。
从HBM基因制备的基因组DNA用作PCR扩增的模板DNA来源(Ausubel et al,Current Protocols in Molecular Biology,J.Wileyand Sons,NY(1994))。要扩增一个包含HBM核苷酸序列的DNA序列,将50ng基因组DNA加入到含有2mM MgCl2;1μM位于确定的HBM基因侧面且与之互补的合成寡核苷酸引物(上游和下游引物);0.2mMdNTP,即dATP,dGTP,dCTP,dTTP;2.5单位热稳定的DNA聚合酶(Amplitaq,Roche Molecular System,Inc.,Branchburg,NJ)的反应管中,使终体积达到100μl。
热循环反应结束后,每一个扩增的DNA样品用Qiaquick Spin PCR纯化试剂盒(Qiagen,Gaithersburg,MD)进行纯化。所有扩增的DNA样品均用限制性内切酶,如:Nco I和EcoR I(New England BioLabs,Beverly,MA)进行消化(Ausubel et al,Current Protocols inMolecular Biology,J.Wiley and Sons,NY(1994))。然后DNA样品在1.0%NuSeiVe(FMC BioProducts,Rockland,ME)的琼脂糖凝胶中进行电泳。DNA暴露于溴化乙锭和长波紫外线下进行显影。包含在从琼脂糖凝胶中分离的切片中的DNA按Bio 101 GeneClean试剂盒方案(Bi 101,Vista,CA)进行纯化。
要进行克隆的pET-28b载体用限制性内切酶,如:NcoI和EcoRI(New England BioLabs,Beverly,MA)进行消化(Ausubel et al,Current Protocols in Molecular Biology,J.Wiley and Sons,NY(1994))。编码能够融合到插入基因的5’端的组氨酸亲和标签的pET-28a载体可用适合的限制性内切酶消化而进行制备。
消化后,DNA插入片段克隆入预先消化好的pET-28b表达载体中(Ausubel et al,Current Protocols in Molecular Biology,J.Wileyand Sons,NY(1994))。连接产物按下述方法用于转化BL21大肠杆菌菌株(Ausubel et al,Current Protocols in Molecular Biology,J.Wiley and Sons,NY(1994))。
感受态细菌,大肠杆菌BL21株或大肠杆菌BL21(DE3)株用带有克隆的HBM序列的重组pET表达质粒按标准方法(Ausubel et al,Current Protocols in Molecular Biology,J.Wiley and Sons,NY(1994))进行转化。简言之,1μl连接反应物加入到50μl电转感受态细胞中混匀,予高电压脉冲,然后样品在0.45ml SOC培养基(0.5%酵母提取物,2.0%胰蛋白胨,10mM NaCl,2.5mM KCl,10mM MgCl2,10mM MgSO4和20mM葡萄糖)37℃震摇1小时。然后将样品涂于含有25μg/ml硫酸卡那霉素的LB琼脂平板上培养过夜。转化的BL21克隆按下述方法挑出、分析,从而按如下方法评估克隆的插入片段。
单个的用重组的pET-28b HBM核苷酸序列转化的BL21克隆用PCR扩增法进行分析,使用与最初的PCR扩增克隆反应相同的特异于HBM的正向和反向引物。成功的扩增验证了HBM序列已整合到表达载体中(Ausubel et al,Current Protocols in Molecular Biology,J.Wileyand Sons,NY(1994))。
挑出各个带有正确克隆的HBM核苷酸序列的重组pET-28b载体,在加入25μg/ml硫酸卡那霉素的5ml LB培养液中孵育过夜。第二天,用Qiagene质粒纯化方案(Qiagene Inc.,Chatsworth,CA)分离并纯化质粒DNA。
PET载体能在任何大肠杆菌K-12菌株中进行繁殖,如:HMS 174,HB 101,JM 109,DH5等,根据克隆的目的或质粒制备进行选择。用于表达的宿主细胞包括含有一个染色体拷贝的T7 RNA聚合酶基因的大肠杆菌菌株。这些宿主是嗜菌体DE3的溶原体,它是一种λ衍生体,带有lacI基因,lacUV5启动子和T7 RNA聚合酶基因。T7 RNA聚合酶在加入异丙基--D-硫代半乳糖苷(IPTG)后被诱导,它转录任何一种带有功能性的T7启动子的目的质粒,比如带有感兴趣基因的pET-28b。菌株包括BL21(DE3)(Studier et al,Meth.Enzymol.,185:60-89(1990)).
要表达重组的HBM序列,用上述方法分离50ng质粒DNA,转化感受态BL21(DE3)菌株(由Novagen作为pET表达试剂盒的一部分提供)。采用用于HBM重组构建体的方法使LacZ基因(β-半乳糖苷酶)在pET系统中的表达。转化的细胞在SOC培养基中培养1小时,培养物铺到含有25μg/ml硫酸卡那霉素的LB平板上。第二天,合并所有克隆接种于含硫酸卡那霉素(25μg/m)的LB培养液中,生长至理想浓度,使其在600nm处吸光值为0.5-1.0 O.D.单位,在该点时,往培养基中加入1mM IPTG培养3小时,诱导HBM重组DNA构建体的基因表达。
用IPTG诱导基因表达后,用Sorvall RC-3B离心机以3500×g在4℃下离心15分钟,收集细菌。沉淀物用50ml冰冷的mMTris-HCl,pH8.0,0.1M NaCl和0.1 mM EDTA(STE缓冲液)重悬。然后细胞以2000×g在4℃离心20分钟。称量沉淀物净重,冻存于-80℃直到准备进行蛋白纯化。
许多本领域公知的方法均可用于蛋白的纯化和分离(Coligan etal,Current Protocols in Protein Science,John Wiley &Sons(1995))。例如;冻存的细胞解冻后,重悬于缓冲液中,多次通过一个小体积的微液流器(microfluidizer)(M110S型,MicrofluidicsInternational Corp.,Newton,MA)使细胞破碎。所得匀浆物进行离心得到澄清的上清(粗提物)接下来进行过滤,粗提物过柱后得到了级分分离,这些级分在O.D280处检测吸光值,峰级分用SDS-PAGE进行分析。
纯化所得蛋白的浓度用从氨基酸成分计算所得的吸光系数进行分光光度定量(Perkin,Eur.J.Biochem.,157:169-180(1986))。蛋白浓度也以牛血清白蛋白作为标准试剂用Bradford法进行测定,Anal.Biochem.,72:248-254(1976)和Lowry et al,J.Biol.Chem.,193:265-275(1951)。
不同浓度的SDS-聚丙烯酰胺凝胶购自BioRAD(Hercules,CA),用考马斯亮蓝进行染色。分子量标记包括兔骨骼肌肌球蛋白(200KD),大肠杆菌β-半乳糖苷酶(116KD),兔骨骼肌磷酸化酶B(97.4KD),牛血清白蛋白(66.2KD),卵清蛋白(45KD),牛碳酸酐酶(31KD),大豆胰蛋白酶抑制剂(21.5KD),鸡卵清溶菌酶(14.4KD)和牛抑蛋白酶肽(6.5KD)。
一旦获得足够量的所需蛋白,它便可用于许多目的。最典型是用于产生特异结合的抗体。这些抗体可以是多克隆也可是单克隆,可用本领域公知的体内或体外技术来产生。根据所述方法识别和分离的任何肽的表位的单克隆抗体可从鼠杂交瘤制备(Kohler,Nature,256:495(1975))。概言之,在两周的时间间隔内对小鼠接种几个微克的HBM蛋白。然后将小鼠杀死。产生抗体的细胞从小鼠的脾脏取出。这些脾细胞用聚乙二醇与小鼠的骨髓瘤细胞融合。融合成功的细胞在微量滴定板中稀释,继续使培养物生长。每一孔中的抗体量用免疫方法如ELISA(Engvall,Meth.Enzymol.,70:419(1980))进行测量。产生抗体的克隆可被扩充并进一步增殖以产生HBM抗体。其它适合的技术包括在体外将淋巴细胞暴露于抗原多肽,或者,在噬菌体或相似载体中选择抗体库,见Huse et al,Science,246:1275-1281(1989)。其它关于抗体制备的信息见Dayis et al,Basic Methods in MolecularBiology,Elsevier,NY,Section 21-2(1989)。XIX.方法运用:基因治疗
近年来,在遗传和获得性疾病的基因治疗领域取得了显著的进展(Kay et al,Proc.Natl.Acad.Sci.USA,94:12744-12746(1997))。基因治疗可定义为因治疗目的而进行的有意的DNA转移。基因转移方法的进步使对各类疾病的基因治疗方法得到了发展。基因治疗也得益于近来在识别新的治疗基因方面的进展,病毒和非病毒基因递送系统的发展,对基因调节的更好理解以及在细胞分离和移植方面的进展。
前述实验鉴定出HBM基因是一个引起增加骨量的显性突变。该突变为显性实际上是指HBM蛋白的表达引起骨量的增加。年老的带有HBM基因的个体,可以表达HBM蛋白,因此不会患骨质疏松。这些个体与接受HBM蛋白治疗的个体是等效的。这些观察资料是HBM蛋白能预防骨质疏松的强有力的实验证据。HBM基因使骨量增加的活性称为“HBM功能”。
因此,根据本发明,可提供一种赋予间充质干细胞HBM功能的方法(Onyia et al,J.Bone Miner.Res.,13:20-30(1998);Ko et al,Cancer Res.,56:4614-4619(1996))。提供该种功能可为骨质疏松提供保护作用。HBM基因或该基因的一部分可在载体中被导入细胞,这样基因可存在于染色体外。在这种情况下,基因可在染色体外的位置被细胞表达。
既可用于重组也可用于染色体外维持的导入基因的载体在本领域是公知的,且任何合适的载体均可使用。将DNA导入细胞的方法如:电穿孔,磷酸钙共沉淀以及病毒转导在本领域都是公知的,对方法的选择在本领域技术人员的能力范围内(Robbins,Ed.,Gene TherapyProtocols,Human Press,NJ(1997))。用HBM基因转化的细胞可作为研究骨质疏松和促进骨生长的药物治疗的模式系统。
如上所述,一般来说,在合适的情况下,HBM基因或片段可用于基因治疗方法用以增加这些基因产物在间充质干细胞中的表达量。它对于增加给定的HBM蛋白或它的一个片段的表达水平也是有用的,甚至在那些野生型基因正常表达的细胞中也是如此。基因治疗可以根据被普遍接受的方法来加以实施,正如Friedman在Therapy for GeneticDiseases,Friedman,Ed.,Oxford University Press,Pages 105-121(1991)所描述的那样。
已制备好包含一个拷贝的HBM基因的病毒或质粒载体,它与表达调控元件相连并能在间充质干细胞内复制。已知有描述的合适的载体,例如:美国专利5,252,479和WO93/07282,其公开内容在此全部引入作为参考。然后将载体注射到病人体内,可在骨髓局部或全身(为到达定位于其它位点,例如血液的任何间充质干细胞)。如果被转染的基因没有永久地掺入每一个靶细胞的基因组中,那么治疗需要定期重复。
本领域内公知的基因转移系统在本发明的基因治疗方法实施中可能有用。这些包括了病毒和非病毒的转移方法。许多病毒已被用作基因转移载体包括,多瘤病毒,例如:SV40(Madzak et al,J.Gen.Virol.,73:1533-1536(1992)),腺病毒(Berkner,Curr.Top.Microbiol.Immunol.,158:39-61(1992);Berkner et al,BioTechniques,6:616-629(1988);Gorziglia et al,J.Virol.,66:4407-4412(1992);Quantin et al,Proc.Natl.Acad.Sci.USA,89:2581-2584(1992);Rosenfeld et al,Ce11,68:143-155(1995);Wilkinson et al,Nucl.Acids Res.,20:2233-2239(1992);Stratford-Perricaudet et al,Hum.Gene Ther.,1:241-256(1990)),痘苗病毒(Mackett et al,Biotechnology,24:495-499(1992)),腺伴随病毒(Muzyczka,Curr.Top.Microbiol.Immunol.,158:91-123(1992);Ohi et al,Gene,89:279-282(1990)),疱疹病毒,包括HSV和EBV(Margolskee,Curr.Top.Microbiol.Immunol.,158:67-90(1992);Johnson et al,J.Virol.,66:2952-2965(1992);Fink et al,Hum.Gene Ther.,3:11-19(1992);Breakfield et al,Mol.Neurobiol.,1:337-371(1987);Fresse et al,Biochem.Pharmacol.,40:2189-2199(1990)),鸟类逆转录病毒(Brandyopadhyayet al,Mol.Cell Biol.,4:749-754(1984);Petropouplos et al,J.Virol.,66:3391-3397(1992)),鼠类逆转录病毒(Miller,Curr.Top.Microbiol.Immunol.,158:1-24(1992);Miller,et al,Mol.Cell Biol.,5:431-437(1985);Sorge et al,Mol.Cell Biol.,4:1730-1737(1984);Mann et al,J.Virol.,54:401-407(1985)),以及人类逆转录病毒(Page et al,64:5370-5276(1990);Buchschalcher et al,J.Virol.,66:2731-2739(1992))。大多数人类基因治疗方法是基于缺陷的鼠类逆转录病毒。
在本领域内公知的非病毒的基因转移方法包括化学技术,例如:磷酸钙共沉淀(Graham et al,Virology,52:456-467(1973);Pelliceret al,Science,209:1414-1422(1980)),机械方法,如:微注射法(Anderson et al,Proc.Natl.Acad.Sci.USA,5399-5403(1980);Gordon et al,Proc.Natl.Acad.Sci.USA,77:7380-7384(1980);Brinster et al,Cell,27:223-231(1981);Constantinin et al,Nature,294:92-94 !1981)),通过脂质体的膜融合介导转移(Felgneret al,Proc.Natl.Acad.Sci.USA,84:7413-7417(1987);Wang etal,Biochemistry,28:9508-9514(1989);Kaneda et al,J.Biol.Chem.,264:12126-12129(1989);Stewart et al,Hum.Gene Ther.,3:267-275(1992);Nabel et al,Science,249:1285-1288(1990);Lim et al,Circulation,83:2007-2011(1992)),以及直接的DNA摄入和受体介导的DNA转移(Wolff et al,Science,247:1465-1468(1990);Wu et al,BioTechniques,11:474-485(1991);Zenke etal,Proc.Natl.Acad.Sci.USA,87:3655-3659(1990);Wu et al,J.Biol.Chem.,264:16985-16987(1989);Wolff et al,BioTechniques,11:474-485(1991);Wagner et al,1990;Wagner etal,Proc.Natl.Acad.Sci.USA,88:4255-4259(1991);Cotton etal,Proc.Natl.Acad.Sci.USA,87:4033-4037(1990);Curiel etal,Proc.Natl.Acad.Sci.USA,88:8850-8854(1991);Curiel etal,Hum.Gene Ther.,3:147-154(1991))。病毒介导的基因转移可与直接的体内载体相结合进入间充质干细胞内而不到周围的细胞中。(Romano et al,In Vivo,12(1):59-67(1998);Gonez et al,Hum.Mol.Genetics,7(12):1913-9(1998))。可选择将产生逆转录病毒载体的细胞系注射入骨髓(Culver et al,Science,256:1550-1552(1992))。注射该细胞将提供一个连续的载体颗粒来源。这种技术已被批准用于人类无法经手术治疗的脑部肿瘤。
将生物和物理的基因转移法结合在一起的方法中,任何大小的质粒DNA与一个多聚赖氨酸缀合的抗体结合,该抗体特异针对于腺病毒的六角体蛋白,由此产生的复合物与腺病毒载体结合。这个三分子复合物即可用于感染细胞。腺病毒载体允许在偶联的DNA被破坏以前对内涵体进行有效地结合、内化以及降解。
脂质体/DNA复合物显示能直接介导体内基因转移。然而在标准的脂质体制备中基因转移过程是非特异性的,例如,已有报道直接原位给药后,肿块局部有其吸收和表达(Nabel,Hum.Gene Ther.,3:399-410(1992))。XX.使用方法:转化宿主,药物开发和研究工具
携带HBM基因的细胞和动物可用作研究和测试可能具有治疗作用物质的模型系统(Onyia et al,J.Bone Miner.Res.,13:20-30(1998);Broder et al,Bone,21:225-235(1997))。典型细胞是培养的间充质干细胞,它们可从体细胞或生殖细胞中带HBM基因的个体中分离。另外,可按上述将细胞系工程化使之携带HBM基因。将测试物质施加到细胞后,检测细胞的转化表型。可评测转化细胞的任何特性,包括培养物中的骨基质形成(Broder et al,Bone,21:225-235(1997)),机械特性(Kizer et al,Proc.Natl.Acad.Sci.USA,94:1013-1018(1997)),和对推测治疗剂的反应。
可在处理生殖细胞或受精卵后选择用于测试治疗剂的动物。这种处理包括插入Zmax1基因、插入HBM基因和破坏的同源基因。此外,可用常规方法通过其它遗传学变化的插入或缺失突变来破坏插入动物中的Zmax1基因和/或HBM基因,如下文所述,例如,Capechi,Science,244:1288(1989);Valancuis et al,Mol.Cell Biol.,11:1402(1991);Hasty et al,Nature,350:243(1991);Shinkai etal,Cell,68:855(1992);Mombaerts et al,Cell,68:869(1992);Philpott et al,Science,256:1448(1992);Snouwaert et al,Science,257:1083(1992);Donehower et al,Nature,356:215(1992)。将测试物给予给动物后,必须评价骨生长。如果测试物增强骨生长,则测试物是候选的治疗剂。这些动物模型为潜在的治疗产品提供非常重要的载体。
携带HBM基因的个体骨量增加。通过改变参与骨发育的其它分子的活性、水平、表达模式和修饰状态,HBM基因引起该表型。用多种已有技术有可能鉴定这些分子,优选蛋白或mRNA,其活性、水平、表达模式和修饰状态在含Zmax1基因的系统和含HBM基因的系统间存在差异。这种系统可以是,例如,无细胞提取物、细胞、组织、或活有机体,如小鼠或人。对于突变型Zmax1,可以使用完全缺失的Zmax1、缺乏该蛋白细胞外或细胞内部分的突变、或其它Zmax1的任何突变。也可用表达反义Zmax1 RNA或寡核苷酸抑制Zmax1蛋白生成。对于突变型HBM,可以使用完全缺失的HBM、缺乏该蛋白细胞外或细胞内部分的突变、或其它HBM的任何突变。也可用表达反义HBM RNA或寡核苷酸抑制HBM蛋白生成。
通过比较Zmax1系统和HBM系统而鉴定的分子可用作药物开发或人和动物骨疾病诊断中的替代标志。另外,这些分子可用于治疗骨疾病。见,Schena et al,Science,270:467-470(1995)。
例如,在同系鼠中构建携带HBM基因的转基因小鼠。基因型HBM/+的小鼠可成活,健康且骨量增加。为鉴定骨量增加的替代标记,处死HBM/+(即杂合子)和纯合+/+(即野生型)小鼠。从每种动物中提取骨组织mRNA,并构建对应表达于+/+个体中表达的mRNA的“基因芯片”。从每种基因型动物中分离不同组织的mRNA,逆转录,荧光标记,然后与连接于固相支持物上的基因片段杂交。两个群体间荧光强度的比值表示+/+和HBM/+动物中特异性mRNA的相对丰度。编码相对野生型对照高或低表达的mRNA的基因是由HBM基因同时调节的候选基因。
正如一种已发现的蛋白,作为相同信号级联一部分的新蛋白,其鉴定的标准程序如下。用放射性磷处理细胞,操作已发现蛋白使其活性升高或降低。通过聚丙烯酰胺凝胶电泳和放射自显影或类似方法检测细胞中其它蛋白的磷酸化状态。可通过许多方法操作已知蛋白的活性水平,包括,例如,简单地加入或不加入一种已知细胞外蛋白,用特异性抑制剂如药物或抗体,或用表达该已知蛋白的反义抑制,来比较野生型突变蛋白(Tamura et al,Science,280(5369):1614-7(1998);Meng,EMBO J.,17(15):4391-403(1998);Cooper et al,Cell,1:263-73(1982))。
另一实例中,在表达Zmax1的有义或反义cDNA的TE85骨肉瘤细胞中鉴定了不同磷酸化水平的蛋白。正常TE85细胞表达高水平Zmax1(Dong et al,Biochem. & Biophys.Res.Comm.,251:784-790(1998))。含有义构建体的细胞表达更高水平的Zmax1,而表达反义构建体的细胞其表达水平降低。细胞生长于32P环境中,收获,裂解,裂解物跑SDS聚丙烯酰胺凝胶以分离蛋白,凝胶放射自显影(Ausubelet al,Current Protocols in Molecular Biology,John Wiley & Sons(1997))。有义和反义细胞系之间有强度差异的带代表其磷酸化状态或绝对水平随Zmax1水平变化的磷酸化蛋白。代替32P标记,可用SDS-PAGE分离未标记蛋白并用商品抗磷酸化酪氨酸抗体作为探针进行免疫印迹(Thomas et al,Nature,376(6537):267-71(1995))。代替表达反义RNA,可用化学修饰的反义寡核苷酸转染(Woolf et al,Nucleic Acids Res.,18(7):1763-9(1990))。
许多骨疾病,如骨质疏松,发作缓慢且对治疗的反应也慢。因此,开发骨发育和矿化的替代标记很有用。这种标记可用于开发骨疾病的治疗方法,以及用于诊断在将来可能有发生骨疾病危险的患者。优选标记的实例是一些N-和C-端端肽标记,例如,描述于美国专利5,455,179,5,641,837和5,652,112中的那些,在此引入其完整内容作为参考。在HIV疾病领域,CD4数和病毒载量是疾病进展的有用替代标记(Vlahov et al,JAMA,279(1):35-40(1998))。在骨疾病领域也需要类似的替代标记。
替代标记可以是容易检测且对非特异性影响相对不敏感的任何特征。例如,替代标记可以是组织或血清中的分子如蛋白或mRNA。另外,替代标记可以是诊断性体征,如对疼痛敏感、反射应答等。
在又一个实例中,用携带HBM基因的人家系鉴定骨量增加的替代标记。从携带HBM基因的三个个体中,以及三个没有该基因的近亲属个体中抽取血样。对这些个体血清中的蛋白进行双向凝胶电泳,其中一向根据大小分离蛋白,另一向根据等电点分离蛋白(Epstein et al,Electrophoresis,17(11):1655-70(1996))。鉴定蛋白对应点。与其正常亲属比较,预期在HBM个体中若干点有量的差异或轻度位置差异。这些蛋白对应点是候选的替代标记。通过微测序鉴定蛋白,可通过诊断测试中所用的标准方法产生这些蛋白的抗体。HBM蛋白或其它候选替代标记的诊断测试包括用本发明所述抗体和报告分子检测人体液、膜、骨、细胞、组织或其提取物中的HBM。通过共价或非共价连接提供可检测信号的物质可以标记抗体。在许多科学和专利文献中描述了多种报告分子或标记,包括放射性核素、酶、荧光剂、化学发光剂或显色剂(美国专利3,817,837;3,850,752;3,939,350;3,996,345;4,277,437;4,275,149;和4,366,241)。
用这些抗体,可以检测正常个体和骨疾病患者中候选替代标记的水平,如骨质疏松、骨质疏松假胶质瘤、恩格尔曼病、Ribbing’s病、高磷酸盐血症、范布伦病、肢骨纹状肥厚、骨硬化病、致密性成骨不全症、硬化性狭窄、脆弱性骨硬化、肢端肥大症、佩吉特病、纤维结构不良、小梁狭窄、成骨不全、甲状旁腺功能减退、假性甲状旁腺功能减退、假假性甲状旁腺功能减退、原发性和继发性甲状旁腺功能亢进以及相关综合征、高钙尿、甲状腺髓样癌、骨软化及其它疾病。用抗体在临床上检测血清中蛋白水平的技术已很成熟。在携带特定疾病或疾病类型的个体中持续高水平或低水平存在的蛋白是有用的替代标记。
替代标记可用于诊断骨疾病。例如,求医的有高频骨折的儿童,其深层原因可能是虐待儿童,儿童的不当行为或骨疾病。要快速检测骨疾病,可用上述抗体检测替代标记蛋白的水平。
也可测量替代标记的修饰状态水平作为待开发药物可能有效性的指标。用替代标记在建立骨疾病的治疗方案中尤其方便,因为骨发育或矿化的改变可能需要长期观察。例如,如上述发现一组骨mRNA,称为“HBM诱导的mRNA组”,与+/+小鼠相比,在HBM/+小鼠中过表达。它们的表达可用作替代标记。具体地,如果用化合物处理+/+小鼠导致HBM诱导的mRNA组的过表达,那么可认为这种化合物是很有前景的候选物供进一步开发。
本发明对于通过用Zmax1或HBM蛋白或其结合片段以任何一种药物筛选技术筛选化合物特别有用。
在此测试中使用的Zmax1或HBM蛋白或其片段可以是游离于溶液中、连接于固相支持物上或负载于细胞表面。一种药物筛选方法使用真核或原核宿主细胞,它们用表达该蛋白或其片段的重组核酸稳定转化,优选在竞争性结合测试中使用。这种细胞可以是活的或固定形式,可用于标准结合测试。例如,可测试Zmax1或HBM蛋白或其片段与待测药物间复合物的形成,或检验待测药物对Zmax1或HBM蛋白或其片段与已知配体间复合物形成的干扰程度。
因而,本发明提供筛选药物的方法,包括将这种试剂与Zmax1或HBM蛋白或其片段接触,并测定(1)该试剂与Zmax1或HBM蛋白或其片段间复合物的存在,或(2)Zmax1或HBM或其片段与配体间复合物的存在,这些方法本领域公知。在这种竞争结合测定中典型情况下标记Zmax1或HBM蛋白或其片段。游离的Zmax1或HBM蛋白或其片段与存在于蛋白:蛋白复合物中的那些分离,游离标记物(即未形成复合物)的量分别是待测药物与Zmax1或HBM结合、或干扰Zmax1或HBM:配体结合的指标。
另一种药物筛选技术可以高通量筛选对Zmax1或HBM蛋白有合适结合亲和力的化合物,它详细描述于WO 84/03564。简单来说,在固相基质,如塑料针或其它表面,上合成大量不同的小肽测试化合物肽测试化合物与Zmax1或HBM蛋白反应并洗涤,然后用本领域公知的方法检测结合的Zmax1或HBM蛋白。可将纯化的Zmax1或HBM直接包被于板上,用于上述药物筛选技术。然而,该蛋白的未中和抗体可用于捕获将Zmax1或HBM蛋白固定于固相上的抗体。
本发明也涉及竞争性药物筛选测定的用途,其中可特异性结合Zmax1或HBM蛋白的中和抗体与待测化合物竞争Zmax1或HBM或其片段。以此方式,抗体可用于检测是否存在共同具有Zmax1或HBM蛋白一或多个抗原决定簇的肽。
进一步的药物筛选技术涉及使用含无功能Zmax1或HBM基因的宿主真核细胞系或细胞(如上述)。这些宿主细胞系或细胞在存在药物化合物的环境中生长。检测宿主细胞的生长速率以确定是否该化合物可调节Zmax1或HBM缺陷细胞的生长。
合理药物设计的目的是生产感兴趣生物活性蛋白的结构类似物,或相互作用(如激动剂、拮抗剂、抑制剂)小分子的结构类似物,以设计,例如,蛋白更有活性或更稳定的药物,或者在体内增强或干扰蛋白功能。见,例如,Hodgon,Bio/Technology,9:19-21(1991)。在一种方法中,首先通过X-射线晶体图像,通过计算机建模或最典型的通过组合方法,确定感兴趣蛋白(例如Zmax1或HBM蛋白)、或例如,Zmax1或HBM-受体或配体复合物的三维结构。不太常见的是,有关蛋白结构的有用信息可通过基于同源蛋白结构的建模获得。合理药物设计的实例是开发HIV蛋白酶抑制剂(Erickson et al,Science,249:527-533(1990))。此外,可通过丙氨酸扫描(Wells,Methods inEnzymol.,202:390-411(1991))分析肽(Zmax1或HBM蛋白)。在此技术中用丙氨酸代替氨基酸残基,并确定这对肽活性的影响。以此方式分析该肽的每个氨基酸残基以确定该肽的重要区域。
也可分离靶特异性抗体,通过功能测试选择,然后解析其晶体结构。从原理来说,该方法产生药物核心,这可以是随后药物设计的基础。通过产生对功能性、有药理活性的抗体的抗独特型抗体(anti-ids),可以完全绕开蛋白晶体学。作为镜相的镜相,预期anti-ids的结合位点是起始受体的类似物。然后可用anti-id从化学或生物产生的肽库中鉴定和分离肽,所选肽用作药物核心。
因而,可设计,例如,具有提高Zmax1或HBM蛋白活性或稳定性的药物,或用作Zmax1或HBM蛋白活性抑制剂、激动剂、拮抗剂等的药物。由于可以得到克隆Zmax1或HBM序列,可以产生足够量的Zmax1或HBM蛋白供这种分析研究,如X-射线晶体衍射。此外,此处有关Zmax1或HBM蛋白序列的知识将可辅助计算机建模技术,以代替或补充X-射线晶体衍射。XXI.使用方法:鸟和哺乳动物饲养
Zmax1 DNA和Zmax1蛋白和/或HBM DNA和HBM蛋白可用于脊椎动物、优选人的治疗剂,以及鸟类和哺乳动物兽用药,包括供家畜饲养用。鸟类,包括,例如,小鸡、公鸡、母鸡、火鸡、鸵鸟、鸭、野鸡和鹌鹑,可从该基因及高骨量途径的鉴定中受益。在文献引用的许多实例中(例如,McCoy et al,Res.Vet.Sci.,60(2):185-186(1996)),因饲养条件导致的骨弱化引起笼层疲劳、骨质疏松和高死亡率。治疗鸟类骨质疏松或其它骨疾病的补充治疗剂对鸟类福利和饲养工业,包括,例如,肉和蛋生产的经济条件有相当积极的作用。XXII.使用方法:用Zmax1特异的寡核苷酸检测影响骨发育的遗传改变的诊断试验
在怀疑骨发育改变或疾病与Zmax1基因或HBM基因改变有关的情况下,可以构建特异性寡核苷酸并分别用于评价骨组织或影响骨发育的其它组织中Zmax1 mRNA或HBM mRNA水平。
例如,要检测一个人是否带影响骨密度的HBM基因,可以使用聚合酶链式反应。用标准方法合成或从定制寡核苷酸的商业供应商处获得两个寡核苷酸。用Oligo 4.0引物选择程序(Wojchich Rychlik,1992)按标准条件确定其长度和碱基组成。其中一条寡核苷酸设计成在所用PCR条件下只与HBM DNA杂交。另一条寡核苷酸设计成与Zmax1基因组DNA的片段杂交,这样用这些寡核苷酸引物进行DNA扩增产生容易鉴定的DNA片段。例如,在下述条件下引物对CCAAGTTCTGAGAAGTCC(SEQ ID NO:32)和AATACCTGAAACCATACCTG(SEQ ID NO:33)将从DNA样品中扩增530bp的DNA片段:第1步95℃120秒;第2步95℃30秒;第3步58℃30秒;第4步72℃120秒;其中第2-4步重复35次。可从毛囊、全血或口腔中获取组织样品。
用标准方法测序由以上程序产生的片段。在171位甘氨酸密码子的第二位上,HBM基因杂合的个体将表现出相等量的G和T。正常或纯合的野生型个体在此位置仅出现G。除PCR外其它扩增技术也可使用,如连接介导的PCR或涉及Q-β复制酶的方法(Cahill et at al,Clin.Chem.,37(9):1482-5(1991))。例如,寡核苷酸AGCTGCTCGTAGCTGTCTCTCCCTGGATCACGGGTACATGTACTGGACAGACTGGGT( SEQ ID NO:34 ) 和TGAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTATTCCCTGTGCCGCATTACGGC(SEQ ID NO:35)可与变性的人DNA样品杂交,DNA连接酶处理,然后用以下寡核苷酸引物进行PCR扩增:AGCTGCTCGTAGCTGTCTCTCCCTGGA ( SEQ ID NO:36 ) 和GCCGTAATGCGGCACAGGGAATAAGCT (SEQ ID NO:37)。在前两个寡核苷酸中,外侧27个碱基是对应引物结合位点的随机序列,内侧30个碱基对应Zmx1基因的序列。第一个寡核苷酸末端的T对应HBM基因。前两个寡核苷酸仅在与携带HBM基因的DNA杂交时被连接,导致形成可扩增的114bp DNA片段。
扩增产物的检测可用琼脂糖凝胶电泳、定量杂交或分子生物学领域技术人员已知的等价核酸检测技术(Sambrook et al,MolecularCloning:A Laboratory Manual,Cold Spring Harbor Laboratory,Cold Spring,NY(1989))。
Zmax1基因或HBM基因的其它改变的诊断可通过相同类型的扩增检测方法,使用设计用于鉴定这些改变的寡核苷酸。这些方法可用于动物及人,鉴定影响骨发育的Zmax1或HBM的改变。
骨组织中Zmax1或HBM的表达可通过分别将Zmax1或HBM的cDNA融合到载体中骨特异性启动子上,用于遗传工程化脊椎动物细胞。通过将DNA包装入病毒衣壳,使用阳离子脂质体、电穿孔,或通过磷酸钙转染,将DNA构建体引入细胞。转染细胞,优选成骨细胞,可用于培养物研究,或通过直接注射入骨,或静脉注射成骨细胞、随后掺入骨组织的方法,导入动物的骨组织(Ko et al,Cancer Research,56(20):4614-9(1996))。例如,在成骨细胞中具有特异性活性的骨钙蛋白启动子,可用于直接转录Zmax1基因或HBM基因。可以使用几种载体和转染方法中的任一个,如逆转录病毒载体,腺病毒载体,或用阳离子脂质体转染后维持的载体,或此处所述的其它方法和载体。
功能性Zmax1蛋白或HBM蛋白水平的改变影响骨矿化水平。通过操作Zmax1蛋白或HBM蛋白的水平,有可能影响骨发育并增加或降低骨矿化水平。例如,可用于增加骨质疏松患者的骨矿化水平。此外,可用于降低骨硬化病或佩吉特病患者的骨矿化水平。Zmax1水平或HBM水平的改变也可用作研究工具。具体来说,它可以鉴定蛋白、mRNA以及水平或修饰状态对Zmax1或HBM功能水平改变而变化的其它分子。骨疾病的病理学和病原学已知,并描述于,例如,Rubin and Farber(Eds.),Pathology,2nd Ed.,S.B.Lippincott Co.,Philadelphia,PA(1994)。
多种方法可用于改变功能性Zmax1或HBM的水平。例如,静脉内或骨内注射Zmax1或其突变体的细胞外部分、或HBM或其突变体的细胞外部分,将分别改变处理人、动物或鸟体中Zmax1活性或HBM活性水平。也可注射截短的Zmax1蛋白或HBM蛋白来分别改变功能性Zmax1蛋白或HBM蛋白水平。某种形式的Zmax1或HBM增强内源性蛋白的活性,而其它形式则被抑制。
在优选实施方案中,HBM蛋白用于治疗骨质疏松。在进一步优选的实施方案中,使用HBM蛋白的细胞外部分。该HBM蛋白可通过加入引起蛋白粘附到细胞表面的部分进行修饰。蛋白溶于药物允许的溶液中,并通过注射或达到合适药代动力学及分布的其它方法来给药。
在该方法的另一个实施方案中,通过基因治疗技术增加或降低Zmax1或HBM水平。要增加Zmax1或HBM水平,按上述方法遗传工程化成骨细胞或另一种有用的细胞类型以表达高水平的Zmax1或HBM。此外,要降低Zmax1或HBM水平,可使用特异性减少可翻译Zmax1或HBM mRNA水平的反义构建体。一般来说,可使用无组织特异性的启动子,如CMV启动子或另一种可在表达载体中发现的商品启动子(Wu etal,Toxicol.Appl.Pharmacol.,141(1):330-9(1996))。在优选实施方案中,用骨特异性启动子,如骨钙蛋白或另一种启动子转录Zmax1 cDNA或其反义形式,以在骨组织中特异性表达。以此方式,如果表达Zmax1的DNA构建体或表达HBM的构建体导入非骨组织,将不被表达。
在该方法的第三个实施方案中,使用抗Zmax1或HBM的抗体以抑制其功能。这种抗体已在此处鉴定。
在本方法的第四个实施方案中,使用抑制Zmax1功能或HBM功能的药物。这些药物在此处描述并根据药物开发领域技术人员熟知的药物化学方法优化。
Zmax1和HBM与几种蛋白相互作用,如ApoE。预期抑制Zmax1或HBM与ApoE或另一种结合成分相互作用的分子可改变骨发育和矿化。这种抑制剂可用作治疗骨质疏松、骨硬化病或其它骨矿化疾病的药物。这种抑制剂可以是低分子量化合物、蛋白或其它类型的分子。见:Kim et al,J.Biochem.(Tokyo),124(6):1072-1076(1998)。
可通过标准药物筛选方法分离Zmax1或HBM与相互作用蛋白间相互作用的抑制剂。例如,可将Zmax1蛋白,(或其片段)或HBM蛋白(或其片段)固定于固相支持物上,如微量滴定板孔底上。另一种蛋白或蛋白片段,如ApoE用衍生荧光素、碘或生物素等进行衍生化以辅助检测,然后在候选化合物存在下加到Zmax1或HBM中,这些候选化合物分别可以特异性抑制Zmax1或HBM的该蛋白-蛋白结构域,并因而避免了与其跨膜片段相关的问题。药物开发领域的技术人员熟知这种药物筛选方法。
因为Zmax1和HBM参与骨发育,预计结合Zmax1和HBM的蛋白也与参与骨发育。可用标准方法鉴定这些蛋白,如免疫共沉淀,共分级分离(co-fractionation),或双杂交筛选(Ausubel et al,CurrentProtocols in Molecular Biology,John Wiley & Sons(1997))。例如,要用双杂交系统鉴定与Zmax1相互作用的蛋白或与HBM相互作用的蛋白,将Zmax1或HBM的细胞外结构域与LexA融合并用酵母载体pEG202表达(“诱饵”)于酵母株EGY48中。该酵母株用适当载体中的“猎物”文库转化,该文库编码融合到候选相互作用蛋白的上半乳糖诱导的转录活化序列。分子生物学领域的技术人员熟知通过该方法初步选择及随后验证相互作用蛋白的技术(Ausubel et al,CurrentProtocols in Molecular Biology,John Wiley & Sons(1997))。
在优选实施方案中,用以上方法的变化方法(Xu et al,Proc.Natl.Acad.Sci.USA,94(23):12473-8(Nov.1997))鉴定与HBM相互作用但不与Zmax1相互作用的蛋白。这种双杂交系统的变化方法使用两个诱饵,且Zmax1和HBM分别融合到LexA和TetR上。此外,也分离出与HBM但不与Zmax1相互作用的蛋白。分子生物学领域的技术人员熟知这些方法,以及标准双杂交方法的简单变化方法。
作为分离Zmax1或HBM相互作用蛋白的替代性方法,可使用生化方法。将Zmax1蛋白或其片段,如细胞外结构域,或HBM蛋白或其片段,如细胞外结构域,化学偶联到Sepharose珠上。将偶联Zmax1或HBM的珠子倒入柱子中。蛋白提取物,如血清蛋白、骨活组织样品上清中的蛋白、或温和裂解的TE85成骨细胞系的细胞内蛋白,也加入柱子中。洗脱非特异结合的蛋白,用低盐缓冲液冲洗柱子几次,然后用高盐缓冲液洗脱紧密结合的蛋白。它们是结合Zmax1或HBM的候选蛋白,可用标准测试和对照实验检测特异性结合。用于偶联蛋白的Sepharose珠及偶联方法可以购得(Sigma),此处所述方法对蛋白质生化领域的技术人员熟知。
作为上述方法的变化方案,随后将从Zmax1-或HBM-Sepharose柱子上高盐洗脱的蛋白加到HBM-或Zmax1-Sepharose柱子上。无保留地流过的蛋白是结合Zmax1但不结合HBM的蛋白。另外,以相反的顺序使用柱子,可以分离结合HBM蛋白但不结合Zmax1蛋白的蛋白。XXIII.使用方法:转化相关的重组(TAR)克隆
鉴定Zmax1等位基因变体的关键是检测个体中两拷贝基因序列的能力。要完成此任务,在基因组序列内鉴定两个“钩”、或显著相似性的区域,由此在待克隆DNA部分的两侧。最优选的是这两个钩的第一个来源于感兴趣外显子5’侧的序列,第二个来源于最后一个外显子的3’侧序列。这两个“钩”克隆进细菌/酵母穿梭质粒载体中,正如Larionov et al,Proc.Natl.Acad.Sci.USA,94:7384-7387(1997)所述。也可使用其它类似载体系统。要回收Zmax1基因的完整基因组拷贝,含两个“钩”的质粒用限制性内切酶线性化,或由另外的方法如PCR产生。这种线性DNA片段与人基因组DNA一起导入酵母细胞。典型地,用啤酒糖酵母作为宿主细胞,尽管Larionov et al(待发表)报道用鸡宿主细胞也很好。转化过程中及之后,内源性宿主细胞通过重组事件将线性质粒转成环状,在此期间与“钩”同源的人基因组DNA区域插入质粒中。可回收该质粒并用本领域技术人员熟知的方法分析。很明显,该反应的特异性要求宿主细胞系统识别与“钩”相似的线性片段上的序列。然而,并不要求100%的序列一致性,正如下文所示:Kouprina et al,Genomics,53(1):21-28(October 1998),其中作者描述用人基因组中常见的简并重复序列从啮齿动物/人杂交细胞系中回收人DNA片段。
另一实施例中,只需要一个“钩”,描述于Larionov et al,Proc.Natl.Acad.Sci.USA,95(8):4469-74(April 1998)。此种试验称为“辐射状TAR克隆”,对此类试验,驱动重组的序列相似性的其它区域来源于基因组的重复序列。以此方式,可以回收邻近Zmax1基因编码区的DNA区域,并检验可能影响功能的改变。XXIV.使用方法:基因组筛选
使用与HBM基因或Zmax1基因连锁的多态性遗传标记,对于预测骨质疏松或其它骨病的易感性十分有用。Koller et al,Amer.J.BoneMin.Res.,13:1903-1908(1998)一文中证明多态性遗传标记的运用在连锁分析中非常有用。与之相似,鉴定高骨量基因中的多态性遗传标记将有可能鉴定特异性等位基因变体,它们与影响骨发育的其它遗传性疾病为连锁不平衡。用BAC文库的DNA序列,鉴定出一个二核苷酸重复CAn,并以此设计了用于扩增包含该重复的基因组DNA的如下两个特殊引物:
B200E21C16_L: GAGAGGCTATATCCCTGGGC(SEQ ID NO:38)
B200E21C16_R: ACAGCACGTGTTTAAAGGGG(SEQ ID NO:39)并将其用于遗传图谱研究。
该方法已被本领域其他技术人员成功运用(例如:Sheffield etal,Genet.,4:1837-1844(1995);LeBlanc-Straceski et al,Genomics,19:341-9(1994);Chen et al,Genomics,25:1-8(1995))。在人群或个体中使用这些试剂能预测他们罹患骨质疏松的风险。与之相似,单核苷酸多态性(SNPs),如上表4中所示,也可用于预测在HBM基因的病例中发生骨疾病的风险或对骨质疏松的抵抗性。XXV.使用方法:组织钙化调节物质
人体组织钙化已有较多了解。Towler et al,J.Boil.Chem.,273:30427-34(1998)证明在模型系统中多个已知调节发育中颅骨钙化的蛋白在钙化大动脉中有表达。Msx2是一个在骨祖先细胞中转录的基因,它在钙化的血管组织中的表达表明,在骨发育中起重要作用的基因参与其它组织的钙化。由于其已证明的在骨矿质密度方面的作用,用HBM蛋白、激动剂或拮抗剂治疗可能改善钙化(例如脉管系统,牙质和颅盖骨)。在证明组织钙化的实验系统中,过表达或抑制Zmax1活性可以鉴定直接被Zmax1基因调节的分子。这些基因是针对调节组织钙化的潜在治疗靶位。例如,一种动物,如LDLR-/-小鼠,以高脂饮食喂养,观察到有组织钙化标记表达,其中包括Zmax1。然后这些动物用抗Zmax1或HBM蛋白的抗体、或直接针对Zmax1或HBM cDNA的反义寡核苷酸、或用已知能与Zmax1或HBM蛋白或其结合配偶体或配体结合的化合物处理。从血管组织中提取RNA或蛋白,并用本领域公知的方法检测基因在该组织中的相对表达水平。在该组织中被调节的基因作为组织钙化调节物质,在药物开发中是潜在治疗目标。
将给予个体的本发明中的核酸、蛋白、肽、氨基酸、小分子或其它药用化合物,可以以组合物的形式,与本领域公知的药学允许的载体、赋形剂、稀释剂一起给药。个体可以是哺乳动物或鸟,优选人,大鼠,小鼠或鸟。可对个体给予药学上有效量的这种组合物。给药剂量根据治疗的状况和治疗的病人而变化。这种组合物可单独给药或与其它治疗方法相结合。
实施例
参考以下实施例描述本发明。它们只是提供举例说明,并非是以任何方式限制本发明。其中使用了本领域公知的标准方法或下面具体描述的方法。实施例1
先证者被她的医生推荐到Creighton骨质疏松中心以检查是什么表现出不正常的骨密度。她18岁,两年前因为背痛引起医生注意,这来自一场交通事故,她乘坐的小汽车被从后面撞击。她只是伤到了下背部的软组织,表现为疼痛和肌肉触痛。X光片上看不出骨折或半脱位。疼痛持续两年,尽管她可以全时上学。到她到中心的时候,疼痛几乎缓解,作为一个高中生她回到了正常生活中。身体检查显示一个正常健康的年轻女性,高66英寸重128磅。完整骨骼的X光片显示看起来较浓的骨骼,皮质较厚。所有骨头都是如此。最重要的是,所有骨头的形状完全正常。L1-4的脊椎BMC是94.48g,L1-4的脊椎BMD是1.667gm/cm2。BMD比女性峰值骨量高5.62个标准差(SD)。这是由DXA用Hologic 2000测量。然后她母亲也作了扫描,发现腰椎BMC 58.05g,BMD 1.500gm/cm2。她母亲的值比峰值量高4.12个SD,比其同龄人高4.98个SD。她母亲51岁,高65英寸,重140磅。她母亲健康极好,无肌肉骨骼或其它症状的历史。她父亲腰椎BMC是75.33g,BMD是1.118gm/cm2。这些值比男性峰值骨量高0.25 SD。他健康良好,高72英寸,重187磅。
这些临床数据提示,先证者从其母亲遗传了一个性状,导致其非常高的骨量,但是有正常的骨骼,由此注意力集中于其母亲家系。在U美国5,691,153中,由DXA测量了这些成员中22个的骨量。在一个病例中,先证者的母系祖父已经死亡,但得到了他的病历、临死前骨骼X光片和供DNA基因型分析的胆囊石蜡标本。其X光片显示他所有检查的骨头包括股骨和脊骨具有明显极高的密度,因而他被包括在受影响成员中。本发明中,该家系扩展到包括37个提供信息的个体。这些增加是对原始家族关系的明显改进(Johnson et al,Am.J.Hum.Genet.,60:1326-1332(1997)),因为在原始研究后又加入的14个个体中,两个个体具有关键交换。由于存在从个体12到14和15的男性-男性传递,可以排除X连锁。实施例2
本发明描述来源于HBM基因区两个BAC克隆的DNA序列,如下表7中所示这些克隆的总结。克隆b200e21-h(ATCC No.980812;SEQ IDNOS:10-11)于1997年12月30日保藏于美国典型培养物保藏中心(ATCC),10801 University Blvd.,Manassas,VA 20110-2209 U.S.A.。克隆b527d12-h(ATCC No.980720;SEQ ID NOS:5-9)于1998年10月2日保藏于美国典型培养物保藏中心(ATCC),10801 UniversityBlvd.,Manassas,VA 20110-2209 U.S.A.。这些序列是独特试剂,本领域技术人员可用它们鉴定Zmax1基因的DNA探针,扩增该基因的PCR引物,Zmax1基因中的核苷酸多态性,或Zmax1基因中的调节元件。
表7
基序 | ATCC编号 | SEQ ID NO. | 长度(碱基对) |
b527d12-h_contig302G | 980720 | 5 | 3096 |
b527d12-h_contig306G | 980720 | 6 | 26928 |
b527d12-h_contig307G | 980720 | 7 | 29430 |
b527d12-h_contig308G | 980720 | 8 | 33769 |
b527d12-h_contig309G | 980720 | 9 | 72049 |
b200e21-h_contig1 | 980812 | 10 | 8705 |
b200e21-h_contig4 | 980812 | 11 | 66933 |
对于说明书中引用的专利、专利申请以及出版物,在此引入其完整内容作为参考。
尽管已详细阐明本发明,本领域的技术人员可认识到对其可作许多变化和改进,并且这些变化和改进并没有偏离本发明的精神和范畴。
本申请要求以下申请的优先权:于2000年4月5日递交的美国申请09/543,771和09/544,398,它们是1999年1月13日递交的申请09/229,319的连续申请,该申请要求以下申请的权益:1998年1月13日递交的美国临时申请60/071,449,和1998年10月23日递交的美国临时申请60/105,511,所有这些都在此引入其完整内容作为参考。
序列表
序列表
<110>J.P.卡鲁利等
<120>11q13.3的高骨量基因
<130>032796-021
<150>US 09/544,398
<151>2000-04-05
<150>US 09/543,771
<151>2000-04-05
<150>US 09/229,319
<151>1999-01-13
<150>US 60/071,449
<151>1998-01-13
<150>US 60/105,511
<151>1998-10-23
<160>109
<210>1
<211>5120
<212>DNA
<213>人(Homo sapiens)
<400>1actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu
1 5 10ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gcc ccc gcc 157Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala
15 20 25gcg gcc tcg ccg ctc ctg cta ttt gcc aac cgc cgg gac gta cgg ctg 205Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu30 35 40 45gtg gac gcc ggc gga gtc aag ctg gag tcc acc atc gtg gtc agc ggc 253Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr Ile Val Val Ser Gly
50 55 60ctg gag gat gcg gcc gca gtg gac ttc cag ttt tcc aag gga gcc gtg 301Leu Glu Asp Ala Ala Ala Val Asp Phe Gln Phe Ser Lys Gly Ala Val
65 70 75tac tgg aca gac gtg agc gag gag gcc atc aag cag acc tac ctg aac 349Tyr Trp Thr Asp Val Ser Glu Glu Ala Ile Lys Gln Thr Tyr Leu Asn
80 85 90cag acg ggg gcc gcc gtg cag aac gtg gtc atc tcc ggc ctg gtc tct 397Gln Thr Gly Ala Ala Val Gln Asn Val Val Ile Ser Gly Leu Val Ser
95 100 105ccc gac ggc ctc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr110 115 120 125gac tca gag acc aac cgc atc gag gtg gcc aac ctc aat ggc aca tcc 493Asp Ser Glu Thr Asn Arg Ile Glu Val Ala Asn Leu Asn Gly Thr Ser
130 135 140cgg aag gtg ctc ttc tgg cag gac ctt gac cag ccg agg gcc atc gcc 541Arg Lys Val Leu Phe Trp Gln Asp Leu Asp Gln Pro Arg Ala Ile Ala
145 150 155ttg gac ccc gct cac ggg tac atg tac tgg aca gac tgg ggt gag acg 589Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr
160 165 170ccc cgg att gag cgg gca ggg atg gat ggc agc acc cgg aag atc att 637Pro Arg Ile Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys Ile Ile
175 180 185gtg gac tcg gac att tac tgg ccc aat gga ctg acc atc gac ctg gag 685Val Asp Ser Asp Ile Tyr Trp Pro Asn Gly Leu Thr Ile Asp Leu Glu190 195 200 205gag cag aag ctc tac tgg gct gac gcc aag ctc agc ttc atc cac cgt 733Glu Gln Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe Ile His Arg
210 215 220gcc aac ctg gac ggc tcg ttc cgg cag aag gtg gtg gag ggc agc ctg 78lAla Asn Leu Asp Gly Ser Phe Arg Gln Lys Val Val Glu Gly Ser Leu
225 230 235acg cac ccc ttc gcc ctg acg ctc tcc ggg gac act ctg tac tgg aca 829Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr
240 245 250gac tgg cag acc cgc tcc atc cat gcc tgc aac aag cgc act ggg ggg 877Asp Trp Gln Thr Arg Ser Ile His Ala Cys Asn Lys Arg Thr Gly Gly
255 260 265aag agg aag gag atc ctg agt gcc ctc tac tca ccc atg gac atc cag 925Lys Arg Lys Glu Ile Leu Ser Ala Leu Tyr Ser Pro Met Asp Ile Gln270 275 280 285gtg ctg agc cag gag cgg cag cct ttc ttc cac act cgc tgt gag gag 973Val Leu Ser Gln Glu Arg Gln Pro Phe Phe His Thr Arg Cys Glu Glu
290 295 300gac aat ggc ggc tgc tcc cac ctg tgc ctg ctg tcc cca agc gag cct 1021Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro
305 310 315ttc tac aca tgc gcc tgc ccc acg ggt gtg cag ctg cag gac aac ggc 1069Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gln Leu Gln Asp Asn Gly
320 325 330agg acg tgt aag gca gga gcc gag gag gtg ctg ctg ctg gcc cgg cgg 1117Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg
335 340 345acg gac cta cgg agg atc tcg ctg gac acg ccg gac ttc acc gac atc 1165Thr Asp Leu Arg Arg Ile Ser Leu Asp Thr Pro Asp Phe Thr Asp Ile350 355 360 365gtg ctg cag gtg gac gac atc cgg cac gcc att gcc atc gac tac gac 1213Val Leu Gln Val Asp Asp Ile Arg His Ala Ile Ala Ile Asp Tyr Asp
370 375 380ccg cta gag ggc tat gtc tac tgg aca gat gac gag gtg cgg gcc atc 1261Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala Ile
385 390 395cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 1309Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gln Thr Leu Val Asn Thr
400 405 410gag atc aac gac ccc gat ggc atc gcg gtc gac tgg gtg gcc cga aac 1357Glu Ile Asn Asp Pro Asp Gly Ile Ala Val Asp Trp Val Ala Arg Asn
415 420 425ctc tac tgg acc gac acg ggc acg gac cgc atc gag gtg acg cgc ctc 1405Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg Ile Glu Val Thr Arg Leu430 435 440 445aac ggc acc tcc cgc aag atc ctg gtg tcg gag gac ctg gac gag ccc 1453Asn Gly Thr Ser Arg Lys Ile Leu Val Ser Glu Asp Leu Asp Glu Pro
450 455 460cga gcc atc gca ctg cac ccc gtg atg ggc ctc atg tac tgg aca gac 1501Arg Ala Ile Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp
465 470 475tgg gga gag aac cct aaa atc gag tgt gcc aac ttg gat ggg cag gag 1549Trp Gly Glu Asn Pro Lys Ile Glu Cys Ala Asn Leu Asp Gly Gln Glu
480 485 490cgg cgt gtg ctg gtc aat gcc tcc ctc ggg tgg ccc aac ggc ctg gcc 1597Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala
495 500 505ctg gac ctg cag gag ggg aag ctc tac tgg gga gac gcc aag aca gac 1645Leu Asp Leu Gln Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp510 515 520 525aag atc gag gtg atc aat gtt gat ggg acg aag agg cgg acc ctc ctg 1693Lys Ile Glu Val Ile Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu
530 535 540gag gac aag ctc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741Glu Asp Lys Leu Pro His Ile Phe Gly Phe Thr Leu Leu Gly Asp Phe
545 550 555atc tac tgg act gac tgg cag cgc cgc agc atc gag cgg gtg cac aag 1789Ile Tyr Trp Thr Asp Trp Gln Arg Arg Ser Ile Glu Arg Val His Lys
560 565 570gtc aag gcc agc cgg gac gtc atc att gac cag ctg ccc gac ctg atg 1837Val Lys Ala Ser Arg Asp Val Ile Ile Asp Gln Leu Pro Asp Leu Met
575 580 585ggg ctc aaa gct gtg aat gtg gcc aag gtc gtc gga acc aac ccg tgt 1885Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys590 595 600 605gcg gac agg aac ggg ggg tgc agc cac ctg tgc ttc ttc aca ccc cac 1933Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His
610 615 620gca acc cgg tgt ggc tgc ccc atc ggc ctg gag ctg ctg agt gac atg 1981Ala Thr Arg Cys Gly Cys Pro Ile Gly Leu Glu Leu Leu Ser Asp Met
625 630 635aag acc tgc atc gtg cct gag gcc ttc ttg gtc ttc acc agc aga gcc 2029Lys Thr Cys Ile Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala
640 645 650gcc atc cac agg atc tcc ctc gag acc aat aac aac gac gtg gcc atc 2077Ala Ile His Arg Ile Ser Leu Glu Thr Asn Asn Asn Asp Val Ala Ile
655 660 665ccg ctc acg ggc gtc aag gag gcc tca gcc ctg gac ttt gat gtg tcc 2125Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser670 675 680 685aac aac cac atc tac tgg aca gac gtc agc ctg aag acc atc agc cgc 2173Asn Asn His Ile Tyr Trp Thr Asp Val Ser Leu Lys Thr Ile Ser Arg
690 695 700gcc ttc atg aac ggg agc tcg gtg gag cac gtg gtg gag ttt ggc ctt 2221Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu
705 710 715gac tac ccc gag ggc atg gcc gtt gac tgg atg ggc aag aac ctc tac 2269Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr
720 725 730tgg gcc gac act ggg acc aac aga atc gaa gtg gcg cgg ctg gac ggg 2317Trp Ala Asp Thr Gly Thr Asn Arg Ile Glu Val Ala Arg Leu Asp Gly
735 740 745cag ttc cgg caa gtc ctc gtg tgg agg gac ttg gac aac ccg agg tcg 2365Gln Phe Arg Gln Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser750 755 760 765ctg gcc ctg gat ccc acc aag ggc tac atc tac tgg acc gag tgg ggc 2413Leu Ala Leu Asp Pro Thr Lys Gly Tyr Ile Tyr Trp Thr Glu Trp Gly
770 775 780ggc aag ccg agg atc gtg cgg gcc ttc atg gac ggg acc aac tgc atg 2461Gly Lys Pro Arg Ile Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met
785 790 795acg ctg gtg gac aag gtg ggc cgg gcc aac gac ctc acc att gac tac 2509Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr Ile Asp Tyr
800 805 810gct gac cag cgc ctc tac tgg acc gac ctg gac acc aac atg atc gag 2557Ala Asp Gln Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met Ile Glu
815 820 825tcg tcc aac atg ctg ggt cag gag cgg gtc gtg att gcc gac gat ctc 2605Ser Ser Asn Met Leu Gly Gln Glu Arg Val Val Ile Ala Asp Asp Leu830 835 840 845ccg cac ccg ttc ggt ctg acg cag tac agc gat tat atc tac tgg aca 2653Pro His Pro Phe Gly Leu Thr Gln Tyr Ser Asp Tyr Ile Tyr Trp Thr
850 855 860gac tgg aat ctg cac agc att gag cgg gcc gac aag act agc ggc cgg 2701Asp Trp Asn Leu His Ser Ile Glu Arg Ala Asp Lys Thr Ser Gly Arg
865 870 875aac cgc acc ctc atc cag ggc cac ctg gac ttc gtg atg gac atc ctg 2749Asn Arg Thr Leu Ile Gln Gly His Leu Asp Phe Val Met Asp Ile Leu
880 885 890gtg ttc cac tcc tcc cgc cag gat ggc ctc aat gac tgt atg cac aac 2797Val Phe His Ser Ser Arg Gln Asp Gly Leu Asn Asp Cys Met His Asn
895 900 905aac ggg cag tgt ggg cag ctg tgc ctt gcc atc ccc ggc ggc cac cgc 2845Asn Gly Gln Cys Gly Gln Leu Cys Leu Ala Ile Pro Gly Gly His Arg910 915 920 925tgc ggc tgc gcc tca cac tac acc ctg gac ccc agc agc cgc aac tgc 2893Cys Gly Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys
930 935 940agc ccg ccc acc acc ttc ttg ctg ttc agc cag aaa tct gcc atc agt 2941Ser Pro Pro Thr Thr Phe Leu Leu Phe Ser Gln Lys Ser Ala Ile Ser
945 950 955cgg atg atc ccg gac gac cag cac agc ccg gat ctc atc ctg ccc ctg 2989Arg Met Ile Pro Asp Asp Gln His Ser Pro Asp Leu Ile Leu Pro Leu
960 965 970cat gga ctg agg aac gtc aaa gcc atc gac tat gac cca ctg gac aag 3037His Gly Leu Arg Asn Val Lys Ala Ile Asp Tyr Asp Pro Leu Asp Lys
975 980 985ttc atc tac tgg gtg gat ggg cgc cag aac atc aag cga gcc aag gac 3085Phe Ile Tyr Trp Val Asp Gly Arg Gln Asn Ile Lys Arg Ala Lys Asp990 995 1000 1005gac ggg acc cag ccc ttt gtt ttg acc tct ctg agc caa ggc caa aac 3133Asp Gly Thr Gln Pro Phe Val Leu Thr Ser Leu Ser Gln Gly Gln Asn
1010 1015 1020cca gac agg cag ccc cac gac ctc agc atc gac atc tac agc cgg aca 3181Pro Asp Arg Gln Pro His Asp Leu Ser Ile Asp Ile Tyr Ser Arg Thr
1025 1030 1035ctg ttc tgg acg tgc gag gcc acc aat acc atc aac gtc cac agg ctg 3229Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr Ile Asn Val His Arg Leu
1040 1045 1050agc ggg gaa gcc atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3277Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro
1055 1060 1065agg gcc atc gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3325Arg Ala Ile Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn1070 1075 1080 1085atg cag gac cgg gca gcc aag atc gaa cgc gca gcc ctg gac ggc acc 3373Met Gln Asp Arg Ala Ala Lys Ile Glu Arg Ala Ala Leu Asp Gly Thr
1090 1095 1100gag cgc gag gtc ctc ttc acc acc ggc ctc atc cgc cct gtg gcc ctg 3421Glu Arg Glu Val Leu Phe Thr Thr Gly Leu Ile Arg Pro Val Ala Leu
1105 1110 1115gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu
1120 1125 1130aag cgc att gag agc tgt gac ctg tca ggg gcc aac cgc ctg acc ctg 3517Lys Arg Ile Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu
1135 1140 1145gag gac gcc aac atc gtg cag cct ctg ggc ctg acc atc ctt ggc aag 3565Glu Asp Ala Asn Ile Val Gln Pro Leu Gly Leu Thr Ile Leu Gly Lys1150 1155 1160 1165cat ctc tac tgg atc gac cgc cag cag cag atg atc gag cgt gtg gag 3613His Leu Tyr Trp Ile Asp Arg Gln Gln Gln Met Ile Glu Arg Val Glu
1170 1175 1180aag acc acc ggg gac aag cgg act cgc atc cag ggc cgt gtc gcc cac 3661Lys Thr Thr Gly Asp Lys Arg Thr Arg Ile Gln Gly Arg Val Ala His
1185 1190 1195ctc act ggc atc cat gca gtg gag gaa gtc agc ctg gag gag ttc tca 3709Leu Thr Gly Ile His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser
1200 1205 1210gcc cac cca tgt gcc cgt gac aat ggt ggc tgc tcc cac atc tgt att 3757Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His Ile Cys Ile1215 1220 1225gcc aag ggt gat ggg aca cca cgg tgc tca tgc cca gtc cac ctc gtg 3805Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val1230 1235 1240 1245ctc ctg cag aac ctg ctg acc tgt gga gag ccg ccc acc tgc tcc ccg 3853Leu Leu Gln Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro
1250 1255 1260gac cag ttt gca tgt gcc aca ggg gag atc gac tgt atc ccc ggg gcc 3901Asp Gln Phe Ala Cys Ala Thr Gly Glu Ile Asp Cys Ile Pro Gly Ala
1265 1270 1275tgg cgc tgt gac ggc ttt ccc gag tgc gat gac cag agc gac gag gag 3949Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gln Ser Asp Glu Glu
1280 1285 1290ggc tgc ccc gtg tgc tcc gcc gcc cag ttc ccc tgc gcg cgg ggt cag 3997Gly Cys Pro Val Cys Ser Ala Ala Gln Phe Pro Cys Ala Arg Gly Gln
1295 1300 1305tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gln Asp1310 1315 1320 1325cgc tca gac gag gtg gac tgt gac gcc atc tgc ctg ccc aac cag ttc 4093Arg Ser Asp Glu Val Asp Cys Asp Ala Ile Cys Leu Pro Asn Gln Phe
1330 1335 1340cgg tgt gcg agc ggc cag tgt gtc ctc atc aaa cag cag tgc gac tcc 4141Arg Cys Ala Ser Gly Gln Cys Val Leu Ile Lys Gln Gln Cys Asp Ser
1345 1350 1355ttc ccc gac tgt atc gac ggc tcc gac gag ctc atg tgt gaa atc acc 4189Phe Pro Asp Cys Ile Asp Gly Ser Asp Glu Leu Met Cys Glu Ile Thr
1360 1365 1370aag ccg ccc tca gac gac agc ccg gcc cac agc agt gcc atc ggg ccc 4237Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala Ile Gly Pro
1375 1380 1385gtc att ggc atc atc ctc tct ctc ttc gtc atg ggt ggt gtc tat ttt 4285Val Ile Gly Ile Ile Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe1390 1395 1400 1405gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gcc aac ggg ccc 4333Val Cys Gln Arg Val Val Cys Gln Arg Tyr Ala Gly Ala Asn Gly Pro
1410 1415 1420ttc ccg cac gag tat gtc agc ggg acc ccg cac gtg ccc ctc aat ttc 4381Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe
1425 1430 1435ata gcc ccg ggc ggt tcc cag cat ggc ccc ttc aca ggc atc gca tgc 4429Ile Ala Pro Gly Gly Ser Gln His Gly Pro Phe Thr Gly Ile Ala Cys
1440 1445 1450gga aag tcc atg atg agc tcc gtg agc ctg atg ggg ggc cgg ggc ggg 4477Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly
1455 1460 1465gtg ccc ctc tac gac cgg aac cac gtc aca ggg gcc tcg tcc agc agc 4525Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser1470 1475 1480 1485tcg tcc agc acg aag gcc acg ctg tac ccg ccg atc ctg aac ccg ccg 4573Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro Ile Leu Asn Pro Pro
1490 1495 1500ccc tcc ccg gcc acg gac ccc tcc ctg tac aac atg gac atg ttc tac 4621Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr
1505 1510 1515tct tca aac att ccg gcc act gcg aga ccg tac agg ccc tac atc att 4669Ser Ser Asn Ile Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr Ile Ile
1520 1525 1530cga gga atg gcg ccc ccg acg acg ccc tgc agc acc gac gtg tgt gac 4717Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp
1535 1540 1545agc gac tac agc gcc agc cgc tgg aag gcc agc aag tac tac ctg gat 4765Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp1550 1555 1560 1565ttg aac tcg gac tca gac ccc tat cca ccc cca ccc acg ccc cac agc 4813Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser
1570 1575 1580cag tac ctg tcg gcg gag gac agc tgc ccg ccc tcg ccc gcc acc gag 4861Gln Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu
1585 1590 1595agg agc tac ttc cat ctc ttc ccg ccc cct ccg tcc ccc tgc acg gac 4909Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp
1600 1605 1610tca tcc tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965Ser Ser
1615aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085tggagaaata tttataaact taattttgta aaaca 5120<210>2<211>5120<212>DNA<213>人(Homo sapiens)<400>2actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu
1 5 10ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gcc ccc gcc 157Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala
15 20 25gcg gcc tcg ccg ctc ctg cta ttt gcc aac cgc cgg gac gta cgg ctg 205Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu30 35 40 45gtg gac gcc ggc gga gtc aag ctg gag tcc acc atc gtg gtc agc ggc 253Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr Ile Val Val Ser Gly
50 55 60ctg gag gat gcg gcc gca gtg gac ttc cag ttt tcc aag gga gcc gtg 301Leu Glu Asp Ala Ala Ala Val Asp Phe Gln Phe Ser Lys Gly Ala Val
65 70 75tac tgg aca gac gtg agc gag gag gcc atc aag cag acc tac ctg aac 349Tyr Trp Thr Asp Val Ser Glu Glu Ala Ile Lys Gln Thr Tyr Leu Asn
80 85 90cag acg ggg gcc gcc gtg cag aac gtg gtc atc tcc ggc ctg gtc tct 397Gln Thr Gly Ala Ala Val Gln Asn Val Val Ile Ser Gly Leu Val Ser
95 100 105ccc gac ggc ctc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr110 115 120 125gac tca gag acc aac cgc atc gag gtg gcc aac ctc aat ggc aca tcc 493Asp Ser Glu Thr Asn Arg Ile Glu Val Ala Asn Leu Asn Gly Thr Ser
130 135 140cgg aag gtg ctc ttc tgg cag gac ctt gac cag ccg agg gcc atc gcc 541Arg Lys Val Leu Phe Trp Gln Asp Leu Asp Gln Pro Arg Ala Ile Ala
145 150 155ttg gac ccc gct cac ggg tac atg tac tgg aca gac tgg gtt gag acg 589Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr
160 165 170ccc cgg att gag cgg gca ggg atg gat ggc agc acc cgg aag atc att 637Pro Arg Ile Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys Ile Ile
175 180 185gtg gac tcg gac att tac tgg ccc aat gga ctg acc atc gac ctg gag 685Val Asp Ser Asp Ile Tyr Trp Pro Asn Gly Leu Thr Ile Asp Leu Glu190 195 200 205gag cag aag ctc tac tgg gct gac gcc aag ctc agc ttc atc cac cgt 733Glu Gln Lys Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe Ile His Arg
210 215 220gcc aac ctg gac ggc tcg ttc cgg cag aag gtg gtg gag ggc agc ctg 781Ala Asn Leu Asp Gly Ser Phe Arg Gln Lys Val Val Glu Gly Ser Leu
225 230 235acg cac ccc ttc gcc ctg acg ctc tcc ggg gac act ctg tac tgg aca 829Thr His Pro Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr
240 245 250gac tgg cag acc cgc tcc atc cat gcc tgc aac aag cgc act ggg ggg 877Asp Trp Gln Thr Arg Ser Ile His Ala Cys Asn Lys Arg Thr Gly Gly
255 260 265aag agg aag gag atc ctg agt gcc ctc tac tca ccc atg gac atc cag 925Lys Arg Lys Glu Ile Leu Ser Ala Leu Tyr Ser Pro Met Asp Ile Gln270 275 280 285gtg ctg agc cag gag cgg cag cct ttc ttc cac act cgc tgt gag gag 973Val Leu Ser Gln Glu Arg Gln Pro Phe Phe His Thr Arg Cys Glu Glu
290 295 300gac aat ggc ggc tgc tcc cac ctg tgc ctg ctg tcc cca agc gag cct 1021Asp Asn Gly Gly Cys Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro
305 310 315ttc tac aca tgc gcc tgc ccc acg ggt gtg cag ctg cag gac aac ggc 1069Phe Tyr Thr Cys Ala Cys Pro Thr Gly Val Gln Leu Gln Asp Asn Gly
320 325 330agg acg tgt aag gca gga gcc gag gag gtg ctg ctg ctg gcc cgg cgg 1117Arg Thr Cys Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg
335 340 345acg gac cta cgg agg atc tcg ctg gac acg ccg gac ttc acc gac atc 1165Thr Asp Leu Arg Arg Ile Ser Leu Asp Thr Pro Asp Phe Thr Asp Ile350 355 360 365gtg ctg cag gtg gac gac atc cgg cac gcc att gcc atc gac tac gac 1213Val Leu Gln Val Asp Asp Ile Arg His Ala Ile Ala Ile Asp Tyr Asp
370 375 380ccg cta gag ggc tat gtc tac tgg aca gat gac gag gtg cgg gcc atc 1261Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala Ile
385 390 395cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 1309Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gln Thr Leu Val Asn Thr
400 405 410gag atc aac gac ccc gat ggc atc gcg gtc gac tgg gtg gcc cga aac 1357Glu Ile Asn Asp Pro Asp Gly Ile Ala Val Asp Trp Val Ala Arg Asn
415 420 425ctc tac tgg acc gac acg ggc acg gac cgc atc gag gtg acg cgc ctc 1405Leu Tyr Trp Thr Asp Thr Gly Thr Asp Arg Ile Glu Val Thr Arg Leu430 435 440 445aac ggc acc tcc cgc aag atc ctg gtg tcg gag gac ctg gac gag ccc 1453Asn Gly Thr Ser Arg Lys Ile Leu Val Ser Glu Asp Leu Asp Glu Pro
450 455 460cga gcc atc gca ctg cac ccc gtg atg ggc ctc atg tac tgg aca gac 1501Arg Ala Ile Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp
465 470 475tgg gga gag aac cct aaa atc gag tgt gcc aac ttg gat ggg cag gag 1549Trp Gly Glu Asn Pro Lys Ile Glu Cys Ala Asn Leu Asp Gly Gln Glu
480 485 490cgg cgt gtg ctg gtc aat gcc tcc ctc ggg tgg ccc aac ggc ctg gcc 1597Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala
495 500 505ctg gac ctg cag gag ggg aag ctc tac tgg gga gac gcc aag aca gac 1645Leu Asp Leu Gln Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp510 515 520 525aag atc gag gtg atc aat gtt gat ggg acg aag agg cgg acc ctc ctg 1693Lys Ile Glu Val Ile Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu
530 535 540gag gac aag ctc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741Glu Asp Lys Leu Pro His Ile Phe Gly Phe Thr Leu Leu Gly Asp Phe
545 550 555atc tac tgg act gac tgg cag cgc cgc agc atc gag cgg gtg cac aag 1789Ile Tyr Trp Thr Asp Trp Gln Arg Arg Ser Ile Glu Arg Val His Lys
560 565 570gtc aag gcc agc cgg gac gtc atc att gac cag ctg ccc gac ctg atg 1837Val Lys Ala Ser Arg Asp Val Ile Ile Asp Gln Leu Pro Asp Leu Met
575 580 585ggg ctc aaa gct gtg aat gtg gcc aag gtc gtc gga acc aac ccg tgt 1885Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys590 595 600 605gcg gac agg aac ggg ggg tgc agc cac ctg tgc ttc ttc aca ccc cac 1933Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His
610 615 620gca acc cgg tgt ggc tgc ccc atc ggc ctg gag ctg ctg agt gac atg 1981Ala Thr Arg Cys Gly Cys Pro Ile Gly Leu Glu Leu Leu Ser Asp Met
625 630 635aag acc tgc atc gtg cct gag gcc ttc ttg gtc ttc acc agc aga gcc 2029Lys Thr Cys Ile Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala
640 645 650gcc atc cac agg atc tcc ctc gag acc aat aac aac gac gtg gcc atc 2077Ala Ile His Arg Ile Ser Leu Glu Thr Asn Asn Asn Asp Val Ala Ile
655 660 665ccg ctc acg ggc gtc aag gag gcc tca gcc ctg gac ttt gat gtg tcc 2125Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser670 675 680 685aac aac cac atc tac tgg aca gac gtc agc ctg aag acc atc agc cgc 2173Asn Asn His Ile Tyr Trp Thr Asp Val Ser Leu Lys Thr Ile Ser Arg
690 695 700gcc ttc atg aac ggg agc tcg gtg gag cac gtg gtg gag ttt ggc ctt 2221Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu
705 710 715gac tac ccc gag ggc atg gcc gtt gac tgg atg ggc aag aac ctc tac 2269Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr
720 725 730tgg gcc gac act ggg acc aac aga atc gaa gtg gcg cgg ctg gac ggg 2317Trp Ala Asp Thr Gly Thr Asn Arg Ile Glu Val Ala Arg Leu Asp Gly
735 740 745cag ttc cgg caa gtc ctc gtg tgg agg gac ttg gac aac ccg agg tcg 2365Gln Phe Arg Gln Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser750 755 760 765ctg gcc ctg gat ccc acc aag ggc tac atc tac tgg acc gag tgg ggc 2413Leu Ala Leu Asp Pro Thr Lys Gly Tyr Ile Tyr Trp Thr Glu Trp Gly
770 775 780ggc aag ccg agg atc gtg cgg gcc ttc atg gac ggg acc aac tgc atg 2461Gly Lys Pro Arg Ile Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met
785 790 795acg ctg gtg gac aag gtg ggc cgg gcc aac gac ctc acc att gac tac 2509Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr Ile Asp Tyr
800 805 810gct gac cag cgc ctc tac tgg acc gac ctg gac acc aac atg atc gag 2557Ala Asp Gln Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met Ile Glu
815 820 825tcg tcc aac atg ctg ggt cag gag cgg gtc gtg att gcc gac gat ctc 2605Ser Ser Asn Met Leu Gly Gln Glu Arg Val Val Ile Ala Asp Asp Leu830 835 840 845ccg cac ccg ttc ggt ctg acg cag tac agc gat tat atc tac tgg aca 2653Pro His Pro Phe Gly Leu Thr Gln Tyr Ser Asp Tyr Ile Tyr Trp Thr
850 855 860gac tgg aat ctg cac agc att gag cgg gcc gac aag act agc ggc cgg 2701Asp Trp Asn Leu His Ser Ile Glu Arg Ala Asp Lys Thr Ser Gly Arg
865 870 875aac cgc acc ctc atc cag ggc cac ctg gac ttc gtg atg gac atc ctg 2749Asn Arg Thr Leu Ile Gln Gly His Leu Asp Phe Val Met Asp Ile Leu
880 885 890gtg ttc cac tcc tcc cgc cag gat ggc ctc aat gac tgt atg cac aac 2797Val Phe His Ser Ser Arg Gln Asp Gly Leu Asn Asp Cys Met His Asn
895 900 905aac ggg cag tgt ggg cag ctg tgc ctt gcc atc ccc ggc ggc cac cgc 2845Asn Gly Gln Cys Gly Gln Leu Cys Leu Ala Ile Pro Gly Gly His Arg910 915 920 925tgc ggc tgc gcc tca cac tac acc ctg gac ccc agc agc cgc aac tgc 2893Cys Gly Cys Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys
930 935 940agc ccg ccc acc acc ttc ttg ctg ttc agc cag aaa tct gcc atc agt 2941Ser Pro Pro Thr Thr Phe Leu Leu Phe Ser Gln Lys Ser Ala Ile Ser
945 950 955cgg atg atc ccg gac gac cag cac agc ccg gat ctc atc ctg ccc ctg 2989Arg Met Ile Pro Asp Asp Gln His Ser Pro Asp Leu Ile Leu Pro Leu
960 965 970cat gga ctg agg aac gtc aaa gcc atc gac tat gac cca ctg gac aag 3037His Gly Leu Arg Asn Val Lys Ala Ile Asp Tyr Asp Pro Leu Asp Lys
975 980 985ttc atc tac tgg gtg gat ggg cgc cag aac atc aag cga gcc aag gac 3085Phe Ile Tyr Trp Val Asp Gly Arg Gln Asn Ile Lys Arg Ala Lys Asp990 995 1000 1005gac ggg acc cag ccc ttt gtt ttg acc tct ctg agc caa ggc caa aac 3133Asp Gly Thr Gln Pro Phe Val Leu Thr Ser Leu Ser Gln Gly Gln Asn
1010 1015 1020cca gac agg cag ccc cac gac ctc agc atc gac atc tac agc cgg aca 3181Pro Asp Arg Gln Pro His Asp Leu Ser Ile Asp Ile Tyr Ser Arg Thr
1025 1030 1035ctg ttc tgg acg tgc gag gcc acc aat acc atc aac gtc cac agg ctg 3229Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr Ile Asn Val His Arg Leu
1040 1045 1050agc ggg gaa gcc atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 3277Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro
1055 1060 1065agg gcc atc gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3325Arg Ala Ile Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn1070 1075 1080 1085atg cag gac cgg gca gcc aag atc gaa cgc gca gcc ctg gac ggc acc 3373Met Gln Asp Arg Ala Ala Lys Ile Glu Arg Ala Ala Leu Asp Gly Thr
1090 1095 1100gag cgc gag gtc ctc ttc acc acc ggc ctc atc cgc cct gtg gcc ctg 3421Glu Arg Glu Val Leu Phe Thr Thr Gly Leu Ile Arg Pro Val Ala Leu
1105 1110 1115gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu
1120 1125 1130aag cgc att gag agc tgt gac ctg tca ggg gcc aac cgc ctg acc ctg 3517Lys Arg Ile Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu
1135 1140 1145gag gac gcc aac atc gtg cag cct ctg ggc ctg acc atc ctt ggc aag 3565Glu Asp Ala Asn Ile Val Gln Pro Leu Gly Leu Thr Ile Leu Gly Lys1150 1155 1160 1165cat ctc tac tgg atc gac cgc cag cag cag atg atc gag cgt gtg gag 3613His Leu Tyr Trp Ile Asp Arg Gln Gln Gln Met Ile Glu Arg Val Glu
1170 1175 1180aag acc acc ggg gac aag cgg act cgc atc cag ggc cgt gtc gcc cac 3661Lys Thr Thr Gly Asp Lys Arg Thr Arg Ile Gln Gly Arg Val Ala His
1185 1190 1195ctc act ggc atc cat gca gtg gag gaa gtc agc ctg gag gag ttc tca 3709Leu Thr Gly Ile His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser
1200 1205 1210gcc cac cca tgt gcc cgt gac aat ggt ggc tgc tcc cac atc tgt att 3757Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His Ile Cys Ile
1215 1220 1225gcc aag ggt gat ggg aca cca cgg tgc tca tgc cca gtc cac ctc gtg 3805Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val1230 1235 1240 1245ctc ctg cag aac ctg ctg acc tgt gga gag ccg ccc acc tgc tcc ccg 3853Leu Leu Gln Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro
1250 1255 1260gac cag ttt gca tgt gcc aca ggg gag atc gac tgt atc ccc ggg gcc 3901Asp Gln Phe Ala Cys Ala Thr Gly Glu Ile Asp Cys Ile Pro Gly Ala
1265 1270 1275tgg cgc tgt gac ggc ttt ccc gag tgc gat gac cag agc gac gag gag 3949Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gln Ser Asp Glu Glu
1280 1285 1290ggc tgc ccc gtg tgc tcc gcc gcc cag ttc ccc tgc gcg cgg ggt cag 3997Gly Cys Pro Val Cys Ser Ala Ala Gln Phe Pro Cys Ala Arg Gly Gln
1295 1300 1305tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gln Asp1310 1315 1320 1325cgc tca gac gag gtg gac tgt gac gcc atc tgc ctg ccc aac cag ttc 4093Arg Ser Asp Glu Val Asp Cys Asp Ala Ile Cys Leu Pro Asn Gln Phe
1330 1335 1340cgg tgt gcg agc ggc cag tgt gtc ctc atc aaa cag cag tgc gac tcc 4141Arg Cys Ala Ser Gly Gln Cys Val Leu Ile Lys Gln Gln Cys Asp Ser
1345 1350 1355ttc ccc gac tgt atc gac ggc tcc gac gag ctc atg tgt gaa atc acc 4189Phe Pro Asp Cys Ile Asp Gly Ser Asp Glu Leu Met Cys Glu Ile Thr
1360 1365 1370aag ccg ccc tca gac gac agc ccg gcc cac agc agt gcc atc ggg ccc 4237Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala Ile Gly Pro
1375 1380 1385gtc att ggc atc atc ctc tct ctc ttc gtc atg ggt ggt gtc tat ttt 4285Val Ile Gly Ile Ile Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe1390 1395 1400 1405gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gcc aac ggg ccc 4333Val Cys Gln Arg Val Val Cys Gln Arg Tyr Ala Gly Ala Asn Gly Pro
1410 1415 1420ttc ccg cac gag tat gtc agc ggg acc ccg cac gtg ccc ctc aat ttc 4381Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe
1425 1430 1435ata gcc ccg ggc ggt tcc cag cat ggc ccc ttc aca ggc atc gca tgc 4429Ile Ala Pro Gly Gly Ser Gln His Gly Pro Phe Thr Gly Ile Ala Cys
1440 1445 1450gga aag tcc atg atg agc tcc gtg agc ctg atg ggg ggc cgg ggc ggg 4477Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly
1455 1460 1465gtg ccc ctc tac gac cgg aac cac gtc aca ggg gcc tcg tcc agc agc 4525Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser1470 1475 1480 1485tcg tcc agc acg aag gcc acg ctg tac ccg ccg atc ctg aac ccg ccg 4573Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro Ile Leu Asn Pro Pro
1490 1495 1500ccc tcc ccg gcc acg gac ccc tcc ctg tac aac atg gac atg ttc tac 4621Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr
1505 1510 1515tct tca aac att ccg gcc act gcg aga ccg tac agg ccc tac atc att 4669Ser Ser Asn Ile Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr Ile Ile
1520 1525 1530cga gga atg gcg ccc ccg acg acg ccc tgc agc acc gac gtg tgt gac 4717Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp
1535 1540 1545agc gac tac agc gcc agc cgc tgg aag gcc agc aag tac tac ctg gat 4765Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp1550 1555 1560 1565ttg aac tcg gac tca gac ccc tat cca ccc cca ccc acg ccc cac agc 4813Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser
1570 1575 1580cag tac ctg tcg gcg gag gac agc tgc ccg ccc tcg ccc gcc acc gag 4861Gln Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu
1585 1590 1595agg agc tac ttc cat ctc ttc ccg ccc cct ccg tcc ccc tgc acg gac 4909Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp
1600 1605 1610tca tcc tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4965Ser Ser
1615aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085tggagaaata tttataaact taattttgta aaaca 5120<210>3<211>1615<212>PRT<213>人(Homo sapiens)<400>3Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu1 5 10 15Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser
20 25 30Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala
35 40 45Gly Gly Val Lys Leu Glu Ser Thr Ile Val Val Ser Gly Leu Glu Asp
50 55 60Ala Ala Ala Val Asp Phe Gln Phe Ser Lys Gly Ala Val Tyr Trp Thr65 70 75 80Asp Val Ser Glu Glu Ala Ile Lys Gln Thr Tyr Leu Asn Gln Thr Gly
85 90 95Ala Ala Val Gln Asn Val Val Ile Ser Gly Leu Val Ser Pro Asp Gly
100 105 110Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu
115 120 125Thr Asn Arg Ile Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val
130 135 140Leu Phe Trp Gln Asp Leu Asp Gln Pro Lys Ala Ile Ala Leu Asp Pro145 150 155 160Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr Pro Arg Ile
165 170 175Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys Ile Ile Val Asp Ser
180 185 190Asp Ile Tyr Trp Pro Asn Gly Leu Thr Ile Asp Leu Glu Glu Gln Lys
195 200 205Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe Ile His Arg Ala Asn Leu
210 215 220Asp Gly Ser Phe Arg Gln Lys Val Val Glu Gly Ser Leu Thr His Pro225 230 235 240Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gln
245 250 255Thr Arg Ser Ile His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys
260 265 270Glu Ile Leu Ser Ala Leu Tyr Ser Pro Met Asp Ile Gln Val Leu Ser
275 280 285Gln Glu Arg Gln Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly
290 295 300Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr305 310 315 320Cys Ala Cys Pro Thr Gly Val Gln Met Gln Asp Asn Gly Arg Thr Cys
325 330 335Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu
340 345 350Arg Arg Ile Ser Leu Asp Thr Pro Asp Phe Thr Asp Ile Val Leu Gln
355 360 365Val Asp Asp Ile Arg His Ala Ile Ala Ile Asp Tyr Asp Pro Leu Glu
370 375 380Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala Ile ArgArg Ala385 390 395 400Tyr Leu Asp Gly Ser Gly Ala Gln Thr Leu Val Asn Thr Glu Ile Asn
405 410 415Asp Pro Asp Gly Ile Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp
420 425 430Thr Asp Thr Gly Thr Asp Arg Ile Glu Val Thr Arg Leu Asn Gly Thr
435 440 445Ser Arg Lys Ile Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala Ile
450 455 460Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu465 470 475 480Asn Pro Lys Ile Glu Cys Ala Asn Leu Asp Gly Gln Glu Arg Arg Val
485 490 495Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu
500 505 510Gln Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys Ile Glu
515 520 525Val Ile Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys
530 535 540Leu Pro His Ile Phe Gly Phe Thr Leu Leu Gly Asp Phe Ile Tyr Trp545 550 555 560Thr Asp Trp Gln Arg Arg Ser Ile Glu Arg Val His Lys Val Lys Ala
565 570 575Ser Arg Asp Val Ile Ile Asp Gln Leu Pro Asp Leu Met Gly Leu Lys
580 585 590Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg
595 600 605Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg
610 615 620Cys Gly Cys Pro Ile Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys625 630 635 640Ile Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala Ile His
645 650 655Arg Ile Ser Leu Glu Thr Asn Asn Asn Asp Val Ala Ile Pro Leu Thr
660 665 670Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His
675 680 685Ile Tyr Trp Thr Asp Val Ser Leu Lys Asn Ile Ser Arg Ala Phe Met
690 695 700Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro705 710 715 720Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp
725 730 735Thr Gly Thr Asn Arg Ile Glu Val Ala Arg Leu Asp Gly Gln Phe Arg
740 745 750Gln Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu
755 760 765Asp Pro Thr Lys Gly Tyr Ile Tyr Trp Thr Glu Trp Gly Gly Lys Pro
770 775 780Arg Ile Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val785 790 795 800Asp Lys Val Gly Arg Ala Asn Asp Leu Thr Ile Asp Tyr Ala Asp Gln
805 810 815Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met Ile Glu Ser Ser Asn
820 825 830Met Leu Gly Gln Glu Arg Val Val Ile Ala Asp Asp Leu Pro His Pro
835 840 845Phe Gly Leu Thr Gln Tyr Ser Asp Tyr Ile Tyr Trp Thr Asp Trp Asn
850 855 860Leu His Ser Ile Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr865 870 875 880Leu Ile Gln Gly His Leu Asp Phe Val Met Asp Ile Leu Val Phe His
885 890 895Ser Ser Arg Gln Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gln
900 905 910Cys Gly Gln Leu Cys Leu Ala Ile Pro Gly Gly His Arg Cys Gly Cys
915 920 925Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro
930 935 940Thr Thr Phe Leu Leu Phe Ser Gln Lys Ser Ala Ile Ser Arg Met Ile945 950 955 960Pro Asp Asp Gln His Ser Pro Asp Leu Ile Leu Pro Leu His Gly Leu
965 970 975Arg Asn Val Lys Ala Ile Asp Tyr Asp Pro Leu Asp Lys Phe Ile Tyr
980 985 990Trp Val Asp Gly Arg Gln Asn Ile Lys Arg Ala Lys Asp Asp Gly Thr
995 1000 1005Gln Pro Phe Val Leu Thr Ser Leu Ser Gln Gly Gln Asn Pro Asp Arg
1010 1015 1020Gln Pro His Asp Leu Ser Ile Asp Ile Tyr Ser Arg Thr Leu Phe Trp1025 1030 1035 1040Thr Cys Glu Ala Thr Asn Thr Ile Asn Val His Arg Leu Ser Gly Glu
1045 1050 1055Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala Ile
1060 1065 1070Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gln Asp
1075 1080 1085Arg Ala Ala Lys Ile Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu
1090 1095 1100Val Leu Phe Thr Thr Gly Leu Ile Arg Pro Val Ala Leu Val Val Asp1105 1110 1115 1120Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg Ile
1125 1130 1135Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala
1140 1145 1150Asn Ile Val Gln Pro Leu Gly Leu Thr Ile Leu Gly Lys His Leu Tyr
1155 1160 1165Trp Ile Asp Arg Gln Gln Gln Met Ile Glu Arg Val Glu Lys Thr Thr
1170 1175 1180Gly Asp Lys Arg Thr Arg Ile Gln Gly Arg Val Ala His Leu Thr Gly1185 1190 1195 1200Ile His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro
1205 1210 1215Cys Ala Arg Asp Asn Gly Gly Cys Ser His Ile Cys Ile Ala Lys Gly
1220 1225 1230Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gln
1235 1240 1245Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gln Phe
1250 1255 1260Ala Cys Ala Thr Gly Glu Ile Asp Cys Ile Pro Gly Ala Trp Arg Cys1265 1270 1275 1280Asp Gly Phe Pro Glu Cys Asp Asp Gln Ser Asp Glu Glu Gly Cys Pro
1285 1290 1295Val Cys Ser Ala Ala Gln Phe Pro Cys Ala Arg Gly Gln Cys Val Asp
1300 1305 1310Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gln Asp Arg Ser Asp
1315 1320 1325Glu Val Asp Cys Asp Ala Ile Cys Leu Pro Asn Gln Phe Arg Cys Ala
1330 1335 1340Ser Gly Gln Cys Val Leu Ile Lys Gln Gln Cys Asp Ser Phe Pro Asp1345 1350 1355 1360Cys Ile Asp Gly Ser Asp Glu Leu Met Cys Glu Ile Thr Lys Pro Pro
1365 1370 1375Ser Asp Asp Ser Pro Ala His Ser Ser Ala Ile Gly Pro Val Ile Gly
1380 1385 1390Ile Ile Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gln
1395 1400 1405Arg Val Val Cys Gln Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His
1410 1415 1420Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe Ile Ala Pro1425 1430 1435 1440Gly Gly Ser Gln His Gly Pro Phe Thr Gly Ile Ala Cys Gly Lys Ser
1445 1450 1455Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu
1460 1465 1470Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser
1475 1480 1485Thr Lys Ala Thr Leu Tyr Pro Pro Ile Leu Asn Pro Pro Pro Ser Pro
1490 1495 1500Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn1505 1510 1515 1520Ile Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr Ile Ile Arg Gly Met
1525 1530 1535Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr
1540 1545 1550Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser
1555 1560 1565Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gln Tyr Leu
1570 1575 1580Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr1585 1590 1595 1600Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser
1605 1610 1615<210>4<211>1615<212>PRT<213>人(Homo sapiens)<400>4Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu1 5 10 15Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser
20 25 30Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala
35 40 45Gly Gly Val Lys Leu Glu Ser Thr Ile Val Val Ser Gly Leu Glu Asp
50 55 60Ala Ala Ala Val Asp Phe Gln Phe Ser Lys Gly Ala Val Tyr Trp Thr65 70 75 80Asp Val Ser Glu Glu Ala Ile Lys Gln Thr Tyr Leu Asn Gln Thr Gly
85 90 95Ala Ala Val Gln Asn Val Val Ile Ser Gly Leu Val Ser Pro Asp Gly
100 105 110Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu
115 120 125Thr Asn Arg Ile Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val
130 135 140Leu Phe Trp Gln Asp Leu Asp Gln Pro Lys Ala Ile Ala Leu Asp Pro145 150 155 160Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg Ile
165 170 175Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys Ile Ile Val Asp Ser
180 185 190Asp Ile Tyr Trp Pro Asn Gly Leu Thr Ile Asp Leu Glu Glu Gln Lys
195 200 205Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe Ile His Arg Ala Asn Leu
210 215 220Asp Gly Ser Phe Arg Gln Lys Val Val Glu Gly Ser Leu Thr His Pro225 230 235 240Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gln
245 250 255Thr Arg Ser Ile His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys
260 265 270Glu Ile Leu Ser Ala Leu Tyr Ser Pro Met Asp Ile Gln Val Leu Ser
275 280 285Gln Glu Arg Gln Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly
290 295 300Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr305 310 315 320Cys Ala Cys Pro Thr Gly Val Gln Met Gln Asp Asn Gly Arg Thr Cys
325 330 335Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu
340 345 350Arg Arg Ile Ser Leu Asp Thr Pro Asp Phe Thr Asp Ile Val Leu Gln
355 360 365Val Asp Asp Ile Arg His Ala Ile Ala Ile Asp Tyr Asp Pro Leu Glu
370 375 380Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala Ile Arg Arg Ala385 390 395 400Tyr Leu Asp Gly Ser Gly Ala Gln Thr Leu Val Asn Thr Glu Ile Asn
405 410 415Asp Pro Asp Gly Ile Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp
420 425 430Thr Asp Thr Gly Thr Asp Arg Ile Glu Val Thr Arg Leu Asn Gly Thr
435 440 445Ser Arg Lys Ile Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala Ile
450 455 460Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu465 470 475 480Asn Pro Lys Ile Glu Cys Ala Asn Leu Asp Gly Gln Glu Arg Arg Val
485 490 495Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu
500 505 510Gln Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys Ile Glu
515 520 525Val Ile Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu Glu Asp Lys
530 535 540Leu Pro His Ile Phe Gly Phe Thr Leu Leu Gly Asp Phe Ile Tyr Trp545 550 555 560Thr Asp Trp Gln Arg Arg Ser Ile Glu Arg Val His Lys Val Lys Ala
565 570 575Ser Arg Asp Val Ile Ile Asp Gln Leu Pro Asp Leu Met Gly Leu Lys
580 585 590Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg
595 600 605Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg
610 615 620Cys Gly Cys Pro Ile Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys625 630 635 640Ile Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala Ala Ile His
645 650 655Arg Ile Ser Leu Glu Thr Asn Asn Asn Asp Val Ala Ile Pro Leu Thr
660 665 670Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His
675 680 685Ile Tyr Trp Thr Asp Val Ser Leu Lys Asn Ile Ser Arg Ala Phe Met
690 695 700Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro705 710 715 720Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp
725 730 735Thr Gly Thr Asn Arg Ile Glu Val Ala Arg Leu Asp Gly Gln Phe Arg
740 745 750Gln Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu
755 760 765Asp Pro Thr Lys Gly Tyr Ile Tyr Trp Thr Glu Trp Gly Gly Lys Pro
770 775 780Arg Ile Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val785 790 795 800Asp Lys Val Gly Arg Ala Asn Asp Leu Thr Ile Asp Tyr Ala Asp Gln
805 810 815Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met Ile Glu Ser Ser Asn
820 825 830Met Leu Gly Gln Glu Arg Val Val Ile Ala Asp Asp Leu Pro His Pro
835 840 845Phe Gly Leu Thr Gln Tyr Ser Asp Tyr Ile Tyr Trp Thr Asp Trp Asn
850 855 860Leu His Ser Ile Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr865 870 875 880Leu Ile Gln Gly His Leu Asp Phe Val Met Asp Ile Leu Val Phe His
885 890 895Ser Ser Arg Gln Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gln
900 905 910Cys Gly Gln Leu Cys Leu Ala Ile Pro Gly Gly His Arg Cys Gly Cys
915 920 925Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro
930 935 940Thr Thr Phe Leu Leu Phe Ser Gln Lys Ser Ala Ile Ser Arg Met Ile945 950 955 960Pro Asp Asp Gln His Ser Pro Asp Leu Ile Leu Pro Leu His Gly Leu
965 970 975Arg Asn Val Lys Ala Ile Asp Tyr Asp Pro Leu Asp Lys Phe Ile Tyr
980 985 990Trp Val Asp Gly Arg Gln Asn Ile Lys Arg Ala Lys Asp Asp Gly Thr
995 1000 1005Gln Pro Phe Val Leu Thr Ser Leu Ser Gln Gly Gln Asn Pro Asp Arg
1010 1015 1020Gln Pro His Asp Leu Ser Ile Asp Ile Tyr Ser Arg Thr Leu Phe Trp1025 1030 1035 1040Thr Cys Glu Ala Thr Asn Thr Ile Asn Val His Arg Leu Ser Gly Glu
1045 1050 1055Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala Ile
1060 1065 1070Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gln Asp
1075 1080 1085Arg Ala Ala Lys Ile Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu
1090 1095 1100Val Leu Phe Thr Thr Gly Leu Ile Arg Pro Val Ala Leu Val Val Asp1105 1110 1115 1120Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg Ile
1125 1130 1135Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala
1140 1145 1150Asn Ile Val Gln Pro Leu Gly Leu Thr Ile Leu Gly Lys His Leu Tyr
1155 1160 1165Trp Ile Asp Arg Gln Gln Gln Met Ile Glu Arg Val Glu Lys Thr Thr
1170 1175 1180Gly Asp Lys Arg Thr Arg Ile Gln Gly Arg Val Ala His Leu Thr Gly1185 1190 1195 1200Ile His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro
1205 1210 1215Cys Ala Arg Asp Asn Gly Gly Cys Ser His Ile Cys Ile Ala Lys Gly
1220 1225 1230Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gln
1235 1240 1245Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gln Phe
1250 1255 1260Ala Cys Ala Thr Gly Glu Ile Asp Cys Ile Pro Gly Ala Trp Arg Cys1265 1270 1275 1280Asp Gly Phe Pro Glu Cys Asp Asp Gln Ser Asp Glu Glu Gly Cys Pro
1285 1290 1295Val Cys Ser Ala Ala Gln Phe Pro Cys Ala Arg Gly Gln Cys Val Asp
1300 1305 1310Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gln Asp Arg Ser Asp
1315 1320 1325Glu Val Asp Cys Asp Ala Ile Cys Leu Pro Asn Gln Phe Arg Cys Ala
1330 1335 1340Ser Gly Gln Cys Val Leu Ile Lys Gln Gln Cys Asp Ser Phe Pro Asp1345 1350 1355 1360Cys Ile Asp Gly Ser Asp Glu Leu Met Cys Glu Ile Thr Lys Pro Pro
1365 1370 1375Ser Asp Asp Ser Pro Ala His Ser Ser Ala Ile Gly Pro Val Ile Gly
1380 1385 1390Ile Ile Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gln
1395 1400 1405Arg Val Val Cys Gln Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His
1410 1415 1420Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe Ile Ala Pro1425 1430 1435 1440Gly Gly Ser Gln His Gly Pro Phe Thr Gly Ile Ala Cys Gly Lys Ser
1445 1450 1455Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu
1460 1465 1470Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser
1475 1480 1485Thr Lys Ala Thr Leu Tyr Pro Pro Ile Leu Asn Pro Pro Pro Ser Pro
1490 1495 1500Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn1505 1510 1515 1520Ile Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr Ile Ile Arg Gly Met
1525 1530 1535Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr
1540 1545 1550Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser
1555 1560 1565Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gln Tyr Leu
1570 1575 1580Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr1585 1590 1595 1600Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser
1605 1610 1615<210>5<211>3096<212>DNA<213>人(Homo sapiens)<400>5catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 60tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 120cagctccgtc ccctaatatt catgcgcgga tggcccattc cgtggtgaaa attcccttct 180actctgctaa tctgctgttc tctctccctc ccgtcgggtt ctgctcctgc cacgttctcc 240cctctcccca ccaaaggctg ggttttcttt gtcagggctc ctttcccctt tggaagaagg 300ggggctgtat ggccttggtg cgaggccctc cagtgacagg atcccccatc acccagagtt 360ccacaggccc tggtagggag gagggggagc agaagaggag gtgccatctt tgcctgctgg 420ggaagggcag gggccaccca cacagagctc tcccatttgc tgtggaccct ggggccactg 480cccagttcct tccaaaggaa agccagctcc ccaggtggtg ggagagtgat atggcttcct 540cttaaactta gggaattgag tgtgtggttg cttctaagtg ccttagaagc cgggagcggc 600tcctggaaag agcctgcctg ccacagcggg ccttaccctg gctgtgccca cagatgtccc 660tggggcctgc cgctcctgcc cggctctcct ggcctccccc ggtgtgggtt gggaaaagca 720cagcaaatta aaaaacacct ccatctctgg cctttgaaga atgcatctga acagccgaga 780gtgtaaaccg tggtgaaatg tggtctttcc agtttgggga gaagcagggc agagctgggg 840cttttgtacc cagggtttcc aagagctcct gcctccctcg gctgggctgg ccagggcccc 900ccgctgggac ctccagctgt aatagggaag gttttactgg gttgctggcc actgtggact 960gcccctaagg gcaggtatgc ctgcctttac ccgggttccc ctcctgcctg gaagatacag 1020cccatgggag gcctgttgtc tgtgggatcc tccagcatca gagacactgg ggccagcgtc 1080tgcctggtga ggtgcaggcc tggcaggccc ggtcccccac ctgcttgagc acccacggtg 1140gtgggggctc gctgcctccc gagacaatct atgtcattgt tgtccaagga agctaattta 1200gagtagaaag ttccgtgtcc agtcccactc tgtgcgtgtg ttagcagggg actctcgggc 1260cggagctggg tccaccctgg tagggggact tcatggggcc tgggcgacag cactgtgtat 1320ttgtgtgtgt gtgtgtttgt gtgtgtgtgt gtctgaggag gtggaccagt ttctcaaaag 1380gcctgtgacc ccaagaacca aggaatttca gcctgggtgg atcacacctt cactggtgag 1440tgggacaagc tgggggccct cgccacagga gcagccaggg catggggcac agttggcctc 1500attcacaaaa tgggagtata agtgatccct gctctggcgg ccaggacgat gagtgggaac 1560acaccgtgtg ggggctgcct ggcctgggtg tgccgcgggt gtccttgttg gtgatggttc 1620cacctgcttg tgccaccagt gccctctggg tctcacacac aactctcttc ccagcgaagg 1680cccctcctgc cctcaggcct cagtgctgct tccgtctcgg aaggccccag gagctcctgc 1740atcctgggcg tgattcctgt gtgcctgcag accccctcgc ggctgccatc tcatcctttg 1800gtgcacctgt tggccagacc tcctggtagc gggtgctgca ctcccctgaa tgtgccgggg 1860cctgggggca gggacctggg ctcctccctc actgagtgga gggaactcag tgtcttggag 1920ttggggtgcc tgcaggctgg gtggtgcagg tgaaatgcag acctctcagc tggtgttcca 1980gagcagctgc cttcccccgc ccgagggact tcacccgcag cccagtcagg ggtggcgcct 2040gggtgcatcg cccgcaggct gggtaggggt ggagcctggg tggccctgcc tgtgagctgc 2100atagttgtcg cctttgaccc tgagttttct tcgttatctg tttggacctg tttggggcag 2160gcaggggatg agatctgaag ataaatgcct tagctgtgac catctccttt tgtgagaggt 2220caatgtccag ttccgctgca gttataacat cccatttttt gatttctttt tattttttcc 2280tttttctttt tgagatggag tctcgctctg tcacccaggc tggagtgcaa tggggtgacc 2340tcagctcact gcaacctcca cttctcgggt tcaagtgatt ctcctgcctc agcctcctga 2400ctagcagggg ttacaggcgt gagccaccac gcccagctaa tttttgtatt tttagtagag 2460gcaaggtttc gtcatgttgg ccaggctggt ctcaaactcc tggccttaag tgatctgccc 2520gcctcggcct cccaaagtgc tgagatgaca ggtgtgagcc accgtgcccg gcccagaact 2580ctttaattcc cacctgaaac ttgccgcctt aagcaggtcc ccagtctccc tcccctagtc 2640cctggtccca ccattctgct ttctgtctca atgaatttgc ctaccgtaag tacctcatat 2700aaattgaatc ataaagtatt tgtcttttta tatctggctt atttcactta gcataacatt 2760cttaagtttc atccatgttg tagcatgtgt cagaatctct ctcttttttt tttttttttt 2820tttttttttt ttttgcagac agagtctcgc tctgtcatct agactggagt tcagtggcac 2880gatctcggtt cactgcaaca tctgcctcct gggtccaagc aattctcctg cctcagcctc 2940cttagcagct ggaactacag gcgcgtgcca ccatgccttg ctaatttttg tatttttatg 3000tggaggcagg gtttcaccat cttggccagg ctggtctcga attcctggtc ttcaccacgg 3060gggcccgaag gacccgggca aagcgtggag gggagg 3096<210>6<211>26928<212>DNA<213>人(Homo sapiens)<220><221>未确定<222>(12044),(12489),(26433),(26434),(26435),(26436),(26439),(26441)<223>在上述位置的核苷酸序列特性未知。<400>6gaagaccaag ggcacacagc gaggcagttt cagggcgggc agcctggggc cccacggggc 60ggccccggac acttgttctc acctgtggag ggcagagaag ggaacaggga gagaagtggc 120cggctgggag tggaggtggg tttgaggttt tactgtaaac taaatgtgta ccctctacct 180tagttatgaa ttatgagaca cgaagactgc gaaacagaca cactcctcta aaagtgcctc 240taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag tccttcaaca 300agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct ccaggaagaa 360gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca cagaaggggc 420tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga acgactgtgc 480ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta aagtggggac 540agcaatggcc accttgcagg ttcagagagg gcttgcagta cctcacagaa ctgagtgccc 600gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt caggctgaac 660gcctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga agacggaaaa 720ttaaaacaac aaaagcccca ttgacaggcc gtgaagaatg ccatgaaaaa tgaatggcgt 780tgtgctgcag tctttgggga aacgggctta cggaaagaag gacacttgag ctgctaccaa 840tgagcagccg tccggtggga gggcagttca ggaagagcag acatccactg aggaggcgct 900ggggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg ccatgagctg 960gagctggtgg tagaatgtac aggagaggcc agccagggcc agctcatgtc agacctcaag 1020cggggaagat gaatcgagaa tgcaccccac gagcaatggg aagccagtct acgatttaag 1080cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac agcctggggt 1140tctattttca agatagaata gacccagact cccagctctt cttacacttc tactactgcc 1200acctgtcacc cactcatgcg tccccacttg cagcctcgac ccccttccac ctgatctcat 1260ggcagccagg gaagctccag ggctcgtgag ggctgccatc tcaggaaaga agcaaaagcc 1320ttcggcacct gcagggcctg ctccaaccac acttcttcct tgacctctca gcttccttag 1380ccactccctt cccacatctc accctgctcc agccacagtg gtgtctctgt gggttctcaa 1440acacaccagg tgcactcctg cctcagggcc tttgtgcttg ctgttctctg ctgggactct 1500tttttttttt tttttttttg agacagggtc tcactctgtg gcccaggctg gagtgtagtg 1560gtgtgatcgt agctcattgc aacctcaaac tcctgggctc aagcaatcct cccacctcag 1620cctctcaagt agttagcttt tgttgttttg ttttgagatg ggatctcact ctgttgccca 1680ggctggagtg cagtggggca atcttggctc accacaacct ctgcctccca ggctcaagca 1740attctcctgc ctcagcctcc caagtagctg ggattacagg catgtgccac cacgcccagc 1800ttatttttgt atttttagta gagacagggt ttcaccatgt tggtctggct ggtcttgaac 1860tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg acaggcatga 1920gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga tcctcccacc 1980ttttctgctc caactctttc accccactta gcctcgtggc tcactctctt acctcttcag 2040ctcctcagtc aggcctgagg acccctgttg aaaattgcaa accacacccc ccaccaccac 2100cacccactat tgccagcact ttctactcca tttctctgct ttacttttct cctttgtact 2160catcaccacc tgactcatta catgtttacg tatctttctt ctctccacta gcatggaagc 2220tccaggagag cagagagtgt agttttattc cctgatgtgt ttcctgtgcc cgtaccaggg 2280cctagcacac agtaggtgct cagtaaatgt gtgttggatg aacaaataca gtgaaaggat 2340ctgatctaca tttataaaga aggcactctg gctgctgagt ggggatgaga ctgtcaggag 2400gaaagaggcc cctgtggggg cctggccagc aggtgggtac aatggtagca gccaggagag 2460agggcctctt ggactcaagt ggatggggcc tgctcagggc tccggccaca ggaacaaagg 2520gaagggggcc caggatggcc tgtcatagag gacacattac aactggccca aagttcaagt 2580caggtttcta aatttgggaa gggatacaga aaaactaaag actctactgg acagtcagtt 2640attgaaatga ttacatagaa aatgtaccaa gaattaaaaa aaaaaaaaaa aagcattatg 2700aaggggccac cagagactcc cagagaggaa agggactatg ggctggatgc ggtgactcac 2760acctataatc ccagcacttt gggaggccga ggagggtgga tcacgaggtc aggagttcaa 2820aaccagccta ggcaacatgg taaaaccccc gtttctacta aaaatacaaa aaattagctg 2880ggcatggcag catgtgcctg taatcccagc tactcgggag gctgaggcag gagagttgct 2940agaacccagg aggcagaggt tgcagtgagc cgagattgag ccactatgct ccagcttggg 3000cgacagagca agactccgtc tctaaaaaaa agaaaaaaaa ggccagatga ggtggctcat 3060gcctgtaatc ccagcacttt gggaggccga ggtgggtgga tcacgaggtc aggagatcga 3120gaccatcctg gctaacatgg tgaaactcca tctctactta aaatacaaaa aattagccgg 3180gcgtggtggc gggcacctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 3240gaacctggga ggcggagctt gcagtgagcc gagattgcgc cactgcactc catccagcct 3300gggcgacaga gttagactcc gtctcaaaaa aaaaaaaaaa aaaaaaatta gctgattagt 3360tgggcttggt ggcgggcgcc tgtaatccca actactcggg aggctgaggc gggagaatca 3420cttgaacccg ggaggcagag gttgcaatga gccgatatca cgccactaca ctccagcctg 3480ggcgacagag caagactcca tctcaaaaaa gaaaaaaaaa aagaaagggg ctgtgctgtg 3540gcctgggacc caaagcacac tactgcaagg tcccagggtg cctgactcca accggagcct 3600tgagaacatt catttgcaaa gaatgaatta aaattcagca ctattttatt ctgcaggatt 3660ccagcacccc aaggacagtc atttttagac ccttcagtaa cgtaataagt aaccggagga 3720tgtgctgagc ttccacttcc ccagacggtt gcctgtcaca gctcatcagg ccaacaaact 3780tttcttaggc ctcaaatttg gaaatgttca ctctcagttc gttccttaga tgcaagtcca 3840tcccaatgaa gtaacagggg ctcagcacct gtccaatctc attgcttccg gggacagggg 3900cccatgagga tgtcgtttca gcccggtgac acttgggcaa agtgcctttt ggtttccctc 3960ccaggctgga acgtgctggc tctgtgaagt tacgctgggc acaagagccc cccccaaccc 4020ggcaggactg actgctgtgg tcagaggcgc ccctggggct ttgggagcca cagaatcttc 4080ctgagggcag cgccggagga ggccccagtg agagtgccca ctgccaggct cattcctcag 4140gctgccgcag gcctctcccc aaaacaggca atgcttctca gcaacctgcc ccaggagcag 4200gccagggaag gccgccatcg gcctacagtg ctgggctctg gagggcttgg ttggtaacag 4260gccatggttt ctatgagcca gctggggtgt gaaggacaca ggctggattc acctctctgg 4320gcctcagttt ctgcattcaa aaagtgggaa tcatgatatc tgctctattt cttatctctc 4380agtgctgatg tgaacctcca ataagacttt taaaaatact ctttctacct tacttttatt 4440tttcatttat tttaagataa tgtctagctg tctcacccag gctggagtgc agtggtgtga 4500ttacggctca ctacagcctt aacctcccag gctcaagtga tcctcctacc acagcctccc 4560aagtagctgg aactacaggc atgcaccacc gcacctggat aattttttct tttgagacaa 4620ggtttcactc tgttgcccag gctggagtgc agtggtgcac tcttggctca ctgcagcctc 4680aacctccctg ggcttaggtg atcctcacac ttcagtctcc caagtagctg ggactacagg 4740tatgtgccag tacacccagc taatattttt gaaggatggg gtttcactat attgcccagg 4800ctggtcttga actccagggt ttaagcaatc taccttcctc agcctgccaa agtgctagga 4860ttataggtat gagccacccc ccggcctata atcctaccac tttaaaaaag cctgtaattt 4920tagcacttta aaaaattttt ctaaattttt tatagagatg ggggacagct gtggtctcac 4980tgtgttgccc aggctggtct tgaactccta ggatcaagcc atcctcctgg cctggcctcc 5040caaagtgttg ggattataag cataagcctt accttacctt ttttttttga gttgcagttt 5100tgttcttgtt gctcaggctg gagtgcaatg gcaagatctt ggctcactgc aacctccacc 5160tcccgggttc aagcaattct cctgcctcag cctcccgagt agctgggatt acaggcatgc 5220gccaccacac ccagctaatt ttgtattttt agtagagatg gggtttctct atatacctta 5280attttaaagc actgcattca tgtaaattgt gattaacatg gattcaagag agggagtgag 5340gatgaatgag ccaggcagtc acctcggctg tcaccctcca cttctctcct ccttctgaca 5400gtcatcgtcc atccgtttct gcagctgttt gtttgactct cctgatcatt ttgcttgcca 5460cataacttgc ctcctgggaa agaatgccct gggcaggccc acatgagtag tgaaaaataa 5520tctgcagtga aaaataaaac taagtagtct ggtccacaga gcagtcttat tttttcactg 5580cagatgaagg agttgacatt caggcttcat tctcatttat aagtgtttta aagacacata 5640cagtggattg aacagtggcc ttcaaaaaga tgtatctaca tcctaatccc tgggacctgt 5700gaatgttaac caagttagga aaagggtctt cccgggtgtc attaagttag agatcttgag 5760atgaggagct catcgtggat tatccaggtg gaccctgcat ccaaggacaa atggtcctta 5820gaaaagaaaa gcagaggctg ggcacagtgg ctcaagcctg taatcccagc actttgagag 5880gccgaggtgg gtggatcacc taaggtcatg agttcgagag cagcctggcc aacatgatga 5940aatcccatct ctactaaaaa tacaaaaatt agcaaggcat ggtggcgggt gcctataatc 6000ccagctactc aggaagctga ggcaggagaa tggcttgcac ctgggaggcg gaggttgcag 6060tgagccaaga tcgcgccact gcactccagc ctgagggaga aaagtgaaac tctgtctcat 6120aaaagaaaag aaaagcagac agagatctga gacagaagag gagagtgaag gaaaaaaggc 6180catgtgaaga tgaggcagag gttggagcca tgcagccaca agccaaggaa tacctggagc 6240cccagaagtt gcaagaggta ggaagaagcc tcccctagag cctccagacg gagcacagcc 6300ctgccaacac ctccacctca gacttctggc ctccagcact gtgagataat caactgctgt 6360tgttttaagc caccagattt gtggtaattt gttatggcag ccacaggaaa ctaatacagt 6420acctaatctt cacaaaccca tcttacagaa aaggaaactg aagtcagaga ggtagtggct 6480tgtgcagtgt gttaggccat tcttgtatta ctataaagaa atacctgagg ccgggcatgg 6540tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc acttgaggtc 6600aggagttcaa gaccagcctg gacaacatgg tgaaacccca tttctactga aaatatgaaa 6660attagccagg catggtggcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 6720gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 6780agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg 6840agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 6900catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 6960aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 7020gggaggtgtg cacactttta aatgagcaga tctcacgaga actcaccatt gcaaggacag 7080caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 7140cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 7200cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 7260cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 7320ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 7380tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 7440ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 7500caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 7560gcacactggt gcaaggggca ggctcccaag gccttgggca gctctattcc tgcggctttg 7620cagaattcag tccccatggc tgctcttaca gattggagat gagggcctgc ggcttttcca 7680ggtgcagggt gcaagctgct ggtgatctac cattctgggg tgtggatggt ggcggccccg 7740tcccgcagct ccactaggca ttgtcccagt ggggactcta tgtggggcct ccaaccccac 7800atttcccctc caatgggaag gctctgcccc tgcagcagcc ttcttcctgg gctcccaggc 7860tttctcatac atcctctgac atctaggtgg atggtgtcaa gcttccttca ctcttgcact 7920ctgcacacct acaggcttaa caccacatgg aagctgccaa ggtgtatggc tggaaccctc 7980tgaagcagca gcctgagctg tgactatggc cctttgagcc aaggctggag ctggaacagt 8040ctagatgcag gcagggagca gtgtcctgag gctgtgcaga gcagcagggc cctgtgcctg 8100gacaatgaaa ccattctttc ctcctcatcc tctgggcctg tgatgggagg gttgtggaag 8160atctctgaaa tgcctttgag gcctttttgc ctctgaggcc tatttcctat tgtctcagtt 8220attggcagtc ggctcctttt tagttatgca aatcctctag caagaggtta ctccactgcc 8280ggcttgaact cctctcctga aaaagctttt tctttctttg tcacatggcc aggctgcaaa 8340ttttccaaac ttttatgctc tgttttacct ttaaatataa cttctaactt taattcattt 8400atttgctcct gcatttgagc atagggaatt caaagaagct gggccacatc ttgaatgctt 8460tgctgcttca aaatttatgg ccacgcttgg tggctcacac ctgtaatccc agcactttgg 8520gaggcctagg tgggcagatc acgagatcag gagatcgaga ccatcctggt caacatggtg 8580aaacccatct ctactaaaaa tacaaaaaaa ttagcttggt gtggtggcgc agacctgtag 8640tcccagctac tggagaggct gaggcaggag aattacttga acctgggagg cagaggttgc 8700agtgagccca gatcatgcca ctgcactcca gcctggtgac agaataagat ttgatctcga 8760aaggaaggaa ggaaggagga agggaagaaa tgtcttcccc ccagatgtcc tgggtcatcc 8820ctcttatgtt caaacttcaa cagatcccta gggcatgaaa ataatacagc caaattattt 8880gctaaggcat aacgaaagtg acctttgctc cagttcccaa taagttcctc atttccatct 8940gagactcatc accctggcct tggcttgtcc atatcactgt cagcattttg gtcacaatca 9000tttaaccagc taatcgggag gctgaggcaa gaggatcact tgaacccagg aggttgaggc 9060tgcagtgagc tgtgatcaca tcactgcagt ccagcttggg caacagagca agatcctgtc 9120tcaataaata aataaataaa tacataaata acttaagttt atttaaagct gcatctttgc 9180caccatggag aaaggccagg ccagctcctt ctctctttct gcacgtgttc ctcccacctc 9240agctgcctct gctcctcaag gaggaacaga gggagtagga aaggccatcc caggaggccc 9300agcaccccat gacctggctc tggggccttg tgggtttatg gattcccagt gctgagtcat 9360ccctcacagg ctcttgtggg caccttggac attggtcaga agcatgtggt ccccgggaac 9420acaccttttc ctgatcatct gggaagggca gcttgtgcca gcgaggccac ctgttcagcg 9480ccacggcccg ccagacagct gcagccacag ccttgccttt gatcagagca aacaccagac 9540atgtgtgtca tgcccccaac ccatctccag gggacacatg tcctttcttg ccaggcctga 9600gatgaacaag agagggacaa gtccccaagc ctctctctcc ttcctgcctc acccactccg 9660ctgttagatt ctcaaggtgg atggtgggct aactagggca accgaccatc ctggtttacc 9720tagaactgag ggggcatttt caggaataaa actgcaaaag tctggagcaa acaggagcaa 9780gttggtcact ctggggctgg tggagtcagg tttccttctg caggccccct ccccgcaagc 9840atgggtggaa cccaggacag gaacacagag caggccccag gaccgggctt gtcacttaca 9900agtctttttt tttttttttt ttttgagatg gagtcttgct ctgtcatcag ggctggagta 9960cagtggtgcc atcttagctc actgcaacct ctgccttctg ggttcaagtg atccccctgc 10020ctcagcctcc tgagtagctg ggactacagg tggcaccacc acgcccagct aattttttgt 10080atttctagta gagatgagat ggccaggctg gtcttgaact cctgacctca agtgatctgc 10140ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgtgcc tggccccact 10200cacaagtctt aaaccatgcc tcagcacatc aatgccattt acaaaaaggt agagggattt 10260tccaggcaaa aatagatgaa agacatagga tgattgatca tgtcctgctt aaacataggt 10320ctgatgctat taagaattga gggctgggag cggtggctca cgcctgtaat cccagcactt 10380tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 10440gtgaaacccc atctctacta aaaatacaaa aaatggccgc gcgcggtgac tcacgcctgt 10500aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 10560cctggctaac acagtgaagc cccgtctcta ctaaaaaata caaaaaaaat tagccaggca 10620tggtggcggg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 10680gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 10740gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 10800ttgcaggtgt gacaaatgat tctgtttctg tggcagaatg ttctcaggag atctcttttg 10860aactctcatg gaaagcatca tgctgttggc aacatcacat ttatttttat ttatttatta 10920ttttttagag acagggtctt gctctgttgc ccaggctgga gtgcagtggc acaatcacag 10980ctcactgcag cctcaacctc ctgggctcaa gcaatcctcc tgcctcagcc tcccaaagta 11040gctgggacca caggcgtgag ccactgcact cagcccaatg taccttcaat atttacattt 11100ctggcaaagg tagcaaaacc ttaacaaatt ttgaatctag ataataaaat tatgaggctg 11160ggtgcagtgg ccctgacagg gatggctcac atctgtaatc tcaacatttt gggaggccaa 11220ggtaggcgga tcacctgagg ccaggagttt gagaccagcc tggccaacat ggtgtaaccc 11280tgtctctaac aaaaatacaa aaaaattagc cagacgtggt ggtgcacgtc tgtcatccca 11340gctactaggg aggctgaggc aggagaattg cttgaacccg agaggcagag gttgtgatga 11400gccgagatcg cgtcattgca ctccagcctg ggcaaaagca agagcgaaac tctctctcca 11460aaaaataaaa aaaaaataaa ttaatgaatt aattaaaata aaataaaata atggatagtc 11520actgtaaaga aaaaataaat gtatatatca gccaacaagt gatggaatag agcaccccat 11580ctccctggct ggacagatac atcccacaac acctggaagg cggctccatg tagaactttc 11640tggactgctt gaggtgctgt gctggagcac ggtgacagag gagctggacc atggacctcc 11700cccggccccc accaagggcg aggtccccct gtggtgggtc tgagggaggc atccgtatgg 11760cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 11820ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggccccctga 11880agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 11940aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 12000cagcaagggc cagaacaggg atggccctgc ccagggtcat gcancgagga agtggccagg 12060ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 12120tagagccctc cagctcactc atcctcggcc agctccatct cctcagcttg taaacccccc 12180cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 12240tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 12300ttgagaccat tctggccagc atggtgaaac cctgtattta ctaaaaaaaa aaaaattaac 12360tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 12420aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 12480tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 12540gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 12600ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 12660caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 12720cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 12780ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 12840cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 12900tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 12960ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 13020tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 13080cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 13140cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 13200ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 13260gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 13320agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 13380ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 13440ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 13500cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 13560tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 13620ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 13680tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 13740cttttctggc ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 13800acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 13860ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 13920tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 13980acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcctcggg agcagtggac 14040agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 14100gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctct 14160cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 14220ccccaggcgc acccctgagc ccctactcgc ctattaagtt ctcctaatag cccttcaagg 14280tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 14340aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 14400ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 14460atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 14520gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg ggaggagggc 14580ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 14640acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 14700tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 14760gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 14820gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 14880cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 14940tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 15000gatcttcctg cctcagcctc ccaaagtgct gggattactg gcatgagcca ccgcacccag 15060cagagacctc agttttctaa cctgtgccag caggaataat gatagctgcc tagcttggct 15120gtgctgggaa ttaagtaaga tgaccgggta gcaaatatga agtattactg gacacagagg 15180gccccaggct gggttagcag cggtggtcag ggctgctgct tcctggcctg agctcgaagg 15240agggccctca ttaccacctg ggtgagtcct cgtccaagcc tggcactgct gcgtgggaat 15300aacttctgcc acccaagttg gcagattgtg tgcaaagtta agtcctgact ctgtggggtg 15360gacttcgagg cctcttcatc ggacctgctt ccggtgactg cattcgcacc tcctcctgtt 15420cctggtttaa cacagcccag ctttcctcct gctgagccct ccctgggcct gctgtcaccc 15480tcgtgccgct gtgcctcgca gtgccactcc ctgtaccctg aatactttgc cctgcctctc 15540cacccagctg agagtcaggg cccctgtgag gctctgccca gcccgtcctc cgggtttctg 15600cctctgctga gcacttccct gcatgattgc ttctgagagt ccccccagcc tgtgagcttc 15660tcaggactgg gacagcttct caggaccgag gcttcctggt ctgcttgcaa ttttacaggc 15720gggcacattt tcccttggcc aacatcagag actggacatc tgcagatctg tgctagccac 15780tgagcaccca ggcaccccag caggtagctc tgtaaccaac ccattctgta aagctgaggc 15840tcagagaggt gaagcgcctg gcctggggcc acagcctgcg tcagctgcag agccaggagc 15900tgagatatgc acctgcggct ctgctcacag ggtcctgcac agactgctgc tggagccacc 15960tatgtagagt caagagagtt catgttaact ccctctcaca tccctcagcc agggtggggg 16020ctgacgatag acactcaggg atggcctacc ctccccaaca acccccgtca ggtttgccgg 16080atctccttgg aagaaaagtt ctgggcagaa ttccaccgtt ggcctggcct acactctcct 16140tagtggctta ggaccctcag cggtggataa gttgtgggca gaagagatgc aatcaggatt 16200ctcacccact caccccttgc cagccccaat aagctcaata agctgggctc ggtctgagga 16260agtgtccagg aaatgtgcaa atggcctggg acagccctgt gttcctttca gtaaggttgc 16320tgaaggtgag gctgaaagtt ggagaaacag aagccagtgc ttatggtttt aattaagata 16380atggaatgta tgtatgtatg tatgtatgta tgtatgtatt tatgtattta tctttagaga 16440tagagtctca ctctgttgcc caggctggaa tgcggtgaca caatcatagc tccttgcagc 16500ctcgacttcc tatgcccaaa tgatcctcct acctcagcct cctgagtagc tgggactaca 16560gacacacgcc aactatgcct agctaatttt tatttctatt ttttgtggag actgggttct 16620cactttgttg cccaggctgg tcttgaaccc ctagcttcaa gcaatcctcc tgcctcagcc 16680tcccaaagtg gagggattac aggtgtgagc caccacacct ggcctggaat ttatttgtat 16740tctgcttata aaattaatac attcttattg cagaaaagtt tgaaaataaa agaaaggaca 16800aagaacaaaa agcgtatata atttcacagc tcagatctca ctgctattaa catttttatt 16860tactttcagg cttttttctt tctaggtaca tatgcagaga ttattttatt ttatttattt 16920tattttatat tttattttat attttttatt tcattatttt attttatttt attttattat 16980ttttagagac agggcctcac tctgtcaccc aggctggagt acaatggagt gatcatagct 17040cactgcagcc tcaaacacct gggctcaagc aatcccccca ctcagccttc tgagtagttg 17100ggactaaagt gtgagtctgg ctaatttttt ttactttttg tattgacaga ggtctcacta 17160tgttgcccag gctgatctca aactcctggg ttcaagcgat cctcccacct tggactccca 17220aagtgctggg attacaggca tgagccacca tgcctggcct aaaatgccac tttttgtcat 17280ttactaaaat cccatggaca ctttgacatg tctgtattct atgctattga tctgactgtt 17340ggcatctaca tcattatggc catctatcat ctatcataat ccattttaac attaaaattg 17400tgctgctgct tagatttttc tggcctgtct cctatttgta ttcttccaga taaattttag 17460aatcatttta tcaaattccc cttgcagaaa aagccctatt ggattttggt tgaaaaatac 17520tgaattttta cattaactta ggaaagggct gggcacggtg gctcacgcct gtaatcccta 17580cacttttcga ggccaaggca ggtggatcac ttgaggttgg gagtttgaga ccagcctggc 17640caacatggtg aaactcggtc tttactaaaa atacaaaaat tgccaggcgc attggctcac 17700ctgtaatccc agcactttgg gaggccgagg tgggtggatc acgaggtcag gagatagaga 17760ccatcctggc taacacggtg caaccccgtc tctcctaaaa atacaaaaaa ttagccaggc 17820gtggtggtgg gcgcctgtgg tctcagctac ttaggaggct gaggcaggag aatggtgtga 17880acccaggagg cggagcttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggcga 17940cagagtgaga ctccatctca aaaaaaaata ataataataa tacaaaaatt agccgggggt 18000cgtggcgtgc acctataatc ccagttactt gggaggctga ggcaggagaa tcgcttgaat 18060ccaggaggtg gaggttgcaa tgagcagaga tcgtgccact gtactccagc ctgggtgaca 18120gagtgacact ctgtgaaaaa aaaaaaaaaa ttctgaagga ttgagactct tagactctta 18180ggtcttccta tccaagagca caatatagct tttcatgtat tcaagccttt ttcaatgcat 18240caacagaatt ttacagtttt tttcatgata tcctgctatt tcttataaaa tgtattccta 18300gatattctgc atgttttccg gttgtttgtt aataaatatt tttcatttgt cattatttcc 18360taattggctg ttatttgtat atatgacatc tgttgaattt tttgattact ttgaaaatgg 18420ccattctttt gtgttttttt ttaactttct attttgagat aattttgact tacagaagat 18480ttgcaaaaat agtacagaga gttcctgttt cccccttatg ttaacccagt ttctccttat 18540gttaacatct tacataacta cagaacaatt gtcaaatcta agaatcaacc tgggcacaat 18600gctattaact aaactgcaga agctgttcag atctcaccag ttcttctact gctccccttt 18660tctcttccag tgttcaatcc ggaatcctac attatattta gttgtcattt ctctttggtg 18720tcttccaatc tgtgacagtt cctcagtctt tctttgtctt tcatgacttt cattttttta 18780tacttttgaa aaatactggc cggttgtttt gtagaacgcc ctcagtttgg gtttgcctga 18840agttttttgt gattagatcg aggtcatgca ttattggaga gggtgccacc gcctcgatgt 18900gcaagctcaa tgcatcatat cagagggttt gtaatgtcag tttataccgc cggagaccct 18960aacctggagc atttcgtgaa ggtgctgtct gccaggattc tccactagaa agttactatt 19020tttccctttt taattactga atgtctgagg ggaaatactt tgagactatg caaatatcct 19080gtttctgctt taacttcggc tcactaagtt tagcattcat ctatggatct cgcttatagc 19140aagtattact gtggagttct aatggtaatt ttctgtttct ctcattcctt caacctttat 19200taatatgctt cttcctcact tattcatttt gtttcagttg tttataccaa catggatttg 19260tggatattgg ttttattctt tgggttgcaa ttgaatccta tcattatttt gttagtcagt 19320tgttccatcc gaccttggtc attaggagcc cttgaaattt ggctcccatg cctttttttt 19380tttttttgag accgagtctc actctgtcac ccaggtttga gtgcagtggc atgatcttgg 19440cttcctgcaa cctccgcctc ccaggttcaa gcaattctcc tgcctcagcc tcctgagtag 19500ctggtattat aggcgctcca ccaccttgcc cggctaattt tttgtatttt tagtagagat 19560ggggttttat tatgttggcc aggctggtct caaactcctg acctcaggtg atctgcccgc 19620ctcggcctcc caaagtgctg ggactacagg cgtgagccac cacacctggc ctcctatgcc 19680attttaacat gcccgtcttt tctttttctt tcctactttc tgtgactgta agaagctcca 19740ggatacattt ttgctgccct agacttagcc tcaatcagtt ctcagaaaag ctctggttct 19800ttttatggga tacttagaaa actagctctg tatggcctgg cgcggtggct cacgcctgta 19860atcccagtac tttgggaggc cgaggtgggc agatcacaga tcacgaagtc aggagatcaa 19920gaccatcctg gctaacatgg tgaaactctg tctctactaa acatacaaaa aattagtcca 19980ggcgcggtgg cgggcgcctg tagtcccagc tactcaggag gctgaggcag gagaacggca 20040tgaacccggg aggcggagct tgcagtgagc cgagatcggc agccactgca ctccagcctg 20100ggccacagag cgagactccg tctcaaaaaa aaaaaaagga aaaagaaaaa agaaaactag 20160ctctgtatgc tagttttttt tttaagacag ggtctctctt gccccagctg gagtgtagca 20220gcacgatcac agctcactgt agcctcaacc ttctgggctc aagcaatcct cctgcctcag 20280tctcctaagt agctgggtct acaggcatgc accaccgtac gtggcaattt ttaaaaactg 20340tttgtagaga tggagtctcc ctatgttgcc tggtctggaa ctcctggcct caagtgatcc 20400tcctgcctcg gcctcccaaa gtgctgagat tacaggcatg agccactgta cctggcctgg 20460ccaaggtctg tcttttttta aaagaagttg ttgtatagtt gttttttttt ttattttttt 20520ttctgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggtgcg atctcggctc 20580actgcaagct ccgcctccca ggttcacgcc attctcctgc ctcagcctcc cgagtagctg 20640ggcctacagg cgcccgctac cacgcccggc taattttttg catttttagt agagacgggg 20700tttcaccgtg ttagccagga tggtctcgat ctcctgacct cgtgatccgc ccgcctcggc 20760ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cggcctgttg tatagttttt 20820atctcgagtt ttctagcgat ttaatcatat tggttacaaa aaaggatgat tttactacct 20880cctttccaat gtttctacat attttttcat tttatctaac tgcattttaa aataaacttt 20940taattttaga atggtttcat atttacagaa aatgtgcaaa gatagtacag agagttcctg 21000tgtactccac acccggtttc cttattatta tcttaacgtg atacacaatt aataaaccag 21060taacattatt attcactgaa gtccacactt tctttttttt tttttctgag acggagtcta 21120cttctgtcac ccaggctgga gtgcagtggc gcaatctcgg ctcactgcaa cctccacctc 21180ctgggttcag gcaattctgt ggctcagcat cccaagtagc tgggaataca ggtgcccgcc 21240accacgcccg gctaattttt tgtattttta gtagagatgg ggtttcacca tgttagccag 21300gatggtcttg aactcctgac ctcgtgatct gcctgcctca gcctcccaaa gtgctgggat 21360tacaggcgtg agccaccgcg cccggcgtcc atactttctt tagatatcct tcctttttac 21420ctaacgtcct tcttctggtt caggatccca tccagaaagc aacattaccc ctcgccatca 21480cgtcttcaca ggctcccctt gacgggaaga gttcctcaga ctttccttgt ttttgttgac 21540cttgacagtt ttgaggagga ctggtatctt agtctgtttt gtgctgctat cacagactag 21600ctgagaccga tacatgatac atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 21660gttcaagacg aagttgctgg ttggtttggt ctctggtttc aagatggcgc cttgctgctg 21720catcctctgg agaagaagaa tgcggtgtcc tctcactgca gaagatggaa gcgctaaaag 21780gaatgaactc cctttgccaa gccattttat aatgggcatt aatccacaaa ggatgaaacc 21840ctgagaaaca tcaagcttta aagcactggt tctcaacctt tttggtctca ggagcccttt 21900atactcttaa aacgttttga ggatcccaaa aaaaggcttc tacaggttcc atcttttaat 21960atttaccata tcaaaaatta aactgaaaaa attttaaatt atttattcat ttaaaataac 22020aaggataaac ccattacatg ctaacataaa tcatgtattt tatgaaaaat agctatattt 22080atcaaaacaa aaattagtga gaagagtggc atgtataatt ttttttgttt attttttgtt 22140tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 22200tgcaagctct gcctcccagg ttcacaccat tctcctgcct cagcctcctg agtagctggg 22260actgcaggtg cctgccacca cgcccggcta attttttgta tttttagtag agatggagtt 22320tcaccgtgtt agccaggatg gtcttgatct cctgaccttg tgatccaccc gcctcagcct 22380cccaaagtgc tgggattaca ggcttgagcc actgcgtctg gcctaaattt ttgtgaatgt 22440ctttaatgcc tgccttctca tatttgtttc tgcattcaag ttattgcaaa atgttgtgtt 22500ggttgaagtt tgtaaagaaa atgtggcctc atacagttgt gtagttggaa aggcaagagt 22560attttgattc tctcttcaaa caactatgga caacctgctg ttacaaaacc agaatgcaaa 22620aagttgtagt aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 22680ctgagttttg taacattaaa gtccagttgc gttaagctac tgtgatagca tatagcattg 22740tcctaatact ggaattagta tcagaagtgg ggtgctactg ttaataaata aaaagaaata 22800aataaatcat gtgatactgg ctcagaagtc aggcagtagg ctgtgtggaa cctgacatca 22860cgccatgtaa tacattggca accatttgat ccagctgtct gtcatgatga cttggaaagt 22920caaccacata cttacagagc ctgtagacat aggggaaaat agtataaaac agaatactaa 22980cagtggacct tggttcttgc cagttgcatt tagccaaata ttaaacaaaa gagatattct 23040tgggcagcaa ctggaccatc ttcaagtaaa agtgaaaggt aataaacaga gtccagacat 23100ttgtgcccat gcgggttaag aaaaatccag ttgcttctag acaccgtata tgaaaacaac 23160gctgaaaaca agcctttgag tggtaaaggc cgattaacac tcagcgcggt aacaaagacc 23220aggtgggcta acccgaaatg aaatgagaag cctgtggtga tgaggaggca gagaagtaaa 23280atcaagtttg agcatttcgt ttaggagagt ttgggctctg attacttgca catgcaaacg 23340aactggaaac aaacagatca gatgtctacc acttcttcga gggaattgca ttgccaaaga 23400agtcatgaaa gcagactcta tactgattag gcattaaaac aaaaacaatc tttaggcccc 23460taaacttgca tgggcaggaa gtgggctgtc aaagctgttc atcctctaag gtggacctag 23520ttcctagtcc ccagtataca cttcagatgt ggccctggag gacactggac atggaggacc 23580tcccagagga tgaggctagg gcttcatttc tccaatgacc tcagctgcct ctatttcccc 23640ttcttcctct ggaagtccta tcatcgttat tattattatt atcatcattt ttattttgag 23700ataaggtctc gctctgttgc ccaggctgga gtgcagtgac atgatcatgg ctcactgcag 23760ccctcccagg ctcaagtgat cctcctgcct cagcctcctg agtagctggg agtacaggca 23820catgccacca tgcttggcta tttttttttt cagtagagat agggctctca ctatgttgcc 23880agggctgatc tcaacctcct gggttcaaga gatcctccta cctcagctcc tgagtagctg 23940ggattcgggt gcacaccacc atgccaacta atttttaatt tttttttgta tggacaggat 24000gtacagtgtt agaaatggat tgcttgcaga ggcaggagga tcacttgagc ccaggagttt 24060gatcacactg tgaaccatga tcgcacccct gcactccaat ctgggcaaca gagtgagacc 24120ttgtctcaaa aaaaaaaaaa aagagagaga gagagagact caaagatagg caaaaaagtg 24180ggaaagcttt atagtggaca aaaaggaacg ctctaagtct gccctattgg catggtgctg 24240aaggtgggct aactagagat agggggtact atgtggttga ctatgggtgc atctttggct 24300ttccctgggt gatcctaagt tggaagcagg gacaaaaatt agggaagctg ttagttattc 24360atcacgttct ggcagtagtg gactggttgt gatagaagtt attgttttgg ccaggtgcgg 24420tggctcatgc ctgtaatcct agccctttca gagttcaacg tgggtggatc aggaaggagg 24480gaggatttgg gaggtcagga gttagcctgg ctaacctggc gaaatcccat ctctactaaa 24540aatacaaaaa ttagctgggc gtggtggtgc atgcctataa tcccagctac tcgggacgct 24600gaggcaggag aatcagttga acctggggag gcggaggttg cagtgagcca agatcgtgcc 24660caatttcatc tcaaaaaaaa aaaaaaagtt atcgtttagc ttcctcgatt gttactggac 24720gtagtaatct ggcttcctgc aagtctaact ttcagcagac tggctacatg ggctgtgtac 24780tgtagataag gcagtaagta aagcaaaaat tgatagagca tcaaggataa atagaaaatc 24840cgtaatcaag cagaagattt gaacacttca ctttcagtaa ctgataaaac aagtagacaa 24900aaaaaatcag taaggatgta gaagatttga acaacgtaat taacaaactt gacttgattt 24960acacgtctag aaccctgcag aacacacact ttttcaagca tactcagaac atttatataa 25020agtgaccata tggtggacca taaagcagtt tcaacaaatc tcacaggagt aaaataacag 25080accgtgtttt ctgaccgtaa gtacagttaa cctagaaatt gaaaacaaaa agctagaaaa 25140accccatgta tctggaaatt ttaatataca ctttgaaata acaaatggat cagagattaa 25200ttcaaatagg aatttagaaa taccttgaac tgaaaaataa tgagaatact ataccccaaa 25260actgtggggt gcagctgaac agtatataga cgaaaagtat actcatatgt gcatacctta 25320aggagcgggg aggattgaaa gttaatggga ggcaaaagca ggtggatcac ttgaggttag 25380gagttcaaga tcagcctggc taacagggtg aaaccccatc tctactaaaa atacaaaaaa 25440ttatccaggc gtagtgaggc tgaggcaaga gaatcgttgg aacccaggag gcagaggttg 25500cagtgagccg cgattgcgcc actgcacccc agcctgggag acagagcgag actccatctc 25560aagaaagaaa aaaaaaaaag aaaaggccag gcgcggtggc tcatgcctgt aatcccagca 25620ttttgggagg ccgaggtggg cggatcacga ggtcaggaga tcgagactat cctggctagc 25680acggtgaaac cccgcctcta ctaaaaatac aaaaaaatta gccaggcgtg gtggcgggtg 25740cctgtagtcc cagctactca ggaggctgag gcaggagaat gtcatgaacc caggaggcag 25800agcttgcagt gagccgagat cgcgccactg tactccagcc tgggcaacag agagagactc 25860tgtctcaaaa aaaaaaaaaa gttaatggga taaacatcca tctcaagaag ttagaaagga 25920atgacaaata aaccaaaaaa aaaaaaatca aaagaagaaa atcataaggt caagactata 25980aagagagtgg ctgggtgcag tggctcaggc ctgtaatctc agcattttgg gaagcagagg 26040tgggcagatc acttgagccc aggagttcaa gaccagcctg agtaacatag agagacctca 26100tctttgctga aaataaaaat aaaaaattag ccaggcatgg tggtactgag gtgggaggat 26160cacttgagcc taggaggttg aggctgcagt aagccatgat tgtgccactg cacttcagcc 26220tgggtgacag agtgggaccc tgtctctaaa aaactaaaat aaggctgggc gcggtggctc 26280aaatctgtaa tcccaccact ttgggaggcc aaggctgagg tcagcagttt gagaacagct 26340tggccaacaa gatgaaacct catctctact aaaaatacaa aaaattagtt gggtgtggtg 26400gcatgtgcct gtaatcccag ctacttagga ggnnnnctnt ngattatatt ttctccttcc 26460tacgtcgtta ttggactgaa ttcagaatga tgactctcat tggagctctt cctgtctcct 26520aactacagtg gcttccgacc ccactctggt tttcacttca cccctctgct gctcatacga 26580gtagatactt ccttccttct ttctcacttg ttgctcttcc tcaacccccc ccgttggtgt 26640cccctcctct ttatcttttt ctcgcgacac ctgcgttctc ttgccctctt atcatccctt 26700tctcgaggcg gtcctttcct ttatccagct taaatacctt ctcctctgtt tatttggggg 26760ttgggttttt atctctcacc ctccctctaa tttctttcct ctttccgcac ccatcaagcc 26820tctcgtggtt tctcttcctc tactctcggg tcccccccct ctccccttct ttttttcttc 26880acccccccaa gcgctttgcc ttttttttct ttgcccttta ttcccccc 26928<210>7<211>29430<212>DNA<213>人(Homo sapiens)<220><221>未确定<222>(4336),(4345),(4349),(4392),(4447),(4490)<223>在上述位置的核苷酸序列特性未知。<400>7aggggaaggg ccggctccgt agctcacacc tataatccca gcactttccg aggagagagg 60atcatctcag gccaggagtt caagaccagc ctgggcaaca cagcaagacc gcatctctac 120aaaaacttct tttaaagctt aaaaaaaaaa aaaaaagcaa agaggacagt tcaggagaaa 180agcctgtaga ggcagcacac taaggaggag acgcagccca ggcaccagga ggggctggcc 240atgggcactc actcctccag caggcgagtg cccagcacca gctggcccac ccagacaccc 300aggacacggc ctgaatggct ccgtattcac gtgggtggta ataaacaagc aatacacata 360gccaataagg acaccttagt aatgttacat cataaacgct gcagatcagg gaaatggtgc 420agggtgaagt gggttggggg gctgcatgct acatgagaag tgggtcgggg ggctgcatgc 480tacctgagac agagcaggcc ttgctgggaa agaaggagcc ggcaggcctg ggcaaaggtc 540ctggggtggg agcacactgg agcagagtgt gggggtagca tggcgggtgc tggtcctctg 600ggcgccttcc caccacgtca tgtgcccatg tgcccaaggt ctctcgtttc acagccccct 660gaagctcagg ggtcacagct acacagcccc cagatacctt ggcctgcccc aggtcattcc 720atccagtgat ggacctgctg acctctagcc tgacctctgg gcagcgtaat ttgagaagga 780ggagaaggga gggcaacaga cctggggcga tgagggatgc acagggtggc agacacctga 840ggctgcacct tggagcctca gttctgggtg tgggtggggg atggacaggc tgagggctga 900agcagctggg cccggccacc atcacacccc aggacccacc agatcaccat gaaaaaccga 960atgtcaactg gcagcccaga gtgcagaaca aacctttcag aaacacggtg gtgactgccg 1020catcatgaac ataaaataat tacgccctct ccccagggat cacccctgca ggagtttgtc 1080ccaagaaaca ccagaaagaa ggaaaacgtc tgagtcacaa tatttgctga ggccttattt 1140gtaatagcaa aaaaaaaaaa aaaaaaagaa caatctccag cggcaggggt aactagacta 1200ttgtctccgt ggaaaggtag caccaattaa ctagtaacaa aatgactgcg gtaacaacaa 1260aacgttcgac atgtcaacac caaaaaccac acacccagca taaccgtgaa ccatgatttc 1320tactagaatg aatggcagtt atgagaaagc accagcggag acaaagattg aaaaagtaaa 1380ggtggcctca ttagggagac aagtctctgg gtaatatatt gtaatactgg taaatatata 1440gtttttaata tattttttaa ttccaaattc catatatgtt cctatgaagc tatttctgca 1500aatatttttt tcaggaccgt acatcacaaa ggcaaaaggg ccaggtcagc tctccagctg 1560agagtgacca cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 1620gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740caagaggtaa cagccgggag ggggtagtgg ttttcattaa ttaaatgttg ccttcatcca 1800gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 1860caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 1920ctcagtttcc tcattcatga aacaggaagt gatcattcct tttattttta tttttatttt 1980tattttgaga cggagtttca ctctagttgc ccaggctgga gtatgatggc gcaatctcag 2040ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 2100ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160gttttgccat gttggtcagg ctggtctcga actcctgacc tcaggtgatc cgcccgcctt 2220ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 2280taaagacacc tctggctgct gtggaaaaca gcctggtagt gcctcaaaaa gttacacata 2340gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 2460aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 2520caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 2580tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2640atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 2760ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 2820cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2880agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2940ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3000tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaaggcccga 3060cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 3120aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3180cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 3240caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacatt gcaccattat 3300actccagcct gggtgacaga gtgagattct gtctcaaaaa aaaaaaaaaa aaaaaaaaaa 3360ctaaacaaaa gcaaaaaaac caatgagtaa tgttgtcaag tgaacttcat cccaatggga 3420atgcagataa tttgtttaaa aggcaccatg cacactgggc aggctggctt cccctgggaa 3480cgtcttcttt tgcctggatt cccagttggt ttaatcgggc gtagaacact ttcttcaatc 3540cgggattcag gcacccctgc tcagcacaaa ctcagtacac cccgcactct gctgtgggtt 3600cttggcacta ttaggagaat gtgagggggt gattcagatc tatctctagt gggtgcatgt 3660ctgccactcc caggaacgcc cacttctggc aagtcagtgt cagagaaagg ccagctcgtg 3720gcccctcctg ccttgagtcc caggacccgt gatcagtcct acccggagca gaatcaggag 3780tttgaaaacc caagtgccaa caatctcatt ttaacccatg taagcatatc caatatttat 3840atatagaatt cataacagat gtctgggctt ccattccaat agcctatatt ttacactgtt 3900tatttacatg gttacaccaa acaagactca attcaaggta acccaatcct ttgctactat 3960accaaaataa gcaacatttt cagtccatgc cttatatata ttcaccaagc attacactag 4020gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga ttaatcagtc 4080aggaatgatc ctgcgtccag tgccagcatg atggaagaga cagagaaaca gaagacatca 4140gggctccaga gtcaaggagc ctgcaggtta gttgggcagg atatacacac atacacacac 4200acacgcacac acaaaaccac ccaagaagaa aaggtgggat gaatgcatgg acaggtaatg 4260cctggagcct ggggatggat aagctgactg caggtggccc aggcaggctt cctggaggaa 4320gaagacctgg ctgtangtgg ggtangcang ctttctaaat ggggaaaatc tggctgtggg 4380tggagttggc angtttccga aaagaagaaa agctgactat gggtacacct ggctgttggt 4440ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 4500tggaagaagt aaacctgact atgggtggac caggcaggct tcctagagga agaagaccgg 4560ctgtgggtga accaggcagg cttcctagac agaggaagat ctggctgcgg ttagagtggg 4620caggcttcta agaagaggaa gggctgactg tgggtagacc tggctgtggg tagactgggc 4680aggcttcctg gaggaggaag agctggagca ttgaaaaaca aacatgactt ggtgaatgtt 4740gagcatgccc aggcctgatc cccagaggca attacgcact caagttactt aattctactc 4800acaatgcctc acaaacaact tctctgacac ctaacacagc tctgggcacc ttctagcttc 4860agctcctcaa agcagttatt cacgctacta ccctgcacac ctcctcacac cccaacccca 4920gggacaggag ttctgccaga tgccaaagct cctgatgcca aagcctgggt ctgcttccgg 4980gctcctcttg gtctaactgt ccaccccgca tcggcatgat gtgcaaaaac aaggctttgc 5040aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 5100gggctgccct ggtcctgtta tacccccaac acattctaca gtcagctccg caagttccac 5160aaagatcaac gctggcgttt ttatggcatt ttatttacag tttttacaat ataaaaaagg 5220aaggatgcca cagctcagcc agcaggacag acagagatct atgatgcttc tgctgcacca 5280ttgtttgtgg tcaagaaagt ctgttttcaa tgatttatta aattgtggtg ggagatggat 5340ggtggcagtg gttaccagca acatgaatgt tcttaatgcc actgaacttc acacttacaa 5400atggttacga cgataagtgt tatatgtatt ttaccacaat taaaaacagg taaatgcagg 5460ccgggcacgg tggctcacga ctgtaatctc agcactttgg gaggccaagg caggcagatc 5520acctgaggtc aggggttcga gaccagtctc gccaacacgg tgaaactctg tctctattaa 5580aaatacaaaa attagccaga tgtggtggtg catgcctgta atcccagctt ctcaggaggc 5640tgaggcagga aaatagcttg aaaccgggag gcagaggttg ccatgagctg agattgtacc 5700attgcactcc agcctgggtg acaaaagcaa aactctgtct caaaaaaata aaataaaata 5760aaaataggta aatgcaaaca tatggtatag taatattatg ggctattatg agctacaaaa 5820aagaatgact tgggactaca gttacagccc tcattcagga atttgtttta aatgtgggtt 5880ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 5940gcacccctct aagctggctc ttctgtccca gaggtcccca ccagtcctcc agaacttctt 6000tgctttctta cacaataaga tgccccatgc tcggcttgta cctttccttg ccccagccct 6060agaaccagct tcttcgtgga caagctctga ctcctttggg tggagaatgg tattcagaaa 6120cccagacctg ggctctggtg tgctcactgc tacttggggt cattgcttct aggcctctct 6180gctgatggag gtaggatata cacgtacagt cttccctctt cccagattcc gtacttgagc 6240tcgcctactt gctaacattt atttatatcc cccaaattaa acctcacagc acttctgcaa 6300tcactcactg acttgcagag tgtgaaaaaa ctgagtcacc atcacacgtt ccaaactgag 6360gtcaactgag gccacaacgc cccatcttct tgctccggct gtcgagatgt aagcaagtgt 6420ccttctctcg gtctagctag tgccatgctt tccacatcac tgtgcttttt gtgggcaatt 6480ttgctgtata aaatgtcccc tgcacatatg ctgctgtgta gtgctcctag gtgcatgagg 6540ctgccccacg ccttacagag agaatatgca tgagaggctt tattcaggta tgagttatag 6600cgtagttggc catgaattca atgttaatga atcaacaata tacagtaaat aaggtgcttt 6660ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 6720cactgccgcc tcaacctcct gggctcaagt gatcctccca cctcagcctc ccaaactgtt 6780gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900caaggtggga ggttcacttg aggccagaag tttgagacta gcctgggcaa catggcaaga 6960cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7020tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 7080catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 7140aaaaaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 7260gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 7320tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 7440tcaagtgatt ctcagcctcc caagtagctg ggattacagg tgtgccccca caaccggcta 7500atttctgtat ttttagtaga gacggagttt cgccacattg gccacgctgg tctcaaactc 7560ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 7620tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7680tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7740tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 7920gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7980tatttatttg tttgagatgg agtcttactc tgttgcccag gctggagtgc agtggcacga 8040tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8100aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160agatggggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccac 8220cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 8280aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 8340actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400ctgtcttcct atccgtgaac atctactgca gtcctctcca atgtccttct gtaaagttgt 8460attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8580ttctcacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 8640aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 8760caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8820tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 8880atattcattg ctgggccatt aacaacattc ttggcaaaac cataccttag cttctcgtgg 8940aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 9000atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 9120ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 9300tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 9360tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 9420tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 9480ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 9540gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600taaaaaccct ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9660ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 9720accttcactg gagagagccc gcacttcacc tcctcagggc aggcatggct gatgccaccc 9780agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 9840ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 10020ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 10080agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 10200catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 10260gcccagtcac ccagacagca ccccactccc cacacacacc tcacatctgc ccgcctcaca 10320ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 10380cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 10440ttgcacactc aaacacacac acacacacac acacacacag cccagaggaa aaccataatt 10500gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 10620caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 10680acaggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc 10800tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 10860ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 10920acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 10980ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 11040ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 11100ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttctctcc 11160cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttccac tcaggaaaca 11220tattttgaaa aacactggcc tagaggcaga agtgaaatgg aaaacacaaa agtaaaactg 11280aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 11340ccccagcttg gcatgctccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 11400gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 11460caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 11520tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 11580caggtctctg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 11640gaccccaggc aagtccttgc ctcccagacc cagaacccca tggagcccac cagggctggt 11700tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 11760ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 11820tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct tcctgagaga 11880aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 11940ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 12000cagaagctca ggggcctggg agggaaggaa acaggccacc agggctcccc agaaggcatg 12060tatctctctc acaaacacac gcatgcacac acacgtgcac acatactctg caagccctga 12120gttagcaact gtggaatgtg accagctcag tgatcccagg acaagctgct agggaatatg 12180acatttgatt gatgtctgca aatgtgcgtt ttcactaatt agaaggttta gggcagagca 12240gagaaaaata tgtatttcag agtcccagtt tgacctgcca gaaaccagcc cattactaac 12300attcttattt tcaacaaaat atagcattct gattacatac catcttggtt ccacgcctcc 12360tgccttgcca agcccccgga agcggcccaa ggccatggca aatagtgaga gaaacagttc 12420cagggtggag actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 12480agcagtcatc accctctcct tgggacagtt gagtagctct cccccagggt catgtggcca 12540ctcaggttca tatgggaggc gagaggagtg gcagagtcca ggagagtggc tccgaagtca 12600ctgttccctc caggcctcag tgtcttcatc cattaaatgg gtaggctgag gtctgggatg 12660acaaggaggg cttgcactta ctgaaaccca tgggaggctg ttcgccgatt tcttttattg 12720atggaagaaa acactcgtat aattcaagta ccaattaaaa ggcaggcact ggaaccaccg 12780tctgccaatt cctagttttg cctataccaa atttgagcaa gttaattgac ctctcccagc 12840ctcagtttct tcgtctgtaa aatgagggta gggatggccc ccagcccaca gggcagctgg 12900aaggattaaa gaaatcaaac atctcttaga gcccacctgg cacactgtga tacacaacaa 12960atgttagcta tttttgtcta tgaagtctag attttatatc ttgggtgttc taaagcagga 13020tacatttatt taaaaacaag gattttcatt aaacacgtac cccacagaca gcaaccccat 13080ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc tttatacagg 13140tctcttggct gtccttcaaa tccaactaag gtggtacttc tgaagcactg tgcacatgtg 13200tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta ccccacccac 13260gcagtctcaa gtcacaaagc gacagagcag ccgaggaagg tctgtgcccc actggaccct 13320cgtgaagcca ccaactctac ctctgcgccg tgtcctgcag actgggctac cctttgggtg 13380gggaccagca tttgatgcaa gaaaggcaga cagaaaagga aaagggcaag ttcgactcca 13440gataacacag acagtaccaa gccccagggt ccataaatgc cacgcagatg gaagcattta 13500ctgcgaggcc acacagcaaa cgcacggatc cagggacgga ggtgcagact gcggtgcccc 13560tgagccatga ccctgcaaat taccaccatg ggaaaggagg ctgccaaacc ccccgacagt 13620cggctgggct ggcacagact cgtggtttcc atcgaggtgg gaggaggtgg gacgtcccag 13680cccctccccc atgcccactg cagagggaag cggccgtttc ccctgtgtgg ttacaaaggt 13740ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 13800ctggcagaaa agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 13860cagcttttat tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 13920catcgctcta caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 13980ctgttcctgg aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 14040aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 14100ttggggaatg ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 14160aggctaacat tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 14220agctgcctgc aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 14280tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 14340tgggtgaact cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 14400gcaggtgcac aggacgaaac acattccgtc cgggggaaac ctgcaggaaa gctccctctt 14460cttcctaagg tgccgggcct agcttcatgg gtccctaccc tccacgcctg tcacactttc 14520tgagtctcat gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 14580aactactgtc accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 14640tgcaaagtca cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 14700agagttaaat tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 14760tgccaaaagg gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 14820gtgaccctcc ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 14880atgaccgtgg cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 14940ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 15000actgcccctt ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 15060ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 15120acctgctggg gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 15180tggccagggc gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 15240ctgggcaaag tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 15300ggtacagagc gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 15360cccctcccca ccatgaggac catgaaggct tcccatgtgc cgcaaggact ctggtgtgga 15420gacacacgtc tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 15480ctcacagctg gagggccagg agctcaaggc ttcgcagggt ctgctctcat cccagaggcg 15540atggggagcc acagcaggct gcaggagaga gggtgggccc cctccacttc agaggcccca 15600tctggcccac agactggaga gcacatctct cagcaaccac ggagcgccaa ctgcgcacag 15660ggcctggtcg tcagagcggg gcaaaggcac tgaccgtcac ggccagggcg agggaagacg 15720ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg ccctgccttc 15780agcagtgaag ctgagtgggg aggcgctgat gcagggggcc agaaagggct gctggtcagc 15840cgggaggagc cccccacaga ggaagcagcc agcccagacg cagatggcag ggtcccctca 15900acaatgtcct ctgaaaagga gaggcgggga ctgctctggt gacacctaca aatagatagt 15960cagccctcag ccccctgcca tacttctgac aaagcagagg cccccagggg aggcgcaccc 16020gaaggtacct gcacctgtcc cccagactcc tagagcccac ctgaccccat cccaccaggg 16080ctccagctac aaaataaatg ccgaggccag ctaggcaagg acgcacactc ggtaccgact 16140gaataggctc cacgttgtca tgagcgcaac ccacaggcca ccaggccaca ctatgcagag 16200ctgagatggt ttcggccaag cagcctctca gctgagctga acaagtccag agtccccggg 16260gggtcgtcac tatggagtaa caattgcgat gcgatggtaa ccctaacagc taaccgtcac 16320tgagccaggc cctgagctag gtacttttca acgctgcctc tctgcagcct caggacgagc 16380ctgtgggagc ataaagatca ttccctatca cggatgggga aactgagctc tgaagcagtt 16440aacgtgcttg tcccagaccg cagagctagg agcaggacac aacagcaggt caggcaggaa 16500cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca acccccagga 16560ccccgaccct gcacctgcag ctcgctactg ccccctcagt gactccagca aacctcgggg 16620taggggaagg aggctgggaa tacctcgggt gtccgaaaca gcagcttctg cttggaggcc 16680actgctgcat aatggttgct gcccagcaca ccccaagcca cctgtgccac ctgtggtgac 16740cttccagcat gccttggtga ccaagctggc cttaggtgct gtgggcagcc aagaatagaa 16800cagggcccac ccctcctctt cacactaaca caaagcaaga ggcgggcact tcgactgagt 16860gcatccctct agctcaaggg cctcacggat cacaggggtc agggcaagat cccaattctg 16920cattcccgtc tgcctttcat cctgctctgc caacaacagc cagtgaggct ggggacatcc 16980ctgaacctgt ttctcacctg aaacacatca taccattgga ccccagccct ccgggagagg 17040ccctaatccc tgactgtggt gagatcagat cactggttaa gtacccagaa gggccttggt 17100caggggctcc aggggtgggg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 17160accctgatgg aggaggtctg tggtcagaac cgggctgtgc agggcacagg agcccagagg 17220gacccccaga gctcacctgg tggtctctga gcagggctcc ctcaaccctc agagaaaagc 17280acagcaagga ggccgcccag agcccagcgc ctagcaccca gtggcgtgcc agacctgcct 17340ggatcctgga gatctctcat caccctccaa gtcagtcatg cccaacccag ggacccacag 17400cccacggggc cgtgaaggtg tgctgagtcc aagaaggcct tcgacactgg gaagccaagt 17460ggcacctcct ggtgtggagc aggcggaatc ccaccagcct ctgctctgcc agtgggcaca 17520gctggacgat gagcagaagg ggctgttgct taataaacgt catttcctta agaggataaa 17580acctttcaaa acagatggaa attttttttt aattaaaact ggtggccaaa gagatggaaa 17640gcaccccttg tgcctccctc ccatcgtgac ccatcctctg cacacctcaa gctgttcgct 17700gcccaggtgt ctcctgaggc actgggggcg ggtgagaatc cgtgagccct cggccagccg 17760tggctctctg gagctctgcc ccaggccatc agggcacacg ccgggcaccc tgggggccac 17820acagggcaga gcccagctgg gtcagcacac agggccacac tgggcacaca agtctctgag 17880cctcccctgt ggacgcagct ctcactatcc caccccacta ggtcccgggg atctgtccca 17940cagggtgata tgctgtcaca gaccactacc agagccatgg cctgctgttc cgcccgcagc 18000caggtagtca cttgctccac agggacaggc aacgccgcac ttgggggctg ctctgcggca 18060ggactagagc tccagcagct cagccctcct gagaaggaga actccatgct ctaagaggca 18120gacgcagcgg acggcaccaa agccaccaca agcccacggg gccctgcatg gcaggtcagg 18180agtccctgac cactcgctct ttgtaaccag agctgcagtg gagtctacga ggcaaggact 18240gtgggcggca gtggccacag caaatgaatg agtgtcccaa gggagcaggc ggctgcgggg 18300aggcacagcc gggacccagg agtcctccgg cactgcagca aactccctgg gccccctgag 18360cagcgaccag gtggcaagtg catgaactcc cgggggcata acctgggagg gtgacactct 18420cttcgtgttc aaattcttga gaacgcatta aaaatatcac tcagtcacct actctatagt 18480tttaactcaa aagtaccaaa gtagccaggc gcggtggctc acgcctataa tcccagtact 18540ttgggaagct gaggcaagag gatcacttaa gcccaggagt tccaaatgaa cctgggcaac 18600atggagggac cccatttcta caaaaaaagt gttttaaaaa attacctggg cctggtggtg 18660tgtgcctgta gtcccagcta ctcaggaggc tgaggcggga gaaccacatg aacccagggg 18720aggtagaggc tgcagtaggc tgtgatggca ccactgcact ccagcctggg taacagagtc 18780agactctatc tcaaaataaa tttaaaaagc accaagccag gcttggtggc tcacacctgt 18840aatcccagca ctcagggagg ctgaggcaag tggatcacct gagtcagaag ttcgagacca 18900gcccagccaa catggtgaaa ctccatctcc actaaaaata caaaaattac ccaggcgtgg 18960tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 19020aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 19080acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 19140aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 19200tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcccccac cgtggcatct 19260cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 19320agtccctttg gggtacactg ggtttcccac accccagaaa tatgggctct tactgcagga 19380ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 19440cctaggctag agggagctca tgctccagca gaattcgagc cagaggaggt aaaagatggg 19500taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 19560agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 19620ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 19680ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 19740acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 19920tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 19980ggcaggggac agagctgggg cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 20100cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 20160cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 20220aagcttcagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 20340gatagctatt aatgccccgt tgagcaattt cctgagagct ttgccaggca ggtaccgcct 20400ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 20460gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 20520gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 20580gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 20700tccacagcct gagaccgcca gctcatcgga ggccatttaa aatccagccc tctgacacct 20760gctggatatc accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 20820cggctctgaa gttcaccaca aaagcataaa cgtgcaagca gagccaatac gtcttttgaa 20880aaggacaatg aggtgggaat ttacataact gatcttaaaa tatgttctga tgcttcagag 20940atggagacag cagcattccg gtacacaaag acactcacag gcagtggagc acagtgaagg 21000gtctggaatc aggacccagg tgtctgtgga cactacacat aaaagagcag catttacaat 21060gaatggatag gatggaccat cccaccaagg tgttggacaa ctccctattc actggccaga 21120cccctacctc ataccatata caaaaaaaaa aaaaaaaaaa aaacccagac agaataatgt 21180ctgaatgtaa aacataaaac agtaacagtc ctggaagaaa ataatggagg atatatttat 21240aatctggaga tggagtaaca agggatagga aaaaagccat agggaaaaag tagagttatg 21300attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 21360ctagctacag accagcagtg aaacctgaaa caaacagaac aaagaattaa agtccatacc 21420aaataaagac ctcccacaaa tctataagaa aaagataaac aggctggcac cgtggcttat 21480gtctgtaatc ccagcacttt gggaggcgga gatgggtagg tcacttgagg tcaggagttc 21540gagaccagcc tggccaacat ggtgaaaccc tgtctctacc aaaaatacaa aaattagcca 21600ggcgtggtgg cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 21660ggaacccggg aggcagaggt tgcagtgaac caagatggca atcgcgccac tgcactccag 21720cctggaggac acagcgagac tctgtctcaa aaaaaaaaaa aaaagaagaa gaagaaaaaa 21780gaaaagaaaa agacaacaga aaaatgggcc aaggataagt gtaggcaatt tgcagaaaag 21840taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat aatttttaac 21900caaactggac caaagaaaac accaaaaacc aaaatcttgt aattgccagc atcagagagg 21960atataggaaa gtgtgtgttc tcgtagatgc ttgcaggtat gaactgctac agccttttag 22020gagttatgta tgtatgtatg cttgtatgta tgtatttgag acagggtctc gctctgttgc 22080ccaggctaga tctgttgcag tgctgtgatc atggcttact gcagccttga cctcctgagc 22140tcaatagatt ttcccacctc agcctttcaa gtagctgaga ctacaggagt gtgcaatcat 22200actcagctaa ttttttaaat tttttgtaga catggggggt ctcccaattt tgcccaggct 22260ggtctcgaac tcctggactc aagtgatcct cctgcctcaa cctcccaaag tgctgggatt 22320acctggatga gccactgtgc ccggcctcaa tatctttaaa aacagaaatg gacacactct 22380ttgactagga atgtatccta taaaaacact tatacacatg cagagacaca cgagcaagca 22440tgctttgtaa tagcaatgaa ggctggaaaa actcctcaat caggtaaatg ctgtcaagtg 22500cacctgtgta ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 22560gtacaaagta gggtgtgatg gcacatgcct gcagtcccag ctactcagga ggctgaggca 22620ggaagatcct ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 22680aaattaaata ataataataa taaaataggc tgggcacagc ggctcatgcc tgtaatccca 22740acactttggg aggctgaggt gggaggatcg cttgatccca ggagttcaag gccagcctgg 22800gcagcaaagc aagacaccca tctcaacgac aaattttaaa aaatcagcca ggcaggctgg 22860gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg cagatcactt 22920gaggtcagga gttcgagacc agcctggcca acgtggcaaa accctgtctc tactaaaaat 22980acaaaaatta gctgggcatg gtggcagatg cctgtagtcc cagctactga ggcacaagaa 23040tcgcttgaac cagggtggca gaagttacag tgagccgaga tcgtgccacc gcactccatc 23100ctgggcgtga gtgagactcc tgtctcaaaa aaaaaaaaaa aaaaaaaaca aggagccagg 23160cacggtgggg tgagggaggg cacagaagca gcgcctcttc tgggggcacc cccaatctct 23220agcgatccag aggcctcagg atcctgaagg gagaaaaaac gtgaagctcc gtgctagaag 23280agaccataga gattggaatc agctggttct attttacaaa aaaaggaaac tgaggccctc 23340agaaggtgag tgcctctcaa tgccccacag ggaggcaggg agagggctct gagccctgca 23400gggccctgga ttcttgcaat ggggtggagt ggagcctgtg ccgcccccac caggcacctt 23460ctcaggagag gagccgttgt catatccttg aaggggtcct tgagcccctc aaaaggctaa 23520aaaccacttt cctccttgag tgaaccttca cctcagttta accacaagaa aaactacatt 23580aaggcccagc gcagtggctc atgtctgtaa tcccagcact ttgggaggct gaggtgggtg 23640gatcgcttga gcccaggagt tcaagaccag cctgggcaac atagtgaaac cctgtctcta 23700caaaaaacaa caaaatcagc tgggcgtggt ggtgcacacc tgaggtccca actacttgcg 23760ggctgaggtg agaggattgc ttcagcccag gaggtagagg ctgcagtaag cggtgactga 23820atcactgcac tccagcctca gcaacagagc aagactcaaa aaaaaaaaaa aaagcaggcc 23880gggtgtggtg gctcacgcct gtaatcccag caccttggga ggccgagcgg gaggatcagg 23940agatggagac catcctggct aacacggtga aaccccgtct ctactaaaaa tgcaaaaaat 24000tagccgggcg tggtggcggg tgcctgtagt tccagctact caggaggctg aggcaggaga 24060aaggcgtgac cctgggaggt ggagcttgca gtgagctgag atcacaccgc tgcactccag 24120cctgggcgac agagcaagac tccatctcaa aaaaaaaaaa attaaatctc aaaaaaaatt 24180acattaaggc aaactaaaag atgtttaaaa tatatatatt aaattaaata cactccaata 24240gagcaaatac gaaaataccc agaaaacaca atccccgcac ccccaggaca acctcccagg 24300gggtccacag caagagaccc caagcacgag agacagagaa cagtgtccct gtggcggaac 24360ctctggccca tcaggctcta ttagaaaata aggctcttgc cactgagaga aagaggcaca 24420gtcgcccagc agccacgggc tctggcacac cacgagtcag gccagcaaag tgtcaactgc 24480cccctacaag gtgacaaact aggacaaact ggaaaccaga ggctggacct ggagcacagg 24540gaccaccaca tggggctggg gaatgggcag ggacctcaga gcgccaccca catgcctaag 24600agcagcgcgt atgcgcatgc ctctgcatgg cttagggaca cagggagctc cccccacccc 24660caacccagga aggcagcccc cactacccag gtagggaacg gataggacca gcaccccgtt 24720ctgctcgtaa ctcagggctc caggccccct cgggggcaac cagcacagag ctcagacccc 24780aaatatcttc acccacctcc tggtccccat ctggacaagg gtgctgggga ctggctctca 24840gtcacaccct cggggtactc ttcaaaggac agctggatgc cccagggcag gagcttttgg 24900cccccagctc cctcacccca gacaccagct cttgggaccc caccagcatg ggcaaggtgg 24960acaccatcgt cccgattttg cagatgagga aactgaggct gagggctggc acacggctct 25020ccagagctga agagaatgca gagagcagcc ggagccagcc ggtgggtccc tgaggccggc 25080tcgtagcaag ccacagctgc ctccgcccat cacacttgga cctcactggc cccaggacag 25140ccctccaggg cggcctggca cagagcccac accctgctgc ttcctgaaca aataagtgaa 25200caaggccacc aagccgagga cctggatgta gccccggctc ccgccagggc ctccccaaca 25260gactccccat ttggagagcg cattaagtgt ttccaaagcc tcacaaacca cagatgtccg 25320gctgtctcac ggcttctgta acctgaactt ggccctcact ctgccctccc agcactcctc 25380tcagggccca ggcccctcct ctgagatgcc agcactgact ccccaacttg tccccatcac 25440ctggctcgtt cctgaacctc ggcaggagag tctcaggcca gatcctccca ccagccacct 25500ccaccaggat gcaggaggca tgagacctgc tcgtgccggc tgggagatgc aaccaaccaa 25560gatcaatcca atcagcggat gaactgacaa atataatgtg gtccctccac acaatggaat 25620attattcagc cacaaaaagg gctgaaatag gccgggcgtg atggctcaca cctgtaatcc 25680cagcactttg ggaggccgag gccggcagct cacttgaggt caggagttca agaccagcct 25740ggccaacatg gtgaaatccc gtctctacta aaaatacaaa aattagctgg gcgtggtggc 25800gggcacctgt aatgcaagct acttgggagc ctgaggcagg agaatcactt aaacccagga 25860ggcagaagtt gcagtgagcc aagatcgcac caccgcactc caacctgggc aacagagcaa 25920gactccattt caaaaaaaaaataaaaggct gaaacaccca tacgtggtac tacttggatg 25980actcctgaaa acgttacagt aaccaaggaa gtcagccacg aagacgcatt gtaagattcc 26040cttcatgcaa aatgcccaga acaggcagaa ccacagaggc agaaagtcga ctggtgttca 26100ccaggggatc cggggagagg gaacgggaag tcaccgtgta atgggtatgg gttttatttt 26160ggggtgatgg aaatctctta taacttgata gaagagaggg ttgtaaacac tgtgaatgta 26220ccaaatgcct gccttctata ctttaatatt ttatattata taagtttcac ctcaatttaa 26280aaaaaaaaca actcgacacc tttcacctag gaaagatctg gctttagctt gcatttcctg 26340taactcctgc ctaaagcctt ccagaagctt ccgctgcctt gtggatcaca accagactcc 26400acaccatgat ctggcctcta agggcctctc gcaggacacc ccgagggtga aggagcaccc 26460gtgggcccac ctctgcatag ctgcaaagct tctttccctg tcctcccctc tacatgggaa 26520gctctgcccg caggggcggg gccttatctg ccattctatc gcactcaacc ctagcacttc 26580actcggtagc agacaccaaa gcaaaacagc aacagcatta taccgggcca ggtgcacgtt 26640aactcactga attcatggta ggaaggattc tattcccatt ttacaggtga gaaaactgag 26700gcacacaaag gtagcatcag cttcctaagc ctcccagcac aggaagcggc caggctggaa 26760tcagaccctg ggcgcagggg ctctgtccac agtgctaact aactactcct gcccccgagg 26820gctgcagcgg tgagtgagtg agtttgtcag tggactggat gtccaaggtc atacaggaaa 26880aatccagact attgtaataa cagcctctag accggctggg gccagaaaga tcgaggacgc 26940tgacacacaa ctgcgctcac tgcagctctg ccagggatgg ggctaaaggt ctcacacagg 27000gcagttaggg ctccccatag cctgggagag gaacggggtg agataacaga aactaggtat 27060ggtgcccgaa gtcaaacagc cactgagcat gtaaacccag gtgggtctga ccccaaaccc 27120ctccaccccc atcagccctg caacccgtcg ctgcaaggga gaaagcaact cagaggcctc 27180acctgcctac atcccccacc cgtgtgtgtg agttctacta aatgcctgag cagtgacaca 27240gcacggctga aattaaacgg gttccaaaaa cgacaggaag cacgaagtga atctccccag 27300gaaagtgctg aacaaatgct ggatcgggtt caccggcgaa tttcttggaa ctgaagaggg 27360gagctaaaca cacggggccc tgctttggag gggactctct cagggtgctc cacacagcac 27420ttggttaacc ccactcagcc cttctgggct ctcccagagg gcccggcctt ggccttgggc 27480atctacagga ggaacctcca gggggagagg gggtgcctgg acaggccggc cctggaacaa 27540gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 27600ggaccactag aacaaagcca tttccgtgcg ttcacagctt ccaattgcaa caggaagcaa 27660tcaggaaaaa taattagcgg cccacttact ggcttcgctg aggtccgagg catgtatttc 27720acacagtaaa accagggata taacatcaaa accgttctgc agaaagattc ctccctttcc 27780ttccatttta ggcctggatc accacattca ctggggctcc caggccttgc tgcctaatgt 27840taaaataatc aactctattt ttgcctcaca cacaactgaa ctctacagct ataattcttt 27900ctcctcaggg gctcgaacca catggacgac aggcatttga ctccagcaac atcaccccaa 27960aacgtgcaca aaacccaaaa ctgcaatgag gtgaaaggca acgcggtcgg cctagaaacc 28020ccccctttaa aacaaacagt ttccccaaaa ccccttttgc ctccttgacc caggcatttc 28080cggaaaaagg agcggcgctg gcctgtactc cccagatact gtcgctgttt tgtcttcacc 28140ttgttttgct agctccagac aaggccccac aatgtaaaca cgctcctgaa agaggcagat 28200ttggggtgaa actgtccata gaatctctag gcttgggtca gaggcaggag gacgtgaaac 28260aaactccaag ctcctcctgt tccccgctgt cccccacacc tccaagcaga ggctgcagcc 28320tgggggatct gactacaggg ccaccccgct gcaccattca cactggaaat attcagggag 28380acagctgttt gccttaagga ggcccagaca aaggggcccg aggtcctccc cgctaaactg 28440ccacaaacag aacaggagcc gcggcgtgca caggcacttg cggccgtgcc acttggccag 28500ccatactcca gaaaaacaaa acacgcacat ccgaagagaa tgatttaggt agcaagaggc 28560ttgcttgaaa aaccacatgg caatctccaa attaaaagaa catgtgtagc gtttcacgac 28620tgcttaagtt tcctgagtcc tcctgacctc aactccaccc cctgggaaac accaaaagtt 28680ggatgagaaa gttcccccgc cctacctctc cccacgggag tgtacaactg aggcacaagc 28740ctgcctcccc cactgccccg cgatctggga ccacgtctcc tccgcgtagc cgacccgggg 28800atggacacta tctggggacc cggcggccac acggggcatt cgggtcgccc gggcacctgg 28860caggtgtcag tccgcttgga aacccacagc cacgcggctc acaggagcag cgccaccggc 28920taggccgccc cgcgcccggg ctcagaactt tctcgctgcc acttcagccc gtcctcggag 28980cacgcggggc ggccgcgcgg ccgctggaaa caggcttgcg aaccggctcc ccgggccagg 29040cccgcctccg cgccccaagt ccccgctcgg tgcccggccc gggccacacg ggcccagcgc 29100gggctcggct cggctcccgg cttcccgcgg gctcgggcag gtgaggaccc gcccgcgccg 29160cacctggcgg agcgggcgcc ctcctcgcca gcccgggacg cagcgtcccc ggggagggcc 29220cgggtgggga gacaaagggc ccgcgcgtgg cggggacgcc ggggacggca gggggatccc 29280gggcgcgcgc cccaactcgc tcccaactcg ccaagtcgct tccgagacgg cggcggcgcc 29340cgcgcacttg gccgcggggc cgcccgggcc attgtccgag caacccgcgg cccgtcttac 29400acgccgggcg cgggaaggta tcgaatcagg 29430<210>8<211>33769<212>DNA<213>人(Homo sapiens)<220><221>未确定<222>(33739),(33749),(33758)<223>在上述位置的核苷酸序列特性未知。<400>8cttcccctta cactggtcct tcgacccgcc tcggatgaaa actgaatggg tttagcctta 60gaggctctcg gtctctaagg gaggtgggtc aggatgccgg ggacagggtc ctcttcctgg 120ggcaacgtgg gggaacgagc cacctacccc tccactgaat tgccctgggg tgtgggtacc 180gacggctcat tcggtgtcca gggtctgaga tgtgttgaca ggaagaatga aaggggatgg 240gagggatggg gcgaaagaag ccacctgcag ccccaggaac tatctggcca gcacaccgtc 300acccagcggc ctgagccacc cctgccagag ccaggaggag accctgccaa tgggtcacca 360gtgtgcagga actcagaagg tcatcacagt taataccctc catgccccaa tgtgggaaaa 420caggtttttt cacaacaaac aagataattt ttgttatttt ggcaaaagga ggcagggcag 480ccccggacac ctccatccca cctcatcacc cagccgcagg gccccggcca tccctgcaga 540cagagtggat gtcacaacct ccctgcaccg aaccaagtgc agctcccagg ccacaggcca 600cccaggaaag gtccagtggc ccccggaggc tcccaccgca ggcctcccac cacagccggc 660accaacccag gatagctgtg ttctcctggc ttcttttcac acgggtagca gaaagctgag 720atccggggaa agctgagatc cagggaaagc tgagaatcgg cctctgctgc ccggacgccc 780acccccagct ctgctcccag ctccagggcc tccttctcag gtgcccttac aggaggcaga 840gggcttgagc cacctcctgg gcctggggca cgcaggatga acggggtcac ggtgcaggcc 900actgtccact gcgcagatcc caaggccata aacagcctgg ccacagtggc ttcccagctg 960gcaggcggcc agattatttt tgttgtttag caattgatta agtttctccg ctgcccccag 1020gggtaagtgg tggggcaaat gccgcaaccg cagcatttga cccgggatcc tgtgccaagt 1080gaccataggg tcacaaagca caagggaagt ggctgggccc gatgctggct ctgctggaac 1140ctgaggccgg ccactgtcac ctgcacggtg cctgggacct tccagcaagc acagagaagc 1200tatggccctc caggagcagc tggcaggcac cttggcctgc agtcaggggc tctgtctgct 1260cagctctaaa acaggaaagt cgctgctctg cctggggtca gggcagccag agagtgacca 1320agtcagtgcc ggcctcagga agggacctgc aggcgggtcc cttcctctcc catccctcgg 1380tgccagccag cccctcctgt ggccccccac tgcctgcctc tgcccccatg ccccaccaca 1440acctcaggcc catggctgca tggccactcc ccaggcaggc agtggggatg ggatttcacc 1500atgttggcca ggctggtctc gaactcctga cctcaggtga ggagttccta aagtgctggg 1560attacaggcg tgagccaccg cgccagccct ccctgtggta ctaaacactc acaccccctt 1620gctggggacc ctggtgaggg aacacagcct cacaagtgaa gtgtggtttt gttgagcaaa 1680tgacgcctgg gcagccctct catctttgcc taaaactgaa gaatttaggg gcgtggatgt 1740ataaaacagt tggtgactta aatgaaaaag aaggccacac tccccccttt aggcaggcgg 1800cctaattctt taaaagccag cacagggtgc ctttctgaac ccaggcacac agtaggtgtt 1860caatggacag cagcggttac ttgtactgct catgacaccc tgtctgtggc ctctgcagct 1920ggctccagcc tgacgcatgg ctgcgcccct ccgcaaggcc accccggtat acatggaaac 1980tctgtggaga aggccttggg ggccggccag gacgccaggc ccagatccca tctgcgccct 2040tcctccatag acctcagcga gctctcggca ccatgtgcct caggcccatt taagaagtag 2100ggccggccag gcatggtggc tcatgcctgt aatcccagca ctttgggagg cccaaggtgg 2160gtggatcacg agatggtcag gagatcgaga ccatcctggc taacacggtg aaaccccatc 2220tctactaaaa atacaaaaaa taagccgagt gtggtggcgg gtgcctatag tccaagctac 2280tcgggaggct gaggcaggat aatcgcttga gctcagcagg cagaggttgc agttagcgga 2340gatcgcgcca ttgcactcca gcctaggtga cagagagaga ctctgtctca attaaaaaaa 2400aaaaaaataa aaaaaagaag cagggccagc cacggacgac ccctcacaca gctcccagga 2460cgcgtgcctg ggtatagggc tcaggaccat gaccgctgca gtggccccca agaaacgtta 2520cttttgtcac ccaccccgcc tcagtggcag tagccaaaat aacggattag aatggaacca 2580tgtgacaatg ccactgcccc aactgacaga agatggctat cagcagttca cgcggcccca 2640cctatcacaa gtgcagggca ctctacaact tatgcatcct tccccagaca ccgtcctttc 2700gaccctccca ggtcagcaag gcacacaggg cctacatttc acagccacac agcagagggc 2760tgaggctgga actcggatgc tctgatttcc gttcaatcac atccccagag gtggcacaga 2820gacggggggc ttctcttgac aaagtcaaga aagtcactgc cagctccact gaagaccaaa 2880gaacctcagc tctcaaaccc tcttgaaggt gttaccgaac tctcccagcc tgtttcctgg 2940gtcccgatgt tggtcccgtg ggacacagga agaggaagaa gctccctaga gcagagcctg 3000gtgcacctgc cacactctca gagggctgcg cacgggcgga ggagccgtgt gcaggagtgg 3060ggtctggatg gaggggcgct gtggccgggg gcagggggca ggggaagggt gctccaggtg 3120gtgggcacag cacgagcagg ggcagggagg tccacactca gatgtgcaca gggagaaaca 3180aatcgtgcat ttccattgga ataggcggta aaaggtagaa aaacagagtg ggggccagga 3240agggagtcgg agccttctag tgtctctctg caggtgagcg gcagcccgag gtgtcagctc 3300agcagacttg gggtccaggg gccgtgtctt ctatcactga ccccagggca cacggaactg 3360gggagggaga gcagaggcac agggcacggt cagtgaaacg aaacaaggag tcatcaccaa 3420atgcggaaag ggcaaggagt gcccgcagcc gcacaagggt tctgtctggg caacgtgggc 3480gtcccaccag gccccgcacc ctgcaagcgc aaagctcgcc actgaagata aagggaagct 3540gttggagctg cggagctggt ctggggtccg catggagctg ggcttatgct gcagtcacaa 3600gggggacatg gaagaggctg caggggacaa aaccagtgac cacagtctaa ctctgagcct 3660gtggaaaggc gcccacagca ttcacccatc ccagagatgc cattccccct gtgcccccgc 3720tccacggtga cagcgttctc caggaatatg atgcgcccct ctcctcttgc atcagccctg 3780acagtgagta ttcaggccaa aaagcagaag agcacagctg cgtggttcca tttccatgta 3840gttctggaac aggcaacgct aatccaaggt gatagaagtc aggagagtgg tggagggggc 3900gggggttgag gatggcaaag gggcaccggg aactttccca gtggtagaaa tgttctctgt 3960ctggaccgtg tggtagttat gcagacatat gcagctgtca aagttaatcc aaatgtacac 4020gttaaaatgt gtgcgtttta ttgcctgcaa gttatacctc aattaaaaaa ataaagttag 4080cactcaggct tcttccacaa cttcctgaac cgtgtgagct gattttcttg ctattaaaaa 4140ttcacggtcc atggctgaga acagcagctg ccttctgttt gcaaagtcaa cgccaatcac 4200tgcccggccg cggcagactc ggccccacag gacctccttt cttttttccc tttgacctac 4260ttccctgata agtgacaaga cagccagact ctgggaacaa acgcccgtta ttcggccccg 4320agctgagcgg gccctgcttc ctgagctaat ccgcccggac agacggaggg acgtgagggg 4380ctttgccgtc ggctccagct gtcagtctgc ccgtcagact cgacagtggc cccctctgtt 4440cctcccgctg cccccactcc atccccgact tctttttgtt tcctgtccct gacagacgaa 4500catctgttaa aactctgtct gggtgagctg tggccagcgg cccacaaatc cccaagccgc 4560accccagcct catctgggcg ctgccgggag cactgcctgg ccaccctctg gacatagctc 4620tgagagccac cggccagggc acgtgtggcc cgagtggcat ggtgcacgcc gctaagccca 4680ctgcccaaag gcccccaagc aggagggatg tgcaggagac aaaagtcaaa agaacagggg 4740cacgttccac agaggatggg gctggagggg tggcagtgag gaacagcagc ttccgaggat 4800ggcggtggca actcccaaat aaggcctcac tcctgctgtt tttagctcat tccacataat 4860tggaaaaaca tggcagaaac cgaagccagc tgctgccttg gtcctggggc tgtgtggagg 4920gggtggggag gccggaggcc caggctctgc actcgactgc tggggatgag agtgactctg 4980agctgcagag agcagcatcg cagccgccat ggtcccattg agccccggcc acgctgggcg 5040gcagaggctc gtgggatata cctgccctgt ctcatggggg tcacttcagg aggggcgggg 5100gagccaggac acagcccagg gctagcggtc accctgcagc tcaggggcca cgtaaatagt 5160gccaccttga aggcacacag cagtgcgggg ccccccccgc caccaacgca tccctacctc 5220taggaggccg cctgtgtgcc cctgggaacg ctgctccctg tcccttgggg tcctggtgtg 5280accaccctct cagccccttc cttggggaag gcacctgact ccctacaccc agctggcttt 5340catttgctca aaatcaggaa aaagcagaat tcaagacatc acagaaatgt cttcgcctgt 5400aactccatga aagataaacg gtcagacacc caggagggag tcccagggac ccttgagtct 5460cacctgaggc tctggcttca aacctcgaga tgtttccagc catgctagcg ccgcccccca 5520caacctgccc cacacagtcc tcccttggga actcacagat ttggccccca cctgccccgt 5580ttcttctggt ggagtgggtg cgttgggttg gggtggggct ggggactctg gatgtgtctt 5640aagagtctga gtgattctga cacagccagg ccctgccccc ctcctgacct tcgccccaca 5700ggaaagggag ccacacgcct gaagcgccca gcacaccccc ctccgtcctc cccaggtcac 5760ccgctggccg tgtgagccgt gctccccact gccccttcac ccaccccagc tcctcctggc 5820agcacccagc cttggaagct acttctgatt acaaccgccg aaggaagact cgctccctcg 5880gcactgaccc agacagcctg caccatcacg ctgctcagca caacccacac agccttcctc 5940caaaccccat ggagcgggga gtataatcac cccctttcta ccaacggaca aactgaagca 6000cagagaggtt aagtcacttt cctaagctcc caacacgatg acaaaaaata gaaggtcagc 6060ccgcaagtgg aactaggtgc tccaagtccc cggtctgcct gacactgcac ctcctcgccg 6120ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 6180gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 6240ccacggtccc gggtccgcct gacactgcac ctcctcgccg ccacggtccc gggtccgcct 6300gacactgcac ctcctcgccg ccacggtccc gggtccgcct gacactgcac ctcctcgccg 6360ccacggtccc gggtctgcct gacactgcac ctcctcaaca ccaccacggt cccgggtctg 6420cctgacactg cacctcctca ccaccaccac agtcccgggt ctgcctgaca ctgcatttcc 6480tcatcaccac agtcccgggt ctgcctgaca ctgcatttcc tcatcaccac ggtcccgggt 6540ctgcctgaca ctgcacctcc tcaccgccac ggtcccgggt ctgcctgaca ctgcactttc 6600tcaacaccac tccttggccg gctcccaact acaaaccaag ccatgtcttc catcctgaat 6660cctcttggcc taaacatcac tcacaatgcc tccctcggga acaggcacaa gtcccaccag 6720cacagcctcc ttcgttacct gcgtttccgc tagcccaggg ccagctccag agccctcacc 6780acagagcctc tatccttcac ccccggacac tggacctcac caacccatag cctggaggag 6840atccctgtgt gaccccaggg cctcctctgc ccgactctga atttcactgc ccaacgtgac 6900acctcggaag gctctctggg cactggcagc cctccatggg caccgctcct tctggccagc 6960tctgacatcc cggctggtga ggtgccctgc acgaggcctc tgcccactgg gacctcacag 7020ccgtgctgtc agctgcaaca agcgacagaa tttcacgttt tcttcacgtt gcccctgggt 7080gagcagctcc aggtagtttt cagtcgaggc gaggcgtccc gtcagcagcc aggcggcaca 7140gctaattcat gcccgccggg cgcacggccg caataccaat gggcacctgc agcctggaaa 7200gccacagagg aaccgagaac agcgactgtg ctcaggtgac aggactgtgg tcttttaaca 7260aaacattttc ctttaacgtg atattttacg gcaaggaatg aaacctggag ggcaggacat 7320ttggatacta aagccccagg ctgccgcgtg gtctgctttg tgaagtctga agcccgcgcc 7380ccattctggc cccgctcaca ggtccggctc tgactcacca gcttcaatgc taggccgtgc 7440ctgtcctcca accagaacat gacttcctta aggacaaagc cgtttctcgc ccatccccat 7500ctccctctgg attaagaaat atgggaagat cttctagaac cacctcaaat ttgcagagag 7560ccatcctggt gacaaaccct tgaaatgctt ctaagaagag tttaggtttc ttctcaactc 7620taaaacctct agaaaactct atttccacac cagctgcccc tggaacactt cagcttcaaa 7680agggcccagg gcagggagac ggaggagcca gcatccacac cgagcaccag cctgttaatt 7740aacgggaagc gggtggggcc catctccagg cagctctgag gtcagactgg ggaaccatgc 7800ttacaaaaaa aagtgaactg aaacgctcac gtcctcatgc aaaaccagac tcccagttgc 7860atctttctgt ctcattgagg agctttttcc tccctttgac agaacaccct acacacggca 7920tctggaacca aagcagaaag attcaggctc agagtaaaac agtccccaca ctggctgcat 7980gtggacgttc ccggcccaga gtctcgccca agcagggcct ataaatgaca caaaatgttt 8040ttctcctgcg tgccagtcat gctccaactg agttatgtgt aaaagtgcct ctcacggctg 8100agggcaaaaa cagttcccac aagactagag aaaggtgacc cctgacggct gagtctctag 8160ggagcgtgga gctgcgtgct cagccctgcg gccctgacgg ctctggaatg gaaaagctat 8220ccaactggaa gggcagggct cgctgctagt ccagcggtcc aaccccacag gtgtctgtgg 8280tgtcagctcc atgccacaga gcccagggct ggggccagag ccaccaggcc ccctgccagc 8340ctgcaggggc ctcctcctct gggtagccta accaccccct gtgagcgcag gcagcctcct 8400ctaatcacca cagggcctgt ccccccctct cccccgcttg caggaaaatg agccctgagg 8460actccccagg gctgctctgg gcctggacat ggagactggg aattacattt gcagaaggag 8520cgcaatgccc ttgaagggct cagccacgag cagccagtcc ccagggctca gaaggcccag 8580ctgttagaac cctgggagcc agcaaagagc caggggctcc acctaagtct atagcccctg 8640cctcttctgg ttgggaaaga aatcaacgcc cctttactgg ctcccactga cagcccactc 8700ccccaggtat gggaggattc tgggacgatg caggcaaacc tggaccctga gtgaacctgc 8760cccagctctc acgggcctgg caccagccac agcacctaag gcgccggtca tggtgacaac 8820atgaaggtga taagggcatg gacagtggac atggcagctg gacactgggc acccactgga 8880tgccaggcac ccagcacggc tccgtcaccc ctggatgagc agtggccctt tgcaagccag 8940ggtagcctgg gcaagttatt tgggggtctc caagcttgtc cagctgtgcg acttcactga 9000gccatgagtc tgggatttta tcagggccca cacccgttcc tggaactctg atacgtgagg 9060gagccacaca gggaccctta acaaaagctc ccagggcaac atgttctctt gcctcagtct 9120cccaaatagc tgggattaca ggcgcacgac taccgcccgg ctaatttttg tatttttagt 9180agagacaggg tttcaccatg ttggccaggc tggtcttgaa cccctgacct caaatgatcc 9240ttccactgtt agggcaaggc acctgacagg cacgactgca cgatctgctt gttgggggct 9300gtgtccattc cccactcctt cgacaaatgt ccacacccag ccttgctttg acaccccaag 9360aacagagatg gtgacacctg cttcctacat gcccattgct ctcccaaggc agacatcccc 9420agcagatgca acacagtgtt taggcagaca tcaccaatcg atggtggcaa cagacaccag 9480gccctgctcc ctctaactcc agtggccagg ccccaagcca gctctcacct gcccactccc 9540aacccacagc agcaagactc agaaatggca aaaacacaaa gagaacagaa acgccccata 9600gcgggaggat gactaaaaga catgtcttga taagatattg ttcaggcata ggccaggcac 9660agtggctcat gcctgtgatc ctagaacttt aggaggctga ggtaggtgga tcacctgagg 9720ttaggagttc aagaccagcc tagccaacat ggtgaaaccc catctctact aaacatacaa 9780aaattagcca gacatagtag cgggcgcctg taatcccagc tgcttgggag gctgaggcag 9840gagaattgct tgaacctggg aggtggaagc tgctgtgagc cactgtactc caacctggac 9900aacagagcaa gactctgtct caaaaaaaaa aaaaaaaaaa gatatccttc actaaaactc 9960atgtctttga tacatattta cctcctgcaa tcgcaaatgc ttctgcagtg cataaagtga 10020aataaatagc aggaagcctt acggttcgat cacccacaca gacacacagt cacatacagg 10080aaaaacgcag ggagggctgg ggaacaaaaa aacagaagat aaaatgtgga gacagacaca 10140ccaagagagt aagagaccac ctccagacct cccttcagct tctcaaacac acgagccggg 10200cccgttacag aatttgcggg gaccgctgca aaatggaagt gcagacagcc ccttactcaa 10260aaggtaggaa tttcaggtca acaacagagc tcacctcata tgactacaca ggtcacacag 10320cccgtgaagt cggtcccaac accagcatgc tcctgcctca aagccgctgc acgtgctgtt 10380ccttctcgcc tttccctctt ttagtccttc agatctcagg cctcctgaga gagacctctg 10440acctgccggc tcaggcggcc acacccccag tacaggagtc tccggctcag cccctgctgt 10500gttccgtacc cgatccaggt ctgtcctatg tccatctgtg tgccggcttg cttcctgaca 10560tggcccccac cacacgtgtg cctcggggca ggggaacagg cccgtctcat taactgcttt 10620cttctcagat attttctgga atatttgtgg atattgggca acatatatgc tccacctttt 10680tcagactagc caggacgagc tgcatttttt tttttttttt tttgagacag ggtctcactc 10740tgttgcccag gctggagtat agcggcatga tcttggctca gtgcaacctc cgcctcctag 10800gctcaagcaa ttctcctgcc tcagtctccc aagtagctgg gattacaggc ccgtgccact 10860actgcccagc taatttttat atttttagta gagatggagt ttcaccatgt tggccaggct 10920ggtcttgaac tcctgacctc aaatgatcca cctgccttgg actcccaaat tgttgggatt 10980acaggcgtga gccactgcgc ccggcccgag ctgcctgttt tacacctttg ccatattccg 11040gtgattctct ctcccctccg tcccccggcc ctgactgtgg tggccactcc ctgccgtcat 11100gagcccgtat gtcctcactc tttccctttc cgccaggact tcaaccaaca ctgcagagcg 11160cagggtccag ctccagcact gagttcagcc tcttctcacc aacagacagg caggaaagaa 11220aacaaactct gagaaggcca aggttcccgg gcagccagca agccaagcat ccttctccgc 11280tgaggcttgt gcagccgagg caccccctcc tccagggagc aggcagcgtc ctggggcagt 11340ctgcgaggga gaccagggcc cttgctccac cagggcccca ggtatggggg cagcagcaaa 11400ctcatggctc tgggagccag accccacctg ctagaaccta ctatgccacc tgctgtgggc 11460aaccccaggc tggtgacttg ccctggcctc ctctgtaaac aaagggctca tccaacctgg 11520tcaaaccact cctccccttc aagggtctat aatcctccct taacctgctt ggtccaaacc 11580cctggtgtcg ccaggtcact caggaggcag ctcatctgga ctccttccct gggtccagtt 11640tctctctcaa cattgccttt gaggccgagg tgaacggtca acagcgaagg gccccagagg 11700tgatggagga gcgggtgtcc aagacactca ccctttctaa tgcactgact ccctcgtgga 11760ctcacttgtg ccgtctcccc cacccaccca gccccagagc ccagagtgcg agcgccagag 11820gcccgggatt ctgtctgcac cgcggggtcc ccagtgcctc ggagcaatgc cagcacccgg 11880caagtgttcg acaaatgcct gctgaatgag caaatggatg gatgaacgaa tgaatgagca 11940agcagatgaa tgaatggggt gctgtccaga gccgtgagga ctaggccgcc caagtcccca 12000tttctcaaat tctccttctc ccgacttggg aaacaagatg cttggtcggg gaggctctcc 12060aaccatcccc tgcagcagcc ggcacagcgg acagaccctt tgatgtaaca gccatgtctt 12120cattaaagat gccctgctct cagaaagaga aagacaaata caaacctgga aaatcctcac 12180caaacgcagg acccctgcca gggagcagag aaaagaccca cacgccacgg gcgccacgac 12240cacacacaca ccccagccgc tgcacacaaa cacagaccct agccagcaag aacaggggga 12300ccaggaaact gttcctaaag tcaggacccc catgtgctca gacagcagtg agagcaagga 12360cacttctcca tccaccggat gccaggagag tccttttagg gggccccaca ccgagactct 12420gcccttagga ctgttcctga gtgtggaagc cagcccactt ggaagccccc tgccctcccg 12480agtgggacac cggcacagga agcaggccct gtcccccacc actttctgca agctgggccc 12540catcacgcta cagaaacggg gaggactggt cccagggatg gcgctttcct gacacctctc 12600gttaccccct cgcttgccag gccccagggt cagccccaga ggccagactg gctatcccag 12660gcccgggagc atccccgaag gcgagctgca tcctgaacgt gtgtgatttc ccgaagggcc 12720cgccccgaac cgacacctgg aaagaaagat cctcagccgg tgccccagag gagaagagcc 12780atgcctcact gcaacacagt cccaggaagc accaagtgcc tgaggaccaa ggcggagagt 12840aaaaaagtgg aaaatatctg gggcaaaaat aaaacaaaac aaaacaggat tgacctcctg 12900ggctcaagca atcctcccaa ctcagcttcc cgagtagctg ggaccacaga cttgaatcac 12960cacacccgcc aagtggatca tttcgaacgg gtttgccgag gttccttctg gggcaccccc 13020ggcggccgca acccattccc gccaggcccc gccccgcccg cccgccccgt cccgtcccac 13080cgcctcacct gccttacacg tcctgccgtt gtcctgcagc tgcacacccg tggggcaggc 13140gcatgtgtag aaaggctcgc ttggggacag caggcacagg tgggagcagc cgccattgtc 13200ctcctcacag cgagtgtgga ctgagaaaac caggacagac tgagagaagg ttccagaaga 13260ggaccgtcac ttgtttctga atgagtcaca tcctgcctcg tcccccgtga cagcctccag 13320tgtgtccctc tgcccaaaca tcggcctcaa gtggcatcag ggacctcccc gcgggcacca 13380ttccacctgc ctcatcgctg gccccgtcca catggggccc tcagcctggc cagacggcct 13440gcaatttccc caaaaccagc cgtgaccttc ctggccaccc tcacacccag atgtgacctg 13500cccatggagt gacatcctcc ccatctgctt cctcccacca agctcctatg actagaacac 13560cctccccagc tcctcggagc ccccaaagga cacccctctg caaaggctgc cccccacgct 13620ccaatggccg gggtcaggac ctgcctgtgt ggtagtgacg ggaaccccag agacaatggg 13680ctcctgggca aaaggcttgt cttgtctttg tgctatgtgt ggacccagca gcttccatag 13740gaacactgtc cttcttgctg ggatggccaa gcttgtcact ctcccaagcc ctcctatgac 13800caacagcaat tgaacggaac tcgataaatg cttccagcac ctcattcaaa ccaggggaaa 13860gctgggtgta gcagccccaa aatacggata taactggaac aacaaactca tcaaaatgaa 13920cctctccctc cctcatgctg ccccaagtgt agatgggttt tgtgaccacg actttctcac 13980caggaaacag ctccagagag ccccaccctc ctgtgtcctg ctctgggaac agctggcacc 14040cctaggcccc acatttcaat tcaaagtcca aaccttccat aatggcctgg ccagaaatct 14100ccatccctgg tccctgtggg agtgggccac tgtccccaga gccgcagccc cactgtcaca 14160gaagctggtg catttcccca tcagggacct ctgtcacaac ccagcgtggc ccccaggctg 14220agaactgctg attctgggca gattattcat tgataaatac gcgacttgca gggccaagca 14280tggtggctca tacctgtgac cccagcactt tgggaagtca aggtgtgagg atcactggag 14340cccacgagtt tgagacaagc ctgggcaacg tggcaaaatc tctcatctct attaaaaata 14400catacacaca cacacacaca cacacacaca cacatatata tgtatatata aataaccata 14460tatatatata cacacatacg tgtatgtgta tataaataca tatacacaca cacacagaca 14520acttcttctg ggccttgaaa acgaggcaac cttccttgga aatccccttg ccactgctga 14580gcctgaaata gcccccatga gctctgcaga ggggtcctct gcaggcccgt gtcccccagc 14640cagccacaca cctccctcca ttgcagcagg taccccttta gagagggggc cccccagagc 14700atgggcttct gcagggaggg gtcacctgcc cccccacccc acccacgccc gcgcaccccc 14760acgcccccgc atcctcccac tcccctgccc cgcgcccccg ctccccccag ccccctcacc 14820ctctcccccg tgccccaacc ggcactcaca aaaaggctgc cgctcctggc tcagcacctg 14880gatgtccatg ggtgagtata gggcactcag gatctccttc ctcttccccc cagtgcgctt 14940gttgcaggca tggatggagc gggtctgcca gtctgtccag tacagagtgt ccccggagag 15000cgtcagggcg aaggggtgcg tcaggctgcc ctccaccacc ttctgcctgc agtcagggaa 15060gcggggtgga ggagccatca ggagggtccc ccgacagtca ttgctgctga cccaattaat 15120ttcttttttt ttttttgaga tggagtctcg gtctgtcgcc caggctggag tgcagtgatg 15180taatctcagc tcactgcaac ctccgcctcc cgggttcaag caattatcct gcctcagcct 15240cccgagtagc tgggatcact gatgcccacc actacgccca gatgattttt gtatttttag 15300tagagacagg gtttcatcat gttggcaagg ctggtctcga actcctgacc tcaggtgatc 15360cacccacctc agcctctcaa agcgctggga ttacaggcgt gcgccaccat gccaggcttc 15420ccatttgctt tcaaccagac aagtgaggcc aggtcaagag ccccaggagc tggcgccctc 15480gtacatttct cccggcgtgc acagggcacc tcccaaacac agcctgtgat ggtgacacac 15540gggctccccc aggtcaagtg gcaaagtctc ccccagggaa gaaaggagga agccatgcct 15600ggcaaaaagc acacctctcc tgcccaacgc tttaacctct gtatacaaat caggccatgt 15660gcactcgctc cttcttacaa tgctcataat ttatactttc agagtaaatg aaacttggca 15720tcaacccgag aaacagctat tcttttctag atgcttacag tgcccagcaa atgaggactc 15780gggtgtaatg agattatgga cactggaaac aggatcataa tgtgacgtgg tcggtaatgt 15840gcagttttat ttgcttaatg accctcgccc cgtgacaggc tccctgaggg tgggcctggg 15900ggcagaggtc cccgccacgt ccccagccct cagcacagtt gccaggagag ggtgacactc 15960atgaagtggc acagggaaga tgggagctgt gggctctgca gatccaccac ctcttctgtt 16020catttttgtt gatgctgttt tttaagaaaa ttattgaagt aaaattcaca ggacatacgt 16080ttactttttt tttttttttt ggagatgggg tctcactctg tcacccaggt tggagtgcag 16140tggtgtgatc tcagctcact gcaacctctg cctcccaggt tcaagcgatt ctcccacctc 16200cgcctccaga gtagctggga ccacaggcgt gcaccaccac acccagctaa tttttggggg 16260gtatcttttt ggtagagaca gggtttcgcc atgttgccca aggctggtct tgaagccctg 16320agctcaggcg atccacccgc cttggcctct caaagtgctg ggattacagg cataagccac 16380tgcacccagc ctaaatttac cactttaaag tgaatagtgt tacctagtgc attcgcaagg 16440cggtgcagcc tccacttctg tctagttcca aagcacttcc attgccccac aggcaaaccc 16500cacacccggc agcagtcatg ccccagtccc cgcccccagc cccggcaaac acttttgatg 16560gacttaacta cacacattct caacatctca tataaacgga atcacaatat acagcctctg 16620atgtctgtct tctttgactt ggcaccatgt tttcgaggtt catccaggct gtagcatgtc 16680agtgcttcat cccgttttag gggtgaacca tattccagtg tgcagacaga aaccaatctg 16740tgcatccatt cacccactgg gggacctttg tgtcatttcc accctcggct gttgtgcaca 16800gtgctgctac ggacattact gtccattcac attttgtgtg aagacctgtt ttcgattctt 16860aagagtatac agctaggagc ggaattgctg ggtcatacgt aaatcaatgt ttacgtctca 16920aggaatcaac aaactgtttt ccacaatgtt gtcttttttg tttgttttct gagacagggt 16980cttgctctgt cacccaggct ggagtgcggt ggtgtgatca tggctcactg cagcctcaat 17040ctcctaagct caatccatcc tcctgcctca gcctcctgag tagctgggaa cacaggtatg 17100taccaccatg gccagctaat tttctaattt tatttttttt tgtttttgtt tttttgagac 17160agagtctcgc tctgtcgccc aggctggagt gcagtggtgc catctcagct cactgcaagc 17220tctgcctccc gggttcacac cattctcctg cctcagcctc ccgagtggct gggactatag 17280tcaccggcca ccacgcctgg ctaatttttt tgtattttta gtagagatgg ggtttcaccg 17340tgttacccag gatggtctcg atctcctaac ttcatgatcc acctgccttg gcctcccaaa 17400gttctgggat tacaggcgtg agccaccacg cccgacctta cttttaattt tttaatttta 17460ttattttatt ttattttttt tttttttgag acagagtctc gctctgtagc ccaggctgga 17520gtgcagtggc gggatctcag ctcactgcaa gctccacctc ccaggttcac gccattctcc 17580tgcctcagcc tcccgagtag ctgggactac aggtgcccac cacgatgccc ggctaatttt 17640ttgtattttt agtagagaca gggtttcact gtgttagcca ggatgatctc aatctcctga 17700cctcgtgatc cgcccgtctc agcctcccaa agtgctggga ttacaggcgt gagccaccgc 17760gcccagcctt tttttttttt tttttttttt ttttgagata gagtcttgct ctgtcgccca 17820ggctggagtg cagtggcggg atctcagctc actgcaagct ccgcctccca ggttcacgcc 17880attctcctgc ctcagcctcc cgagtagctg ggactacagg cacccaccac cacacctggc 17940taatgttttg tatttttagt agagacgagg tttcaccgtg ttagccagga tggtctcgat 18000ctcctgacct cgtaatccgc ccgcctcggc ctcccaaagt gctgggatta cacgcgtaag 18060ccatggcgcc cagcccatgt ggccattttt cagtgagaga agccagaggc ccatcactct 18120cggttgctcc ctgggccatg ctctgcctca gccagaagca ctgagggaag gtcagcctcg 18180gcccttgccc cagccacagt cacagataaa ggggcctgca caggtctgtg tggctccaga 18240gctcgtcacc caacacacga cgcttccatg tgaatagccc caggtgcatc atgaagagcg 18300atggccgctg cagaggcaga agaatcccgc ggggaagcag gtgggagaga ggctgagaac 18360agaccagacc ctggagctac agaccctatg ttccaaccct ggctgggact agctgtgtgg 18420ctctgggcaa attcacatgc ttctctgtgc acaggggatc aaaatagcaa acacaggcta 18480ggcacagtgg ttcacaccta taatcccagt gctttgagag gccgaggtgg acacatggct 18540taagctcagg agtttgagac cagcctgggc aacatggtga aacctcgtct ctacaaaaaa 18600aataccaaat aaattagcca ggcgtggtgg tacgtgcctg tggtctcagc tacttggaag 18660gctgaggcgg gaggaacact tgagcccaag aagtcaaggc tgtggccgcg tgtggtggct 18720cacgcctgta atcccagcac tttgagaggc tcaggtgggt ggatcacttg tgatcaggag 18780ttcaagacca gcctggccaa catggtgaaa ccccgtccct actaaaaaaa tacaacaatt 18840tgccaggcgt ggtggcgggc acctgtaatc ccagctactt gggaggctga ggcaggagaa 18900tagttagaac ttgggaggtg gaggttgtag ttagccaaga tggtgccgct gcactccagc 18960cagggggaca gagcaagact ccatcccaaa aaaaaaaaaa acaaacaaac aaacaaaaaa 19020agaggtcaag gctgcagtga accatgattg tgccaatgca ctccagcctg ggtgacaaag 19080tgagaccctg cctcaaaaca ataaaaatat aaataaaaat aaaacataat agcaaacgtt 19140tcatagaggt ggtatgagca ttaaatgaac tgataaacgt ccctggaaaa cagtaagtgc 19200tatggaagga ttcgctgccg ccaccgccac caccattagc atgtttcaac ctccatcacc 19260ctcactgtcc cctgtcacca tcctttgacc agggcactcc cagctgcagc ctttctatcc 19320tcttgtccac ccttcataac tgtaagatca ctcagctccc aagaaccaca gtctacaggg 19380taaccacatt tccaaatctc aaaccagacc cgctggtctg cacttccagg gacaacagga 19440tattttcaaa ccagcccaaa agagatgtgt ggctcagcat aagaggaaca ggagaaactg 19500aggcctcttg ccctgagaat gagcttggaa gtggatgtcc cggcctcact caaaccttca 19560gatgactgag gcccagccag gagcttgagt gtaccctcag gtcataccct gagccagaag 19620cacccagcta atccactcct catcactgac tccctcccca taaaaaacct gtttgctgtt 19680tcaggctgtt aagttgtggg ctgttttgtt acacagcaat ggataactaa cacacgaggc 19740ctggcaagtg tggagcaaag ctgcccaagc cctcaagtct gttcatgtgg gtgttggcct 19800gtgtttgcag aaatccagcc actgagtcct cccatgcagt cactactgcc ctctgcacag 19860acacctgcca catccctgcc tgggccagga gctccactag tgcaggaatg gggtctgccg 19920tcccaggagg atccctgaca cctagcacag ggctagcagc aggcagcact tggttagtga 19980ataaactgcc cttcacctgt acacagaagg gatgtttcta taaggggtaa ttaagtacag 20040agctgggaag ctatgctgac cagaaggctc taaaagcaat taaccaacga ggggaaaacc 20100cttcctactc attctcggcc cattttattg agcactgacc atgtggaagg ccccctggtg 20160agactgggga atgcaccaat aactgagaca gcttccggct gttgccctca ggatgcctga 20220gctgggatag ggccagggtg ggggtggtgc gtgtgacagg gttactgttc acaaccctgc 20280cgggccataa gccctcccca acaattccaa aatccaaaac gctctgaaga tggaaagctt 20340ttgttgctca tctggtgaca aaacctcatt tggtgcatgg gccgggtgcg gtggctcacg 20400cctgtaatcc cagcactctg ggagccgagg ggaaggatcc cttgagctta ggagtttgag 20460accagcctga gcaacatgtg agaccccgtc tctaccaaaa atacaaaaat tagccaggtg 20520tggtggcgca ctcctgtagt cccagctact cgggaggctg aggcgggagg atcgcttgag 20580cctgggaggt gggggctgca gtgagctgag attatgacat tgcactccag cctgggtgaa 20640agagtgagac tctgtctcaa aaaaacaaag ttaaaaaaaa aaaaactgtg catgggtgtg 20700ggctacagat agtcttttct gccctactta gaatgaacgt gccacatttg ctatagaaat 20760attcaagggc tggtggcaaa tgccacacag accctgacgc tgttccaagt tctgagaagt 20820cctgcattcc tcagggcccc agagtttcag agaagagtct gtaggcctga gttaagaagg 20880aacgccttca aaagccctgg ggacaaaggg gaaaggggtg ccccaggact gcgtgggtac 20940ctaccggaac gagccgtcca ggttggcacg gtggatgaag ctgagcttgg cgtcagccca 21000gtagagcttc tgctcctcca ggtcgatggt cagtccattg ggccagtaaa tgtccgagtc 21060cacaatgatc ttccgggtgc tgccatccat ccctgcccgc tcaatccggg gcgtctcacc 21120ccagtctgtc cagtacatgt acctgtgacg ggggcagggc aagagaagca gctaacacag 21180atctgttttt tgtttttgtc tgcatagatg cagacatgaa acaacagaca gtgaacttgc 21240cctaaaatct cacccatcgg aaataaccaa caggtatggt ttcaggtatt cctgccttaa 21300gctgggcaat caaaatatac tatttccaac ttgttctcag ttaacagtaa attctgggca 21360ccttcccttc ttgtggatag aaagattcct tgttcttttg atgattgcct agtgtactct 21420gctgtaagtt ttttaaagaa cttcaggtta tttctgattt ttttgctacc atgaaaatgc 21480tgtaaatgaa cctctaaaag gcaattcaaa acactcagga tggaatatta tttagtggta 21540taaagaaatg agctatcggc tgggcccagt ggctcacacc tctaatccca gcactttggg 21600aggccaaggc gggtggatca cgaggtcggg agatcaagac catcctggct aacacagtga 21660aaccccgtcc ctactaaaaa tacaaaacat tagccaggcg tggtagtgag cacctgtagt 21720cccagctact taggaggctg aggcaggaga atcatttgaa cccgggaggg ggaggttgca 21780gtgagcagaa atcgcaccat tgcactccat cctgggcgac agagcgagac tccatctcaa 21840aaaaaaaaaa aagaaaagaa aagaaatgat ctatcaagcc atgaaaagac atggaggaaa 21900cttaaatgca tgttagtagg tgaaagagcc aatctgtatg agtccagttc taaacactct 21960ggaaaaagca aatacacaga gacagtaaag catcagtggt tgccaggagt tggagaggag 22020agggatgaat gagtggagca cagaaaatca gggcagtgga actatcctgt atgacatgga 22080atggtgggtg catgtcctta ctcatctgtc taaaccaaga atgtacaaat caagggcgaa 22140ccctcgtgta aacgtggatt ttgggtgatg gtgcgtcagc cagctttcat cagttgtaac 22200aaatgtacca ccctgcacag gatgctgaca gttgggaagg ctgtgtgggt gtgaggacag 22260ggatgtatag gaactcagta cctgctgctc atcaattttg ctgtgaacct acaactgttt 22320gaaaaaatta agtctattta aaaacaacaa aacatggcca ggcacgatgg cttgcacctg 22380taattccagt acttcgggag gctgaggtgg gtgggtcact tgagccaccc tgggcaacat 22440ggcaaaatcc cacctctaca aaaaataaaa attaaaaaaa agttagctgg gcatggtggc 22500acactcttgt agtcccagct acttgggagg ctgacgtggg aggatccctt cagccctggg 22560aggtcgaggc tgcagtgagc tgtgactgta ccactgcact ccagcctgga tgacagagtg 22620agaccctgcc taaaaaaaaa aaaaaaaagg ctgggtgcgg tggctcatgc ctgtaattcc 22680agcgctttgg gaggccgaga tgggcggatc acgaggtcag gagatcgaga ccatcctggc 22740taacacggtg aaaccccgtc tctactaaaa gtacaaaaaa aaaaattagc cgggcatggt 22800ggcggacacc tgtagtcaca gctactcggg aggctgaggc aggagaatgg cgtgaacccg 22860ggaggcggag cttgcagtga gccaagatca caccactgca ctctcagcct gggagacagc 22920aacactccgt ctcaaaaaaa aaagaataaa acccatggct gggatggacc ctgaacctgc 22980agctgcagct gttcctgggt aggtctgtgg gcgacgtggc tttgcttctc catgttccca 23040agagacaagc atcacccatc catgagaaac aagcacatcc tcagggcgcc cttacgtgat 23100ctctggccaa tgaaccaaga caaagtgagc agacaccagg tctgggatgg caggtcccac 23160ccccaccagt gcccagtgtg ccctgtttgg aggtgaccac agggtgtgtg cccagaggct 23220gggcgtgact ctcagcggag accagagggg aaccacacca gcttggagga ctcagttccc 23280atcccagcca gctgggatga gccacaggac acaagggctg gcagacctat tgtgttttgt 23340ccacccttca cagcagagaa aggggacagt gcccagaatg tcctctgagg agcctcctcc 23400cactcttggt ccttgtaaaa tggtgctgac tcccttgctc ccttcttcct ggggtgggcg 23460gcaaacccca ttcccctcag ccttagcaag tgatttagaa acaggcagct cgcccaagcc 23520aggcatgaga gtgatcccgg gacacaggga gaacaagccc cgctttgccc tctgggggtc 23580tccattcagc agaagaggca aatgacagac acacagccgc ctcctccccc accatggtgc 23640tctgcagcct caggagcctc aggtgcacca agggccaccc catccagggg gccatgcttc 23700cttgagtggt atcgttcctg agcgagtacc atctccacct tccagagggg ctgtgacaag 23760atcaacaaga atgagggcat aggagcctcg aaccaaacat gccctcttcc ctgcagaggc 23820tgactgcgcc cagctgctat caccaagccc ctgctcctcc ggccccgtgg ggacagggta 23880agaggggtgt cacatggaac agctctccaa acagtccctc tcaagctgct gtctcctgtg 23940catctagtga gaacccaacc aacaaaggga aggtgggaat tgctattccc attaggcaga 24000tgagaaaact gaggccccga aaggctggcc tgttccaggt tacaggcgct gagcggctgc 24060tctgggaaca cacttggtgt ctgctgaggg cccgagcccg gccatcatat gactcaccct 24120tcgccagcaa agcccgggtg tgggtgaact tttcctggca gcctgggact ccaaggtgct 24180ggcagccagc ccagggaagg ctcccgcgtg cctgcggcag acgccttgct ttacctgcac 24240gtccccaccc ctaggagcct ggacagagcc cagaccctcc gccacctcct gagaaggtat 24300caggggcatc agtctggact tgggggggaa tccacacagg ccttccccaa atgctccacc 24360gtggcccatg gaaaaggctg gaaaacgtgc aggagcagga gcctccgcat ggagcataat 24420tcacattcct tccccgagtt tcataacaga ggcctgctgg tttccttaaa tggggaattt 24480gcgagccagt cggtgaccag agactggttg gcgtggacgt gctcttgcag agtctcaaac 24540gctaccacaa gcccagccaa attccacgga ggaaaatcga cttccgaaga aaagagctgc 24600agcatggcct tcgtgcagag ccagctgcgg ttgtggttgt gtgttatttt agggaagggc 24660cattttgcat tttaaagagg gggttgggtt tcaccctggc tttaatttga gacccggggg 24720ccactgcagc cccttgtcag gctggtacag gccggggact cctcccatgc taagccagtg 24780tctttctggc cccagatcct caggggccag agggtcatcc ccagagcccg ctctgccacc 24840cacatgggta ccctgggcct gggagggatg tgccttccct caaccctgcc tggatgtccg 24900cacggggcca cctgcattgc tgaaactgca acgaagtcga gtctcaggag gggcccccct 24960ggctgcaggg ctcttgatcc ttttggccac gtgcacactg aggtggacgc tcggacccag 25020agaccccctt catgatgatg gccggggcag gaaccccctc ctctgaggaa ggaccctggt 25080gggggacagc actgcaggag ggcacaggag atgacggggg ctctagcagg gccgggagga 25140aggccaagat gctcctcgca accgtgtgcc tgtggccagg acagaggaca aacccaccct 25200ccactgtccc cactctcagg acagcagtcc tgccccagga ctcagcgccc acacttatgc 25260ctgaggacca ctattcaagt cagtatttgg cgagcagggg ttgctgccgc gggcgctgtg 25320acaggctgga atcctctccc tctccctctc cctctccgga gacatggagc ctacagggac 25380agagtcagca cctcagggta ggaccatggc tggcgtcatc agcatcactg gatctgatga 25440gtgggagccg gcatctcact gttttcactc tctcattcaa atgactggag caaagggaag 25500gtgtggggag aggcccagga atcaacacta aggtcaactt tgcccccagg ggcaggggtg 25560ggagtgaaca gccacaggtg tgatcctggg gagggcttct gggagagaat tcagaggcaa 25620gcatgtagag gaaccatttc aaatagttaa gaaaagccag agccaaacag ggacagttgg 25680ctcgcagaga tgatgcaggc aaagccagct cagatctgag catgggaaag actactccca 25740accaagggcc cagcatctcc caaccaagca ccaagtacct cccaaccaaa tgccaagcac 25800ctcccaatca aatacctccc aaccaagcac ctagcacctc tcaactggac accaactact 25860cccaaccagg caccaagtac ctcccaacca agtgccaagc acctcccaac caagtaccaa 25920ttacctccca accaagcgcc tagcacctcc caactgagca tcatgcacct cccaacagag 25980catctagcac ctcccaactg atcacctccc aacctagcac cgagcacctc ccaaccaagt 26040gcagagcacc tcccaaccaa gtgccaagca cctcccaatc aaatacctcc caaccaagca 26100cctagcacct ctcaactgga caccaacaac tcccaaccaa gcgccaagca cctcctaaca 26160aagtaccaat caccttccaa ccgagcacct agcacctccc aactgagcat catgcacctc 26220ccaacaaatc acctagcacc tcccgactga tcacctccca acctagcact gagcacttcc 26280caaccaacat agcaaaagcc ataaagaagt aaaaagacaa aaccacgtag gcatggagac 26340tggacttctg gtggcgagga aagggcattt ttattataac gacagctaac atttgttgaa 26400ctcacaaact gttcttggtg ttttcctcat gacatgcagc atggtcacgc ctctgtacag 26460acaaggatac tgaggcacag agtggcaccg tgccaacctt gtctcatctt tttatcgaac 26520ctacatgcag agtgccagca aatccagctg tcttttctct tcagaacaga tcccaaatct 26580cgccactcct tacccccaca agtgaggtgt ccccgctgct gctttctgtc gccaggatcc 26640cggtaataac cgtggagagg gctcctgccc ccacgccacc caccccacag ctcactctcg 26700ctccagccac caggggatgc cttccagcac gagtcagagc tggcacctcc tctgctcgag 26760acctcatgtg tcctctcctc acaccttggg ccctgtttcc ctacattctg ctacagcccc 26820tcaaacaggc cccgccccaa accagcccag ggcctttgca ctggctgatc cctctgcctg 26880gaccgcgctg cccccagaca gccacacggt tctcagcctc atctgcttcc agtctcgact 26940caaaagtcac caagaggcct tcccagcacc tgagctccga cggaagcccc tcgccacagc 27000acccaagcac tgctttatcc ccctacgcac acgtcccttt caaatactat tcatttacca 27060tctcctccca ctcactgaaa gggccagaga ctgggctata cccgctgcgt ggggagcagg 27120accaggcgca agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 27180cgacccacgc tgggagggaa cgcaatgaca ggaggagcgc aggtcctggc gacacgatgg 27240ccatggcagc cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 27300gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 27360ccagccgacc cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 27420ggggagaggg ctgagagggc aacagttcac caagcggaac agaggctgcc ccggaggtca 27480gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 27540aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 27600tcaggtagtg ggggcacgag gctgctctgg cctgacctca gggactcaaa atactttggc 27660gataaattcc accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 27720gatgcagaca ctctcgcgca catactcgct cacacgggca catacacgtg cacacacagt 27780cacatgcgca cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 27840gacacacaca ggctcacacg tatgcacgca tatgcgtgca cacgcacaca cacacacaca 27900cgctcacatc ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 27960acaaacctgt tgggctctga aaggctccag cccttcccat gctcgtcaga agccagtcaa 28020tggcttccta agtcaccaca cagatcaaag aggtgaactt ggccacatgg cactctgctt 28080cctgagctcc caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 28140actaccctgg gcaggcccag aggaggggca tctgcaggat ctggcaacca gcccctcccg 28200cccggctcct gcagccggca ccatgggagt cagggggagg tcactgcaaa gggcaacagc 28260aagttggtgg ccccaggact agagcccagg ggtcttcagt cctactccag agcttggaca 28320ctgtcccaca gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 28380caggcgggct cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 28440tagaatatct aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 28500ggccgaggag ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 28560aaaccctgtg tctactaaaa atacaaaacc tagccaggtg tggtagtgca cgcctgtaat 28620cacagctact caggaggctg aggcaggaga atcacttgaa ctcgggaggt ggaggttgca 28680gtgagctgag atcgtgccat tgcgcaaaga agatctaggc cggcccctca accggtgagg 28740tccaggctgg gagtgctgag agactgtggt gacactgaat gaactaacag gcaaagggct 28800tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 28860accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 28920acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 28980gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 29040gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 29100ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 29160tgagccttca agggcccagg tgaactcagc cagactagga gctgaggagg acacagggca 29220gcttccagaa cggacccgag aaccactccc agcaggttct gcttccagac aaggagctgc 29280actttttcag ccaatgcaat tagaaagcca ggagaaggtg caaattccac ctgcctgagc 29340gtccgcactt cccaggccgc ccaccataca cacagcaaag atgtgtttaa ccattcaaac 29400ccatggccaa ccacatcggt tgcctcagac atgcaagttt taaaaaggaa cataactatg 29460ggccaggcac ggtggttcac gtctgtaatc ccagcacttt gggaggccga ggtgggtgga 29520tcacctgagg tcaggagttc gagaccagcc tagacaccat ggtgaaaccc catctgtacc 29580aaaactacaa aaattagctg ggcgtggtgg tgggcgcctg taatcccagc tacttgggaa 29640gctgaggcag gagaatcact tgaacccggg aggcgaaggt tgcagtgagc cgagattgtg 29700ccactgcact ccagcctggg caacaaggga gactccatct caattaaaaa aaaaaaaaaa 29760aaaaaggaac ataactatgg agtctcaagg ggaagtaatt ccttcaacaa taacaaatct 29820tgaaagctga gctctttttt ttttttgaga caggatctcc tcactttgtc gcccaggctg 29880gagtgcagtg gtgggatcac agctcactgc agcctcgatc tcccaggctc aaatgatcct 29940cctacctcag cctcccaaga agctgggatt acaggtgcat accatcacac ccgattcatt 30000tttgtatact ttgaagagat ggggtctcac catgttgccc agtgtggtct tgaattcctg 30060gactcaggtg atctgcccgc cttggcctcc cagagtgctg ggattacagg cctgagccaa 30120cacccccacg ggttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30180cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgaaccaa 30240cccccccacg agttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30300cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgagccaa 30360cacccccacg ggttcatttt cagagtcaca ccgagtgctg gggttacagg cctgagccaa 30420cccccccacg ggttcatttt cagagtcaca ccctttttct gaaaaacaac ttgggctcat 30480gcaaattcga gagagagatg gtgacactcc ccgccccctg gacccaggtg gagtcgcagc 30540agggtttacc cgtgagcggg gtccaaggcg atggccctcg gctggtcaag gtcctgccag 30600aagagcacct tccgggatgt gccattgagg ttggccacct cgatgcggtt ggtctctgag 30660tccgtccagt acagcttctt gcccacccag tcgcaggcga ggccgtcggg agagaccagg 30720ccggagatga ccacgttctg cacggcggcc cccgtctggt tcaggtaggt ctgcttgatg 30780gcctcctcgc tcacgtctgt ccagtacacg gctcccttgg aaaactggaa gtccactgcg 30840gccgcatcct ccaggccgct gaccacgatg gtggactcca gcttgactcc gccggcgtcc 30900accagccgta cgtcccggcg gttggcaaat agcaggagcg gcgaggctgt ggggcagaag 30960caaaccgtga gggccactgg ctaagccagc aagatacaca gccctgggat ggagcactat 31020gcccagagca ctcctggtac tgccctgccc atgcccaaga cctccagttc cttcctccca 31080cccctaaggc gttgtcagga agttgcctgg gcagccccgg cccgcatcat tcagaggctc 31140ctgcagcgca gcaaacagcc ttcttcccac attcggtgac agcacctgtt tgtttaccaa 31200ctgttacgtc tgttccccca gatatgggtg acccttcctg ccatgcccaa aacctcccac 31260atcgtcctcc agaggctaca ggggccctgt cctgttctgc agagaagcca catccccttt 31320gttggcctga cacaggggat ggggacatgc aggcacagca ctggccatgc tgctcgctac 31380agacccagcc acagggccac attttttgag gggttcagag cccaggccag acagagcctc 31440aagattccct tacaagtctt tgaccactgt ccaagctcag gcccgtttcc ttggccgtgg 31500catcagcttc ccatccaccc ctgtattcca tgtttctccc accctgcttc tggacattcc 31560tacatttaaa gggtcactct ggaatgccac cccttggctc agacaccttc cacagctccc 31620tgtgccagtg ccatgcagaa caaggtcaga ccccctagcc tggcctccaa ggccttggcc 31680tctggcctca cctacacttc tctccaccac cccaccccaa gcattcctga tctgcctgcg 31740gccaggctgg ctccctcacc tccctgtgca ccgcagccct cagccccttc tgcctgtgca 31800agaagcctca tctcacagac aacggtctca ttcccacaac gggctcaatg agaaatcagg 31860agaggccttc agaccatcac cccaccagac acctcagacg tcggaccagg agggtccagc 31920aacccccaac acagactcag agggactaag aagccacatg aggagtgaac acaagatgtg 31980gacaggagga ggttaagggc ctccagggag ctccatcagt ccgtgttctg ctgtcagcag 32040ggttaggctg ggctggccac aaacaccccc aaaaaacatc tgaagccttg gcttgaaaca 32100gctgacattc ctcatgaaaa ctgcagaccc ctgggtcctc ctgcgcagat gggggagccc 32160agccaacccc acactcccac cttcaccaag aaagagaaag ccaaaacaaa ctcaactcag 32220ccaatgacaa tcacagaact gaatcctgta gttagttcag ttggtttcat ttcagcaggg 32280gaaagatttg cagcctctat gagggtagct gggaacacaa agggccagag catggcccag 32340gagaccccag cgcagtgggg tagatggttc cgagcacagg cctccctgcc aagacaagca 32400ctggctcaaa tcctggcccc tcccattccc aggagacatg ctccacagga tgggaggaca 32460cacagaggac ctgaggccag gaaaatgaca gcggcgcctc cgccgcccca cccgtgctgt 32520catcatctta ggtctacagt tctttgtggc aacgagggac actgtgaaag tcaaacaaca 32580ggaaggcata ggccacaaat aaagacaaac gggacttcat gggaagctaa agattttgtg 32640catcaaaaga cactatcgag agagtaaaaa ggcaacccac agaatgagag aaaatatttc 32700caaatcatag atctactaag agattaatat ccatgaaata cagagaactc ctaaaactca 32760acaatgagaa aacaactaag ccaactcaaa aatgggcaaa caacttgaac agacatttct 32820ccaaagatga catataaatg gccaataaac acatcaaaac aggcttaata tatccctaat 32880catcagggaa atgcaaatca aaactacaat aagataccat cttgcaccaa ttaggacggc 32940tactatcaaa aaaacaaaat agcaagtgtt ggtgaggatc tggagcaact ggaacccttg 33000tgcaccactg gcaaaaatgt gaaatggtgc agctactatg gaaaacagca tggcagttcc 33060ccaaaaactt aaacacagaa ttaccatatg acccagcaat ttcgctttgg gttatatacc 33120caaaagaact gaaaacaggg acacaatcag atatgcatac accttggatc acagcagcat 33180ccttcccaac agctaaaaca tggaggcagc caggcatggt ggctcacgcc tgtaatccca 33240gcactttggg aggctgaggc gggtggatca cctgaggtca ggagttcgag accagcctgg 33300ccaacatggt gaaaccccgt ctctactaaa atacaaaaat tagctgggcg tagtgacggg 33360cacctgtaat cccagctact cacaagtctg aggcaggaga atcacttgaa ccctggaagt 33420ggacgttgca gtgagccaag attgcgccac tgcattccag cctgggtgac acagcgagac 33480tctgtctcaa aaaacagcaa aacaaaaaca aaaaaacaaa caaacatgga agcaacccaa 33540gcgtccctct actgagggat gaatagcggg gcaaaatctg ctccatccac acaatggagt 33600actattcagt ctcaaaaagg aaaaagattc tggtcaggca cggtggctca tgcctgtaat 33660cccagcactt ggggaggctg aggcgggtgg atcacctgaa gtcaggaatt caaggcccgc 33720ctggccaaga ctggcaccna gctacacana aagtatangg ccccggaaa 33769<210>9<211>72049<212>DNA<213>人(Homo sapiens)<220><221>未确定<222>(8356),(8385),(38585)<223>在上述位置的核苷酸序列特性未知。<400>9tataccttgc gcggaccttc ggctcctgtg gtgaagacaa tatgaagaaa atagaaatta 60cccataattt tgccacacag acttagttgt gtccatgtat cttgtgcacc ttttttctgt 120ttacggatca aaatcgactt ttagggtcag gcgcggtggc tcacacctgt aatcccaaca 180ctttgggagg ctggagttgg ggttgggggg tggatcactg aagatcagga gtttgagacc 240agcctggcca acatggcgaa actccatctc tactaaaaat aaaagattag ccaggcgtgg 300tggtgggtgc ctctaatccc agctactccg gaggctgagg caggagaatc gcttgaaccc 360aggagacaga ggttgcagtg agccaggatc acgccactgc actccagcct ggcaacagag 420cgagactctg tctcaaaaaa aaaaataaaa ataaaataaa taaatacata aattgacttt 480taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 540catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgttttcct gcctgtgtca 600tcacccgtca ccttgaatgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660ttgaggacat ttgggttgtc tccaggttct tggtcttgag ttttgctgtt cttgtggagc 720catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 840ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttaccaatt 960tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 1140cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260ctgtaatccc agcacgttgg gaggcagagg caggcagatc acctgaggtc aggagttcca 1320gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 1380gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 1440gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 1500aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 1620ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 1680aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1800cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 1860ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 2040tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160cgacaggttt gggccaccct ctctctggcc tctgtgcagt ggcccaggcc gtctgctctg 2220cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 2460cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 2520cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 2580gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 2640cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtccatgt ggtcggggac 2760agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 2820gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2880ccttgcccca agcccctggc ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2940tgcatgcgtg cagggaggaa ggggagaggt cagcagcccg cctggaggag gctcgggcga 3000ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3060atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 3120ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 3180agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 3240ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 3300cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 3420accttcaggc aaatgggttg gatcctgcct ctctcccagg tgtctcagtc tctgcaaaac 3480tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 3540aaacacctac agattcaccc cagcagtgaa aggactgctt cgccacagag gtttgattta 3600ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3660ctgctgacct agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 3720cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 3840accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 3900ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 3960ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 4020ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4080tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 4140cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4380tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4440tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4500ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 4560cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 4620aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 4680gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 4740ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt 4800ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4860ctggtcccct ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 4920ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 4980ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5100ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5160ggaggaacag ctttcctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 5220ggcggtgctg tgtgacctgg gccttgaagg acaggtccat tgtcaacaga acattttggg 5280agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 5400tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 5520atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 5640ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 5700ctggcagttg gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760ttcagccccg ggcccctttg cagattgtga cctgggcccc tgtgcagggg catggccaca 5820ggatgctggg aggggtctct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 6240tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc 6300acagcggatc cctgggccct gttgctcaga tgctctcaga gtgttgctgt ggccacggag 6360ggagcctgag ttaagcttct cttgtgccgg ttgtacgctg tcaggtcaca ctggtgagtt 6420aggcagggca cagatgccca gagcagaggg aactttcctt ggggattcaa cacgtgcaag 6480tcttaggggc tggcaaatcc tgccctcagc tagagagggg gcttttattt gagaccagaa 6540tcacctgagc atcctcctgt ccccagctgt gtccagcctg tctgcaggga catcctgaga 6600ggaccaggct ctcccctcat ccacctgcct aagtgccact ctgaaccctg tccacctgtg 6660ccgtggaggg gcgtgacctc aagctgctca gccagcagca ggcttggccc tggggggcag 6720cagagaccca ggtggctgtg gggtgggtgc ttcgtggcgt ggttctgaaa cttcgttgga 6780agtgtgtgga cagtgccttg cctgttctct gtgggaccct atttagaaac gaggtctgag 6840ttactggggg tcatcactgt gttctgatgg cccagctgtg tggaggccgc ggtgcagccc 6900catccaagga gccagggccc tgggtctagc cgtgaccaga atgcatgccc cggaggtgtt 6960tctcatctcg cacctgtgtt gcctggtgtg tcaagtggtc gtgaaactct gtgttagctc 7020ttggtgttcc tgaaagtgcc cccgggtctc aggcctcaga accagggttt cccttcatct 7080cggtggcctg ggagcatctg ggcagttgag caaagagggc gattcacttg aaggatgtgt 7140ctggccctgc ctaggagccc cccggcacgg tgctggggcc tgaagctgcc ctcgggtggt 7200ggagaggagg gagcgatgaa gtggcgtcga gctgggcagg aagggtgagc ccctgcaagg 7260tgggcatgct ggggacgctg agcagcatgg ccagcagctg ggtctgcagc ctggtacccg 7320gcgggacttg tggttggggc tggtttgtgg ccaggagagg ggctggcagg agacaagggg 7380gactgtgagg cagctcccac ccagcagctg aagcccaatg gcctggctgt gtggctctca 7440gctgcgtgca taacctctca gtgcttcagt tctctcattt gtaaaatgag gaaacaaaca 7500gtgccagcct cccagaggtg tcatgaggat gaacgagtga ccatgtagca tgggctgggt 7560gcgtgtcacc taacatcacc agcctttgca aggagagccc tgggggcctg gctgagtatt 7620tcccttgccc ggcccacccc aggcctagac ttgtgcctgc tgcaggccct tgacccctga 7680ccccattgca cctgtctcca caggagccga ggaggtgctg ctgctggccc ggcggacgga 7740cctacggagg atctcgctgg acacgccgga cttcaccgac atcgtgctgc aggtggacga 7800catccggcac gccattgcca tcgactacga cccgctagag ggctatgtct actggacaga 7860tgacgaggtg cgggccatcc gcagggcgta cctggacggg tctggggcgc agacgctggt 7920caacaccgag atcaacgacc ccgatggcat cgcggtcgac tgggtggccc gaaacctcta 7980ctggaccgac acgggcacgg accgcatcga ggtgacgcgc ctcaacggca cctcccgcaa 8040gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 8100gtaagacggg cgggggctgg ggcctggagc cagggccagg ccaagcacag gcgagaggga 8160gattgacctg gacctgtcat tctgggacac tgtcttgcat cagaacccgg aggagggctt 8220gttaaaacac cggcagctgg gccccacccc cagagcggtg attcaggagc tccagggcgg 8280ggctgaagac ttgggtttct aacaagcacc ccagtggtcc ggtgctgctg ctgggtccat 8340gcgtagaaag ccctgnaaac tggagggagc cctttgtccc cctgncttca gtttcctcat 8400ctgtagaatg gaacggtcca tctgggtgat ttccaggatg acagtagtga cagtaagggc 8460agcctctgtg acactgacca cagtacaggc caggcctctt tttttctttt tttttttgag 8520atggagtctc actctgtcgc ccaggctgga gtgcagtggt gtgatctcag ctcactacaa 8580cctctgcctc ctgggctcaa gtgattctcc tgcctcagcc tcctgagtag ctgggattac 8640aggtgcctgc cactgtgctt ggctaatgtt tgtatttttg gtagagatgg ggtttcaccg 8700tcttggccag gctggtcgca aactcctgac ctcaggtgat ccacctgcct cagcctccca 8760aagtgctggg attacaggca tgagccacca cgcccggtca ggccaggcct cttttgaaca 8820ctttgcacac catgggtctt ttcatccagg ggggtaggta cagttgtaca gttgaggaca 8880ctgaagccca gagaggctca gggacttgcc cagggtcaca cagcaggatg tggcaggtgt 8940ggggctgggc ctggcagcgt ggctccagct ttccagcata gaaatctgtg aaagcagata 9000gtttgtcggt cggtagggga gactttctga gacccgcccc agcggctcag agggtagtag 9060ccaggggcct tcctgggggc tcataaccca gaacactgaa tgggaaaacc ctgatggagg 9120aggcgcagtg gagctgtggg tgccgatggg aagtcccaga ggagctggga ggtcagtagc 9180ggtgctgccc tctgtggagc acttagtggg caccaggtgt gtttccaggt tcatggccct 9240gggacctgaa gctcagaagg tgaagtaact tgcccagggc acccgtcggg cagcggcggg 9300cagaggattt gtgggctgtg gagcctgtgc tcgtggccca gccctggggg ttgtgagtgt 9360gctggccggg gagcttttcc tgcaagtgga ctggtgtcta ggagccagca tgtcaggcag 9420caggcagcgg gagtgcagca ggcagcggga gcacagcagg cagagggcgg ggctcgagca 9480gccatccgtg gaccctgggg cacggaggca tgtgggagag ggctgctcca tggcagtggc 9540tgaagggctg ggttgtgccc cgaggagggt ggatgagggt aagaagtggg gtccccaggg 9600gctttagcaa gaggaggccc aggaactggt tgccagctac agtgaaggga acacggccct 9660gaggtcagga gcttggtcaa gtcactgtct acatgggcct cggtgtcctc atctgtgaaa 9720aaggaaggga tggggaagct gactccaagg cccctcctag ccctggtttc atgagtctga 9780ggatcccagg gacatgggct tggcagtctg acctgtgagg tcgtggggtc cagggagggg 9840caccgagctg gaagcgggag gcagaggggc tggccggctg ggtcagacac agctgaagca 9900gaggctgtga cttggggcct cagaaccttc acccctgagc tgccacccca ggatctgggt 9960tccctccttg gggggcccca gggaacaagt cacctgtcct ttgcataggg gagcccttca 10020gctatgtgca gaaggttctg ctctgcccct tcctccctct aggtgctcag ctcctccagc 10080ccactagtca gatgtgaggc tgccccagac cctgggcagg gtcatttctg tccactgacc 10140tttgggatgg gagatgagct cttggcccct gagagtccaa gggctggtgt ggtgaaaccc 10200gcacagggtg gaagtgggca tccctgtccc aggggagccc ccagggactc tggtcactgg 10260gcttgccgct ggcatgctca gtcctccagc acttactgac accagcatct actgacacca 10320acatttacaa acaccgacat tgaccgacac cgacatttac cgacactgac atttaccaac 10380actgtttacc aacactgaca tctactgaca ctggcatcta ccaacactga catttaccga 10440cactgacatt taccaacact atttaccaac actgacatct actgacattg gcatctacca 10500acaccaacat ttaccgacac caacatttac caacactgaa atttaccgac accgacattt 10560accgacaccg tttaccaaca ccgacgttta ccgacaccga catttaccga cactgatatt 10620taccaacact gacatctact gacgctggca tctactgaca ccgatgccag catctaccaa 10680caccgacatt taccaacact gacatttacc aacactgaca tttaccgaca ttgacattta 10740ctgacactga catctactga cactggcatc tactgacact gacgtttacc gacactagca 10800tctactgaca ctgacattta ccaacaccag catctaccaa caccgacatt taccaacact 10860gacatttact gacactgata tctactgaca ctggcatcta ctgacaccaa catttaccaa 10920caccagcatc taccaacacc gacatttacc aacaccagca tttaccaaca ccgatgttta 10980ccaacgccga cgtttaccga cgccagcatc taccaacact gacatttacc gacaccgaca 11040tttaccgaca ctgacattta ctgacactga catctactga tactggcatc taccgacact 11100gatatttacc aacgccagca tctactgaca ctgatgttta ccaacaccga catttacgag 11160caccgacatt tactgacacc aatatttact gacatcaaca tttagccatg tgatgggggc 11220cggcttgggg gcaggccttg ctcttggcac tggggatgct gcagagacca gacagactca 11280tggggtcatg gacttctgct tcttctccag cctcatgtac tggacagact ggggagagaa 11340ccctaaaatc gagtgtgcca acttggatgg gcaggagcgg cgtgtgctgg tcaatgcctc 11400cctcgggtgg cccaacggcc tggccctgga cctgcaggag gggaagctct actggggaga 11460cgccaagaca gacaagatcg aggtgaggct cctgtggaca tgtttgatcc aggaggccag 11520gcccagccac cccctgcagc cagatgtacg tattggcgag gcaccgatgg gtgcctgtgc 11580tctgctattt ggccacatgg aatgcttgag aaaatagtta caatactttc tgacaaaaac 11640gccttgagag ggtagcgcta tacaacgtcc tgtggttacg taagatgtta tcattcggcc 11700aggtgcctgt agacacagct acttggagac tgaggtggga ggatcgctgg agtccaagag 11760tttgaggcca gcccgggcaa aggggacaca ggaatcctct gcactgcttt tgccacttac 11820tgtgagattt aaattatttc acaatacaaa attaagacaa aaagttaatc acatatccac 11880tgccctgctt aagacagaaa acatgggtgt tgttgaagcc agaggcagct gctggcctga 11940gtttggtgat tggttcctaa gcagttgaag gcagttttgt ttttccatag atgtctgttc 12000tccctttgct gggtgcagcc tcgccctgct gctgtggtcg ggtttcagtg gcctcgtccc 12060gtggacgcag cctcgccctg ccgctgtggt cgggtttcag tggcctcgtc ccgtggacgc 12120agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac gcagcctcgc 12180cctgccgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc gccctgccgc 12240tgtggtcggg tttcagtggc ctcgtcctgt ggacgcagcc tcgccctgcc gctgtggtcg 12300ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac ctgtggagcc 12360tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat tgtttgtttt 12420tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct cactgaaacc 12480tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc tgggattaca 12540agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg gtttcaccat 12600gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc ggcctcccaa 12660agtgttggga ttacaggcgt gagccaccgc gcccagccct ctgttgagca tattttgagg 12720ttctcttggt gccagtgata tgtacatgtg tccccatcgc accatcgtca cccattgagg 12780tgacattggt gcctctcctc ggggtggatg cctccctctg tttccagcaa cttctgaagg 12840attttcctga gctgcatcag tccttgttga cgtcaccatc ggggtcacct ttgctctcct 12900cagggctccc aggggaggcc cgaatcaggc agcttgcagg gcagggcagg atggagaaca 12960cgagtgtgtg tctgtgttgc aggatttcag accctgcttc tgagcgggag gagtttcagc 13020accttcaggg tggggaaccc agggatgggg gaggctgagt ggacgccctt cccacgaaaa 13080ccctaggagc tgcaggtgtg gccatttcct gctggagctc cttgtaaatg ttttgttttt 13140ggcaaggccc atgtttgcgg gccgctgagg atgatttgcc ttcacgcatc cccgctaccc 13200gtgggagcag gtcagggact cgcgtgtctg tggcacacca ggcctgtgac aggcgttgtt 13260ccatgtactg tctcagcagt ggttttcttg agacagggtc tcgctcgctc acccaggcga 13320gagtgcagtg gcgcaatcac ggctcgctgt agcctcaatc tccctgggct caggtgatcc 13380tcctgcctca ccctctgagt agctgggact acagacacat accaccacac ccagctagtt 13440tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 13500ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 13560cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 13620tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 13680cccagtactt tgagaggctg aggtgggcag atcacctgag gtcaggagtt tgagaccagc 13740ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 13800gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 13860gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 13920aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 13980catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 14040ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 14100cccagcctct gtcaaccact aatctacttt ctgtctctgg gggttcactt cttctggacg 14160ttttgtgtga ctggaatcct gcaatatgtg gtccctgcgt gtggcttctt tccatagcat 14220tgtgttttcc agattcaccc acacattgtc gcacgttatc agaatctcat tcctgactgg 14280gtgcagtggg ttaggcctgt aatcctaaca ttctgggagg ccaaggcggg acgatcactt 14340gaggcaggag tttgagacca gcctggccag cctagcaaga ccccagctac caaaaaattt 14400taaaagttaa ctgaacgtgg tggtggtggg cacttgtggt tcccagctac ctgggaggct 14460gaggttggag gatcgcttaa gcccaggagg tcaaggctgc agtgagctat gatcgcacca 14520ctgcactcca gcctggacaa cagagcaaga ccctgtctga aaaaaaaaac aaaaaaaaaa 14580gttcctttct ttttgtggct ggatgacatc ccattgtatg gccacagcac attttgtttg 14640tctgtttatc gggtggtggg cagtggtttc caccttttgt ctcctgtgaa taatgctgct 14700gtgaacattt gaattcaagt ttttgtttga acacctgttg tgaattattt ggatatatgt 14760gtaggggtag gattgctgag tcctatggta atgttaggtt tgacttactg aggaaccatt 14820aaactgtttt caacagtggc tgcgccgttc tgcatcccca ccggcagtgt gtgagggttc 14880tgactttacc tcctcacaaa cgcttctttt ccatttaaaa aaatattcag ccaggtgctc 14940tggctcacgc ctgtaatccc agcactttgg gaggccgtgg cgggcggatc acctgaggtc 15000aggagttcga gacgagcctg gccaacatgg tgtaacccca tctctaccaa aaatataaaa 15060attagccggg tgtggcagcg ggcgcctgta atcccagcta cttgggaggc tgaggcagga 15120gaatcacttg aacccgggag gcagaggttg cagtgagcca agatcgcgcc actacactcc 15180agcctgggtg acaagagtga aactccatct aaaataaaac aaaaataaaa ataaataaaa 15240atttattaaa acattcatca cagccagcct agtgggtgtc ccatgtggct ttgcctcgca 15300tttccctgat aactaggatg ctgagcgtct tgtcccaggc ttgccacacc tcagcacttt 15360gagatacgtc gcacagtccc catttgcgaa cgagaaatga ggtttaggga acagcagctg 15420tgtcatgtca cacagcgagc agggggtctc tgagccgtct gaccccacag ccgaccaagc 15480tccaatcctt accgcctcct agtgttgtgg atgtagccca gggtgctccc acatttttca 15540gatgagaaca ccgaagctca aaacaggagc gttttgtcca cattggatac acgatgtctg 15600tggtttggtc ctgaagtcac tttatatctc agtggtccag actggagtag gacagggggt 15660tctggggaat ggggaaggtg tctcaggtga aaggaaggaa ttccagattc tccatactgt 15720ccttgggaag ttagaagact cagagggtct ggcaaagtca gacaaagcaa gagaaatgca 15780gtcaggagga agcggagctg tccaggaaca ggggggtcgc aggagctcac ccccaggaac 15840tacacttgct ggggccttcg tgtcacaatg acgtgagcac tgcgtgttga ttacccactt 15900tttttttttt tttgaggtgg agtctcgctc tcttgcccag tctggagtgc agtggcacga 15960tctcggctca ctgcaagctc tgcctcccgg gttcatgcca ttctcctgcc tcagcctccc 16020gcgtagctgg gactacaggc gcctgccacc gcgcccggct aatttttgta tttttagtag 16080agatgggatt tcactacatt agccaggatg gtctcgatct cctgacctca tgatccgccc 16140gtctcggcct cccaaagtgc tgggattaca ggcgtgagcc accgcgcccg gcccgatttc 16200ccactttaag aatctgtctg tacatcctca aagccctata cacagtgctg ggttgctata 16260gggaatatga ggcttacagg ccatggtgct ggacacacag aagggacgga ggtcaggagg 16320tagaagggcg gagagaggga acaggcggag gtcacatcct tggctttcaa aatgggccag 16380ggagagacac cctctgagca tggtaggaca ggaaagcaag attggaacac attgagagca 16440accgaggtgg ctgggcgtgg tggcttacgc ctgtaatccc aacactttgg aaagctgagg 16500tgggtggatt gcttgaggcc aggagttcaa gaccagcctg gccaacatgg tgagaccccg 16560tctctactaa atatacaaaa attagccagg cgtgatggtg catacctgta atcccagctg 16620cttgggaggc tgaggcagga gaattgctta aacctgggag gcggaggttg cagtgagccg 16680agatcccgcc actgcactcc agcctgggcc acagagtgag actccatctc aaaaaaaaaa 16740aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac agtagcagaa 16800gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta cctgacgatt 16860tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga gtgatggcgc 16920ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct cactgcaggc 16980cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat acattgcata 17040agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc cctggtggcg 17100cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac ctgcacgtcc 17160agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca cctcggatgt 17220gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt ttcctcctgc 17280aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct gcttctcagg 17340tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg gatgcgccag 17400tgttcctgct gtggaccgcc atctcacatg agggtcttgg gcctgcaggc tcgttcagga 17460aacacccgct gagtatgcag tgtgtgccag ctgtgtccca ggcaatggcg gggacagtgg 17520ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt gcaaggagcc 17580acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg gaggtataaa 17640tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg ctatgtcggg 17700aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg acgtgttttg 17760ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg gtggtgtgcg 17820cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg gctagccggg 17880ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg agaggacttc 17940acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc tatgaaatga 18000ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg agcggagacg 18060gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt cctgggaatc 18120agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc cccatcctgc 18180gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag ccctgggcac 18240cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt gcttggagcc 18300gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg aggtcccctg 18360attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt ttttccagcc 18420tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc cttagccctc 18480ttcctgagtc tagagccccg ggggttggaa gtcctggccc ctgggacacc tgcagccaca 18540ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc acgttcttgc 18600ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct cccctctgcc 18660ctgcccttct cccctcctac cctgcccttc tctcctctgc cccgccatgg cttttatatc 18720ctgtgccaca agacatggct gtgtgtgaaa gtggcagggt ctggcatctc tgtgggtctc 18780tgaggcccac gctccagtgc cactcttccc acccgctggc cgtgccctca tgctggaggg 18840acagcccagc cctctcccga accccagccc catgtgccca gctgcccccg gccctctccc 18900ctggaagccg gggtcactcc agccgtatgc catggtgggg acatcctgct tccttggcct 18960tccagggaag gtcctctttc caaatggcga cacctggtcc ctgcctggag gctggaagct 19020gtggcccttg tatgcccctc cagggtctgt gcgctcggtt ggcccgagtt cccatcaccg 19080tcatcatcac catcatcatt gtcatttcgc ttgtctgtga gccggcctgg tctcccagag 19140cagagaccct ctgaggtcca gcctgagttg gggtctccgt gctgacccct gacggggact 19200caggacgtac caggtctggg tcaggagtga cccccaaacc tcgtgccctt tgacaggcac 19260ccctgacttt tgctaagtgg gtggaggtga catcacttac agcgggagtg atgggacagg 19320gtctgttggc tgcactgtgc tcccagggat ctggggagag gctatatccc tgggctttgg 19380cactgcagag ctgtgtgtgt ttgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 19440gtgtgtgttt gcgtgcgcgc acatgtgtat aagatctttt tttattacat gaagcaagat 19500aactgttgct gtttcctttt gggttttgtg ttcaacagag tggggtactt cttccctcag 19560acaacagaac tctccccttt aaacacgtgc tgtcagaggg tgggtcttgg gctcatgtct 19620gtttgcacag ccgagtcaga ggaaacacag ggttcttcat aaaaacactg cacagcaggc 19680gactgtccag agtcagcctg caggacggca gcagccctgc ccctcagagc acagctaggg 19740tgggctgctt tgggatctcc cgtcattccc tcccagctgg cagccggcgg ccggcccatt 19800ccttggtgtg ctggtcaggg gggcgtgcgc ctgctctgct caccctggga atgggacaga 19860agctggcagc tcggagagga cagggctgga cccttgggtg gcctctggct ggaccatctc 19920attgtcctca gacacagcct ctcgggtcta gtttcatttc ctgaaaaaca agtgcacaga 19980actagagcag gagtcgagag ctacggcccc cgggccagat ccagccctgc cacctgtttt 20040cacaccatgc tcaagctgag tgggttttac attttttaat tacttgaaaa aaaaaaagcc 20100aaaggaggtt tcatgaccca tgaaaattat atggaattca aaaaaaaaaa attatatgga 20160attcaaattt cagtgtccat aaataatttc ttgagacagg gtctcgctct gtcacccagg 20220ctggagtgca gtgctatggc atggctcgct gtacccttga cctcccaggc tcaagcgatc 20280ctcctgtctc agcctcctga gtagctggga ctacgggtgt gtgccaccaa gcccggctaa 20340ttttttttta attttagtaa agacagggtc tttctatgtt gcccaggctt ttctggaact 20400ccatcttggc ctcccaaagt gctgggatta caggctcgag ccacggagcc cagcctgttt 20460ttgttttttc actgataaag ttttgccggg tgtggtagtg tgtgcctcta gcgatttggg 20520aggctgaggt gggaggatcg cttaagccca ggagtttgag gctgggctca agtgatcagg 20580aggtgaacta tgatcatgtc attgcattcc agcctgggtg acagagcaag aacctatctc 20640ttaaaaatat atatttaaaa agtattgggt gtggtggctc acgcctgtgg tcccagctac 20700ttaggcatct gaggtgggag gatggcttga gcccaggagt ttgaggttgc agcgagccaa 20760gatcgtgtca ctacactcta gcctgggtga cagagcccag accctgcctc tttaaaaaaa 20820aaaaccaaaa aacatgtatt ggaacacagc catgcctgtt cagtcacgtg ctctccatgc 20880tgctttctgc tccagagacc cttatggcct gaaagctgaa aatattttct atcctttaca 20940aaaaagtttg ctgacctctg tcctggaaaa ttcatctccc aagttctctt ccggcactgg 21000cgttcctggg tgtcctaaat ttggcccctg ttatttctga actctgtttt ggctctgttc 21060cctcccagga gccaggacag gcacgttctc tgcatcttgt cccctgacgc ccagaggctt 21120ggctcggctc aggcattctt ggaaatatct ggctccagga aaggcagagg cctcctgagt 21180cggcccagag ggaacctgcc ccaggtctgg gggaggcctg acccagcaga gtggcttttg 21240ccgatgggtt gggccggtca agatgtgctg aaagttgtcc tcagaaggcc actttgggat 21300tccttcctcc agtattagag caactgagag ctgctcattg caagcctgat gttttcccag 21360ttggccgggt ccaccgggtg ccctgggatt ctgggatctg ggtggaaagt agggggcttg 21420ggggagtgtc ctgggttctg gaatccaggt ggcaagtggt gaggttcagg gagtggcttc 21480tgagccacca taggggtctc tgtgggaggc tctgcccatc caggagattc cgcaggccct 21540gccggcccag agccagcgtc ttgcgcttgc cgaggctaca gccagcccca gccgggtgga 21600acagcccgtc gcctcctctc actttgtttt ggggccacct gggagtgtgg agcaagggta 21660gagagggagg aagtggctgc cggccgctgc ccagcaccct tgtttgcctt gggccctctg 21720tgggctcctt tttattgctc ttcaatgaag ccagggaaat ggacttcctt gcctcacttc 21780agttcaacat gtctggaagt ttggtattaa aattaagaaa gtgtggaaat agagcaagaa 21840gagaaaaatc tctccaagag ataatagtga cctctgagct gggcgcggtg gctcacgcct 21900gtaaatccca gtactttggg aggctgaggc gggcagatca cctgaggtcg ggagtttgtg 21960accggcctga ccaagatgga gaaaccccgt ctctactaaa aataaataaa taaataaata 22020aataaataca aaattagcca ggcatggtgg cgcctgccta taatcccagc taaggcagga 22080gaatcgcttg aacctgggag gcaaaggttg cagtgagcca agatcacgcc attgcactct 22140agtctgggca acaagagtga aactccgtct caaaaaaaat aaataaataa aaaataaaaa 22200tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 22260gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 22320aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 22380cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 22440ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 22500cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 22560attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 22620cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 22680ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 22740cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 22800gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 22860ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 22920gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 22980actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 23040ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 23100tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 23160tcaccctgtc acccaggctg gagtgcaatg gcatgatctc ggctcactgc aatcttcgct 23220agctgcgttc cagcgattct cctgcctcag cctcccgagc agctgggatt acaggtgtgt 23280gccgccatgc ccagctcgtt tttttttgta cttttagtag agatagggtt tcaccatgtt 23340ggccaggctg atctcgaact cctgacctca aatgatccgc ctgcctcggc ctcccaaagt 23400gctaggatta caggctggga tcacactgtg cctggcccta gcagctttgt cctgtgccat 23460ccaacaacag atgaccgaag tctttgtttc ttaacatgca ttccatctgc cttacagttt 23520tgccacctgc aaaacagagg acttgtcgct tttctggtaa gctggaaatg taatctggta 23580gcaggaggcc tgtggaagct tgcctttaat ggccttgtgt ctctttcatc ctgtcctgag 23640agccggagaa cttggatgtt gcacctaact caaccttcct gttaacatac agttctgcag 23700gctcatggat catcagaacc acgtcctatc tcacgcggct gtatgcttcc gttggttcag 23760gtgtttttac cttgacagta ttttctcctc ggtggctttt gcggtggttg cttttaatca 23820gcattgactc ttcaagaaaa atatttagct gctacatctc agaggagaca gggtggaaag 23880catctgagac ctgcaggctc agacttagaa ccagaagtgc cctcagagtt catccggccc 23940tgacccagcg ggaaatgagt tcacagagaa gcgggagaac tttgccccag gccctgccgt 24000tgctcataac tgccccaggt ccttacattt gctccaggtc ctgccccagg ccctgcagtt 24060gctcataact gccccaggtc cttatatttg ctccaggtcc tgccccaggt cctgcagttg 24120ctctgtgtgg tgggtgtgat ctggagccct ccgcccattg ctgcacctgg ggcaggcatt 24180gctaattgat cccaggactc cttcctgcgg agcacgccct ggttctccag gcagccgctg 24240cctgtcagcc tgcagtggtt cgggagagga cacctgcttg cctggtctgt tccaaatctt 24300gcttctcatc ccagcacagg tagggggtgc tatgggaaag ggatcctcag ttggccctgt 24360cactgctcta tcagctgggg acgtggcatc ctagtgaaaa catcatggcc gggcgcggtg 24420gctcacgcct ggaatcccag cactttggga ggctgaggag ggtggatcac ttgaggtcag 24480aagttcgaga ccagcctggt caacatggtg aaacccatct ctactaaaaa tacaaaaatt 24540cgccaggtgt ggtggcgggt acctgtaatc cgagctactc gggaggctga ggcaggagaa 24600tcgcttgaac ctgggaggtg gagcttgcag tgagccgaga tcttgccact gcactccagc 24660ctgggcaaca gagtgagacg ctgtctcaaa atctcaaaca aacaaacaaa caaaaaacaa 24720acaaacaaag cgtcatttat ccagcacccc tggggaacca tgctacctgg tgttttatgg 24780tacctggcaa ggtgcaggtg aagttgctgc tcttgggcat tgaacccgtc ttgtttgggg 24840cagctcaggc cccaggcagg gtccgggttg gctctcgttg gtgtggccct ggcccatcca 24900gacctatatt tctgccgtcc tgcaggtgat caatgttgat gggacgaaga ggcggaccct 24960cctggaggac aagctcccgc acattttcgg gttcacgctg ctgggggact tcatctactg 25020gactgactgg cagcgccgca gcatcgagcg ggtgcacaag gtcaaggcca gccgggacgt 25080catcattgac cagctgcccg acctgatggg gctcaaagct gtgaatgtgg ccaaggtcgt 25140cggtgagtcc ggggggtccc aagccatggc tcagccatgc agacttgcat gaggaggaag 25200tgacgggtcc atgcctgggc ataagtgttg agctcaggtg ccccgacctg gggaagggca 25260ggacaggaaa ggtgacagta tctggccaag gacagatggg aagggaccaa gggagctgat 25320tagggagtgg ttatggacta ggaatgtcgg taacaatggt tagaaagtga ctaacatttg 25380ttgagcacct gctgtgtgcc cggccctggc cgggagcctt cgtgcccaca gtgaccccgt 25440ctgcaaatgt agttccttgc cctactcgca ctggggagca ggacgcagag ccgtgcaact 25500cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 25560ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 25620gaggaggtcg ttgtctggca caatgatttg tctcttcctg aaaaggtgac agagttacac 25680tggagagagc agcatccagg tgcggcaggg acaggcctgg ggctcgcggg cagggactct 25740gtgtcctgcc ggggtcccac actgcacctg cttgtcagag gcactcagtc aatctttgct 25800gatgaaggat gagaggacag aggacgtgat gcttgctgct gcattgcctg cagtcctggg 25860tgagatgccc gggttgactc tgctgcccgt cgggtggatg tgatgtcaga tccccggctt 25920taaaatacga gggagctggg aattgaggga gcaggttggg gcagaaagca cagccccgtg 25980gaagcctgga gctgaggcag tgtgggcgac ccctggagca gtgagtgctt ccttcatggc 26040cttcatcgca ccctgcagtc ctcatgtagg ggatgccatc catgaattta gttttcccag 26100cctcctttaa aaacgcgttc atgctggggc cggggcagtg cagtggctca catctgaaat 26160cccaccactt tgggaggccg aggcgggtgg atcatgaggt caggagatcg agaccatcct 26220ggctaacaag gtgaaacccc gtctctacta aaaatacaaa aaattagccg ggtgcggtgg 26280cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 26340aagcggagct tgcagtgagc cgagattgcg ccactgcagt ccgcagtccg gcctgggcga 26400cagagcgaga ctccgtctca aaaaaaaaaa aaaaagtaca aaaaaaaaaa aattagtctg 26460ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 26520aacccaggag gtagaggttg tagtgagccc gtatcgtacc actgccctcc acctgggcaa 26580tagagcgaga ctctgtctca aaaagaaaaa aaaaaaaaga acatttatgc caggtgtggt 26640ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 26700gaaatttgag agtgtcttcc ctgggcaaca tagagagacc tcatctctac cagaaaaaaa 26760aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 26820gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 26880ccagcctggg tgacagacag agaccctgtc tcaaaaaaaa aaaaaaaaaa aagcatttac 26940tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 27000cccagagata agacaaaagg gtgcctccat gggggtgtga tttaaagctg agaaattggg 27060cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 27120tttttggctg aaatatttaa cactaaatta aagccaattt taacagcact ttggttgatg 27180agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 27240agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 27300gaggaagggc ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 27360cctgccctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 27420cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 27480gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 27540aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 27600aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 27660tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 27720cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 27780cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 27960cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 28020acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 28140tccctcgaga ccaataacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 28200gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 28260gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 28320ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 28380ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 28440ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 28500gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560ctctttcttc atttatagtg caaatgtaaa catccaggac aaatacagga agactttttt 28620tttttttttt tgagacagag tcttactctg ttgcccaggc tggagtaccg tagcgtgagc 28680tcagctcact gcaacctccg cctcccaggt tcaagcgatt cttctgcctc agcctcctga 28740gtagctggga ctacagacat gcaccaccac acccagctaa ttttttttat atttttagta 28800gagacagggt ttcatcatgt tggccaggct ggtcttgaac tcctgacctc aggggaacag 28860acggggttgg cctcccaaag ggcggaaata acaggggtga gccaccgttc ccggcctagg 28920aaaacttttt gccttctaaa gaagagttta gcaaactagt ctgtgggctg gccttctgat 28980tctgtaaaga aagtttgatt ggtggctggg tgcggtggct cacacctgta atcccagcac 29040tttgggaggc cgaggtgggc agatcacctg aggtcgggag ttcgagacca gcctcaccaa 29100cgtggagaaa ccccgtctct actaaaaata caaaaaaaaa attaaccggg catggcggcg 29160cctgcctgta atcgcagcta ctcaggaggc tgaagcagga gaattgcttg aacctgggag 29220gcggaggttg tggtgagctg agatggcacc attgcactcc agcctgggca acaaaagtga 29280aactccgtct cagaaaaaaa aaagtttgat tggtgtaacc aaagcgcatt tgtttatgga 29340ttgtctgtgg cagcttttgt tctgccgaga tgagttgtga cagatctgta tgggctctaa 29400agcctaaaac atgtgccatc cgccccttta cagaaaaagt gtgctgacct ctgttctaaa 29460gtattggaca actacaatgt ttgctcattt attattctat gatttgtttt ctgctttttg 29520ttgttgttgt tgttgttgag atagggtttc cctctgtcac tcaggctgga gtgcagtggt 29580gtaatttcag ctcactgcag cctcgacctc ctgggctcta gtgatcctct catctcagcc 29640tccctagtag ctgggactac aggcacacac caccactcct ggctgatttt tttttttttt 29700tttttttttt gtggagacag ggtttccgca tgttgcccag gctggtttca aactcctagg 29760ctcaaacacc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc caccatgccc 29820agcctattct actgtttgta ttacatagct ttaaaagatt ttttatgact ttaagtcaca 29880agggttcttt gtagaaaaaa atatatatat aggaaagtat aaaaagaaag taaaaattgt 29940ccataacctc tccagccaga gacgaccgtt gctgacacct cagcatattg cctttaagtc 30000ttttttctct aagatagcat ttctcttcat cacagtcata tgctacgcag aattctgtat 30060cctgattttt tcacttgaca ttacaacagg tatttgatgg cgctgtgaca aactctttgg 30120cacaatcttt taaatgtatg aaatactcca ctgcacagat gtttgctttt aggcttaact 30180gttcttttat tttgcgtgtg ctggttacag ccgggcacag tggctcatgc ctgtaatcac 30240aacactttga gagggtgagg caggaggatc acttgagccc agaagtttga gaccggcctg 30300ggcaacatag tgagacccca tctctacaaa aaactttttt aataagtcgg gcgtagtggt 30360gcatagctgt agtcccagcc accaaggagg ctgagttggg aggattgctt gagccccagg 30420aggttgatgc tgcagtgacc tgagattact ccactgtact ccaacctgag cgacagagca 30480agacttgtct ggggaaaaaa aaaaaaaaaa tatatatata tatatatata tatatacata 30540tatacataca cgcacacaca cataatataa aaatatatat ttataaatat ataatatata 30600atataaaaat atatatttat aaataaaatt tataaattat atttataagt aaatatataa 30660tatataatat aaaaatatat attatataat atataataaa atatataata taaaaatata 30720tatttataaa taatatataa tacatactta taagtatata tttaaaatat atgtaatgta 30780tattttttaa tgtatgatat ataatataca tttataaata cacatttata ttattttata 30840taaaatatat ataaaatctc caagttgctt tttccaaaaa ggtgtcttgc tgcatttcaa 30900acattcattt aaaaacttga atgctggtga tctggtccag aatgtgttca gtagctgctg 30960ccagtggcca agcatctcgg gagatgtcta caaaacacgc tggttctggc ctggcgtggt 31020ggctcacgcc tgtaatctca gcactttggg aggctgaggc aggtggatca actgaggtct 31080ggatttcgag accagccttg ccagcttggt gaaaccccat ctctactaat aatacaaaaa 31140aattagccag gcgtggtggc atgtgcctgt aatcccacct acttgggagg ctaaggctgg 31200agaatcgctt gaacccaggg ggcagaggtt gcagtgagcc gagatcgcac cattgcactc 31260caggctgggc aagaagagcg aaactccgtc tcaaaaaaaa aaaaaaagat gctggttcct 31320aaaatgtggc ccttttcctc ctcacctgct gccagaccat cagccgcgcc ttcatgaacg 31380ggagctcggt ggagcacgtg gtggagtttg gccttgacta ccccgagggc atggccgttg 31440actggatggg caagaacctc tactgggccg acactgggac caacagaatc gaagtggcgc 31500ggctggacgg gcagttccgg caagtcctcg tgtggaggga cttggacaac ccgaggtcgc 31560tggccctgga tcccaccaag gggtaagtgt ttgcctgtcc cgtgcgtcct tgtgttcacc 31620tcgtatgaga cagtgcgggg gtgccaactg ggcaaggtgg caggctgtcc gtgtggccct 31680cagtgattag agctgtactg atgtcattag ccttgatggt ggccaggact ggtagggccc 31740tcagaggtca tggagttcct tcgtggagcg ggtgctgagg ctgtatcagg cacagtgctg 31800gctgctttca cctgggccgt ctcaccgaag tgtccatgga gcctgcgtag ggtgggtatc 31860tgtgtcgatt ttacagatgc agaaacaggc tcagagaaac cgagtgactt ccctaaggtc 31920acatacccag ttagagcaga gctgggccag gaagtgctgt ctcaggctcc tgaccaggtc 31980tccttgcttt gcactcttgc caaaaccatg atccagaact gactttgagg tccccggacc 32040tcaggctcct ccgaaatggc ctcttggagg ctgctgagcc acagcttagg acccacctcg 32100agaggcaaat gtgctttgag ctgccaggcg tcctgggggc cctgccttgg gcacggggtt 32160cagacaggcc ccagatgtgt ggggcgtctt tctggacttg agttttcttt tctgtgtggt 32220ggacacagtg ctcacccctt aaagcacctg tgatgtgtgc agcagcccaa tccctgcctg 32280tcgcctgttc tgctagggaa ggaaggaata cttcaggatg gcaggacaac agaaagaggt 32340ccaggtttta gagcaagggc aggtcaaact tagaaaattc tggaatgagg atgtgcattt 32400cctcttctgg atctgctaaa agaagaggga aggaggggct gctgggggag gagcccagag 32460ccgagtttac atccggatcc cgcaaggcct cccctgccct gaggtcttgt tttgtgatgt 32520gcttgtgtcc atcctggttt ctgccgtgtc cccaacatcc ggccaagctt aggtggatgt 32580tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 32640caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 32700ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 32760ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 32820ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 32880catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 32940caactgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 33000tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 33060gggtgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 33120cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 33180tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 33240cccctgggtg cctttgctgc agttatgtcg ggagaggctc tggtgacagc tgtttcctgt 33300gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 33480ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33540ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 33720ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 33780caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 33840gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 33900agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 33960cctgtagtcc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34020aggttgtggt gagctgtgat catgccactg cactccagcc tgggcaatag agtgagaccg 34080tctccaaaaa aaaaaaaaga agaagaaaaa gaagctctga ggctccaagt ccccaggcac 34140cccttggctt gagggcagac aagggaggag agggtcacct gggcagccct gacttttgtc 34200ccctggcaaa gggaccttca gtgaccttgg ccctaggaga gcctctgagc acgtcagcca 34260tgtcgaaccg ctcaggaagg gcagcaagaa tttggcttct gacctctgcc tctcctactc 34320gccatctgca ctgggtgtgg ttgtgcccat tttacagatg aggaggctgg ggcatcgacc 34380agctgaatgc cttgtcccag gtactgcgta ggcagagctg gcagttgaac cccgtgtcct 34440ggttgtcgct gggggtgggc tgcaccctga cttgtgaggc cagtagcaag gtttgcacgt 34500gacttcgtga ccgtcaccca gctctgcagc acatcccgtg acccagctca tccaggccgc 34560atgcaaacct gttgccaggc gagaaaccag tcaccgcaca gctgtggttg cctgaaatga 34620ttaagctcat taatcacccc ggagtgagga cagactcaga tgaaaaccag caaaagccct 34680ggaaactcat gtgaccctgc caatgagggc ggccatgtgc attgcagcct ggccgtcact 34740cctcggtacg tgttttggac ttaaacgctc cggatgttta ctgagtgctt gattaataac 34800atggaaggcc tggtctcatt gctgtgggag tgaaggatgc acagccaggc ctgacatgat 34860gagaacaaga acctggagtc tcgctgcctg ggtggtaatc ctggccctgc cacttagcaa 34920ctgtgtgact gtagccaggt cacttaattt tgctagatcc tgcctgcgct tcagtggatc 34980ttgctggttt tccaaggtgg ccaaacactt taaggcattc atgtggtcgc taggctgcag 35040ggttgaaccc tggctcaccc cgcagggcgc cgtgtgctct gtggcctggc tgtgcctttg 35100ctgacaccgt gcccgtgtgt gttcatgcag gtcaggagcg ggtcgtgatt gccgacgatc 35160tcccgcaccc gttcggtctg acgcagtaca gcgattatat ctactggaca gactggaatc 35220tgcacagcat tgagcgggcc gacaagacta gcggccggaa ccgcaccctc atccagggcc 35280acctggactt cgtgatggac atcctggtgt tccactcctc ccgccaggat ggcctcaatg 35340actgtatgca caacaacggg cagtgtgggc agctgtgcct tgccatcccc ggcggccacc 35400gctgcggctg cgcctcacac tacaccctgg accccagcag ccgcaactgc agccgtaagt 35460gcctcatggt cccccgcacc tcactccctc gttagatcag gctggttctg ggagctgacg 35520ctgaaaggag cttctcatct ggggttcctg ggtgtacata gatggttggg taggttgtgc 35580actgcacaag ctgcatgatg ctacctgggg gtccaggtcc aggctggatg gacttgttgc 35640ttcatcagga catagataaa tggccaaaac tcctcagctg gaaggtcctg ggcaggatct 35700ttgggtgtga aaaccagtca caggggaagg gtgcttgctc atactgccag cacagtgctg 35760agtgctttcc atagcgctcg tttactcctc aagcctggag ggtggggagt agcatggtcc 35820catttcacgt acaaggaacc cgatgcacag agaggtgtgg caacccatcc aaggccatac 35880aactggggtg ggttgagccg gggttgactg tggcaggctg gctcaagagt ccctgctcct 35940gaacccttgc caggcagcct ggcatcagct cggggaattt ttgccctgac ccttggaagc 36000aagtgggcct ctttgttctc atgtcagtga tgagaagagt gactttccta tggcccctct 36060ggagtacagg tgtttcctgt tggcgggctc ttcccccatg acatcagcag cgagctggtt 36120atgattccct acgcagaact tgatagttta taaagctctt tgtcatccag gccccgttgg 36180agtctcacgc agacctggtc gcaggcgggg ctggtcttgc ctgtcccagc tgcatggatg 36240gggaacttga ggcttgcaaa ggttaagggg ctgttcgagg cccacgctgg caggagatgg 36300gcctgggcca gagtctggga cttcccatgc ctgggctgtc tttggtcctg ttgctcacca 36360tccctccctg gggccatgac cttagagagc caaatggagg tgcaggtaac ccacggcaag 36420gaggggttgc catgactcag agtccccgtc ctgtggccgg cagtacctgg tgcaacgact 36480tggatttcag accagccact gtagcccgct gacggtgcgc tcgaagtgcc acagcttctg 36540aagccaggca ggactcaggc caggagactc tgttagctgt tgagagggag aggccaacgg 36600atgttctggt tctgctagag agctggttct tcggatcctg gtaccagtgc actgagagga 36660ggcccagctt gattctgggg ctgccttgtg gtggcatgtg ctgctcactg acaccctcga 36720ggagtgtctt ctctcgggct tgttgactgt gcccggtttt ccgcagttca ctggtgcaca 36780cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 36840ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 36900tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 36960gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 37020ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 37080ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 37140agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 37200taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 37260gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 37320ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 37380gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 37440ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 37500agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 37560aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 37620cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 37680tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 37740tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 37800tagtcacagg gtatgaccct gagatgcgtg gaaccccgag actgtgatta tatgtagaaa 37860ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 37920tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 37980ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 38040ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 38100gccagggcac agccctcaaa atctccatca atgacatgta agaaaagaga ggaacctggg 38160aaatagcaaa gtgccttttg cacattaaat ggttagctat atcccacaat actgtgcatt 38220cgtaaacgtt aatgctgcaa taaatacggc acttcacctt gggaagatct ggagttggct 38280tatgagtgtg gaagggtgta gcgcatgagt ttttgtgaaa cactggaagg aggattgtgg 38340gaaatcaaat ggaaagttct caccccaggc gtggagaaga gtgggtcatg gccccagcag 38400tgagcccagg gaggtcagag acggaggtgt gtgtgtgggt gtgaccctgc gcagttccct 38460gccggctgta gttttttgca ttcgcttaat gtttctcgtg gaggaaattg tgcatgagca 38520aatgtgaaac cgtgctgtgc tcaaattgtc ctaatacatc attgcattgg aacagattgg 38580ctttnttttt tttttttttt tttttttttt tttgaaatgg agtctcactc tgtcaccagc 38640ctggagtgca gtggcatgat cttggctcac tgcaaccttt gcctcctatg ttcaagtgat 38700tttcctgcct cagcctcctg agtaactggg attacagggc atgagccacc gcggccggcc 38760agatttgcat ttttgaaaca actgctaggc tgggcgcggt ggctcacacc tgtaatccca 38820gcactgtggg aggccgaggc aggtggatca cctgaggtca ggggttcgag accagcctgg 38880ccaacatggt gaaaccccgt ctctactgaa tatacaaaaa tcagctgggt gtggtggcgg 38940gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccaggagg 39000cagaggttgc ggtgagccga gatcacacca ttgcactcca gcctgggcaa caagagcaaa 39060actccatctc aaaaaataaa aaatagaaaa acaagtgctg tagcggaagt gagcactttg 39120cggagtcagg cttgtgtggc ctgttccaca aatgatgtgc tcacggtggc ctcaggccca 39180cctggagtct gcagcatggg gcacaacagg ttcattagtg tagaattcca ggacaggcct 39240ggctcctaag cagccttctt ttacaaaaac tgcagagccc gcctgtatcg tagcactttg 39300ggaggccgaa gtgggtggat cacgaggtca ggagttcaag accagcctgg ccaacatggt 39360gaaaccccat ctctactaaa tatacgaaaa ttagctgggt gtggtggcac gcgcctgtag 39420tcccagctac tcgggaggct gaggcagaat tgcttgaacc tgggaggtgg aggttgcagg 39480gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 39540aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 39600tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 39660gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 39720gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 39780tttctagtta cgttgtactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 39840ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 39900gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 39960ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 40020tgagaaagct cctgggcgta cattgcctta gagtttcatt ggagctgcgt tcttttatgg 40080tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 40140ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 40200cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 40260tggtgtctaa attatcatcc aattggccga ggttttgttt tctattaatt gtttttattt 40320tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 40380tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 40440cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 40500gggactacag gcatacacta ccacatctgg ctgatttttt gtattttttt tttattgtag 40560agacccgcta tgttgcccag gctggtctca actcctggac tcaagccatc ctcccacctc 40620accctcccaa agtgctggga ttacaggcat gagccacaac acccagccat tttaattttt 40680tttttttttt ttgagatgga gtctcactct atcgcccagg ctggagtgca gtggcgtggt 40740atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 40800ccgagtagct gggattacag gtgcccatca ctatgcctgg ctaatttttg tattttttag 40860cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 40920cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc ccggcctttt 40980tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 41040ggctcactgc agcctccgcc tcccaggctc aagttgtgca cctccacacc tggctaactg 41100tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 41160cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 41220gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 41280ccatcactgc cattcatctc cagactactt caccttctcg gcagatgtcc gaaactgtcc 41340gcattgaaca cactcctcat ctccctctga cagccaccat tctactttgt atctctctct 41400gccttctcta ggtacctcat gtaagtggaa ttataccaat atttgccctt gtgtgactgg 41460cttctttcat gtgacatggt gtcctcaagg ttcatctgtg ttatagcctg tgtcagaatt 41520tccttcctta aagcctgaat aataacccgt tgtaaaggct gggcgcggtg gctcacaccc 41580tctaatccca gcattttggg agtccgaggt gggcagatca cttgaggtca ggagtttgag 41640accagcctgg ccaacatagt gaaaccctgg ctctactaaa agtacaaaat tagctgggtg 41700tggtggcgcg cacctgtaat cccagttact caggaggctg aggcaggaga atcgcttgta 41760cccgggaggc agaggttgca atgaaccaag attgtgcctc tgcagtccag cctgggtaac 41820agagtgagac ttcctgtctc aaaaaaaaaa aaaatcatcg gatggatgga cggaccactt 41880cttgttattt atccatccac gggtgctagg tttcttccac ctttggttgt cgtgaataag 41940gccactatga acatttcctt ccgtggtgaa ggttttgtac tagtgaggaa aaggcgtgtt 42000tgtggtgttg cataggattc tggtaagaaa gtttgcacta accataagta tttgtactac 42060attaaaatga aagctcaggg gccgggcgcg gtggctcacg cctgtaatcc cagcactttg 42120ggaggccagg gcgggcggat catgaggtca ggagatcaag accatcctgg ccaacatggt 42180gaaaccccgt ctctactaaa aataccaaaa aactagccag gtgtggtggc gggcacctgt 42240agtcccagct acttgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 42300gcggtgagcc gagatcgctt cactgcactc gagcctgggc aacagagcaa gactccgtct 42360cacgcaaaac tctgtctcac gcaagactcc gtctcaaaaa aaaaaagagt tcagggttta 42420tgaaactggc cagccgcgta aagtttgctg tgttgttttt gtgcccggga ggagtgtggc 42480cagggtgtca cgtcacacag tacacgtttc tcagatggtg gttctccaga ctgctgtccc 42540aaagtctgtt tttgcatctg gttcccacag acccaccctc cacggtgagc ctgattttgg 42600ccagggtagc tggaatcttg cttgtctttc agcccggcag ctgtaccagt ccagggtcca 42660cagctagtgg cttttaggaa ggaatttgtt cagttggctt tgacacatgg ccccctaggg 42720tccacagctc tgtagtgatg tggatgttgt tatctacaaa gacacatgat ccttcgtgtc 42780cagatgaaag tgatgatgtc tttgcagctg cccagcaagg ctgtgtgtgt gtgtgtgtgt 42840gtgtgtgtgt gtgtgtgtgg tgtgtgtgtg gtgtgtgtgt gtgtatgggg gagggaggca 42900ccctttccat ctgggggtgt gtgtgtgtgg ggtgtgtgtg tgtgtgtgcg cgtgtgtgtg 42960gtgtgtggtg tgtgtgtgtg tatgggggag gcaccctttc catctgggtc caagagactg 43020ggcctgggga agacgcttct ttttatctac ttagagactt tgttttattt gtattttttt 43080gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 43140cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 43200tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 43260ttgatacaga tggagtcttg ctatgttgcc cagactagtc tcaaactcct gaactcaagt 43320gattctccca cctcagtttc ccgacattct gggatcacag gtgtgagcca ctgctgtctc 43380cctgttttat taactgctga aagacctaga taaagaaagt ctgaaaagac ttactatcag 43440agcaccatcc taagatgatt ccctctgact caatggagag ggaggggagc ttttccttca 43500ggcctgggtg gcaggagccc aggtgctcca ggccccattt gccccaggcc aaatcactcg 43560ggaacttgga tgcagctgtc tttcagggta acccaaagga accagatccc cgcaggcagt 43620aggcttctgg gctgtcctct cctcctacgt cagctcagta agagcccttc gaagggatgc 43680tgtgtcggag gccccaaaag cccaggctca tccctgagat gcacagggtg ggctgggctt 43740aggcagcgct cgagcatctc ctggacggtg accccagaga gtgtggagac ggagagtcct 43800tgagagtcac tgagagacgt ggctgccctg ccttcccaag aggggctctg agtcattccc 43860cacactcacc tgcccctacc caccctcacc tggcccccag cctcacctac ccccacatct 43920gtaccgatcc ctttacccgc accttcccta cccaccctca cctcccctgt accttcacct 43980cccccactca cccgcccctg caccctcacc tgtcccccac cttcacctaa cccccaccct 44040cacctgccct cccctcacct ggcctccttc cgttggggaa ggggttgtaa ggggcggccc 44100ccaaactgtc tgtcctggtg ccctgcagag aaaacagtac gtgagggccg cagtccaaaa 44160gcttgagtcc tggaaggtgg aggagacagg gatgtgttgg gaagggcccc atggtcttgg 44220atcccttctc gactgtcaat ggggccttca tgggagcgcc agtctagtga tgcacagctg 44280ggtgcccggc gggtggctga ggaggcctaa agtccgaggc ggcaagagct cttccagagg 44340ctgttgtcct aatcgctctg gcatactcag gcgggcacgt agttaggagc tgattggaga 44400ggagagaccc ccacaccaat actgggattt gactttcagg ctaaacttga gaagtgtggc 44460ctctgctgtc ctgccagagc tctccagcca gtgcccaggg ctctccagcc agtgcccggg 44520ggtctccacc agtgcccggg ggtctccgcc agtgccaggg gtctccgcca gtgcccaggg 44580gtctccgcca gtgctcagga gtcttggttt ctttgtctta cagccctttg ttttgacctc 44640tctgagccaa ggccaaaacc cagacaggca gccccacgac ctcagcatcg acatctacag 44700ccggacactg ttctggacgt gcgaggccac caataccatc aacgtccaca ggctgagcgg 44760ggaagccatg ggggtggtgc tgcgtgggga ccgcgacaag cccagggcca tcgtcgtcaa 44820cgcggagcga gggtaggagg ccaacgggtg ggtgggggtg ctgcccgtcc aggcgtgccc 44880gccgtgtctt ctgccgaatg ccagcctctc acaggctggg gagactttcc accctgggga 44940tccaatgggt ggctttccag ggtcccaaaa gcaaacacag gctctttcac agcccctcca 45000ggaaagcaga aagccccaag ggctggaagg gaagggggag ctctgctgag aggttacaag 45060gcagcgctgg ccgacgggag ttgcagttga taggttttgt atcatccttg ttaaacttga 45120accctgtgca gaaatccctt ccacggcatg ggggctgcct gttgactcgc tcctgttcca 45180ccacagggag ctcctgggct tcttcctccc agaggccccc gacgctccca cctgttggtc 45240gtcagagctt ctggttggtg ggaaggcacc caggaccttg aggtctccag agagaaaagc 45300cagggaaaga gggagaccga aacccatgtg acatgaaact caggctccaa actgagcacg 45360ggaacgtttg gggacaggag cgcgatggcc ttcctcagat agctgggggg ctggcatgaa 45420gacgggagct acagccagca caggtcctgg gccgggagcc cagagattga gccctgactc 45480tgtcacttac tggccacgtg accttgggcg ggtggcatag cctcttggag actcagtttc 45540ctcattggta ggagtgacgg ccacagtggt gcggcctctg cagcacacgg ggggctcggt 45600gggcggaagc cccgggtcta taaggcggct gtgcaggagc cagccgagct ggtctcccaa 45660cagccagggc tccggggtcc ttagcagctg tggggggcct gcacctgttt cccatggctg 45720ctgtcagaaa ttaccagaag ccaggtggct gagagtaatg gacacttgtt ctctcacagt 45780tcctgagggc tgaagcccga gatcgaggtg tgggcagggc cctgcgccct ctgaaggctc 45840tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 46020gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 46080gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 46140aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 46200ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 46260ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 46320caccaccggc ctcatccgcc ctgtggccct ggtggtggac aacacactgg gcaagctgtt 46380ctgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 46440ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 46560tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 46620aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 46680actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 46740ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 46800catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 46860ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 46920gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 46980tcccacccct gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 47040tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 47100ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 47220gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 47280ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 47340tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 47400atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 47460gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gttttctttt gccgcagaac 47520aaatcaccac gaacagcgtt ttaagacaac accaactctt tttttttttt ttttttttga 47580gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 47640gcctcgacct ctgggcttaa ttaagtgaac accttgcctc agcctcccag gtagctggga 47700ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 47760gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 47820agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 47880tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 47940ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 48000agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 48060ttattggtcg gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 48120ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 48180gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 48240cctttgatag actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 48300tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 48420gggaccacag tccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 48480gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 48540cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 48600aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 48660caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 48720tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 48840tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 49020ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 49080attttatttt attttatttt attttatttt atttatttat ttatttattt atttatttat 49140tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 49200tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 49260attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 49320ttcgccattt tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 49380cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatttt 49440aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 49500ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 49560gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 49620gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 49680catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 49740tataaggacg tttgtcattg ggtttagggc ccacccaaat catccgagat gacctcgtct 49800tgagatcctt aacctgcaaa gacccttttt ccaaaaaaag gttatgctca cagattctag 49860gccttaagac atgggtgtat ctttctgggg ggcactatcc aaccccttat acaatgaaag 49920acgggaagag ggccaggtgt ggtagttcac gcctgtaatc tcagcacttt aggaagctga 49980agcgggagga tcacttgagc ccaggagttt acaagtagct aggcaacatg atgagacccc 50040atttctacaa aaagtaaaaa aaaaaaaaaa aaaaaaaaag ccaggtgtgg tggctcacac 50100ctgtaatccc agcactttgg gaggctgagg caggcagatc acgaggtcag gagattgaga 50160ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaaaa ttatggccgg 50220gcgcagtggc tcccgcctgt aatcccagca ctttgggagg ccgaggtggg tgaattacaa 50280ggtcaagaga tcgagaccat cttggctaac acggtgaaac cccatcaaga tcacaaggtc 50340aagagatgga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 50400aattagccgg gcatggtagc gggcgcctgt agtcccagct gctcgggagg ctgaggcagg 50460agaatggcgt gaacccggga ggcggagctt gcggtgagcc gagatcgctc catgccattg 50520cactccagcc tgggtgacag agtgagactc cgtctcaaaa aaaaaaaaaa aaagaaaatt 50580agccaggcac agtggcaggt gcctattgtc ccagctactt gggaggctaa ggcaggagaa 50640tggcatgaac ccgggaggtg gagtttgcag tgagccgaga tcatgccact gcgctccagc 50700ctgggcgata gagcaagact ctgtctcaaa aaaaaaagcc aggcatggtg gtgcatgcct 50760gtagtcccag ctactcaaga ggctgaggca ggagggttgt tcgacccacg gagatcaagg 50820ctacagtgag ccatgatcgc accactgccc tccagcctgg gtgacagagt gtgaccctgt 50880ctcaaagtaa gtaaatagga ggagagacaa gtgggcagtt cagactgatg gtatgggcac 50940agtagagact ggtgcagaca ggctggcctg tgatgtcaag caacttctgt aactgtttcc 51000ggcatccatt tgtgtgtcaa tttccgtgtc agtaggaaga ctctgtaggc tgccaagagg 51060aataagtggg aggatcctcc cagagaggcc gggcctgcag gagggccagt tctcatgagt 51120tcttatttgg cccctaccct ccaggctgtg gttctgaggt gggagacaga gcctgacctc 51180tgtttgtctt gttttgtctt tgcagcagcc cacccatgtg cccgtgacaa tggtggctgc 51240tcccacatct gtattgccaa gggtgatggg acaccacggt gctcatgccc agtccacctc 51300gtgctcctgc agaacctgct gacctgtgga ggtaggtgtg acctaggtgc tcctttgggg 51360tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc 51420acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 51480ttctactatg tgcattcggt gctatgaatt ttcctctaag tactgcgttt cctgcatctc 51540acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 51600tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 51660tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 51720ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 51780ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctct 51840gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 51900ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 51960tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 52020tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 52080gcgtgagcca ctgtgcctgg ccattaggtg tgttttatca cccagcatca tgcagtttat 52140cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 52200tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 52260cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 52320tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 52380tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 52440aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 52740atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 52860cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 52920aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 52980cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 53040tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 53100acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 53220gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 53280agcagagggg acattttgtg actgtccccc tcctgagctt cccagcagct tttctccaag 53340ttacagccca aaagctcagg tggatttgca acccaacggt gtctgtgcac ctcccactga 53400tgcccgaact gccctggcca agaaacgggg ccgtcagaac gctgcactaa ctgcagcctt 53460gggcctccat gccagaggcc atgcccttcc atccaccacc ccctggcctg ggccctggcc 53520ctcctggctc gggaactcca ggccccttcc tcacggatcg agagacgtgt atttaccgca 53580caggtgcttg tcattctctt gtggcctctt ctccagggag atcacagaag gacagggcct 53640cactgaggtc tcggacatgg accctttgat agtggcagga gccaggctgg gcaagaggcg 53700gccacagtca cctcagcagt gccatcacca ccgccattca gcccttccct gagccgggcg 53760cgcccctggc tctggcccca gtgtcccagt tacagctcac aggagcttgt ggtgcccagc 53820ggctgcttct gattgagagt cgaggtcgga ggctttggga ggctgagagg ctgctcggtt 53880tcacaactgc tgagggagac ttgggctcca tctcaggtct gccccatgtc gccctcaacc 53940tccagccacc ggtcctccgt gtcccccatg gccaggcacg gcttgcagac atctgtcgtt 54000ggctcctctc agccgtcgtg ggctgaccct ggcacgtcct cctgtggctg agcccagtgg 54060ggacagctgc ttccttttat taccctagaa ctctcgtctt tgatcaggcc ccctccccta 54120tgccacacag tccctgtcac tcgggtgagc ccagtagtca tggggaaggc ctgcgggttc 54180caaacatcca aaggcttgcg tgcagcatga cagcttgaaa ccgatgtttt ttaccttgat 54240cagatttcag cttggcgggg gctttgctca gctttcagtg aggcctgggc cgatttccca 54300gcatcccctc ctgaggccag cctctgtttc ctgtgatttt ctgcacaaag tgggagggag 54360gagtcttagg aaatgggggg ccacctcgaa acctaggcct cctctggctt ctctgtgcca 54420gtgcccccac gctttgtgtc tgtgtcccca gcccatggga ctgtgttatt ccctgagtgc 54480tgccgcatgc ccagcccgca ctgaggacgt ggagccccga ggggcaggat ggcctccatg 54540gtcacacgta ggaagtggcc tccaccctcc gatgatcctc tccccccctc cctttcagcg 54600ccttccccgg gggtgtcatc agccctcctg cctgtgcttt gtcccgtctt ctgcaggcgc 54660atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 54720cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 54780ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 54840agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 54900cggagcattt ctgctgtatt tggtgtagcg cctgctgctt aaagctctga ttcccagttg 54960gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 55140tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 55200gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 55320tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 55500aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560aggggctctg aaaacctttg cttgcaggga gattgccaag tctgtctttt aggcccaaca 55620aggaaaactc tgcagttcca cccatcctgt cccaccaggt agtgtggctt gaaggcagac 55680tgtgagggtc tatctcacct tcctgcatta ggtcaggagt ttcacagaaa cctgaggcac 55740attcaggggt gggctgcaga ggtccatggc tcacaccctg gaaaatccgc ccccaaaaga 55800cagtgctgtc tccactgacc agtctgtggg atagtgctta agcctgagtg gtttctatca 55860acatgtagaa tcaggaggta taaagagatt tgctcaggca tcctgggccc tctctgacca 55920gcaggatctt cctttagatc ttgacagtga aacacatctc ttctgtgccc cctgtgagtt 55980ttctttcatt cattcattca ttcattcatt cattcattca ttcgagacag agtcttgctc 56040tgtcacccag gctggagtgc cctggtgtaa tctcggctca ctgcaacctc tgcctccagg 56100gttcaatcga ttctcctgcc tcagcctccc gagtagctgg gatgacaggt gcgcaccacc 56160atgcctggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggccaggctg 56220gtctcgaact cctgacctca ggtgatccgc ccgcctcagc ctcccaaagt gctgggatta 56280caggcatgag ccaccgcgcc cggcctgagt tttcctttta tgaaggacct gcttggttgg 56340ttgcctgcca catgttgtca gcaccatggg cccaggactg ctgaggagct gttgatgccc 56400tcgctctccc agagccaccg gctctgttag ataattcaca tgcagtctgg ccactgtcct 56460acgtcctcat tcacaaagag cagacatttc gtagaagatg agggcctggg agtaacctcc 56520ctgcatgttt ttctataaag gcatagtggt taagtccttc cagctcattg accattggag 56580aattttatgg aggctgtaga ctaggggctg gtaaactaag ggcccagggg ccaaatccag 56640cctgccacct acttttgtaa ataaagtttt cttggtgcac agccatgccc attcattcat 56700ttgcacaatg tctgtggctg ctttcatgcc aaaagcagga gaactgagtg gttatgctgg 56760agacctacgg ccttcaaagc cccagacctc acgtctggcc cttgacagac agagcttccc 56820cagccctgct gcgcatcctg gcccagcatg tgctgtgtgt gtgatttcag cttgcaggag 56880ccgtggttag gaattgtccc tgtgttggtc cattttgcat tgctatgaag gagcacctga 56940ggccgggtag attatgaagg aaagaggtct gtctggctca tggttctgta ggcagcacca 57000gtatggcacc cgcatctgct cagcttctag tgaggtctca ggaagctttg actcatggtg 57060gaagtcgaag cgggagcagg tgcatcacat ggtgagagag ggagcaacgg agagagagag 57120agagagagag agagcgcctc tccctcttgc cctcaccttg agaggagatg ccaggctcct 57180ttaagtaacc agctcccatg tgaactcaca gtgagagccc atttgctact gcggagaggg 57240caccaggcat ctgctcccat gacccaaaca ctgcccacca ggccctacct ccaaccttgg 57300ggtcatattt tattctgttc tatgctatgc tatgctatgc catgccatgc catgccatgc 57360tattcctatt ctattatttg agacagaatc tcgctctgtt gcccaggctg gagtgcagtg 57420gcatgatctt ggctcactgc aacctccacc tcccaggttc aagcgattct cctgcctcag 57480cctcccgagt agctgggatt acaggcacac accaccacac ccgggtaatt tttgtatttt 57540caatagagat ggggtttcac catgttggcc aggctggtct caaactcctg gcctcaagtg 57600atccacttac ctcggcctcc caaagtgcca tgattacaga tgtgagtcac tgcgcccagt 57660gagggtcaca tttccgttga gatttggagg ggcagacgtt ggagccatct gagccccctc 57720gtcccgctct agcttctcct cccgtgtgcc ccgcggtgct ggtggcaggc ccttacgccg 57780gttctggctg cacgctctgt tccagaagct ttcttccctg cttggttacc agaaaatcat 57840cccatccatt acaaggacag ggtcccctta tctcccattc ccagggcagg acaccggggg 57900cagggcaggt ggggaactga gcaagttctc tgggggcagg cgtggctatg gctccctctg 57960ggtgggcgtc tggggagggg tggaggcagc cgtcagcgcc ctggcttgct cttcctccct 58020ggccagagac tgtggccttg tgctgctccc gtgtgggctg cctgcacctc cagtgggttg 58080tgctccctcc cctcccctcc cctcaagctc tgctgagcac cactgccttc cacagccccc 58140actctcggga ggcgaggctc ctcgtggcca ttcctgtcct tggcacccac ccccccacca 58200acctggtaga gccttgggcg gggtctgtta ctccttgcat ggcgtagacc tccccacagt 58260aggcacctga cacatacctc ctggggggca ggcaggaggt gcgttgaggt ctcagccctg 58320gcagtccctc ccctgcgtgg cataggcctc gccacagggt catcgagggt gggtggagac 58380tgtactagac cactccccgc tggtcctaga aagggtccca tctgtctgct ctctgtttgg 58440agtccagacc ttggttgctg tgccctgcat ggtgggctgg ggggcaccct ccagcctctc 58500tgagtgcatg gcctctcctt gcagccatct gcctgcccaa ccagttccgg tgtgcgagcg 58560gccagtgtgt cctcatcaaa cagcagtgcg actccttccc cgactgtatc gacggctccg 58620acgagctcat gtgtggtgag ccagcttctg gcacggggaa ggggcgtccg ggctgggttc 58680ccccaggaac gtggagttta ggggaggaga cgtgcctttc cagcggggct gggggctgtg 58740tgggagactc aggcggctgg gaggctcctt gcgggaggca gggaagcctt tcccagggca 58800gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 58860gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 58920agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 58980actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 59040tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 59100gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 59160aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 59220ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 59340tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 59460gggccccagc tgtgggggcg ctgttgatgc gcagccaggc ctcgccgcca gagcccgcac 59520gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 59580gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 59640tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 59820gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 59880ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 59940agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 60000gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 60060ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 60120gcagtgccat cgggcccgtc attggcatca tcctctctct cttcgtcatg ggtggtgtct 60180attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 60240acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 60300agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 60360agtagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 60420atgtggcaga cattgctaat cgatcacagc attcagcctt tcccactgag cctgtgcttg 60480gcatcagaat ccttcaacac agaggcctgc atggctgtag caacccaccc tttggcactg 60540taggtgtgga gaaagctcct tggacttgac cttcatattc tagtaggaca tgtgctgtgt 60600tgtccacaaa tcctcatgta ccctagaaat gaatgtgggg gcggctgggc tctctccaga 60660gctgaaggaa tcactctgta ccatacagca gctttgtctt gagtgcagct gggatttgtg 60720gctgagcagt tacaattcct acgtggccca ggcaccagga acgcaggctg tgtttgtaga 60780tggctgggca gccgcaccgc agagctgcac catgctggtt tgtatcacat gggtgaccat 60840ggtatgtcta agaaggtgga gtccctgtga ggtctgcagg tgcccccaca gctccaggcc 60900accttgagga ttgcctctgc ctgcccagcc ctgagttccc tctcccctgt cctgtcccac 60960tgtcacccca agccggcctc attgggagcc tgttggatgg cagggtatag atgtaacctg 61020attctctctg gggagcgggg ttatctggct tctcaagagc tcctaggagc ccacagtggt 61080ggcaccatca cagtcgcagc agcccccaga gaacgcggcc ctgtctgttc ctggcgtgct 61140ctgtgctgcc ccgcctgggt tccctgcccc agtcgcaggc cccttggagg aggtaccatg 61200tgtctcccgt ttcacagatg agccccgggg agctcactct agtagtggcc agagaggcct 61260gcggctcagg gagcggggca catttccaac aggacacacc gccctggtct gagtctcgtg 61320ggtagtggga gcagaggaga gcgccctatg tctgtggggc ggcttggctg agcctggaag 61380ccacctgacc tcccccgtcc cttccctgcc aggcatcgca tgcggaaagt ccatgatgag 61440ctccgtgagc ctgatggggg gccggggcgg ggtgcccctc tacgaccgga accacgtcac 61500aggggcctcg tccagcagct cgtccagcac gaaggccacg ctgtacccgc cggtgagggg 61560cggggccggg gaggggcggg gcgggatggg gctgtgggcc cctcccaccg tcagtgctgg 61620ccaccggagg cttcccgggt tcctgggggc tgtgccaccg cctctgaggc atgcttgctt 61680tcttcccttt tcaaaccctt ctgcttcctt ctttaatgac attgttgatt gtggataatc 61740tgaaaactac acaaaaatat aaagagccaa aatctcaccc aaatccacct cctagagtgg 61800ctgttgggct ccgtcagcat ccaggcggcc gtctgtgttc cgcacggccc agcccatcga 61860tagccgcctg caccaggcct gtctgccctc tgtgagcctc cccacagggt tccctccaca 61920aacaccctgt tctcccaccc agggctggct gcttcctgga aaacagctgg atggttttgt 61980gcatgacaga caaacacagg gtgattttcg tggctaaaat actccctgga gcttttggca 62040gggtgagggg ctggctccag ctgagccacg ccttgagtga aatgactgtg aggagaataa 62100actgccgctg ccctccagga tcactggggc tggctgggga gaacccccgt ttctgggagc 62160acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220cgggggctga cttgacatgc tttgtatgct tttcatttgt tcctgcagct gtatgcccct 62280aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg tcttcttgtt 62400ttatttttct tttttttttg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 62520cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62640tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 62760ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62820cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62880tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctggt ctccaactcc 62940tggactcgag cgatcctcct gcctcggcct ccccaggtgc tgggattaca ggcgtgagcc 63000accgtgcctg gcctggggta ttgtcttctt atggcacctg actgtggtgg gccctgggaa 63060ggaagtagca gaagagggtt cttcttggtt tcctggacag taactgagtg ttctggaggc 63120cccagggcct ggctttgttt agggacaaag ggaactggta accagaagcc gagagtttaa 63180acacccactg cccttcttcc ctgctcctgc tgctgcaacc cagcttaacc agccaggagt 63240gctaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaat tctcttttcc 63300tgttctccct tgggagctgg gaggatctta atcaggcaat aagagatggc actgagcagc 63360cagctaattt tttaaatcac tttattgttt aaccatatga ctcacccact taaaaaaggg 63420tacagttcag tgggttttag tgtattcaca gatgtgtgca accctcacca cagttaattt 63480tagaacattt tcctgcccct aaaagaaact ctgcatgaag ccagctgttt ttaaattagc 63540aaagttattt tgcatccttt aaatatatgt tcatggtaca aaattcaaaa gatacagaag 63600agtctgcagt ccaaagagac tccgccccca tgacgccaag caggactccc tgggaggcat 63660ggcctcctgc agtgtgtttc ttctatgtcc ccccaggggt catctgtaca tatgcaagca 63720tacaagagcg tggactttgt tttccaagcc agaagataat tgtagattta tgtgcagttg 63780tgagaaagag cacagaccca tttatcctct gcctggtttc ccccagtgct gcctgccatc 63840ttgcatgact tccattccta tcataagcaa gacactgata acgattcttt caccttattc 63900agattgacat aagtgttttt tgtttgttct tgagacaaac ttcctctgtc acccagtggg 63960agtgcagtgg cacaatcaca gctcactgca gcctcaaact cctgggctca agcgattctc 64020ctgcctcagt cccctcaagt agctcagatg gcaggtgtgc accatcatgc caggctaatt 64080tttaaatttt ttgtggaggt gaggcctcac taaatttcct gggctagtct tgaactcctg 64140agctaaagtg atcctcctgc ctcagcctcc caaagtggta ggattacagg catgagccac 64200tgcgcctggg ctgacatatg tgttttcgta agcccgaaag atagcatctg aagagtcaac 64260attgagcctt gccttttgct gctaatgatg tataaaagct gctgttctga gcatttcgga 64320ggctcccagc tgccgtgtgc accctgccta gagctctacc gtaacccatc tccgggagga 64380ggtgctattg ttttcctcat tttgcaacaa ggaggctgaa gaactgagca tgaaccactg 64440gcctgggtcg ttcggttggt aggcagtggg gccaggccat ccaactcaca accaccttct 64500actctgcttc ccccgcaccc tgaagtttgt tctgttttga ggacacagcc gtcacattct 64560tggtggctga acagcactcc ttgtcaggtg tggctgggcc cccactggag ggcatcatgg 64620tcctctctcc tgctgcggtt gaaccttggc tgtttcaacc actcctgcca agtggccctc 64680tgaaagggac agtccatctt ttctcagcag agggccacac tggcaaaacg gtccctggca 64740ccctttctct ccacctgtct aatatagagt aaaaatggta tcatgttaag atcttcattt 64800atatttattt tatcatgaat gatgtaagca tcattttgtg tgtttaagaa cctttgggcc 64860cagcgtgatg gcttgcagct gtaatctcag cactttagga ggctgagatg agcggatcac 64920ttgaggccgg gagtttgaga ccagcctggc caacatggag aaaccccgtc tctagtaaaa 64980atttaaaaat tagccgggta tggtgatccc agctacttgg gagtctgaag catgagaatt 65040gcttgaacat gggaggcgga ggttgcagtg agccgagatc gcgccattgc actccagcct 65100gggcgacaga gcgagactct gtctcacaaa aaaaaaaaaa aaagaaaaga aaagaaatta 65160tcaatctcct cttttatggc atatatatat atatatatat atatatatat ttatttccct 65220ttcttggtta tgttcataaa ggcctcccct gctctgatca taaaaaacaa cttattttca 65280cactctctct cttttttttt tgagacagag ttttgctcct gttgcccagg ctggagtgca 65340gtggcgcaat ctcagctcac tgtaacctcc gcctcccggg ttggagtgat tctcctgcct 65400taccttcccg agtagctggg attataggca tgcaccacca tgcctggcta attttgtact 65460tttagtagag acgggggttt ctccatgttg gtcaggctgg tctcgaactc gcgacctcag 65520gtgatccacc cacctcggcc tcccaaagtg ctgggattac agacgtgagc caccatgccc 65580agcccacact ctctttctta acgtcctcct cctttcgttt tacgttcaca tctttaattc 65640ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 65700ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 65820cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 65880ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 65940gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 66000ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 66120caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 66180ggggcctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 66240aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 66300gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 66360taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 66420tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 66480aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc aaatcctgag 66540gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 66600tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 66720tttatttaat tttaaattta ttttttgaga cagagtctca ctctgttgcc caggctggag 66780tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 66840acctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacccg gctaattttc 66900atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 67020agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 67080aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgttcttgta ccaccctagg 67140gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 67200ggggctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 67260accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 67320gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 67380tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 67440cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 67500actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 67560cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 67620ggaggggcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 67680gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 67740gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 67800cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67920tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68040tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 68160aggtaggaca tcccctgcag ccctccatgg ccattgggtt cccgccagcc cgtggtggag 68220gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 68280gggcccagcc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 68340tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 68400tcaagttgct cctatggggg ttacttttaa gctgggaaat ggctgtggcg tcgaggggcc 68460gggggcttgg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 68520tgagatgccc cctcttggtg gctctgtcct cttaggatgg gacaagtctg tgaaggctgc 68580tgcagcaccc accgtagacc cctaatcgtg tgacgtcacc aggatggtcc gggctgctca 68640cttgccacag tggcctgttt gagcccggga agccaacggg gctgctcagc tggacaccag 68700ccccccgagc tgcccatgtt ggggtcacag gccccacctc cctggttggg gaggggcaac 68760tgagagtgtg gagaggtggg acccaggtgt gctggtctcc gcaggggctg gatcagagcc 68820tgggatgggc agggtgagcc tcctgacctt taacccagtg gtgtcaggca acgtggccca 68880cccgccagcc gcaccaggcc ccacccccgc aggtgaaggg gtgggatagg ctgggcctgg 68940gccaggacac ctctggacca cgcattcctc attgcttggg tccctggagc agcagggcct 69000cccgagtgtg gtgccgcctg ccacctagtg gccatttcca cgaactccca ggcctggctg 69060gggagccgga actgcagcct ccatttccac cccactccgg gtcgggccac ctccctgatg 69120cctcagtatt atatcaaact gtcacagtct gtcccacagc cttacagacc actgtctcca 69180gaatggtcac atccacactg ggcagcccag tctcgctagt tcctcgtccc acctcctgcc 69240tttgctcatg cccgtcctgc tctgggccca ccgcggacac atcttccccc cgcccgccgt 69300ctgacctcac agcagctggg ccccaagagg agtatcctgt cctgctgcac ttttctcaac 69360acccggtgtt ggctgcacct tcccacccat tgcaggcccc tctgtgacag gacgggggct 69420cctaaacaca ccacagttcc gagtctgaac tcacacagtg ggatgcggcg tttctgggcc 69480acagttgggt gcaggtagcc tctgggagga tgggaggtca ggagccatct tgcgagtcag 69540gttgcttgaa ctcaggatgg aagtgttccg ggcccattgg ttgctgtatt agcctgttct 69600cacgctgcta ataaagacat acccaagact gggtaattgt aaaggaaaga ggtttaacgg 69660actcacagtt ccacctgcct ggggtggcct cacaatcatg gtagaagaca aggaggagca 69720agtcacatct tacatggctt cagggaacag acagcatgag aaccaagcga aaggggtttc 69780cccttgtaaa accatcaagt ctagtgagat ttattcacta ccacgagaac agtatggggg 69840gaaccacccc catgattcaa tcatctccca ctgggtccct cccacagcac gtgggaatta 69900tgggagtaca attcaagatg agatttgggt ggggacacag ccaaacccta tcggttgcca 69960acatttacag taacagtgtt aggtgaacag ttgtccagtc tcctgttttg tcggacactg 70020tttctagcac cttccaggca gaatctcatg tatccttcac tttcgaaatg ggtactattt 70080catccccact tttatcaatg agaaactaaa gctcgaagag gtcaagtaag ttcctggcca 70140aggtcagcta gcaggctcta gaggcctcgt tctccttaga ggcagccttg ccagggccca 70200ggcttggcag gctgcagggc aggtgcgggc atgcccatgg tagaggtggg accattgagg 70260ctcagagagg gtaagtgatg agccctggcg acacagcggg gtgggtccag agtccggcct 70320gcatcttctg gagctggcca gtggacaggc ctttcccgtt cacagccccg gggctgctgt 70380gcccaccagg gcggatgtgc ctaccgaatc ccactcctct gtgtgtgtcc ctttcaggcc 70440ctacatcatt cgaggaatgg cgcccccgac gacgccctgc agcaccgacg tgtgtgacag 70500cgactacagc gccagccgct ggaaggccag caagtactac ctggatttga actcggactc 70560agacccctat ccacccccac ccacgcccca cagccagtac ctgtcggcgg aggacagctg 70620cccgccctcg cccgccaccg agaggagcta cttccatctc ttcccgcccc ctccgtcccc 70680ctgcacggac tcatcctgac ctcggccggg ccactctggc ttctctgtgc ccctgtaaat 70740agttttaaat atgaacaaag aaaaaaatat attttatgat ttaaaaaata aatataattg 70800ggattttaaa aacatgagaa atgtgaactg tgatggggtg ggcagggctg ggagaacttt 70860gtacagtgga gaaatattta taaacttaat tttgtaaaac agaactgcca ttcttttgtg 70920ccctgtgtgc atttgagttg tgtgtccccg tggagggaat gccgaccccc ggaccaccat 70980gagagtcctc ctgcacccgg gcgtccctct gtccggctcc tgcagggaag ggctggggcc 71040ttgggcagag gtggatatct cccctgggat gcatccctga gctgcaggcc gggccggctt 71100tatgtgcgtg tggcctgtgc cgtcagaaag ggccctgggc ttcatcacgc tgttgctgtt 71160cgtcttcctc agattcttag tctttttttt tttttttttt ttttgagacg gagtctttct 71220ctgtcatcca ggctggagtg cagtggtaca atctcagctc actgcaagct ccgactccca 71280ggttcaagtg agtctcctgc ctcagcctcc cgagtagctg ggactacagg tgcgcgccac 71340cacacccgcc cagctaattt ttgtattttt agtagagatg gggtttcacc atgttggcca 71400ggatgatctc gatctcttga cctcgtgatc cgcccacctc ggcctcccaa agtgctggga 71460ttataggcat gagccactgt acccagctga ctcttagtca cttttaagaa ggggactgtg 71520ccttcatttt tcactgggcc ctgcagaata tatgcctggg ctctgggctc ttctgaacct 71580gtgttggctt ccatctgacc tctctgtgcc agcccaaggc tgctgctctt cctgagggca 71640aggagcccca tgactgcgtg ttgactcgct ggatggggct gctgagccca ctctgccaca 71700ccacgtgccc ctggcaggga gggaatccct gggtcctcac aggaacagtc agcaagccac 71760acctgacgcc tgctgtgggc ccatccctgc ggtgctggag aagacagaca aggcctggtc 71820actgcctctg cagggtcccc agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880ggaagctgag ctgttctcgt gtcctgaagg ccaggcggga acagccgtct tcagagggaa 71940gggagaaaat gcacatcgca tcagtggaga agggcctgac ttccctcagc atggtggagg 72000gaggtcagaa aacagtcaag cttgagtatt ctatagtgtc acctaaata 72049<210>10<211>8705<212>DNA<213>人(Homo sapiens)<400>10ggactcaggg gcagcaggga ggtacaccca tggttagtgg gcggaccata gggggtaatg 60agagggtgaa tcgatggaac ctgggggaca caatcgaagt ggttccagag tcgggctgta 120ctaattaaag agacggggca gtggacaggc attttcagtt gactgcccag ggagtgttct 180gcccaacagg gaggatatgc gtacagaatc atactcgatc agcatgagtc caattcagac 240cgtacatcag tggagatatg ggtcccccga tgactccgtg gaacactgat gtttgtgaca 300ggggagtaca gcaccagcca tcagcaggcc agtaaatcat accggcctgc gaaattggac 360tcagacccgg atccaccctg accgacgtcc caagccccca ccccccaccc cccaccatgg 420gccgagatcc agtcctcttt gaatagggcc tggccgtggt tcacgggaca tctgagacat 480tgccgaggcg ctgcattggt ggatcttgcc agaagtttgc ccagtgcaga tttgggcaga 540atctcaaact gccttgggat gtaggagaga aaccaggcct ggtcaagttc atgggaagag 600gtggaaacag accccatagg ctggggcttg ggcagctgta ggaagccctc tctgctgcct 660ccctgcctgc tctctgcttt gaagcatctt ccccagtgcc cccagtctca tgccctctca 720acgttggggt caaatcctga ggaataccca gactggctct ctgggccaaa gaggaccctc 780tccagaaaga gcagggccca gtgcggcttc ctaaagggca ggggaagggc ctggccactc 840cccagaggct actcaccagc catcaggata gccccaggaa gcaggccttc tcgagcccat 900tttattactt tattttatta ttttatttaa ttttaaattt attttttgag acagagtctc 960actctgttgc ccaggctgga gtgcagtggt gcgatctcaa cccactgcag cctctgcctc 1020cagggttcaa gggattctcc cacctcagcc tcccaagtag ctgggattac aggtgcccgc 1080caccacaccc ggctaatttt catattttta gtagagatga ggtttcacca tgttggccag 1140gctggtctcg aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtgctagg 1200tcaagcccat tttaaagttg aagaaactga ggctgaggta aattccctcc ccagggatcc 1260tgctgcagcc agaaggtggt aaaacaggac ttcacccggg tctgtctggc gtgaaaggca 1320gtgttcttgt accaccctag ggggcctgag agaactgagt ccctcgggca taactgacag 1380ttctgttccc attattccgc aggggctcgg atctggctgt atgctttcca ggatggcctt 1440ggagacccac ataagcccta caccctttgg gaagctgcat gttgggttgg ggtgccgtca 1500gtggcacttg tggaaggtgc agacctgtgt gggtgtgtgg gcccagggcc cctggtccct 1560tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 1620aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 1680ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 1740ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 1800agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg 1860aaggtccctg aggagcctgg tgaggtgcta actggtgtgg aggacactca gggcctgtgg 1920ggacatctcc tactgctggg ggccagccac aaagggaact ggccgaagtc ctgtccccgc 1980cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg cacctgggcc 2040aaaagtgcct gggttccctt tgcctttcac tgagatgacc ttcggggcag gtggctgctg 2100cctcccctcc tgtccccagg ttttgccaac tggccagagg aaggggtcct gggaagcagg 2160ggggccagaa gccctctctg caaggaaagc ccgaggggtg tgggaggaag gaaggaatgc 2220ccaggctggc gaggctctaa gtcaccctgg cttggctctc ctcagatcct gaacccgccg 2280ccctccccgg ccacggaccc ctccctgtac aacatggaca tgttctactc ttcaaacatt 2340ccggccactg cgagaccgta caggtaggac atcccctgca gccctccatg gccattgggt 2400tcccgccagc ccgtggtgga ggggcctaat ccccatgcca ctgatgaggg gaggtattct 2460gggtgctaat gggcaggtgc cgggcccagc cctgcctccc tctgctctgc caaccacact 2520aggctgcctc cccagacaag ctcagcgggc actgcatgtt gggttcagaa atcagcagaa 2580ctccacgttc tgagctgctc ttcaagttgc tcctatgggg gttactttta agctgggaaa 2640tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 2700cggctgtgga aacctagcca ttgagatgcc ccctcttggt ggctctgtcc tcttaggatg 2760ggacaagtct gtgaaggctg ctgcagcacc caccgtagac ccctaatcgt gtgacgtcac 2820caggatggtc cgggctgctc acttgccaca gtggcctgtt tgagcccggg aagccaacgg 2880ggctgctcag ctggacacca gccccccgag ctgcccatgt tggggtcaca ggccccacct 2940ccctggttgg ggaggggcaa ctgagagtgt ggagaggtgg gacccaggtg tgctggtctc 3000cgcaggggct ggatcagagc ctgggatggg cagggtgagc ctcctgacct ttaacccagt 3060ggtgtcaggc aacgtggccc acccgccagc cgcaccaggc cccacccccg caggtgaagg 3120ggtgggatag gctgggcctg ggccaggaca cctctggacc acgcattcct cattgcttgg 3180gtccctggag cagcagggcc tcccgagtgt ggtgccgcct gccacctagt ggccatttcc 3240acgaactccc aggcctggct ggggagccgg aactgcagcc tccatttcca ccccactccg 3300ggtcgggcca cctccctgat gcctcagtat tatatcaaac tgtcacagtc tgtcccacag 3360ccttacagac cactgtctcc agaatggtca catccacact gggcagccca gtctcgctag 3420ttcctcgtcc cacctcctgc ctttgctcat gcccgtcctg ctctgggccc accgcggaca 3480catcttcccc ccgcccgccg tctgacctca cagcagctgg gccccaagag gagtatcctg 3540tcctgctgca cttttctcaa cacccggtgt tggctgcacc ttcccaccca ttgcaggccc 3600ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3660gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 3720aggagccatc ttgcgagtca ggttgcttga actcaggatg gaagtgttcc gggcccattg 3780gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg 3840taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3900ggtagaagac aaggaggagc aagtcacatc ttacatggct tcagggaaca gacagcatga 3960gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4020accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 4080tcccacagca cgtgggaatt atgggagtac aattcaagat gagatttggg tggggacaca 4140gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggtgaaca gttgtccagt 4200ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 4260ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 4320ggtcaagtaa gttcctggcc aaggtcagct agcaggctct agaggcctcg ttctccttag 4380aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 4440gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc gacacagcgg 4500ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4560tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4620tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 4680cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 4740cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800cctgtcggcg gaggacagct gcccgccctc gcccgccacc gagaggagct acttccatct 4860cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 4920cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 4980tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5040gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 5100cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 5160tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 5220ctgcagggaa gggctggggc cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 5340cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 5460tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 5520tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 5640tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 5700cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 5760ggctctgggc tcttctgaac ctgtgttggc ttccatctga cctctctgtg ccagcccaag 5820gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 5880ctgctgagcc cactctgcca caccacgtgc ccctggcagg gagggaatcc ctgggtcctc 5940acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 6000agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 6060gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 6120gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 6180acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 6240tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 6300ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 6360atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 6420aatcccagct acttgggagg ctgaggcagg agaattgctt gaacctggga ggcggaggct 6480gcagtgagcc gagattgtgc cactgcactc cagcctgggc aacagagcaa gactctgtct 6540caaaaaaaaa aaaaaaaaaa gcaatacaaa atacaaatat cactttcact aaaagaaggg 6600atggaagacc caaaacaaac agaaaacaac aaaatggcag gagtaagtcc ccacttatca 6660ataataacat tgactgtaaa taggctaagc tctgcaatca aaagagtggg ccaggagcgg 6720tggctcacgc ctgtaattcc aacgctttgg gaggctgagg cggatggatc atttgatgtc 6780acgagtttta agaccagcct ggccaacaag gtgaaacccc atctgtacta aaaatacaaa 6840aattagccag gcggtagtgg cacgcacctg taatcccagc tacttgtgag gctgaggcag 6900gagaatcact ggaggctggg aagcggaggt tgctgtgagc caagatggag ccactgcact 6960cccacctggg cgacagagtg agatcctgtc ttaagaaaaa aaagagtgga tgaatggatc 7020aaaaaacaag acccaaccat ctcttgcata caagaaacac actttaccta taaaaacaca 7080ctaggccagg tgtggtggct cacacctgta atcccagccc tttgggaggc ctgactggca 7140gatcacctga ggccaggagt ttcagaccag cttgaccgac atggcaaaac cccatctctc 7200ctaaaaatac aaaaaaacaa aaaaaagaaa aaggctggaa gtagtgatgt gtgcctgtag 7260ccccagctac ttgggaggct gaggcaggag aattgcttga atccgggaag tggaggttgc 7320agtgagccag gatggtgcca ctgcactcca gcctgggtga cagagcgaga ccctgtcata 7380aaaaaaaaaa gaaaagaaaa agaaaaacgag aaaacaaac acaaaattag tagaagaaaa 7440gaaataataa agatcagaac aggccaggct catgggcaca gtggctcaac tcctacctgc 7500tcaggagttt gagaccagtc tggccaacat ggcaaaaccc catctctcct aaaaatatga 7560aaaaaaaaaa ataggctgga tgtggtgatg tgtgtgtgcc tgtagcccca gctacttggg 7620aggctgaggt gggagaatca cttgagccca ggaagtggag gctgcagcga gtcatgaatg 7680caccctgcac tctagctggg taactggagt gagattctgt ctcaaaaaag caaagaccag 7740agcagaaata aatgaaatgg aaatgaagga aacaatgcaa aatgatacaa aaagtttttt 7800cgaaaagata aacaaaatca acaaaccttt agccagatta agaaaaaaag agagaagacc 7860caaataaata aaatccgaga ttaaaaagga gacattacca ctgataccac agaaattcaa 7920aggatcatta gaggcaacta tgtgcaacta tatgctaatg aactggaaaa cctagaagaa 7980ctgggtaaat ttctagacac atacaaccta tcaagattga accatgaaga aatccaaaac 8040ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 8100tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 8160aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 8220aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 8280ctccagcctt ggcgacagag ccagactcca tctcaaaaaa attaaaataa caaaaacctg 8340aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 8400gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 8460gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 8580gatggtactt gtcaaacaag gcctggctaa atttctatat ttccttcaag tagaagatgt 8640gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 8700aaaag 8705<210>11<211>66933<212>DNA<213>人(Homo sapiens)<400>11tataatcaag cgcgttccgt ccagtccggt gggaagattt tcgatatgct tcgtgatctg 60ctcaagaacg ttgatcttaa agggttcgag cctgatgtac gtattttgct taccaaatac 120agcaatagta atggctctca gtccccgtgg atggaggagc aaattcggga tgcctgggga 180agcatggttc taaaaaatgt tgtacgtgaa acggatgaag ttggtaaagg tcagatccgg 240atgagaactg tttttgaaca ggccattgat caacgctctt caactggtgc ctggagaaat 300gctctttcta tttgggaacc tgtctgcaat gaaattttcg atcgtctgat taaaccacgc 360tgggagatta gataatgaag cgtgcgcctg ttattccaaa acatacgctc aatactcaac 420cggttgaaga tacttcgtta tcgacaccag ctgccccgat ggtggattcg ttaattgcgc 480gcgtaggagt aatggctcgc ggtaatgcca ttactttgcc tgtatgtggt cgggatgtga 540agtttactct tgaagtgctc cggggtgata gtgttgagaa gacctctcgg gtatggtcag 600gtaatgaacg tgaccaggag ctgcttactg aggacgcact ggatgatctc atcccttctt 660ttctactgac tggtcaacag acaccggcgt tcggtcgaag agtatctggt gtcatagaaa 720ttgccgatgg gagtcgccgt cgtaaagctg ctgcacttac cgaaagtgat tatcgtgttc 780tggttggcga gctggatgat gagcagatgg ctgcattatc cagattgggt aacgattatc 840gcccaacaag tgcttatgaa cgtggtcagc gttatgcaag ccgattgcag aatgaatttg 900ctggaaatat ttctgcgctg gctgatgcgg aaaatatttc acgtaagatt attacccgct 960gtatcaacac cgccaaattg cctaaatcag ttgttgctct tttttctcac cccggtgaac 1020tatctgcccg gtcaggtgat gcacttcaaa aagcctttac agataaagag gaattactta 1080agcagcaggc atctaacctt catgagcaga aaaaagctgg ggtgatattt gaagctgaag 1140aagttatcac tcttttaact tctgtgctta aaacgtcatc tgcatcaaga actagtttaa 1200gctcacgaca tcagtttgct cctggagcga cagtattgta taagggcgat aaaatggtgc 1260ttaacctgga caggtctcgt gttccaactg agtgtataga gaaaattgag gccattctta 1320aggaacttga aaagccagca ccctgatgcg accacgtttt agtctacgtt tatctgtctt 1380tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 1440ccactgttcc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 1500tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg tctgattatt 1560agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 1620ccactcgtat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 1680ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 1740accacggtcc cactcgtatc gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 1860cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 1920tccatcaatg cctgtcaagg gcaagtattg acatgtcgtc gtaacctgta gaacggagta 1980acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2040attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 2220tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 2280tccacgcacg ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2340atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 2400tttaaggcgt ttccgttctt cttcgtcata acttaatgtt tttatttaaa ataccctctg 2460aaaagaaagg aaacgacagg tgctgaaagc gagctttttg gcctctgtcg tttcctttct 2520ctgtttttgt ccgtggaatg aacaatggaa gtccgagctc atcgctaata acttcgtata 2580gcatacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agcgggtgtt 2640ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2700catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2760tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2820cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt 2880tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2940gaattcgagc tcggtacccg gggatcctct agagtcgacc tgcaggcatg caagcttctc 3000ttgtgccggt tgtacgctgt caggtcacac tggtgagtta ggcagggcac agatgcccag 3060agcagaggga actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 3120gccctcagct agagaggggg cttttatttg agaccagaat cacctgagca tcctcctgtc 3180cccagctgtg tccagcctgt ctgcagggac atcctgagag gaccaggctc tcccctcatc 3240cacctgccta agtgccactc tgaaccctgt ccacctgtgc cgtggagggg cgtgacctca 3300agctgctcag ccagcagcag gcttggccct ggggggcagc agagacccag gtggctgtgg 3360ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 3420ctgttctctg tgggacccta tttagaaacg aggtctgagt tactgggggt catcactgtg 3480ttctgatggc ccagctgtgt ggaggccgcg gtgcagcccc atccaaggag ccagggccct 3540gggtctagcc gtgaccagaa tgcatgcccc ggaggtgttt ctcatctcgc acctgtgttg 3600cctggtgtgt caagtggtcg tgaaactctg tgttagctct tggtgttcct gaaagtgccc 3660ccgggtctca ggcctcagaa ccagggtttc ccttcatctc ggtggcctgg gagcatctgg 3720gcagttgagc aaagagggcg attcacttga aggatgtgtc tggccctgcc taggagcccc 3780ccggcacggt gctggggcct gaagctgccc tcgggtggtg gagaggaggg agcgatgaag 3840tggcgtcgag ctgggcagga agggtgagcc cctgcaaggt gggcatgctg gggacgctga 3900gcagcatggc cagcagctgg gtctgcagcc tggtacccgg cgggacttgt ggttggggct 3960ggtttgtggc caggagaggg gctggcagga gacaaggggg actgtgaggc agctcccacc 4020cagcagctga agcccaatgg cctggctgtg tggctctcag ctgcgtgcat aacctctcag 4080tgcttcagtt ctctcatttg taaaatgagg aaacaaacag tgccagcctc ccagaggtgt 4140catgaggatg aacgagtgac catgtagcat gggctgggtg cgtgtcacct aacatcacca 4200gcctttgcaa ggagagccct gggggcctgg ctgagtattt cccttgcccg gcccacccca 4260ggcctagact tgtgcctgct gcaggccctt gacccctgac cccattgcac ctgtctccac 4320aggagccgag gaggtgctgc tgctggcccg gcggacggac ctacggagga tctcgctgga 4380cacgccggac ttcaccgaca tcgtgctgca ggtggacgac atccggcacg ccattgccat 4440cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 4500cagggcgtac ctggacgggt ctggggcgca gacgctggtc aacaccgaga tcaacgaccc 4560cgatggcatc gcggtcgact gggtggcccg aaacctctac tggaccgaca cgggcacgga 4620ccgcatcgag gtgacgcgcc tcaacggcac ctcccgcaag atcctggtgt cggaggacct 4680ggacgagccc cgagccatcg cactgcaccc cgtgatgggg taagacgggc gggggctggg 4740gcctggagcc agggccaggc caagcacagg cgagagggag attgacctgg acctgtcatt 4800ctgggacact gtcttgcatc agaacccgga ggagggcttg ttaaaacacc ggcagctggg 4860ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 4920acaagcaccc cagtggtccg gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 4980tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca 5040tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 5100cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 5160gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 5220aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 5280ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 5340caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 5400catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 5460ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 5520cagggacttg cccagggtca cacagcagga tgtggcaggt gtggggctgg gcctggcagc 5580gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 5640gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttcctgggg 5700gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 5760ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cctctgtgga 5820gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 5880ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 5940tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 6000cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 6060caggcagcgg gagcacagca ggcagagggc ggggctcgag cagccatccg tggaccctgg 6120ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 6180cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 6240ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 6300aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 6360ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 6420cttggcagtc tgacctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 6480aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 6540ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc 6600cagggaacaa gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 6660tgctctgccc cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 6720gctgccccag accctgggca gggtcatttc tgtccactga cctttgggat gggagatgag 6780ctcttggccc ctgagagtcc aagggctggt gtggtgaaac ccgcacaggg tggaagtggg 6840catccctgtc ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 6900cagtcctcca gcacttactg acaccagcat ctactgacac caacatttac aaacaccgac 6960attgaccgac accgacattt accgacactg acatttacca acactgttta ccaacactga 7020catctactga cactggcatc taccaacact gacatttacc gacactgaca tttaccaaca 7080ctatttacca acactgacat ctactgacat tggcatctac caacaccaac atttaccgac 7140accaacattt accaacactg aaatttaccg acaccgacat ttaccgacac cgtttaccaa 7200caccgacgtt taccgacacc gacatttacc gacactgata tttaccaaca ctgacatcta 7260ctgacgctgg catctactga caccgatgcc agcatctacc aacaccgaca tttaccaaca 7320ctgacattta ctgacactga tatctactga cactggcatc tactgacacc aacatttacc 7380aacaccagca tctaccaaca ccgacattta ccaacaccag catttaccaa caccgatgtt 7440taccaacgcc gacgtttacc gacgccagca tctaccaaca ctgacattta ccgacaccga 7500catttaccga cactgacatt tactgacact gacatctact gatactggca tctaccgaca 7560ctgatattta ccaacgccag catctactga cactgatgtt taccaacacc gacatttacg 7620agcaccgaca tttactgaca ccaatattta ctgacatcaa catttagcca tgtgatgggg 7680gccggcttgg gggcaggcct tgctcttggc actggggatg ctgcagagac cagacagact 7740catggggtca tggacttctg cttcttctcc agcctcatgt actggacaga ctggggagag 7800aaccctaaaa tcgagtgtgc caacttggat gggcaggagc ggcgtgtgct ggtcaatgcc 7860tccctcgggt ggcccaacgg cctggccctg gacctgcagg aggggaagct ctactgggga 7920gacgccaaga cagacaagat cgaggtgagg ctcctgtgga catgtttgat ccaggaggcc 7980aggcccagcc accccctgca gccagatgta cgtattggcg aggcaccgat gggtgcctgt 8040gctctgctat ttggccacat ggaatgcttg agaaaatagt tacaatactt tctgacaaaa 8100acgccttgag agggtagcgc tatacaacgt cctgtggtta cgtaagatgt tatcattcgg 8160ccaggtgcct gtagacacag ctacttggag actgaggtgg gaggatcgct ggagtccaag 8220agtttgaggc cagcccgggc aaaggggaca caggaatcct ctgcactgct tttgccactt 8280actgtgagat ttaaattatt tcacaataca aaattaagac aaaaagttaa tcacatatcc 8340actgccctgc ttaagacaga aaacatgggt gttgttgaag ccagaggcag ctgctggcct 8400gagtttggtg attggttcct aagcagttga aggcagtttt gtttttccat agatgtctgt 8460tctccctttg ctgggtgcag cctcgccctg ctgctgtggt cgggtttcag tggcctcgtc 8520ccgtggacgc agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac 8580gcagcctcgc cctgctgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc 8640gccctgccgc tgtggtcggg tttcagtggc ctcgtcccgt ggacgcagcc tcgccctgcc 8700gctgtggtcg ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac 8760ctgtggagcc tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat 8820tgtttgtttt tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct 8880cactgaaacc tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc 8940tgggattaca agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg 9000gtttcaccat gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc 9060ggcctcccaa agtgttggga ttacaggcgt gagccaccgc gcccagcctc tgttgagcat 9120attttgaggt tctcttggtg ccagtgatat gtacatgtgt ccccatcgca ccatcgtcac 9180ccattgaggt gacattggtg cctctcctcg gggtggatgc ctccctctgt ttccagcaac 9240ttctgaagga ttttcctgag ctgcatcagt ccttgttgac gtcaccatcg gggtcacctt 9300tgctctcctc agggctccca ggggaggccc gaatcaggca gcttgcaggg cagggcagga 9360tggagaacac gagtgtgtgt ctgtgttgca ggatttcaga ccctgcttct gagcgggagg 9420agtctcagca ccttcagggt ggggaaccca gggatggggg aggctgagtg gacgcccttc 9480ccacgaaaac cctaggagct gcaggtgtgg ccatttcctg ctggagctcc ttgtaaatgt 9540tttgtttttg gcaaggccca tgtttgcggg ccgctgagga tgatttgcct tcacgcatcc 9600ccgctacccg tgggagcagg tcagggactc gcgtgtctgt ggcacaccag gcctgtgaca 9660ggcgttgttc catgtactgt ctcagcagtg gttttcttga gacagggtct cgctcgctca 9720cccaggcgag agtgcagtgg cgcaatcacg gctcgctgta gcctcaatct ccctgggctc 9780aggtgatcct cctgcctcac cctctgagta gctgggacta cagacacata ccaccacacc 9840cagctagttt ttgtgtattt tttgtggggg gagatggggt ttcgctgtgg tgcccaagct 9900gatctcaaac tcctgaggca caagcgatcc acctgcctcg gcctcccaaa gtgctgggat 9960gacaggcatc agccgtcaca cgcagctcaa tgattttatt gtggtaaaat aaacatagca 10020caaaattgat gattttaacc attttaaagt gaacagttca ggctgggcgt ggtggcttat 10080gcttgtaatc ccagtacttt gagaggctga ggtgggcaga tcacctgagg tcaggagttt 10140gagaccagcc tggccaacat gatgaaatcc agtctctact aaaaatacaa aaattagccg 10200ggcatggtgg caggtgcctg taatcccagc tactcgggag gctgaggcag gagaatcgct 10260tgagcccggg aggtggaggt tgcagtgatc tgagatcatg ccactgcact ccaatctgtg 10320tgacagagca agactctgtc ttgaaaaata aataaataaa aaaaatttta aaaagtgaac 10380aattcagggc atttagtatg aggacaatgt ggtgcaggta tctctgctac tatctacttc 10440tagaacactt tcttctgccc tgaaggaaac cccatgccca ccggcactca cgcccattct 10500cccctctctc ccagcctctg tcaaccacta atctactttc tgtctctggg ggttcacttc 10560ttctggacgt tttgtgtgac tggaatcctg caatatgtgg tccctgcgtg tggcttcttt 10620ccatagcatt gtgttttcca gattcaccca cacattgtcg cacgttatca gaatctcatt 10680cctgactggg tgcagtgggt taggcctgta atcctaacat tctgggaggc caaggcggga 10740cgatcacttg aggcaggagt ttgagaccag cctggccagc ctagcaagac cccagctacc 10800aaaaaatttt aaaagttaac tgaacgtggt ggtggtgggc acttgtggtt cccagctacc 10860tgggaggctg aggtgggagg atcgcttaag cccaggaggt caaggctgca gtgagctatg 10920atcgcaccac tgcactccag cctggacaac agagcaagac cctgtctgaa aaaaaaaaca 10980aaaaaaaaag ttcctttctt tttgtggctg gatgacatcc cattgtatgg ccacagcaca 11040ttttgtttgt ctgtttatcg ggtggtgggc agtggtttcc accttttgtc tcctgtgaat 11100aatgctgctg tgaacatttg aattcaagtt tttgtttgaa cacctgttgt gaattatttg 11160gatatatgtg taggggtagg attgctgagt cctatggtaa tgttaggttt gacttactga 11220ggaaccatta aactgttttc aacagtggct gcgccgttct gcatccccac cggcagtgtg 11280tgagggttct gactttacct cctcacaaac gcttcttttc catttaaaaa aatattcagc 11340caggtgctct ggctcacgcc tgtaatccca gcactttggg aggccgtggc gggcggatca 11400cctgaggtca ggagttcgag acgagcctgg ccaacatggt gtaaccccat ctctaccaaa 11460aatataaaaa ttagccgggt gtggcagcgg gcgcctgtaa tcccagctac ttgggaggct 11520gaggcaggag aatcacttga acccgggagg cagaggttgc agtgagccaa gatcgcgcca 11580ctacactcca gcctgggtga caagagtgaa actccatcta aaataaaaca aaaataaaaa 11640taaataaaaa tttattaaaa cattcatcac agccagccta gtgggtgtcc catgtggctt 11700tgcctcgcat ttccctgata actaggatgc tgagcgtctt gtcccaggct tgccacacct 11760cagcactttg agatacgtcg cacagtcccc atttgcgaac gagaaatgag gtttagggaa 11820cagcagctgt gtcatgtcac acagcgagca gggggtctct gagccgtctg accccacagc 11880cgaccaagct ccaatcctta ccgcctccta gtgttgtgga tgtagcccag ggtgctccca 11940catttttcag atgagaacac cgaagctcaa aacaggagcg ttttgtccac attggataca 12000cgatgtctgt ggtttggtcc tgaagtcact ttatatctca gtggtccaga ctggagtagg 12060acagggggtt ctggggaatg gggaaggtgt ctcaggtgaa aggaaggaat tccagattct 12120ccatactgtc cttgggaagt tagaagactc agagggtctg gcaaagtcag acaaagcaag 12180agaaatgcag tcaggaggaa gcggagctgt ccaggaacag gggggtcgca ggagctcacc 12240cccaggaact acacttgctg gggccttcgt gtcacaatga cgtgagcact gcgtgttgat 12300tacccacttt tttttttttt ttgaggtgga gtctcgctct cttgcccagt ctggagtgca 12360gtggcacgat ctcggctcac tgcaagctct gcctcccggg ttcatgccat tctcctgcct 12420cagcctcccg cgtagctggg actacaggcg cctgccaccg cgcccggcta atttttgtat 12480ttttagtaga gatgggattt cactacatta gccaggatgg tctcgatctc ctgacctcat 12540gatccgcccg tctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccgg 12600cccgatttcc cactttaaga atctgtctgt acatcctcaa agccctatac acagtgctgg 12660gttgctatag ggaatatgag gcttacaggc catggtgctg gacacacaga agggacggag 12720gtcaggaggt agaagggcgg agagagggaa caggcggagg tcacatcctt ggctttcaaa 12780atgggccagg gagagacacc ctctgagcat ggtaggacag gaaagcaaga ttggaacaca 12840ttgagagcaa ccgaggtggc tgggcgtggt ggcttacgcc tgtaatccca acactttgga 12900aagctgaggt gggtggattg cttgaggcca ggagttcaag accagcctgg ccaacatggt 12960gagaccccgt ctctactaaa tatacaaaaa ttagccaggc gtgatggtgc atacctgtaa 13020tcccagctgc ttgggaggct gaggcaggag aattgcttaa acctgggagg cggaggttgc 13080agtgagccga gatcccgcca ctgcactcca gcctgggcca cagagtgaga ctccatctca 13140aaaaaaaaaa aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac 13200agtagcagaa gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta 13260cctgacgatt tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga 13320gtgatggcgc ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct 13380cactgcaggc cttctcagcg ccctttatat gcttacacag gctgtttcta agagggggat 13440acattgcata agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc 13500cctggtggcg cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac 13560ctgcacgtcc agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca 13620cctcggatgt gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt 13680ttcctcctgc aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct 13740gcttctcagg tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg 13800gatgcgccag tgttcctgct gtggaccccc atctcacatg agggtcttgg gcctgcaggc 13860tcgttcagga aacacccgct gagtacgcag tgtgtgccag ctgtgtccca ggcaatggcg 13920gggacagtgg ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt 13980gcaaggagcc acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg 14040gaggtataaa tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg 14100ctatgtcggg aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg 14160acgtgttttg ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg 14220gtggtgtgcg cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg 14280gctagccggg ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg 14340agaggacttc acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc 14400tatgaaatga ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg 14460agcggagacg gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt 14520cctgggaatc agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc 14580cccatcctgc gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag 14640ccctgggcac cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt 14700gcttggagcc gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg 14760aggtcccctg attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt 14820ttttccagcc tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc 14880cttagccctc ttcctgagtc tagagccccg ggggttggaa gttctggccc ctgggacacc 14940tgcagccaca ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc 15000acgttcttgc ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct 15060cccctctgcc ctgcccttct cccctctacc ctgcccttct ctcctctgcc ccgccatggc 15120ttttatatcc tgtgccacaa gacatggctg tgtgtgaaag tggcagggtc tggcatctct 15180gtgggtctct gaggcccacg ctccagtgcc actcttccca cccgctggcc gtgccctcat 15240gctggaggga cagcccagcc ctctcccgaa ccccagcccc atgtgcccag ctgcccccgg 15300ccctctcccc tggaagccgg ggtcactcca gccgtatgcc atggtgggga catcctgctt 15360ccttggcctt ccagggaagg tcctctttcc aaatggcgac acctggtccc tgcctggagg 15420ctggaagctg tggcccttgt atgcccctcc agggtctgtg cgctcggttg gcccgagttc 15480ccatcaccgt catcatcacc atcatcattg tcatttcgct tgtctgtgag ccggcctggt 15540ctcccagagc agagaccctc tgaggtccag cctgagttgg ggtctccgtg ctgacccctg 15600acggggactc aggacgtacc aggtctgggt caggagtgac ccccaaacct cgtgcccttt 15660gacaggcacc cctgactttt gctaagtggg tggaggtgac atcacttaca gcgggagtga 15720tgggacaggg tctgttggct gcactgtgct cccagggatc tggggagagg ctatatccct 15780gggctttggc actgcagagc tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 15840tgtgtgtgtg tgtgtgtgtg tttgcgtgcg cgcacatgtg tataagatct ttttttatta 15900catgaagcaa gataactgtt gctgtttcct tttgggtttt gtgttcaaca gagtggggta 15960cttcttccct cagacaacag aactctcccc tttaaacacg tgctgtcaga gggtgggtct 16020tgggctcatg tctgtttgca cagccgagtc agaggaaaca cagggttctt cataaaaaca 16080ctgcacagca ggcgactgtc cagagtcagc ctgcaggacg gcagcagccc tgcccctcag 16140agcacagcta gggtgggctg ctttgggatc tcccgtcatt ccctcccagc tggcagccgg 16200cggccggccc attccttggt gtgctggtca ggggggcgtg cgcctgctct gctcaccctg 16260ggaatgggac agaagctggc agctcggaga ggacagggct ggacccttgg gtggcctctg 16320gctggaccat ctcattgtcc tcagacacag cctctcgggt ctagtttcat ttcctgaaaa 16380acaagtgcac agaactagag caggagtcga gagctacggc ccccgggcca gatccagccc 16440tgccacctgt tttcacacca tgctcaagct gagtgggttt tacatttttt aattacttga 16500aaaaaaaaaa gccaaaggag gtttcatgac ccatgaaaat tatatggaat tcaaaaaaaa 16560aaaattatat ggaattcaaa tttcagtgtc cataaataat ttcttgagac agggtctcgc 16620tctgtcaccc aggctggagt gcagtgctat ggcatggctc gctgtaccct tgacctccca 16680ggctcaagcg atcctcctgt ctcagcctcc tgagtagctg ggactacggg tgtgtgccac 16740caagcccggc taattttttt ttaattttag taaagacagg gtctttctat gttgcccagg 16800cttttctgga actccatctt ggcctcccaa agtgctggga ttacaggctc gagccacgga 16860gcccagcctg tttttgtttt ttcactgata aagttttgcc gggtgtggta gtgtgtgcct 16920ctagcgattt gggaggctga ggtgggagga tcgcttaagc ccaggagttt gaggctgggc 16980tcaagtgatc aggaggtgaa ctatgatcat gtcattgcat tccagcctgg gtgacagagc 17040aagaacctat ctcttaaaaa tatatattta aaaagtattg ggtgtggtgg ctcacgcctg 17100tggtcccagc tacttaggca tctgaggtgg gaggatggct tgagcccagg agtttgaggt 17160tgcagcgagc caagatcgtg tcactacact ctagcctggg tgacagagcc cagaccctgc 17220ctctttaaaa aaaaaaacca aaaaacatgt attggaacac agccatgcct gttcagtcac 17280gtgctctcca tgctgctttc tgctccagag acccttatgg cctgaaagct gaaaatattt 17340tctatccttt acaaaaaagt ttgctgacct ctgtcctgga aaattcatct cccaagttct 17400cttccggcac tggcgttcct gggtgtccta aatttggccc ctgttatttc tgaactctgt 17460tttggctctg ttccctccca ggagccagga caggcacgtt ctctgcatct tgtcccctga 17520cgcccagagg cttggctcgg ctcaggcatt cttggaaata tctggctcca ggaaaggcag 17580aggcctcctg agtcagccca gagggaacct gccccaggtc tgggggaggc ctgacccagc 17640agagtggctt ttgccgatgg gttgggccgg tcaagatgtg ctgaaagttg tcctcagaag 17700gccactttgg gattccttcc tccagtatta gagcaactga gagctgctca ttgcaagcct 17760gatgttttcc cagttggccg ggtccaccgg gtgccctggg attctgggat ctgggtggaa 17820agtagggggc ttgggggagt gtcctgggtt ctggaatcca ggtggcaagt ggtgaggttc 17880agggagtggc ttctgagcca ccataggggt ctctgtggga ggctctgccc atccaggaga 17940ttccgcaggc cctgccggcc cagagccagc gtcttgcgct tgccgaggct acagccagcc 18000ccagccgggt ggaacagccc gtcgcctcct ctcactttgt tttggggcca cctgggagtg 18060tggagcaagg gtagagaggg aggaagtggc tgccggccgc tgcccagcac ccttgtttgc 18120cttgggccct ctgtgggctc ctttttattg ctcttcaatg aagccaggga aatggacttc 18180cttgcctcac ttcagttcaa catgtctgga agtttggtat taaaattaag aaagtgtgga 18240aatagagcaa gaagagaaaa atctctccaa gagataatag tgacctctga gctgggcgcg 18300gtggctcacg cctgtaaatc ccagtacttt gggaggctga ggcgggcaga tcacctgagg 18360tcgggagttt gtgaccggcc tgaccaagat ggagaaaccc cgtctctact aaaaataaat 18420aaataaataa ataaataaat acaaaattag ccaggcatgg tggcgcctgc ctataatccc 18480agctaaggca ggagaatcgc ttgaacctgg gaggcaaagg ttgcagtgag ccaagatcac 18540gccattgcac tctagtctgg gcaacaagag tgaaactccg tctcaaaaaa aataaataaa 18600taaaaaataa aaatagtgac ctctggccag gtgtggcagc tcatacccgt aatcccagca 18660ctttggaagg aaggccgaga tgggcagatt gctttagcac aggagtttga gaccagcctg 18720gccaacatgg tggaacccca tctctacaaa aatagaataa aatttaagag gtaatagtga 18780ccttttggta gatcgaaacc tggattgctt tctttttcta aatgctgatt cttttctttg 18840tggtgtttgt gttctgtgcc gatgtccctc ccccagccct gttattgtga gtggaagaag 18900gggaaagggt tcgcccgcta ctgtgagccc ctcctctcac gctgggtgtc cttggagaag 18960cctgcacttc ttcattgtac gccagggctg ggtccctccc tggagtggtt ctgtgctgct 19020gggatggggc caacccctca gatgttttct gagtgtcaca cacaggtgtg tgcattcatg 19080gcctttgcgt gtcttcctgt tgtggaggca aaaatgtgaa gaaccctaga tgattttggg 19140accagggctc catcacctgc tgttcattgc acaccggagc atccaggcat gggtggagag 19200ctcagacttc caggcacggt cgcaggggct ggtctaacca tgttcccgcc cgcctgctcg 19260tcagaaccgc ctgttgggag ctgttatcat gataccatac ctgggccctg ggctatccga 19320ttctgactta attgctccag gttggggcca ggccgttgtt tgctgttttg ttgtttcttc 19380tgtgacgtta gccactgggc taatctgagc ccctcagtta caggtggaga aactgagacc 19440catgggggtg caaggacttg ccgaggaccc agagcccctt gggggcagag ctgaggcggg 19500gcctggcttt gggtcccaga gcttccagtc cccttcccgc tctcctaaca gctttttttt 19560ttgagacaag atctcaccct gtcacccagg ctggagtgca atggcatgat ctcggctcac 19620tgcaatcttc gctagctgcg ttccagcgat tctcctgcct cagcctcccg agcagctggg 19680attacaggtg tgtgccgcca tgcccagctc gttttttttt gtacttttag tagagatagg 19740gtttcaccat gttggccagg ctgatctcga actcctgacc tcaaatgatc cgcctgcctc 19800ggcctcccaa agtgctagga ttacaggctg ggatcacact gtgcctggcc ctagcagctt 19860tgtcctgtgc catccaacaa cagatgaccg aagtctttgt ttcttaacat gcattccatc 19920tgccttacag ttttgccacc tgcaaaacag aggacttgtc gcttttctgg taagctggaa 19980atgtaatctg gtagcaggag gcctgtggaa gcttgccttt aatggccttg tgtctctttc 20040atcctgtcct gagagccgga gaacttggat gttgcaccta actcaacctt cctgttaaca 20100tacagttctg caggctcatg gatcatcaga accacgtcct atctcacgcg gctgtatgct 20160tccgttggtt caggtgtttt taccttgaca gtattttctc ctcggtggct tttgcggtgg 20220ttgcttttaa tcagcattga ctcttcaaga aaaatattta gctgctacat ctcagaggag 20280acagggtgga aagcatctga gacctgcagg ctcagactta gaaccagaag tgccctcaga 20340gttcatccgg ccctgaccca gcgggaaatg agttcacaga gaagcgggag aactttgccc 20400caggccctgc cgttgctcat aactgcccca ggtccttaca tttgctccag gtcctgcccc 20460aggccctgca gttgctcata actgccccag gtccttatat ttgctccagg tcctgcccca 20520ggtcctgcag ttgctctgtg tggtgggtgt gatctggagc cctccgccca ttgctgcacc 20580tggggcaggc attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 20640caggcagccg ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 20700tgttccaaat cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 20760cagttggccc tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 20820gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 20880cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 20940aaatacaaaa attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 21000tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 21060actgcactcc agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 21120aaacaaaaaa caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 21180tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 21240gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 21300cctggcccat ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 21360agaggcggac cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 21420acttcatcta ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 21480ccagccggga cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 21540tggccaaggt cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 21600catgaggagg aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 21660ctggggaagg gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 21720caagggagct gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 21780tgactaacat ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 21840acagtgaccc cgtctgcaaa tgtagttcct tgccctactc gcactgggga gcaggacgca 21900gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 21960ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatctt 22020caggacgctc cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 22080gacagagtta cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 22140gggcagggac tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 22200gtcaatcttt gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 22260ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 22380gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 22440cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500ttagttttcc cagcctcctt taaaaacgcg ttcatgctgg ggccggggca gtgcagtggc 22560tcacatctga aatcccacca ctttgggagg ccgaggcggg tggatcatga ggtcaggaga 22620tcgagaccat cctggctaac aaggtgaaac cccgtctcta ctaaaaatac aaaaaattag 22680ccgggtgcgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 22740gcgtgaaccc gggaagcgga gcttgcagtg agccgagatt gcgccactgc agtccgcagt 22800ccggcctggg cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaagt acaaaaaaaa 22860aaaaattagt ctgggtgtgg tatcacgcgc ctataatctc actactcgag aggctgaggc 22920ggagaattgc ttgaacccag gaggtagagg ttgtagtgag cccgtatcgt accactgccc 22980tccacctggg caatagagcg agactctgtc tcaaaaagaa aaaaaaaaaa agaacattta 23040tgccaggtgt ggtggctcat gcctgaaatc ccagaacttt ggaagactga ggcaggagga 23100tcacttgagc ccagaaattt gagagtgtct tccctgggca acatagagag acctcatctc 23160taccagaaaa aaaaaaatta gcccggcatg gtggcatatc cctgtggtcc cagctactta 23220gggggctgac gtggcaggat cacctgagtc tggaggcaga ggttgaagtg agctgagatc 23280atgccactgc actccagcct gggtgacaga cagagaccct gtctcaaaaa aaaaaaaaaa 23340aaaaagcatt tactatccac catggaaggt gagactgacc tgtgagtgat tgttcaaaga 23400acaaaaaata aaccccagag ataagacaaa agggtgcctc catgggggtg tgatttaaag 23460ctgagaaatt gggcttcttc cccctcccct ctcaccccgt ggtttgctaa aggagatggg 23520aaaaaggatt ctttttttgg ctgaaatatt taacactaaa ttaaagccaa ttttaacagc 23580actttggttg atgagtgaaa ttaacagact ggccaaaaat aaacgaacgg tctgtactat 23640gtgaaaaaga ggcagctttg gccatgctgg gccaatgtga gttttcaggg ttgctgggaa 23700tgtctgtgaa tcggaggaag ggcctagctg ggactctcag gagccaaggc cctgaggggc 23760aacttgcctg gtccctgccc tgaggcgttc actgctttct tcctgggcca gatcacaggc 23820ccggaggctg gaccactggg ctggcactct tgccgagctg ctccctgact tcctgaccat 23880gctcctttca gcagccttgc tgcactttag tttccttgaa tgaaaaatgg ggatgagaat 23940agctcctacc tccaaggtga atggagtgag ttcggacagg tgactccctg ggaccagtgc 24000ctggcgcctg acaaggtcca gtcagagccc gcactgctgt tactgatacc cttggctgta 24060ccaggggaga acttggttgc cattgccagg tgttctccca ccacccccac tactgtccct 24120gtttgatgtg tggcgggaat aaagctgtgc acattggagc ttttggcaca tcctggcttt 24180caggtgaaag gtgcgtgtgt gtttgagggt ttagcctggc caacccagcc atgaggtcgg 24240acctgacctg ggggtgagtc ctgagctcgg cacccctgag ctgtgtggct cacggcagca 24300ttcattgtgt ggcttggccg cacccctttc cctgctgggc tgttgatgtt tagactggag 24360cctctgtgtt cgcttccagg aaccaacccg tgtgcggaca ggaacggggg gtgcagccac 24420ctgtgcttct tcacacccca cgcaacccgg tgtggctgcc ccatcggcct ggagctgctg 24480agtgacatga agacctgcat cgtgcctgag gccttcttgg tcttcaccag cagagccgcc 24540atccacagga tctccctcga gaccaataac aacgacgtgg ccatcccgct cacgggcgtc 24600aaggaggcct cagccctgga ctttgatgtg tccaacaacc acatctactg gacagacgtc 24660agcctgaagg tagcgtgggc cagaacgtgc acacaggcag cctttatggg aaaaccttgc 24720ctctgttcct gcctcaaagg cttcagacac ttttcttaaa gcactatcgt atttattgta 24780acgcagttca agctaatcaa atatgagcaa gcctatttaa aaaaaaaaaa gatgattata 24840atgagcaagt ccggtagaca cacataaggg cttttgtgaa atgcttgtgt gaatgtgaaa 24900tatttgttgt ccgttgagct tgacttcaga caccccaccc actcccttgt cggtgcccgt 24960ttgctcagca gactctttct tcatttatag tgcaaatgta aacatccagg acaaatacag 25020gaagactttt tttttttttt tttgagacag agtcttactc tgttgcccag gctggagtac 25080cgtagcgtga gctcagctca ctgcaacctc cgcctcccag gttcaagcga ttcttctgcc 25140tcagcctcct gagtagctgg gactacagac atgcaccacc acacccagct aatttttttt 25200atatttttag tagagacagg gtttcatcat gttggccagg ctggtcttga actcctgacc 25260tcaggtgatc tgcccgcctc ggcctcccaa agtgctgaga taacaggtgt gagccaccgt 25320tcccggcata ggaaaacttt ttgccttcta aagaagagtt tagcaaacta gtctgtgggc 25380tggccttctg attctgtaaa gaaagtttga ttggtggctg ggtgcggtgg ctcacacctg 25440taatcccatc actttgggag gccgacgtgg gcatatcacc tgatgtcggg acttcgagac 25500cagcctcacc aacgtggaga aaccccgtct ctactaaaaa tacaaaaaaa aaattaaccg 25560ggcatggcgg cgcctgcctg taatcgcagc tactcaggag gctgaagcag gagaattgct 25620tgaacctggg aggcggaggt tgtggtgagc tgagatggca ccattgcact ccagcctggg 25680caacaaaagt gaaactccgt ctcagaaaaa aaaaagtttg attggtgtaa ccaaagcgca 25740tttgtttatg gattgtctgt ggcagctttt gttctgccga gatgagttgt gacagatctg 25800tatgggctct aaagcctaaa acatgtgcca tccgcccctt tacagaaaaa gtgtgctgac 25860ctctgttcta aagtattgga caactacaat gtttgctcat ttattattct atgatttgtt 25920ttctgctttt tgttgttgtt gttgttgttg agatagggtt tccctctgtc actcaggctg 25980gagtgcagtg gtgtaatctc agctcactgc agcctcgacc tcctgggctc tagtgatcct 26040ctcatctcag cctccctagt agctgggact acaggcacac accaccactc ctggctgatt 26100tttttttttt tttttttttt ttgtggagac agggtttccg catgttgccc aggctggttt 26160caaactccta ggctcaaaca cccacctcag cctcccaaag tgctgggatt acaggcgtga 26220gccaccatgc ccagcctatt ctactgtttg tattacatag ctttaaaaga ttttttatga 26280ctttaagtca caagggttct ttgtagaaaa aaatatatat ataggaaagt ataaaaagaa 26340agtaaaaatt gtccataacc tctccagcca gagacgaccg ttgctgacac ctcagcatat 26400tgcctttaag tcttttttct ctaagatagc atttctcttc atcacagtca tatgctacgc 26460agaattctgt atcctgattt tttcacttga cattacaaca ggtatttgat ggcgctgtga 26520caaactcttt ggcacaatct tttaaatgta tgaaatactc cactgcacag atgtttgctt 26580ttaggcttaa ctgttctttt attttgcgtg tgctggttac agccgggcac agtggctcat 26640gcctgtaatc acaacacttt gagagggtga ggcaggagga tcacttgagc ccagaagttt 26700gagaccggcc tgggcaacat agtgagaccc catctctaca aaaaactttt ttaataagtc 26760gggcgtagtg gtgcatagct gtagtcccag ccaccaagga ggctgagttg ggaggattgc 26820ttgagcccca ggaggttgat gctgcagtga cctgagatta ctccactgta ctccaacctg 26880agcgacagag caagacttgt ctggggaaaa aaaaaaaaaa aatatatata tatatatata 26940tatatataca tatatacata cacgcacaca cacataatat aaaaatatat atttataaat 27000atataatata taatataaaa atatatattt ataaataaaa tttataaatt atatttataa 27060gtaaatatat aatatataat ataaaaatat atattatata atatataata aaatatataa 27120tataaaaata tatatttata aataatatat aatacatact tataagtata tatttaaaat 27180atatgtaatg tatatttttt aatgtatgat atataatata catttataaa tacacattta 27240tattatttta tataaaatat atataaaatc tccaagttgc tttttccaaa aaggtgtctt 27300gctgcatttc aaacattcat ttaaaaactt gaatgctggt gatctggtcc agaatgtgtt 27360cagtagctgc tgccagtggc caagcatctc gggagatgtc tacaaaacac gctggttctg 27420gcctggcgtg gtggctcacg cctgtaatct cagcactttg ggaggctgag gcaggtggat 27480caactgaggt ctggatttcg agaccagcct tgccagcttg gtgaaacccc atctctacta 27540agaatacaaa aaaattagcc aggcgtggtg gcatgtgcct gtaatcccac ctacttggga 27600ggctaaggct ggagaatcgc ttgaacccag ggggcagagg ttgcagtgag ccgagatcgc 27660accattgcac tccaggctgg gcaagaagag cgaaactccg tctcaaaaaa aaaaaaaaag 27720atgctggttc ctaaaatgtg gcccttttcc tcctcacctg ctgccagacc atcagccgcg 27780ccttcatgaa cgggagctcg gtggagcacg tggtggagtt tggccttgac taccccgagg 27840gcatggccgt tgactggatg ggcaagaacc tctactgggc cgacactggg accaacagaa 27900tcgaagtggc gcggctggac gggcagttcc ggcaagtcct cgtgtggagg gacttggaca 27960acccgaggtc gctggccctg gatcccacca aggggtaagt gtttgcctgt cccgtgcgtc 28020cttgtgttca cctcgtatga gacagtgcgg gggtgccaac tgggcaaggt ggcaggctgt 28080ccgtgtggcc ctcagtgatt agagctgtac tgatgtcatt agccttgatg gtggccagga 28140ctggtagggc cctcagaggt catggagttc cttcgtggag cgggtgctga ggctgtatca 28200ggcacagtgc tggctgcttt cacctgggcc gtctcaccga agtgtccatg gagcctgcgt 28260agggtgggta tctgtgtcga ttttacagat gcagaaacag gctcagagaa accgagtgac 28320ttccctaagg tcacataccc agttagagca gagctgggcc aggaagtgct gtctcaggct 28380cctgaccagg tctccttgct ttgcactctt gccaaaacca tgatccagaa ctgactttga 28440ggtccccgga cctcaggctc ctccgaaatg gcctcttgga ggctgctgag ccacagctta 28500ggacccacct cgagaggcaa atgtgctttg agctgccagg cgtcctgggg gccctgcctt 28560gggcacgggg ttcagacagg ccccagatgt gtggggcgtc tttctggact tgagttttct 28620tttctgtgtg gtggacacag tgctcacccc ttaaagcacc tgtgatgtgt gcagcagccc 28680aatccctgcc tgtcgcctgt tctgctaggg aaggaaggaa gacttcagga tggcaggaca 28740acagaaagag gtccaggttt tagagcaagg gcaggtcaaa cttagaaaat tctggaatga 28800ggatgtgcat ttcctcttct ggatctgcta aaagaagagg gaaggagggg ctgctggggg 28860aggagcccag agccgagttt acatccggat cccgcaaggc ctcccctgcc ctgaggtctt 28920gttttgtgat gtgcttgtgt ccatcctggt ttctgccgtg tccccaacat ccggccaagc 28980ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 29040tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 29100ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgcttcct 29160gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 29220ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 29280gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 29340catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 29400cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 29460gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 29520gggcgttggc cacctcccag cctcgccgca cgtaccctgt ggcctgcaag ttccccaacc 29580tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 29640tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 29700gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 29760atttcaccct aacacacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820acttgcctta ggtcacatag ctggtgagtt caggatcccc cagagatacc agggccagca 29880ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 30000tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 30060ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 30120gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 30180agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 30240tcccagcact ttgtgaggcc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 30480agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 30540gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 30720cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 30780ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 30840accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 30900aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 30960catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcaccgca cagctgtggt 31020tgcctgaaat gattaagctc attaatcacc ccggagtgag gacagactca gatgaaaacc 31080agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 31140ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 31200ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 31260gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 31320gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 31440gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 31500gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 31560ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 31620cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 31680tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 31740atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 31860gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 31920tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 31980ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 32040tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 32100tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 32220gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 32280ccaaggccat acaactgggg tgggttgagc cggggttgac tgtggcaggc tggctcaaga 32340gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 32400acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 32460tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 32520agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 32640gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 32700ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32820acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32880ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 32940ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct gttgagaggg 33000agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 33060gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 33120tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 33180cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33360aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 33480cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33600aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 33660tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat agcgcacccc 33720caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 33780ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 33840ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct catcctgccc 33960ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 34020tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 34080ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 34140tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 34200gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaaccccg agactgtgat 34260tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 34320ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 34380gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 34440gtatatgctc ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 34500agggaaagcg gggccagggc acagccctca aaatctccat caatgacatg taagaaaaga 34560gaggaacctg ggaaatagca aagtgccttt tgcacattaa atggttagct atatcccaca 34620atactgtgca ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 34680ctggagttgg cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 34740ggaggattgt gggaaatcaa atggaaagtt ctcaccccag gcgtggagaa gagtgggtca 34800tggccccagc agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 34860gcgcagttcc ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 34920tgtgcatgag caaatgtgaa accgtgctgt gctcaaattg tcctaataca tcattgcatt 34980ggaacagatt ggcttttttt tttttttttt tttttttttt tttttgagat ggagtctcac 35040tctgtcacca gcctggagtg cagtggcatg atcttggctc actgcaacct ttgcctccta 35100tgttcaagtg attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 35160cgcggccggc cagatttgca tttttgaaac aactgctagg ctgggcgcgg tggctcacac 35220ctgtaatccc agcactgtgg gaggccgagg caggtggatc acctgaggtc aggggttcga 35280gaccagcctg gccaacatgg tgaaaccccg tctctactga atatacaaaa atcagctggg 35340tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 35400aacccaggag gcagaggttg cggtgagccg agatcacacc attgcactcc agcctgggca 35460acaagagcaa aactccatct caaaaaataa aaaatagaaa aacaagtgct gtagcggaag 35520tgagcacttt gcggagtcag gcttgtgtgg cctgttccac aaatgatgtg ctcacggtgg 35580cctcaggccc acctggagtc tgcagcatgg ggcacaacag gttcattagt gtagaattcc 35640aggacaggcc tggctcctaa gcagccttct tttacaaaaa ctgcagagcc cgcctgtatc 35700ctagcacttt gggaggccga agtgggtgga tcacgaggtc aggagttcaa gaccagcctg 35760gccaacatgg tgaaacccca tctctactaa atatacgaaa attagctggg tgtggtggca 35820cgcgcctgta gtcccagcta ctcgggaggc tgaggcagaa ttgcttgaac ctgggaggtg 35880gaggttgcag ggatctgaga ccatgtcatt gcactccagc ctgggcaaca gagcgagacg 35940ccatctcaaa aaaaaaaaac ctacagagcc acacggcctc tttctccacc gagtgttggt 36000gtgggagctt gtgttattgt ggtgaaatct tggtactttc ttgaggcaga gagaggctga 36060gcgcctggag agactttcac atgggtcgcc atgtccgccg tcggtttcgc tgttgtgctc 36120cccatctgaa ggctggtgcc gtccagacag gctggacgcc cctttccacc agatccttcc 36180tcccgcagca gtttctagtt acgttgtact gtgaggtctg tgtccttggt tgatggcaaa 36240agtcagccga attgaaattc agagccatgc ctggctccct ggagcttctc tcctgggcag 36300ctgtgatcat tgcctctgct gtggtgtggg tggtggaaat ggattccttt catcttgctt 36360gctacaggtg actgtcacgt ggagtccttt ggagagaggg acgtgttaat tgatggatgt 36420ggctcccatg ctgagaaagc tcctgggcgt acattgcctt agagtttcat tggagctgcg 36480ttcttttatg gtgtctgcta ggcagaagtg atgaagactt ggaagaaaac ccagaaggtt 36540ttccacttaa tttggaaaat gtgcttttcc cctcctgtgt cttttgctaa ggtccagcct 36600cctgcagcct ccccgctctg tggactctgg ctttgattct ttattaggag tccccctgct 36660cccccaaaag atggtgtcta aattatcatc caattggccg aggttttgtt ttctattaat 36720tgtttttatt ttttattgtg gtaaatttat ataacataaa atttgccatt ttaattgttt 36780tgttattgtt gtttttgaga cagggtctca ccccagtgcc caggctggag tgcagtggtg 36840cgatcatggc tcactgcagc ctcagcctcc agggctccag tgatcctctc acctcagcct 36900ctctagtagc cgggactaca ggcatacact accacatctg gctgattttt tgtatttttt 36960ttttattgta gagacccgct atgttgccca ggctggtctc aactcctgga ctcaagccat 37020cctcccacct caccctccca aagtgctggg attacaggca tgagccacaa cacccagcca 37080ttttaatttt tttttttttt tttgagatgg agtctcactc tatcgcccag gctggagtgc 37140agtggcgtgg tatcaactca ctgcaacctc tgcctcccag gttcaagcga ctctcctgcc 37200tcagcctcct cccgagtagc tgggattaca ggtgcccatc actatgcctg gctaattttt 37260gtatttttta gcagagacgg ggtttcacca tgttggccag gctggtcttg aactcctaac 37320ctggtgatcc gcccgcctcg gcctcccaaa atgctgagat tacaggtgtg agccaccgtg 37380cccggccttt ttttgttttt gagacagggt cttgccctgt cacccagact ggagtgcaat 37440ggtgggctct tggctcactg cagcctccgc ctcccaggct caagttgtgc acctccacac 37500ctggctaact gtattttatg tagagacaga tttcaccatg ttgcccaggc tgggcttgaa 37560atggactcaa gcagtccacc cacctcagcc tcccaaagtg ctgagattac aggcgcgagc 37620caccgcaccc agcccatttt acctattctg cagttgacag ttcagtggca ttcagtcagt 37680tcacgaggta accatcactg ccattcatct ccagactact tcaccttctc ggcagatgtc 37740cgaaactgtc cgcattgaac acactcctca tctccctctg acagccacca ttctactttg 37800tatctctctc tgccttctct aggtacctca tgtaagtgga attataccaa tatttgccct 37860tgtgtgactg gcttctttca tgtgacatgg tgtcctcaag gttcatctgt gttatagcct 37920gtgtcagaat ttccttcctt aaagcctgaa taataacccg ttgtaaaggc tgggcgcggt 37980ggctcacacc ctctaatccc agcattttgg gagtccgagg tgggcagatc acttgaggtc 38040aggagtttga gaccagcctg gccaacatag tgaaaccctg gctctactaa aagtacaaaa 38100ttagctgggt gtggtggcgc gcacctgtaa tcccagttac tcaggaggct gaggcaggag 38160aatcgcttgt acccgggagg cagaggttgc agtgaaccaa gattgtgcct ctgcagtcca 38220gcctgggtaa cagagtgaga cttcctgtct caaaaaaaaa aaaaatcatc ggatggatgg 38280acggaccact tcttgttatt tatccatcca cgggtgctag gtttcttcca cctttggttg 38340tcgtgaataa ggccactatg aacatttcct tccgtggtga aggttttgta ctagtgagga 38400aaaggcgtgt ttgtggtgtt gcataggatt ctggtaagaa agtttgcact aaccataagt 38460atttgtacta cattaaaatg aaagctcagg ggccgggcgc ggtggctcac gcctgtaatc 38520ccagcacttt gggaggccag ggcgggcgga tcatgaggtc aggagatcaa gaccatcctg 38580gccaacatgg tgaaaccccg tctctactaa aaataccaaa aaactagcca ggtgtggtgg 38640cgggcacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaacccggg 38700aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 38760agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 38820ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 38880aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 38940actgctgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 39000cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 39060tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 39120gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 39180tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 39240tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 39300ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 39360gcgcgtgtgt gtggtgtgtg gtgtgtgtgt gtgtatgggg gaggcaccct ttccatctgg 39420gtccaagaga ctgggcctgg ggaagacgct tctttttatc tacttagaga ctttgtttta 39480tttgtatttt tttgagacag ggtctcactc tgtcacccag gctggggtat ggtgatatga 39540gcatagctca ctgcagcctc ggcctcccag gctgaagcga tcctcccacc tcagccttct 39600gaatagctgg gactgtaggc gtgcgtcacc atactgagct attgtttttt ttgtttggtt 39660ggtttaattt tttttgatac agatggagtc ttgctatgtt gcccagacta gtctcaaact 39720cctgaactca agtgattctc ccacctcagt ttcccgacat tctgggatca caggtgtgag 39780ccactgctgt ctccctgttt tattaactgc tgaaagacct agataaagaa agtctgaaaa 39840gacttactat cagagcacca tcctaagatg attccctctg actcaatgga gagggagggg 39900agcttttcct tcaggcctgg gtggcaggag cccaggtgct ccaggcccca tttgccccag 39960gccaaatcac tcgggaactt ggatgcagct gtctttcagg gtaacccaaa ggaaccagat 40020ccccgcaggc agtaggcttc tgggctgtcc tctcctccta cgtcagctca gtaagagccc 40080ttcgaaggga tgctgtgtcg gaggccccaa aagcccaggc tcatccctga gatgcacagg 40140gtgggctggg cttaggcagc gctcgagcat ctcctggacg gtgaccccag agagtgtgga 40200gacggagagt ccttgagagt cactgagaga cgtggctgcc ctgccttccc aagaggggct 40260ctgagtcatt ccccacactc acctgcccct acccaccctc acctggcccc cagcctcacc 40320tacccccaca tctgtaccga tccctttacc cgcaccttcc ctacccaccc tcacctcccc 40380tgtaccttca cctcccccac tcacccgccc ctgcaccctc acctgtcccc caccttcacc 40440taacccccac cctcacctgc cctcccctca cctggcctcc ttccgttggg gaaggggttg 40500taaggggcgg cccccaaact gtctgtcctg gtgccctgca gagaaaacag tacgtgaggg 40560ccgcagtcca aaagcttgag tcctggaagg tggaggagac agggatgtgt tgggaagggc 40620cccatggtct tggatccctt ctcgactgtc aatggggcct tcatgggagc gccagtctag 40680tgatgcacag ctgggtgccc ggcgggtggc tgaggaggcc taaagtccga ggcggcaaga 40740gctcttccag aggctgttgt cctaatcgct ctggcatact caggcgggca cgtagttagg 40800agctgattgg agaggagaga cccccacacc aatactggga tttgactttc aggctaaact 40860tgagaagtgt ggcctctgct gtcctgccag agctctccag ccagtgccca gggctctcca 40920gccagtgccc gggggtctcc accagtgccc gggggtctcc gccagtgcca ggggtctccg 40980ccagtgccca ggggtctccg ccagtgctca ggagtcttgg tttctttgtc ttacagccct 41040ttgttttgac ctctctgagc caaggccaaa acccagacag gcagccccac gacctcagca 41100tcgacatcta cagccggaca ctgttctgga cgtgcgaggc caccaatacc atcaacgtcc 41160acaggctgag cggggaagcc atgggggtgg tgctgcgtgg ggaccgcgac aagcccaggg 41220ccatcgtcgt caacgcggag cgagggtagg aggccaacgg gtgggtgggg gtgctgcccg 41280tccaggcgtg cccgccgtgt cttatgccga atgccagcct ctcacaggct ggggagactt 41340tccacctggg gatccaatgg gtggctttcc agggtcccaa aagcaaacac aggtttttca 41400cagcccgtcc gggaaagcag aaagccccaa ggggctggaa ggggaaaggg ggagctctgc 41460tgagaggtta caaggcagcg ctggccgacg ggagttgcag ttgataggtt ttgtatcatc 41520cttgttaaac ttgaaccctg tgcagaaatc ccttccacgg catgggggct gcctgttgac 41580tcgctcctgt tccaccacag ggagctcctg ggcttcttcc tcccagaggc ccccgacgct 41640cccacctgtt ggtcgtcaga gcttctggtt ggtgggaagg cacccaggac cttgaggtct 41700ccagagagaa aagccaggga aagagggaga ccgaaaccca tgtgacatga aactcaggct 41760ccaaactgag cacgggaacg tttggggaca ggagcgcgat ggccttcctc agatagctgg 41820ggggctggca tgaagacggg agctacagcc agcacaggtc ctgggccggg agcccagaga 41880ttgagccctg actctgtcac ttactggcca cgtgaccttg ggcgggtggc atagcctctt 41940ggagactcag tttcctcatt ggtaggagtg acggccacag tggtgcggcc tctgcagcac 42000acggggggct cggtgggcgg aagccccggg tctataaggc ggctgtgcag gagccagccg 42060agctggtctc ccaacagcca gggctccggg gtccttagca gctgtggggg gcctgcacct 42120gtttcccatg gctgctgtca gaaattacca gaagccaggt ggctgagagt aatggacact 42180tgttctctca cagttcctga gggctgaagc ccgagatcga ggtgtgggca gggccctgcg 42240ccctctgaag gctctgaggg aacctttggg cttctggtgg ctccaggcac cccttgactt 42300gtggtcctgt cactccagtc tctctgtctg gctgcacatg gcgtggcctc ttctgtacca 42360ttgaaggaca cttcagttgg atttagggcc taccctcacc cattgtggtc gtatcttgat 42420ccttcatgac atttgtaaag accctgcttc caaataagct cacattctga ggttctgggg 42480tgagcgggaa tttggagagc attgttcaac tagtatagaa tgtgacctgt cagcctcggg 42540cagccctgag aggcaggggc tttccacagc ccagctgggt gccctgggct ccgtgctgtc 42600cgaggagacg ccatccccac acccgtcctt cacccgccac cctcccgcag gtacctgtac 42660ttcaccaaca tgcaggaccg ggcagccaag atcgaacgcg cagccctgga cggcaccgag 42720cgcgaggtcc tcttcaccac cggcctcatc cgccctgtgg ccctggtggt agacaacaca 42780ctgggcaagc tgttctgggt ggacgcggac ctgaagcgca ttgagagctg tgacctgtca 42840ggtacgcgcc ccggggcctg ccctaaccgc agacacccgg ccttcattgt cagtaatggc 42900agcagctgcc acattgtccg agacctgccg tgagcccagt gccgcgccag gggctttgtg 42960tgtagcgtgt tttgtcctca cactgacagc tgtaggctgg ggttctgagt gagccccaca 43020gggcagaggc agaaaatgag tctcagagag ggtgagcgag ctgcttgggg ccccacagca 43080ggagatggag caggactgca gcctagcctc tgcccccagc acctgcgcaa gaagctgctc 43140tgctctggac tgtgttaggc tgcgagggct ggagagaaat gagagttggt gcttagagag 43200ggggcgcagg tccccatggc ttttcctctt atgatgaggt agatgggtga agggaggggc 43260catgcttgca ggggccagtg accgaggccc gccgttggaa ctgatggcct tcatcccgag 43320cccagcccag gtgggagcag ggctttccga gggcttgtct tgggtcggcc tgcttccagg 43380gactctgctg cagctcccac ccctgtccaa agcatggaat cccccaggct ccctggcagt 43440cctgtcaacc tctgtcctcc caagctgagt gtggggcaag ttctggaggt cagcactgct 43500caggggggcc cacgggctgc ttgcaggggc caaccgcctg accctggagg acgccaacat 43560cgtgcagcct ctgggcctga ccatccttgg caagcatctc tactggatcg accgccagca 43620gcagatgatc gagcgtgtgg agaagaccac cggggacaag cggactcgca tccagggccg 43680tgtcgcccac ctcactggca tccatgcagt ggaggaagtc agcctggagg agttctgtac 43740gtgggggctg gcagtggggt gggcagggtg gcctctaaac ccgacccctg gaggaggctg 43800gaggccagtg caagatcctg tgtggcctca gccaggcggt ggtctctgcc agatgccaac 43860tgttgcccgc tggggttcag cgacatgtcc gaatgtcccg aggcctctga ggttgttttc 43920ttttgccgca gaacaaatca ccacgaacag cgttttaaga caacaccaac tctttttttt 43980tttttttttt tgagtcagga tcttgctctg ttgcccaggc tggggtgccc tggtgcaaac 44040acagttcact gcagcctcga cctctgggct taattaagtg aacaccttgc ctcagcctcc 44100caggtagctg ggactacagg tgggcaccac cacacctggc taattttttt ttgtagagac 44160ggggtttccc catgttgccc aggctggtct gcaactcctg ggcacaagct atctgcctgc 44220tgtggcctcc caaagtgcta ggattatagg tgtgagccac tggcctgaca acacccacgg 44280attgtctctc agttctgtaa ggcaaagtcc aggcacagcg tggctcacct gggttctctg 44340ctcagggtct cacggggcca gaatcaaggt gtcaggaacg ctgggccctc agcggaggct 44400ctgtggagaa attagcttcc ttgctcactc agcaggtagc agttgtggga tcgaggttct 44460gttttctctc tggttattgg tcggggacca ctctcagctc ctagaggcca ccacaggtcc 44520ttgccccgtg gccctctctg cctcagcagt gggggctccc tgcgtcagtc cctcccacac 44580cttgagtctc tctgatttgc ttctaaaggg ccctgtgatt cggctcagcc acctttagat 44640taggttagcc tcccctttga tagactccaa gtcggctgat taataacctt aatcacatct 44700gcagaatccc ttctgccaca taaggtcatg acgccgtgct ggggactggg gtgggaaatt 44760acggggtcat ttaggattct gcctgccact gccttgctgt gtcccagggc ttgggggagg 44820ggcctccaca gctgggacca cagtccttcc tcccctccat ggtaaccatc tgaggattac 44880ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 44940ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 45120ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 45180attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 45240gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 45300gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 45360gtcatttctt ccacctgggt gtggtcagct ctccctccaa gctgcctcct cttcttctca 45420ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 45480tggtccattt gctattttat tttattttat tttattttat tttatttatt tatttattta 45540tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 45600gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt 45660ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 45720gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 45780ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 45840catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 45900caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 45960ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 46020gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 46080ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 46140ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 46200gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 46260tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 46320tatacaatga aagacgggaa gagggccagg tgtggtagtt cacgcctgta atctcagcac 46380tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 46440atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 46500tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 46560caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 46620aaattatggc cgggcgcagt ggctcccgcc tgtaatccca gcactttggg aggccgaggt 46680gggtgaatta caaggtcaag agatcgagac catcttggct aacacggtga aaccccatca 46740agatcacaag gtcaagagat ggagaccatc ctggctaaca cggtgaaacc ccgtctctac 46800taaaaataca aaaaattagc cgggcatggt agcgggcgcc tgtagtccca gctgctcggg 46860aggctgaggc aggagaatgg cgtgaacccg ggaggcggag cttgcggtga gccgagatcg 46920ctccatgcca ctgcactcca gcctgggtga cagagtgaga ctccgtctca aaaaaaaaaa 46980aaaaaaaaaa aaaaaaagaa aattagccag gcacagtggc aggtgcctat tgtcccagct 47040acttgggagg ctaaggcagg agaatggcat gaacccggga ggtggagttt gcagtgagcc 47100gagatcatgc cactgcgctc cagcctgggc gatagagcaa gactctgtct caaaaaaaaa 47160agccaggcat ggtggtgcat gcctgtagtc ccagctactc aagaggctga ggcaggaggg 47220ttgttcgacc cacggagatc aaggctacag tgagccatga tcgcaccact gccctccagc 47280ctgggtgaca gagtgtgacc ctgtctcaaa gtaagtaaat aggaggagag acaagtgggc 47340agttcagact gatggtatgg gcacagtaga gactggtgca gacaggctgg cctgtgatgt 47400caagcaactt ctgtaattgt ttccggcatc catttgtgtg tcaatttccg tgtcagtagg 47460aagactctgt aggctgccaa gaggaataag tgggaggatc ctcccagaga ggccgggcct 47520gcaggagggc cagttctcat gagttctcat ttggccccta ccctccaggc tgtggttctg 47580aggtgggaga cagagcctga cctctgtttg tcttgttttg tctttgcagc agcccaccca 47640tgtgcccgtg acaatggtgg ctgctcccac atctgtattg ccaagggtga tgggacacca 47700cggtgctcat gcccagtcca cctcgtgctc ctgcagaacc tgctgacctg tggaggtagg 47760tgtgacctag gtgctccttt ggggtgatgg acaggtacct gattctctgc ctgctaggct 47820gctgcctggc atccttttaa aatcacagtc cctgtggcat ccagtttcca aagctgattg 47880tgtcttcctt tgccctcctt tcttttctac tatgtgcatt cggtgctatg aattttcctc 47940taagtactgc gtttcctgca tctcacaaat tttgttacat tttcattttc aggtagtttg 48000aatattttta cacttctcct gagatgacat ctttggctca tgtgttattt agaagtgttg 48060cttagtttct aaagagttgg ggcttttcca gctgtctctc tgcaactgat ttctaattta 48120attctactgt agtctgagag cttattttat atgatttctg ttattttaaa tgtgttgggt 48180gtggtgtttt tgttgttatt gtttttgtgt ctttttgttt tgttttgctt cgtttgtttt 48240gtttttgaga cagtgtcttg ctctgtcact caggctggag tgcaatggcg cgatctcagc 48300tcaccgcaac ctctgcctcc cgggttcaag tgatcctctt gcctcagcct cctgagtagc 48360tgggattaca ggtgcacgcc accataccca gctaattttt gtatttttag tagagacggg 48420gtttcaccat gttggtcagg ctggtctcga actcctgacc tcgtgatccg cccacctcgg 48480cctcccaaag tgctgggatt ataggcgtga gccactgtgc ctggccatta ggtgtgtttt 48540atcacccagc atcatgcagt ttatcttggt gaatgttctg tgtactcttg aaaagaatgt 48600ggattctgct gttgttgggt ggagtgttcc agaaacatca attagatcca gttggttaat 48660agtgctcatc aggttgtctc tatccttcct tcctgactgc ctgcttgagc tgtcagttat 48720tgacaggggt gtggagtctc caactctaat ggtggatttg tttatttctc ctagtagttc 48780tatctttttc tctccttcta cccttgatcc tcttctcccc ctagggcttc ctggtgttag 48840tggtgggaga gtggggtagt gaagaacctg gactttaggg ccaaagaggc cagggttcaa 48900atcctggctc tgtcacttcc cagttgagtg accctggctg gtgcctgaat ctctgtgagc 48960ctccacttcc tcctctgtga aattgagagc acttacctgg caggctgtca tgggcatcaa 49020gtaacagggc actccacctg gaccctgaca cgtgatgcac aggaatgcca gctgctatgc 49080catgggtgtg gcagtagtaa taaagtgacc atctgtatcc tcaccacagt gaagcctgtc 49140cagggctttc tctcctatgc ccccatgcct ccaggtggcc ttggatcctg ttggttctgt 49200gctctgctca gcgacctttc tcccgtggga gttcctgggg gttcagcttc atcctacaga 49260cagcagcaca cactggctgt gcaccctttt tttttttttt tttttttttt tgagatggag 49320tctcgctttt ttcgcgcagg ctgaagtgca gtggtgtgat cttggctcac tgcaacctct 49380acctcctggg ttcaagtgat tttcctgcct caccctccca agtagctggg attacaggct 49440cccaccacca cgcccggcta atttttgtat tttcagtaga gatggtgttt caccatgttg 49500gccaggatgg tcttgaactc ctgacctcag gtgatccgcc cacctcagcc tcccaaagtg 49560cagggattac aggcgtgagc caccacaccc ggagtgccgg ttgtttttag cagtttgtct 49620tgttcctgga gagactggct cctgcccagg agctcgggga gtagggccgc ggggtgctgc 49680ctcacacctc gagtttggcc gtaagcagag gggacatttt gtgactgtcc ccctcctgag 49740cttcccagca gcttttctcc aagttacagc ccaaaagctc aggtggattt gcaacccaac 49800ggtgtctgtg cacctcccac tgatgcccga actgccctgg ccaagaaacg gggccgtcag 49860aacgctgcac taactgcagc cttgggcctc catgccagag gccatgccct tccatccacc 49920accccctggc ctgggccctg ggccctcctg gctcgggaac tccaggcccc ttcctcacgg 49980ctcgagagac gtgtatttac cgcacaggtg cttgtcattc tcttgtggcc tcttctccag 50040ggagatcaca gaaggacagg gcctcactga ggtctcggac atggaccctt tgatagtggc 50100aggagccagg ctgggcaaga ggcggccaca gtcacctcag cagtgccatc accaccgcca 50160ttcagccctt ccctgagccg ggcgcgcccc tggctctggc cccagtgtcc cagttacagc 50220tcacaggagc ttgtggtgcc cagcggctgc ttctgattga gagtcgaggt cggaggcttt 50280gggaggctga gaggctgctc ggtttcacaa ctgctgaggg agacttgggc tccatctcag 50340gtatgcccca tgtcgccctc aacctccagc caccggtcct ccgtgtcccc catggccagg 50400cacggcttgc agacatctgt cgttggctcc tctcagccgt cgtgggctga ccctggcacg 50460tcctcctgtg gctgagccca gtggggacag ctgcttcctt ttattaccct agaactctcg 50520tctttgatca ggccccctcc cctatgccac acagtccctg tcactcgggt gagcccagta 50580gtcatgggga aggcctgcgg gttccaaaca tccaaaggct tgcgtgcagc atgacagctt 50640gaaaccgatg ttttttacct tgatcagatt tcagcttggc gggggctttg ctcagctttc 50700agtgaggcct gggccgattt cccagcatcc cctcctgagg ccagcctctg tttcctgtga 50760ttttctgcac aaagtgggag ggaggagtcc taggaaatgg ggggccacct cgaagcctag 50820gcctcctctg gcttctctgt gccagtgccc ccacgctttg tgtctgtgtc cccagcccat 50880gggactctgc tattccctga gtgctgccgc atgcccagcc cgcactgagg acgtggagcc 50940ccgaggggca ggatggcctc catggtcaca cgtaggaagt ggcctccacc ctccgatgat 51000cctctccctc ctccctttca gcgccctccc cgggggtgtc ctcagccctc ctgcctgtgc 51060tttgtcccgt cttctgcagg cgcctgggac gtgctgacag gtcctctgcc ggctcctgcc 51120ttgctatgcg cacgctggtc accacagagg cctggccctt cttctgtagc agtcccacac 51180ccgcaacagg tgtggctgct gaccacctgc tttctgcccc tctggtcctg aggagggcgc 51240agtgggcact caggcgtggc tgagcagatg tgtgttgccg ggaggaggaa ggactgctcc 51300agtcagggct gaatttccca cccggagcat ttctgctgta tttggtgtag cgcctgctgc 51360ttaaagctct gattcccagt tggcaccctt tcccttctgc attgaaaaac atacggatgc 51420atgtcttctt gcagtgaatg tgtattctcc cagcctctct tctgggttgg ggctggaggt 51480ggagcggcac acaggagccg cagcgatgga ggatgtgcgg gtgcagcacc ccgtacagca 51540gggatgccaa acccgcgctg agtccctctc aacttctgct ttgaagccca gtcacgccat 51600tgcctgggtt ttgctgggcg gggctgcgtg tgatgttctc ctctgtccct cccccagagc 51660cgcccacctg ctccccggac cagtttgcat gtgccacagg ggagatcgac tgtatccccg 51720gggcctggcg ctgtgacggc tttcccgagt gcgatgacca gagcgacgag gagggctgcc 51780ccgtgtgctc cgccgcccag ttcccctgcg cgcggggtca gtgtgtggac ctgcgcctgc 51840gctgcgacgg cgaggcagac tgtcaggacc gctcagacga ggcggactgt gacggtgagg 51900ccctccccgt caaggctctg ccaagaccct ggccctgccc tccgggatac gagcttgggg 51960ctgcctccgg cctcacagga gtaggggctc tgaaaacctt tgcttgcagg gagattgcca 52020agtctgtctt ttaggcccaa caaggaaaac tctgcagttc cacccatcct gtcccaccag 52080gtagtgtggc ttgaaggcag actgtgaggg tctatctcac cttcctgcat taggtcagga 52140gtttcacaga aacctgaggc acattcaggg gtgggctgca gaggtccatg gctcacaccc 52200tggaaaatcc gcccccaaaa gacagtgctg tctccactga ccagtctgtg ggatagtgct 52260taagcctgag tggtttctat caacatgtag aatcaggagg tataaagaga tttgctcagg 52320catcctgggc cctctctgac cagcaggatc ttcctttaga tcttgacagt gaaacacatc 52380tcttctgtgc cccctgtgag ttttctttca ttcattcatt cattcattca ttcattcatt 52440cattcattcg agacagagtc ttgctctgtc acccaggctg gagtgccctg gtgtaatctc 52500ggctcactgc aacctctgcc tccagggttc aatcgattct cctgcctcag cctcccgagt 52560agctgggatg acaggtgcgc accaccatgc ctggctaatt tttgtatttt tagtagagac 52620agggtttcac catgttggcc aggctggtct cgaactcctg acctcaggtg atccgcccgc 52680ctcagcctcc caaagtgctg ggattacagg catgagccac cgcgcccggc ctgagttttc 52740cttttatgaa ggacctgctt ggttggttgc ctgccacatg ttgtcagcac catgggccca 52800ggactgctga ggagctgttg atgccctcgc tctcccagag ccaccggctc tgttagataa 52860ttcacatgca gtctggccac tgtcctacgt cctcattcac aaagagcaga catttcgtag 52920aagatgaggg cctgggagta acctccctgc atgtttttct ataaaggcat agtggttaag 52980tccttccagc tcattgacca ttggagaatt ttatggaggc tgtagactag gggctggtaa 53040actaagggcc caggggccaa atccagcctg ccacctactt ttgtaaataa agttttcttg 53100gtgcacagcc atgcccattc attcatttgc acaatgtctg tggctgcttt catgccaaaa 53160gcaagagaac tgagtggtta tgctggagac ctacggcctt caaagcccca gacctcacgt 53220ctggcccttg acagacagag cttccccagc cctgctgcgc atcctggccc agcatgtgct 53280gtgtgtgtga tttcagcttg caggagccgt ggttaggaat tgtccctgtg ttggtccatt 53340ttgcattgct atgaaggagc acctgaggcc gggtagatta tgaaggaaag aggtctgtct 53400ggctcatggt tctgtaggca gcaccagtat ggcacccgca tctgctcagc ttctagtgag 53460gtctcaggaa gctttgactc atggtgaaag tcgaagcggg agcaggtgca tcacatggtg 53520agagagggag caacggagag agagagagag cgcctctccc tcttgccctc accttgagag 53580gagatgccag gctcctttaa gtaaccagct cccatgtgaa ctcacagtga gagcccattt 53640gctactgcgg agagggcacc aggcatctgc tcccatgacc caaacactgc ccaccaggcc 53700ctacctccaa ccttggggtc atattttatt ctgttctatg ctatgctatg ctatgccatg 53760ccatgccatg ccatgctatt cctattctat tatttgagac agaatctcgc tctgttgccc 53820aggctggagt gcagtggcat gatcttggct cactgcaacc tccacctccc aggttcaagc 53880gattctcctg tatcagcctc ccgagtagct gggattacag gcacacacca ccacacccgg 53940ctaatttttg tattttcaat agagatgggg tttcaccatg ttggccaggc tggtctcaaa 54000ctcctggcct caagtgatcc acctacctcg gcctcccaaa gtgccatgat tacagatgtg 54060agtcactgcg cccagtgagg gtcacatttc cgttgagatt tggaggggca gacgttggag 54120ccatctgagc cccctcgtcc cgctctagct tctcctcccg tgtgccccgc ggtgctggtg 54180gcaggccctt acgccggttc tggctgcatg ctctgttcca gaagctttct tccctgcttg 54240gttaccagaa aatcatccca tccattacaa ggacagggtc cccttatctc ccattcccag 54300ggcaggacac cgggggcagg gcaggtgggg aactgagcaa gttctctggg ggcaggcgtg 54360gctatggctc cctctgggtg ggcgtctggg gaggggtgga ggcagccgtc agcgccctgg 54420cttgctcttc ctccctggcc agagactgtg gccttgtgct gctcccgtgt gggctgcctg 54480cacctccagt gggttgtgct ccctcccctc ccctcccctc aagctctgct gagcaccact 54540gccttccaca gcccccactc tcgggaggcg aggctcctcg tggccattcc tgtccttggc 54600acccaccccc ccaccaacct ggtagagcct tgggcggggt ctgttactcc ttgcatggcg 54660tagacctccc cacagtaggc acctgacaca tacctcctgg ggggcaggca ggaggtgcgt 54720tgaggtctca gccctggcag tccctcccct gcgtggcata ggcctcgcca cagggtcatc 54780gagggtgggt ggagactgta ctagaccact ccccgctggt cctagaaagg gtcccatctg 54840tctgctctct gtttggagtc cagaccttgg ttgctgtgcc ctgcatggtg ggctgggggg 54900caccctccag cctctctgag tgcatggcct ctccttgcag ccatctgcct gcccaaccag 54960ttccggtgtg cgagcggcca gtgtgtcctc atcaaacagc agtgcgactc cttccccgac 55020tgtatcgacg gctccgacga gctcatgtgt ggtgagccag cttctggcac ggggaagggg 55080cgtccgggct gggttccccc aggaacgtgg agtttagggg aggagacgtg cctttccagc 55140ggggctgggg gctgtgtggg agactcaggc ggctgggagg ctccttgcgg gaggcaggga 55200agcctttccc agggcagcgg ccaggaggac agactgtgag ctgtgggctc ggcggctaca 55260gagtctgcct cagtgggcgg ggctgatggt gtccaggtgc ctgcagcacg cacccaccca 55320cgggaccttg ctgagcagcg tctgtcaggc agcaagatta cccgagggct gcagtggtcc 55380tgttccctgg cagcttactg tctggctgag gaggagtgat gttcacatat gcacacatgt 55440catgtgcaca cacatgtaca tgacaacatc ccacatgctc ctcaaatagc atgacctgta 55500cagtcacgga tatagggcct aggggatagg aggccaagac agtcagggaa gactttccag 55560aggcagtggc tcctgaaagg ctgtctgatt caggcaggaa gggagctgag ttcagatagg 55620aagtagcaat gagtcattgt gtctggggac atggccactc cttcgctgca gagggacctg 55680ggctgagagc tcctctctta tggctgcagt cgggagagaa gtctgttggg gggagaaggg 55740ggcttcctca agggactccc tgtgcccttt ggcaccttcg tgccaggtca ggcttgaggc 55800ctgaaggcag tggtgggggc caccaagggt cgcctcctct gctgggcaag ttcccagtct 55860gacgggcctg tgccgtgggc cccagctgtg ggggcgctgt tgatgcgcag ccaggcctcg 55920ccgccagagc ccgcacgctt ccattccgct gacttcatcg acgccctcag gatcgctggg 55980ccggccctgt gggagagtga atgtggcttt tgccaaagtt gagtctggag cctggaaact 56040tccctatggg cagccttgat agtggagtgg cccaaggagc ccacccagcc gaccctgccc 56100ctcccgtggc tggtgggcgg caccaggggc tgcctggctt tgctcgttca ccaacatcac 56160ctgggctggc cagggcgcgc tcacttctgc caccaccgag ggccctgggc gaaggagtga 56220ataccaggct gccttggcag ggatgtgttg agggctgtgg ggagtcggac agcggcgggg 56280gtcagaggag gaggagggtg caccgtgcag gctgaagggc cacgttaccc tgaggttggc 56340caggctcccc aggcctagcc tcccagctcc cccactttct ccccaccctc caccagtggc 56400aaagccagcc ccttcagggc gcacggtgtc tgcccccaag gagggcccat tccgttgggg 56460ttaatgttgg ccacctcttt ctgtttgtct ctggcagaaa tcaccaagcc gccctcagac 56520gacagcccgg cccacagcag tgccatcggg cccgtcattg gcatcatcct ctctctcttc 56580gtcatgggtg gtgtctattt tgtgtgccag cgcgtggtgt gccagcgcta tgcgggggcc 56640aacgggccct tcccgcacga gtatgtcagc gggaccccgc acgtgcccct caatttcata 56700gccccgggcg gttcccagca tggccccttc acaggtaagg agcctgagat atggaatgat 56760ctggaggagg caggagagta gtctgggcag ctttggggag tggagcaggg atgtgctacc 56820ccaggccctc ttgcacatgt ggcagacatt gctaatcgat cacagcattc agcctttccc 56880actgagcctg tgcttggcat cagaatcctt caacacagag gcctgcatgg ctgtagcaac 56940ccaccctttg gcactgtagg tgtggagaaa gctccttgga cttgaccttc atattctagt 57000aggacatgtg ctgtgttgtc cacaaatcct catgtaccct agaaatgaat gtgggggcgg 57060ctgggctctc tccagagctg aaggaatcac tctgtaccat acagcagctt tgtcttgagt 57120gcagctggga tttgtggctg agcagttaca attcctacgt ggcccaggca ccaggaacgc 57180aggctgtgtt tgtagatggc tgggcagccg caccgcagag ctgcaccatg ctggtttgta 57240tcacatgggt gaccatggta tgtctaagaa ggtggagtcc ctgtgaggtc tgcaggtgcc 57300cccacagctc caggccacct tgaggattgc ctctgcctgc ccagccctga gttccctctc 57360ccctgtcctg tcccactgtc accccaagcc ggcctcattg ggagcctgtt ggatggcagg 57420gtatagatgt aacctgattc tctctgggga gcggggttat ctggcttctc aagagctcct 57480aggagcccac agtggtggca ccatcacagt cgcagcagcc cccagagaac gcggccctgt 57540ctgttcctgg cgtgctctgt gctgccccgc ctgggttccc tgccccagtc gcaggcccct 57600tggaggaggt accatgtgtc tcccgtttca cagatgagcc ccggggagct cactctagta 57660gtggccagag aggcctgcgg ctcagggagc ggggcacatt tccaacagga cacaccgccc 57720tggtctgagt ctcgtgggta gtgggagcag aggagagcgc cctatgtctg tggggcggct 57780tggctgagcc tggaagccac ctgacctccc ccgtcccttc cctgccaggc atcgcatgcg 57840gaaagtccat gatgagctcc gtgagcctga tggggggccg gggcggggtg cccctctacg 57900accggaacca cgtcacaggg gcctcgtcca gcagctcgtc cagcacgaag gccacgctgt 57960acccgccggt gaggggcggg gccggggagg ggcggggcgg gatggggctg tgggcccctc 58020ccaccgtcag tgctggccac cggaggcttc ccgggttcct gggggctgtg ccaccgcctc 58080tgaggcatgc ttgctttctt cccttttcaa acccttctgc ttccttcttt aatgacattg 58140ttgattgtgg ataatctgaa aactacacaa aaatataaag agccaaaatc tcacccaaat 58200ccacctccta gagtggctgt tgggctccgt cagcatccag gcggccgtct gtgttccgca 58260cggcccagcc catcgatagc cgcctgcacc aggcctgtct gccctctgtg agcctcccca 58320cagggttccc tccacaaaca ccctgttctc ccacccaggg ctggctgctt cctggaaaac 58380agctggatgg ttttgtgcat gacagacaaa cacagggtga ttttcgtggc taaaatactc 58440cctggagctt ttggcagggt gaggggctgg ctccagctga gccacgcctt gagtgaaatg 58500actgtgagga gaataaactg ccgctgccct ccaggatcac tggggctggc tggggagaac 58560ccccgtttct gggagcacag tcccaggatg ccaaggcgag cttggtgccg agatgtgaac 58620tcctgagtgt aaacagcggg ggctgacttg acatgctttg tatgcttttc atttgttcct 58680gcagctgtat gcccctaagg tgagtccagc ccccttctgc ttcctctggg gcctcgccag 58740tgagccccac cttgctgggg ctggttcctc ctgcccttct gggtatccct cacatctggg 58800gtcttgtctt cttgttttct ttttcttttt tttttgagac ggagtttcac ttttgttgcc 58860caggcttcag tgcaatggtg tgatctctag gctcaccgca acctctgcct cccaggttca 58920agcagttccc ctgcctcagc ctccctagta gctgggatta caggcatgtg ccaccacgcc 58980cagctaattt tgtattttta gtagagatgg ggtttctcca tgttggtcag gctgatcttg 59040aactccctac ctcaggtgat ccgcccacct tggcctccca aagtgctggg attacaggcg 59100tgagccaccg cacctggcct ttttcttttc ttttcttttc ttttttctga gacagggtct 59160cgctctgtca cccaggctgg agtgcaatgg tgtcatcatg gctaactgca gcctctacct 59220tctaggctca agcaatcctc ccatctcagc ccctaagtag ctaggactgc acgcatgcat 59280ccccatgccc agctaatatt tacatttttt gtagagatga agtttcacta tattgcccag 59340gctggtctcc aactcctgga ctcgagcgat cctcctgcct cggcctcccc aggtgctggg 59400attacaggcg tgagccaccg tgcctggcct ggggtattgt cttcttatgg cacctgactg 59460tggtgggccc tgggaaggaa gtagcagaag agggttcttc ttggtttcct ggacagtaac 59520tgagtgttct ggaggcccca gggcctggct ttgtttaggg acaaagggaa ctggtaacca 59580gaagccgaga gtttaaacac ccactgccct tcttccctgc tcctgctgct gcaacccagc 59640ttaaccagcc aggagtgcta ggaacccaag cagggccccc gagcacacag caggcagctc 59700acgaattctc ttttcctgtt ctcccttggg agctgggagg atcttaatca ggcaataaga 59760gatggcactg agcagccagc taatttttta aatcacttta ttgtttaacc atatgactca 59820cccacttaaa aaagggtaca gttcagtggg ttttagtgta ttcacagatg tgtgcaaccc 59880tcaccacagt taattttaga acattttcct gcccctaaaa gaaactctgc atgaagccag 59940ctgtttttaa attagcaaag ttattttgca tcctttaaat atatgttcat ggtacaaaat 60000tcaaaagata cagaagagtc tgcagtccaa agagactccg cccccatgac gccaagcagg 60060catccctggg aggcatggcc tcctgcagtg tgtttcttct atgtcccccc aggggtcatc 60120tgtacatatg caagcataca agagcgtgga ctttgttttc caagccagaa gataattgta 60180gatttatgtg cagttgtgag aaagagcaca gacccattta tcctctgcct ggtttccccc 60240agtgctgcct gccatcttgc atgacttcca ttcctatcat aagcaagaca ctgataacga 60300ttctttcacc ttattcagat tgacataagt gttttttgtt tgttcttgag acaaacttcc 60360tctgtcaccc agtgggagtg cagtggcaca atcacagctc actgcagcct caaactcctg 60420ggctcaagcg attctcctgc ctcagtcccc tcaagtagct cagatggcag gtgtgcacca 60480tcatgccagg ctaattttta aattttttgt ggaggtgagg cctcactaaa tttcctgggc 60540tagtcttgaa ctcctgagct aaagtgatcc tcctgcctca gcctcccaaa gtggtaggat 60600tacaggcatg agccactgcg cctgggctga catatgtgtt ttcgtaagcc cgaaagatag 60660catctgaaga gtcaacattg agccttgcct tttgctgcta acgatgtata aaagctgctg 60720ttctgagcat ttcggaggct cccagctgcc gtgtgcaccc tgcctagagc tctaccgtaa 60780cccatctccg ggaggaggtg ctattgtttt cctcattttg caacaaggag gctgaagaac 60840tgagcatgaa ccactggcct gggtcgttcg gttggtaggc agtggggcca ggccatccaa 60900ctcacaacca ccttctactc tgcttccccc gcaccctgaa gtttgttctg ttttgaggac 60960acagccgtca cattcttggt ggctgaacag cactccttgt caggcgtggc tgggccccca 61020ctggagggca tcatggtcct ctctcctgct gcggttgaac cttggctgtt tcaaccactc 61080ctgccaagtg gccctctgaa agggacagtc catcttttct cagcagaggg ccacactggc 61140aaaacggtcc ctggcaccct ttctctccac ctgtctaata tagagtaaaa atggtatcat 61200gttaagatct tcatttatat ttattttatc atgaatgatg taagcatcat tttgtgtgtt 61260taagaacctt tgggcccagc gtgatggctt gcagctgtaa tctcagcact ttaggaggct 61320gagatgagcg gatcacttga ggccgggagt ttgagaccag cctggccaac atggagaaac 61380cccgtctcta gtaaaaattt aaaaattagc cgggtatggt gatcccagct acttgggagt 61440ctgaagcatg agaattgctt gaacatggga ggcggaggtt gcagtgagcc gagatcgcgc 61500cattgcactc cagcctgggc gacagagcga gactctgtct caaaaaaaaa aaaaaaaaag 61560aaaagaaaag aaattatcaa tctcctcttt tatggcatat atatatatat atatatatat 61620atatatatat atatatattt tttttttttg gttatgttca gaaaggcctt ccctgctctg 61680atcataaaaa acaacttatt ttcacactct ctctcttttt tttttgagac agagttttgc 61740tcctgttgcc caggctggag tgcagtggcg caatctcagc tcactgtaac ctccgcctcc 61800cgggttggag tgattctcct gccttacctt cccgagtagc tgggattata ggcatgcacc 61860accatgcctg gctaattttg tacttttagt agagacgggg gtttctccat gttggtcagg 61920ctggtctcga actcgcgacc tcaggtgatc cacccacctc ggcctcccaa agtgctggga 61980ttacagacgt gagccaccat gcccagccca cactctcttt cttaacgtcc tcctcctttc 62040gttttacgtt cacatcttta attcttctgg gatgtaatta gatttgatga gcaaggtggg 62100catccagctt gtttcttggc tgatggctta tgggtggcgt gaattagtcg gggtctatca 62160ggaggcagaa actctatgag aatttgaaca gagaaagttc cgtctacagg cttattacca 62220gggactggaa tagcagaaat tgaacagtga gatgtacaga gaactctaag aatgcaggaa 62280taggccaggc atggtggctc acacctgtca tcccagcact ttgggagacc aaggcgggtg 62340gatcacctga ggtcaggagt tcgagaccag cctggccaac atagtgaaac cccatctcta 62400ctaaaaatac aaaaaaatta gctgggtgtg gtggcgcatg cctgtaatcc cagctactcg 62460ggaggctgag gcaggagaat cacttgaacc tgggaggcag aggttgcagt gagccgagat 62520catgccactg tactccagcc tgggtggaag agcggaactc tgtctgaaaa aaaaaaaaaa 62580aacaagaagt tcaacttgaa gggaaaaatg ccgtattgtc tttccctttg ttatgtcacc 62640agggcacagt ccatcccagg ctggcgctga tccacgggct ggagaggggc tgccccagaa 62700gaggacatgc caggaagggc ttggctggtg ttcaggagcc caggccaggt caggtcaaga 62760ggtgttgagg ctggacggga gaggccagct aggggctcat gtaggatatg aggggtcggc 62820ccatttcaac gtggaaactg agctcttctg cttctctttc ttcttcactg cattaagatt 62880caataccgct tgggaagcag gtatttccct tcctataaag gatggttggg agcctgagtg 62940ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 63000aaagagagaa gaggacattt ggaaggagag agatcaagct gtggcaccct gggagaggac 63060cacagaaaag aggccagtga gggggttccc cggtggcatc tgaaggtgtg gcccaaccag 63120gaggtccaga ggctgccagc cgagtggccc aggagaggga acctcacagg ggctgagtgg 63180gacccaagcc ctatccaccg tcctaaccac ccacatttct cgggaacaag acctcccaca 63240gtggcctccc cggcagtgga aatagccaaa ctggcaacat ggactttctt caactgcccg 63300ggcgatgctg cctcagtgcc ccagggcagg caggaagctc ccacacccat tctggaatga 63360ggggttggag gaaggctgag ctgagcaaag gacccatctc tgctctggtt ggtggggagg 63420gagcccatta tacaagagac ccctcagggc tcagtgaggg gtgacagaga cttggggagt 63480agtggctgtc actgcagagg tgagagggtt tggagagaag gtacatgcct ttttggccac 63540attgagtagc acctggtagc cagttagtaa cgtgtattgg ataaacaaaa gattaaacgg 63600atgcaaaaaa aaatgttggc tttgcttctt tttacccaaa cctcagttcc ctcaagtaga 63660ttctgggaac accccctacc tggctggact gttgtgaagt ttaaataagc caggttaact 63720tcacctcctc ctttaagaca cagctcagac actgcctcct ccaagaagcc ccctctggct 63780tcctgtgtga atatgacggc cctctgggct ctagggtatc ttagaacaat gcttccttat 63840ggctttggaa ccccgctgtc tcctggattg ggagcaaatg caggggagga gccacacctg 63900actaatctct gggtctccca gcacataagt ggcataaggg cagggctgtg cccgcttcag 63960gcacttactg aaggatgtac ttggcagagg gtaggcagcc ggcggatgag cccctcactc 64020tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct ccctgggctg 64080tcaccacctc tgcacatcca gccccattga tcaagggttc aatttttggg gtcctgttgg 64140gaggccagga gactctctcc aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 64200tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt atttatttat ttatttattt 64260atttatttat ttatttattt tgagacgcag tctcgctctg ttgcccaggt tggagggtgg 64320tggcatgatc tcggctcact gcaagctccg cctcccgggt tcacgccatt ctcctgcctc 64380actcttcctg agtagccgga ttacaggcgc acgcaccatg cctggctaat tattttgttt 64440ttttagtaga gacagggttt cgccacgttg cccaggctgg tcttgaatcc ctggcctcaa 64500gcgatccgcc cgcctcagcc tcccaaagtg ctgggattac aggcgtgagc caccgtgccc 64560gcccagccta ggggtacatg aaactttttt tttttttttt ttgagacaga gtttcactct 64620gtcctcaggc tggagtgcag tggcgtgatc tcggcgtact gcaatctccg cctcccggtt 64680caagcgattc tcctgcctca gcctcccgag tagctgggat tgcaggcacg cgccaccaca 64740cccagctaat ttttgtattt ttagtagaga cgggctttca ccatgtggga caggatggtc 64800tcgatctcct gacctcgtga tccgcccgcc tcagcctccg aaagtgctgg gattacaggc 64860ctgagccacc gtgcccagcc atgatgtttt gatacaggca tataacgtat aataatcaca 64920tcagggtaaa tgatgtaacc atcacatcaa gcatttatcc tttgtgttac aaaaaaaaat 64980ctaattatac tttcctactt attctttttt tttttttttt ttgagacgga gtctccctca 65040gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 65100tctgcctcct gggttcatgc cattctcctg tctcagcctc gcgagtagct gggactacag 65160gcgcctgcca ccgtgcccgg ctaatttttt tttttgtatt tttggtagag acagggtttc 65220accgtgttag ccaggatggt ctcgatctcc tgacctcata atccgcccgt ctcggcctcc 65280caaagtgctg ggattacagg catgagccac cgcccccagc ctatttattc ttaaatgtac 65340aataaattat tgttgactcc agtcaccctg ctgtgctacc aaatacggat cttcttcatt 65400ctatctaact gtatttctgt acctgttaac catctctcct ccacctcacc ccccaaaccc 65460actacccttc tcagcctctg gtaaccatcc ttctactctc tatctctatg agttcaattg 65520tattaatttt tagctccccg gccgggcacg gtggctcacg cctgtaatcc cagcacttca 65580ggaggctgag gcaggtggat cacgaggtca ggagtttgag accagcctgg ccaacatggt 65640ggaaccccat ctctactaaa aacacaaaaa ttagctgggc gtggtggtgg gcgcttgtag 65700tcccagctac ttgggaggct gaggcaggag aatcgcttga aactgggagg cagaggttgc 65760agtgagccaa gattgcgcca ctgcactcca gtctgggtga cagagtaaga ttccatcccg 65820aaaaaaaaaa agtttagctc ccacaaataa gtgagaacac gtgaagtttc tctttctgtg 65880cctcgcttgt ttcacttaac ataatgacct ccagttccat ccacgttgtt gctttgttat 65940aaatgacagg atcttggtca ggcgcagtgg ctcatgcctg taatcccagc actttgggag 66000gctgaggtgg actgatcatg aggtcaagag atcgagacca tcctggctaa cacagtgaaa 66060ccccgtctct actaaaaata caagaaatta gccgggcgtg gtggtgggca cccatttccg 66120ccccttctcg ggacgctgat gcacgacata ttacccatcc ccggaagact aatcctcccc 66180cactctatat tgtacctctt cctttctcct ccacgcgatt ccccgagtaa cccgtcttcc 66240ctccctcctc ggattacgct cacctttccg cttcaatcac gttgctccgt ccccttcccc 66300attcgtacca ctcctcactt tcgtcttcct acccccacta tcccttttcg tcctctctat 66360tccttactta ctcctccccc ttctcttcat acttcattcc ctccgctctt cccactcgcg 66420ctcccacttt cacctagttg ccctcaccta cgttgccatc tcgccccttc ttcagctctc 66480ggcctctcac ccatctgtcc tctctcttac ctctctcctc atctcgctca gacatctctc 66540tagactatcc ctcactttac cttctcagtc gtcttcttcc tatccttcgt tctccatgat 66600cttcacgtcg ccatctcttt tcgccccttt catatgtctc tcttcatgtt ctcactatca 66660ttctcatgat cactatcgtt ctcactactt atcactcccc tctttcttca tcaattcctc 66720tccgtcattc tcgtctctct cttacaaccg ccttccttgt gctatctaac tcaaccatgc 66780ctctcctact ctctctctat cgcccctcca tcgcttatgc atcctcttct attgcacacc 66840cgcccctcca tcgcttatgc atcctcttct attgcacacc gcccctccat cgcttatgca 66900tcctcttcta ttgcacatcc tcttctattg cac 66933<210>12<211>21<212>DNA<213>人工序列<220><223>人工序列为引物。<400>12ctgagcggaa ttcgtgagac c 21<210>13<211>23<212>DNA<213>人工序列<220><223>人工序列为引物。<400>13ttggtctcac gtattccgct cga 23<210>14<211>20<212>DNA<213>人工序列<220><223>人工序列为引物。<400>14ctcgagaatt ctggatcctc 20<210>15<211>22<212>DNA<213>人工序列<220><223>人工序列为引物.<400>15ttgaggatcc agaattctcg ag 22<210>16<211>21<212>DNA<213>人工序列<220><223>人工序列为引物.<400>16tgtatgcgaa ttcgctgcgc g 21<210>17<211>23<212>DNA<213>人工序列<220><223>人工序列为引物.<400>17ttcgcgcagc gaattcgcat aca 23<210>18<211>21<212>DNA<213>人工序列<220><223>人工序列为引物.<400>18gtccactgaa ttctcagtga g 21<210>19<211>23<212>DNA<213>人工序列<220><223>人工序列为引物.<400>19ttgtcactga gaattcagtg gac 23<210>20<211>21<212>DNA<213>人工序列<220><223>人工序列为引物.<400>20gaatccgaat tcctggtcag c 21<210>21<211>23<212>DNA<213>人工序列<220><223>人工序列为引物.<400>21ttgctgacca ggaattcgga ttc 23<210>22<211>33<212>DNA<213>人工序列<220><223>人工序列为引物.<400>22cuacuacuac uactgagcgg aattcgtgag acc 33<210>23<211>32<212>DNA<213>人工序列<220><223>人工序列为引物.<400>23cuacuacuac uactcgagaa ttctggatcc tc 32<210>24<211>33<212>DNA<213>人工序列<220><223>人工序列为引物.<400>24cuacuacuac uatgtatgcg aattcgctgc gcg 33<210>25<211>33<212>DNA<213>人工序列<220><223>人工序列为引物.<400>25cuacuacuac uagtccactg aattctcagt gag 33<210>26<211>33<212>DNA<213>人工序列<220><223>人工序列为引物.<400>26cuacuacuac uagaatccga attcctggtc agc 33<210>27<211>45<212>DNA<213>人工序列<220><223>人工序列为引物.<400>27aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 45<210>28<211>13<212>DNA<213>人工序列<220><223>人工序列为引物.<400>28aattcggcac gag 13<210>29<211>9<212>DNA<213>人工序列<220><223>人工序列为引物.<400>29ctcgtgccg 9<210>30<211>14<212>DNA<213>人工序列<220><223>人工序列为引物.<400>30gtacgacggc cagt 14<210>31<211>16<212>DNA<213>人工序列<220><223>人工序列为引物.<400>31aacagctatg accatg 16<210>32<211>18<212>DNA<213>人工序列<220><223>人工序列为引物.<400>32ccaagttctg agaagtcc 18<210>33<211>20<212>DNA<213>人工序列<220><223>人工序列为引物.<400>33aatacctgaa accatacctg 20<210>34<211>57<212>DNA<213>人工序列<220><223>人工序列为引物.<400>34agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 57<210>35<211>56<212>DNA<213>人工序列<220><223>人工序列为引物.<400>35tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 56<210>36<211>27<212>DNA<213>人工序列<220><223>人工序列为引物.<400>36agctgctcgt agctgtctct ccctgga 27<210>37<211>27<212>DNA<213>人工序列<220><223>人工序列为引物.<400>37gccgtaatgc ggcacaggga ataagct 27<210>38<211>20<212>DNA<213>人工序列<220><223>人工序列为引物.<400>38gagaggctat atccctgggc 20<210>39<211>20<212>DNA<213>人工序列<220><223>人工序列为引物.<400>39acagcacgtg tttaaagggg 20<210>40<211>163<212>DNA<213>人(Homo sapiens)<400>40actaaagcgc cgccgccgcg ccatggagcc cgagtgagct cggcgcgggc ccgtccggcc 60gccggacaac atggaggcag ctccgcccgg gccgccgtgg ccgctgctgc tgctgctgct 120gctgctgctg gcgctgtgcg gctgcccggc ccccgccgcg gcc 163<210>41<211>419<212>DNA<213>人(Homo sapiens)<400>41gccccacagc ctcgccgctc ctgctatttg ccaaccgccg ggacgtacgg ctggtggacg 60ccggcggagt caagctggag tccaccatcg tggtcagcgg cctggaggat gcggccgcag 120tggacttcca gttttccaag ggagccgtgt actggacaga cgtgagcgag gaggccatca 180agcagaccta cctgaaccag acgggggccg ccgtgcagaa cgtggtcatc tccggcctgg 240tctctcccga cggcctcgcc tgcgactggg tgggcaagaa gctgtactgg acggactcag 300agaccaaccg catcgaggtg gccaacctca atggcacatc ccggaaggtg ctcttctggc 360aggaccttga ccagccgagg gccatcgcct tggaccccgc tcacgggtaa accctgctg 419<210>42<211>221<212>DNA<213>人(Homo sapiens)<400>42ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 60gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 120ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 180caccgtgcca acctggacgg ctcgttccgg taggtaccca c 221<210>43<211>221<212>DNA<213>人(Homo sapiens)<400>43tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 60tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 120agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 180aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 221<210>44<211>156<212>DNA<213>人(Homo sapiens)<400>44tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 60gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 120cggcaggacg tgtaaggcag gtgaggcggt gggacg 156<210>45<211>416<212>DNA<213>人(Homo sapiens)<400>45ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 60gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 120tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 180catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 240cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 300cacggaccgc atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 360ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 416<210>46<211>198<212>DNA<213>人(Homo sapiens)<400>46ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 60aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 120ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 180gaggtgaggc tcctgtgg 198<210>47<211>244<212>DNA<213>人(Homo sapiens)<400>47ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 120gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240ggtc 244<210>48<211>313<212>DNA<213>人(Homo sapiens)<400>48gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 60tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 120tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 240cctcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 300aggtagcgtg ggc 313<210>49<211>255<212>DNA<213>人(Homo sapiens)<400>49cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240aagtgtttgc ctgtc 255<210>50<211>210<212>DNA<213>人(Homo sapiens)<400>50gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 120accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180tcgtccaaca tgctgggtga gggccgggct 210<210>51<211>352<212>DNA<213>人(Homo sapiens)<400>51gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 120ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 240ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 300actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352<210>52<211>225<212>DNA<213>人(Homo sapiens)<400>52gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 180atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 225<210>53<211>235<212>DNA<213>人(Homo sapiens)<400>53ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 60gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 120caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 180ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 235<210>54<211>218<212>DNA<213>人(Homo sapiens)<400>54ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 60cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 120gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 180cgcattgaga gctgtgacct gtcaggtacg cgccccgg 218<210>55<211>234<212>DNA<213>人(Homo sapiens)<400>55ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 60gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 120gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 180ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 234<210>56<211>157<212>DNA<213>人(Homo sapiens)<400>56ttgtctttgc agcagcccac ccatgtgccc gtgacaatgg tggctgctcc cacatctgta 60ttgccaaggg tgatgggaca ccacggtgct catgcccagt ccacctcgtg ctcctgcaga 120acctgctgac ctgtggaggt aggtgtgacc taggtgc 157<210>57<211>272<212>DNA<213>人(Homo sapiens)<400>57gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 120tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg 180gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240agacgaggtg gactgtgacg gtgaggccct cc 272<210>58<211>134<212>DNA<213>人(Homo sapiens)<400>58tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 120tggtgagcca gctt 134<210>59<211>274<212>DNA<213>人(Homo sapiens)<400>59gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 60ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240gccccttcac aggtaaggag cctgagatat ggaa 274<210>60<211>164<212>DNA<213>人(Homo sapiens)<400>60cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 120cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 164<210>61<211>130<212>DNA<213>人(Homo sapiens)<400>61ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 60acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 120tcccctgcag 130<210>62<211>496<212>DNA<213>人(Homo sapiens)<400>62tcaaacattc cggccactgc gagaccgtac aggccctaca tcattcgagg aatggcgccc 60ccgacgacgc cctgcagcac cgacgtgtgt gacagcgact acagcgccag ccgctggaag 120gccagcaagt actacctgga tttgaactcg gactcagacc cctatccacc cccacccacg 180ccccacagcc agtacctgtc ggcggaggac agctgcccgc cctcgcccgc caccgagagg 240agctacttcc atctcttccc gccccctccg tccccctgca cggactcatc ctgacctcgg 300ccgggccact ctggcttctc tgtgcccctg taaatagttt taaatatgaa caaagaaaaa 360aatatatttt atgatttaaa aaataaatat aattgggatt ttaaaaacat gagaaatgtg 420aactgtgatg gggtgggcag ggctgggaga actttgtaca gtggagaaat atttataaac 480ttaattttgt aaaaca 496<210>63<211>3081<212>DNA<213>人(Homo sapiens)<400>63cccgccagcc cagcccagcc caaccctact ccctccccac gccagggcag cagccgttgc 60tcagagagaa ggtggaggaa gaaatccaga ccctagcacg cgcgcaccat catggaccat 120tatgattctc agcaaaccaa cgattacatg cagccagaag aggactggga ccgggacctg 180ctcctggacc cggcctggga gaagcagcag agaaagacat tcacggcatg gtgtaactcc 240cacctccgga aggcggggac acagatcgag aacatcgaag aggacttccg ggatggcctg 300aagctcatgc tgctgctgga ggtcatctca ggtgaacgct tggccaagcc agagcgaggc 360aagatgagag tgcacaagat ctccaacgtc aacaaggccc tggatttcat agccagcaaa 420ggcgtcaaac tggtgtccat cggagccgaa gaaatcgtgg atgggaatgt gaagatgacc 480ctgggcatga tctggaccat catcctgcgc tttgccatcc aggacatctc cgtggaagag 540acttcagcca aggaagggct gctcctgtgg tgtcagagaa agacagcccc ttacaaaaat 600gtcaacatcc agaacttcca cataagctgg aaggatggcc tcggcttctg tgctttgatc 660caccgacacc ggcccgagct gattgactac gggaagctgc ggaaggatga tccactcaca 720aatctgaata cggcttttga cgtggcagag aagtacctgg acatccccaa gatgctggat 780gccgaagaca tcgttggaac tgcccgaccg gatgagaaag ccatcatgac ttacgtgtct 840agcttctacc acgccttctc tggagcccag aaggcggaga cagcagccaa tcgcatctgc 900aaggtgttgg ccgtcaacca ggagaacgag cagcttatgg aagactacga gaagctggcc 960agtgatctgt tggagtggat ccgccgcaca atcccgtggc tggagaaccg ggtgcccgag 1020aacaccatgc atgccatgca acagaagctg gaggacttcc gggactaccg gcgcctgcac 1080aagccgccca aggtgcagga gaagtgccag ctggagatca acttcaacac gctgcagacc 1140aagctgcggc tcagcaaccg gcctgccttc atgccctctg agggcaggat ggtctcggac 1200atcaacaatg cctggggctg cctggagcag gtggagaagg gctatgagga gtggttgctg 1260aatgagatcc ggaggctgga gcgactggac cacctggcag agaagttccg gcagaaggcc 1320tccatccacg aggcctggac tgacggcaaa gaggccatgc tgcgacagaa ggactatgag 1380accgccaccc tctcggagat caaggccttg ctcaagaagc atgaggcctt cgagagtgac 1440ctggctgccc accaggaccg tgtggagcag attgccgcca tcgcacagga gctcaatgag 1500ctggactatt atgactcacc cagtgtcaac gcccgttgcc aaaagatctg tgaccagtgg 1560gacaatctgg gggccctaac tcagaagcga agggaagctc tggagcggac cgagaaactg 1620ctggagacca ttgaccagct gtacttggag tatgccaagc gggctgcacc cttcaacaac 1680tggatggagg gggccatgga ggacctgcag gacaccttca ttgtgcacac cattgaggag 1740atccagggac tgaccacagc ccatgagcag ttcaaggcca ccctccctga tgccgacaag 1800gagcgcctgg ccatcctggg catccacaat gaggtgtcca agattgtcca gacctaccac 1860gtcaatatgg cgggcaccaa cccctacaca accatcacgc ctcaggagat caatggcaaa 1920tgggaccacg tgcggcagct ggtgcctcgg agggaccaag ctctgacgga ggagcatgcc 1980cgacagcagc acaatgagag gctacgcaag cagtttggag cccaggccaa tgtcatcggg 2040ccctggatcc agaccaagat ggaggagatc gggaggatct ccattgagat gcatgggacc 2100ctggaggacc agctcagcca cctgcggcag tatgagaaga gcatcgtcaa ctacaagcca 2160aagattgatc agctggaggg cgaccaccag ctcatccagg aggcgctcat cttcgacaac 2220aagcacacca actacaccat ggagcacatc cgtgtgggct gggagcagct gctcaccacc 2280atcgccagga ccatcaatga ggtagagaac cagatcctga cccgggatgc caagggcatc 2340agccaggagc agatgaatga gttccgggcc tccttcaacc actttgaccg ggatcactcc 2400ggcacactgg gtcccgagga gttcaaagcc tgcctcatca gcttgggtta tgatattggc 2460aacgaccccc agggagaagc agaatttgcc cgcatcatga gcattgtgga ccccaaccgc 2520ctgggggtag tgacattcca ggccttcatt gacttcatgt cccgcgagac agccgacaca 2580gatacagcag accaagtcat ggcttccttc aagatcctgg ctggggacaa gaactacatt 2640accatggacg agctgcgccg cgagctgcca cccgaccagg ctgagtactg catcgcgcgg 2700atggccccct acaccggccc cgactccgtg ccaggtgctc tggactacat gtccttctcc 2760acggcgctgt acggcgagag tgacctctaa tccaccccgc ccggccgccc tcgtcttgtg 2820cgccgtgccc acagatgtga aatgaatgta atctaataga agcctaatca gcccaccatg 2880ttctccactg aaaaatcctc tttctttggg gtttttcttt ctttcttttt tgattttgca 2940ctggacggtg acgtcagcct gtacaggctc ccaggggtgg cgtcaaatgc tattgaaatt 3000gcgctgaatc gtatgctttt tccttttgat aaataaacaa tgtaaaaatg tttcaaaaac 3060ctaataaaat aaataaatac g 3081<210>64<211>1324<212>DNA<213>人(Homo sapiens)<220><221>misc_特征<222>(1)...(1324)<223>n=A,T,C or G<400>64ggccgcccgg cgcccccagc agnccgagcc ggggcgcaca gncggggcgc agcccgcgcc 60ccccgccgcg attgacatga tgtttccaca aagcaggcat tcgggctcct cgcacctacc 120ccagcaactc aaattcacca cctcggactc ctgcgaccgc atcaaagacg aatttcagct 180actgcaagct cagtaccaca gcctcaagct cgaatgtgac aagttggcca gtgagaagtc 240agagatgcag cgtcactatg tgatgtacta cgagatgtcc tacggcttga acatcgagat 300gcacaaacag gctgagatcg tcaaaaggct gaacgggatt tgtgcccagg tcctgcccta 360cctctcccaa gagcaccagc agcaggtctt gggagccatt gagagggcca agcaggtcac 420cgctcccgag ctgaactcta tcatccgaca gcagctccaa gcccaccagc tgtcccagct 480gcaggccctg gccctgccct tgaccccact acccgtgggg ctgcagccgc cttcgctgcc 540ggcggtcagc gcaggcaccg gcctcctctc gctgtccgcg ctgggttccc aggcccacct 600ctccaaggaa gacaagaacg ggcacgatgg tgacacccac caggaggatg atggcgagaa 660gtcggattag cagggggccg ggacagggag gttgggaggg gggacagagg ggagacagag 720gcacggagag aaaggaatgt ttagcacaag acacagcgga gctcgggatt ggctaatctc 780ccatagtatt tatggtggcg ccggcggggc cccagcccag cttgcaggcc acctctagct 840ttcttcctac cccattccgg cttccctcct cctcccctgc agcctggtta ggtggatacc 900tgccctgaca tgtgaggcaa gctaaggcct ggagggtcag atgggagacc aggtcccaag 960ggagcaagac ctgcgaagcg cagcagcccc ggcccttccc ccgttttgaa catgtgtaac 1020cgacagtctg ccctgggcca cagccctctc accctggtac tgcatgcacg caatgctagc 1080tgcccctttc ccgtcctggg caccccgagt ctcccccgac cccgggtccc aggtatgctc 1140ccacctccac ctgccccact caccacctct gctagttcca gacacctcca cgcccacctg 1200gtcctctccc atcgcccaca aaaggggggg cacgagggac gagcttagct gagctgggag 1260gagcagggtg agggtgggcg acccaggatt ccccctcccc ttcccaaata aagatgaggg 1320tact 1324<210>65<211>2377<212>DNA<213>人(Homo sapiens)<400>65ggtgacaaag agccaacaga gacaatagga gacttgtcaa tttgtcttga tgggctacag 60ttagagtctg aagttgttac caatggtgaa actacatgtt cagaaagtgc ttctcagaat 120gatgatggct ccagatccaa ggatgaaaca agagtgagca caaatggatc agatgaccct 180gaagatgcag gagctggtga aaataggaga gtcagtggga ataattctcc atcactctca 240aatggtggtt ttaaaccttc tagacctcca agaccttcac gaccaccacc acccacccca 300cgtagaccag catctgtcaa tggttcacca tctgccactt ctgaaagtga tgggtctagt 360acaggctctc tgccgccgac aaatacaaat acaaatacat ctgaaggagc aacatctgga 420ttaataattc ctcttactat atctggaggc tcaggcccta ggccattaaa tcctgtaact 480caagctccct tgccacctgg ttgggagcag agagtggacc agcacgggcg agtttactat 540gtagatcatg ttgagaaaag aacaacatgg gatagaccag aacctctacc tcctggctgg 600gaacggcggg ttgacaacat gggacgtatt tattatgttg accatttcac aagaacaaca 660acgtggcaga ggccaacact ggaatccgtc cggaactatg aacaatggca gctacagcgt 720agtcagcttc aaggagcaat gcagcagttt aaccagagat tcatttatgg gaatcaagat 780ttatttgcta catcacaaag taaagaattt gatcctcttg gtccattgcc acctggatgg 840gagaagagaa cagacagcaa tggcagagta tatttcgtca accacaacac acgaattaca 900caatgggaag accccagaag tcaaggtcaa ttaaatgaaa agcccttacc tgaaggttgg 960gaaatgagat tcacagtgga tggaattcca tattttgtgg accacaatag aagaactacc 1020acctatatag atccccgcac aggaaaatct gccctagaca atggacctca gatagcctat 1080gttcgggact tcaaagcaaa ggttcagtat ttccggttct ggtgtcagca actggccatg 1140ccacagcaca taaagattac agtgacaaga aaaacattgt ttgaggattc ctttcaacag 1200ataatgagct tcagtcccca agatctgcga agacgtttgt gggtgatttt tccaggagaa 1260gaaggtttag attatggagg tgtagcaaga gaatggttct ttcttttgtc acatgaagtg 1320ttgaacccaa tgtattgcct gtttgaatat gcagggaagg ataactactg cttgcagata 1380aaccccgctt cttacatcaa tccagatcac ctgaaatatt ttcgttttat tggcagattt 1440attgccatgg ctctgttcca tgggaaattc atagacacgg gtttttcttt accattctat 1500aagcgtatct tgaacaaacc agttggactc aaggatttag aatctattga tccagaattt 1560tacaattctc tcatctgggt taaggaaaac aatattgagg aatgtgattt ggaaatgtac 1620ttctccgttg acaaagaaat tctaggtgaa attaagagtc atgatctgaa acctaatggt 1680ggcaatattc ttgtaacaga agaaaataaa gaggaataca tcagaatggt agctgagtgg 1740aggttgtctc gaggtgttga agaacagaca caagctttct ttgaaggctt taatgaaatt 1800cttccccagc aatatttgca atactttgat gcaaaggaat tagaggtcct tttatgtgga 1860atgcaagaga ttgatttgaa tgactggcaa agacatgcca tctaccgtca ttatgcaagg 1920accagcaaac aaatcatgtg gttttggcag tttgttaaag aaattgataa tgagaagaga 1980atgagacttc tgcagtttgt tactggaacc tgccgattgc cagtaggagg atttgctgat 2040ctcatgggga gcaatggacc acagaaattc tgcattgaaa aagttgggaa agaaaattgg 2100ctacccagaa gtcatacctg ttttaatcgc ctggacctgc caccatacaa gagctatgag 2160caactgaagg aaaagctgtt gtttgccata gaagaaacag aaggatttgg acaagagtaa 2220cttctgagaa cttgcaccat gaatgggcaa gaacttattt gcaatgtttg tccttctctg 2280cctgttgcac atcttgtaaa attggacaat ggctctttag agagttatct gagtgtaagt 2340aaattaatgt tctcatttaa aaaaaaaaaa aaaaaaa 2377<210>66<211>1295<212>DNA<213>人(Homo sapiens)<400>66gggccgccgc ccacccgggc cttgcctcta cctcagtcgt tgccccccga ttttcggctg 60gagcccacgg ccccggccct cagcccccgc tctagcttcg ccagtagctc ggccagcgac 120gcgagcaagc cgtccagccc ccggggcagc ctgctgctgg acggggcggg ggctggcgga 180gctggaggta gccggccctg cagcaatcgc accagcggca tcagcatggg ctacgaccag 240cgccacggga gccccttgcc agcggggccg tgcctgtttg gcccacccct ggccggagca 300ccggcaggct attctcccgg aggggtcccg tccgcctacc cggagctcca cgccgccctg 360gaccgattgt acgctcagcg gcccgcgggg ttcggctgcc aggaaagccg ccactcgtat 420cccccggccc tgggcagccc tggagctcta gccggggccc gagtgggagc ggcggggccc 480ttggagagac ggggggcgca acccggacga cactctgtga ccggctacgg ggactgcgcc 540gtgggcgccc ggtaccagga cgagctaaca gctttgcttc gcctgacggt gggcaccggt 600gggcgagaag ccggagcccg cggagaaccc tcggggattg agccgtcggg tctggaggag 660ccaccaggtc ctttcgttcc ggaggccgcc cgggcccgga tgcgggagcc agaggccagg 720gaggactact tcggcacctg tatcaagtgc aacaaaggca tctatgggca gagcaatgcc 780tgccaggccc tggacagcct ctaccacacc cagtgctttg tttgctgctc ttgtgggcga 840actttgcgtt gcaaggcttt ctacagtgtc aatggctctg tgtactgtga ggaagattat 900ctgttttcag ggtttcagga ggcagctgag aaatgctgtg tctgtggtca cttgattttg 960gagaagatcc tacaagcaat ggggaagtcc tatcatccag gctgtttccg atgcattgtt 1020tgcaacaagt gcctggatgg catccccttc acagtggact tctccaacca agtatactgt 1080gtcaccgact accacaaaaa ttatgctcct aagtgtgcag cctgtggcca acccatcctc 1140ccctctgagg gctgtgagga catcgtgagg gtgatatcca tggaccggga ttatcacttt 1200gagtgctacc actgtgagga ctgccggatg cagctgagtg atgaggaagg ctgctgctgt 1260ttccctctgg atgggcactt gctctgccat ggttg 1295<210>67<211>3411<212>DNA<213>人(Homo sapiens)<400>67gggcccgggg tcccgccacc accgcgcgcg ggacagattg attcactttg gagctgtaag 60tactgatgta ttagggtgca gcgctcattg ttcattgacg cagagtccca aaatgaatat 120ccaagagcag ggtttcccct tggacctcgg agcaagtttc accgaagatg ctccccgacc 180cccagtgcct ggtgaggagg gagaactggt gtccacagac ccgaggcccg ccagctacag 240tttctgctcc gggaaaggtg ttggcattaa aggtgagact tcgacggcca ctccgaggcg 300ctcggatctg gacctggggt atgagcctga gggcagtgcc tcccccaccc caccatactt 360gaagtgggct gagtcactgc attccctgct ggatgaccaa gatgggataa gcctgttcag 420gactttcctg aagcaggagg gctgtgccga cttgctggac ttctggtttg cctgcactgg 480cttcaggaag ctggagccct gtgactcgaa cgaggagaag aggctgaagc tggcgagagc 540catctaccga aagtacattc ttgataacaa tggcatcgtg tcccggcaga ccaagccagc 600caccaagagc ttcataaagg gctgcatcat gaagcagctg atcgatcctg ccatgtttga 660ccaggcccag accgaaatcc aggccactat ggaggaaaac acctatccct ccttccttaa 720gtctgatatt tatttggaat atacgaggac aggctcggag agccccaaag tctgtagtga 780ccagagctct gggtcaggga cagggaaggg catatctgga tacctgccga ccttaaatga 840agatgaggaa tggaagtgtg accaggacat ggacgaggac gatggcagag acgctgctcc 900ccccggaaga ctccctcaga agctgctcct ggagacagct gccccgaggg tctcctccag 960tagacggtac agcgaaggca gagagttcag gtatggatcc tggcgggagc cagtcaaccc 1020ctattatgtc aatgccggct atgccctggc cccagccacc agtgccaacg acagcgagca 1080gcagagcctg tccagcgatg cagacaccct gtccctcacg gacagcagcg tggatgggat 1140ccccccatac aggatccgta agcagcaccg cagggagatg caggagagcg cgcaggtcaa 1200tgggcgggtg cccctacctc acattccccg cacgtaccgg gtgccgaagg aggtccgcgt 1260ggagcctcag aagttcgcgg aggagctcat ccaccgcctg gaggctgtgc agcgcacgcg 1320ggaggccgag gagaagctgg aggagcggct gaagcgcgtg cgcatggagg aggaaggtga 1380ggacggcgat ccatcgtcag ggcccccagg gccgtgtcac aagctgcctc ccgcccccgc 1440ttggcaccac ttcccgcccc gcttgtgttg gacatgggct tgtgccgggc tccgggatgc 1500acacgaggag aaccctgaga gcatcctgga cgagcacgta cagcgtgtgc tgaggacaac 1560tggccgccag tcgcctgggc ctggccatcg ctccccggac agtgggcacg tggccaagat 1620gccagtggca ctggggggtg ccgcctcggg gcacgggaag cacgtaccca agtcaggggc 1680gaagctggac gcggccggcc tgcaccacca ccgacacgtc caccaccacg tccaccacag 1740cacagcccgg cccaaggagc aggtggaggc cgaggccacc cgcagggccc agagcagctt 1800cgcctggggc ctggaaccac acagccatgg ggcaaggtcc cgaggctact cagagagtgt 1860tggcgctgcc cccaacgcca gtgatggcct cgcccacagt gggaaggtgg gcgttgcgtg 1920caaaagaaat gccaagaagg ctgagtcggg gaagagcgcc agcaccgagg tgccaggtgc 1980ctcggaggat gcggagaaga accagaaaat catgcagtgg atcattgagg gggaaaagga 2040gatcagcagg caccgcagga ccggccacgg gtcttcgggg acgaggaagc cacagcccca 2100tgagaactcc agaccyttgt cccttgagca cccctgggcc ggccctcagc tccggacctc 2160cgtgcagccc tcccacctct tcatccaaga ccccaccatg ccaccccacc cagctcccaa 2220ccccctaacc cagctggagg aggcgcgccg acgtctggag gaggaagaaa agagagccag 2280ccgagcaccc tccaagcaga ggtatgtgca ggaggttatg cggcggggac gcgcctgcgt 2340caggccagcg tgcgcgccgg tgctgcacgt ggtaccagcc gtgtcggaca tggagctctc 2400cgagacagag acaagatcgc agaggaaggt gggcggcggg agtgcccagc cgtgtgacag 2460catcgttgtg gcgtactact tctgcgggga acccatcccc taccgcaccc tggtgagggg 2520ccgcgctgtc accctgggcc agttcaagga gctgctgacc aaaaagggca gctacagata 2580ctacttcaag aaagtgagcg acgagtttga ctgtggggtg gtgtttgagg aggttcgaga 2640ggacgaggcc gtcctgcccg tctttgagga gaagatcatc ggcaaagtgg agaaggtgga 2700ctgataggct ggtgggctgg ccgctgtgcc aggcgaggcc cttggcgggc acgggtgtca 2760cggccaggca gatgacctcg tactcaggag cccgatgggg aacagtgttg ggtgtaccac 2820ccatccctgt ggtctacccg tgtctagagg caggtagggg gtccctccaa gtggtccaca 2880agcttctgtc ctgcccccaa ggaggcagcc tggaccactc ctcatagcaa tacttggagg 2940gcccagccca agtgaggcag ccgaggtccc tgctgccagc ttcaggtgac ccccccccat 3000cccccggcac ctcccttggg cacgtgtgct gggatctact ttccctctgg gatttgccca 3060cgtacccagg tctggctggg gcccaggccc ggatgcagag gcctgcaggg cctctgtcaa 3120ttgtacgcgc caccaagtgc cttcaacaca gcttgtctct tgcctgccac tgtgtgaatc 3180ggcgacggag cactgcacct gcctccagcc gccggctgtg cagtcctggg tcctcctttc 3240tgagggcccg tgtaaatatg tacatttctc aggctagggc cagcaggggc tgcccgagtc 3300tgtttttcat gcgatgacac ttgtacaatt atcttttcaa aggtacttgg ataataatga 3360aataaaactg tttttgaacc tgaataaaaa aaaaaaaaaa aaaaaaaaaa a 3411<210>68<211>3140<212>DNA<213>人(Homo sapiens)<400>68ggctgcgagt acctccatgg tcccggtggc tgtgacggcg gcagtggcgc ctgtcctgtc 60cataaacagc gatttctcag atttgcggga aattaaaaag caactgctgc ttattgcggg 120ccttacccgg gagcggggcc tactacacag tagcaaatgg tcggcggagt tggctttctc 180tctccctgca ttgcctctgg ccgagctgca accgcctccg cctattacag aggaagatgc 240ccaggatatg gatgcctata ccctggccaa ggcctacttt gacgttaaag agtatgatcg 300ggcagcacat ttcctgcatg gctgcaatag caagaaagcc tattttctgt atatgtattc 360cagatatctg tctggagaaa aaaagaagga cgatgaaaca gttgatagct taggccccct 420ggaaaaagga caagtgaaaa atgaggcgct tagagaattg agagtggagc tcagcaaaaa 480acaccaagct cgagaacttg atggatttgg actttatctg tatggtgtgg tgcttcgaaa 540actggacttg gttaaagagg ccattgatgt gtttgtggaa gctactcatg ttttgccctt 600gcattgggga gcctggttag aactctgtaa cctgatcaca gacaaagaga tgctgaagtt 660cctgtctttg ccagacacct ggatgaaaga gttttttctg gctcatatat acacagagtt 720gcagttgata gaggaggccc tgcaaaagta tcagaatctc attgatgtgg gcttctctaa 780gagctcgtat attgtttccc aaattgcagt tgcctatcac aatatcagag atattgacaa 840agccctctcc atttttaatg agctaaggaa acaagaccct tacaggattg aaaatatgga 900cacattctcc aaccttcttt atgtcaggag catgaaatcg gagttgagtt atctggctca 960taacctctgt gagattgata aataccgtgt agaaacgtgc tgtgtaattg gcaattatta 1020cagtttacgt tctcagcatg agaaagcagc cttatatttc cagagagccc tgaaattaaa 1080tcctcggtat cttggtgcct ggacactaat gggacatgag tacatggaga tgaagaacac 1140gtctgctgct atccaggctt atagacatgc cattgaggtc aacaaacggg actacagagc 1200ttggtatggc ctcgggcaga cctatgaaat ccttaagatg ccattttact gcctttatta 1260ttatagacgg gcccaccagc ttcgacccaa tgattctcgc atgctggttg ctttaggaga 1320atgttacgag aaactcaatc aactagtgga agccaaaaag tgttattgga gagcttacgc 1380cgtgggagat gtggagaaaa tggctctggt gaaactggca aagcttcatg aacagttgac 1440tgagtcagaa caggctgccc agtgttacat caaatatatc caagatatct attcctgtgg 1500ggaaatagta gaacacttgg aggaaagcac tgcctttcgc tatctggccc agtactattt 1560taagtgcaaa ctgtgggatg aagcttcaac ttgtgcacaa aagtgttgtg catttaatga 1620tacccgggaa gaaggtaagg ccttactccg gcaaatccta cagcttcgga accaaggcga 1680gactcctacc accgaggtgc ctgctccctt tttcctacct gcttcactct ctgctaacaa 1740tacccccaca cgcagagttt ctccactcaa cttgtcttct gtcacgccat agttggctac 1800tctcaagcca gcacattgtt agacccatct taattaagcc ttacctccat gtaaagaaca 1860gcacgtctgt tccaaggacc tcagctcttc ttgtttctac agatggcaac agctccatag 1920ggacagcttg tataattacc ttcagaggcc aactgacaga atcctggcag gaacagacat 1980tatcttgcca gttagaagta cttctgtctc acttatgtcc aaagagtggc tatagatctt 2040ggccttcttc cctgaatgct tttttttttt ggcccccaag aaagtccctt ttatagcact 2100ttagcacagg caatgctaca ggaacaaagt ttcaatgctg ctgagagtga aagaaaggag 2160gaaagtctgc cactctaccc tgagctggca gtagggcact gagtaccctt aggaagaagt 2220cagagcaatg gatacaaatg accttgctct tggatttgct gagcatgatc cctattctga 2280tgtcagagat taggtttaaa tggaatagag ctatccattt gttcttactc tctagggaga 2340caatcttcca aaacagtttt gggggggtct tctaaagctt tcaaattgga agtaacttta 2400ttcaactaga gttgaataaa agaagggcaa aaataatctc acagagcttg gaactgctga 2460tagcccttac tgagggcaaa agatggctat attgttagct atactcctac caaagcaagc 2520aaggagatag gattatagat aatttcacgg acatttggaa ataacattgg tgattataca 2580gacaagaata aactcacttc aagctggtct gttttaataa attttcaacg taattgtcta 2640tttttttccc tcccatctgc aacagaatac atttttttca gcctttatct agatgaggta 2700aagggaatca ttcttatggt gctcttggag agtttcaggc ctgtgcatgt gtgtacagca 2760ggaggtaata tgctataatg tctgctgtaa tatatttgca cagtagatgc tatggatcat 2820tctgagctca gggtccagac tttattctta ttcccagaat tttgtgttac gtttttacct 2880cctaacatat gacacttcat cttatattaa ggaaggttta gaatatctaa tacgacttga 2940attcatttgt tactaagcct tctcaggcaa gctgtatact agttactggt ctccactgcc 3000atgccttttc aaggttccca tggtccagaa tgatgtttga ttcttaattt ttctgtccct 3060tttataattt gttttaatga ttttgctaca tttggaattc aataaaaaat gtgaacaata 3120ataaaaaaaa aaaaaaaaaa 3140<210>69<211>3513<212>DNA<213>人(Homo sapiens)<400>69ccgtgtacca ggtgctgcta gtgggaagca cgctgctgaa ggaagtgcct tccgggctgc 60agctggagca gttgccttct cagagcctgc tgacccacat cccaacggcg gggctgccca 120cttcgctagg aggaggcctg ccttactgcc accaggcctg gctggatttc cgaaggcggc 180tggaagctct actacagaac tgccaggcag cttgtgccct gctccagggg gccatcgaaa 240gtgtgaaggc tgtgccccag cccatggagc ctggggaggt cggtcagctg ctacagcaga 300cagaggtcct gatgcagcag gtgctagact cgccatggct ggcatggcta caatgccagg 360ggggccggga gctgacatgg ctgaagcaag aggtcccaga ggtgaccctg agcccagact 420acaggacggc aatggacaag gctgacgagc tatatgaccg ggtggatgga ttgctgcacc 480aactgaccct gcagagcaac cagcgaatac aggccctaga gttggtccaa acactggagg 540cccgggaaag cggactgcac cagattgaag tgtggctgca gcaggtgggc tggccagcac 600tggaggaggc tggggagccc tcgctggaca tgctgctcca ggcccaaggc tcttttcagg 660agctgtacca ggttgcccag gagcaggtca ggcaagggga gaagtttctg cagccgctga 720ctggctggga ggcggctgaa ctggaccccc ctggggcacg ctttctggcc ctgcgagccc 780agctgactga attctctagg gctttggccc agcggtgcca gcggctggcg gatgctgaga 840ggctgtttca gctcttcagg gaggccttga cgtgggctga ggaggggcag cgagtgttgg 900cagagctgga gcaggaacgc ccgggggttg tgttgcagca gctgcagctg cactggacca 960ggcaccctga cttgcctcct gcccacttcc gaaagatgtg ggctctggcc acggggctgg 1020gctcagaggc catccgccag gagtgccgct gggcctgggc gcggtgccag gacacctggc 1080tggccctgga ccaaaagctt gaggcttcac tgaagctacc accggtgggc agcacagcta 1140gcctgtgtgt cagccaggtc cccgctgcac ctgcccaccc tcccctgagg aaggcctaca 1200gcttcgatcg gaatctgggg cagagtctca gtgaacctgc ctgccactgc caccatgcgg 1260ccactattgc tgcctgccgc agaccagagg ctggaggagg tgccctgccc caggcatccc 1320ctactgtgcc tccaccaggc agctctgacc ccaggagcct caacaggcta cagctggtgc 1380tggcagagat ggtggccacg gagcgggagt atgtccgggc tctagagtac actatggaga 1440actatttccc cgagctggat cgccccgatg tgccccaggg cctccgcggt cagcgtgccc 1500acctctttgg caacctggag aagctgcggg acttccactg ccacttcttc ctgcgtgagc 1560tggaggcttg cacccggcac ccaccacgag tggcctatgc cttcctgcgc catagggtgc 1620agtttgggat gtacgcgctc tacagcaaga ataagcctcg ctccgatgcc ctgatgtcaa 1680gctatgggca caccttcttc aaggacaagc agcaagcact gggggaccac ctggacctgg 1740cctcctacct gctaaagccc atccagcgca tgggcaagta cgcactgctg ctgcaggagc 1800tggcacgggc ctgcgggggc cccacgcagg agctcagtgc gctgcgggag gcccagagcc 1860ttgtgcactt ccagctgcgg cacggaaacg acctgctggc catggacgcc atccagggct 1920gtgatgttaa cctcaaggaa caggggcagc tggtgcgaca ggatgagttt gtggtgcgca 1980ctgggcgcca caagtccgtg cgccgcatct tcctttttga ggagctgctg ctcttcagca 2040agcctcgcca tgggcccaca ggggttgaca catttgccta caagcgctcc ttcaagatgg 2100cagaccttgg tctcactgag tgctgtggga acagcaacct gcgcttcgag atctggttcc 2160gccgccgcaa ggccagggac acctttgtgc tgcaggcctc cagcctggct atcaagcagg 2220cctggacagc tgacatctcc cacctgcttt ggaggcaggc cgtccacaac aaggaggtgc 2280gcatggctga gatggtgtcc atgggtgtgg ggaacaaggc cttccgagac attgctccca 2340gcgaggaagc catcaacgac cgcaccgtca actatgtcct gaagtgccga gaagttcgct 2400ctcgggcgtc cattgccgta gccccgtttg accatgacag cctctacctg ggggcctcga 2460actcccttcc tggagaccct gcctcttgct ctgttctggg gtccctcaac ctgcacctgt 2520acagagaccc agctcttctg ggtctccgct gtcccctgta tcccagcttc ctagaggaag 2580cagcactgga ggctgaggca gagctgggcg gccagccctc tttgactgct gaggactcag 2640agatctcgtc ccaatgccca tcagccagtg gctccagtgg ctctgacagc agctgtgtgt 2700cagggcaggc cctgggtagg ggcctggagg acttaccctg tgtctgagcc cgggactgga 2760cgagcagtag atccagcagc ctgcagctcc aaggaacatt gcctctctgg atctgctgtg 2820accagggtgt ggctgacacc tgggctacct ccaacctaca tgtgcaacgc tgttgactac 2880cctttctgat gtgtgtggcc attggactaa ctggcacggg gcctctctag ggaagtctgg 2940ttgtagagcc tgaataggct cctggcccca tgaccccttc tcctgtcccc agctcccatc 3000ccagttgtgg gttaagaata ggctagagca gacattgggt gtttccatgc tgtaggctgg 3060tgggggacca tgtgcctcta ggcagtgact agggtgcccc cacccctcag gaagaacaca 3120ggtgggctcc tagcagctga tccccaatgc ctggccttaa agccgagctc agttaccata 3180gggacaggtc cacctctact gggccctcat gcttgccttt cctggccccc aggcccagcc 3240cctttttact ggggcagttt cgttattttg acttgatgcc ttttgaataa ctttcaatag 3300aattgtctaa aattatctta ctggttgtta ggcctttggt gtctcagaga aggagtctag 3360gtctttgatg tgtgatttaa tcttttattt gtttataata aaaaatagac tgatttgtaa 3420aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3480aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 3513<210>70<211>3597<212>DNA<213>人(Homo sapiens)<400>70catgccagtt acttcctcag gaaaatattt tcttgccttc ttctttcagt atggttttaa 60atttgggaac agtggataac ccaagtgtcc cacaggccaa ggtatattcc aatggcagca 120tgatccctgc acccaaagcc agcccctaaa gcctacccct tgtgcacccg cagcctggta 180agtgagcttg gctgcttgtg aggagctaca agtgaaagag aagttatttt aaataaatcc 240caaagtttga ggcagactgt ccaggactgt tcccaggaag aagcaggagt tacccacagg 300aaaagtctct gacctggtcc cctcaggccc agctacctgc gcccaccagc agtgaaggtt 360gatgtactgg cccagcatct ccacctcccc catgcaacca ggtccctggt accgtgtctc 420ccgttgcatg tctggcttct gcctgtgctc ctcctgccac gagcatcctc cctgtccctc 480ctcattccac cgtgtctctc ctgcacacat agcctctgtc ccagggcgat ttatccactt 540gagtacagga gctgctcaga cctctcagcc cagccctctg tgactgcccc agccccatcc 600taccccaccc aaagctgcct tcctggctgt aggagctccc tcgtctagcc aaggccctat 660gggtccccat ccgaggatcc acaagcaatg acttcccaaa tgacctccac tgcaagaaga 720atccttacca ctgtttccag agccgtgaac gatgctgtga tggcccaggt ctcagcacca 780ccctctgtga cctaaaaaga aaagctcaat ttccatctgt cttctttccc aggaccaagg 840ggacacagta atgtgaagtc aaatacttaa ccgagcaaag ggccagtgtt gttatcagtc 900aaggacaaac ctcccacctc acagacagcc aagcagtgag ggaaagacag acagacatag 960gtaggaaggt gctctgcagg cacaaggccc agagaagccc ctctccggga acttcccctg 1020ctccttccag gaacagtgag cccagtgagc agtcccagcc agctcttcaa ggccttcaag 1080gggtctttcc atgactgagt cacctccagg agctcacctg acccccagag aagacctacc 1140ccaggcagct ccgtgccctg gcttctcccc atgccccaaa tcccccccag ccatccctcc 1200tggtcctcgt ctacatcaag ggcctcttcc cctcttcctc ccagctctca ggacaggtga 1260ctgggagacc ttgaaccctc agcctcttcc tttaaaaaaa acaaaacaaa acaaa ctgt 1320gggccattta tttgggattt tggagttgtt tggtttttgt ttttatatct taatagttcg 1380aaagtaagaa gggagccctg ctatggatgt taagtccaaa ttactcggtt agtgggagca 1440aaacctatga cttccaaggg gatgaggaga ggttcagagg acaggaggag cctcccccat 1500tgaaaaaaaa aaaatgggtc aggacattcc ctggatgagg acaatgctag gggtggcatc 1560tcacatggct gctgctattc ctggtgcttc cccacacttt tgacagatgg agtccttctc 1620ctaccgcctc ctgccacctc accctacagg cattctctat gtaggaaaca agagccttat 1680cttatagagt ggggagctga gacacagcct caggtaacac tgacacagct cccgaatgag 1740gctgggacac tctgcaaacc tctcctcatg gtgctaaggg tggcatgctc ttgacaggaa 1800acctaaatga ccactcctct catttggaaa gtaatccact gcagtaaaag tttcagacat 1860gcaagagaga gttttttttt ttttttacta caaatttttg ctcccccata aaattatttt 1920tttattagag ggagtatcca agttttaaaa gtatatagaa ttttttggtt gtaagagaaa 1980tacatactca ttaggatccc gattaaattc cttgagtaga ctggtgccta ccagaaagca 2040aagcaaagtt aaacaaaacg aaacaaaatc cttcatatac aaaaagaact ttctgtttgt 2100attggcagag gtagtgaggt gattcaggta ggctgaaaat cctgggttgc gggagcctca 2160ctttattcca ttcccacccg ctttgatgtc tatgcttggc tctctgggct gcccctggta 2220ctgccgaatc ctacacatct cttatcagct ttcctcaaac tttaaggagg ctctgtgagg 2280gatgggtcat gggaagaccc aagctttccc tccgccagga ttgcaaaagc aagtagactt 2340ggtctatgca gctcttcttc caacaatttc tttatttgga attagaactt cctttgttag 2400tatctttgat cttttgactc aagcacattt tggaagggct cccttacaaa agtagaattt 2460aaaacagagg atacagttaa agagcaaccc aaaggacgct taagaaaccg agaccacttc 2520accgaacagg actaaggaac actttcgtgc acagaagtca gccgcaatcc aggcacagga 2580cgaagatggg atacacgtgc tcatctgtct gtcctccttt cctctccctc cccgacgttc 2640tagttagctt gttgacttgt taaaccttct gttcttaaaa tgaaaagcta gcttacctca 2700aagaatcttg tttccattcg gaaaccaacg attttgtgtt ttagaatgga cagccctccc 2760ctcaccactc cctaccttgg cctggtgtcc ttgagacata cggtctttgc ttagtcgtgt 2820gttggctgct ttgagcagga acgaggcctc caggccctga ggtgggaagg aaggattgga 2880tgccactgcc ctcctcccca ctttagcatg taggggccag cccatctctt ccagcagggt 2940cctgctgagt taccatagca accagcaact ccagggtacc acaacagaca atggctcagc 3000gagccgacgt gtggggatga tgcaggggtt ttggcccagc cagaggaccc agagttgagc 3060ttcaaatgct agagaagggg agaaacagga tggaagggtg gtttaaggaa ctggcagggg 3120tctttgagtc acatagagaa gccgttgaag gaggtagggc aggttatctc tgttccagtc 3180acccccttcc agccccatcc cacttctgtt tcaaactaaa gctcccacct cgaacattga 3240ccctttgtta gaacaaagca aagcatatct ttaaacaaca gtgttaaaat gagcctcaaa 3300tgtatgtgga tgagatctct aagaagaggg tcttctggtt ttgattttta aagaagagta 3360tcctagtaaa atattaaaaa aaaattaaaa agtttttaaa aaggaaacct atgctattta 3420aattggagcc cagttgtaac ttggtaaagg caagcttctg tacctttgtt ataattaatt 3480gtatacctgt gtatgtaaat ataaggcatt cctattttgc agttcagaac aaaaaaaact 3540tatttgtaat atagaataaa gtttattaaa aaataagaaa aaaaaaaaaa aaaaaaa 3597<210>71<211>855<212>DNA<213>人(Homo sapiens)<400>71cgctcaatta tctactcgag tctagactcg aggcggccgc ccattgtgca ctaaagcagg 60ggatagcaac ggcgtccctc ctccccgctc agctgcagcc cgcagtcctc acagtggtaa 120catgccacgt ggtagtctct gtccatggac accacacgga tggttgtctc gcagccctgt 180gcagggagga taggacgggc acaggaggcg cattttggtg caaaaaccgt gtgatagtct 240cgcacgcagt agatgttgtt ctccacgtcc acggtgaagg gaaccccgtc caggcactca 300ttgcacacgg agcaccggaa gcagcctggg tggtaggact tgcccagggc ctgcaggatc 360atttccatga tgagatgtcc acacacgctg catttgtcgg ccgtctgctg gaacccggag 420tacaggaagt cctcctggca gtacactttc tcacccacgt tgtagaacgc cttcccacgg 480agtcgtctcc cacacgagtc gcaggtgaag cagtcagtgt gataaagact ccccattgcc 540tggcacgcct gctgggctcc gtagatgcca agcccacact tgatgcaaat gccgaagtag 600tcccgcgccg tgcgcgcctc gagcgcccgc tccagctccc gggtgagcgc ctccagccgc 660cgctcggccg cgcttgggcc gccctcccgg ccagggggca gcgggagtgc aggcagcggg 720aagggggccg gccccgcagg ctccggggag cgagcgggcg cggggcaggc gccgggcggg 780aggaagtcag cgtaggcttt agtgcacaat gggaattcgg atccggtcga cactagttct 840aggatctctt ttttt 855<210>72<211>3791<212>DNA<213>人(Homo sapiens)<400>72acagacggcg ggtgaacatg gcgtcctcga cttggtctga gacgtgatag gcctgccttc 60tggttgaaga tgtggcgagt gaaaaaactg agcctcagcc tgtcgccttc gccccagacg 120ggaaaaccat ctatgagaac tcctctccgt gaacttaccc tgcagcccgg tgccctcacc 180acctctggaa aaagatcccc cgcttgctcc tcgctgaccc catcactgtg caagctgggg 240ctgcaggaag gcagcaacaa ctcgtctcca gtggattttg taaataacaa gaggacagac 300ttatcttcag aacatttcag tcattcctca aagtggctag aaacttgtca gcatgaatca 360gatgagcagc ctctagatcc aattccccaa attagctcta ctcctaaaac gtctgaggaa 420gcagtagacc cactgggcaa ttatatggtt aaaaccatcg tccttgtacc atctccactg 480gggcagcaac aagacatgat atttgaggcc cgtttagata ccatggcaga gacaaacagc 540atatctttaa atggaccttt gagaacagac gatctggtga gagaggaggt ggcaccctgc 600atgggagaca ggttttcaga agttgctgct gtatctgaga aacctatctt tcaggaatct 660ccgtcccatc tcttagagga gtctccacca aatccctgtt ctgaacaact acattgctcc 720aaggaaagcc tgagcagtag aactgaggct gtgcgtgagg acttagtacc ttctgaaagt 780aacgccttct tgccttcctc tgttctctgg ctttcccctt caactgcctt ggcagcagat 840ttccgtgtca atcatgtgga cccagaggag gaaattgtag agcatggagc tatggaggaa 900agagaaatga ggtttcccac acatcctaag gagtctgaaa cagaagatca agcacttgtc 960tcaagtgtgg aagatattct gtccacatgc ctgacaccaa atctagtaga aatggaatcc 1020caagaagctc caggcccagc agtagaagat gttggtagga ttcttggctc tgatacagag 1080tcttggatgt ccccactggc ctggctggaa aaaggtgtaa atacctccgt catgctggaa 1140aatctccgcc aaagcttatc ccttccctcg atgcttcggg atgctgcaat tggcactacc 1200cctttctcta cttgctcggt ggggacttgg tttactcctt cagcaccaca ggaaaagagt 1260acaaacacat cccagacagg cctggttggc accaagcaca gtacttctga gacagagcag 1320ctcctgtgtg gccggcctcc agatctgact gccttgtctc gacatgactt ggaagataac 1380ctgctgagct ctcttgtcat tgtggagttt ctctcccgcc agcttcggga ctggaagagc 1440cagctggctg tccctcaccc agaaacccag gacagtagca cacagactga cacatctcac 1500agtgggataa ctaataaact tcagcatctt aaggagagcc atgagatggg acaggcccta 1560cagcaggcca gaaatgtcat gcaatcatgg gtgcttatct ctaaagagct gatatccttg 1620cttcacctat ccctgttgca tttagaagaa gataagacta ctgtgaatca ggagtctcgg 1680cgtgcagaaa cattggtctg ttgctgtttt gatttgctga agaaattgag ggcaaagctc 1740cagagcctca aagcagaaag ggaggaggca aggcacagag aggaaatggc tctcagaggc 1800aaggatgcgg cagagatagt gttggaggct ttctgtgcac acgccagcca gcgcatcagc 1860cagctggaac aggacctagc atccatgcgg gaattcagag gccttctgaa ggatgcccag 1920acccaactgg tagggcttca tgccaagcaa gaagagctgg ttcagcagac agtgagtctt 1980acttctacct tgcaacaaga ctggaggtcc atgcaactgg attatacaac atggacagct 2040ttgctgagtc ggtcccgaca actcacagag aaactcacag tcaagagcca gcaagccctg 2100caggaacgtg atgtggcaat tgaggaaaag caggaggttt ctagggtgct ggaacaagtc 2160tctgcccagt tagaggagtg caaaggccaa acagaacaac tggagttgga aaacattcgt 2220ctagcaacag atctccgggc tcagttgcag attctggcca acatggacag ccagctaaaa 2280gagctacaga gtcagcatac ccattgtgcc caggacctgg ctatgaagga tgagttactc 2340tgccagctta cccagagcaa tgaggagcag gctgctcaat gcgtaaagga agagatggca 2400ctaaaacaca tgcaggcaga actgcagcag caacaagctg tcctggccaa agaggtgcgg 2460gacctgaaag agaccttgga gtttgcagac caggagaatc aggttgctca cctggagctg 2520ggtcaggttg agtgtcaatt gaaaaccaca ctggaagtgc tccgggagcg cagcttgcag 2580tgtgagaacc tcaaggacac tgtagagaac ctaacggcta aactggccag caccatagca 2640gataaccagg agcaagatct ggagaaaaca cggcagtact ctcaaaagct agggctgctg 2700actgagcaac tacagagcct gactctcttt ctacagacaa aactaaagga gaagactgaa 2760caagagaccc ttctgctgag tacagcctgt cctcccaccc aggaacaccc tctgcctaat 2820gacaggacct tcctgggaag catcttgaca gcagtggcag atgaagagcc agaatcaact 2880cctgtgccct tgcttggaag tgacaagagt gctttcaccc gagtagcatc aatggtttcc 2940cttcagcccg cagagacccc aggcatggag gagagcctgg cagaaatgag tattatgact 3000actgagcttc agagtctttg ttccctgcta caagagtcta aagaagaagc catcaggact 3060ctgcagcgaa aaatttgtga gctgcaagct aggctgcagg cccaggaaga acagcatcag 3120gaagtccaga aggcaaaaga agcagacata gagaagctga accaggcctt gtgcttgcgc 3180tacaagaatg aaaaggagct ccaggaagtg atacagcaga atgagaagat cctagaacag 3240atagacaaga gtggcgagct cataagcctt agagaggagg tgacccacct tacccgctca 3300cttcggcgtg cggagacaga gaccaaagtg ctccaggagg cctggcaggc cagctggact 3360ccaactgcca gcctatggcc accaattgga tccaggagaa agtgtggctc tctcaggagg 3420tggacaaact gagagtgatg ttcctggaga tgaaaaatga gaaggaaaac tcctgatcaa 3480gttccagagc ccatagaaat atcctagagg agaaccttcg gcgctctgac aaggagttag 3540aaaaactaga tgacattgtt cagcatattt ataagaccct gctctctatt ccagaggtgg 3600tgaggggatg caaagaacta cagggattgc tggaatttct gagctaagaa actgaaagcc 3660agaatttgtt tcacctcttt ttacctgcaa taccccctta ccccaatacc aagaccaact 3720ggcatagagc caactgagat aaatgctatt taaataaagt gtatttaatg aaaaaaaaaa 3780aaaaaaaaaa a 3791<210>73<211>1683<212>DNA<213>人(Homo sapiens)<400>73ctctgagtgt ccagtggtca gttgccccag gatggggacc acagccagag cagccttggt 60cttgacctat ttggctgttg cttctgctgc ctctgaggga ggcttcacgg ctacaggaca 120gaggcagctg aggccagagc actttcaaga agttggctac gcagctcccc cctccccacc 180cctatcccga agcctcccca tggatcaccc tgactcctct cagcatggcc ctccctttga 240gggacagagt caagtgcagc cccctccctc tcaggaggcc acccctctcc aacaggaaaa 300gctgctacct gcccaactcc ctgctgaaaa ggaagtgggt ccccctctcc ctcaggaagc 360tgtccccctc caaaaagagc tgccctctct ccagcacccc aatgaacaga aggaaggaac 420gccagctcca tttggggacc agagccatcc agaacctgag tcctggaatg cagcccagca 480ctgccaacag gaccggtccc aagggggctg gggccaccgg ctggatggct tcccccctgg 540gcggccttct ccagacaatc tgaaccaaat ctgccttcct aaccgtcagc atgtggtata 600tggtccctgg aacctaccac agtccagcta ctcccacctc actcgccagg gtgagaccct 660caatttcctg gagattggat attcccgctg ctgccactgc cgcagccaca caaaccgcct 720agagtgtgcc aaacttgtgt gggaggaagc aatgagccga ttctgtgagg ccgagttctc 780ggtcaagacc cgaccccact ggtgctgcac gcggcagggg gaggctcggt tctcctgctt 840ccaggaggaa gctccccagc cacactacca gctccgggcc tgccccagcc atcagcctga 900tatttcctcg ggtcttgagc tgcctttccc tcctggggtg cccacattgg acaatatcaa 960gaacatctgc cacctgaggc gcttccgctc tgtgccacgc aacctgccag ctactgaccc 1020cctacaaagg gagctgctgg cactgatcca gctggagagg gagttccagc gctgctgccg 1080ccaggggaac aatcacacct gtacatggaa ggcctgggag gatacccttg acaaatactg 1140tgaccgggag tatgctgtga agacccacca ccacttgtgt tgccgccacc ctcccagccc 1200tactcgggat gagtgctttg cccgtcgggc tccttacccc aactatgacc gggacatctt 1260gaccattgac atcagtcgag tcacccccaa cctcatgggc cacctctgtg gaaaccaaag 1320agttctcacc aagcataaac atattcctgg gctgatccac aacatgactg cccgctgctg 1380tgacctgcca tttccagaac aggcctgctg tgcagaggag gagaaattaa ccttcatcaa 1440tgatctgtgt ggtccccgac gtaacatctg gcgagaccct gccctctgct gttacctgag 1500tcctggggat gaacaggtca actgcttcaa catcaattat ctgaggaacg tggctctagt 1560gtctggagac actgagaacg ccaagggcca gggggagcag ggctcaactg gaggaacaaa 1620tatcagctcc acctctgagc ccaaggaaga atgagtcacc ccagagccct agagggtcag 1680atg 1683<210>74<211>1696<212>DNA<213>人(Homo sapiens)<400>74cacctaaaag ccaaaatggg aaaggaaaag actcatatca acattgtcgt cattggacac 60gtagattcgg gcaagtccac cactactggc catctgatct ataaatgcgg tggcatcgac 120aaaagaacca ttgaaaaatt tgagaaggag gctgctgaga tgggaaaggg ctccttcaag 180tatgcctggg tcttggataa actgaaagct gagcgtgaac gtggtatcac cattgatatc 240tccttgtgga aatttgagac cagcaagtac tatgtgacta tcattgatgc cccaggacac 300agagacttta tcaaaaacat gattacaggg acatctcagg ctgactgtgc tgtcctgatt 360gttgctgctg gtgttggtga atttgaagct ggtatctcca agaatgggca gacccgagag 420catgcccttc tggcttacac actgggtgtg aaacaactaa ttgtcggtgt taacaaaatg 480gattccactg agccacccta cagccagaag agatatgagg aaattgttaa ggaagtcagc 540acttacatta agaaaattgg ctacaacccc gacacagtag catttgtgcc aatttctggt 600tggaatggtg acaacatgct ggagccaagt gctaacatgc cttggttcaa gggatggaaa 660gtcacccgta aggatggcaa tgccagtgga accacgctgc ttgaggctgt ggactgcatc 720ctaccaccaa ctcgtccaac tgacaagccc ttgcgcctgc ctctccagga tgtctacaaa 780attggtggta ttggtactgt tcctgttggc cgagtggaga ctggtgttct caaacccggt 840atggtggtca cctttgctcc agtcaacgtt acaacggaag taaaatctgt cgaaatgcac 900catgaagctt tgagtgaagc tcttcctggg gacaatgtgg gcttcaatgt caagaatgtg 960tctgtcaagg atgttcgtcg tggcaacgtt gctggtgaca gcaaaaatga cccaccaatg 1020gaagcagctg gcttcactgc tcaggtgatt atcctgaacc atccaggcca aataagcgcc 1080ggctatgccc ctgtattgga ttgccacacg gctcacattg catgcaagtt tgctgagctg 1140aaggaaaaga ttgatcgccg ttctggtaaa aagctggaag atggccctaa attcttgaag 1200tctggtgatg ctgccattgt tgatatggtt cctggcaagc ccatgtgtgt tgagagcttc 1260tcagactatc cacctttggg tcgctttgct gttcgtgata tgagacagac agttgcggtg 1320ggtgtcatca aagcagtgga caagaaggct gctggagctg gcaaggtcac caagtctgcc 1380cagaaagctc agaaggctaa atgaatatta tccctaatac ctgccacccc actcttaatc 1440agtggtggaa gaacggtctc agaactgttt gtttcaattg gccatttaag tttagtagta 1500aaagactggt taatgataac aatgcatcgt aaaaccttca gaaggaaagg agaatgtttt 1560gtggaccact ttggttttct tttttgcgtg tggcagtttt aagttattag tttttaaaat 1620cagtactttt taatggaaac aacttgacca aaaatttgtc acagaatttt gagacccatt 1680aaaaaagtta aatgag 1696<210>75<211>7680<212>DNA<213>人(Homo sapiens)<400>75gaagagcaag aggcaggctc agcaaatggt tcagccccag tccccggtgg ctgtcagtca 60aagcaagccc ggttgttatg acaatggaaa acactatcag ataaatcaac agtgggagcg 120gacctaccta ggtaatgtgt tggtttgtac ttgttatgga ggaagccgag gttttaactg 180cgaaagtaaa cctgaagctg aagagacttg ctttgacaag tacactggga acacttaccg 240agtgggtgac acttatgagc gtcctaaaga ctccatgatc tgggactgta cctgcatcgg 300ggctgggcga gggagaataa gctgtaccat cgcaaaccgc tgccatgaag ggggtcagtc 360ctacaagatt ggtgacacct ggaggagacc acatgagact ggtggttaca tgttagagtg 420tgtgtgtctt ggtaatggaa aaggagaatg gacctgcaag cccatagctg agaagtgttt 480tgatcatgct gctgggactt cctatgtggt cggagaaacg tgggagaagc cctaccaagg 540ctggatgatg gtagattgta cttgcctggg agaaggcagc ggacgcatca cttgcacttc 600tagaaataga tgcaacgatc aggacacaag gacatcctat agaattggag acacctggag 660caagaaggat aatcgaggaa acctgctcca gtgcatctgc acaggcaacg gccgaggaga 720gtggaagtgt gagaggcaca cctctgtgca gaccacatcg agcggatctg gccccttcac 780cgatgttcgt gcagctgttt accaaccgca gcctcacccc cagcctcctc cctatggcca 840ctgtgtcaca gacagtggtg tggtctactc tgtggggatg cagtggttga agacacaagg 900aaataagcaa atgctttgca cgtgcctggg caacggagtc agctgccaag agacagctgt 960aacccagact tacggtggca acttaaatgg agagccatgt gtcttaccat tcacctacaa 1020tggcaggacg ttctactcct gcaccacgga agggcgacag gacggacatc tttggtgcag 1080cacaacttcg aattatgagc aggaccagaa atactctttc tgcacagacc acactgtttt 1140ggttcagact caaggaggaa attccaatgg tgccttgtgc cacttcccct tcctatacaa 1200caaccacaat tacactgatt gcacttctga gggcagaaga gacaacatga agtggtgtgg 1260gaccacacag aactatgatg ccgaccagaa gtttgggttc tgccccatgg ctgcccacga 1320ggaaatctgc acaaccaatg aaggggtcat gtaccgcatt ggagatcagt gggataagca 1380gcatgacatg ggtcacatga tgaggtgcac gtgtgttggg aatggtcgtg gggaatggac 1440atgcattgcc tactcgcaac ttcgagatca gtgcattgtt gatgacatca cttacaatgt 1500gaacgacaca ttccacaagc gtcatgaaga ggggcacatg ctgaactgta catgcttcgg 1560tcagggtcgg ggcaggtgga agtgtgatcc cgtcgaccaa tgccaggatt cagagactgg 1620gacgttttat caaattggag attcatggga gaagtatgtg catggtgtca gataccagtg 1680ctactgctat ggccgtggca ttggggagtg gcattgccaa cctttacaga cctatccaag 1740ctcaagtggt cctgtcgaag tatttatcac tgagactccg agtcagccca actcccaccc 1800catccagtgg aatgcaccac agccatctca catttccaag tacattctca ggtggagacc 1860taaaaattct gtaggccgtt ggaaggaagc taccatacca ggccacttaa actcctacac 1920catcaaaggc ctgaagcctg gtgtggtata cgagggccag ctcatcagca tccagcagta 1980cggccaccaa gaagtgactc gctttgactt caccaccacc agcaccagca cacctgtgac 2040cagcaacacc gtgacaggag agacgactcc cttttctcct cttgtggcca cttctgaatc 2100tgtgaccgaa atcacagcca gtagctttgt ggtctcctgg gtctcagctt ccgacaccgt 2160gtcgggattc cgggtggaat atgagctgag tgaggaggga gatgagccac agtacctgga 2220tcttccaagc acagccactt ctgtgaacat ccctgacctg cttcctggcc gaaaatacat 2280tgtaaatgtc tatcagatat ctgaggatgg ggagcagagt ttgatcctgt ctacttcaca 2340aacaacagcg cctgatgccc ctcctgaccc gactgtggac caagttgatg acacctcaat 2400tgttgttcgc tggagcagac cccaggctcc catcacaggg tacagaatag tctattcgcc 2460atcagtagaa ggtagcagca cagaactcaa ccttcctgaa actgcaaact ccgtcaccct 2520cagtgacttg caacctggtg ttcagtataa catcactatc tatgctgtgg aagaaaatca 2580agaaagtaca cctgttgtca ttcaacaaga aaccactggc accccacgct cagatacagt 2640gccctctccc agggacctgc agtttgtgga agtgacagac gtgaaggtca ccatcatgtg 2700gacaccgcct gagagtgcag tgaccggcta ccgtgtggat gtgatccccg tcaacctgcc 2760tggcgagcac gggcagaggc tgcccatcag caggaacacc tttgcagaag tcaccgggct 2820gtcccctggg gtcacctatt acttcaaagt ctttgcagtg agccatggga gggagagcaa 2880gcctctgact gctcaacaga caaccaaact ggatgctccc actaacctcc agtttgtcaa 2940tgaaactgat tctactgtcc tggtgagatg gactccacct cgggcccaga taacaggata 3000ccgactgacc gtgggcctta cccgaagagg ccagcccagg cagtacaatg tgggtccctc 3060tgtctccaag taccccctga ggaatctgca gcctgcatct gagtacaccg tatccctcgt 3120ggccataaag ggcaaccaag agagccccaa agccactgga gtctttacca cactgcagcc 3180tgggagctct attccacctt acaacaccga ggtgactgag accaccatcg tgatcacatg 3240gacgcctgct ccaagaattg gttttaagct gggtgtacga ccaagccagg gaggagaggc 3300accacgagaa gtgacttcag actcaggaag catcgttgtg tccggcttga ctccaggagt 3360agaatacgtc tacaccatcc aagtcctgag agatggacag gaaagagatg cgccaattgt 3420aaacaaagtg gtgacaccat tgtctccacc aacaaacttg catctggagg caaaccctga 3480cactggagtg ctcacagtct cctgggagag gagcaccacc ccagacatta ctggttatag 3540aattaccaca acccctacaa acggccagca gggaaattct ttggaagaag tggtccatgc 3600tgatcagagc tcctgcactt ttgataacct gagtcccggc ctggagtaca atgtcagtgt 3660ttacactgtc aaggatgaca aggaaagtgt ccctatctct gataccatca tcccagctgt 3720tcctcctccc actgacctgc gattcaccaa cattggtcca gacaccatgc gtgtcacctg 3780ggctccaccc ccatccattg atttaaccaa cttcctggtg cgttactcac ctgtgaaaaa 3840tgaggaagat gttgcagagt tgtcaatttc tccttcagac aatgcagtgg tcttaacaaa 3900tctcctgcct ggtacagaat atgtagtgag tgtctccagt gtctacgaac aacatgagag 3960cacacctctt agaggaagac agaaaacagg tcttgattcc ccaactggca ttgacttttc 4020tgatattact gccaactctt ttactgtgca ctggattgct cctcgagcca ccatcactgg 4080ctacaggatc cgccatcatc ccgagcactt cagtgggaga cctcgagaag atcgggtgcc 4140ccactctcgg aattccatca ccctcaccaa cctcactcca ggcacagagt atgtggtcag 4200catcgttgct cttaatggca gagaggaaag tcccttattg attggccaac aatcaacagt 4260ttctgatgtt ccgagggacc tggaagttgt tgctgcgacc cccaccagcc tactgatcag 4320ctgggatgct cctgctgtca cagtgagata ttacaggatc acttacggag aaacaggagg 4380aaatagccct gtccaggagt tcactgtgcc tgggagcaag tctacagcta ccatcagcgg 4440ccttaaacct ggagttgatt ataccatcac tgtgtatgct gtcactggcc gtggagacag 4500ccccgcaagc agcaagccaa tttccattaa ttaccgaaca gaaattgaca aaccatccca 4560gatgcaagtg accgatgttc aggacaacag cattagtgtc aagtggctgc cttcaagttc 4620ccctgttact ggttacagag taaccaccac tcccaaaaat ggaccaggac caacaaaaac 4680taaaactgca ggtccagatc aaacagaaat gactattgaa ggcttgcagc ccacagtgga 4740gtatgtggtt agtgtctatg ctcagaatcc aagcggagag agtcagcctc tggttcagac 4800tgcagtaacc aacattgatc gccctaaagg actggcattc actgatgtgg atgtcgattc 4860catcaaaatt gcttgggaaa gcccacaggg gcaagtttcc aggtacaggg tgacctactc 4920gagccctgag gatggaatcc atgagctatt ccctgcacct gatggtgaag aagacactgc 4980agagctgcaa ggcctcagac cgggttctga gtacacagtc agtgtggttg ccttgcacga 5040tgatatggag agccagcccc tgattggaac ccagtccaca gctattcctg caccaactga 5100cctgaagttc actcaggtca cacccacaag cctgagcgcc cagtggacac cacccaatgt 5160tcagctcact ggatatcgag tgcgggtgac ccccaaggag aagaccggac caatgaaaga 5220aatcaacctt gctcctgaca gctcatccgt ggttgtatca ggacttatgg tggccaccaa 5280atatgaagtg agtgtctatg ctcttaagga cactttgaca agcagaccag ctcagggtgt 5340tgtcaccact ctggagaatg tcagcccacc aagaagggct cgtgtgacag atgctactga 5400gaccaccatc accattagct ggagaaccaa gactgagacg atcactggct tccaagttga 5460tgccgttcca gccaatggcc agactccaat ccagagaacc atcaagccag atgtcagaag 5520ctacaccatc acaggtttac aaccaggcac tgactacaag atctacctgt acaccttgaa 5580tgacaatgct cggagctccc ctgtggtcat cgacgcctcc actgccattg atgcaccatc 5640caacctgcgt ttcctggcca ccacacccaa ttccttgctg gtatcatggc agccgccacg 5700tgccaggatt accggctaca tcatcaagta tgagaagcct gggtctcctc ccagagaagt 5760ggtccctcgg ccccgccctg gtgtcacaga ggctactatt actggcctgg aaccgggaac 5820cgaatataca atttatgtca ttgccctgaa gaataatcag aagagcgagc ccctgattgg 5880aaggaaaaag acagacgagc ttccccaact ggtaaccctt ccacacccca atcttcatgg 5940accagagatc ttggatgttc cttccacagt tcaaaagacc cctttcgtca cccaccctgg 6000gtatgacact ggaaatggta ttcagcttcc tggcacttct ggtcagcaac ccagtgttgg 6060gcaacaaatg atctttgagg aacatggttt taggcggacc acaccgccca caacggccac 6120ccccataagg cataggccaa gaccataccc gccgaatgta ggacaagaag ctctctctca 6180gacaaccatc tcatgggccc cattccagga cacttctgag tacatcattt catgtcatcc 6240tgttggcact gatgaagaac ccttacagtt cagggttcct ggaacttcta ccagtgccac 6300tctgacaggc ctcaccagag gtgccaccta caacatcata gtggaggcac tgaaagacca 6360gcagaggcat aaggttcggg aagaggttgt taccgtgggc aactctgtca acgaaggctt 6420gaaccaacct acggatgact cgtgctttga cccctacaca gtttcccatt atgccgttgg 6480agatgagtgg gaacgaatgt ctgaatcagg ctttaaactg ttgtgccagt gcttaggctt 6540tggaagtggt catttcagat gtgattcatc tagatggtgc catgacaatg gtgtgaacta 6600caagattgga gagaagtggg accgtcaggg agaaaatggc cagatgatga gctgcacatg 6660tcttgggaac ggaaaaggag aattcaagtg tgaccctcat gaggcaacgt gttacgatga 6720tgggaagaca taccacgtag gagaacagtg gcagaaggaa tatctcggtg ccatttgctc 6780ctgcacatgc tttggaggcc agcggggctg gcgctgtgac aactgccgca gacctggggg 6840tgaacccagt cccgaaggca ctactggcca gtcctacaac cagtattctc agagatacca 6900tcagagaaca aacactaatg ttaattgccc aattgagtgc ttcatgcctt tagatgtaca 6960ggctgacaga gaagattccc gagagtaaat catctttcca atccagagga acaagcatgt 7020ctctctgcca agatccatct aaactggagt gatgttagca gacccagctt agagttcttc 7080tttctttctt aagccctttg ctctggagga agttctccag cttcagctca actcacagct 7140tctccaagca tcaccctggg agtttcctga gggttttctc ataaatgagg gctgcacatt 7200gcctgttctg cttcgaagta ttcaataccg ctcagtattt taaatgaagt gattctaaga 7260tttggtttgg gatcaatagg aaagcatatg cagccaacca agatgcaaat gttttgaaat 7320gatatgacca aaattttaag taggaaagtc acccaaacac ttctgctttc acttaagtgt 7380ctggcccgca atactgtagg aacaagcatg atcttgttac tgtgatattt taaatatcca 7440cagtactcac tttttccaaa tgatcctagt aattgcctag aaatatcttt ctcttacctg 7500ttatttatca atttttccca gtatttttat acggaaaaaa ttgtattgaa aacacttagt 7560atgcagttga taagaggaat ttggtataat tatggtgggt gattattttt tatactgtat 7620gtgccaaagc tttactactg tggaaagaca actgttttaa taaaagattt acattccaca 7680<210>76<211>1316<212>DNA<213>人(Homo sapiens)<400>76tcctaatacg actcactata gggctcgagc ggccgcccgg gcaggtcgaa tgcaggcgac 60ttgcgagctg ggagcgattt aaaacgcttt ggattccccc ggcctgggtg gggagagcga 120gctgggtgcc ccctagattc cccgcccccg cacctcatga gccgaccctc ggctccatgg 180agcccggcaa ttatgccacc ttggatggag ccaaggatat cgaaggcttg ctgggagcgg 240gaggggggcg gaatctggtc gcccactccc ctctgaccag ccacccagcg gcgcctacgc 300tgatgcctgc tgtcaactat gcccccttgg atctgccagg ctcggcggag ccgccaaagc 360aatgccaccc atgccctggg gtgccccagg ggacgtcccc agctcccgtg ccttatggtt 420actttggagg cgggtactac tcctgccgag tgtcccggag ctcgctgaaa ccctgtgccc 480aggcagccac cctggccgcg taccccgcgg agactcccac ggccggggaa gagtacccca 540gtcgccccac tgagtttgcc ttctatccgg gatatccggg aacctaccac gctatggcca 600gttacctgga cgtgtctgtg gtgcagactc tgggtgctcc tggagaaccg cgacatgact 660ccctgttgcc tgtggacagt taccagtctt gggctctcgc tggtggctgg aacagccaga 720tgtgttgcca gggagaacag aacccaccag gtcccttttg gaaggcagca tttgcagact 780ccagcgggca gcaccctcct gacgcctgcg cctttcgtcg cggccgcaag aaacgcattc 840cgtacagcaa ggggcagttg cgggagctgg agcgggagta tgcggctaac aagttcatca 900ccaaggacaa gaggcgcaag atctcggcag ccaccagcct ctcggagcgc cagattacca 960tctggtttca gaaccgccgg gtcaaagaga agaaggttct cgccaaggtg aagaacagcg 1020ctacccctta agagatctcc ttgcctgggt gggaggagcg aaagtggggg tgtcctgggg 1080agaccagaaa cctgccaagc ccaggctggg gccaaggact ctgctgagag gcccctagag 1140acaacaccct tcccaggcca ctggctgctg gactgttcct caggagcggc ctgggtaccc 1200agtatgtgca gggagacgga accccatgtg acaggcccac tccaccaggg ttcccaaaga 1260acctggccca gtcataatca ttcatcctca cagtggcaat aatcacgata accagt 1316<210>77<211>566<212>DNA<213>人(Homo sapiens)<400>77cccaccaaac ccataaagag ggtgggtcga cccacgcgtc cgcggacgcg tgggaaatta 60ttgaattgga aacagaaata gaaaagttta aagctgagaa cgcatcttta gctaaacttc 120gcattgaacg agaaagtgcc ttggaaaaac tcaggaaaga aattgcagac ttcgaacaac 180agaaagcaaa agaattagct cgaatagaag agtttaaaaa ggaggagatg aggaagctac 240aaaaggaacg taaagttttt gaaaagtata ctacagctgc aagaactttt ccagataaaa 300aggaacgtga agaaatacag actttaaaac agcaaatagc agatttacgg gaagatttga 360aaagaaagga gaccaaatgg tcaagtacac acagccgtct cagaagccag atacaaatgt 420tagtcagaga gaacacagac ctccgggaag aaataaaagt gatggaaaga ttccgactgg 480atgcctggaa gagagcagaa gccatagaga gcagcctcga ggtggagaag aaggacaagc 540ttgcgaacac atctgttcga tttcaa 566<210>78<211>5067<212>DNA<213>人(Homo sapiens)<400>78gcccggacac ctgtctgcag catggataag tatgacgacc tgggcctgga ggccagtaaa 60ttcatcgagg acctgaacat gtatgaggcc tctaaggatg ggctcttccg agtggacaag 120ggtgcaggca acaaccccga gtttgaggaa actcgcaggg tgttcgccac caagatggcc 180aaaatccacc tccagcagca gcagcagcag ctcctgcagg aggagactct gcccaggggg 240agtagaggcc ctgtcaatgg agggggccgc ctgggcccac aggcccgttg ggaagttgtg 300ggcagcaagc tgactgtgga tggtgctgcc aagcctcctc ttgctgcctc gacaggggca 360cctggggcag tcaccaccct cgctgctggg cagcccccgt acccaccgca ggagcagaga 420tccaggccat acctgcatgg cacgaggcat ggcagccagg actgtggttc cagggagagc 480ctggcgactt ctgagatgtc tgctttccac cagccaggcc cctgtgagga tccttcctgc 540ctcactcatg gagactatta tgacaacctc tccttggcaa gcccaaagtg gggtgacaaa 600ccaggagtgt cccccagcat cggcctgagt gtagggagtg ggtggcctag ctccccgggg 660agtgacccac cactgcccaa accctgcggg gaccatcccc taaatcaccg acagctctcc 720ctgagctcca gcaggtcttc tgagggtagc ctcggtggtc agaatagtgg cattggtggc 780cgcagcagcg agaagccaac aggcctttgg tccactgcct cctcccagcg ggtgagccct 840ggcctgcctt ccccaaactt ggagaacgga gcaccagctg tggggcctgt tcagcccagg 900actccttctg tgtcagcacc cttggccctg agctgcccca ggcaaggagg tcttccaaga 960tcaaactcgg ggctgggggg tgaggtttca ggtgtgatgt ccaaacccaa tgtggacccc 1020caaccctggt tccaggatgg gcccaaatct tacctttcca gttctgcccc gtcatcctcg 1080ccagctggcc tggacggttc acagcagggt gcggtccctg ggctggggcc gaagcctggc 1140tgcacagacc ttggcactgg tcccaagctc agccccacca gtcttgtcca tccagtgatg 1200tccaccctgc ctgagttatc ttgtaaagag ggtcccctgg gctggtcttc tgatggtagc 1260ctgggatctg tgctcctgga cagccccagc tcccctaggg taaggctgcc ctgccagccc 1320ctcgtcccag gtcctgagct gagaccctct gctgctgagt tgaaattaga agccctcacc 1380caacgtctgg agcgagagat ggatgctcac ccgaaggctg attactttgg agcctgtgtg 1440aaatgcagca aaggggtgtt tggggctggc caggcctgtc aggccatggg gaacctctac 1500catgacacat gcttcacctg tgcagcttgc agccggaagc tgagaggaaa agccttttat 1560tttgtcaacg gcaaagtgtt ttgtgaagaa gacttcctgt actctggttt ccagcagtcg 1620gctgacaggt gttttctttg tggacatctg atcatggaca tgatcctgca agccctgggg 1680aagtcctacc accccggctg tttccgctgt gtcatctgta atgagtgttt ggatggggtg 1740cccttcaccg tggactcaga gaacaagatc tactgtgtcc gagattacca caaggtgctg 1800gcccccaagt gtgcagcctg tgggcttccc atccttccac ctgagggctc agatgagacc 1860atccgtgtcg tgtccatgga cagagactac cacgtggagt gttaccactg cgaggactgt 1920ggtctggagc tcaatgatga agatggccac cgctgttatc cgctggagga ccacctgttc 1980tgtcactcct gccacgtgaa gaggctggag aagagaccct catctacagc ccttcaccag 2040caccacttct agccagagcc acttgcagac atcacggcag gggatgagga gccggggttg 2100ctgctgctgc ttccggtggc ccctggggtg gaagtggggt aggggaagag gaggggcagg 2160agggagagtt cctgtgagca tgtggggggt gcctttcctt taaccaggga ggtgaacact 2220acctgcctcc tgcgtgtatt ttccaagtgc ttttctctgt tgccacattt tcctcaggtt 2280actcaggaaa atgctccagc atgtgcgagc acatgacctg aggttgcatc atagcaccaa 2340aggaatcctc ctgtcccctc tgggaacatt tcatgcttca gagggagagg tttttattga 2400gcttgtttca caatatcccc ttgaagggac agctcagctg ccaatacatt caaccctttc 2460tcttccttca ggaaaatacc tatacccaaa tgttccctcc cccgacatat atcatggcat 2520gacttaaggc ttcttttcac ctgagagctt cagttcttct gcagaatggc tgcaaattta 2580attgcattaa ggcaagaagg aagctctaat gtgtgctttg tatcctaaga taaatttgct 2640tagaaaacca gagtcaagat ttgaaatagg tgaggcaggg tttcctcctt agacactgac 2700agcattctcc gtaccccttc aaatccttac tctcctaaag gcagctgagt ccgcgacaga 2760aatttgccct atgggagtaa aacatacttt gggagaagaa cttggtgcag gcaccaggat 2820tttttttttt gcccacgtgt ttgcgctgtt tttctctgga gttctcaaga gttggtgact 2880tggaaggccg cttctgcaag gcaagtctca ggaacccatg caggtacatc gcttgcacct 2940gtttttagct tatttaatga cgggcttttg ggaagagctg cccgcatact gagagacagc 3000ttcttataaa caaggagagt ttttgtgtgt gcgagatctc taagccagcg tgggagggag 3060cgcctcagga taagttatta tattcatttc gttggtttct ctcctgccca attcttggca 3120caggcattat gtttgaagaa accaggataa ggtacactgc ttttgtctgt ttaatttttt 3180tagttgtttc ccttcacttt cagtcttcca cacacaaaaa atacctcaca gagcttcacc 3240aaatcacaga ttcaggagga atttggcttt cacactggac tcagatacct tcttcagtgt 3300gttggaaatc actggcttca cacaggccca actccaactg gtcagggcag agtgatcgta 3360actaaaggtc agtggggaat agatccgatt cagtgctttt gccttatgca tttcagcatc 3420ctggctcccc agggtggcag gagctgagga agggccacac actggcaaga tttcaagacc 3480actctctgca ctgaagaggt aaaatttgca ctgcaagtca catccctgag gccagaggtc 3540agtacccttt ggtatttcga ttagaagaag ctgcaaaaga aaggcagccc attttaccat 3600tgccagccag gccggggaca caggagccgg tgtgtgcact ctgcctccta acattgcacc 3660cagagcaaga ggactgggtg ctgggctgca gaggccggtc agtggagccc ctagcacgtg 3720tgaactcagg cttttcattg ggcccggctc cacttctagg ccatgttttg actcatttgg 3780taaccattgc ctgtaagcag cacagaattg gtgccatgga ttatcttttc catgttgatg 3840gaattcattc tgttggaatc ctttggccag atgtcacttc agccagggtg tgcatcatca 3900ttggttcttt ttcacaggct gagcctcctg aaaacccatg aacgctgggg ctggggaagt 3960gaaccctgag gtggggaccc tctcttccca tcaaatcatc cagctcagtg tggggcgtgg 4020caggggggta aatgaagcca gccaatgtgt taacctgtct ctgtcaacct aagaatgttg 4080gccttactga cacacctttg ctccatgttc aagaccagaa gtagctggga tttgtttgca 4140aattgggtaa ttagtttaaa aatctgtgat tacattttta aatgaaattt tcaaagtggc 4200ctagattgag gtgattcaga taggtttgcg aatataccat tttatattgt tgagaaagaa 4260caaaaaggga atttccagat gtcctagaaa tcctagcaac agatttctct ggttgtcagt 4320ttccctggag aaggcgccag ataggaatct ccaatcagtt gtttttctct tcgcttcagg 4380cccttacaca aaagccatga agagatgttc acctacccgg tattttaaat gttctgtaaa 4440ttattagcca aatagaactg taatggggtt gtatttatgg gcgcctagaa agaaaacaca 4500aggacttggt aggccaggaa gaaaagattt taaaatttag aatgaatagc ccttctgggt 4560tttctttttg acaattcttg gacttgaggt aaaacaagga ggattgtggc cggatttcag 4620atcccaaagc cagcctccat cttaggcctt tgcctcattg tgccttttag gttttcttac 4680ccaccgtctc ctgttttgtc ttttttttct tttctcctac ccctatcttg ggacattcag 4740aaactgcctg ggtggtttga gaagagacaa cccagtttga tctgcaatac aaggatccat 4800tcgtaatctc tctctcactg atgttattcc cccatctgcc gtcttggttc atctcaccac 4860agaagggcat ttagtcctac ccagccatcg gctgcggtat gacagcagga tggcacttcc 4920catttctctg tggttagtgc tcgagtgaaa acctctttca gctgagtcct ctgaggttct 4980gctgttgagt cctgggtggc tgatggaatg attgaggagg tctggtcacc ctcaagcgcc 5040gtcatcgcct tgtttccatg ggcttct 5067<210>79<211>950<212>DNA<213>人(Homo sapiens)<400>79tcgaccggat ccgaattccc attgtgcact aaagcgtctc cctgctccgc ggcccgggct 60ggcgggcggg cgctcggctg gcggctgcag cagcagaggg agacccgcgg caaccccggc 120aacccagggc tcggcgtcgc tgccaccatg acgggaagca atatgtcgga cgccttggcc 180aacgccgtgt gccagcgctg ccaggcccgc ttctcccccg ccgagcgcat tgtcaacagc 240aatggggagc tgtaccatga gcactgcttc gtgtgtgccc agtgcttccg gcccttcccc 300gaggggctct tctatgagtt tgaaggccgg aagtactgcg aacacgactt ccaaatgctg 360tttgctccgt gctgtggatc ctgcggtgag ttcatcattg gccgcgtcat caaggccatg 420aacaacaact ggcacccggg ctgcttccgc tgcgagctgt gtgatgtgga gctggctgac 480ctgggctttg tgaagaatgc cggcaggcat ctctgccggc cttgccacaa ccgtgagaag 540gccaagggcc tgggcaagta catctgccag cggtgccacc tggtcatcga cgagcagccc 600ctcatgttca ggagcgacgc ctaccaccct gaccacttca actgcaccca ctgtgggaag 660gagctgacag ccgaggcccg cgagctgaag ggtgagctct actgcctgcc ctgccatgac 720aagatgggcg ttcccatctg cggggcctgc cgccggccca tcgagggccg agtggtcaac 780gcgctgggca agcagtggca cgtggagcac tttgtctgtg ccaagtgtga gaagccattc 840ctggggcacc ggcactatga gaagaagggc ctggcctact gcgagcttta gtgcacaatg 900ggcggccgcc tcgagtctag actcgagtag ataattgagc ggaatttctt 950<210>80<211>2346<212>DNA<213>人(Homo sapiens)<400>80ccgccgtcgc ccccgcctcc ccctgcctca gcggctgccc ccgccagcgg gccgcccgct 60cccccgggcc ttgcagcggg ccccggcccg gctggagggg ccccgacccc agctctggtg 120gcgggcagca gcgccgcggc ccccttccct cacggggact cggccctgaa cgagcaggag 180aaggagttgc agcggcggct gaagcgtctc tacccggccg tggacgaaca agagacgccg 240ctgcctcggt cctggagccc gaaggacaag ttcagctaca tcggcctctc tcagaacaac 300ctgcgggtgc actacaaagg tcatggcaaa accccaaaag atgccgcgtc agttcgagcc 360acgcatccaa taccagcagc ctgtgggatt tattattttg aagtaaaaat tgtcagtaag 420ggaagagatg gttacatggg aattggtctt tctgctcaag gtgtgaacat gaatagacta 480ccaggttggg ataagcattc atatggttac catggggatg atggacattc gttttgttct 540tctggaactg gacaacctta tggaccaact ttcactactg gtgatgtcat tggctgttgt 600gttaatctta tcaacaatac ctgcttttac accaagaatg gacatagttt aggtattgct 660ttcactgacc taccgccaaa tttgtatcct actgtggggc ttcaaacacc aggagaagtg 720gtcgatgcca attttgggca acatcctttc gtgtttgata tagaagacta tatgcgggag 780tggagaacca aaatccaggc acagatagat cgatttccta tcggagatcg agaaggagaa 840tggcagacca tgatacaaaa aatggtttca tcttatttag tccaccatgg gtactgtgcc 900acagcagagg cctttgccag atctacagac cagaccgttc tagaagaatt agcttccatt 960aagaatagac aaagaattca gaaattggta ttagcaggaa gaatgggaga agccattgaa 1020acaacacaac agttataccc aagtttactt gaaagaaatc ctaatctcct tttcacatta 1080aaagtgcgtc agtttataga aatggtgaat ggtacagata gtgaagtacg atgtttggga 1140ggccgaagtc caaagtctca agacagttat cctgttagtc ctcgaccttt tagtagtcca 1200agtatgagcc ccagccatgg aatgaatatc cacaatttag catcaggcaa aggaagcacc 1260gcacattttt caggttttga aagttgtagt aatggtgtaa tatcaaataa agcacatcaa 1320tcatattgcc atagtaataa acaccagtca tccaacttga atgtaccaga actaaacagt 1380ataaatatgt caagatcaca gcaagttaat aacttcacca gtaatgatgt agacatggaa 1440acagatcact actccaatgg agttggagaa acttcatcca atggtttcct aaatggtagc 1500tctaaacatg accacgaaat ggaagattgt gacaccgaaa tggaagttga ttcaagtcag 1560ttgagacgcc agttgtgtgg aggaagtcag gccgccatag aaagaatgat ccactttgga 1620cgagagctgc aagcaatgag tgaacagcta aggagagact gtggcaagaa cactgcaaac 1680aaaaaaatgt tgaaggatgc attcagtcta ctagcatatt cagatccctg gaacagccca 1740gttggaaatc agcttgaccc gattcagaga gaacctgtgt gctcagctct taacagtgca 1800atattagaaa cccacaatct gccaaagcaa cctccacttg ccctagcaat gggacaggcc 1860acacaatgtc taggactgat ggctcgatca ggaattggat cctgcgcatt tgccacagtg 1920gaagactacc tacattagct atgcatttca agagctcaca cttatattgt ggcatatagt 1980caacatggaa gtagaccagc tctgctgatt tgaaatttag attttttaaa ttatgtactg 2040gggacaggtt tttgtcgctt tacattgctt cctagtttac agcatgatgc aaatgatttt 2100ctaacttagt gttaggagaa attattttcc atctttaacc tcttagttgt ctaagagtta 2160aatattactg aatttcagac gttcaaattg atcatcacaa atcctttaaa acaattacct 2220aaaagaaacc aaaaatcctg ccttctttgt gggggagggg ggagagaggg gaaggaaatg 2280gaacaagttg tgtttgtgtt agcatgtggg tgatgtaaac ttcaaattgg gagatgttcc 2340gacccc 2346<210>81<211>2512<212>DNA<213>人(Homo sapiens)<400>81caatgcactg acggatatga gtgggatcct gtgagacagc aatgcaaaga tattgatgaa 60tgtgacattg tcccagacgc ttgtaaaggt ggaatgaagt gtgtcaacca ctatggagga 120tacctctgcc ttccgaaaac agcccagatt attgtcaata atgaacagcc tcagcaggaa 180acacaaccag cagaaggaac ctcaggggca accaccgggg ttgtagctgc cagcagcatg 240gcaaccagtg gagtgttgcc cgggggtggt tttgtggcca gtgctgctgc agtcgcaggc 300cctgaaatgc agactggccg aaataacttt gtcatccggc ggaacccagc tgaccctcag 360cgcattccct ccaacccttc ccaccgtatc cagtgtgcag caggctacga gcaaagtgaa 420cacaacgtgt gccaagacat agacgagtgc actgcaggga cgcacaactg tagagcagac 480caagtgtgca tcaatttacg gggatccttt gcatgtcagt gccctcctgg atatcagaag 540cgaggggagc agtgcgtaga catagatgaa tgtaccatcc ctccatattg ccaccaaaga 600tgcgtgaata caccaggctc attttattgc cagtgcagtc ctgggtttca attggcagca 660aacaactata cctgcgtaga tataaatgaa tgtgatgcca gcaatcaatg tgctcagcag 720tgctacaaca ttcttggttc attcatctgt cagtgcaatc aaggatatga gctaagcagt 780gacaggctca actgtgaaga cattgatgaa tgcagaacct caagctacct gtgtcaatat 840caatgtgtca atgaacctgg gaaattctca tgtatgtgcc cccagggata ccaagtggtg 900agaagtagaa catgtcaaga tataaatgag tgtgagacca caaatgaatg ccgggaggat 960gaaatgtgtt ggaattatca tggcggcttc cgttgttatc cacgaaatcc ttgtcaagat 1020ccctacattc taacaccaga gaaccgatgt gtttgcccag tctcaaatgc catgtgccga 1080gaactgcccc agtcaatagt ctacaaatac atgagcatcc gatctgatag gtctgtgcca 1140tcagacatct tccagataca ggccacaact atttatgcca acaccatcaa tacttttcgg 1200attaaatctg gaaatgaaaa tggagagttc tacctacgac aaacaagtcc tgtaagtgca 1260atgcttgtgc tcgtgaagtc attatcagga ccaagagaac atatcgtgga cctggagatg 1320ctgacagtca gcagtatagg gaccttccgc acaagctctg tgttaagatt gacaataata 1380gtggggccat tttcatttta gtcttttcta agagtcaacc acaggcattt aagtcagcca 1440aagaatattg ttaccttaaa gcactatttt atttatagat atatctagtg catctacatc 1500tctatactgt acactcaccc ataacaaaca attacaccat ggtataaagt gggcatttaa 1560tatgtaaaga ttcaaagttt gtctttatta ctatatgtaa attagacatt aatccactaa 1620actggtcttc ttcaagagag ctaagtatac actatctggt gaaacttgga ttctttccta 1680taaaagtggg accaagcaat gatgatcttc tgtggtgctt aaggaaactt actagagctc 1740cactaacagt ctcataagga ggcagccatc ataaccattg aatagcatgc aagggtaaga 1800atgagttttt aactgctttg taagaaaatg gaaaaggtca ataaagatat atttctttag 1860aaaatgggga tctgccatat ttgtgttggt ttttattttc atatccagcc taaaggtggt 1920tgtttattat atagtaataa atcattgctg tacaacatgc tggtttctgt agggtatttt 1980taattttgtc agaaatttta gattgtgaat attttgtaaa aaacagtaag caaaattttc 2040cagaattccc aaaatgaacc agataccccc tagaaaatta tactattgag aaatctatgg 2100ggaggatatg agaaaataaa ttccttctaa accacattgg aactgacctg aagaagcaaa 2160ctcggaaaat ataataacat ccctgaattc aggcattcac aagatgcaga acaaaatgga 2220taaaaggtat ttcactggag aagttttaat ttctaagtaa aatttaaatc ctaacacttc 2280actaatttat aactaaaatt tctcatcttc gtacttgatg ctcacagagg aagaaaatga 2340tgatggtttt tattcctggc atccagagtg acagtgaact taagcaaatt accctcctac 2400ccaattctat ggaatatttt atacgtctcc ttgtttaaaa tctgactgct ttactttgat 2460gtatcatatt tttaaataaa aataaatatt cctttagaag atcactctaa aa 2512<210>82<211>2306<212>DNA<213>人(Homo sapiens)<400>82gggcgggagc tgcacgcgcc gtggctccgg atctcttcgt ctttgcagcg tacgcccgag 60tcggtcagcg ccggaggacc tcagcagcca tgtcgaagcc ccatagtgaa gccgggactg 120ccttcattca gacccagcag ctgcacgcag ccatggctga cacattcctg gagcacatgt 180gccgcctgga cattgattca ccacccatca cagcccggaa cactggcatc atctgtacca 240ttggcccagc ttcccgatca gtggagacgt tgaaggagat gattaagtct ggaatgaatg 300tggctcgtct gaacttctct catggaactc atgagtacca tgcggagacc atcaagaatg 360tgcgcacagc cacggaaagc tttgcttctg acccctacct ctaccggccc gttgctgtgg 420ctctagacac taaaggacct gagatccgaa ctgggctcat caagggcagc ggcactgcag 480agctggagct gaagaaggga gccactctca aaatcacgct ggataacgcc tacatggaaa 540agtgtgacga gaacatcctg tggctggact acaagaacat ctgcaaggtg gtggaagtgg 600gcagcaagat ctacgtggat gatgggctta tttctctcca ggtgaagcag aaaggtgccg 660acttcctggt gacggaggtg gaaaatggtg gctccttggg cagcaagaag ggtgtgaacc 720ttcctggggc tgctgtggac ttgcctgctg tgtcggagaa ggacatccag gatctgaagt 780ttggggtcga gcaggatgtt gatatggtgt ttgcgtcatt catccgcaag gcatctgatg 840tccatgaagt taggaaggtc ctgggagaga agggaaagaa catcaagatt atcagcaaaa 900tcgagaatca tgagggggtt cggaggtttg atgaaatcct ggaggccagt gatgggatca 960tggtggctcg tggtgatcta ggcattgaga ttcctgcaga gaaggtcttc cttgctcaga 1020agatgatgat tggacggtgc aaccgagctg ggaagcctgt catctgtgct actcagatgc 1080tggagagcat gatcaagaag ccccgcccca ctcgggctga aggcagtgat gtggccaatg 1140cagtcctgga tggagccgac tgcatcatgc tgtctggaga aacagccaaa ggggactatc 1200ctctggaggc tgtgcgcatg cagcacctga ttgcccgtga ggcagaggct gccatctacc 1260acttgcaatt atttgaggaa ctccgccgcc tggcgcccat taccagcgac cccacagaag 1320ccaccgccgt gggtgccgtg gaggcctcct tcaagtgctg cagtggggcc ataatcgtcc 1380tcaccaagtc tggcaggtct gctcaccagg tggccagata ccgcccacgt gcccccatca 1440ttgctgtgac ccggaatccc cagacagctc gtcaggccca cctgtaccgt ggcatcttcc 1500ctgtgctgtg caaggaccca gtccaggagg cctgggctga ggacgtggac ctccgggtga 1560actttgccat gaatgttggc aaggcccgag gcttcttcaa gaagggagat gtggtcattg 1620tgctgaccgg atggcgccct ggctccggct tcaccaacac catgcgtgtt gttcctgtgc 1680cgtgatggac cccagagccc ctcctccagc ccctgtccca cccccttccc ccagcccatc 1740cattaggcca gcaacgcttg tagaactcac tctgggctgt aacgtggcac tggtaggttg 1800ggacaccagg gaagaagatc aacgcctcac tgaaacatgg ctgtgtttgc agcctgctct 1860agtgggacag cccagagcct ggctgcccat catgtggccc cacccaatca agggaagaag 1920gaggaatgct ggactggagg cccctggagc cagatggcaa gagggtgaca gcttcctttc 1980ctgtgtgtac tctgtccagt tcctttagaa aaaatggatg cccagaggac tcccaaccct 2040ggcttggggt caagaaacag ccagcaagag ttaggggcct tagggcactg ggctgttgtt 2100ccattgaagc cgactctggc cctggccctt acttgcttct ctagctctct aggcctctcc 2160agtttgcacc tgtccccacc ctccactcag ctgtcctgca gcaaacactc caccctccac 2220cttccatttt cccccactac tgcagcacct ccaggcctgt tgctatagag cctacctgta 2280tgtcaataaa caacagctga agcacc 2306<210>83<211>2656<212>DNA<213>人(Homo sapiens)<400>83gaattcgcgg ccgcagagtc cccgggccaa gatggctgcg cggtgctcca cacgctggtt 60gctggtggtt gtggggaccc cgcggctgcc ggctatatcg ggtagagggg cccggccgcc 120cagggagggc gtggtggggg catggctgag ccgcaagctg agcgtccccg cctttgcgtc 180ttccctgacc tcttgcggcc cccgagcgct gctgacattg agacctggtg tcagccttac 240aggaacaaaa cataaccctt tcatttgtac tgcctccttc cacacgagtg cccctttggc 300caaagaagat tattatcaga tattaggagt gcctcgaaat gccagccaga aagagatcaa 360gaaagcctat tatcagcttg ccaagaagta tcaccctgac acaaataagg atgatcccaa 420agccaaggag aagttctccc agctggcaga agcctatgag gttttgagtg atgaggtgaa 480gaggaagcag tacgatgcct acggctctgc aggcttcgat cctggggcca gcggctccca 540gcatagctac tggaagggag gccccactgt ggaccccgag gagctgttca ggaagatctt 600tggcgagttc tcatcctctt catttggaga tttccagacc gtgtttgatc agcctcagga 660atacttcatg gagttgacat tcaatcaagc tgcaaagggg gtcaacaagg agttcaccgt 720gaacatcatg gacacgtgtg agcgctgcaa cggcaagggg aacgagcccg gcaccaaggt 780gcagcattgc cactactgtg gcggctccgg catggaaacc atcaacacag gcccttttgt 840gatgcgttcc acgtgtagga gatgtggtgg ccgcggctcc atcatcatat cgccctgtgt 900ggtctgcagg ggagcaggac aagccaagca gaaaaagcga gtgatgatcc ctgtgcctgc 960aggagtcgag gatggccaga ccgtgaggat gcctgtggga aaaagggaaa ttttcattac 1020gttcagggtg cagaaaagcc ctgtgttccg gagggacggc gcagacatcc actccgacct 1080ctttatttct atagctcagg ctcttcttgg gggaacagcc agagcccagg gcctgtacga 1140gacgatcaac gtgacgatcc cccctgggac tcagacagac cagaagattc ggatgggtgg 1200gaaaggcatc ccccggatta acagctacgg ctacggagac cactacatcc acatcaagat 1260acgagttcca aagaggctaa cgagccggca gcagagcctg atcctgagct acgccgagga 1320cgagacagat gtggagggga cggtgaacgg cgtcaccctc accagctctg gtggcagcac 1380catggatagc tccgcaggaa gcaaggctag gcgtgaggct ggggaggacg aggagggatt 1440cctttccaaa cttaagaaaa tgtttacctc atgatatccc agccgaggaa aaagatccac 1500tggaaactag gccgggaagc agcagcccct ccaagggcca gggcacctgg gagacgggag 1560gattccagaa cagcagcact gagctcccac ccgcagagcc tctggacggc cttggcaaca 1620gcaaaatcat gggacaacac ctctctccac ggaaaggtca cagtggacag cccgggcagt 1680aggatgcagc cccagaggct ggtggcagtt tcctgtccat tggtaggtga cggccccctg 1740gtcagcagag gagaggttag atcttgcagg ctaaaactct aatttggaat tgaatattgt 1800ggatatctta gttaaaggcc atgcttacag cttagaaatg aagccttaag ctgcatcaag 1860ttacgaagtg attaatttcc ttctcagcaa acctccggga ggttccagaa tgagttcttc 1920ctgacaggtt gtcttcactg ggagcgtggg gcccccaggc cccaccagca ccgtcctccc 1980ctaatgaggg gccctgccga ggcatcagct gctctgctca gttagttttt attcccgggg 2040taccaagcag ctgcacagtc ggtgcctggg aagcacgtta aaggcccaga gagatcctgg 2100gggttctgct ctgaccgtgt gggtggtgat ccttgtcagg atgtacagtc cttgctccca 2160ccccatccgg gatggccgcc tgtccctgac tattgagtcc tgttgttgta agccaggcat 2220ggagggctcc tgcccttctg ctgagccaca gcccattgca gcactgtgct ggccagactt 2280cagctgcctt gggaactgaa gccctgccac tgttgctagt caggggcttg gttctcccac 2340ttacactgtt gacatctatt ttctgaagtg tgtttaaatt attcagtgct aatcattgtt 2400ttttcctttg taaatgttga ttcagaaaag gaaagcacag gctaagcagt tgaaggttcc 2460ccaccattca gtgagagcag aacccccatt ccccagcctc tgctggtagc atgtcgcagt 2520ttccatgtgt ttcaggatct tcgggctgtc gttagacagg ttaatgaaga acacttctca 2580acagtttcct ttttgttttc ctttataatt cactaaaata aagcatctat tagtgtctga 2640aaaaaaaaaa aaaaaa 2656<210>84<211>2217<212>DNA<213>人(Homo sapiens)<400>84gcggacccgg cgccgaggcg gccacccgag acgcggcgcg cacgctccgg cctgcgcagc 60ccggcccggc catggcggcc ccccgcccgt ctcccgcgat ctccgtttcg gtctcggctc 120cggcttttta cgccccgcag aagaagttcg gccctgtggt ggccccaaag cccaaagtga 180atcccttccg gcccggggac agcgagcctc ccccggcacc cggggcccag cgcgcacaga 240tgggccgggt gggcgagatt cccccgccgc ccccggaaga ctttcccctg cctccacctc 300cccttgctgg ggatggcgac gatgcagagg gtgctctggg aggtgccttc ccgccgcccc 360ctcccccgat cgaggaatca tttccccctg cgcctctgga ggaggagatc ttcccttccc 420cgccgcctcc tccggaggag gagggagggc ctgaggcccc cataccgccc ccaccacagc 480ccagggagaa ggtgagcagt attgatttgg agatcgactc tctgtcctca ctgctggatg 540acatgaccaa gaatgatcct ttcaaagccc gggtgtcatc tggatatgtg cccccaccag 600tggccactcc attcagttcc aagtccagta ccaagcctgc agccgggggc acagcacccc 660tgcctccttg gaagtcccct tccagctccc agcctctgcc ccaggttccg gctccggctc 720agagccagac acagttccat gttcagcccc agccccagcc caagcctcag gtccaactcc 780atgtccagtc ccagacccag cctgtgtctt tggctaacac ccagccccga gggcccccag 840cctcatctcc ggctccagcc cctaagtttt ctccagtgac tcctaagttt actcctgtgg 900cttccaagtt cagtcctgga gccccaggtg gatctgggtc acaaccaaat caaaaattgg 960ggcaccccga agctctttct gctggcacag gctcccctca acctcccagc ttcacctatg 1020cccagcagag ggagaagccc cgagtgcagg agaagcagca ccccgtgccc ccaccggctc 1080agaaccaaaa ccaggtgcgc tcccctgggg ccccagggcc cctgactctg aaggaggtgg 1140aggagctgga gcagctgacc cagcagctaa tgcaggacat ggagcatcct cagaggcaga 1200atgtggctgt caacgaactc tgcggccgat gccatcaacc cctggcccgg gcgcagccag 1260ccgtccgcgc tctagggcag ctgttccaca tcgcctgctt cacctgccac cagtgtgcgc 1320agcagctcca gggccagcag ttctacagtc tggagggggc gccgtactgc gagggctgtt 1380acactgacac cctggagaag tgtaacacct gcggggagcc catcactgac cgcatgctga 1440gggccacggg caaggcctat cacccgcact gcttcacctg tgtggtctgc gcccgccccc 1500tggagggcac ctccttcatc gtggaccagg ccaaccggcc ccactgtgtc cccgactacc 1560acaagcagta cgccccgagg tgctccgtct gctctgagcc catcatgcct gagcctggcc 1620gagatgagac tgtgcgagtg gtcgccctgg acaagaactt ccacatgaag tgttacaagt 1680gtgaggactg cgggaagccc ctgtcgattg aggcagatga caatggctgc ttccccctgg 1740acggtcacgt gctctgtcgg aagtgccaca ctgctagagc ccagacctga gtgaggacag 1800gccctcttca gaccgcagtc catgccccat tgtggaccac ccacactgag accacctgcc 1860cccacctcag ttattgtttt gatgtctagc ccctcccatt tccaacccct ccctagcatc 1920ccaggtgccc tgacccagga cccaacatgg tctagggatg caggatcccc gccctggggt 1980ctggtcctcg cccatcctgc agggattgcc caccgtcttc cagacacccc acctgagggg 2040ggcaccaggt ttagtgctgc tgctttcact gctgcacccg cgccctcggc cggccccccg 2100agcagccttt gtactctgct tgcggagggc tgggagaccc tccaggacat tcccaccctc 2160ccccatgctg ccaagttgta gctatagcta caaataaaaa aaaaccttgt tttccag 2217<210>85<211>8906<212>DNA<213>人(Homo sapiens)<400>85gaggcggcca aggacctggc cgacatcgcg gccttcttcc gatccgggtt tcgaaaaaac 60gatgaaatga aagctatgga tgttttacca attttgaagg aaaaagttgc atacctttca 120ggtgggagag ataaacgtgg aggtcccatt ttaacgtttc cggcccgcag caatcatgac 180agaatacgac aggaggatct caggagactc atttcctatc tagcctgtat tcccagcgag 240gaggtctgca agcgtggctt cacggtgatc gtggacatgc gtgggtccaa gtgggactcc 300atcaagcccc ttctgaagat cctgcaggag tccttcccct gctgcatcca tgtggccctg 360atcatcaagc cagacaactt ctggcagaaa cagaggacta attttggcag ttctaaattt 420gaatttgaga caaatatggt ctctttagaa ggccttacca aagtagttga tccttctcag 480ctaactcctg agtttgatgg ctgcctggaa tacaaccacg aagaatggat tgaaatcaga 540gttgcttttg aagactacat tagcaatgcc acccacatgc tgtctcggct ggaggaactt 600caggacatcc tagctaagaa ggagctgcct caggatttag agggggctcg gaatatgatc 660gaggaacatt ctcagctgaa gaagaaggtg attaaggccc ccatcgagga cctggatttg 720gagggacaga agctgcttca gaggatacag agcagtgaaa gctttcccaa aaagaactca 780ggctcaggca atgcggacct gcagaacctc ttgcccaagg tgtccaccat gctggaccgg 840ctgcactcga cacggcagca tctgcaccag atgtggcatg tgaggaagct gaagctggac 900cagtgcttcc agctgaggct gtttgaacag gatgctgaga agatgtttga ctggatcaca 960cacaacaaag gcctgtttct aaacagctac acagagattg ggaccagcca ccctcatgcc 1020atggagcttc agacgcagca caatcacttt gccatgaact gtatgaacgt gtatgtaaat 1080ataaaccgca tcatgtcggt ggccaatcgt ctggtggagt ctggccacta tgcctcgcag 1140cagatcaggc agatcgcgag tcagctggag caggagtgga aggcgtttgc ggcagccctg 1200gatgagcgga gcaccttgct ggacatgtcc tccattttcc accagaaggc cgaaaagtat 1260atgagcaacg tggattcatg gtgtaaagct tgcggtgagg tagaccttcc ctcagagctg 1320caggacctag aagatgccat tcatcaccac cagggaatat atgaacatat cactcttgct 1380tattctgagg tcagccaaga tgggaagtcg ctccttgaca agctccagcg gcccttgact 1440cccggcagct ccgattccct gacagcctct gccaactact ccaaggccgt gcaccatgtc 1500ctggatgtca tccacgaggt gctgcaccac cagcggcacg tgagaacaat ctggcaacac 1560cgcaaggtcc ggctgcatca gaggctgcag ctgtgtgttt tccagcagga agttcagcag 1620gtgctagact ggatcgagaa ccacggagaa gcatttctga gcaaacatac aggtgtgggg 1680aaatctcttc atcgggccag agcattgcag aaacgtcatg aagattttga agaagtggca 1740cagaacacat acaccaatgc ggataaatta ctggaagcag cagaacagct ggctcagact 1800ggggaatgtg accccgaaga gatttatcag gctgcccatc agctggaaga ccggattcaa 1860gatttcgttc ggcgtgttga gcagcgaaag atcctactgg acatgtcagt gtcctttcac 1920acccatgtga aagagctgtg gacgtggctg gaggagctgc agaaggagct gctggacgac 1980gtgtatgccg agtcggtgga ggccgtgcag gacctcatca agcgctttgg ccagcagcag 2040cagaccaccc tgcaggtgac tgtcaacgtg atcaaggaag gggaggacct catccagcag 2100ctcagggact ctgccatctc cagtaacaag accccccaca acagctccat caaccacatt 2160gagacggtgc tgcagcagct ggacgaggcg cagtcgcaga tggaggagct cttccaggag 2220cgcaagatca agctggagct cttcctgcac gtgcgcatct tcgagaggga cgccatcgac 2280attatctcag acctcgagtc ttggaatgat gagctttctc agcaaatgaa tgacttcgac 2340acagaagatc tcacgattgc agagcagcgc ctccagcacc atgcagacaa agccttgacc 2400atgaacaact tgacttttga cgtcatccac caagggcaag atcttctgca gtatgtcaat 2460gaggtccagg cctctggtgt ggagctgctg tgtgatagag atgtagacat ggcaactcgg 2520gtccaggacc tgctggagtt tcttcatgaa aaacagcagg aattggattt agccgcagag 2580cagcatcgga aacacctgga gcagtgcgtg cagctgcgcc acctgcaggc agaagtgaaa 2640caggtgctgg gttggatccg caacggagag tccatgttaa atgccggact tatcacagcc 2700agctcgttac aagaggcaga gcagctccag cgagagcacg agcagttcca gcatgccatt 2760gagaaaacac atcagagcgc gctgcaggtg cagcagaagg cagaagccat gctacaggcc 2820aaccactacg acatggacat gatccgggac tgcgccgaga aggtggcgtc tcactggcaa 2880cagctcatgc tcaagatgga agatcgcctc aagctcgtca acgcctctgt cgctttctac 2940aaaacctcag agcaggtctg cagcgtcctc gagagcctgg aacaggagta caagagagaa 3000gaagactggt gtggcggggc ggataagctg ggcccaaact ctgagacgga ccacgtgacg 3060cccatgatca gcaagcacct ggagcagaag gaggcattcc tgaaggcttg cacccttgct 3120cggaggaatg cagacgtctt cctgaaatac ctgcacagga acagcgtgaa catgccagga 3180atggtgacgc acatcaaagc tcctgaacag caagtgaaaa atatcttgaa tgaactcttc 3240caacgggaga acagggtatt gcattactgg accatgagga agagacggct ggaccagtgt 3300cagcagtacg tggtctttga gaggagtgcc aagcaggctt tggaatggat ccatgacaat 3360ggcgagttct acctttccac acacacctcc acgggctcca gtatacagca cacccaggag 3420ctcctgaaag agcacgagga gttccagata actgcaaagc aaaccaaaga gagagtgaag 3480ctattgatac agctggctga tggcttttgt gaaaaagggc atgcccatgc ggcagagata 3540aaaaaatgtg ttactgctgt ggataagagg tacagagatt tctctctgcg gatggagaag 3600tacaggacct ctttggaaaa agccctgggg atttcttcag attccaacaa atcgagtaaa 3660agtctccagc tagatatcat tccagccagt atccctggct cagaggtgaa acttcgagat 3720gctgctcatg aacttaatga agagaagcgg aaatctgccc gcaggaaaga gttcataatg 3780gctgagctca ttcaaactga aaaggcttat gtaagagacc tccgggaatg tatggatacg 3840tacctgtggg aaatgaccag tggcgtggaa gagattccac ctggcattgt aaacaaagaa 3900ctcatcatct tcggaaacat gcaagaaatc tacgaatttc ataataacat attcctaaag 3960gagctggaaa aatatgaaca gttgccagag gatgttggac attgttttgt tacttgggca 4020gacaagtttc agatgtatgt cacatattgc aaaaataagc ctgattctac tcagctgata 4080ttggaacatg cagggtccta ttttgacgag atacagcagc gacatggatt agccaattcc 4140atttcttcct accttattaa accagttcag cgaataacga aatatcagct ccttttaaaa 4200gagctgctga cgtgctgtga ggaaggaaag ggagagatta aagatggcct ggaggtgatg 4260ctcagcgtgc cgaagcgagc caatgacgcc atgcacctca gcatgctgga agggtttgat 4320gaaaacattg agtctcaggg agaactcatc ctacaggaat ccttccaagt gtgggaccca 4380aaaaccttaa ttcgaaaggg tcgagaacgg catctcttcc tttttgaaat gtccttagta 4440tttagtaaag aagtgaaaga ttccagtggg agaagcaagt acctttataa aagcaaattg 4500tttacctcag agttgggtgt cacagaacat gttgaaggag acccttgcaa atttgcactg 4560tgggtgggga gaacaccaac ttcagataat aaaattgtcc ttaaggcttc cagcatagag 4620aacaagcagg actggataaa gcatatccgc gaagtcatcc aggagcggac gatccacctg 4680aagggagccc tgaaggagcc cattcacatc cctaagaccg ctcccgccac aagacagaag 4740ggaaggaggg atggagagga tctggacagc caaggagacg gcagcagcca gcctgatacg 4800atttccatcg cctcacggac gtctcagaac acgctggaca gcgataagct ctctggtggc 4860tgtgagctga cagtggtgat ccatgacttc accgcttgca acagcaacga gctgaccatc 4920cgacggggcc agaccgtgga agttctggag cggccgcatg acaagcctga ctggtgtctg 4980gtgcggacca ctgaccgctc cccagcggca gaaggcctgg tcccctgtgg ttcactgtgc 5040atcgcccact ccagaagtag catggaaatg gagggcatct tcaaccacaa agactcgctc 5100tccgtctcca gcaatgacgc cagtccaccc gcatccgtgg cttccctcca gccccacatg 5160atcggggccc agagctcgcc gggccccaag cggccgggca acaccctgcg caagtggctc 5220accagccccg tgcggcggct cagcagcggc aaggccgacg ggcacgtgaa gaagctggcg 5280cacaagcaca agaagagccg cgaggtccgc aagagcgccg acgccggctc gcagaaggac 5340tccgacgaca gtgcggccac cccgcaggac gagacggtcg aggagagagg ccggaacgag 5400ggcctgagca gcggtactct ctccaaatcc tcctcctcgg ggatgcagag ctgtggagaa 5460gaggaaggcg aggagggggc cgacgccgtg cccctgccgc cacccatggc catccagcag 5520cacagcctcc tccagccaga ctcacaggat gacaaggcct cttctcggtt attagtccgc 5580cccaccagct ccgaaacacc gagtgcagcc gagctcgtca gtgcaattga ggaactcgtg 5640aaaagcaaga tggcactgga ggatcgcccc agctcactcc ttgttgacca gggagatagt 5700agcagccctt ccttcaaccc ttcggataat tcccttctct cttcctcctc gcccattgat 5760gagatggaag aaaggaaatc cagctcttta aagagaagac actacgtttt gcaagaacta 5820gtggagacag agcgtgacta tgtgcgggac cttggctatg tggttgaggg ctacatggca 5880cttatgaaag aagatggtgt tcctgatgac atgaaaggaa aagacaaaat tgtgttcggc 5940aacatccatc agatttacga ctggcacaga gacttttttt taggagagtt agagaagtgc 6000cttgaagatc cagaaaaact aggatccctt tttgttaaac acgagagaag gttgcacatg 6060tacatagctt attgtcaaaa taaaccaaag tctgagcaca ttgtctcaga atacattgat 6120accttttttg aggacttaaa gcagcgtctt ggccacaggt tacagctcac agatctgttg 6180atcaaaccag tgcagagaat catgaagtat cagctgttac tgaaggactt cctcaagtat 6240tccaaaaagg ccagcctgga tacatcagaa ttagagagag ctgtggaagt catgtgcata 6300gtacccaggc ggtgcaacga catgatgaac gtggggcggc tgcaaggatt cgacgggaaa 6360atcgttgccc agggtaaact gctcttgcag gacacattct tggtcacaga ccaagatgca 6420ggacttctgc ctcgctgcag agagaggcgc atcttcctct ttgagcagat cgtcatattc 6480agcgaaccac ttgataaaaa gaagggcttc tccatgccgg gattcctgtt taagaacagt 6540atcaaggtga gttgcctttg cctggaggaa aatgtggaaa atgatccctg taaatttgct 6600ctgacatcga ggacgggtga cgtggtagag accttcattt tgcattcatc tagtccaagt 6660gtccggcaaa cttggatcca tgaaatcaac caaattttag aaaaccagcg caatttttta 6720aatgccttga catcgccaat cgagtaccag aggaaccaca gcgggggcgg cggcggcggc 6780ggcagcgggg cagcggcggg ggtgggggca gcggcggcgg cggggccccc agtggcggca 6840gcggccacag tggcggcccc agcagctgcg gcggcgcccc cagcacgagc aggagccggc 6900cctcccggat cccccagcct gtccgacacc acccccccgt gctggtctcc tctgcagcct 6960cgagccaggc agaggcagac aagatgtcag agtgaaagca gcagcagtag caacatctcc 7020accatgttgg tgacacacga ttacacggca gtgaaggagg atgagatcaa cgtctaccaa 7080ggagaggtcg ttcaaattct ggccagcaac cagcagaaca tgtttctggt gttccgagcc 7140gccactgacc agtgccccgc agctgagggc tggattccag gctttgtcct gggccacacc 7200agtgcagtca tcgtggagaa cccggacggg actctcaaga agtcaacatc ttggcacaca 7260gcactccgtt taaggaaaaa atctgagaaa aaagataaag acggcaaaag ggaaggcaag 7320ttagagaacg gttatcggaa gtcacgggaa ggactcagca acaaggtatc tgtgaagctt 7380ctcaatccca actacattta tgacgttccc ccagaattcg tcattccatt gagtgaggtc 7440acgtgtgaga caggggagac cgttgttctt agatgtcgag tctgtggccg ccccaaagcc 7500tcaattacct ggaagggccc tgaacacaac accttgaaca acgatggtca ctacagcatc 7560tcctacagtg acctgggaga ggccacgctg aagattgtgg gcgtgaccac ggaagatgac 7620ggcatctaca cgtgcatcgc tgtcaatgac atgggttcag cctcatcatc ggccagcctg 7680agggtcctag gtccagggat ggatgggatc atggtgacct ggaaagacaa ctttgactcc 7740ttctacagtg aagtggctga gcttggcagg ggcagattct ctgtcgttaa gaaatgtgat 7800cagaaaggaa ccaagcgagc agtggccact aagtttgtga acaagaagtt gatgaagcgc 7860gaccaggtca cccatgagct tggcatcctg cagagcctcc agcaccccct gcttgtcggc 7920ctcctcgaca cctttgagac ccccaccagc tacatcctgg tcttagaaat ggctgaccag 7980ggtcgcctcc tggactgcgt ggtgcgatgg ggaagcctca ctgaagggaa gatcagggcg 8040cacctggggg aggttctgga agctgtccgg tacctgcaca actgcaggat agcacacctg 8100gacctaaagc ctgagaatat cctggtggat gagagtttag ccaagccaac catcaaactg 8160gctgactttg gagatgctgt tcagctcaac acgacctact acatccacca gttactgggg 8220aaccctgaat tcgcagcccc tgaaatcatc ctcgggaacc ctgtctccct gacctcggat 8280acgtggagtg ttggagtgct cacatacgta cttcttagtg gcgtgtcccc cttcctggat 8340gacagtgtgg aagagacctg cctgaacatt tgccgcttag actttagctt cccagatgac 8400tactttaaag gagtgagcca gaaggccaag gagttcgtgt gcttcctcct gcaggaggac 8460cccgccaagc gtccctcggc tgcgctggcc ctccaggagc agtggctgca ggccggcaac 8520ggcagaagca cgggcgtcct cgacacgtcc agactgactt ccttcattga gcggcgcaaa 8580caccagaatg atgttcgacc tatccgtagc attaaaaact ttctgcagag caggcttctg 8640cctagagttt gacctatcca gaagttcttt ctcattctct ttcacctgcc aatcagctgt 8700taatctgaat tttcaagaga aaacaagcaa acataactga tcagctgccg gtatgttcat 8760cgtgtgaaat tgcattccaa gtgagctgtg ctcagcagtg cttggacaca gagctgcaag 8820ctgcgctggg gtggaggacc gtcacttaca ctctgccaag gacggaggtc gcattgctgt 8880atcacagtat tttttacgga tttctg 8906<210>86<211>1204<212>DNA<213>人(Homo sapiens)<400>86tcggcggcgg tggtatcggc ggcagctgtg agggggttcc gggaagatgg tgctgatcaa 60ggaattccgt gtggttttgc catgttctgt tcaggagtat caggttgggc agctttactc 120tgttgcagaa gctagtaaga atgagactgg tggtggagaa ggaattgaag tcttaaagaa 180tgaaccttat gagaaggatg gagaaaaggg acagtatacg cacaaaattt atcacctaaa 240gagcaaagtg cctgcattcg tgaggatgat tgctcccgag ggctccttgg tgtttcatga 300gaaagcctgg aatgcgtacc cctactgtag aacaattgta acgaatgaat atatgaaaga 360tgatttcttc attaaaatcg aaacatggca caaaccagac ttgggaacat tagaaaatgt 420acatggttta gatccaaaca catggaaaac tgttgaaatt gtccatatag atattgcaga 480tagaagtcaa gttgaaccag cagactacaa agctgatgaa gacccagcat tattccagtc 540agtcaagacc aagagaggcc ctttgggacc caactggaag aaggagctgg caaacagccc 600tgactgtccc cagatgtgtg cctataagct ggtgaccatc aaattcaagt ggtggggact 660gcaaagcaaa gtagaaaact tcattcaaaa gcaagaaaaa cggatattta caaacttcca 720tcgccagctt ttttgttgga ttgacaagtg gatcgatctc acgatggaag acattaggag 780aatggaagac gagactcaga aagaactaga aacaatgcgt aagaggggtt ccgttcgagg 840cacgtcggct gctgatgtct agatgagtcc cctgtagggt cagagacaat gtcaaactgt 900ttacgtaatc aaggtcaagt gaggggaaca agcgcagcca gtgatgagtg aacaacaatc 960tgaccagtat cttgcagtgt tgacgtttcc cagatgtgtg cttgtgatga tacacacaca 1020tgcacaggtt ctcaaccacg tgtgtatata tgtatgtgtg catatgtctg tagctgtata 1080taaagcgcat gtagagctac agatccagat acacacactt gtgtatatat gtacatacag 1140acatactgaa gggattagta caat ttctcc aaagtactgt acctatcttc agcaagaatg 1200caaa 1204<210>87<211>892<212>PRT<213>人(Homo sapiens)<400>87Met Asp His Tyr Asp Ser Gln Gln Thr Asn Asp Tyr Met Gln Pro Glu1 5 10 15Glu Asp Trp Asp Arg Asp Leu Leu Leu Asp Pro Ala Trp Glu Lys Gln
20 25 30Gln Arg Lys Thr Phe Thr Ala Trp Cys Asn Ser His Leu Arg Lys Ala
35 40 45Gly Thr Gln Ile Glu Asn Ile Glu Glu Asp Phe Arg Asp Gly Leu Lys
50 55 60Leu Met Leu Leu Leu Glu Val Ile Ser Gly Glu Arg Leu Ala Lys Pro65 70 75 80Glu Arg Gly Lys Met Arg Val His Lys Ile Ser Asn Val Asn Lys Ala
85 90 95Leu Asp Phe Ile Ala Ser Lys Gly Val Lys Leu Val Ser Ile Gly Ala
100 105 110Glu Glu Ile Val Asp Gly Asn Val Lys Met Thr Leu Gly Met Ile Trp
115 120 125Thr Ile Ile Leu Arg Phe Ala Ile Gln Asp Ile Ser Val Glu Glu Thr
130 135 140Ser Ala Lys Glu Gly Leu Leu Leu Trp Cys Gln Arg Lys Thr Ala Pro145 150 155 160Tyr Lys Asn Val Asn Ile Gln Asn Phe His Ile Ser Trp Lys Asp Gly
165 170 175Leu Gly Phe Cys Ala Leu Ile His Arg His Arg Pro Glu Leu Ile Asp
180 185 190Tyr Gly Lys Leu Arg Lys Asp Asp Pro Leu Thr Asn Leu Asn Thr Ala
195 200 205Phe Asp Val Ala Glu Lys Tyr Leu Asp Ile Pro Lys Met Leu Asp Ala
210 215 220Glu Asp Ile Val Gly Thr Ala Arg Pro Asp Glu Lys Ala Ile Met Thr225 230 235 240Tyr Val Ser Ser Phe Tyr His Ala Phe Ser Gly Ala Gln Lys Ala Glu
245 250 255Thr Ala Ala Asn Arg Ile Cys Lys Val Leu Ala Val Asn Gln Glu Asn
260 265 270Glu Gln Leu Met Glu Asp Tyr Glu Lys Leu Ala Ser Asp Leu Leu Glu
275 280 285Trp Ile Arg Arg Thr Ile Pro Trp Leu Glu Asn Arg Val Pro Glu Asn
290 295 300Thr Met His Ala Met Gln Gln Lys Leu Glu Asp Phe Arg Asp Tyr Arg305 310 315 320Arg Leu His Lys Pro Pro Lys Val Gln Glu Lys Cys Gln Leu Glu Ile
325 330 335Asn Phe Asn Thr Leu Gln Thr Lys Leu Arg Leu Ser Asn Arg Pro Ala
340 345 350Phe Met Pro Ser Glu Gly Arg Met Val Ser Asp Ile Asn Asn Ala Trp
355 360 365Gly Cys Leu Glu Gln Val Glu Lys Gly Tyr Glu Glu Trp Leu Leu Asn
370 375 380Glu Ile Arg Arg Leu Glu Arg Leu Asp His Leu Ala Glu Lys Phe Arg385 390 395 400Gln Lys Ala Ser Ile His Glu Ala Trp Thr Asp Gly Lys Glu Ala Met
405 410 415Leu Arg Gln Lys Asp Tyr Glu Thr Ala Thr Leu Ser Glu Ile Lys Ala
420 425 430Leu Leu Lys Lys His Glu Ala Phe Glu Ser Asp Leu Ala Ala His Gln
435 440 445Asp Arg Val Glu Gln Ile Ala Ala Ile Ala Gln Glu Leu Asn Glu Leu
450 455 460Asp Tyr Tyr Asp Ser Pro Ser Val Asn Ala Arg Cys Gln Lys Ile Cys465 470 475 480Asp Gln Trp Asp Asn Leu Gly Ala Leu Thr Gln Lys Arg Arg Glu Ala
485 490 495Leu Glu Arg Thr Glu Lys Leu Leu Glu Thr Ile Asp Gln Leu Tyr Leu
500 505 510Glu Tyr Ala Lys Arg Ala Ala Pro Phe Asn Asn Trp Met Glu Gly Ala
515 520 525Met Glu Asp Leu Gln Asp Thr Phe Ile Val His Thr Ile Glu Glu Ile
530 535 540Gln Gly Leu Thr Thr Ala His Glu Gln Phe Lys Ala Thr Leu Pro Asp545 550 555 560Ala Asp Lys Glu Arg Leu Ala Ile Leu Gly Ile His Asn Glu Val Ser
565 570 575Lys Ile Val Gln Thr Tyr His Val Asn Met Ala Gly Thr Asn Pro Tyr
580 585 590Thr Thr Ile Thr Pro Gln Glu Ile Asn Gly Lys Trp Asp His Val Arg
595 600 605Gln Leu Val Pro Arg Arg Asp Gln Ala Leu Thr Glu Glu His Ala Arg
610 615 620Gln Gln His Asn Glu Arg Leu Arg Lys Gln Phe Gly Ala Gln Ala Asn625 630 635 640Val Ile Gly Pro Trp Ile Gln Thr Lys Met Glu Glu Ile Gly Arg Ile
645 650 655Ser Ile Glu Met His Gly Thr Leu Glu Asp Gln Leu Ser His Leu Arg
660 665 670Gln Tyr Glu Lys Ser Ile Val Asn Tyr Lys Pro Lys Ile Asp Gln Leu
675 680 685Glu Gly Asp His Gln Leu Ile Gln Glu Ala Leu Ile Phe Asp Asn Lys
690 695 700His Thr Asn Tyr Thr Met Glu His Ile Arg Val Gly Trp Glu Gln Leu705 710 715 720Leu Thr Thr Ile Ala Arg Thr Ile Asn Glu Val Glu Asn Gln Ile Leu
725 730 735Thr Arg Asp Ala Lys Gly Ile Ser Gln Glu Gln Met Asn Glu Phe Arg
740 745 750Ala Ser Phe Asn His Phe Asp Arg Asp His Ser Gly Thr Leu Gly Pro
755 760 765Glu Glu Phe Lys Ala Cys Leu Ile Ser Leu Gly Tyr Asp Ile Gly Asn
770 775 780Asp Pro Gln Gly Glu Ala Glu Phe Ala Arg Ile Met Ser Ile Val Asp785 790 795 800Pro Asn Arg Leu Gly Val Val Thr Phe Gln Ala Phe Ile Asp Phe Met
805 810 815Ser Arg Glu Thr Ala Asp Thr Asp Thr Ala Asp Gln Val Met Ala Ser
820 825 830Phe Lys Ile Leu Ala Gly Asp Lys Asn Tyr Ile Thr Met Asp Glu Leu
835 840 845Arg Arg Glu Leu Pro Pro Asp Gln Ala Glu Tyr Cys Ile Ala Arg Met
850 855 860Ala Pro Tyr Thr Gly Pro Asp Ser Val Pro Gly Ala Leu Asp Tyr Met865 870 875 880Ser Phe Ser Thr Ala Leu Tyr Gly Glu Ser Asp Leu
885 890<210>88<211>197<212>PRT<213>人(Homo sapiens)<400>88Met Met Phe Pro Gln Ser Arg His Ser Gly Ser Ser His Leu Pro Gln1 5 10 15Gln Leu Lys Phe Thr Thr Ser Asp Ser Cys Asp Arg Ile Lys Asp Glu
20 25 30Phe Gln Leu Leu Gln Ala Gln Tyr His Ser Leu Lys Leu Glu Cys Asp
35 40 45Lys Leu Ala Ser Glu Lys Ser Glu Met Gln Arg His Tyr Val Met Tyr
50 55 60Tyr Glu Met Ser Tyr Gly Leu Asn Ile Glu Met His Lys Gln Ala Glu65 70 75 80Ile Val Lys Arg Leu Asn Gly Ile Cys Ala Gln Val Leu Pro Tyr Leu
85 90 95Ser Gln Glu His Gln Gln Gln Val Leu Gly Ala Ile Glu Arg Ala Lys
100 105 110Gln Val Thr Ala Pro Glu Leu Asn Ser Ile Ile Arg Gln Gln Leu Gln
115 120 125Ala His Gln Leu Ser Gln Leu Gln Ala Leu Ala Leu Pro Leu Thr Pro
130 135 140Leu Pro Val Gly Leu Gln Pro Pro Ser Leu Pro Ala Val Ser Ala Gly145 150 155 160Thr Gly Leu Leu Ser Leu Ser Ala Leu Gly Ser Gln Ala His Leu Ser
165 170 175Lys Glu Asp Lys Asn Gly His Asp Gly Asp Thr His Gln Glu Asp Asp
180 185 190Gly Glu Lys Ser Asp
195<210>89<211>739<212>PRT<213>人(Homo sapiens)<400>89Gly Asp Lys Glu Pro Thr Glu Thr Ile Gly Asp Leu Ser Ile Cys Leu1 5 10 15Asp Gly Leu Gln Leu Glu Ser Glu Val Val Thr Asn Gly Glu Thr Thr
20 25 30Cys Ser Glu Ser Ala Ser Gln Asn Asp Asp Gly Ser Arg Ser Lys Asp
35 40 45Glu Thr Arg Val Ser Thr Asn Gly Ser Asp Asp Pro Glu Asp Ala Gly
50 55 60Ala Gly Glu Asn Arg Arg Val Ser Gly Asn Asn Ser Pro Ser Leu Ser65 70 75 80Asn Gly Gly Phe Lys Pro Ser Arg Pro Pro Arg Pro Ser Arg Pro Pro
85 90 95Pro Pro Thr Pro Arg Arg Pro Ala Ser Val Asn Gly Ser Pro Ser Ala
100 105 110Thr Ser Glu Ser Asp Gly Ser Ser Thr Gly Ser Leu Pro Pro Thr Asn
115 120 125Thr Asn Thr Asn Thr Ser Glu Gly Ala Thr Ser Gly Leu Ile Ile Pro
130 135 140Leu Thr Ile Ser Gly Gly Ser Gly Pro Arg Pro Leu Asn Pro Val Thr145 150 155 160Gln Ala Pro Leu Pro Pro Gly Trp Glu Gln Arg Val Asp Gln His Gly
165 170 175Arg Val Tyr Tyr Val Asp His Val Glu Lys Arg Thr Thr Trp Asp Arg
180 185 190Pro Glu Pro Leu Pro Pro Gly Trp Glu Arg Arg Val Asp Asn Met Gly
195 200 205Arg Ile Tyr Tyr Val Asp His Phe Thr Arg Thr Thr Thr Trp Gln Arg
210 215 220Pro Thr Leu Glu Ser Val Arg Asn Tyr Glu Gln Trp Gln Leu Gln Arg225 230 235 240Ser Gln Leu Gln Gly Ala Met Gln Gln Phe Asn Gln Arg Phe Ile Tyr
245 250 255Gly Asn Gln Asp Leu Phe Ala Thr Ser Gln Ser Lys Glu Phe Asp Pro
260 265 270Leu Gly Pro Leu Pro Pro Gly Trp Glu Lys Arg Thr Asp Ser Asn Gly
275 280 285Arg Val Tyr Phe Val Asn His Asn Thr Arg Ile Thr Gln Trp Glu Asp
290 295 300Pro Arg Ser Gln Gly Gln Leu Asn Glu Lys Pro Leu Pro Glu Gly Trp305 310 315 320Glu Met Arg Phe Thr Val Asp Gly Ile Pro Tyr Phe Val Asp His Asn
325 330 335Arg Arg Thr Thr Thr Tyr Ile Asp Pro Arg Thr Gly Lys Ser Ala Leu
340 345 350Asp Asn Gly Pro Gln Ile Ala Tyr Val Arg Asp Phe Lys Ala Lys Val
355 360 365Gln Tyr Phe Arg Phe Trp Cys Gln Gln Leu Ala Met Pro Gln His Ile
370 375 380Lys Ile Thr Val Thr Arg Lys Thr Leu Phe Glu Asp Ser Phe Gln Gln385 390 395 400Ile Met Ser Phe Ser Pro Gln Asp Leu Arg Arg Arg Leu Trp Val Ile
405 410 415Phe Pro Gly Glu Glu Gly Leu Asp Tyr Gly Gly Val Ala Arg Glu Trp
420 425 430Phe Phe Leu Leu Ser His Glu Val Leu Asn Pro Met Tyr Cys Leu Phe
435 440 445Glu Tyr Ala Gly Lys Asp Asn Tyr Cys Leu Gln Ile Asn Pro Ala Ser
450 455 460Tyr Ile Asn Pro Asp His Leu Lys Tyr Phe Arg Phe Ile Gly Arg Phe465 470 475 480Ile Ala Met Ala Leu Phe His Gly Lys Phe Ile Asp Thr Gly Phe Ser
485 490 495Leu Pro Phe Tyr Lys Arg Ile Leu Asn Lys Pro Val Gly Leu Lys Asp
500 505 510Leu Glu Ser Ile Asp Pro Glu Phe Tyr Asn Ser Leu Ile Trp Val Lys
515 520 525Glu Asn Asn Ile Glu Glu Cys Asp Leu Glu Met Tyr Phe Ser Val Asp
530 535 540Lys Glu Ile Leu Gly Glu Ile Lys Ser His Asp Leu Lys Pro Asn Gly545 550 555 560Gly Asn Ile Leu Val Thr Glu Glu Asn Lys Glu Glu Tyr Ile Arg Met
565 570 575Val Ala Glu Trp Arg Leu Ser Arg Gly Val Glu Glu Gln Thr Gln Ala
580 585 590Phe Phe Glu Gly Phe Asn Glu Ile Leu Pro Gln Gln Tyr Leu Gln Tyr
595 600 605Phe Asp Ala Lys Glu Leu Glu Val Leu Leu Cys Gly Met Gln Glu Ile
610 615 620Asp Leu Asn Asp Trp Gln Arg His Ala Ile Tyr Arg His Tyr Ala Arg625 630 635 640Thr Ser Lys Gln Ile Met Trp Phe Trp Gln Phe Val Lys Glu Ile Asp
645 650 655Asn Glu Lys Arg Met Arg Leu Leu Gln Phe Val Thr Gly Thr Cys Arg
660 665 670Leu Pro Val Gly Gly Phe Ala Asp Leu Met Gly Ser Asn Gly Pro Gln
675 680 685Lys Phe Cys Ile Glu Lys Val Gly Lys Glu Asn Trp Leu Pro Arg Ser
690 695 700His Thr Cys Phe Asn Arg Leu Asp Leu Pro Pro Tyr Lys Ser Tyr Glu705 710 715 720Gln Leu Lys Glu Lys Leu Leu Phe Ala Ile Glu Glu Thr Glu Gly Phe
725 730 735Gly Gln Glu<210>90<211>431<212>PRT<213>人(Homo sapiens)<400>90Gly Pro Pro Pro Thr Arg Ala Leu Pro Leu Pro Gln Ser Leu Pro Pro1 5 10 15Asp Phe Arg Leu Glu Pro Thr Ala Pro Ala Leu Ser Pro Arg Ser Ser
20 25 30Phe Ala Ser Ser Ser Ala Ser Asp Ala Ser Lys Pro Ser Ser Pro Arg
35 40 45Gly Ser Leu Leu Leu Asp Gly Ala Gly Ala Gly Gly Ala Gly Gly Ser
50 55 60Arg Pro Cys Ser Asn Arg Thr Ser Gly Ile Ser Met Gly Tyr Asp Gln65 70 75 80Arg His Gly Ser Pro Leu Pro Ala Gly Pro Cys Leu Phe Gly Pro Pro
85 90 95Leu Ala Gly Ala Pro Ala Gly Tyr Ser Pro Gly Gly Val Pro Ser Ala
100 105 110Tyr Pro Glu Leu His Ala Ala Leu Asp Arg Leu Tyr Ala Gln Arg Pro
115 120 125Ala Gly Phe Gly Cys Gln Glu Ser Arg His Ser Tyr Pro Pro Ala Leu
130 135 140Gly Ser Pro Gly Ala Leu Ala Gly Ala Arg Val Gly Ala Ala Gly Pro145 150 155 160Leu Glu Arg Arg Gly Ala Gln Pro Gly Arg His Ser Val Thr Gly Tyr
165 170 175Gly Asp Cys Ala Val Gly Ala Arg Tyr Gln Asp Glu Leu Thr Ala Leu
180 185 190Leu Arg Leu Thr Val Gly Thr Gly Gly Arg Glu Ala Gly Ala Arg Gly
195 200 205Glu Pro Ser Gly Ile Glu Pro Ser Gly Leu Glu Glu Pro Pro Gly Pro
210 215 220Phe Val Pro Glu Ala Ala Arg Ala Arg Met Arg Glu Pro Glu Ala Arg225 230 235 240Glu Asp Tyr Phe Gly Thr Cys Ile Lys Cys Asn Lys Gly Ile Tyr Gly
245 250 255Gln Ser Asn Ala Cys Gln Ala Leu Asp Ser Leu Tyr His Thr Gln Cys
260 265 270Phe Val Cys Cys Ser Cys Gly Arg Thr Leu Arg Cys Lys Ala Phe Tyr
275 280 285Ser Val Asn Gly Ser Val Tyr Cys Glu Glu Asp Tyr Leu Phe Ser Gly
290 295 300Phe Gln Glu Ala Ala Glu Lys Cys Cys Val Cys Gly His Leu Ile Leu305 310 315 320Glu Lys Ile Leu Gln Ala Met Gly Lys Ser Tyr His Pro Gly Cys Phe
325 330 335Arg Cys Ile Val Cys Asn Lys Cys Leu Asp Gly Ile Pro Phe Thr Val
340 345 350Asp Phe Ser Asn Gln Val Tyr Cys Val Thr Asp Tyr His Lys Asn Tyr
355 360 365Ala Pro Lys Cys Ala Ala Cys Gly Gln Pro Ile Leu Pro Ser Glu Gly
370 375 380Cys Glu Asp Ile Val Arg Val Ile Ser Met Asp Arg Asp Tyr His Phe385 390 395 400Glu Cys Tyr His Cys Glu Asp Cys Arg Met Gln Leu Ser Asp Glu Glu
405 410 415Gly Cys Cys Cys Phe Pro Leu Asp Gly His Leu Leu Cys His Gly
420 425 430<210>91<211>900<212>PRT<213>人(Homo sapiens)<400>91Gly Pro Gly Ser Arg His His Arg Ala Arg Asp Arg Leu Ile His Phe1 5 10 15Gly Ala Val Ser Thr Asp Val Leu Gly Cys Ser Ala His Cys Ser Leu
20 25 30Thr Gln Ser Pro Lys Met Asn Ile Gln Glu Gln Gly Phe Pro Leu Asp
35 40 45Leu Gly Ala Ser Phe Thr Glu Asp Ala Pro Arg Pro Pro Val Pro Gly
50 55 60Glu Glu Gly Glu Leu Val Ser Thr Asp Pro Arg Pro Ala Ser Tyr Ser65 70 75 80Phe Cys Ser Gly Lys Gly Val Gly Ile Lys Gly Glu Thr Ser Thr Ala
85 90 95Thr Pro Arg Arg Ser Asp Leu Asp Leu Gly Tyr Glu Pro Glu Gly Ser
100 105 110Ala Ser Pro Thr Pro Pro Tyr Leu Lys Trp Ala Glu Ser Leu His Ser
115 120 125Leu Leu Asp Asp Gln Asp Gly Ile Ser Leu Phe Arg Thr Phe Leu Lys
130 135 140Gln Glu Gly Cys Ala Asp Leu Leu Asp Phe Trp Phe Ala Cys Thr Gly145 150 155 160Phe Arg Lys Leu Glu Pro Cys Asp Ser Asn Glu Glu Lys Arg Leu Lys
165 170 175Leu Ala Arg Ala Ile Tyr Arg Lys Tyr Ile Leu Asp Asn Asn Gly Ile
180 185 190Val Ser Arg Gln Thr Lys Pro Ala Thr Lys Ser Phe Ile Lys Gly Cys
195 200 205Ile Met Lys Gln Leu Ile Asp Pro Ala Met Phe Asp Gln Ala Gln Thr
210 215 220Glu Ile Gln Ala Thr Met Glu Glu Asn Thr Tyr Pro Ser Phe Leu Lys225 230 235 240Ser Asp Ile Tyr Leu Glu Tyr Thr Arg Thr Gly Ser Glu Ser Pro Lys
245 250 255Val Cys Ser Asp Gln Ser Ser Gly Ser Gly Thr Gly Lys Gly Ile Ser
260 265 270Gly Tyr Leu Pro Thr Leu Asn Glu Asp Glu Glu Trp Lys Cys Asp Gln
275 280 285Asp Met Asp Glu Asp Asp Gly Arg Asp Ala Ala Pro Pro Gly Arg Leu
290 295 300Pro Gln Lys Leu Leu Leu Glu Thr Ala Ala Pro Arg Val Ser Ser Ser305 310 315 320Arg Arg Tyr Ser Glu Gly Arg Glu Phe Arg Tyr Gly Ser Trp Arg Glu
325 330 335Pro Val Asn Pro Tyr Tyr Val Asn Ala Gly Tyr Ala Leu Ala Pro Ala
340 345 350Thr Ser Ala Asn Asp Ser Glu Gln Gln Ser Leu Ser Ser Asp Ala Asp
355 360 365Thr Leu Ser Leu Thr Asp Ser Ser Val Asp Gly Ile Pro Pro Tyr Arg
370 375 380Ile Arg Lys Gln His Arg Arg Glu Met Gln Glu Ser Ala Gln Val Asn385 390 395 400Gly Arg Val Pro Leu Pro His Ile Pro Arg Thr Tyr Arg Val Pro Lys
405 410 415Glu Val Arg Val Glu Pro Gln Lys Phe Ala Glu Glu Leu Ile His Arg
420 425 430Leu Glu Ala Val Gln Arg Thr Arg Glu Ala Glu Glu Lys Leu Glu Glu
435 440 445Arg Leu Lys Arg Val Arg Met Glu Glu Glu Gly Glu Asp Gly Asp Pro
450 455 460Ser Ser Gly Pro Pro Gly Pro Cys His Lys Leu Pro Pro Ala Pro Ala465 470 475 480Trp His His Phe Pro Pro Arg Leu Cys Trp Thr Trp Ala Cys Ala Gly
485 490 495Leu Arg Asp Ala His Glu Glu Asn Pro Glu Ser Ile Leu Asp Glu His
500 505 510Val Gln Arg Val Leu Arg Thr Thr Gly Arg Gln Ser Pro Gly Pro Gly
515 520 525His Arg Ser Pro Asp Ser Gly His Val Ala Lys Met Pro Val Ala Leu
530 535 540Gly Gly Ala Ala Ser Gly His Gly Lys His Val Pro Lys Ser Gly Ala545 550 555 560Lys Leu Asp Ala Ala Gly Leu His His His Arg His Val His His His
565 570 575Val His His Ser Thr Ala Arg Pro Lys Glu Gln Val Glu Ala Glu Ala
580 585 590Thr Arg Arg Ala Gln Ser Ser Phe Ala Trp Gly Leu Glu Pro His Ser
595 600 605His Gly Ala Arg Ser Arg Gly Tyr Ser Glu Ser Val Gly Ala Ala Pro
610 615 620Asn Ala Ser Asp Gly Leu Ala His Ser Gly Lys Val Gly Val Ala Cys625 630 635 640Lys Arg Asn Ala Lys Lys Ala Glu Ser Gly Lys Ser Ala Ser Thr Glu
645 650 655Val Pro Gly Ala Ser Glu Asp Ala Glu Lys Asn Gln Lys Ile Met Gln
660 665 670Trp Ile Ile Glu Gly Glu Lys Glu Ile Ser Arg His Arg Arg Thr Gly
675 680 685His Gly Ser Ser Gly Thr Arg Lys Pro Gln Pro His Glu Asn Ser Arg
690 695 700Pro Leu Ser Leu Glu His Pro Trp Ala Gly Pro Gln Leu Arg Thr Ser705 710 715 720Val Gln Pro Ser His Leu Phe Ile Gln Asp Pro Thr Met Pro Pro His
725 730 735Pro Ala Pro Asn Pro Leu Thr Gln Leu Glu Glu Ala Arg Arg Arg Leu
740 745 750Glu Glu Glu Glu Lys Arg Ala Ser Arg Ala Pro Ser Lys Gln Arg Tyr
755 760 765Val Gln Glu Val Met Arg Arg Gly Arg Ala Cys Val Arg Pro Ala Cys
770 775 780Ala Pro Val Leu His Val Val Pro Ala Val Ser Asp Met Glu Leu Ser785 790 795 800Glu Thr Glu Thr Arg Ser Gln Arg Lys Val Gly Gly Gly Ser Ala Gln
805 810 815Pro Cys Asp Ser Ile Val Val Ala Tyr Tyr Phe Cys Gly Glu Pro Ile
820 825 830Pro Tyr Arg Thr Leu Val Arg Gly Arg Ala Val Thr Leu Gly Gln Phe
835 840 845Lys Glu Leu Leu Thr Lys Lys Gly Ser Tyr Arg Tyr Tyr Phe Lys Lys
850 855 860Val Ser Asp Glu Phe Asp Cys Gly Val Val Phe Glu Glu Val Arg Glu865 870 875 880Asp Glu Ala Val Leu Pro Val Phe Glu Glu Lys Ile Ile Gly Lys Val
885 890 895Glu Lys Val Asp
900<210>92<211>591<212>PRT<213>人(Homo sapiens)<400>92Met Val Pro Val Ala Val Thr Ala Ala Val Ala Pro Val Leu Ser Ile1 5 10 15Asn Ser Asp Phe Ser Asp Leu Arg Glu Ile Lys Lys Gln Leu Leu Leu
20 25 30Ile Ala Gly Leu Thr Arg Glu Arg Gly Leu Leu His Ser Ser Lys Trp
35 40 45Ser Ala Glu Leu Ala Phe Ser Leu Pro Ala Leu Pro Leu Ala Glu Leu
50 55 60Gln Pro Pro Pro Pro Ile Thr Glu Glu Asp Ala Gln Asp Met Asp Ala65 70 75 80Tyr Thr Leu Ala Lys Ala Tyr Phe Asp Val Lys Glu Tyr Asp Arg Ala
85 90 95Ala His Phe Leu His Gly Cys Asn Ser Lys Lys Ala Tyr Phe Leu Tyr
100 105 110Met Tyr Ser Arg Tyr Leu Ser Gly Glu Lys Lys Lys Asp Asp Glu Thr
115 120 125Val Asp Ser Leu Gly Pro Leu Glu Lys Gly Gln Val Lys Asn Glu Ala
130 135 140Leu Arg Glu Leu Arg Val Glu Leu Ser Lys Lys His Gln Ala Arg Glu145 150 155 160Leu Asp Gly Phe Gly Leu Tyr Leu Tyr Gly Val Val Leu Arg Lys Leu
165 170 175Asp Leu Val Lys Glu Ala Ile Asp Val Phe Val Glu Ala Thr His Val
180 185 190Leu Pro Leu His Trp Gly Ala Trp Leu Glu Leu Cys Asn Leu Ile Thr
195 200 205Asp Lys Glu Met Leu Lys Phe Leu Ser Leu Pro Asp Thr Trp Met Lys
210 215 220Glu Phe Phe Leu Ala His Ile Tyr Thr Glu Leu Gln Leu Ile Glu Glu225 230 235 240Ala Leu Gln Lys Tyr Gln Asn Leu Ile Asp Val Gly Phe Ser Lys Ser
245 250 255Ser Tyr Ile Val Ser Gln Ile Ala Val Ala Tyr His Asn Ile Arg Asp
260 265 270Ile Asp Lys Ala Leu Ser Ile Phe Asn Glu Leu Arg Lys Gln Asp Pro
275 280 285Tyr Arg Ile Glu Asn Met Asp Thr Phe Ser Asn Leu Leu Tyr Val Arg
290 295 300Ser Met Lys Ser Glu Leu Ser Tyr Leu Ala His Asn Leu Cys Glu Ile305 310 315 320Asp Lys Tyr Arg Val Glu Thr Cys Cys Val Ile Gly Asn Tyr Tyr Ser
325 330 335Leu Arg Ser Gln His Glu Lys Ala Ala Leu Tyr Phe Gln Arg Ala Leu
340 345 350Lys Leu Asn Pro Arg Tyr Leu Gly Ala Trp Thr Leu Met Gly His Glu
355 360 365Tyr Met Glu Met Lys Asn Thr Ser Ala Ala Ile Gln Ala Tyr Arg His
370 375 380Ala Ile Glu Val Asn Lys Arg Asp Tyr Arg Ala Trp Tyr Gly Leu Gly385 390 395 400Gln Thr Tyr Glu Ile Leu Lys Met Pro Phe Tyr Cys Leu Tyr Tyr Tyr
405 410 415Arg Arg Ala His Gln Leu Arg Pro Asn Asp Ser Arg Met Leu Val Ala
420 425 430Leu Gly Glu Cys Tyr Glu Lys Leu Asn Gln Leu Val Glu Ala Lys Lys
435 440 445Cys Tyr Trp Arg Ala Tyr Ala Val Gly Asp Val Glu Lys Met Ala Leu
450 455 460Val Lys Leu Ala Lys Leu His Glu Gln Leu Thr Glu Ser Glu Gln Ala465 470 475 480Ala Gln Cys Tyr Ile Lys Tyr Ile Gln Asp Ile Tyr Ser Cys Gly Glu
485 490 495Ile Val Glu His Leu Glu Glu Ser Thr Ala Phe Arg Tyr Leu Ala Gln
500 505 510Tyr Tyr Phe Lys Cys Lys Leu Trp Asp Glu Ala Ser Thr Cys Ala Gln
515 520 525Lys Cys Cys Ala Phe Asn Asp Thr Arg Glu Glu Gly Lys Ala Leu Leu
530 535 540Arg Gln Ile Leu Gln Leu Arg Asn Gln Gly Glu Thr Pro Thr Thr Glu545 550 555 560Val Pro Ala Pro Phe Phe Leu Pro Ala Ser Leu Ser Ala Asn Asn Thr
565 570 575Pro Thr Arg Arg Val Ser Pro Leu Asn Leu Ser Ser Val Thr Pro
580 585 590<210>93<211>914<212>PRT<213>人(Homo sapiens)<400>93Val Tyr Gln Val Leu Leu Val Gly Ser Thr Leu Leu Lys Glu Val Pro1 5 10 15Ser Gly Leu Gln Leu Glu Gln Leu Pro Ser Gln Ser Leu Leu Thr His
20 25 30Ile Pro Thr Ala Gly Leu Pro Thr Ser Leu Gly Gly Gly Leu Pro Tyr
35 40 45Cys His Gln Ala Trp Leu Asp Phe Arg Arg Arg Leu Glu Ala Leu Leu
50 55 60Gln Asn Cys Gln Ala Ala Cys Ala Leu Leu Gln Gly Ala Ile Glu Ser65 70 75 80Val Lys Ala Val Pro Gln Pro Met Glu Pro Gly Glu Val Gly Gln Leu
85 90 95Leu Gln Gln Thr Glu Val Leu Met Gln Gln Val Leu Asp Ser Pro Trp
100 105 110Leu Ala Trp Leu Gln Cys Gln Gly Gly Arg Glu Leu Thr Trp Leu Lys
115 120 125Gln Glu Val Pro Glu Val Thr Leu Ser Pro Asp Tyr Arg Thr Ala Met
130 135 140Asp Lys Ala Asp Glu Leu Tyr Asp Arg Val Asp Gly Leu Leu His Gln145 150 155 160Leu Thr Leu Gln Ser Asn Gln Arg Ile Gln Ala Leu Glu Leu Val Gln
165 170 175Thr Leu Glu Ala Arg Glu Ser Gly Leu His Gln Ile Glu Val Trp Leu
180 185 190Gln Gln Val Gly Trp Pro Ala Leu Glu Glu Ala Gly Glu Pro Ser Leu
195 200 205Asp Met Leu Leu Gln Ala Gln Gly Ser Phe Gln Glu Leu Tyr Gln Val
210 215 220Ala Gln Glu Gln Val Arg Gln Gly Glu Lys Phe Leu Gln Pro Leu Thr225 230 235 240Gly Trp Glu Ala Ala Glu Leu Asp Pro Pro Gly Ala Arg Phe Leu Ala
245 250 255Leu Arg Ala Gln Leu Thr Glu Phe Ser Arg Ala Leu Ala Gln Arg Cys
260 265 270Gln Arg Leu Ala Asp Ala Glu Arg Leu Phe Gln Leu Phe Arg Glu Ala
275 280 285Leu Thr Trp Ala Glu Glu Gly Gln Arg Val Leu Ala Glu Leu Glu Gln
290 295 300Glu Arg Pro Gly Val Val Leu Gln Gln Leu Gln Leu His Trp Thr Arg305 310 315 320His Pro Asp Leu Pro Pro Ala His Phe Arg Lys Met Trp Ala Leu Ala
325 330 335Thr Gly Leu Gly Ser Glu Ala Ile Arg Gln Glu Cys Arg Trp Ala Trp
340 345 350Ala Arg Cys Gln Asp Thr Trp Leu Ala Leu Asp Gln Lys Leu Glu Ala
355 360 365Ser Leu Lys Leu Pro Pro Val Gly Ser Thr Ala Ser Leu Cys Val Ser
370 375 380Gln Val Pro Ala Ala Pro Ala His Pro Pro Leu Arg Lys Ala Tyr Ser385 390 395 400Phe Asp Arg Asn Leu Gly Gln Ser Leu Ser Glu Pro Ala Cys His Cys
405 410 415His His Ala Ala Thr Ile Ala Ala Cys Arg Arg Pro Glu Ala Gly Gly
420 425 430Gly Ala Leu Pro Gln Ala Ser Pro Thr Val Pro Pro Pro Gly Ser Ser
435 440 445Asp Pro Arg Ser Leu Asn Arg Leu Gln Leu Val Leu Ala Glu Met Val
450 455 460Ala Thr Glu Arg Glu Tyr Val Arg Ala Leu Glu Tyr Thr Met Glu Asn465 470 475 480Tyr Phe Pro Glu Leu Asp Arg Pro Asp Val Pro Gln Gly Leu Arg Gly
485 490 495Gln Arg Ala His Leu Phe Gly Asn Leu Glu Lys Leu Arg Asp Phe His
500 505 510Cys His Phe Phe Leu Arg Glu Leu Glu Ala Cys Thr Arg His Pro Pro
515 520 525Arg Val Ala Tyr Ala Phe Leu Arg His Arg Val Gln Phe Gly Met Tyr
530 535 540Ala Leu Tyr Ser Lys Asn Lys Pro Arg Ser Asp Ala Leu Met Ser Ser545 550 555 560Tyr Gly His Thr Phe Phe Lys Asp Lys Gln Gln Ala Leu Gly Asp His
565 570 575Leu Asp Leu Ala Ser Tyr Leu Leu Lys Pro Ile Gln Arg Met Gly Lys
580 585 590Tyr Ala Leu Leu Leu Gln Glu Leu Ala Arg Ala Cys Gly Gly Pro Thr
595 600 605Gln Glu Leu Ser Ala Leu Arg Glu Ala Gln Ser Leu Val His Phe Gln
610 615 620Leu Arg His Gly Asn Asp Leu Leu Ala Met Asp Ala Ile Gln Gly Cys625 630 635 640Asp Val Asn Leu Lys Glu Gln Gly Gln Leu Val Arg Gln Asp Glu Phe
645 650 655Val Val Arg Thr Gly Arg His Lys Ser Val Arg Arg Ile Phe Leu Phe
660 665 670Glu Glu Leu Leu Leu Phe Ser Lys Pro Arg His Gly Pro Thr Gly Val
675 680 685Asp Thr Phe Ala Tyr Lys Arg Ser Phe Lys Met Ala Asp Leu Gly Leu
690 695 700Thr Glu Cys Cys Gly Asn Ser Asn Leu Arg Phe Glu Ile Trp Phe Arg705 710 715 720Arg Arg Lys Ala Arg Asp Thr Phe Val Leu Gln Ala Ser Ser Leu Ala
725 730 735Ile Lys Gln Ala Trp Thr Ala Asp Ile Ser His Leu Leu Trp Arg Gln
740 745 750Ala Val His Asn Lys Glu Val Arg Met Ala Glu Met Val Ser Met Gly
755 760 765Val Gly Asn Lys Ala Phe Arg Asp Ile Ala Pro Ser Glu Glu Ala Ile
770 775 780Asn Asp Arg Thr Val Asn Tyr Val Leu Lys Cys Arg Glu Val Arg Ser785 790 795 800Arg Ala Ser Ile Ala Val Ala Pro Phe Asp His Asp Ser Leu Tyr Leu
805 810 815Gly Ala Ser Asn Ser Leu Pro Gly Asp Pro Ala Ser Cys Ser Val Leu
820 825 830Gly Ser Leu Asn Leu His Leu Tyr Arg Asp Pro Ala Leu Leu Gly Leu
835 840 845Arg Cys Pro Leu Tyr Pro Ser Phe Leu Glu Glu Ala Ala Leu Glu Ala
850 855 860Glu Ala Glu Leu Gly Gly Gln Pro Ser Leu Thr Ala Glu Asp Ser Glu865 870 875 880Ile Ser Ser Gln Cys Pro Ser Ala Ser Gly Ser Ser Gly Ser Asp Ser
885 890 895Ser Cys Val Ser Gly Gln Ala Leu Gly Arg Gly Leu Glu Asp Leu Pro
900 905 910Cys Val<210>94<211>277<212>PRT<213>人(Homo sapiens)<400>94Leu Asn Tyr Leu Leu Glu Ser Arg Leu Glu Ala Ala Ala His Cys Ala1 5 10 15Leu Lys Gln Gly Ile Ala Thr Ala Ser Leu Leu Pro Ala Gln Leu Gln
20 25 30Pro Ala Val Leu Thr Val Val Thr Cys His Val Val Val Ser Val His
35 40 45Gly His His Thr Asp Gly Cys Leu Ala Ala Leu Cys Arg Glu Asp Arg
50 55 60Thr Gly Thr Gly Gly Ala Phe Trp Cys Lys Asn Arg Val Ile Val Ser65 70 75 80His Ala Val Asp Val Val Leu His Val His Gly Glu Gly Asn Pro Val
85 90 95Gln Ala Leu Ile Ala His Gly Ala Pro Glu Ala Ala Trp Val Val Gly
100 105 110Leu Ala Gln Gly Leu Gln Asp His Phe His Asp Glu Met Ser Thr His
115 120 125Ala Ala Phe Val Gly Arg Leu Leu Glu Pro Gly Val Gln Glu Val Leu
130 135 140Leu Ala Val His Phe Leu Thr His Val Val Glu Arg Leu Pro Thr Glu145 150 155 160Ser Ser Pro Thr Arg Val Ala Gly Glu Ala Val Ser Val Ile Lys Thr
165 170 175Pro His Cys Leu Ala Arg Leu Leu Gly Ser Val Asp Ala Lys Pro Thr
180 185 190Leu Asp Ala Asn Ala Glu Val Val Pro Arg Arg Ala Arg Leu Glu Arg
195 200 205Pro Leu Gln Leu Pro Gly Glu Arg Leu Gln Pro Pro Leu Gly Arg Ala
210 215 220Trp Ala Ala Leu Pro Ala Arg Gly Gln Arg Glu Cys Arg Gln Arg Glu225 230 235 240Gly Gly Arg Pro Arg Arg Leu Arg Gly Ala Ser Gly Arg Gly Ala Gly
245 250 255Ala Gly Arg Glu Glu Val Ser Val Gly Phe Ser Ala Gln Trp Glu Phe
260 265 270Gly Ser Gly Arg His
275<210>95<211>1120<212>PRT<213>人(Homo sapiens)<400>95Met Trp Arg Val Lys Lys Leu Ser Leu Ser Leu Ser Pro Ser Pro Gln1 5 10 15Thr Gly Lys Pro Ser Met Arg Thr Pro Leu Arg Glu Leu Thr Leu Gln
20 25 30Pro Gly Ala Leu Thr Thr Ser Gly Lys Arg Ser Pro Ala Cys Ser Ser
35 40 45Leu Thr Pro Ser Leu Cys Lys Leu Gly Leu Gln Glu Gly Ser Asn Asn
50 55 60Ser Ser Pro Val Asp Phe Val Asn Asn Lys Arg Thr Asp Leu Ser Ser65 70 75 80Glu His Phe Ser His Ser Ser Lys Trp Leu Glu Thr Cys Gln His Glu
85 90 95Ser Asp Glu Gln Pro Leu Asp Pro Ile Pro Gln Ile Ser Ser Thr Pro
100 105 110Lys Thr Ser Glu Glu Ala Val Asp Pro Leu Gly Asn Tyr Met Val Lys
115 120 125Thr Ile Val Leu Val Pro Ser Pro Leu Gly Gln Gln Gln Asp Met Ile
130 135 140Phe Glu Ala Arg Leu Asp Thr Met Ala Glu Thr Asn Ser Ile Ser Leu145 150 155 160Asn Gly Pro Leu Arg Thr Asp Asp Leu Val Arg Glu Glu Val Ala Pro
165 170 175Cys Met Gly Asp Arg Phe Ser Glu Val Ala Ala Val Ser Glu Lys Pro
180 185 190Ile Phe Gln Glu Ser Pro Ser His Leu Leu Glu Glu Ser Pro Pro Asn
195 200 205Pro Cys Ser Glu Gln Leu His Cys Ser Lys Glu Ser Leu Ser Ser Arg210 215 220Thr Glu Ala Val Arg Glu Asp Leu Val Pro Ser Glu Ser Asn Ala Phe225 230 235 240Leu Pro Ser Ser Val Leu Trp Leu Ser Pro Ser Thr Ala Leu Ala Ala
245 250 255Asp Phe Arg Val Asn His Val Asp Pro Glu Glu Glu Ile Val Glu His
260 265 270Gly Ala Met Glu Glu Arg Glu Met Arg Phe Pro Thr His Pro Lys Glu
275 280 285Ser Glu Thr Glu Asp Gln Ala Leu Val Ser Ser Val Glu Asp Ile Leu
290 295 300Ser Thr Cys Leu Thr Pro Asn Leu Val Glu Met Glu Ser Gln Glu Ala305 310 315 320Pro Gly Pro Ala Val Glu Asp Val Gly Arg Ile Leu Gly Ser Asp Thr
325 330 335Glu Ser Trp Met Ser Pro Leu Ala Trp Leu Glu Lys Gly Val Asn Thr
340 345 350Ser Val Met Leu Glu Asn Leu Arg Gln Ser Leu Ser Leu Pro Ser Met
355 360 365Leu Arg Asp Ala Ala Ile Gly Thr Thr Pro Phe Ser Thr Cys Ser Val
370 375 380Gly Thr Trp Phe Thr Pro Ser Ala Pro Gln Glu Lys Ser Thr Asn Thr385 390 395 400Ser Gln Thr Gly Leu Val Gly Thr Lys His Ser Thr Ser Glu Thr Glu
405 410 415Gln Leu Leu Cys Gly Arg Pro Pro Asp Leu Thr Ala Leu Ser Arg His
420 425 430Asp Leu Glu Asp Asn Leu Leu Ser Ser Leu Val Ile Val Glu Phe Leu
435 440 445Ser Arg Gln Leu Arg Asp Trp Lys Ser Gln Leu Ala Val Pro His Pro
450 455 460Glu Thr Gln Asp Ser Ser Thr Gln Thr Asp Thr Ser His Ser Gly Ile465 470 475 480Thr Asn Lys Leu Gln His Leu Lys Glu Ser His Glu Met Gly Gln Ala
485 490 495Leu Gln Gln Ala Arg Asn Val Met Gln Ser Trp Val Leu Ile Ser Lys
500 505 510Glu Leu Ile Ser Leu Leu His Leu Ser Leu Leu His Leu Glu Glu Asp
515 520 525Lys Thr Thr Val Asn Gln Glu Ser Arg Arg Ala Glu Thr Leu Val Cys
530 535 540Cys Cys Phe Asp Leu Leu Lys Lys Leu Arg Ala Lys Leu Gln Ser Leu545 550 555 560Lys Ala Glu Arg Glu Glu Ala Arg His Arg Glu Glu Met Ala Leu Arg
565 570 575Gly Lys Asp Ala Ala Glu Ile Val Leu Glu Ala Phe Cys Ala His Ala
580 585 590Ser Gln Arg Ile Ser Gln Leu Glu Gln Asp Leu Ala Ser Met Arg Glu
595 600 605Phe Arg Gly Leu Leu Lys Asp Ala Gln Thr Gln Leu Val Gly Leu His
610 615 620Ala Lys Gln Glu Glu Leu Val Gln Gln Thr Val Ser Leu Thr Ser Thr625 630 635 640Leu Gln Gln Asp Trp Arg Ser Met Gln Leu Asp Tyr Thr Thr Trp Thr
645 650 655Ala Leu Leu Ser Arg Ser Arg Gln Leu Thr Glu Lys Leu Thr Val Lys
660 665 670Ser Gln Gln Ala Leu Gln Glu Arg Asp Val Ala Ile Glu Glu Lys Gln
675 680 685Glu Val Ser Arg Val Leu Glu Gln Val Ser Ala Gln Leu Glu Glu Cys
690 695 700Lys Gly Gln Thr Glu Gln Leu Glu Leu Glu Asn Ile Arg Leu Ala Thr705 710 715 720Asp Leu Arg Ala Gln Leu Gln Ile Leu Ala Asn Met Asp Ser Gln Leu
725 730 735Lys Glu Leu Gln Ser Gln His Thr His Cys Ala Gln Asp Leu Ala Met
740 745 750Lys Asp Glu Leu Leu Cys Gln Leu Thr Gln Ser Asn Glu Glu Gln Ala
755 760 765Ala Gln Cys Val Lys Glu Glu Met Ala Leu Lys His Met Gln Ala Glu
770 775 780Leu Gln Gln Gln Gln Ala Val Leu Ala Lys Glu Val Arg Asp Leu Lys785 790 795 800Glu Thr Leu Glu Phe Ala Asp Gln Glu Asn Gln Val Ala His Leu Glu
805 810 815Leu Gly Gln Val Glu Cys Gln Leu Lys Thr Thr Leu Glu Val Leu Arg
820 825 830Glu Arg Ser Leu Gln Cys Glu Asn Leu Lys Asp Thr Val Glu Asn Leu
835 840 845Thr Ala Lys Leu Ala Ser Thr Ile Ala Asp Asn Gln Glu Gln Asp Leu
850 855 860Glu Lys Thr Arg Gln Tyr Ser Gln Lys Leu Gly Leu Leu Thr Glu Gln865 870 875 880Leu Gln Ser Leu Thr Leu Phe Leu Gln Thr Lys Leu Lys Glu Lys Thr
885 890 895Glu Gln Glu Thr Leu Leu Leu Ser Thr Ala Cys Pro Pro Thr Gln Glu
900 905 910His Pro Leu Pro Asn Asp Arg Thr Phe Leu Gly Ser Ile Leu Thr Ala
915 920 925Val Ala Asp Glu Glu Pro Glu Ser Thr Pro Val Pro Leu Leu Gly Ser
930 935 940Asp Lys Ser Ala Phe Thr Arg Val Ala Ser Met Val Ser Leu Gln Pro945 950 955 960Ala Glu Thr Pro Gly Met Glu Glu Ser Leu Ala Glu Met Ser Ile Met
965 970 975Thr Thr Glu Leu Gln Ser Leu Cys Ser Leu Leu Gln Glu Ser Lys Glu
980 985 990Glu Ala Ile Arg Thr Leu Gln Arg Lys Ile Cys Glu Leu Gln Ala Arg
995 1000 1005Leu Gln Ala Gln Glu Glu Gln His Gln Glu Val Gln Lys Ala Lys Glu
1010 1015 1020Ala Asp Ile Glu Lys Leu Asn Gln Ala Leu Cys Leu Arg Tyr Lys Asn1025 1030 1035 1040Glu Lys Glu Leu Gln Glu Val Ile Gln Gln Asn Glu Lys Ile Leu Glu
1045 1050 1055Gln Ile Asp Lys Ser Gly Glu Leu Ile Ser Leu Arg Glu Glu Val Thr
1060 1065 1070His Leu Thr Arg Ser Leu Arg Arg Ala Glu Thr Glu Thr Lys Val Leu
1075 1080 1085Gln Glu Ala Trp Gln Ala Ser Trp Thr Pro Thr Ala Ser Leu Trp Pro
1090 1095 1100Pro Ile Gly Ser Arg Arg Lys Cys Gly Ser Leu Arg Arg Trp Thr Asn1105 1110 1115 1120<210>96<211>540<212>PRT<213>人(Homo sapiens)<400>96Met Gly Thr Thr Ala Arg Ala Ala Leu Val Leu Thr Tyr Leu Ala Val1 5 10 15Ala Ser Ala Ala Ser Glu Gly Gly Phe Thr Ala Thr Gly Gln Arg Gln
20 25 30Leu Arg Pro Glu His Phe Gln Glu Val Gly Tyr Ala Ala Pro Pro Ser
35 40 45Pro Pro Leu Ser Arg Ser Leu Pro Met Asp His Pro Asp Ser Ser Gln
50 55 60His Gly Pro Pro Phe Glu Gly Gln Ser Gln Val Gln Pro Pro Pro Ser65 70 75 80Gln Glu Ala Thr Pro Leu Gln Gln Glu Lys Leu Leu Pro Ala Gln Leu
85 90 95Pro Ala Glu Lys Glu Val Gly Pro Pro Leu Pro Gln Glu Ala Val Pro
100 105 110Leu Gln Lys Glu Leu Pro Ser Leu Gln His Pro Asn Glu Gln Lys Glu
115 120 125Gly Thr Pro Ala Pro Phe Gly Asp Gln Ser His Pro Glu Pro Glu Ser
130 135 140Trp Asn Ala Ala Gln His Cys Gln Gln Asp Arg Ser Gln Gly Gly Trp145 150 155 160Gly His Arg Leu Asp Gly Phe Pro Pro Gly Arg Pro Ser Pro Asp Asn
165 170 175Leu Asn Gln Ile Cys Leu Pro Asn Arg Gln His Val Val Tyr Gly Pro
180 185 190Trp Asn Leu Pro Gln Ser Ser Tyr Ser His Leu Thr Arg Gln Gly Glu
195 200 205Thr Leu Asn Phe Leu Glu Ile Gly Tyr Ser Arg Cys Cys His Cys Arg
210 215 220Ser His Thr Asn Arg Leu Glu Cys Ala Lys Leu Val Trp Glu Glu Ala225 230 235 240Met Ser Arg Phe Cys Glu Ala Glu Phe Ser Val Lys Thr Arg Pro His
245 250 255Trp Cys Cys Thr Arg Gln Gly Glu Ala Arg Phe Ser Cys Phe Gln Glu
260 265 270Glu Ala Pro Gln Pro His Tyr Gln Leu Arg Ala Cys Pro Ser His Gln
275 280 285Pro Asp Ile Ser Ser Gly Leu Glu Leu Pro Phe Pro Pro Gly Val Pro
290 295 300Thr Leu Asp Asn Ile Lys Asn Ile Cys His Leu Arg Arg Phe Arg Ser305 310 315 320Val Pro Arg Asn Leu Pro Ala Thr Asp Pro Leu Gln Arg Glu Leu Leu
325 330 335Ala Leu Ile Gln Leu Glu Arg Glu Phe Gln Arg Cys Cys Arg Gln Gly
340 345 350Asn Asn His Thr Cys Thr Trp Lys Ala Trp Glu Asp Thr Leu Asp Lys
355 360 365Tyr Cys Asp Arg Glu Tyr Ala Val Lys Thr His His His Leu Cys Cys
370 375 380Arg His Pro Pro Ser Pro Thr Arg Asp Glu Cys Phe Ala Arg Arg Ala385 390 395 400Pro Tyr Pro Asn Tyr Asp Arg Asp Ile Leu Thr Ile Asp Ile Ser Arg
405 410 415Val Thr Pro Asn Leu Met Gly His Leu Cys Gly Asn Gln Arg Val Leu
420 425 430Thr Lys His Lys His Ile Pro Gly Leu Ile His Asn Met Thr Ala Arg
435 440 445Cys Cys Asp Leu Pro Phe Pro Glu Gln Ala Cys Cys Ala Glu Glu Glu
450 455 460Lys Leu Thr Phe Ile Asn Asp Leu Cys Gly Pro Arg Arg Asn Ile Trp465 470 475 480Arg Asp Pro Ala Leu Cys Cys Tyr Leu Ser Pro Gly Asp Glu Gln Val
485 490 495Asn Cys Phe Asn Ile Asn Tyr Leu Arg Asn Val Ala Leu Val Ser Gly
500 505 510Asp Thr Glu Asn Ala Lys Gly Gln Gly Glu Gln Gly Ser Thr Gly Gly
515 520 525Thr Asn Ile Ser Ser Thr Ser Glu Pro Lys Glu Glu
530 535 540<210>97<211>462<212>PRT<213>人(Homo sapiens)<400>97Met Gly Lys Glu Lys Thr His Ile Asn Ile Val Val Ile Gly His Val1 5 10 15Asp Ser Gly Lys Ser Thr Thr Thr Gly His Leu Ile Tyr Lys Cys Gly
20 25 30Gly Ile Asp Lys Arg Thr Ile Glu Lys Phe Glu Lys Glu Ala Ala Glu
35 40 45Met Gly Lys Gly Ser Phe Lys Tyr Ala Trp Val Leu Asp Lys Leu Lys
50 55 60Ala Glu Arg Glu Arg Gly Ile Thr Ile Asp Ile Ser Leu Trp Lys Phe65 70 75 80Glu Thr Ser Lys Tyr Tyr Val Thr Ile Ile Asp Ala Pro Gly His Arg
85 90 95Asp Phe Ile Lys Asn Met Ile Thr Gly Thr Ser Gln Ala Asp Cys Ala
100 105 110Val Leu Ile Val Ala Ala Gly Val Gly Glu Phe Glu Ala Gly Ile Ser
115 120 125Lys Asn Gly Gln Thr Arg Glu His Ala Leu Leu Ala Tyr Thr Leu Gly
130 135 140Val Lys Gln Leu Ile Val Gly Val Asn Lys Met Asp Ser Thr Glu Pro145 150 155 160Pro Tyr Ser Gln Lys Arg Tyr Glu Glu Ile Val Lys Glu Val Ser Thr
165 170 175Tyr Ile Lys Lys Ile Gly Tyr Asn Pro Asp Thr Val Ala Phe Val Pro
180 185 190Ile Ser Gly Trp Asn Gly Asp Asn Met Leu Glu Pro Ser Ala Asn Met
195 200 205Pro Trp Phe Lys Gly Trp Lys Val Thr Arg Lys Asp Gly Asn Ala Ser
210 215 220Gly Thr Thr Leu Leu Glu Ala Val Asp Cys Ile Leu Pro Pro Thr Arg225 230 235 240Pro Thr Asp Lys Pro Leu Arg Leu Pro Leu Gln Asp Val Tyr Lys Ile
245 250 255Gly Gly Ile Gly Thr Val Pro Val Gly Arg Val Glu Thr Gly Val Leu
260 265 270Lys Pro Gly Met Val Val Thr Phe Ala Pro Val Asn Val Thr Thr Glu
275 280 285Val Lys Ser Val Glu Met His His Glu Ala Leu Ser Glu Ala Leu Pro
290 295 300Gly Asp Asn Val Gly Phe Asn Val Lys Asn Val Ser Val Lys Asp Val305 310 315 320Arg Arg Gly Asn Val Ala Gly Asp Ser Lys Asn Asp Pro Pro Met Glu
325 330 335Ala Ala Gly Phe Thr Ala Gln Val Ile Ile Leu Asn His Pro Gly Gln
340 345 350Ile Ser Ala Gly Tyr Ala Pro Val Leu Asp Cys His Thr Ala His Ile
355 360 365Ala Cys Lys Phe Ala Glu Leu Lys Glu Lys Ile Asp Arg Arg Ser Gly
370 375 380Lys Lys Leu Glu Asp Gly Pro Lys Phe Leu Lys Ser Gly Asp Ala Ala385 390 395 400Ile Val Asp Met Val Pro Gly Lys Pro Met Cys Val Glu Ser Phe Ser
405 410 415Asp Tyr Pro Pro Leu Gly Arg Phe Ala Val Arg Asp Met Arg Gln Thr
420 425 430Val Ala Val Gly Val Ile Lys Ala Val Asp Lys Lys Ala Ala Gly Ala
435 440 445Gly Lys Val Thr Lys Ser Ala Gln Lys Ala Gln Lys Ala Lys
450 455 460<210>98<211>2328<212>PRT<213>人(Homo sapiens)<400>98Lys Ser Lys Arg Gln Ala Gln Gln Met Val Gln Pro Gln Ser Pro Val1 5 10 15Ala Val Ser Gln Ser Lys Pro Gly Cys Tyr Asp Asn Gly Lys His Tyr
20 25 30Gln Ile Asn Gln Gln Trp Glu Arg Thr Tyr Leu Gly Asn Val Leu Val
35 40 45Cys Thr Cys Tyr Gly Gly Ser Arg Gly Phe Asn Cys Glu Ser Lys Pro
50 55 60Glu Ala Glu Glu Thr Cys Phe Asp Lys Tyr Thr Gly Asn Thr Tyr Arg65 70 75 80Val Gly Asp Thr Tyr Glu Arg Pro Lys Asp Ser Met Ile Trp Asp Cys
85 90 95Thr Cys Ile Gly Ala Gly Arg Gly Arg Ile Ser Cys Thr Ile Ala Asn
100 105 110Arg Cys His Glu Gly Gly Gln Ser Tyr Lys Ile Gly Asp Thr Trp Arg
115 120 125Arg Pro His Glu Thr Gly Gly Tyr Met Leu Glu Cys Val Cys Leu Gly
130 135 140Asn Gly Lys Gly Glu Trp Thr Cys Lys Pro Ile Ala Glu Lys Cys Phe145 150 155 160Asp His Ala Ala Gly Thr Ser Tyr Val Val Gly Glu Thr Trp Glu Lys
165 170 175Pro Tyr Gln Gly Trp Met Met Val Asp Cys Thr Cys Leu Gly Glu Gly
180 185 190Ser Gly Arg Ile Thr Cys Thr Ser Arg Asn Arg Cys Asn Asp Gln Asp
195 200 205Thr Arg Thr Ser Tyr Arg Ile Gly Asp Thr Trp Ser Lys Lys Asp Asn
210 215 220Arg Gly Asn Leu Leu Gln Cys Ile Cys Thr Gly Asn Gly Arg Gly Glu225 230 235 240Trp Lys Cys Glu Arg His Thr Ser Val Gln Thr Thr Ser Ser Gly Ser
245 250 255Gly Pro Phe Thr Asp Val Arg Ala Ala Val Tyr Gln Pro Gln Pro His
260 265 270Pro Gln Pro Pro Pro Tyr Gly His Cys Val Thr Asp Ser Gly Val Val
275 280 285Tyr Ser Val Gly Met Gln Trp Leu Lys Thr Gln Gly Asn Lys Gln Met
290 295 300Leu Cys Thr Cys Leu Gly Asn Gly Val Ser Cys Gln Glu Thr Ala Val305 310 315 320Thr Gln Thr Tyr Gly Gly Asn Leu Asn Gly Glu Pro Cys Val Leu Pro
325 330 335Phe Thr Tyr Asn Gly Arg Thr Phe Tyr Ser Cys Thr Thr Glu Gly Arg
340 345 350Gln Asp Gly His Leu Trp Cys Ser Thr Thr Ser Asn Tyr Glu Gln Asp
355 360 365Gln Lys Tyr Ser Phe Cys Thr Asp His Thr Val Leu Val Gln Thr Gln
370 375 380Gly Gly Asn Ser Asn Gly Ala Leu Cys His Phe Pro Phe Leu Tyr Asn385 390 395 400Asn His Asn Tyr Thr Asp Cys Thr Ser Glu Gly Arg Arg Asp Asn Met
405 410 415Lys Trp Cys Gly Thr Thr Gln Asn Tyr Asp Ala Asp Gln Lys Phe Gly
420 425 430Phe Cys Pro Met Ala Ala His Glu Glu Ile Cys Thr Thr Asn Glu Gly
435 440 445Val Met Tyr Arg Ile Gly Asp Gln Trp Asp Lys Gln His Asp Met Gly
450 455 460His Met Met Arg Cys Thr Cys Val Gly Asn Gly Arg Gly Glu Trp Thr465 470 475 480Cys Ile Ala Tyr Ser Gln Leu Arg Asp Gln Cys Ile Val Asp Asp Ile
485 490 495Thr Tyr Asn Val Asn Asp Thr Phe His Lys Arg His Glu Glu Gly His
500 505 510Met Leu Asn Cys Thr Cys Phe Gly Gln Gly Arg Gly Arg Trp Lys Cys
515 520 525Asp Pro Val Asp Gln Cys Gln Asp Ser Glu Thr Gly Thr Phe Tyr Gln
530 535 540Ile Gly Asp Ser Trp Glu Lys Tyr Val His Gly Val Arg Tyr Gln Cys545 550 555 560Tyr Cys Tyr Gly Arg Gly Ile Gly Glu Trp His Cys Gln Pro Leu Gln
565 570 575Thr Tyr Pro Ser Ser Ser Gly Pro Val Glu Val Phe Ile Thr Glu Thr
580 585 590Pro Ser Gln Pro Asn Ser His Pro Ile Gln Trp Asn Ala Pro Gln Pro
595 600 605Ser His Ile Ser Lys Tyr Ile Leu Arg Trp Arg Pro Lys Asn Ser Val
610 615 620Gly Arg Trp Lys Glu Ala Thr Ile Pro Gly His Leu Asn Ser Tyr Thr625 630 635 640Ile Lys Gly Leu Lys Pro Gly Val Val Tyr Glu Gly Gln Leu Ile Ser
645 650 655Ile Gln Gln Tyr Gly His Gln Glu Val Thr Arg Phe Asp Phe Thr Thr
660 665 670Thr Ser Thr Ser Thr Pro Val Thr Ser Asn Thr Val Thr Gly Glu Thr
675 680 685Thr Pro Phe Ser Pro Leu Val Ala Thr Ser Glu Ser Val Thr Glu Ile
690 695 700Thr Ala Ser Ser Phe Val Val Ser Trp Val Ser Ala Ser Asp Thr Val705 710 715 720Ser Gly Phe Arg Val Glu Tyr Glu Leu Ser Glu Glu Gly Asp Glu Pro
725 730 735Gln Tyr Leu Asp Leu Pro Ser Thr Ala Thr Ser Val Asn Ile Pro Asp
740 745 750Leu Leu Pro Gly Arg Lys Tyr Ile Val Asn Val Tyr Gln Ile Ser Glu
755 760 765Asp Gly Glu Gln Ser Leu Ile Leu Ser Thr Ser Gln Thr Thr Ala Pro
770 775 780Asp Ala Pro Pro Asp Pro Thr Val Asp Gln Val Asp Asp Thr Ser Ile785 790 795 800Val Val Arg Trp Ser Arg Pro Gln Ala Pro Ile Thr Gly Tyr Arg Ile
805 810 815Val Tyr Ser Pro Ser Val Glu Gly Ser Ser Thr Glu Leu Asn Leu Pro
820 825 830Glu Thr Ala Asn Ser Val Thr Leu Ser Asp Leu Gln Pro Gly Val Gln
835 840 845Tyr Asn Ile Thr Ile Tyr Ala Val Glu Glu Asn Gln Glu Ser Thr Pro
850 855 860Val Val Ile Gln Gln Glu Thr Thr Gly Thr Pro Arg Ser Asp Thr Val865 870 875 880Pro Ser Pro Arg Asp Leu Gln Phe Val Glu Val Thr Asp Val Lys Val
885 890 895Thr Ile Met Trp Thr Pro Pro Glu Ser Ala Val Thr Gly Tyr Arg Val
900 905 910Asp Val Ile Pro Val Asn Leu Pro Gly Glu His Gly Gln Arg Leu Pro
915 920 925Ile Ser Arg Asn Thr Phe Ala Glu Val Thr Gly Leu Ser Pro Gly Val
930 935 940Thr Tyr Tyr Phe Lys Val Phe Ala Val Ser His Gly Arg Glu Ser Lys945 950 955 960Pro Leu Thr Ala Gln Gln Thr Thr Lys Leu Asp Ala Pro Thr Asn Leu
965 970 975Gln Phe Val Asn Glu Thr Asp Ser Thr Val Leu Val Arg Trp Thr Pro
980 985 990Pro Arg Ala Gln Ile Thr Gly Tyr Arg Leu Thr Val Gly Leu Thr Arg
995 1000 1005Arg Gly Gln Pro Arg Gln Tyr Asn Val Gly Pro Ser Val Ser Lys Tyr
1010 1015 1020Pro Leu Arg Asn Leu Gln Pro Ala Ser Glu Tyr Thr Val Ser Leu Val1025 1030 1035 1040Ala Ile Lys Gly Asn Gln Glu Ser Pro Lys Ala Thr Gly Val Phe Thr
1045 1050 1055Thr Leu Gln Pro Gly Ser Ser Ile Pro Pro Tyr Asn Thr Glu Val Thr
1060 1065 1070Glu Thr Thr Ile Val Ile Thr Trp Thr Pro Ala Pro Arg Ile Gly Phe
1075 1080 1085Lys Leu Gly Val Arg Pro Ser Gln Gly Gly Glu Ala Pro Arg Glu Val
1090 1095 1100Thr Ser Asp Ser Gly Ser Ile Val Val Ser Gly Leu Thr Pro Gly Val1105 1110 1115 1120Glu Tyr Val Tyr Thr Ile Gln Val Leu Arg Asp Gly Gln Glu Arg Asp
1125 1130 1135Ala Pro Ile Val Asn Lys Val Val Thr Pro Leu Ser Pro Pro Thr Asn
1140 1145 1150Leu His Leu Glu Ala Asn Pro Asp Thr Gly Val Leu Thr Val Ser Trp
1155 1160 1165Glu Arg Ser Thr Thr Pro Asp Ile Thr Gly Tyr Arg Ile Thr Thr Thr
1170 1175 1180Pro Thr Asn Gly Gln Gln Gly Asn Ser Leu Glu Glu Val Val His Ala1185 1190 1195 1200Asp Gln Ser Ser Cys Thr Phe Asp Asn Leu Ser Pro Gly Leu Glu Tyr
1205 1210 1215Asn Val Ser Val Tyr Thr Val Lys Asp Asp Lys Glu Ser Val Pro Ile
1220 1225 1230Ser Asp Thr Ile Ile Pro Ala Val Pro Pro Pro Thr Asp Leu Arg Phe
1235 1240 1245Thr Asn Ile Gly Pro Asp Thr Met Arg Val Thr Trp Ala Pro Pro Pro
1250 1255 1260Ser Ile Asp Leu Thr Asn Phe Leu Val Arg Tyr Ser Pro Val Lys Asn1265 1270 1275 1280Glu Glu Asp Val Ala Glu Leu Ser Ile Ser Pro Ser Asp Asn Ala Val
1285 1290 1295Val Leu Thr Asn Leu Leu Pro Gly Thr Glu Tyr Val Val Ser Val Ser
1300 1305 1310Ser Val Tyr Glu Gln His Glu Ser Thr Pro Leu Arg Gly Arg Gln Lys
1315 1320 1325Thr Gly Leu Asp Ser Pro Thr Gly Ile Asp Phe Ser Asp Ile Thr Ala
1330 1335 1340Asn Ser Phe Thr Val His Trp Ile Ala Pro Arg Ala Thr Ile Thr Gly1345 1350 1355 1360Tyr Arg Ile Arg His His Pro Glu His Phe Ser Gly Arg Pro Arg Glu
1365 1370 1375Asp Arg Val Pro His Ser Arg Asn Ser Ile Thr Leu Thr Asn Leu Thr
1380 1385 1390Pro Gly Thr Glu Tyr Val Val Ser Ile Val Ala Leu Asn Gly Arg Glu
1395 1400 1405Glu Ser Pro Leu Leu Ile Gly Gln Gln Ser Thr Val Ser Asp Val Pro
1410 1415 1420Arg Asp Leu Glu Val Val Ala Ala Thr Pro Thr Ser Leu Leu Ile Ser1425 1430 1435 1440Trp Asp Ala Pro Ala Val Thr Val Arg Tyr Tyr Arg Ile Thr Tyr Gly
1445 1450 1455Glu Thr Gly Gly Asn Ser Pro Val Gln Glu Phe Thr Val Pro Gly Ser
1460 1465 1470Lys Ser Thr Ala Thr Ile Ser Gly Leu Lys Pro Gly Val Asp Tyr Thr
1475 1480 1485Ile Thr Val Tyr Ala Val Thr Gly Arg Gly Asp Ser Pro Ala Ser Ser
1490 1495 1500Lys Pro Ile Ser Ile Asn Tyr Arg Thr Glu Ile Asp Lys Pro Ser Gln1505 1510 1515 1520Met Gln Val Thr Asp Val Gln Asp Asn Ser Ile Ser Val Lys Trp Leu
1525 1530 1535Pro Ser Ser Ser Pro Val Thr Gly Tyr Arg Val Thr Thr Thr Pro Lys
1540 1545 1550Asn Gly Pro Gly Pro Thr Lys Thr Lys Thr Ala Gly Pro Asp Gln Thr
1555 1560 1565Glu Met Thr Ile Glu Gly Leu Gln Pro Thr Val Glu Tyr Val Val Ser
1570 1575 1580Val Tyr Ala Gln Asn Pro Ser Gly Glu Ser Gln Pro Leu Val Gln Thr1585 1590 1595 1600Ala Val Thr Asn Ile Asp Arg Pro Lys Gly Leu Ala Phe Thr Asp Val
1605 1610 1615Asp Val Asp Ser Ile Lys Ile Ala Trp Glu Ser Pro Gln Gly Gln Val
1620 1625 1630Ser Arg Tyr Arg Val Thr Tyr Ser Ser Pro Glu Asp Gly Ile His Glu
1635 1640 1645Leu Phe Pro Ala Pro Asp Gly Glu Glu Asp Thr Ala Glu Leu Gln Gly
1650 1655 1660Leu Arg Pro Gly Ser Glu Tyr Thr Val Ser Val Val Ala Leu His Asp1665 1670 1675 1680Asp Met Glu Ser Gln Pro Leu Ile Gly Thr Gln Ser Thr Ala Ile Pro
1685 1690 1695Ala Pro Thr Asp Leu Lys Phe Thr Gln Val Thr Pro Thr Ser Leu Ser
1700 1705 1710Ala Gln Trp Thr Pro Pro Asn Val Gln Leu Thr Gly Tyr Arg Val Arg
1715 1720 1725Val Thr Pro Lys Glu Lys Thr Gly Pro Met Lys Glu Ile Asn Leu Ala
1730 1735 1740Pro Asp Ser Ser Ser Val Val Val Ser Gly Leu Met Val Ala Thr Lys1745 1750 1755 1760Tyr Glu Val Ser Val Tyr Ala Leu Lys Asp Thr Leu Thr Ser Arg Pro
1765 1770 1775Ala Gln Gly Val Val Thr Thr Leu Glu Asn Val Ser Pro Pro Arg Arg
1780 1785 1790Ala Arg Val Thr Asp Ala Thr Glu Thr Thr Ile Thr Ile Ser Trp Arg
1795 1800 1805Thr Lys Thr Glu Thr Ile Thr Gly Phe Gln Val Asp Ala Val Pro Ala
1810 1815 1820Asn Gly Gln Thr Pro Ile Gln Arg Thr Ile Lys Pro Asp Val Arg Ser1825 1830 1835 1840Tyr Thr Ile Thr Gly Leu Gln Pro Gly Thr Asp Tyr Lys Ile Tyr Leu
1845 1850 1855Tyr Thr Leu Asn Asp Asn Ala Arg Ser Ser Pro Val Val Ile Asp Ala
1860 1865 1870Ser Thr Ala Ile Asp Ala Pro Ser Asn Leu Arg Phe Leu Ala Thr Thr
1875 1880 1885Pro Asn Ser Leu Leu Val Ser Trp Gln Pro Pro Arg Ala Arg Ile Thr
1890 1895 1900Gly Tyr Ile Ile Lys Tyr Glu Lys Pro Gly Ser Pro Pro Arg Glu Val1905 1910 1915 1920Val Pro Arg Pro Arg Pro Gly Val Thr Glu Ala Thr Ile Thr Gly Leu
1925 1930 1935Glu Pro Gly Thr Glu Tyr Thr Ile Tyr Val Ile Ala Leu Lys Asn Asn
1940 1945 1950Gln Lys Ser Glu Pro Leu Ile Gly Arg Lys Lys Thr Asp Glu Leu Pro
1955 1960 1965Gln Leu Val Thr Leu Pro His Pro Asn Leu His Gly Pro Glu Ile Leu
1970 1975 1980Asp Val Pro Ser Thr Val Gln Lys Thr Pro Phe Val Thr His Pro Gly1985 1990 1995 2000Tyr Asp Thr Gly Asn Gly Ile Gln Leu Pro Gly Thr Ser Gly Gln Gln
2005 2010 2015Pro Ser Val Gly Gln Gln Met Ile Phe Glu Glu His Gly Phe Arg Arg
2020 2025 2030Thr Thr Pro Pro Thr Thr Ala Thr Pro Ile Arg His Arg Pro Arg Pro
2035 2040 2045Tyr Pro Pro Asn Val Gly Gln Glu Ala Leu Ser Gln Thr Thr Ile Ser
2050 2055 2060Trp Ala Pro Phe Gln Asp Thr Ser Glu Tyr Ile Ile Ser Cys His Pro2065 2070 2075 2080Val Gly Thr Asp Glu Glu Pro Leu Gln Phe Arg Val Pro Gly Thr Ser
2085 2090 2095Thr Ser Ala Thr Leu Thr Gly Leu Thr Arg Gly Ala Thr Tyr Asn Ile
2100 2105 2110Ile Val Glu Ala Leu Lys Asp Gln Gln Arg His Lys Val Arg Glu Glu
2115 2120 2125Val Val Thr Val Gly Asn Ser Val Asn Glu Gly Leu Asn Gln Pro Thr
2130 2135 2140Asp Asp Ser Cys Phe Asp Pro Tyr Thr Val Ser His Tyr Ala Val Gly2145 2150 2155 2160Asp Glu Trp Glu Arg Met Ser Glu Ser Gly Phe Lys Leu Leu Cys Gln
2165 2170 2175Cys Leu Gly Phe Gly Ser Gly His Phe Arg Cys Asp Ser Ser Arg Trp
2180 2185 2190Cys His Asp Asn Gly Val Asn Tyr Lys Ile Gly Glu Lys Trp Asp Arg
2195 2200 2205Gln Gly Glu Asn Gly Gln Met Met Ser Cys Thr Cys Leu Gly Asn Gly
2210 2215 2220Lys Gly Glu Phe Lys Cys Asp Pro His Glu Ala Thr Cys Tyr Asp Asp2225 2230 2235 2240Gly Lys Thr Tyr His Val Gly Glu Gln Trp Gln Lys Glu Tyr Leu Gly
2245 2250 2255Ala Ile Cys Ser Cys Thr Cys Phe Gly Gly Gln Arg Gly Trp Arg Cys
2260 2265 2270Asp Asn Cys Arg Arg Pro Gly Gly Glu Pro Ser Pro Glu Gly Thr Thr
2275 2280 2285Gly Gln Ser Tyr Asn Gln Tyr Ser Gln Arg Tyr His Gln Arg Thr Asn
2290 2295 2300Thr Asn Val Asn Cys Pro Ile Glu Cys Phe Met Pro Leu Asp Val Gln2305 2310 2315 2320Ala Asp Arg Glu Asp Ser Arg Glu
2325<210>99<211>188<212>PRT<213>人(Homo sapiens)<400>99His Gln Thr His Lys Glu Gly Gly Ser Thr His Ala Ser Ala Asp Ala1 5 10 15Trp Glu Ile Ile Glu Leu Glu Thr Glu Ile Glu Lys Phe Lys Ala Glu
20 25 30Asn Ala Ser Leu Ala Lys Leu Arg Ile Glu Arg Glu Ser Ala Leu Glu
35 40 45Lys Leu Arg Lys Glu Ile Ala Asp Phe Glu Gln Gln Lys Ala Lys Glu
50 55 60Leu Ala Arg Ile Glu Glu Phe Lys Lys Glu Glu Met Arg Lys Leu Gln65 70 75 80Lys Glu Arg Lys Val Phe Glu Lys Tyr Thr Thr Ala Ala Arg Thr Phe
85 90 95Pro Asp Lys Lys Glu Arg Glu Glu Ile Gln Thr Leu Lys Gln Gln Ile
100 105 110Ala Asp Leu Arg Glu Asp Leu Lys Arg Lys Glu Thr Lys Trp Ser Ser
115 120 125Thr His Ser Arg Leu Arg Ser Gln Ile Gln Met Leu Val Arg Glu Asn
130 135 140Thr Asp Leu Arg Glu Glu Ile Lys Val Met Glu Arg Phe Arg Leu Asp145 150 155 160Ala Trp Lys Arg Ala Glu Ala Ile Glu Ser Ser Leu Glu Val Glu Lys
165 170 175Lys Asp Lys Leu Ala Asn Thr Ser Val Arg Phe Gln
180 185<210>100<211>284<212>PRT<213>人(Homo sapiens)<400>100Met Glu Pro Gly Asn Tyr Ala Thr Leu Asp Gly Ala Lys Asp Ile Glu1 5 10 15Gly Leu Leu Gly Ala Gly Gly Gly Arg Asn Leu Val Ala His Ser Pro
20 25 30Leu Thr Ser His Pro Ala Ala Pro Thr Leu Met Pro Ala Val Asn Tyr
35 40 45Ala Pro Leu Asp Leu Pro Gly Ser Ala Glu Pro Pro Lys Gln Cys His
50 55 60Pro Cys Pro Gly Val Pro Gln Gly Thr Ser Pro Ala Pro Val Pro Tyr65 70 75 80Gly Tyr Phe Gly Gly Gly Tyr Tyr Ser Cys Arg Val Ser Arg Ser Ser
85 90 95Leu Lys Pro Cys Ala Gln Ala Ala Thr Leu Ala Ala Tyr Pro Ala Glu
100 105 110Thr Pro Thr Ala Gly Glu Glu Tyr Pro Ser Arg Pro Thr Glu Phe Ala
115 120 125Phe Tyr Pro Gly Tyr Pro Gly Thr Tyr His Ala Met Ala Ser Tyr Leu
130 135 140Asp Val Ser Val Val Gln Thr Leu Gly Ala Pro Gly Glu Pro Arg His145 150 155 160Asp Ser Leu Leu Pro Val Asp Ser Tyr Gln Ser Trp Ala Leu Ala Gly
165 170 175Gly Trp Asn Ser Gln Met Cys Cys Gln Gly Glu Gln Asn Pro Pro Gly
180 185 190Pro Phe Trp Lys Ala Ala Phe Ala Asp Ser Ser Gly Gln His Pro Pro
195 200 205Asp Ala Cys Ala Phe Arg Arg Gly Arg Lys Lys Arg Ile Pro Tyr Ser
210 215 220Lys Gly Gln Leu Arg Glu Leu Glu Arg Glu Tyr Ala Ala Asn Lys Phe225 230 235 240Ile Thr Lys Asp Lys Arg Arg Lys Ile Ser Ala Ala Thr Ser Leu Ser
245 250 255Glu Arg Gln Ile Thr Ile Trp Phe Gln Asn Arg Arg Val Lys Glu Lys
260 265 270Lys Val Leu Ala Lys Val Lys Asn Ser Ala Thr Pro
275 280<210>101<211>676<212>PRT<213>人(Homo sapiens)<400>101Met Asp Lys Tyr Asp Asp Leu Gly Leu Glu Ala Ser Lys Phe Ile Glu1 5 10 15Asp Leu Asn Met Tyr Glu Ala Ser Lys Asp Gly Leu Phe Arg Val Asp
20 25 30Lys Gly Ala Gly Asn Asn Pro Glu Phe Glu Glu Thr Arg Arg Val Phe
35 40 45Ala Thr Lys Met Ala Lys Ile His Leu Gln Gln Gln Gln Gln Gln Leu
50 55 60Leu Gln Glu Glu Thr Leu Pro Arg Gly Ser Arg Gly Pro Val Asn Gly65 70 75 80Gly Gly Arg Leu Gly Pro Gln Ala Arg Trp Glu Val Val Gly Ser Lys
85 90 95Leu Thr Val Asp Gly Ala Ala Lys Pro Pro Leu Ala Ala Ser Thr Gly
100 105 110Ala Pro Gly Ala Val Thr Thr Leu Ala Ala Gly Gln Pro Pro Tyr Pro
115 120 125Pro Gln Glu Gln Arg Ser Arg Pro Tyr Leu His Gly Thr Arg His Gly
130 135 140Ser Gln Asp Cys Gly Ser Arg Glu Ser Leu Ala Thr Ser Glu Met Ser145 150 155 160Ala Phe His Gln Pro Gly Pro Cys Glu Asp Pro Ser Cys Leu Thr His
165 170 175Gly Asp Tyr Tyr Asp Asn Leu Ser Leu Ala Ser Pro Lys Trp Gly Asp
180 185 190Lys Pro Gly Val Ser Pro Ser Ile Gly Leu Ser Val Gly Ser Gly Trp
195 200 205Pro Ser Ser Pro Gly Ser Asp Pro Pro Leu Pro Lys Pro Cys Gly Asp
210 215 220His Pro Leu Asn His Arg Gln Leu Ser Leu Ser Ser Ser Arg Ser Ser225 230 235 240Glu Gly Ser Leu Gly Gly Gln Asn Ser Gly Ile Gly Gly Arg Ser Ser
245 250 255Glu Lys Pro Thr Gly Leu Trp Ser Thr Ala Ser Ser Gln Arg Val Ser
260 265 270Pro Gly Leu Pro Ser Pro Asn Leu Glu Asn Gly Ala Pro Ala Val Gly
275 280 285Pro Val Gln Pro Arg Thr Pro Ser Val Ser Ala Pro Leu Ala Leu Ser
290 295 300Cys Pro Arg Gln Gly Gly Leu Pro Arg Ser Asn Ser Gly Leu Gly Gly305 310 315 320Glu Val Ser Gly Val Met Ser Lys Pro Asn Val Asp Pro Gln Pro Trp
325 330 335Phe Gln Asp Gly Pro Lys Ser Tyr Leu Ser Ser Ser Ala Pro Ser Ser
340 345 350Ser Pro Ala Gly Leu Asp Gly Ser Gln Gln Gly Ala Val Pro Gly Leu
355 360 365Gly Pro Lys Pro Gly Cys Thr Asp Leu Gly Thr Gly Pro Lys Leu Ser
370 375 380Pro Thr Ser Leu Val His Pro Val Met Ser Thr Leu Pro Glu Leu Ser385 390 395 400Cys Lys Glu Gly Pro Leu Gly Trp Ser Ser Asp Gly Ser Leu Gly Ser
405 410 415Val Leu Leu Asp Ser Pro Ser Ser Pro Arg Val Arg Leu Pro Cys Gln
420 425 430Pro Leu Val Pro Gly Pro Glu Leu Arg Pro Ser Ala Ala Glu Leu Lys
435 440 445Leu Glu Ala Leu Thr Gln Arg Leu Glu Arg Glu Met Asp Ala His Pro
450 455 460Lys Ala Asp Tyr Phe Gly Ala Cys Val Lys Cys Ser Lys Gly Val Phe465 470 475 480Gly Ala Gly Gln Ala Cys Gln Ala Met Gly Asn Leu Tyr His Asp Thr
485 490 495Cys Phe Thr Cys Ala Ala Cys Ser Arg Lys Leu Arg Gly Lys Ala Phe
500 505 510Tyr Phe Val Asn Gly Lys Val Phe Cys Glu Glu Asp Phe Leu Tyr Ser
515 520 525Gly Phe Gln Gln Ser Ala Asp Arg Cys Phe Leu Cys Gly His Leu Ile
530 535 540Met Asp Met Ile Leu Gln Ala Leu Gly Lys Ser Tyr His Pro Gly Cys545 550 555 560Phe Arg Cys Val Ile Cys Asn Glu Cys Leu Asp Gly Val Pro Phe Thr
565 570 575Val Asp Ser Glu Asn Lys Ile Tyr Cys Val Arg Asp Tyr His Lys Val
580 585 590Leu Ala Pro Lys Cys Ala Ala Cys Gly Leu Pro Ile Leu Pro Pro Glu
595 600 605Gly Ser Asp Glu Thr Ile Arg Val Val Ser Met Asp Arg Asp Tyr His
610 615 620Val Glu Cys Tyr His Cys Glu Asp Cys Gly Leu Glu Leu Asn Asp Glu625 630 635 640Asp Gly His Arg Cys Tyr Pro Leu Glu Asp His Leu Phe Cys His Ser
645 650 655Cys His Val Lys Arg Leu Glu Lys Arg Pro Ser Ser Thr Ala Leu His
660 665 670Gln His His Phe
675<210>102<211>296<212>PRT<213>人(Homo sapiens)<400>102Ser Thr Gly Ser Glu Phe Pro Leu Cys Thr Lys Ala Ser Pro Cys Ser1 5 10 15Ala Ala Arg Ala Gly Gly Arg Ala Leu Gly Trp Arg Leu Gln Gln Gln
20 25 30Arg Glu Thr Arg Gly Asn Pro Gly Asn Pro Gly Leu Gly Val Ala Ala
35 40 45Thr Met Thr Gly Ser Asn Met Ser Asp Ala Leu Ala Asn Ala Val Cys
50 55 60Gln Arg Cys Gln Ala Arg Phe Ser Pro Ala Glu Arg Ile Val Asn Ser65 70 75 80Asn Gly Glu Leu Tyr His Glu His Cys Phe Val Cys Ala Gln Cys Phe
85 90 95Arg Pro Phe Pro Glu Gly Leu Phe Tyr Glu Phe Glu Gly Arg Lys Tyr
100 105 110Cys Glu His Asp Phe Gln Met Leu Phe Ala Pro Cys Cys Gly Ser Cys
115 120 125Gly Glu Phe Ile Ile Gly Arg Val Ile Lys Ala Met Asn Asn Asn Trp
130 135 140His Pro Gly Cys Phe Arg Cys Glu Leu Cys Asp Val Glu Leu Ala Asp145 150 155 160Leu Gly Phe Val Lys Asn Ala Gly Arg His Leu Cys Arg Pro Cys His
165 170 175Asn Arg Glu Lys Ala Lys Gly Leu Gly Lys Tyr Ile Cys Gln Arg Cys
180 185 190His Leu Val Ile Asp Glu Gln Pro Leu Met Phe Arg Ser Asp Ala Tyr
195 200 205His Pro Asp His Phe Asn Cys Thr His Cys Gly Lys Glu Leu Thr Ala
210 215 220Glu Ala Arg Glu Leu Lys Gly Glu Leu Tyr Cys Leu Pro Cys His Asp225 230 235 240Lys Met Gly Val Pro Ile Cys Gly Ala Cys Arg Arg Pro Ile Glu Gly
245 250 255Arg Val Val Asn Ala Leu Gly Lys Gln Trp His Val Glu His Phe Val
260 265 270Cys Ala Lys Cys Glu Lys Pro Phe Leu Gly His Arg His Tyr Glu Lys
275 280 285Lys Gly Leu Ala Tyr Cys Glu Leu
290 295<210>103<211>500<212>PRT<213>人(Homo sapiens)<400>103Met Gly Ile Gly Leu Ser Ala Gln Gly Val Asn Met Asn Arg Leu Pro1 5 10 15Gly Trp Asp Lys His Ser Tyr Gly Tyr His Gly Asp Asp Gly His Ser
20 25 30Phe Cys Ser Ser Gly Thr Gly Gln Pro Tyr Gly Pro Thr Phe Thr Thr
35 40 45Gly Asp Val Ile Gly Cys Cys Val Asn Leu Ile Asn Asn Thr Cys Phe
50 55 60Tyr Thr Lys Asn Gly His Ser Leu Gly Ile Ala Phe Thr Asp Leu Pro65 70 75 80Pro Asn Leu Tyr Pro Thr Val Gly Leu Gln Thr Pro Gly Glu Val Val
85 90 95Asp Ala Asn Phe Gly Gln His Pro Phe Val Phe Asp Ile Glu Asp Tyr
100 105 110Met Arg Glu Trp Arg Thr Lys Ile Gln Ala Gln Ile Asp Arg Phe Pro
115 120 125Ile Gly Asp Arg Glu Gly Glu Trp Gln Thr Met Ile Gln Lys Met Val
130 135 140Ser Ser Tyr Leu Val His His Gly Tyr Cys Ala Thr Ala Glu Ala Phe145 150 155 160Ala Arg Ser Thr Asp Gln Thr Val Leu Glu Glu Leu Ala Ser Ile Lys
165 170 175Asn Arg Gln Arg Ile Gln Lys Leu Val Leu Ala Gly Arg Met Gly Glu
180 185 190Ala Ile Glu Thr Thr Gln Gln Leu Tyr Pro Ser Leu Leu Glu Arg Asn
195 200 205Pro Asn Leu Leu Phe Thr Leu Lys Val Arg Gln Phe Ile Glu Met Val
210 215 220Asn Gly Thr Asp Ser Glu Val Arg Cys Leu Gly Gly Arg Ser Pro Lys225 230 235 240Ser Gln Asp Ser Tyr Pro Val Ser Pro Arg Pro Phe Ser Ser Pro Ser
245 250 255Met Ser Pro Ser His Gly Met Asn Ile His Asn Leu Ala Ser Gly Lys
260 265 270Gly Ser Thr Ala His Phe Ser Gly Phe Glu Ser Cys Ser Asn Gly Val
275 280 285Ile Ser Asn Lys Ala His Gln Ser Tyr Cys His Ser Asn Lys His Gln
290 295 300Ser Ser Asn Leu Asn Val Pro Glu Leu Asn Ser Ile Asn Met Ser Arg305 310 315 320Ser Gln Gln Val Asn Asn Phe Thr Ser Asn Asp Val Asp Met Glu Thr
325 330 335Asp His Tyr Ser Asn Gly Val Gly Glu Thr Ser Ser Asn Gly Phe Leu
340 345 350Asn Gly Ser Ser Lys His Asp His Glu Met Glu Asp Cys Asp Thr Glu
355 360 365Met Glu Val Asp Ser Ser Gln Leu Arg Arg Gln Leu Cys Gly Gly Ser
370 375 380Gln Ala Ala Ile Glu Arg Met Ile His Phe Gly Arg Glu Leu Gln Ala385 390 395 400Met Ser Glu Gln Leu Arg Arg Asp Cys Gly Lys Asn Thr Ala Asn Lys
405 410 415Lys Met Leu Lys Asp Ala Phe Ser Leu Leu Ala Tyr Ser Asp Pro Trp
420 425 430Asn Ser Pro Val Gly Asn Gln Leu Asp Pro Ile Gln Arg Glu Pro Val
435 440 445Cys Ser Ala Leu Asn Ser Ala Ile Leu Glu Thr His Asn Leu Pro Lys
450 455 460Gln Pro Pro Leu Ala Leu Ala Met Gly Gln Ala Thr Gln Cys Leu Gly465 470 475 480Leu Met Ala Arg Ser Gly Ile Gly Ser Cys Ala Phe Ala Thr Val Glu
485 490 495Asp Tyr Leu His
500<210>104<211>387<212>PRT<213>人(Homo sapiens)<400>104Met Ala Thr Ser Gly Val Leu Pro Gly Gly Gly Phe Val Ala Ser Ala1 5 10 15Ala Ala Val Ala Gly Pro Glu Met Gln Thr Gly Arg Asn Asn Phe Val
20 25 30Ile Arg Arg Asn Pro Ala Asp Pro Gln Arg Ile Pro Ser Asn Pro Ser
35 40 45His Arg Ile Gln Cys Ala Ala Gly Tyr Glu Gln Ser Glu His Asn Val
50 55 60Cys Gln Asp Ile Asp Glu Cys Thr Ala Gly Thr His Asn Cys Arg Ala65 70 75 80Asp Gln Val Cys Ile Asn Leu Arg Gly Ser Phe Ala Cys Gln Cys Pro
85 90 95Pro Gly Tyr Gln Lys Arg Gly Glu Gln Cys Val Asp Ile Asp Glu Cys
100 105 110Thr Ile Pro Pro Tyr Cys His Gln Arg Cys Val Asn Thr Pro Gly Ser
115 120 125Phe Tyr Cys Gln Cys Ser Pro Gly Phe Gln Leu Ala Ala Asn Asn Tyr
130 135 140Thr Cys Val Asp Ile Asn Glu Cys Asp Ala Ser Asn Gln Cys Ala Gln145 150 155 160Gln Cys Tyr Asn Ile Leu Gly Ser Phe Ile Cys Gln Cys Asn Gln Gly
165 170 175Tyr Glu Leu Ser Ser Asp Arg Leu Asn Cys Glu Asp Ile Asp Glu Cys
180 185 190Arg Thr Ser Ser Tyr Leu Cys Gln Tyr Gln Cys Val Asn Glu Pro Gly
195 200 205Lys Phe Ser Cys Met Cys Pro Gln Gly Tyr Gln Val Val Arg Ser Arg
210 215 220Thr Cys Gln Asp Ile Asn Glu Cys Glu Thr Thr Asn Glu Cys Arg Glu225 230 235 240Asp Glu Met Cys Trp Asn Tyr His Gly Gly Phe Arg Cys Tyr Pro Arg
245 250 255Asn Pro Cys Gln Asp Pro Tyr Ile Leu Thr Pro Glu Asn Arg Cys Val
260 265 270Cys Pro Val Ser Asn Ala Met Cys Arg Glu Leu Pro Gln Ser Ile Val
275 280 285Tyr Lys Tyr Met Ser Ile Arg Ser Asp Arg Ser Val Pro Ser Asp Ile
290 295 300Phe Gln Ile Gln Ala Thr Thr Ile Tyr Ala Asn Thr Ile Asn Thr Phe305 310 315 320Arg Ile Lys Ser Gly Asn Glu Asn Gly Glu Phe Tyr Leu Arg Gln Thr
325 330 335Ser Pro Val Ser Ala Met Leu Val Leu Val Lys Ser Leu Ser Gly Pro
340 345 350Arg Glu His Ile Val Asp Leu Glu Met Leu Thr Val Ser Ser Ile Gly
355 360 365Thr Phe Arg Thr Ser Ser Val Leu Arg Leu Thr Ile Ile Val Gly Pro
370 375 380Phe Ser Phe385<210>105<211>531<212>PRT<213>人(Homo sapiens)<400>105Met Ser Lys Pro His Ser Glu Ala Gly Thr Ala Phe Ile Gln Thr Gln1 5 10 15Gln Leu His Ala Ala Met Ala Asp Thr Phe Leu Glu His Met Cys Arg
20 25 30Leu Asp Ile Asp Ser Pro Pro Ile Thr Ala Arg Asn Thr Gly Ile Ile
35 40 45Cys Thr Ile Gly Pro Ala Ser Arg Ser Val Glu Thr Leu Lys Glu Met
50 55 60Ile Lys Ser Gly Met Asn Val Ala Arg Leu Asn Phe Ser His Gly Thr65 70 75 80His Glu Tyr His Ala Glu Thr Ile Lys Asn Val Arg Thr Ala Thr Glu
85 90 95Ser Phe Ala Ser Asp Pro Tyr Leu Tyr Arg Pro Val Ala Val Ala Leu
100 105 110Asp Thr Lys Gly Pro Glu Ile Arg Thr Gly Leu Ile Lys Gly Ser Gly
115 120 125Thr Ala Glu Leu Glu Leu Lys Lys Gly Ala Thr Leu Lys Ile Thr Leu
130 135 140Asp Asn Ala Tyr Met Glu Lys Cys Asp Glu Asn Ile Leu Trp Leu Asp145 150 155 160Tyr Lys Asn Ile Cys Lys Val Val Glu Val Gly Ser Lys Ile Tyr Val
165 170 175Asp Asp Gly Leu Ile Ser Leu Gln Val Lys Gln Lys Gly Ala Asp Phe
180 185 190Leu Val Thr Glu Val Glu Asn Gly Gly Ser Leu Gly Ser Lys Lys Gly
195 200 205Val Asn Leu Pro Gly Ala Ala Val Asp Leu Pro Ala Val Ser Glu Lys
210 215 220Asp Ile Gln Asp Leu Lys Phe Gly Val Glu Gln Asp Val Asp Met Val225 230 235 240Phe Ala Ser Phe Ile Arg Lys Ala Ser Asp Val His Glu Val Arg Lys
245 250 255Val Leu Gly Glu Lys Gly Lys Asn Ile Lys Ile Ile Ser Lys Ile Glu
260 265 270Asn His Glu Gly Val Arg Arg Phe Asp Glu Ile Leu Glu Ala Ser Asp
275 280 285Gly Ile Met Val Ala Arg Gly Asp Leu Gly Ile Glu Ile Pro Ala Glu
290 295 300Lys Val Phe Leu Ala Gln Lys Met Met Ile Gly Arg Cys Asn Arg Ala305 310 315 320Gly Lys Pro Val Ile Cys Ala Thr Gln Met Leu Glu Ser Met Ile Lys
325 330 335Lys Pro Arg Pro Thr Arg Ala Glu Gly Ser Asp Val Ala Asn Ala Val
340 345 350Leu Asp Gly Ala Asp Cys Ile Met Leu Ser Gly Glu Thr Ala Lys Gly
355 360 365Asp Tyr Pro Leu Glu Ala Val Arg Met Gln His Leu Ile Ala Arg Glu
370 375 380Ala Glu Ala Ala Ile Tyr His Leu Gln Leu Phe Glu Glu Leu Arg Arg385 390 395 400Leu Ala Pro Ile Thr Ser Asp Pro Thr Glu Ala Thr Ala Val Gly Ala
405 410 415Val Glu Ala Ser Phe Lys Cys Cys Ser Gly Ala Ile Ile Val Leu Thr
420 425 430Lys Ser Gly Arg Ser Ala His Gln Val Ala Arg Tyr Arg Pro Arg Ala
435 440 445Pro Ile Ile Ala Val Thr Arg Asn Pro Gln Thr Ala Arg Gln Ala His
450 455 460Leu Tyr Arg Gly Ile Phe Pro Val Leu Cys Lys Asp Pro Val Gln Glu465 470 475 480Ala Trp Ala Glu Asp Val Asp Leu Arg Val Asn Phe Ala Met Asn Val
485 490 495Gly Lys Ala Arg Gly Phe Phe Lys Lys Gly Asp Val Val Ile Val Leu
500 505 510Thr Gly Trp Arg Pro Gly Ser Gly Phe Thr Asn Thr Met Arg Val Val
515 520 525Pro Val Pro
530<210>106<211>480<212>PRT<213>人(Homo sapiens)<400>106Met Ala Ala Arg Cys Ser Thr Arg Trp Leu Leu Val Val Val Gly Thr1 5 10 15Pro Arg Leu Pro Ala Ile Ser Gly Arg Gly Ala Arg Pro Pro Arg Glu
20 25 30Gly Val Val Gly Ala Trp Leu Ser Arg Lys Leu Ser Val Pro Ala Phe
35 40 45Ala Ser Ser Leu Thr Ser Cys Gly Pro Arg Ala Leu Leu Thr Leu Arg
50 55 60Pro Gly Val Ser Leu Thr Gly Thr Lys His Asn Pro Phe Ile Cys Thr65 70 75 80Ala Ser Phe His Thr Ser Ala Pro Leu Ala Lys Glu Asp Tyr Tyr Gln
85 90 95Ile Leu Gly Val Pro Arg Asn Ala Ser Gln Lys Glu Ile Lys Lys Ala
100 105 110Tyr Tyr Gln Leu Ala Lys Lys Tyr His Pro Asp Thr Asn Lys Asp Asp
115 120 125Pro Lys Ala Lys Glu Lys Phe Ser Gln Leu Ala Glu Ala Tyr Glu Val
130 135 140Leu Ser Asp Glu Val Lys Arg Lys Gln Tyr Asp Ala Tyr Gly Ser Ala145 150 155 160Gly Phe Asp Pro Gly Ala Ser Gly Ser Gln His Ser Tyr Trp Lys Gly
165 170 175Gly Pro Thr Val Asp Pro Glu Glu Leu Phe Arg Lys Ile Phe Gly Glu
180 185 190Phe Ser Ser Ser Ser Phe Gly Asp Phe Gln Thr Val Phe Asp Gln Pro
195 200 205Gln Glu Tyr Phe Met Glu Leu Thr Phe Asn Gln Ala Ala Lys Gly Val
210 215 220Asn Lys Glu Phe Thr Val Asn Ile Met Asp Thr Cys Glu Arg Cys Asn225 230 235 240Gly Lys Gly Asn Glu Pro Gly Thr Lys Val Gln His Cys His Tyr Cys
245 250 255Gly Gly Ser Gly Met Glu Thr Ile Asn Thr Gly Pro Phe Val Met Arg
260 265 270Ser Thr Cys Arg Arg Cys Gly Gly Arg Gly Ser Ile Ile Ile Ser Pro
275 280 285Cys Val Val Cys Arg Gly Ala Gly Gln Ala Lys Gln Lys Lys Arg Val
290 295 300Met Ile Pro Val Pro Ala Gly Val Glu Asp Gly Gln Thr Val Arg Met305 310 315 320Pro Val Gly Lys Arg Glu Ile Phe Ile Thr Phe Arg Val Gln Lys Ser
325 330 335Pro Val Phe Arg Arg Asp Gly Ala Asp Ile His Ser Asp Leu Phe Ile
340 345 350Ser Ile Ala Gln Ala Leu Leu Gly Gly Thr Ala Arg Ala Gln Gly Leu
355 360 365Tyr Glu Thr Ile Asn Val Thr Ile Pro Pro Gly Thr Gln Thr Asp Gln
370 375 380Lys Ile Arg MeT Gly Gly Lys Gly Ile Pro Arg Ile Asn Ser Tyr Gly385 390 395 400Tyr Gly Asp His Tyr Ile His Ile Lys Ile Arg Val Pro Lys Arg Leu
405 410 415Thr Ser Arg Gln Gln Ser Leu Ile Leu Ser Tyr Ala Glu Asp Glu Thr
420 425 430Asp Val Glu Gly Thr Val Asn Gly Val Thr Leu Thr Ser Ser Gly Gly
435 440 445Ser Thr MeT Asp Ser Ser Ala Gly Ser Lys Ala Arg Arg Glu Ala Gly
450 455 460Glu Asp Glu Glu Gly Phe Leu Ser Lys Leu Lys Lys MeT Phe Thr Ser465 470 475 480<210>107<211>572<212>PRT<213>人(Homo sapiens)<400>107Met Ala Ala Pro Arg Pro Ser Pro Ala Ile Ser Val Ser Val Ser Ala1 5 10 15Pro Ala Phe Tyr Ala Pro Gln Lys Lys Phe Gly Pro Val Val Ala Pro
20 25 30Lys Pro Lys Val Asn Pro Phe Arg Pro Gly Asp Ser Glu Pro Pro Pro
35 40 45Ala Pro Gly Ala Gln Arg Ala Gln Met Gly Arg Val Gly Glu Ile Pro
50 55 60Pro Pro Pro Pro Glu Asp Phe Pro Leu Pro Pro Pro Pro Leu Ala Gly65 70 75 80Asp Gly Asp Asp Ala Glu Gly Ala Leu Gly Gly Ala Phe Pro Pro Pro
85 90 95Pro Pro Pro Ile Glu Glu Ser Phe Pro Pro Ala Pro Leu Glu Glu Glu
100 105 110Ile Phe Pro Ser Pro Pro Pro Pro Pro Glu Glu Glu Gly Gly Pro Glu
115 120 125Ala Pro Ile Pro Pro Pro Pro Gln Pro Arg Glu Lys Val Ser Ser Ile
130 135 140Asp Leu Glu Ile Asp Ser Leu Ser Ser Leu Leu Asp Asp Met Thr Lys145 150 155 160Asn Asp Pro Phe Lys Ala Arg Val Ser Ser Gly Tyr Val Pro Pro Pro
165 170 175Val Ala Thr Pro Phe Ser Ser Lys Ser Ser Thr Lys Pro Ala Ala Gly
180 185 190Gly Thr Ala Pro Leu Pro Pro Trp Lys Ser Pro Ser Ser Ser Gln Pro
195 200 205Leu Pro Gln Val Pro Ala Pro Ala Gln Ser Gln Thr Gln Phe His Val
210 215 220Gln Pro Gln Pro Gln Pro Lys Pro Gln Val Gln Leu His Val Gln Ser225 230 235 240Gln Thr Gln Pro Val Ser Leu Ala Asn Thr Gln Pro Arg Gly Pro Pro
245 250 255Ala Ser Ser Pro Ala Pro Ala Pro Lys Phe Ser Pro Val Thr Pro Lys
260 265 270Phe Thr Pro Val Ala Ser Lys Phe Ser Pro Gly Ala Pro Gly Gly Ser
275 280 285Gly Ser Gln Pro Asn Gln Lys Leu Gly His Pro Glu Ala Leu Ser Ala
290 295 300Gly Thr Gly Ser Pro Gln Pro Pro Ser Phe Thr Tyr Ala Gln Gln Arg305 310 315 320Glu Lys Pro Arg Val Gln Glu Lys Gln His Pro Val Pro Pro Pro Ala
325 330 335Gln Asn Gln Asn Gln Val Arg Ser Pro Gly Ala Pro Gly Pro Leu Thr
340 345 350Leu Lys Glu Val Glu Glu Leu Glu Gln Leu Thr Gln Gln Leu Met Gln
355 360 365Asp Met Glu His Pro Gln Arg Gln Asn Val Ala Val Asn Glu Leu Cys
370 375 380Gly Arg Cys His Gln Pro Leu Ala Arg Ala Gln Pro Ala Val Arg Ala385 390 395 400Leu Gly Gln Leu Phe His Ile Ala Cys Phe Thr Cys His Gln Cys Ala
405 410 415Gln Gln Leu Gln Gly Gln Gln Phe Tyr Ser Leu Glu Gly Ala Pro Tyr
420 425 430Cys Glu Gly Cys Tyr Thr Asp Thr Leu Glu Lys Cys Asn Thr Cys Gly
435 440 445Glu Pro Ile Thr Asp Arg Met Leu Arg Ala Thr Gly Lys Ala Tyr His
450 455 460Pro His Cys Phe Thr Cys Val Val Cys Ala Arg Pro Leu Glu Gly Thr465 470 475 480Ser Phe Ile Val Asp Gln Ala Asn Arg Pro His Cys Val Pro Asp Tyr
485 490 495His Lys Gln Tyr Ala Pro Arg Cys Ser Val Cys Ser Glu Pro Ile Met
500 505 510Pro Glu Pro Gly Arg Asp Glu Thr Val Arg Val Val Ala Leu Asp Lys
5l5 520 525Asn Phe His Met Lys Cys Tyr Lys Cys Glu Asp Cys Gly Lys Pro Leu
530 535 540Ser Ile Glu Ala Asp Asp Asn Gly Cys Phe Pro Leu Asp Gly His Val545 550 555 560Leu Cys Arg Lys Cys His Thr Ala Arg Ala Gln Thr
565 570<210>108<211>2861<212>PRT<213>人(Homo sapiens)<400>108Met Lys Ala Met Asp Val Leu Pro Ile Leu Lys Glu Lys Val Ala Tyr1 5 10 15Leu Ser Gly Gly Arg Asp Lys Arg Gly Gly Pro Ile Leu Thr Phe Pro
20 25 30Ala Arg Ser Asn His Asp Arg Ile Arg Gln Glu Asp Leu Arg Arg Leu
35 40 45Ile Ser Tyr Leu Ala Cys Ile Pro Ser Glu Glu Val Cys Lys Arg Gly
50 55 60Phe Thr Val Ile Val Asp Met Arg Gly Ser Lys Trp Asp Ser Ile Lys65 70 75 80Pro Leu Leu Lys Ile Leu Gln Glu Ser Phe Pro Cys Cys Ile His Val
85 90 95Ala Leu Ile Ile Lys Pro Asp Asn Phe Trp Gln Lys Gln Arg Thr Asn
100 105 110Phe Gly Ser Ser Lys Phe Glu Phe Glu Thr Asn Met Val Ser Leu Glu
115 120 125Gly Leu Thr Lys Val Val Asp Pro Ser Gln Leu Thr Pro Glu Phe Asp
130 135 140Gly Cys Leu Glu Tyr Asn His Glu Glu Trp Ile Glu Ile Arg Val Ala145 150 155 160Phe Glu Asp Tyr Ile Ser Asn Ala Thr His Met Leu Ser Arg Leu Glu
165 170 175Glu Leu Gln Asp Ile Leu Ala Lys Lys Glu Leu Pro Gln Asp Leu Glu
180 185 190Gly Ala Arg Asn Met Ile Glu Glu His Ser Gln Leu Lys Lys Lys Val
195 200 205Ile Lys Ala Pro Ile Glu Asp Leu Asp Leu Glu Gly Gln Lys Leu Leu
210 215 220Gln Arg Ile Gln Ser Ser Glu Ser Phe Pro Lys Lys Asn Ser Gly Ser225 230 235 240Gly Asn Ala Asp Leu Gln Asn Leu Leu Pro Lys Val Ser Thr Met Leu
245 250 255Asp Arg Leu His Ser Thr Arg Gln His Leu His Gln Met Trp His Val
260 265 270Arg Lys Leu Lys Leu Asp Gln Cys Phe Gln Leu Arg Leu Phe Glu Gln
275 280 285Asp Ala Glu Lys Met Phe Asp Trp Ile Thr His Asn Lys Gly Leu Phe
290 295 300Leu Asn Ser Tyr Thr Glu Ile Gly Thr Ser His Pro His Ala Met Glu305 3l0 315 320Leu Gln Thr Gln His Asn His Phe Ala Met Asn Cys Met Asn Val Tyr
325 330 335Val Asn Ile Asn Arg Ile Met Ser Val Ala Asn Arg Leu Val Glu Ser
340 345 350Gly His Tyr Ala Ser Gln Gln Ile Arg Gln Ile Ala Ser Gln Leu Glu
355 360 365Gln Glu Trp Lys Ala Phe Ala Ala Ala Leu Asp Glu Arg Ser Thr Leu
370 375 380Leu Asp Met Ser Ser Ile Phe His Gln Lys Ala Glu Lys Tyr Met Ser385 390 395 400Asn Val Asp Ser Trp Cys Lys Ala Cys Gly Glu Val Asp Leu Pro Ser
405 410 415Glu Leu Gln Asp Leu Glu Asp Ala Ile His His His Gln Gly Ile Tyr
420 425 430Glu His Ile Thr Leu Ala Tyr Ser Glu Val Ser Gln Asp Gly Lys Ser
435 440 445Leu Leu Asp Lys Leu Gln Arg Pro Leu Thr Pro Gly Ser Ser Asp Ser
450 455 460Leu Thr Ala Ser Ala Asn Tyr Ser Lys Ala Val His His Val Leu Asp465 470 475 480Val Ile His Glu Val Leu His His Gln Arg His Val Arg Thr Ile Trp
485 490 495Gln His Arg Lys Val Arg Leu His Gln Arg Leu Gln Leu Cys Val Phe
500 505 510Gln Gln Glu Val Gln Gln Val Leu Asp Trp Ile Glu Asn His Gly Glu
515 520 525Ala Phe Leu Ser Lys His Thr Gly Val Gly Lys Ser Leu His Arg Ala
530 535 540Arg Ala Leu Gln Lys Arg His Glu Asp Phe Glu Glu Val Ala Gln Asn545 550 555 560Thr Tyr Thr Asn Ala Asp Lys Leu Leu Glu Ala Ala Glu Gln Leu Ala
565 570 575Gln Thr Gly Glu Cys Asp Pro Glu Glu Ile Tyr Gln Ala Ala His Gln
580 585 590Leu Glu Asp Arg Ile Gln Asp Phe Val Arg Arg Val Glu Gln Arg Lys
595 600 605Ile Leu Leu Asp Met Ser Val Ser Phe His Thr His Val Lys Glu Leu
610 615 620Trp Thr Trp Leu Glu Glu Leu Gln Lys Glu Leu Leu Asp Asp Val Tyr625 630 635 640Ala Glu Ser Val Glu Ala Val Gln Asp Leu Ile Lys Arg Phe Gly Gln
645 650 655Gln Gln Gln Thr Thr Leu Gln Val Thr Val Asn Val Ile Lys Glu Gly
660 665 670Glu Asp Leu Ile Gln Gln Leu Arg Asp Ser Ala Ile Ser Ser Asn Lys
675 680 685Thr Pro His Asn Ser Ser Ile Asn His Ile Glu Thr Val Leu Gln Gln
690 695 700Leu Asp Glu Ala Gln Ser Gln Met Glu Glu Leu Phe Gln Glu Arg Lys705 710 715 720Ile Lys Leu Glu Leu Phe Leu His Val Arg Ile Phe Glu Arg Asp Ala
725 730 735Ile Asp Ile Ile Ser Asp Leu Glu Ser Trp Asn Asp Glu Leu Ser Gln
740 745 750Gln Met Asn Asp Phe Asp Thr Glu Asp Leu Thr Ile Ala Glu Gln Arg
755 760 765Leu Gln His His Ala Asp Lys Ala Leu Thr Met Asn Asn Leu Thr Phe
770 775 780Asp Val Ile His Gln Gly Gln Asp Leu Leu Gln Tyr Val Asn Glu Val785 790 795 800Gln Ala Ser Gly Val Glu Leu Leu Cys Asp Arg Asp Val Asp Met Ala
805 810 815Thr Arg Val Gln Asp Leu Leu Glu Phe Leu His Glu Lys Gln Gln Glu
820 825 830Leu Asp Leu Ala Ala Glu Gln His Arg Lys His Leu Glu Gln Cys Val
835 840 845Gln Leu Arg His Leu Gln Ala Glu Val Lys Gln Val Leu Gly Trp Ile
850 855 860Arg Asn Gly Glu Ser Met Leu Asn Ala Gly Leu Ile Thr Ala Ser Ser865 870 875 880Leu Gln Glu Ala Glu Gln Leu Gln Arg Glu His Glu Gln Phe Gln His
885 890 895Ala Ile Glu Lys Thr His Gln Ser Ala Leu Gln Val Gln Gln Lys Ala
900 905 910Glu Ala Met Leu Gln Ala Asn His Tyr Asp Met Asp Met Ile Arg Asp
915 920 925Cys Ala Glu Lys Val Ala Ser His Trp Gln Gln Leu Met Leu Lys Met
930 935 940Glu Asp Arg Leu Lys Leu Val Asn Ala Ser Val Ala Phe Tyr Lys Thr945 950 955 960Ser Glu Gln Val Cys Ser Val Leu Glu Ser Leu Glu Gln Glu Tyr Lys
965 970 975Arg Glu Glu Asp Trp Cys Gly Gly Ala Asp Lys Leu Gly Pro Asn Ser
980 985 990Glu Thr Asp His Val Thr Pro Met Ile Ser Lys His Leu Glu Gln Lys
995 1000 1005Glu Ala Phe Leu Lys Ala Cys Thr Leu Ala Arg Arg Asn Ala Asp Val
1010 1015 1020Phe Leu Lys Tyr Leu His Arg Asn Ser Val Asn Met Pro Gly Met Val1025 1030 1035 1040Thr His Ile Lys Ala Pro Glu Gln Gln Val Lys Asn Ile Leu Asn Glu
1045 1050 1055Leu Phe Gln Arg Glu Asn Arg Val Leu His Tyr Trp Thr Met Arg Lys
1060 1065 1070Arg Arg Leu Asp Gln Cys Gln Gln Tyr Val Val Phe Glu Arg Ser Ala
1075 1080 1085Lys Gln Ala Leu Glu Trp Ile His Asp Asn Gly Glu Phe Tyr Leu Ser
1090 1095 1100Thr His Thr Ser Thr Gly Ser Ser Ile Gln His Thr Gln Glu Leu Leu1105 1110 1115 1120Lys Glu His Glu Glu Phe Gln Ile Thr Ala Lys Gln Thr Lys Glu Arg
1125 1130 1135Val Lys Leu Leu Ile Gln Leu Ala Asp Gly Phe Cys Glu Lys Gly His
1140 1145 1150Ala His Ala Ala Glu Ile Lys Lys Cys Val Thr Ala Val Asp Lys Arg
1155 1160 1165Tyr Arg Asp Phe Ser Leu Arg Met Glu Lys Tyr Arg Thr Ser Leu Glu
1170 1175 1180Lys Ala Leu Gly Ile Ser Ser Asp Ser Asn Lys Ser Ser Lys Ser Leu1185 1190 1195 1200Gln Leu Asp Ile Ile Pro Ala Ser Ile Pro Gly Ser Glu Val Lys Leu
1205 1210 1215Arg Asp Ala Ala His Glu Leu Asn Glu Glu Lys Arg Lys Ser Ala Arg
1220 1225 1230Arg Lys Glu Phe Ile Met Ala Glu Leu Ile Gln Thr Glu Lys Ala Tyr
1235 1240 1245Val Arg Asp Leu Arg Glu Cys Met Asp Thr Tyr Leu Trp Glu Met Thr
1250 1255 1260Ser Gly Val Glu Glu Ile Pro Pro Gly Ile Val Asn Lys Glu Leu Ile1265 1270 1275 1280Ile Phe Gly Asn Met Gln Glu Ile Tyr Glu Phe His Asn Asn Ile Phe
1285 1290 1295Leu Lys Glu Leu Glu Lys Tyr Glu Gln Leu Pro Glu Asp Val Gly His
1300 1305 1310Cys Phe Val Thr Trp Ala Asp Lys Phe Gln Met Tyr Val Thr Tyr Cys
1315 1320 1325Lys Asn Lys Pro Asp Ser Thr Gln Leu Ile Leu Glu His Ala Gly Ser
1330 1335 1340Tyr Phe Asp Glu Ile Gln Gln Arg His Gly Leu Ala Asn Ser Ile Ser1345 1350 1355 1360Ser Tyr Leu Ile Lys Pro Val Gln Arg Ile Thr Lys Tyr Gln Leu Leu
1365 1370 1375Leu Lys Glu Leu Leu Thr Cys Cys Glu Glu Gly Lys Gly Glu Ile Lys
1380 1385 1390Asp Gly Leu Glu Val Met Leu Ser Val Pro Lys Arg Ala Asn Asp Ala
1395 1400 1405Met His Leu Ser Met Leu Glu Gly Phe Asp Glu Asn Ile Glu Ser Gln
1410 1415 1420Gly Glu Leu Ile Leu Gln Glu Ser Phe Gln Val Trp Asp Pro Lys Thr1425 1430 1435 1440Leu Ile Arg Lys Gly Arg Glu Arg His Leu Phe Leu Phe Glu Met Ser
1445 1450 1455Leu Val Phe Ser Lys Glu Val Lys Asp Ser Ser Gly Arg Ser Lys Tyr
1460 1465 1470Leu Tyr Lys Ser Lys Leu Phe Thr Ser Glu Leu Gly Val Thr Glu His
1475 1480 1485Val Glu Gly Asp Pro Cys Lys Phe Ala Leu Trp Val Gly Arg Thr Pro
1490 1495 1500Thr Ser Asp Asn Lys Ile Val Leu Lys Ala Ser Ser Ile Glu Asn Lys1505 1510 1515 1520Gln Asp Trp Ile Lys His Ile Arg Glu Val Ile Gln Glu Arg Thr Ile
1525 1530 1535His Leu Lys Gly Ala Leu Lys Glu Pro Ile His Ile Pro Lys Thr Ala
1540 1545 1550Pro Ala Thr Arg Gln Lys Gly Arg Arg Asp Gly Glu Asp Leu Asp Ser
1555 1560 1565Gln Gly Asp Gly Ser Ser Gln Pro Asp Thr Ile Ser Ile Ala Ser Arg
1570 1575 1580Thr Ser Gln Asn Thr Leu Asp Ser Asp Lys Leu Ser Gly Gly Cys Glu1585 1590 1595 1600Leu Thr Val Val Ile His Asp Phe Thr Ala Cys Asn Ser Asn Glu Leu
1605 1610 1615Thr Ile Arg Arg Gly Gln Thr Val Glu Val Leu Glu Arg Pro His Asp
1620 1625 1630Lys Pro Asp Trp Cys Leu Val Arg Thr Thr Asp Arg Ser Pro Ala Ala
1635 1640 1645Glu Gly Leu Val Pro Cys Gly Ser Leu Cys Ile Ala His Ser Arg Ser
1650 1655 1660Ser Met Glu Met Glu Gly Ile Phe Asn His Lys Asp Ser Leu Ser Val1665 1670 1675 1680Ser Ser Asn Asp Ala Ser Pro Pro Ala Ser Val Ala Ser Leu Gln Pro
1685 1690 1695His Met Ile Gly Ala Gln Ser Ser Pro Gly Pro Lys Arg Pro Gly Asn
1700 1705 1710Thr Leu Arg Lys Trp Leu Thr Ser Pro Val Arg Arg Leu Ser Ser Gly
1715 1720 1725Lys Ala Asp Gly His Val Lys Lys Leu Ala His Lys His Lys Lys Ser
1730 1735 1740Arg Glu Val Arg Lys Ser Ala Asp Ala Gly Ser Gln Lys Asp Ser Asp1745 1750 1755 1760Asp Ser Ala Ala Thr Pro Gln Asp Glu Thr Val Glu Glu Arg Gly Arg
1765 1770 1775Asn Glu Gly Leu Ser Ser Gly Thr Leu Ser Lys Ser Ser Ser Ser Gly
1780 1785 1790Met Gln Ser Cys Gly Glu Glu Glu Gly Glu Glu Gly Ala Asp Ala Val
1795 1800 1805Pro Leu Pro Pro Pro Met Ala Ile Gln Gln His Ser Leu Leu Gln Pro
1810 1815 1820Asp Ser Gln Asp Asp Lys Ala Ser Ser Arg Leu Leu Val Arg Pro Thr1825 1830 1835 1840Ser Ser Glu Thr Pro Ser Ala Ala Glu Leu Val Ser Ala Ile Glu Glu
1845 1850 1855Leu Val Lys Ser Lys Met Ala Leu Glu Asp Arg Pro Ser Ser Leu Leu
1860 1865 1870Val Asp Gln Gly Asp Ser Ser Ser Pro Ser Phe Asn Pro Ser Asp Asn
1875 1880 1885Ser Leu Leu Ser Ser Ser Ser Pro Ile Asp Glu Met Glu Glu Arg Lys
1890 1895 1900Ser Ser Ser Leu Lys Arg Arg His Tyr Val Leu Gln Glu Leu Val Glu1905 1910 1915 1920Thr Glu Arg Asp Tyr Val Arg Asp Leu Gly Tyr Val Val Glu Gly Tyr
1925 1930 1935Met Ala Leu Met Lys Glu Asp Gly Val Pro Asp Asp Met Lys Gly Lys
1940 1945 1950Asp Lys Ile Val Phe Gly Asn Ile His Gln Ile Tyr Asp Trp His Arg
1955 1960 1965Asp Phe Phe Leu Gly Glu Leu Glu Lys Cys Leu Glu Asp Pro Glu Lys
1970 1975 1980Leu Gly Ser Leu Phe Val Lys His Glu Arg Arg Leu His Met Tyr Ile1985 1990 1995 2000Ala Tyr Cys Gln Asn Lys Pro Lys Ser Glu His Ile Val Ser Glu Tyr
2005 2010 2015Ile Asp Thr Phe Phe Glu Asp Leu Lys Gln Arg Leu Gly His Arg Leu
2020 2025 2030Gln Leu Thr Asp Leu Leu Ile Lys Pro Val Gln Arg Ile Met Lys Tyr
2035 2040 2045Gln Leu Leu Leu Lys Asp Phe Leu Lys Tyr Ser Lys Lys Ala Ser Leu
2050 2055 2060Asp Thr Ser Glu Leu Glu Arg Ala Val Glu Val Met Cys Ile Val Pro2065 2070 2075 2080Arg Arg Cys Asn Asp Met Met Asn Val Gly Arg Leu Gln Gly Phe Asp
2085 2090 2095Gly Lys Ile Val Ala Gln Gly Lys Leu Leu Leu Gln Asp Thr Phe Leu
2100 2105 2110Val Thr Asp Gln Asp Ala Gly Leu Leu Pro Arg Cys Arg Glu Arg Arg
2115 2120 2125Ile Phe Leu Phe Glu Gln Ile Val Ile Phe Ser Glu Pro Leu Asp Lys
2130 2135 2140Lys Lys Gly Phe Ser Met Pro Gly Phe Leu Phe Lys Asn Ser Ile Lys2145 2150 2155 2160Val Ser Cys Leu Cys Leu Glu Glu Asn Val Glu Asn Asp Pro Cys Lys
2165 2170 2175Phe Ala Leu Thr Ser Arg Thr Gly Asp Val Val Glu Thr Phe Ile Leu
2180 2185 2190His Ser Ser Ser Pro Ser Val Arg Gln Thr Trp Ile His Glu Ile Asn
2195 2200 2205Gln Ile Leu Glu Asn Gln Arg Asn Phe Leu Asn Ala Leu Thr Ser Pro
2210 2215 2220Ile Glu Tyr Gln Arg Asn His Ser Gly Gly Gly Gly Gly Gly Gly Ser2225 2230 2235 2240Gly Ala Ala Ala Gly Val Gly Ala Ala Ala Ala Ala Gly Pro Pro Val
2245 2250 2255Ala Ala Ala Ala Thr Val Ala Ala Pro Ala Ala Ala Ala Ala Pro Pro
2260 2265 2270Ala Arg Ala Gly Ala Gly Pro Pro Gly Ser Pro Ser Leu Ser Asp Thr
2275 2280 2285Thr Pro Pro Cys Trp Ser Pro Leu Gln Pro Arg Ala Arg Gln Arg Gln
2290 2295 2300Thr Arg Cys Gln Ser Glu Ser Ser Ser Ser Ser Asn Ile Ser Thr Met2305 2310 2315 2320Leu Val Thr His Asp Tyr Thr Ala Val Lys Glu Asp Glu Ile Asn Val
2325 2330 2335Tyr Gln Gly Glu Val Val Gln Ile Leu Ala Ser Asn Gln Gln Asn Met
2340 2345 2350Phe Leu Val Phe Arg Ala Ala Thr Asp Gln Cys Pro Ala Ala Glu Gly
2355 2360 2365Trp Ile Pro Gly Phe Val Leu Gly His Thr Ser Ala Val Ile Val Glu
2370 2375 2380Asn Pro Asp Gly Thr Leu Lys Lys Ser Thr Ser Trp His Thr Ala Leu2385 2390 2395 2400Arg Leu Arg Lys Lys Ser Glu Lys Lys Asp Lys Asp Gly Lys Arg Glu
2405 2410 2415Gly Lys Leu Glu Asn Gly Tyr Arg Lys Ser Arg Glu Gly Leu Ser Asn
2420 2425 2430Lys Val Ser Val Lys Leu Leu Asn Pro Asn Tyr Ile Tyr Asp Val Pro
2435 2440 2445Pro Glu Phe Val Ile Pro Leu Ser Glu Val Thr Cys Glu Thr Gly Glu
2450 2455 2460Thr Val Val Leu Arg Cys Arg Val Cys Gly Arg Pro Lys Ala Ser Ile2465 2470 2475 2480Thr Trp Lys Gly Pro Glu His Asn Thr Leu Asn Asn Asp Gly His Tyr
2485 2490 2495Ser Ile Ser Tyr Ser Asp Leu Gly Glu Ala Thr Leu Lys Ile Val Gly
2500 2505 2510Val Thr Thr Glu Asp Asp Gly Ile Tyr Thr Cys Ile Ala Val Asn Asp
2515 2520 2525Met Gly Ser Ala Ser Ser Ser Ala Ser Leu Arg Val Leu Gly Pro Gly
2530 2535 2540Met Asp Gly Ile Met Val Thr Trp Lys Asp Asn Phe Asp Ser Phe Tyr2545 2550 2555 2560Ser Glu Val Ala Glu Leu Gly Arg Gly Arg Phe Ser Val Val Lys Lys
2565 2570 2575Cys Asp Gln Lys Gly Thr Lys Arg Ala Val Ala Thr Lys Phe Val Asn
2580 2585 2590Lys Lys Leu Met Lys Arg Asp Gln Val Thr His Glu Leu Gly Ile Leu
2595 2600 2605Gln Ser Leu Gln His Pro Leu Leu Val Gly Leu Leu Asp Thr Phe Glu
2610 2615 2620Thr Pro Thr Ser Tyr Ile Leu Val Leu Glu Met Ala Asp Gln Gly Arg2625 2630 2635 2640Leu Leu Asp Cys Val Val Arg Trp Gly Ser Leu Thr Glu Gly Lys Ile
2645 2650 2655Arg Ala His Leu Gly Glu Val Leu Glu Ala Val Arg Tyr Leu His Asn
2660 2665 2670Cys Arg Ile Ala His Leu Asp Leu Lys Pro Glu Asn Ile Leu Val Asp
2675 2680 2685Glu Ser Leu Ala Lys Pro Thr Ile Lys Leu Ala Asp Phe Gly Asp Ala
2690 2695 2700Val Gln Leu Asn Thr Thr Tyr Tyr Ile His Gln Leu Leu Gly Asn Pro2705 2710 2715 2720Glu Phe Ala Ala Pro Glu Ile Ile Leu Gly Asn Pro Val Ser Leu Thr
2725 2730 2735Ser Asp Thr Trp Ser Val Gly Val Leu Thr Tyr Val Leu Leu Ser Gly
2740 2745 2750Val Ser Pro Phe Leu Asp Asp Ser Val Glu Glu Thr Cys Leu Asn Ile
2755 2760 2765Cys Arg Leu Asp Phe Ser Phe Pro Asp Asp Tyr Phe Lys Gly Val Ser
2770 2775 2780Gln Lys Ala Lys Glu Phe Val Cys Phe Leu Leu Gln Glu Asp Pro Ala2785 2790 2795 2800Lys Arg Pro Ser Ala Ala Leu Ala Leu Gln Glu Gln Trp Leu Gln Ala
2805 2810 2815Gly Asn Gly Arg Ser Thr Gly Val Leu Asp Thr Ser Arg Leu Thr Ser
2820 2825 2830Phe Ile Glu Arg Arg Lys His Gln Asn Asp Val Arg Pro Ile Arg Ser
2835 2840 2845Ile Lys Asn Phe Leu Gln Ser Arg Leu Leu Pro Arg Val
2850 2855 2860<210>109<211>271<212>PRT<213>人(Homo sapiens)<400>109Met Val Leu Ile Lys Glu Phe Arg Val Val Leu Pro Cys Ser Val Gln1 5 10 15Glu Tyr Gln Val Gly Gln Leu Tyr Ser Val Ala Glu Ala Ser Lys Asn
20 25 30Glu Thr Gly Gly Gly Glu Gly Ile Glu Val Leu Lys Asn Glu Pro Tyr
35 40 45Glu Lys Asp Gly Glu Lys Gly Gln Tyr Thr His Lys Ile Tyr His Leu
50 55 60Lys Ser Lys Val Pro Ala Phe Val Arg Met Ile Ala Pro Glu Gly Ser65 70 75 80Leu Val Phe His Glu Lys Ala Trp Asn Ala Tyr Pro Tyr Cys Arg Thr
85 90 95Ile Val Thr Asn Glu Tyr Met Lys Asp Asp Phe Phe Ile Lys Ile Glu
100 105 110Thr Trp His Lys Pro Asp Leu Gly Thr Leu Glu Asn Val His Gly Leu
115 120 125Asp Pro Asn Thr Trp Lys Thr Val Glu Ile Val His Ile Asp Ile Ala
130 135 140Asp Arg Ser Gln Val Glu Pro Ala Asp Tyr Lys Ala Asp Glu Asp Pro145 150 155 160Ala Leu Phe Gln Ser Val Lys Thr Lys Arg Gly Pro Leu Gly Pro Asn
165 170 175Trp Lys Lys Glu Leu Ala Asn Ser Pro Asp Cys Pro Gln Met Cys Ala
180 185 190Tyr Lys Leu Val Thr Ile Lys Phe Lys Trp Trp Gly Leu Gln Ser Lys
195 200 205Val Glu Asn Phe Ile Gln Lys Gln Glu Lys Arg Ile Phe Thr Asn Phe
210 215 220His Arg Gln Leu Phe Cys Trp Ile Asp Lys Trp Ile Asp Leu Thr Met225 230 235 240Glu Asp Ile Arg Arg Met Glu Asp Glu Thr Gln Lys Glu Leu Glu Thr
245 250 255Met Arg Lys Arg Gly Ser Val Arg Gly Thr Ser Ala Ala Asp Val
260 265 270
Claims (93)
1.一种分离的SEQ ID NO:2的核酸序列。
2.权利要求1的分离核酸序列,其中核酸序列是DNA。
3.一种分离的SEQ ID NO:4的氨基酸序列。
4.编码SEQ ID NO:4的氨基酸序列的核酸序列。
5.一种复制性克隆载体,其中含有权利要求1的核酸序列和在分离的宿主细胞中可操作的复制子。
6.一种分离的宿主细胞,它已用权利要求5的复制性克隆载体转化。
7.一种表达载体,其中含有权利要求1的核酸序列,该序列可操作性连接到转录调节区。
8.一种分离的宿主细胞,它已用权利要求7的表达载体转化。
9.一种检验物质作为宿主中骨调节治疗剂的方法,包括将权利要求1的核酸给予宿主,并评估是否有骨调节发生。
10.权利要求9的方法,其中宿主是细胞或动物。
11.权利要求10的方法,其中动物是人、啮齿动物或鸟。
12.一种鉴定参与骨调节的分子的方法,包括鉴定与HBM结合的分子、或抑制分子与HBM结合的分子。
13.权利要求12的方法,其中所述分子是蛋白质。
14.一种鉴定参与骨调节的蛋白的方法,包括鉴定一种蛋白,该蛋白在一种含Zmax1基因的宿主中的表达水平与在含HBM基因的另一种宿主中的表达水平不同。
15.权利要求14的方法,其中宿主是细胞或动物。
16.一种鉴定参与骨调节的候选蛋白的方法,包括
鉴定在第一种有高骨量表型的个体中的蛋白;
鉴定在第二种不具有高骨量表型的个体中的蛋白;
比较第一种个体的蛋白与第二种个体的蛋白,其中(i)存在于第一种个体中但不存在于第二种个体中的蛋白是候选蛋白或(ii)在第一种个体中存在量高于第二种个体中存在量的蛋白是候选蛋白或(iii)在第一种个体中存在量低于第二种个体中存在量的蛋白是候选蛋白。
17.权利要求16的方法,进一步包括产生抗候选蛋白的抗体。
18.一种鉴定参与骨调节的候选蛋白的方法,包括
鉴定在第一种有高骨量表型的个体中的蛋白;
鉴定在第二种不具有高骨量表型的个体中的蛋白;
比较第一种个体的蛋白与第二种个体的蛋白,其中(i)存在于第二种个体中但不存在于第一种个体中的蛋白是候选蛋白或(ii)在第二种个体中存在量高于第一种个体中存在量的蛋白是候选蛋白或(iii)在第二种个体中存在量低于第一种个体中存在量的蛋白是候选蛋白。
19.权利要求18的方法,进一步包括产生抗候选蛋白的抗体。
20.一种检验HBM活性的方法,包括固定化HBM蛋白,结合蛋白到HBM蛋白上,和测量结合程度。
21.权利要求20的方法,其中蛋白是ApoE。
22.一种鉴定参与骨调节的候选分子的方法,包括
鉴定与SEQ ID NO:1的核酸序列结合的分子、或抑制分子与SEQ IDNO:1的核酸序列结合的分子;
鉴定与SEQ ID NO:2的核酸序列结合的分子、或抑制分子与SEQ IDNO:2的核酸序列结合的分子;和
比较该分子与每种核酸序列的结合程度、或抑制结合的程度,其中与SEQ ID NO:2的核酸序列或SEQ ID NO:1的核酸序列结合、或者或多或少抑制其结合的分子是候选分子。
23.权利要求22的方法,其中候选分子是蛋白质或mRNA。
24.一种治疗骨发育疾病的药物开发的方法,包括鉴定与SEQ IDNO:4的氨基酸序列结合的分子。
25.权利要求24的方法,其中该分子抑制或增强该氨基酸的功能。
26.一种治疗骨发育疾病的药物开发的方法,包括
构建第一种含Zmax1基因或蛋白的宿主;
构建第二种含HBM基因或蛋白的宿主;
分析第一和第二种宿主间的差别;
鉴定一种分子,当加入第一种宿主中时,引起第一种宿主表现第二种宿主的特征。
27.权利要求26的方法,其中宿主是无细胞提取物、细胞或动物。
28.权利要求26的方法,其中差别是替代标记。
29.一种治疗动物骨发育疾病的方法,包括将权利要求1的核酸序列转移入患骨发育疾病的动物的体细胞。
30.权利要求29的方法,其中动物是人或鸟。
31.一种治疗动物骨发育疾病的方法,包括将权利要求1的核酸序列转移入患骨发育疾病的动物的生殖细胞。
32.权利要求31的方法,其中动物是人或鸟。
33.一种改变宿主骨发育的方法,包括将权利要求3的氨基酸序列给予患骨发育疾病的宿主的体细胞。
34.权利要求33的方法,其中宿主是人或鸟。
35.一种改变宿主骨发育的方法,包括将权利要求3的氨基酸序列给予患骨发育疾病的宿主的生殖细胞。
36.权利要求35的方法,其中宿主是人或鸟。
37.一种治疗骨质疏松的方法,包括将权利要求3的氨基酸序列给予需要的患者。
38.权利要求37的方法,其中患者是人或鸟。
39.一种治疗骨质疏松的方法,包括将权利要求3的氨基酸序列的细胞外结构域给予需要的患者。
40.权利要求39的方法,其中患者是人或鸟。
41.一种治疗骨质疏松的方法,包括将权利要求3的氨基酸序列的细胞内结构域给予需要的患者。
42.权利要求41的方法,其中患者是人或鸟。
43.一种治疗骨发育疾病的方法,包括将结合权利要求1的核酸序列的分子给予需要的患者。
44.权利要求43的方法,其中患者是人或鸟。
45.一种治疗骨发育疾病的方法,包括将抗体给予需要的患者,其中抗体抗权利要求3的氨基酸序列。
46.一种诊断性筛选对骨发育疾病的遗传倾向性的方法,包括筛选来自患者的样品,其中含有来源于HBM的基因组或cDNA核酸序列的核苷酸序列。
47.一种骨发育疾病的诊断测试,包括抗HBM蛋白的抗体。
48.一种鉴定对骨发育疾病的遗传倾向性的方法,包括用权利要求1的核酸序列进行单倍型分析。
49.一种在骨组织中表达HBM蛋白的方法,包括构建含指导骨组织中表达的启动子的表达载体,该启动子可操作性连接到权利要求1的核酸序列上。
50.权利要求49的方法,其中指导骨中表达的启动子是骨钙蛋白启动子,骨唾液酸糖蛋白启动子或AML-3启动子。
51.一种含SEQ ID NO:5,6,7,8,9,10或11的核酸序列的细菌人工染色体。
52.一种扩增Zmax1基因中核苷酸多态性的方法,包括使用权利要求51的细菌人工染色体。
53.一种扩增HBM基因中核苷酸多态性的方法,包括使用权利要求51的细菌人工染色体。
54.一种鉴定HBM基因的调控元件的方法,包括使用权利要求1或权利要求51的细菌人工染色体。
55.一种分离的核酸序列,其中含有SEQ ID NO:2的核酸序列的至少15个连续核苷酸,其中至少15个连续核苷酸之一是582位的胸腺嘧啶。
56.权利要求55的分离核酸序列,该核酸序列是DNA。
57.权利要求55的分离核酸序列,该核酸序列是DNA。
58.一种复制性克隆载体,其中含有权利要求55的核酸序列和在分离的宿主细胞中可操作的复制子。
59.一种分离的宿主细胞,已用权利要求58的复制性克隆载体转化。
60.一种表达载体,其中含有权利要求55的核酸序列,它可操作性连接到转录调控区。
61.一种分离的宿主细胞,已用权利要求60的表达载体转化。
62.一种分离的核酸序列,其中含有SEQ ID NO:2的核酸序列的至少15个连续核苷酸,其中至少15个连续核苷酸之一是582位的胸腺嘧啶,且其编码包括缬氨酸的氨基酸序列,该缬氨酸对应于SEQ IDNO:4的171位缬氨酸。
63.权利要求62的核酸序列,该核酸序列是DNA。
64.一种至少15个连续核苷酸的分离核酸片段,其中包括SEQ IDNO:2的核酸序列中的多态性位点,其中582位的G由T取代,和与之互补的序列。
65.权利要求64的分离核酸片段,其中所述互补序列是反向互补。
66.权利要求65的分离核酸片段,其中所述反向互补序列是mRNA。
67.权利要求64的分离核酸片段,该核酸片段是DNA。
68.权利要求64的分离核酸片段,该核酸片段是cDNA。
69.权利要求65的分离核酸片段,该核酸片段是RNA。
70.一种至少15个连续核苷酸的分离核酸片段,其中包括来自选自下组的外显子序列的单核苷酸多态性位点:
SEQ ID NO:9,其中核苷酸69169由A取代,
SEQ ID NO:9,其中核苷酸27402由G取代,
SEQ ID NO:9,其中核苷酸27841由C取代,
SEQ ID NO:9,其中核苷酸35600由G取代,
SEQ ID NO:9,其中核苷酸45619由A取代,
SEQ ID NO:9,其中核苷酸46018由G取代,
SEQ ID NO:9,其中核苷酸46093由G取代,
SEQ ID NO:9,其中核苷酸46190由G取代,
SEQ ID NO:9,其中核苷酸50993由C取代,
SEQ ID NO:9,其中核苷酸51124由T取代,
SEQ ID NO:9,其中核苷酸55461由T取代,
SEQ ID NO:9,其中核苷酸63645由A取代,
SEQ ID NO:9,其中核苷酸63646由C取代,
SEQ ID NO:9,其中核苷酸24809由G取代,
SEQ ID NO:9,其中核苷酸27837由C取代,
SEQ ID NO:9,其中核苷酸31485由T取代,
SEQ ID NO:9,其中核苷酸31683由G取代,
SEQ ID NO:9,其中核苷酸24808由G取代,
SEQ ID NO:8,其中核苷酸31340由C取代,
SEQ ID NO:8,其中核苷酸32538由G取代,
SEQ ID NO:8,其中核苷酸13224由G取代,
SEQ ID NO:8,其中核苷酸21119由A取代,
SEQ ID NO:8,其中核苷酸30497由A取代,
SEQ ID NO:9,其中核苷酸24811由C取代,
SEQ ID NO:9,其中核苷酸68280由A取代,和与之互补的序列。
71.权利要求70的分离核酸片段,其中所述SEQ ID NO:8的外显子序列的核苷酸21119由A取代。
72.权利要求70的分离核酸片段,该核酸片段是DNA。
73.权利要求70的分离核酸片段,该核酸片段是RNA。
74.权利要求64或权利要求70的分离核酸片段,该核酸片段是探针或引物。
75.一种鉴定参与骨调节的分子的方法,包括鉴定与参与粘着斑信号传递的蛋白结合的分子或抑制分子与该蛋白结合的分子。
76.权利要求75的方法,其中参与粘着斑信号传递的分子结合一种选自SEQ ID NO:87-109的蛋白。
77.权利要求75的方法,其中参与粘着斑信号传递的分子结合一种蛋白,该蛋白选自:SEQ ID NO:90,SEQ ID NO:93,SEQ ID NO:94,SEQ ID NO:99和SEQ ID NO:102。
78.一种通过给予一种试剂调节受试者骨密度的方法,该试剂调节参与粘着斑信号传递的核酸或由其编码的多肽。
79.权利要求78的方法,其中核酸包括选自SEQ ID NOS:63-86的核酸。
80.权利要求78的方法,其中核酸包括SEQ ID NO:66,SEQ IDNO:71,SEQ ID NO:77或SEQ ID NO:79。
81.权利要求78的方法,其中多肽选自SEQ ID NOS:87-109。
82.权利要求78的方法,其中多肽是SEQ ID NO:90,SEQ ID NO:93,SEQ ID NO:94,SEQ ID NO:99或SEQ ID NO:102。
83.一种核酸,包括SEQ ID NO:66,SEQ ID NO:71,SEQ ID NO:77或SEQ ID NO:79。
84.权利要求83的核酸,其中核酸是RNA或DNA。
85.一种复制性克隆载体,包含权利要求83的核酸和可在宿主细胞中操作的复制子。
86.一种分离的宿主细胞,已用权利要求85的复制性克隆载体转化。
87.一种含权利要求83的核酸序列的表达载体。
88.一种分离的宿主细胞,已用权利要求87的表达载体转化。
89.一种多肽,包括SEQ ID NO:90,SEQ ID NO:93,SEQ ID NO:94,SEQ ID NO:99或SEQ ID NO:102。
90.一种编码选自下组多肽的核酸:SEQ ID NO:90,SEQ ID NO:93,SEQ ID NO:94,SEQ ID NO:99或SEQ ID NO:102。
91.一种治疗骨发育疾病的方法,包括给予调节参与粘着斑信号传递的核酸或多肽的试剂的步骤。
92.权利要求91的方法,其中由所述试剂调节的核酸选自SEQ IDNOS:63-86的任意一个。
93.权利要求91的方法,其中由所述试剂调节的多肽选自SEQ IDNOS:87-109的任意一个。
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/544,398 | 2000-04-05 | ||
US09/543,771 | 2000-04-05 | ||
US09/544,398 US6770461B1 (en) | 1998-10-23 | 2000-04-05 | High bone mass gene of 11q13.3 |
US09/543,771 US6780609B1 (en) | 1998-10-23 | 2000-04-05 | High bone mass gene of 1.1q13.3 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1454256A true CN1454256A (zh) | 2003-11-05 |
Family
ID=27067425
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN00819619A Pending CN1454256A (zh) | 2000-04-05 | 2000-06-21 | 11q13.3的高骨量基因 |
Country Status (11)
Country | Link |
---|---|
EP (1) | EP1268775B1 (zh) |
JP (1) | JP2004515209A (zh) |
CN (1) | CN1454256A (zh) |
AR (1) | AR028316A1 (zh) |
AU (2) | AU5626900A (zh) |
BR (1) | BR0017197A (zh) |
CA (1) | CA2402410A1 (zh) |
IL (1) | IL151904A0 (zh) |
MX (1) | MXPA02009791A (zh) |
NZ (1) | NZ521769A (zh) |
WO (1) | WO2001077327A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113564254A (zh) * | 2013-09-16 | 2021-10-29 | 分子医学研究中心责任有限公司 | 用于骨髓恶性肿瘤诊断的突变钙网蛋白 |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6545137B1 (en) | 1997-04-15 | 2003-04-08 | John A. Todd | Receptor |
US6555654B1 (en) | 1997-04-15 | 2003-04-29 | The Wellcome Trust Limited As Trustee For The Wellcome Trust | LDL-receptor |
US7244577B2 (en) | 1997-04-15 | 2007-07-17 | Merck & Co., Inc. | Method of screening for modulator of LRP5 activity |
IL154473A0 (en) * | 2000-08-18 | 2003-09-17 | Proskelia | Regulator gene and system useful for the diagnosis and therapy of osteoporosis |
US7514594B2 (en) | 2001-05-11 | 2009-04-07 | Wyeth | Transgenic animal model of bone mass modulation |
EP1483288A4 (en) * | 2001-05-11 | 2005-09-21 | Genome Therapeutics Corp | HBM VARIANTS MODULATING BONE MASS AND LIPID LEVELS |
EP1395285A4 (en) * | 2001-05-17 | 2005-06-01 | Genome Therapeutics Corp | REAGENTS AND METHODS FOR MODULATING INDUCED DKK INTERACTIONS |
AU2003292404A1 (en) * | 2002-12-13 | 2004-07-09 | Oxagen Limited | Genetic susceptibility |
WO2006004066A1 (ja) * | 2004-07-02 | 2006-01-12 | Locomogene, Inc. | S1-5を含有するタンパク質製剤 |
US20090276863A1 (en) * | 2004-07-02 | 2009-11-05 | Toshihiro Nakajima | Protein formulations comprising s1-5 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997012903A1 (en) * | 1995-10-04 | 1997-04-10 | Warner-Lambert Company | Compounds, compositions and methods for inhibiting the binding of proteins containing an sh2 domain to cognate phosphorylated proteins |
US5691153A (en) * | 1996-09-06 | 1997-11-25 | Creighton University | Genetic markers to detect high bone mass |
US6545137B1 (en) * | 1997-04-15 | 2003-04-08 | John A. Todd | Receptor |
BE1011331A6 (fr) * | 1997-08-20 | 1999-07-06 | Univ Catholique De Louvain Hal | Polypeptide associe au peroxysome, sequence nucleotidique encodant ledit polypeptide et leur utilisation dans le diagnostic et/ou le traitement de maladies ou de lesions pulmonaires. |
JP2002506873A (ja) * | 1998-03-18 | 2002-03-05 | アリアド・ファーマシューティカルズ・インコーポレイテッド | 複素環式シグナル伝達阻害剤、それを含む組成物 |
JP2002539843A (ja) * | 1999-03-26 | 2002-11-26 | ヒューマン ジノーム サイエンシーズ, インコーポレイテッド | 50個のヒト分泌タンパク質 |
ES2290155T3 (es) * | 2000-05-26 | 2008-02-16 | Oscient Pharmaceuticals Corporation | Regulacion de las tasas lipidicas por mediacion del gen zmax1 y del gen hbm. |
IL154473A0 (en) * | 2000-08-18 | 2003-09-17 | Proskelia | Regulator gene and system useful for the diagnosis and therapy of osteoporosis |
-
2000
- 2000-06-21 MX MXPA02009791A patent/MXPA02009791A/es active IP Right Grant
- 2000-06-21 WO PCT/US2000/016951 patent/WO2001077327A1/en active Application Filing
- 2000-06-21 NZ NZ521769A patent/NZ521769A/xx unknown
- 2000-06-21 AU AU5626900A patent/AU5626900A/xx active Pending
- 2000-06-21 CA CA002402410A patent/CA2402410A1/en not_active Abandoned
- 2000-06-21 JP JP2001575181A patent/JP2004515209A/ja active Pending
- 2000-06-21 CN CN00819619A patent/CN1454256A/zh active Pending
- 2000-06-21 EP EP00941578A patent/EP1268775B1/en not_active Expired - Lifetime
- 2000-06-21 IL IL15190400A patent/IL151904A0/xx unknown
- 2000-06-21 AU AU2000256269A patent/AU2000256269B2/en not_active Ceased
- 2000-06-21 BR BR0017197-2A patent/BR0017197A/pt not_active IP Right Cessation
-
2001
- 2001-04-05 AR ARP010101640A patent/AR028316A1/es unknown
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113564254A (zh) * | 2013-09-16 | 2021-10-29 | 分子医学研究中心责任有限公司 | 用于骨髓恶性肿瘤诊断的突变钙网蛋白 |
Also Published As
Publication number | Publication date |
---|---|
CA2402410A1 (en) | 2001-10-18 |
AU5626900A (en) | 2001-10-23 |
NZ521769A (en) | 2004-12-24 |
EP1268775A1 (en) | 2003-01-02 |
AR028316A1 (es) | 2003-05-07 |
AU2000256269B2 (en) | 2006-11-02 |
JP2004515209A (ja) | 2004-05-27 |
BR0017197A (pt) | 2003-01-14 |
IL151904A0 (en) | 2003-04-10 |
WO2001077327A1 (en) | 2001-10-18 |
AU2000256269A1 (en) | 2002-01-10 |
MXPA02009791A (es) | 2004-09-06 |
EP1268775B1 (en) | 2008-11-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1100895B1 (en) | Abc1 polypeptide and methods and reagents for modulating cholesterol levels | |
US7416849B2 (en) | HBM variants that modulate bone mass and lipid levels | |
CN1446227A (zh) | 新型成纤维细胞生长因子(fgf23)及其使用方法 | |
US20090023905A1 (en) | Transgenic animal model of bone mass modulation | |
CN1182452A (zh) | 促骨生长素 | |
CN1367829A (zh) | 用于肺癌治疗和诊断的组合物和方法 | |
AU2007200880A1 (en) | Regulating lipid levels via the ZMAX1 or HBM gene | |
JP2005500815A6 (ja) | タンパク質およびそれをコードする核酸 | |
JP2005500815A (ja) | タンパク質およびそれをコードする核酸 | |
CN1454256A (zh) | 11q13.3的高骨量基因 | |
JP2003530814A (ja) | 不整脈に関連するヒトmink遺伝子突然変異 | |
CN1371390A (zh) | 人g-蛋白偶联受体 | |
CN1379816A (zh) | 人g-蛋白偶联受体 | |
CN1358228A (zh) | 来自基质细胞的多核苷酸序列和推断的氨基酸序列 | |
CN1468306A (zh) | 与精神分裂症相关的基因及蛋白质 | |
US7285400B2 (en) | High bone mass gene of 11q13.3 | |
CN1777676A (zh) | 与精神分裂症相关的电压门控离子通道基因及蛋白质 | |
EP1964921A2 (en) | The high bone mass gene of 11q13.3 | |
US20040221326A1 (en) | Transgenic animal model of bone mass modulation | |
CN1399644A (zh) | 新的人g蛋白偶联受体 | |
US20030219793A1 (en) | High bone mass gene of 11q13.3 | |
CN1498271A (zh) | 钾通道相互作用蛋白及其用途 | |
AU2001269712B2 (en) | Regulating lipid levels via the ZMAXI or HBM gene | |
US20030064489A1 (en) | Novel polypeptides and nucleic acids encoding same | |
AU2007200352A1 (en) | The high bone mass gene of 11q13.3 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |